Top 6 Arxiv Papers Today in Applications


2.014 Mikeys
#1. Modelling Diffusion through Statistical Network Analysis: A Simulation Study
Johan A. Elkink, Thomas U. Grund
The study of international relations by definition deals with interdependencies among countries. One form of interdependence between countries is the diffusion of country-level features, such as policies, political regimes, or conflict. In these studies, the outcome variable tends to be categorical, and the primary concern is the clustering of the outcome variable among connected countries. Statistically, such clustering is studied with spatial econometric models. This paper instead proposes the use of a statistical network approach to model diffusion with a binary outcome variable. Using statistical network instead of spatial econometric models allows for a more natural specification of the diffusion process, assuming autocorrelation in the outcomes rather than the corresponding latent variable, and it simplifies the inclusion of temporal dynamics, higher level interdependencies and interactions between network ties and country-level features. In our simulations, the performance of the Stochastic Actor-Oriented Model...
more | pdf | html
Figures
Tweets
arxiv_org: Modelling Diffusion through Statistical Network Analysis: A Simulation Study. https://t.co/yQPeKIaoJl https://t.co/Ogb8nm6EiV
jelkink: Working paper with @thomasgrundUCD on using Stochastic Actor-Oriented Models for (static) spatial diffusion analysis now online at https://t.co/zf6yn2NHPr @ucdpolitics @ucdsociology
StatsPapers: Modelling Diffusion through Statistical Network Analysis: A Simulation Study. https://t.co/6Q0PTXYkkN
BrianKrent: RT @arxiv_org: Modelling Diffusion through Statistical Network Analysis: A Simulation Study. https://t.co/yQPeKIaoJl https://t.co/Ogb8nm6EiV
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 11344
Unqiue Words: 3189

2.01 Mikeys
#2. Prescriptive Cluster-Dependent Support Vector Machines with an Application to Reducing Hospital Readmissions
Taiyao Wang, Ioannis Ch. Paschalidis
We augment linear Support Vector Machine (SVM) classifiers by adding three important features: (i) we introduce a regularization constraint to induce a sparse classifier; (ii) we devise a method that partitions the positive class into clusters and selects a sparse SVM classifier for each cluster; and (iii) we develop a method to optimize the values of controllable variables in order to reduce the number of data points which are predicted to have an undesirable outcome, which, in our setting, coincides with being in the positive class. The latter feature leads to personalized prescriptions/recommendations. We apply our methods to the problem of predicting and preventing hospital readmissions within 30-days from discharge for patients that underwent a general surgical procedure. To that end, we leverage a large dataset containing over 2.28 million patients who had surgeries in the period 2011--2014 in the U.S. The dataset has been collected as part of the American College of Surgeons National Surgical Quality Improvement Program (NSQIP).
more | pdf | html
Figures
Tweets
arxivml: "Prescriptive Cluster-Dependent Support Vector Machines with an Application to Reducing Hospital Readmissions", Tai… https://t.co/qtDNwoPRD7
arxiv_cs_LG: Prescriptive Cluster-Dependent Support Vector Machines with an Application to Reducing Hospital Readmissions. Taiyao Wang and Ioannis Ch. Paschalidis https://t.co/bnCIlev7Qm
Memoirs: Prescriptive Cluster-Dependent Support Vector Machines with an Application to Reducing Hospital Readmissions. https://t.co/19tU3dPkMc
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 5373
Unqiue Words: 1971

2.009 Mikeys
#3. Large-Scale Online Experimentation with Quantile Metrics
Min Liu, Xiaohui Sun, Maneesh Varshney, Ya Xu
Online experimentation (or A/B testing) has been widely adopted in industry as the gold standard for measuring product impacts. Despite the wide adoption, few literatures discuss A/B testing with quantile metrics. Quantile metrics, such as 90th percentile page load time, are crucial to A/B testing as many key performance metrics including site speed and service latency are defined as quantiles. However, with LinkedIn's data size, quantile metric A/B testing is extremely challenging because there is no statistically valid and scalable variance estimator for the quantile of dependent samples: the bootstrap estimator is statistically valid, but takes days to compute; the standard asymptotic variance estimate is scalable but results in order-of-magnitude underestimation. In this paper, we present a statistically valid and scalable methodology for A/B testing with quantiles that is fully generalizable to other A/B testing platforms. It achieves over 500 times speed up compared to bootstrap and has only $2\%$ chance to differ from...
more | pdf | html
Figures
None.
Tweets
arxiv_org: Large-Scale Online Experimentation with Quantile Metrics. https://t.co/hkDKE7onLm https://t.co/yEUfICfT5N
StatsPapers: Large-Scale Online Experimentation with Quantile Metrics. https://t.co/yW1mFXiShj
JDRedding: RT @arxiv_org: Large-Scale Online Experimentation with Quantile Metrics. https://t.co/hkDKE7onLm https://t.co/yEUfICfT5N
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 4715
Unqiue Words: 1442

2.006 Mikeys
#4. A Method for Measuring Network Effects of One-to-One Communication Features in Online A/B Tests
Guillaume Saint-Jacques, James Eric Sorenson, Nanyu Chen, Ya Xu
A/B testing is an important decision making tool in product development because can provide an accurate estimate of the average treatment effect of a new features, which allows developers to understand how the business impact of new changes to products or algorithms. However, an important assumption of A/B testing, Stable Unit Treatment Value Assumption (SUTVA), is not always a valid assumption to make, especially for products that facilitate interactions between individuals. In contexts like one-to-one messaging we should expect network interference; if an experimental manipulation is effective, behavior of the treatment group is likely to influence members in the control group by sending them messages, violating this assumption. In this paper, we propose a novel method that can be used to account for network effects when A/B testing changes to one-to-one interactions. Our method is an edge-based analysis that can be applied to standard Bernoulli randomized experiments to retrieve an average treatment effect that is not...
more | pdf | html
Figures
Tweets
arxiv_org: A Method for Measuring Network Effects of One-to-One Communication Features in Online A/B... https://t.co/22IQigF5xE https://t.co/Jd7FEhXt4k
StatsPapers: A Method for Measuring Network Effects of One-to-One Communication Features in Online A/B Tests. https://t.co/JUBSK1s7Nf
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 8584
Unqiue Words: 2207

2.001 Mikeys
#5. Optimal Intermittent Measurements for Tumor Tracking in X-ray Guided Radiotherapy
Antoine Aspeel, Damien Dasnoy, Raphaël M. Jungers, Benoît Macq
In radiation therapy, tumor tracking is a challenging task that allows a better dose delivery. One practice is to acquire X-ray images in real-time during treatment, that are used to estimate the tumor location. These informations are used to predict the close future tumor trajectory. Kalman prediction is a classical approach for this task. The main drawback of X-ray acquisition is that it irradiates the patient, including its healthy tissues. In the classical Kalman framework, X-ray measurements are taken regularly, i.e. at a constant rate. In this paper, we propose a new approach which relaxes this constraint in order to take measurements when they are the most useful. Our aim is for a given budget of measurements to optimize the tracking process. This idea naturally brings to an optimal intermittent Kalman predictor for which measurement times are selected to minimize the mean squared prediction error over the complete fraction. This optimization problem can be solved directly when the respiratory model has been identified and...
more | pdf | html
Figures
None.
Tweets
StatsPapers: Optimal Intermittent Measurements for Tumor Tracking in X-ray Guided Radiotherapy. https://t.co/zPXnXLwkx9
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 3985
Unqiue Words: 1374

2.001 Mikeys
#6. Is Basketball a Game of Runs?
Mark F. Schilling
Basketball is often referred to as "a game of runs." We investigate the appropriateness of this claim using data from the full NBA 2016-17 season, comparing actual longest runs of scoring events to what long run theory predicts under the assumption that team "momentum" is not present. We provide several different variations of the analysis. Our results consistently indicate that the lengths of longest runs in NBA games are no longer than those that would occur naturally when scoring events are generated by a random process, rather than one that is influenced by "momentum".
more | pdf | html
Figures
None.
Tweets
StatsPapers: Is Basketball a Game of Runs?. https://t.co/AqCd7il3u2
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 2964
Unqiue Words: 1052

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 99,599 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 99,599 papers.