Top 10 Arxiv Papers Today in Statistics


2.245 Mikeys
#1. A General Method for Amortizing Variational Filtering
Joseph Marino, Milan Cvitkovic, Yisong Yue
We introduce the variational filtering EM algorithm, a simple, general-purpose method for performing variational inference in dynamical latent variable models using information from only past and present variables, i.e. filtering. The algorithm is derived from the variational objective in the filtering setting and consists of an optimization procedure at each time step. By performing each inference optimization procedure with an iterative amortized inference model, we obtain a computationally efficient implementation of the algorithm, which we call amortized variational filtering. We present experiments demonstrating that this general-purpose method improves performance across several deep dynamical latent variable models.
more | pdf | html
Figures
Tweets
BrundageBot: A General Method for Amortizing Variational Filtering. Joseph Marino, Milan Cvitkovic, and Yisong Yue https://t.co/ZSknyhPjJW
Memoirs: A General Method for Amortizing Variational Filtering. https://t.co/V8T5zidWDf
Github

PyTorch implementation of AVF

Repository: amortized-variational-filtering
User: joelouismarino
Language: Python
Stargazers: 8
Subscribers: 4
Forks: 1
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6968
Unqiue Words: 2126

2.027 Mikeys
#2. The Augmented Synthetic Control Method
Eli Ben-Michael, Avi Feller, Jesse Rothstein
The synthetic control method (SCM) is a popular approach for estimating the impact of a treatment on a single unit in panel data settings. The "synthetic control" is a weighted average of control units that balances the treated unit's pre-treatment outcomes as closely as possible. The curse of dimensionality, however, means that SCM does not generally achieve exact balance, which can bias the SCM estimate. We propose an extension, Augmented SCM, which uses an outcome model to estimate the bias due to covariate imbalance and then de-biases the original SCM estimate, analogous to bias correction for inexact matching. We motivate this approach by showing that SCM is a (regularized) inverse propensity score weighting estimator, with pre-treatment outcomes as covariates and a ridge penalty on the propensity score coefficients. We give theoretical guarantees for specific cases and propose a new inference procedure. We demonstrate gains from Augmented SCM with extensive simulation studies and apply this framework to canonical...
more | pdf | html
Figures
None.
Tweets
F_Bethke: New paper explaining the "Augmented Synthetic Control Method" by Ben-Michael et al. Also provides new augsynth #rstats package. https://t.co/2Mok5we0bQ #DataScience #Econometrics
StatsPapers: The Augmented Synthetic Control Method. https://t.co/L1Yigm3QJV
econometriclub: RT @F_Bethke: New paper explaining the "Augmented Synthetic Control Method" by Ben-Michael et al. Also provides new augsynth #rstats package. https://t.co/HrNqJVLnN7 #DataScience #Econometrics
lihua_lei_stat: RT @StatsPapers: The Augmented Synthetic Control Method. https://t.co/L1Yigm3QJV
Github

Augmented Synthetic Control Method

Repository: augsynth
User: ebenmichael
Language: R
Stargazers: 2
Subscribers: 0
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 19346
Unqiue Words: 3589

2.013 Mikeys
#3. Practical Bayesian Learning of Neural Networks via Adaptive Subgradient Methods
Arnold Salas, Stefan Zohren, Stephen Roberts
We introduce a novel framework for the estimation of the posterior distribution of the weights of a neural network, based on a new probabilistic interpretation of adaptive subgradient algorithms such as AdaGrad and Adam. Having a confidence measure of the weights allows several shortcomings of neural networks to be addressed. In particular, the robustness of the network can be improved by performing weight pruning based on signal-to-noise ratios from the weight posterior distribution. Using the MNIST dataset, we demonstrate that the empirical performance of Badam, a particular instance of our framework based on Adam, is competitive in comparison to related Bayesian approaches such as Bayes By Backprop.
more | pdf | html
Figures
Tweets
arxiv_org: Practical Bayesian Learning of Neural Networks via Adaptive Subgradient Methods. https://t.co/nHhGN1lYFo https://t.co/G38HypOqTg
arxivml: "Practical Bayesian Learning of Neural Networks via Adaptive Subgradient Methods", Arnold Salas, Stefan Zohren, Ste… https://t.co/3kXbNOjrmQ
nmfeeds: [O] https://t.co/EoqqDFYWuc Practical Bayesian Learning of Neural Networks via Adaptive Subgradient Methods. We introduce ...
hg_phys: RT @arxiv_org: Practical Bayesian Learning of Neural Networks via Adaptive Subgradient Methods. https://t.co/nHhGN1lYFo https://t.co/G38Hyp…
AssistedEvolve: RT @arxiv_org: Practical Bayesian Learning of Neural Networks via Adaptive Subgradient Methods. https://t.co/nHhGN1lYFo https://t.co/G38Hyp…
DataForager: RT @arxiv_org: Practical Bayesian Learning of Neural Networks via Adaptive Subgradient Methods. https://t.co/nHhGN1lYFo https://t.co/G38Hyp…
shubh_300595: RT @arxiv_org: Practical Bayesian Learning of Neural Networks via Adaptive Subgradient Methods. https://t.co/nHhGN1lYFo https://t.co/G38Hyp…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 5967
Unqiue Words: 1882

2.01 Mikeys
#4. A Fundamental Measure of Treatment Effect Heterogeneity
Jonathan Levy, Mark van der Laan, Alan Hubbard, Romain Pirracchio
In this paper we offer an asymptotically efficient, non-parametric way to assess treatment effect variability via the conditional average treatment effect (CATE) which is a function of the measured confounders or strata, giving the average treatment effect for a given stratum. We can ask the two main questions of the CATE function: What are its mean and variance? The mean gives the more easily estimable and well-studied average treatment effect whereas CATE variance measures reliability of treatment or the extent of effect modification. With the knowledge of CATE variance and hence, CATE standard deviation, a doctor or policy analyst can give a precise statement as to what an individual patient can expect, which we distinguish as clinical effect heterogeneity. We can also assess how much precision in treatment can be gained in assigning treatments based on patient covariates. Through simulations we will verify some of the theoretical properties of our proposed estimator and we will also point out some of the challenges in...
more | pdf | html
Figures
Tweets
arxiv_org: A Fundamental Measure of Treatment Effect Heterogeneity. https://t.co/h792OV4p6p https://t.co/tllcbh2ncR
StatsPapers: A Fundamental Measure of Treatment Effect Heterogeneity. https://t.co/0YRnlDiJl4
SLaurelWeldon: RT @StatsPapers: A Fundamental Measure of Treatment Effect Heterogeneity. https://t.co/0YRnlDiJl4
DrPjenFI: RT @arxiv_org: A Fundamental Measure of Treatment Effect Heterogeneity. https://t.co/h792OV4p6p https://t.co/tllcbh2ncR
RexDouglass: RT @StatsPapers: A Fundamental Measure of Treatment Effect Heterogeneity. https://t.co/0YRnlDiJl4
MaverickPramit: RT @arxiv_org: A Fundamental Measure of Treatment Effect Heterogeneity. https://t.co/h792OV4p6p https://t.co/tllcbh2ncR
danielk_oxsci: RT @arxiv_org: A Fundamental Measure of Treatment Effect Heterogeneity. https://t.co/h792OV4p6p https://t.co/tllcbh2ncR
shubh_300595: RT @arxiv_org: A Fundamental Measure of Treatment Effect Heterogeneity. https://t.co/h792OV4p6p https://t.co/tllcbh2ncR
Github

controlling the noise on Simulations

Repository: Simulations
User: jlstiles
Language: R
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 1
Youtube
None.
Other stats
Sample Sizes : [1000]
Authors: 4
Total Words: 13370
Unqiue Words: 2983

2.001 Mikeys
#5. The doctrinal paradox: ROC analysis in a probabilistic framework
Aureli Alabert, Mercè Farré
The doctrinal paradox is analysed from a probabilistic point of view assuming a simple parametric model for the committee's behaviour. The well known issue-by-issue and case-by-case majority rules are compared in this model, by means of the concepts of false positive rate (FPR), false negative rate (FNR) and Receiver Operating Characteristics (ROC) space. We introduce also a new rule that we call path-by-path, which is somehow halfway between the other two. Under our model assumptions, the issue-by-issue rule is shown to be the best of the three according to an optimality criterion based in ROC maps, for all values of the model parameters (committee size and competence of its members), when equal weight is given to FPR an FNR. For unequal weights, the relative goodness of the rules depends on the values of the competence and the weights, in a way which is precisely described. The results are illustrated with some numerical examples.
more | pdf | html
Figures
None.
Tweets
StatsPapers: The doctrinal paradox: ROC analysis in a probabilistic framework. https://t.co/fBoqHivb2t
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 13346
Unqiue Words: 2763

2.001 Mikeys
#6. Adversarial Uncertainty Quantification in Physics-Informed Neural Networks
Yibo Yang, Paris Perdikaris
We present a deep learning framework for quantifying and propagating uncertainty in systems governed by non-linear differential equations using physics-informed neural networks. Specifically, we employ latent variable models to construct probabilistic representations for the system states, and put forth an adversarial inference procedure for training them on data, while constraining their predictions to satisfy given physical laws expressed by partial differential equations. Such physics-informed constraints provide a regularization mechanism for effectively training deep generative models as surrogates of physical systems in which the cost of data acquisition is high, and training data-sets are typically small. This provides a flexible framework for characterizing uncertainty in the outputs of physical systems due to randomness in their inputs or noise in their observations that entirely bypasses the need for repeatedly sampling expensive experiments or numerical simulators. We demonstrate the effectiveness of our approach...
more | pdf | html
Figures
Tweets
BrundageBot: Adversarial Uncertainty Quantification in Physics-Informed Neural Networks. Yibo Yang and Paris Perdikaris https://t.co/gCDmwRdFsm
arxivml: "Adversarial Uncertainty Quantification in Physics-Informed Neural Networks", Yibo Yang, Paris Perdikaris https://t.co/gXktyBgX1n
nmfeeds: [O] https://t.co/8IP7QgFApC Adversarial Uncertainty Quantification in Physics-Informed Neural Networks. We present a deep ...
Memoirs: Adversarial Uncertainty Quantification in Physics-Informed Neural Networks. https://t.co/12eOeG6ODR
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 9971
Unqiue Words: 2649

2.0 Mikeys
#7. Variational Bayesian hierarchical regression for data analysis
Dennis Becker
Collected data, which is used for analysis or prediction tasks, often have a hierarchical structure, for example, data from various people performing the same task. Modeling the data's structure can improve the reliability of the derived results and prediction performance of newly unobserved data. Bayesian modeling provides a tool-kit for designing hierarchical models. However, Markov Chain Monte Carlo methods which are commonly used for parameter estimation are computationally expensive. This often renders its use for many applications not applicable. However, variational Bayesian methods allow to derive an approximation with much less computational effort. This document describes the derivation of a variational approximation for a hierarchical linear Bayesian regression and demonstrates its application to data analysis.
more | pdf | html
Figures
Tweets
StatsPapers: Variational Bayesian hierarchical regression for data analysis. https://t.co/t68MXqK3Hd
Github

Hierarchical Bayesian Regression

Repository: hBReg
User: dennisthemenace2
Language: R
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 6431
Unqiue Words: 1710

1.998 Mikeys
#8. Ball: An R package for detecting distribution difference and association in metric spaces
Jin Zhu, Wenliang Pan, Wei Zheng, Xueqin Wang
The rapid development of modern technology facilitates the appearance of numerous unprecedented complex data which do not satisfy the axioms of Euclidean geometry, while most of the statistical hypothesis tests are available in Euclidean or Hilbert spaces. To properly analyze the data of more complicated structures, efforts have been made to solve the fundamental test problems in more general spaces. In this paper, a publicly available R package Ball is provided to implement Ball statistical test procedures for K-sample distribution comparison and test of mutual independence in metric spaces, which extend the test procedures for two sample distribution comparison and test of independence. The tailormade algorithms as well as engineering techniques are employed on the Ball package to speed up computation to the best of our ability. Two real data analyses and several numerical studies have been performed and the results certify the powerfulness of Ball package in analyzing complex data, e.g., spherical data and symmetric positive...
more | pdf | html
Figures
Tweets
StatsPapers: Ball: An R package for detecting distribution difference and association in metric spaces. https://t.co/WuGgyaYUyk
RexDouglass: RT @StatsPapers: Ball: An R package for detecting distribution difference and association in metric spaces. https://t.co/WuGgyaYUyk
madsyair: RT @StatsPapers: Ball: An R package for detecting distribution difference and association in metric spaces. https://t.co/WuGgyaYUyk
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 9591
Unqiue Words: 2672

1.997 Mikeys
#9. Post-randomization Biomarker Effect Modification in an HIV Vaccine Clinical Trial
Peter B. Gilbert, Bryan S. Blette, Bryan E. Shepherd, Michael G. Hudgens
While the HVTN 505 trial showed no overall efficacy of the tested vaccine to prevent HIV infection over placebo, previous studies, biological theories, and the finding that immune response markers strongly correlated with infection in vaccine recipients generated the hypothesis that a qualitative interaction occurred. This hypothesis can be assessed with statistical methods for studying treatment effect modification by an intermediate response variable (i.e., principal stratification effect modification (PSEM) methods). However, available PSEM methods make untestable structural risk assumptions, such that assumption-lean versions of PSEM methods are needed in order to surpass the high bar of evidence to demonstrate a qualitative interaction. Fortunately, the survivor average causal effect (SACE) literature is replete with assumption-lean methods that can be readily adapted to the PSEM application for the special case of a binary intermediate response variable. We map this adaptation, opening up a host of new PSEM methods for a...
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : [400, 800, 1600, 400, 800, 1600, 1600, 800, 400, 400, 800, 1600, 400, 800, 1600, 1600, 800, 400, 2000, 4000, 8000, 2000, 4000, 8000, 8000, 4000, 2000, 2000, 4000, 8000, 2000, 4000, 8000, 8000, 4000, 2000, 400, 800, 1600, 400, 800, 1600, 1600, 800, 400, 400, 800, 1600, 400, 800, 1600, 1600, 800, 400, 1600, 400, 800, 1600, 400, 800, 1600, 800, 400, 2000, 4000, 8000, 2000, 4000, 8000, 8000, 4000, 2000, 2000, 4000, 8000, 2000, 4000, 8000, 8000, 4000, 2000, 8000, 2000, 4000, 8000, 2000, 4000, 8000, 4000, 2000]
Authors: 4
Total Words: 15312
Unqiue Words: 2785

1.997 Mikeys
#10. An external validation of Thais' cardiovascular 10-year risk assessment in the southern Thailand
Suthara Aramcharoen, Ponlapat Satian, Ponlachart Chotikarn, Sipat Triukose
Cardiovascular diseases (CVDs) is a number one cause of death globally. WHO estimated that CVD is a cause of 17.9 million deaths (or 31% of all global deaths) in 2016. It may seem surprising, CVDs can be easily prevented by altering lifestyle to avoid risk factors. The only requirement needed is to know your risk prior. Thai CV Risk score is a trustworthy tool to forecast risk of having cardiovascular event in the future for Thais. This study is an external validation of the Thai CV risk score. We aim to answer two key questions. Firstly, Can Thai CV Risk score developed using dataset of people from central and north western parts of Thailand is applicable to people from other parts of the country? Secondly, Can Thai CV Risk score developed for general public works for hospital's patients who tend to have higher risk? We answer these two questions using a dataset of 1,025 patients (319 males, 35-70 years old) from Lansaka Hospital in the southern Thailand. In brief, we find that the Thai CV risk score works for southern Thais...
more | pdf | html
Figures
Tweets
StatsPapers: An external validation of Thais' cardiovascular 10-year risk assessment in the southern Thailand. https://t.co/RhQH7Hwyox
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 1531
Unqiue Words: 645

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 56,474 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 56,474 papers.