Top 10 Arxiv Papers Today in Statistics


2.293 Mikeys
#1. Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects
Zoe Guan, Giovanni Parmigiani, Prasad Patil
A critical decision point when training predictors using multiple studies is whether these studies should be combined or treated separately. We compare two multi-study learning approaches in the presence of potential heterogeneity in predictor-outcome relationships across datasets. We consider 1) merging all of the datasets and training a single learner, and 2) cross-study learning, which involves training a separate learner on each dataset and combining the resulting predictions. In a linear regression setting, we show analytically and confirm via simulation that merging yields lower prediction error than cross-study learning when the predictor-outcome relationships are relatively homogeneous across studies. However, as heterogeneity increases, there exists a transition point beyond which cross-study learning outperforms merging. We provide analytic expressions for the transition point in various scenarios and study asymptotic properties.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. Zoe Guan, Giovanni Parmigiani, and Prasad Patil https://t.co/hhbn0gH40e
arxiv_cs_LG: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. Zoe Guan, Giovanni Parmigiani, and Prasad Patil https://t.co/djAxH5mAud
StatsPapers: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. https://t.co/ntKkrgSDdF
SantchiWeb: RT @StatsPapers: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. https://t.co/ntKkrgSDdF
RexDouglass: RT @StatsPapers: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. https://t.co/ntKkrgSDdF
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.277 Mikeys
#2. Dream Distillation: A Data-Independent Model Compression Framework
Kartikeya Bhardwaj, Naveen Suda, Radu Marculescu
Model compression is eminently suited for deploying deep learning on IoT-devices. However, existing model compression techniques rely on access to the original or some alternate dataset. In this paper, we address the model compression problem when no real data is available, e.g., when data is private. To this end, we propose Dream Distillation, a data-independent model compression framework. Our experiments show that Dream Distillation can achieve 88.5% accuracy on the CIFAR-10 test set without actually training on the original data!
more | pdf | html
Figures
None.
Tweets
BrundageBot: Dream Distillation: A Data-Independent Model Compression Framework. Kartikeya Bhardwaj, Naveen Suda, and Radu Marculescu https://t.co/pCaYJKRHdJ
arxiv_cs_LG: Dream Distillation: A Data-Independent Model Compression Framework. Kartikeya Bhardwaj, Naveen Suda, and Radu Marculescu https://t.co/3LEtKATLpA
StatsPapers: Dream Distillation: A Data-Independent Model Compression Framework. https://t.co/GdE0nj5nO0
arxiv_cscv: Dream Distillation: A Data-Independent Model Compression Framework https://t.co/OR7SdUWPEL
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.102 Mikeys
#3. Pair Matching: When bandits meet stochastic block model
Christophe Giraud, Yann Issartel, Luc Lehéricy, Matthieu Lerasle
The pair-matching problem appears in many applications where one wants to discover good matches between pairs of individuals. Formally, the set of individuals is represented by the nodes of a graph where the edges, unobserved at first, represent the good matches. The algorithm queries pairs of nodes and observes the presence/absence of edges. Its goal is to discover as many edges as possible with a fixed budget of queries. Pair-matching is a particular instance of multi-armed bandit problem in which the arms are pairs of individuals and the rewards are edges linking these pairs. This bandit problem is non-standard though, as each arm can only be played once. Given this last constraint, sublinear regret can be expected only if the graph presents some underlying structure. This paper shows that sublinear regret is achievable in the case where the graph is generated according to a Stochastic Block Model (SBM) with two communities. Optimal regret bounds are computed for this pair-matching problem. They exhibit a phase...
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Pair Matching: When bandits meet stochastic block model. Christophe Giraud, Yann Issartel, Luc Lehéricy, and Matthieu Lerasle https://t.co/EYpEGe368l
StatsPapers: Pair Matching: When bandits meet stochastic block model. https://t.co/r5uUgHYVul
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.102 Mikeys
#4. Online Distributed Estimation of Principal Eigenspaces
Davoud Ataee Tarzanagh, Mohamad Kazem Shirani Faradonbeh, George Michailidis
Principal components analysis (PCA) is a widely used dimension reduction technique with an extensive range of applications. In this paper, an online distributed algorithm is proposed for recovering the principal eigenspaces. We further establish its rate of convergence and show how it relates to the number of nodes employed in the distributed computation, the effective rank of the data matrix under consideration, and the gap in the spectrum of the underlying population covariance matrix. The proposed algorithm is illustrated on low-rank approximation and $\boldsymbol{k}$-means clustering tasks. The numerical results show a substantial computational speed-up vis-a-vis standard distributed PCA algorithms, without compromising learning accuracy.
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Online Distributed Estimation of Principal Eigenspaces. Davoud Ataee Tarzanagh, Mohamad Kazem Shirani Faradonbeh, and George Michailidis https://t.co/FKqeOm9dtD
StatsPapers: Online Distributed Estimation of Principal Eigenspaces. https://t.co/6gMItVqJY5
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.102 Mikeys
#5. Comparison of Machine Learning Models in Food Authentication Studies
Manokamna Singh, Katarina Domijan
The underlying objective of food authentication studies is to determine whether unknown food samples have been correctly labelled. In this paper we study three near infrared (NIR) spectroscopic datasets from food samples of different types: meat samples (labelled by species), olive oil samples (labelled by their geographic origin) and honey samples (labelled as pure or adulterated by different adulterants). We apply and compare a large number of classification, dimension reduction and variable selection approaches to these datasets. NIR data pose specific challenges to classification and variable selection: the datasets are high - dimensional where the number of cases ($n$) $<<$ number of features ($p$) and the recorded features are highly serially correlated. In this paper we carry out comparative analysis of different approaches and find that partial least squares, a classic tool employed for these types of data, outperforms all the other approaches considered.
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Comparison of Machine Learning Models in Food Authentication Studies. Manokamna Singh and Katarina Domijan https://t.co/Pkbp02wttP
StatsPapers: Comparison of Machine Learning Models in Food Authentication Studies. https://t.co/klONbwU5vu
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.102 Mikeys
#6. Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models
Mor Shpigel Nacson, Suriya Gunasekar, Jason D. Lee, Nathan Srebro, Daniel Soudry
With an eye toward understanding complexity control in deep learning, we study how infinitesimal regularization or gradient descent optimization lead to margin maximizing solutions in both homogeneous and non-homogeneous models, extending previous work that focused on infinitesimal regularization only in homogeneous models. To this end we study the limit of loss minimization with a diverging norm constraint (the "constrained path"), relate it to the limit of a "margin path" and characterize the resulting solution. For non-homogeneous ensemble models, which output is a sum of homogeneous sub-models, we show that this solution discards the shallowest sub-models if they are unnecessary. For homogeneous models, we show convergence to a "lexicographic max-margin solution", and provide conditions under which max-margin solutions are also attained as the limit of unconstrained gradient descent.
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models. Mor Shpigel Nacson, Suriya Gunasekar, Jason D. Lee, Nathan Srebro, and Daniel Soudry https://t.co/jkgY1TRfXR
StatsPapers: Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models. https://t.co/59DQssal4J
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 10006
Unqiue Words: 1898

2.021 Mikeys
#7. Reduced-order modeling using Dynamic Mode Decomposition and Least Angle Regression
John Graff, Xianzhang Xu, Francis D. Lagor, Tarunraj Singh
Dynamic Mode Decomposition (DMD) yields a linear, approximate model of a system's dynamics that is built from data. We seek to reduce the order of this model by identifying a reduced set of modes that best fit the output. We adopt a model selection algorithm from statistics and machine learning known as Least Angle Regression (LARS). We modify LARS to be complex-valued and utilize LARS to select DMD modes. We refer to the resulting algorithm as Least Angle Regression for Dynamic Mode Decomposition (LARS4DMD). Sparsity-Promoting Dynamic Mode Decomposition (DMDSP), a popular mode-selection algorithm, serves as a benchmark for comparison. Numerical results from a Poiseuille flow test problem show that LARS4DMD yields reduced-order models that have comparable performance to DMDSP. LARS4DMD has the added benefit that the regularization weighting parameter required for DMDSP is not needed.
more | pdf | html
Figures
Tweets
StatsPapers: Reduced-order modeling using Dynamic Mode Decomposition and Least Angle Regression. https://t.co/E4NGkKiR5a
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 7360
Unqiue Words: 1801

2.021 Mikeys
#8. Non-negative matrix factorization based on generalized dual divergence
Karthik Devarajan
A theoretical framework for non-negative matrix factorization based on generalized dual Kullback-Leibler divergence, which includes members of the exponential family of models, is proposed. A family of algorithms is developed using this framework and its convergence proven using the Expectation-Maximization algorithm. The proposed approach generalizes some existing methods for different noise structures and contrasts with the recently proposed quasi-likelihood approach, thus providing a useful alternative for non-negative matrix factorizations. A measure to evaluate the goodness-of-fit of the resulting factorization is described. This framework can be adapted to include penalty, kernel and discriminant functions as well as tensors.
more | pdf | html
Figures
None.
Tweets
StatsPapers: Non-negative matrix factorization based on generalized dual divergence. https://t.co/7Crm0cJqS8
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 4815
Unqiue Words: 1362

2.021 Mikeys
#9. Model interpretation through lower-dimensional posterior summarization
Spencer Woody, Carlos M. Carvalho, Jared S. Murray
Nonparametric regression models have recently surged in their power and popularity, accompanying the trend of increasing dataset size and complexity. While these models have proven their predictive ability in empirical settings, they are often difficult to interpret, and by themselves often do not address the underlying inferential goals of the analyst or decision maker. In this paper, we propose a modular two-stage approach for creating parsimonious, interpretable summaries of complex models which allow freedom in the choice of modeling technique and the inferential target. In the first stage, a flexible model is fit which is believed to be as accurate as possible. Then, in the second stage, a lower-dimensional summary model is fit which is suited to interpretably explain global or local predictive trends in the original model. The summary is refined and refitted as necessary to give adequate explanations of the original model, and we provide heuristics for this summary search. Our methodology is an example of posterior...
more | pdf | html
Figures
None.
Tweets
StatsPapers: Model interpretation through lower-dimensional posterior summarization. https://t.co/xlNAciQFgB
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.021 Mikeys
#10. Colombian Women's Life Patterns: A Multivariate Density Regression Approach
Sara Wade, Raffaella Piccarreta, Andrea Cremaschi, Isadora Antoniano-Villalobos
Women in Latin America and the Caribbean face difficulties related to the patriarchal traits of their societies. In Colombia, the well-known conflict afflicting the country since 1948 has increased the risk for vulnerable groups. It is important to determine if recent efforts to improve the welfare of women have had a positive effect extending beyond the capital, Bogota. In an initial endeavor to shed light on this matter, we analyze cross-sectional data arising from the Demographic and Health Survey Program. Our aim is to study the relationship between baseline socio-demographic factors and variables associated to fertility, partnership patterns, and work activity. To best exploit the explanatory structure, we propose a Bayesian multivariate density regression model, which can capture nonlinear regression functions and allow for non-standard features in the errors, such as asymmetry or multi-modality. The model has interpretable covariate-dependent weights constructed through normalization, allowing for combinations of...
more | pdf | html
Figures
None.
Tweets
StatsPapers: Colombian Women's Life Patterns: A Multivariate Density Regression Approach. https://t.co/DXKnTp0qHF
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 128,326 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 128,326 papers.