### Top 10 Arxiv Papers Today in Statistics

##### #1. Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects
###### Zoe Guan, Giovanni Parmigiani, Prasad Patil
A critical decision point when training predictors using multiple studies is whether these studies should be combined or treated separately. We compare two multi-study learning approaches in the presence of potential heterogeneity in predictor-outcome relationships across datasets. We consider 1) merging all of the datasets and training a single learner, and 2) cross-study learning, which involves training a separate learner on each dataset and combining the resulting predictions. In a linear regression setting, we show analytically and confirm via simulation that merging yields lower prediction error than cross-study learning when the predictor-outcome relationships are relatively homogeneous across studies. However, as heterogeneity increases, there exists a transition point beyond which cross-study learning outperforms merging. We provide analytic expressions for the transition point in various scenarios and study asymptotic properties.
more | pdf | html
None.
###### Tweets
BrundageBot: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. Zoe Guan, Giovanni Parmigiani, and Prasad Patil https://t.co/hhbn0gH40e
arxiv_cs_LG: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. Zoe Guan, Giovanni Parmigiani, and Prasad Patil https://t.co/djAxH5mAud
StatsPapers: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. https://t.co/ntKkrgSDdF
SantchiWeb: RT @StatsPapers: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. https://t.co/ntKkrgSDdF
RexDouglass: RT @StatsPapers: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. https://t.co/ntKkrgSDdF
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

##### #2. Dream Distillation: A Data-Independent Model Compression Framework
###### Kartikeya Bhardwaj, Naveen Suda, Radu Marculescu
Model compression is eminently suited for deploying deep learning on IoT-devices. However, existing model compression techniques rely on access to the original or some alternate dataset. In this paper, we address the model compression problem when no real data is available, e.g., when data is private. To this end, we propose Dream Distillation, a data-independent model compression framework. Our experiments show that Dream Distillation can achieve 88.5% accuracy on the CIFAR-10 test set without actually training on the original data!
more | pdf | html
None.
###### Tweets
BrundageBot: Dream Distillation: A Data-Independent Model Compression Framework. Kartikeya Bhardwaj, Naveen Suda, and Radu Marculescu https://t.co/pCaYJKRHdJ
arxiv_cs_LG: Dream Distillation: A Data-Independent Model Compression Framework. Kartikeya Bhardwaj, Naveen Suda, and Radu Marculescu https://t.co/3LEtKATLpA
StatsPapers: Dream Distillation: A Data-Independent Model Compression Framework. https://t.co/GdE0nj5nO0
arxiv_cscv: Dream Distillation: A Data-Independent Model Compression Framework https://t.co/OR7SdUWPEL
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

##### #3. Pair Matching: When bandits meet stochastic block model
###### Christophe Giraud, Yann Issartel, Luc Lehéricy, Matthieu Lerasle
The pair-matching problem appears in many applications where one wants to discover good matches between pairs of individuals. Formally, the set of individuals is represented by the nodes of a graph where the edges, unobserved at first, represent the good matches. The algorithm queries pairs of nodes and observes the presence/absence of edges. Its goal is to discover as many edges as possible with a fixed budget of queries. Pair-matching is a particular instance of multi-armed bandit problem in which the arms are pairs of individuals and the rewards are edges linking these pairs. This bandit problem is non-standard though, as each arm can only be played once. Given this last constraint, sublinear regret can be expected only if the graph presents some underlying structure. This paper shows that sublinear regret is achievable in the case where the graph is generated according to a Stochastic Block Model (SBM) with two communities. Optimal regret bounds are computed for this pair-matching problem. They exhibit a phase...
more | pdf | html
None.
###### Tweets
arxiv_cs_LG: Pair Matching: When bandits meet stochastic block model. Christophe Giraud, Yann Issartel, Luc Lehéricy, and Matthieu Lerasle https://t.co/EYpEGe368l
StatsPapers: Pair Matching: When bandits meet stochastic block model. https://t.co/r5uUgHYVul
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

##### #4. Online Distributed Estimation of Principal Eigenspaces
Principal components analysis (PCA) is a widely used dimension reduction technique with an extensive range of applications. In this paper, an online distributed algorithm is proposed for recovering the principal eigenspaces. We further establish its rate of convergence and show how it relates to the number of nodes employed in the distributed computation, the effective rank of the data matrix under consideration, and the gap in the spectrum of the underlying population covariance matrix. The proposed algorithm is illustrated on low-rank approximation and $\boldsymbol{k}$-means clustering tasks. The numerical results show a substantial computational speed-up vis-a-vis standard distributed PCA algorithms, without compromising learning accuracy.
more | pdf | html
None.
###### Tweets
arxiv_cs_LG: Online Distributed Estimation of Principal Eigenspaces. Davoud Ataee Tarzanagh, Mohamad Kazem Shirani Faradonbeh, and George Michailidis https://t.co/FKqeOm9dtD
StatsPapers: Online Distributed Estimation of Principal Eigenspaces. https://t.co/6gMItVqJY5
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

##### #5. Comparison of Machine Learning Models in Food Authentication Studies
###### Manokamna Singh, Katarina Domijan
The underlying objective of food authentication studies is to determine whether unknown food samples have been correctly labelled. In this paper we study three near infrared (NIR) spectroscopic datasets from food samples of different types: meat samples (labelled by species), olive oil samples (labelled by their geographic origin) and honey samples (labelled as pure or adulterated by different adulterants). We apply and compare a large number of classification, dimension reduction and variable selection approaches to these datasets. NIR data pose specific challenges to classification and variable selection: the datasets are high - dimensional where the number of cases ($n$) $<<$ number of features ($p$) and the recorded features are highly serially correlated. In this paper we carry out comparative analysis of different approaches and find that partial least squares, a classic tool employed for these types of data, outperforms all the other approaches considered.
more | pdf | html
None.
###### Tweets
arxiv_cs_LG: Comparison of Machine Learning Models in Food Authentication Studies. Manokamna Singh and Katarina Domijan https://t.co/Pkbp02wttP
StatsPapers: Comparison of Machine Learning Models in Food Authentication Studies. https://t.co/klONbwU5vu
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

##### #6. Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models
###### Mor Shpigel Nacson, Suriya Gunasekar, Jason D. Lee, Nathan Srebro, Daniel Soudry
With an eye toward understanding complexity control in deep learning, we study how infinitesimal regularization or gradient descent optimization lead to margin maximizing solutions in both homogeneous and non-homogeneous models, extending previous work that focused on infinitesimal regularization only in homogeneous models. To this end we study the limit of loss minimization with a diverging norm constraint (the "constrained path"), relate it to the limit of a "margin path" and characterize the resulting solution. For non-homogeneous ensemble models, which output is a sum of homogeneous sub-models, we show that this solution discards the shallowest sub-models if they are unnecessary. For homogeneous models, we show convergence to a "lexicographic max-margin solution", and provide conditions under which max-margin solutions are also attained as the limit of unconstrained gradient descent.
more | pdf | html
None.
###### Tweets
arxiv_cs_LG: Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models. Mor Shpigel Nacson, Suriya Gunasekar, Jason D. Lee, Nathan Srebro, and Daniel Soudry https://t.co/jkgY1TRfXR
StatsPapers: Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models. https://t.co/59DQssal4J
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 5
Total Words: 10006
Unqiue Words: 1898

##### #7. Reduced-order modeling using Dynamic Mode Decomposition and Least Angle Regression
###### John Graff, Xianzhang Xu, Francis D. Lagor, Tarunraj Singh
Dynamic Mode Decomposition (DMD) yields a linear, approximate model of a system's dynamics that is built from data. We seek to reduce the order of this model by identifying a reduced set of modes that best fit the output. We adopt a model selection algorithm from statistics and machine learning known as Least Angle Regression (LARS). We modify LARS to be complex-valued and utilize LARS to select DMD modes. We refer to the resulting algorithm as Least Angle Regression for Dynamic Mode Decomposition (LARS4DMD). Sparsity-Promoting Dynamic Mode Decomposition (DMDSP), a popular mode-selection algorithm, serves as a benchmark for comparison. Numerical results from a Poiseuille flow test problem show that LARS4DMD yields reduced-order models that have comparable performance to DMDSP. LARS4DMD has the added benefit that the regularization weighting parameter required for DMDSP is not needed.
more | pdf | html
###### Tweets
StatsPapers: Reduced-order modeling using Dynamic Mode Decomposition and Least Angle Regression. https://t.co/E4NGkKiR5a
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 7360
Unqiue Words: 1801

##### #8. Non-negative matrix factorization based on generalized dual divergence
###### Karthik Devarajan
A theoretical framework for non-negative matrix factorization based on generalized dual Kullback-Leibler divergence, which includes members of the exponential family of models, is proposed. A family of algorithms is developed using this framework and its convergence proven using the Expectation-Maximization algorithm. The proposed approach generalizes some existing methods for different noise structures and contrasts with the recently proposed quasi-likelihood approach, thus providing a useful alternative for non-negative matrix factorizations. A measure to evaluate the goodness-of-fit of the resulting factorization is described. This framework can be adapted to include penalty, kernel and discriminant functions as well as tensors.
more | pdf | html
None.
###### Tweets
StatsPapers: Non-negative matrix factorization based on generalized dual divergence. https://t.co/7Crm0cJqS8
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 4815
Unqiue Words: 1362

##### #9. Model interpretation through lower-dimensional posterior summarization
###### Spencer Woody, Carlos M. Carvalho, Jared S. Murray
Nonparametric regression models have recently surged in their power and popularity, accompanying the trend of increasing dataset size and complexity. While these models have proven their predictive ability in empirical settings, they are often difficult to interpret, and by themselves often do not address the underlying inferential goals of the analyst or decision maker. In this paper, we propose a modular two-stage approach for creating parsimonious, interpretable summaries of complex models which allow freedom in the choice of modeling technique and the inferential target. In the first stage, a flexible model is fit which is believed to be as accurate as possible. Then, in the second stage, a lower-dimensional summary model is fit which is suited to interpretably explain global or local predictive trends in the original model. The summary is refined and refitted as necessary to give adequate explanations of the original model, and we provide heuristics for this summary search. Our methodology is an example of posterior...
more | pdf | html
None.
###### Tweets
StatsPapers: Model interpretation through lower-dimensional posterior summarization. https://t.co/xlNAciQFgB
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

##### #10. Colombian Women's Life Patterns: A Multivariate Density Regression Approach
Women in Latin America and the Caribbean face difficulties related to the patriarchal traits of their societies. In Colombia, the well-known conflict afflicting the country since 1948 has increased the risk for vulnerable groups. It is important to determine if recent efforts to improve the welfare of women have had a positive effect extending beyond the capital, Bogota. In an initial endeavor to shed light on this matter, we analyze cross-sectional data arising from the Demographic and Health Survey Program. Our aim is to study the relationship between baseline socio-demographic factors and variables associated to fertility, partnership patterns, and work activity. To best exploit the explanatory structure, we propose a Bayesian multivariate density regression model, which can capture nonlinear regression functions and allow for non-standard features in the errors, such as asymmetry or multi-modality. The model has interpretable covariate-dependent weights constructed through normalization, allowing for combinations of...
more | pdf | html
None.
###### Tweets
StatsPapers: Colombian Women's Life Patterns: A Multivariate Density Regression Approach. https://t.co/DXKnTp0qHF
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 128,326 papers.

###### Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Online
###### Stats
Tracking 128,326 papers.