A critical decision point when training predictors using multiple studies is
whether these studies should be combined or treated separately. We compare two
multi-study learning approaches in the presence of potential heterogeneity in
predictor-outcome relationships across datasets. We consider 1) merging all of
the datasets and training a single learner, and 2) cross-study learning, which
involves training a separate learner on each dataset and combining the
resulting predictions. In a linear regression setting, we show analytically and
confirm via simulation that merging yields lower prediction error than
cross-study learning when the predictor-outcome relationships are relatively
homogeneous across studies. However, as heterogeneity increases, there exists a
transition point beyond which cross-study learning outperforms merging. We
provide analytic expressions for the transition point in various scenarios and
study asymptotic properties.

more |
pdf
| html
None.

BrundageBot:
Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. Zoe Guan, Giovanni Parmigiani, and Prasad Patil https://t.co/hhbn0gH40e

arxiv_cs_LG:
Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. Zoe Guan, Giovanni Parmigiani, and Prasad Patil https://t.co/djAxH5mAud

StatsPapers:
Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. https://t.co/ntKkrgSDdF

SantchiWeb:
RT @StatsPapers: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. https://t.co/ntKkrgSDdF

RexDouglass:
RT @StatsPapers: Merging versus Ensembling in Multi-Study Machine Learning: Theoretical Insight from Random Effects. https://t.co/ntKkrgSDdF

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 0

Unqiue Words: 0

Model compression is eminently suited for deploying deep learning on
IoT-devices. However, existing model compression techniques rely on access to
the original or some alternate dataset. In this paper, we address the model
compression problem when no real data is available, e.g., when data is private.
To this end, we propose Dream Distillation, a data-independent model
compression framework. Our experiments show that Dream Distillation can achieve
88.5% accuracy on the CIFAR-10 test set without actually training on the
original data!

more |
pdf
| html
None.

BrundageBot:
Dream Distillation: A Data-Independent Model Compression Framework. Kartikeya Bhardwaj, Naveen Suda, and Radu Marculescu https://t.co/pCaYJKRHdJ

arxiv_cs_LG:
Dream Distillation: A Data-Independent Model Compression Framework. Kartikeya Bhardwaj, Naveen Suda, and Radu Marculescu https://t.co/3LEtKATLpA

StatsPapers:
Dream Distillation: A Data-Independent Model Compression Framework. https://t.co/GdE0nj5nO0

arxiv_cscv:
Dream Distillation: A Data-Independent Model Compression Framework https://t.co/OR7SdUWPEL

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 0

Unqiue Words: 0

The underlying objective of food authentication studies is to determine
whether unknown food samples have been correctly labelled. In this paper we
study three near infrared (NIR) spectroscopic datasets from food samples of
different types: meat samples (labelled by species), olive oil samples
(labelled by their geographic origin) and honey samples (labelled as pure or
adulterated by different adulterants). We apply and compare a large number of
classification, dimension reduction and variable selection approaches to these
datasets. NIR data pose specific challenges to classification and variable
selection: the datasets are high - dimensional where the number of cases ($n$)
$<<$ number of features ($p$) and the recorded features are highly serially
correlated. In this paper we carry out comparative analysis of different
approaches and find that partial least squares, a classic tool employed for
these types of data, outperforms all the other approaches considered.

more |
pdf
| html
None.

arxiv_cs_LG:
Comparison of Machine Learning Models in Food Authentication Studies. Manokamna Singh and Katarina Domijan https://t.co/Pkbp02wttP

StatsPapers:
Comparison of Machine Learning Models in Food Authentication Studies. https://t.co/klONbwU5vu

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

With an eye toward understanding complexity control in deep learning, we
study how infinitesimal regularization or gradient descent optimization lead to
margin maximizing solutions in both homogeneous and non-homogeneous models,
extending previous work that focused on infinitesimal regularization only in
homogeneous models. To this end we study the limit of loss minimization with a
diverging norm constraint (the "constrained path"), relate it to the limit of a
"margin path" and characterize the resulting solution. For non-homogeneous
ensemble models, which output is a sum of homogeneous sub-models, we show that
this solution discards the shallowest sub-models if they are unnecessary. For
homogeneous models, we show convergence to a "lexicographic max-margin
solution", and provide conditions under which max-margin solutions are also
attained as the limit of unconstrained gradient descent.

more |
pdf
| html
None.

arxiv_cs_LG:
Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models. Mor Shpigel Nacson, Suriya Gunasekar, Jason D. Lee, Nathan Srebro, and Daniel Soudry https://t.co/jkgY1TRfXR

StatsPapers:
Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models. https://t.co/59DQssal4J

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 10006

Unqiue Words: 1898

Principal components analysis (PCA) is a widely used dimension reduction
technique with an extensive range of applications. In this paper, an online
distributed algorithm is proposed for recovering the principal eigenspaces. We
further establish its rate of convergence and show how it relates to the number
of nodes employed in the distributed computation, the effective rank of the
data matrix under consideration, and the gap in the spectrum of the underlying
population covariance matrix. The proposed algorithm is illustrated on low-rank
approximation and $\boldsymbol{k}$-means clustering tasks. The numerical
results show a substantial computational speed-up vis-a-vis standard
distributed PCA algorithms, without compromising learning accuracy.

more |
pdf
| html
None.

arxiv_cs_LG:
Online Distributed Estimation of Principal Eigenspaces. Davoud Ataee Tarzanagh, Mohamad Kazem Shirani Faradonbeh, and George Michailidis https://t.co/FKqeOm9dtD

StatsPapers:
Online Distributed Estimation of Principal Eigenspaces. https://t.co/6gMItVqJY5

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 0

Unqiue Words: 0

The pair-matching problem appears in many applications where one wants to
discover good matches between pairs of individuals. Formally, the set of
individuals is represented by the nodes of a graph where the edges, unobserved
at first, represent the good matches. The algorithm queries pairs of nodes and
observes the presence/absence of edges. Its goal is to discover as many edges
as possible with a fixed budget of queries. Pair-matching is a particular
instance of multi-armed bandit problem in which the arms are pairs of
individuals and the rewards are edges linking these pairs. This bandit problem
is non-standard though, as each arm can only be played once.
Given this last constraint, sublinear regret can be expected only if the
graph presents some underlying structure. This paper shows that sublinear
regret is achievable in the case where the graph is generated according to a
Stochastic Block Model (SBM) with two communities. Optimal regret bounds are
computed for this pair-matching problem. They exhibit a phase...

more |
pdf
| html
None.

arxiv_cs_LG:
Pair Matching: When bandits meet stochastic block model. Christophe Giraud, Yann Issartel, Luc Lehéricy, and Matthieu Lerasle https://t.co/EYpEGe368l

StatsPapers:
Pair Matching: When bandits meet stochastic block model. https://t.co/r5uUgHYVul

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 0

Unqiue Words: 0

A theoretical framework for non-negative matrix factorization based on
generalized dual Kullback-Leibler divergence, which includes members of the
exponential family of models, is proposed. A family of algorithms is developed
using this framework and its convergence proven using the
Expectation-Maximization algorithm. The proposed approach generalizes some
existing methods for different noise structures and contrasts with the recently
proposed quasi-likelihood approach, thus providing a useful alternative for
non-negative matrix factorizations. A measure to evaluate the goodness-of-fit
of the resulting factorization is described. This framework can be adapted to
include penalty, kernel and discriminant functions as well as tensors.

more |
pdf
| html
None.

StatsPapers:
Non-negative matrix factorization based on generalized dual divergence. https://t.co/7Crm0cJqS8

None.

None.

Sample Sizes : None.

Authors: 1

Total Words: 4815

Unqiue Words: 1362

Dynamic Mode Decomposition (DMD) yields a linear, approximate model of a
system's dynamics that is built from data. We seek to reduce the order of this
model by identifying a reduced set of modes that best fit the output. We adopt
a model selection algorithm from statistics and machine learning known as Least
Angle Regression (LARS). We modify LARS to be complex-valued and utilize LARS
to select DMD modes. We refer to the resulting algorithm as Least Angle
Regression for Dynamic Mode Decomposition (LARS4DMD). Sparsity-Promoting
Dynamic Mode Decomposition (DMDSP), a popular mode-selection algorithm, serves
as a benchmark for comparison. Numerical results from a Poiseuille flow test
problem show that LARS4DMD yields reduced-order models that have comparable
performance to DMDSP. LARS4DMD has the added benefit that the regularization
weighting parameter required for DMDSP is not needed.

more |
pdf
| html
StatsPapers:
Reduced-order modeling using Dynamic Mode Decomposition and Least Angle Regression. https://t.co/E4NGkKiR5a

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 7360

Unqiue Words: 1801

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

*Tracking 128,327 papers.*

Sort results based on if they are interesting or reproducible.

Interesting

Reproducible