This paper studies a classic maximum entropy sampling problem (MESP), which
aims to select the most informative principal submatrix of a prespecified size
from a covariance matrix. MESP has been widely applied to many areas, including
healthcare, power system, manufacturing and data science. By investigating its
Lagrangian dual and primal characterization, we derive a novel convex integer
program for MESP and show that its continuous relaxation yields a near-optimal
solution. The results motivate us to study an efficient sampling algorithm and
develop its approximation bound for MESP, which improves the best-known bound
in literature. We then provide an efficient deterministic implementation of the
sampling algorithm with the same approximation bound. By developing new
mathematical tools for the singular matrices and analyzing the Lagrangian dual
of the proposed convex integer program, we investigate the widely-used local
search algorithm and prove its first-known approximation bound for MESP. The
proof techniques further inspire...

more |
pdf
| html
arxivml:
"Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance G…
https://t.co/jP8dBZadmp

arxiv_cs_LG:
Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees. Yongchun Li and Weijun Xie https://t.co/aBjOCZmtgW

StatsPapers:
Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees. https://t.co/YYsCgoXULK

Stargazers: 0

Subscribers: 1

Subscribers: 1

Forks: 0

Open Issues: 0

Open Issues: 0

None.

Sample Sizes : [100, 1000, 90, 124, 90, 124, 90, 124, 2000]

Authors: 2

Total Words: 25093

Unqiue Words: 3448

Financial institutions obtain enormous amounts of data about user
transactions and money transfers, which can be considered as a large graph
dynamically changing in time. In this work, we focus on the task of predicting
new interactions in the network of bank clients and treat it as a link
prediction problem. We propose a new graph neural network model, which uses not
only the topological structure of the network but rich time-series data
available for the graph nodes and edges. We evaluate the developed method using
the data provided by a large European bank for several years. The proposed
model outperforms the existing approaches, including other neural network
models, with a significant gap in ROC AUC score on link prediction problem and
also allows to improve the quality of credit scoring.

more |
pdf
| html
None.

arxiv_org:
Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data. https://t.co/LU1aA3VsKa https://t.co/s8xowDoGzr

arxivml:
"Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data",
Valentina Shumovskaia, Kiril…
https://t.co/dmg60L2aQr

arxiv_cs_LG:
Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data. Valentina Shumovskaia, Kirill Fedyanin, Ivan Sukharev, Dmitry Berestnev, and Maxim Panov https://t.co/OlYUoDIEuj

StatsPapers:
Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data. https://t.co/LjpufdjCN4

morioka:
RT @arxiv_org: Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data. https://t.co/LU1aA3VsKa https://t.co/s8…

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 0

Unqiue Words: 0

Autoencoder-based learning has emerged as a staple for disciplining
representations in unsupervised and semi-supervised settings. This paper
analyzes a framework for improving generalization in a purely supervised
setting, where the target space is high-dimensional. We motivate and formalize
the general framework of target-embedding autoencoders (TEA) for supervised
prediction, learning intermediate latent representations jointly optimized to
be both predictable from features as well as predictive of targets---encoding
the prior that variations in targets are driven by a compact set of underlying
factors. As our theoretical contribution, we provide a guarantee of
generalization for linear TEAs by demonstrating uniform stability, interpreting
the benefit of the auxiliary reconstruction task as a form of regularization.
As our empirical contribution, we extend validation of this approach beyond
existing static classification applications to multivariate sequence
forecasting, verifying their advantage on both linear and nonlinear...

more |
pdf
| html
None.

arxiv_org:
Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS

BrundageBot:
Target-Embedding Autoencoders for Supervised Representation Learning. Daniel Jarrett and Mihaela van der Schaar https://t.co/c72QNDnwkd

arxivml:
"Target-Embedding Autoencoders for Supervised Representation Learning",
Daniel Jarrett, Mihaela van der Schaar
https://t.co/1FgeYDvzyt

arxiv_cs_LG:
Target-Embedding Autoencoders for Supervised Representation Learning. Daniel Jarrett and Mihaela van der Schaar https://t.co/3K7sPP8Q4N

StatsPapers:
Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/jEOHEpqfMM

RexDouglass:
RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS

morioka:
RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS

shubh_300595:
RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS

MishakinSergey:
RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS

IaXZnumDR5D0mAa:
RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

We present an algorithm for supervised learning using tensor networks,
employing a step of preprocessing the data by coarse-graining through a
sequence of wavelet transformations. We represent these transformations as a
set of tensor network layers identical to those in a multi-scale entanglement
renormalization ansatz (MERA) tensor network, and perform supervised learning
and regression tasks through a model based on a matrix product state (MPS)
tensor network acting on the coarse-grained data. Because the entire model
consists of tensor contractions (apart from the initial non-linear feature
map), we can adaptively fine-grain the optimized MPS model backwards through
the layers with essentially no loss in performance. The MPS itself is trained
using an adaptive algorithm based on the density matrix renormalization group
(DMRG) algorithm. We test our methods by performing a classification task on
audio data and a regression task on temperature time-series data, studying the
dependence of training accuracy on the number of...

more |
pdf
| html
None.

arxiv_org:
A Multi-Scale Tensor Network Architecture for Classification and Regression. https://t.co/kT8IypVNxa https://t.co/GYwqSKt4wk

arxivml:
"A Multi-Scale Tensor Network Architecture for Classification and Regression",
Justin Reyes, Miles Stoudenmire
https://t.co/5jQuCx6oE6

MLSTjournal:
Interesting new work from @MStoudenmire @FlatironCCQ @FlatironInst - A Multi-Scale Tensor Network Architecture for Classification and Regression - https://t.co/3d6j6rDPPW #ML #MachineLearning #networks

StatsPapers:
A Multi-Scale Tensor Network Architecture for Classification and Regression. https://t.co/HIsO6QjQPV

puneethmishra:
RT @arxiv_org: A Multi-Scale Tensor Network Architecture for Classification and Regression. https://t.co/kT8IypVNxa https://t.co/GYwqSKt4wk

shubh_300595:
RT @arxiv_org: A Multi-Scale Tensor Network Architecture for Classification and Regression. https://t.co/kT8IypVNxa https://t.co/GYwqSKt4wk

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

In this paper, we study zeroth-order algorithms for minimax optimization
problems that are nonconvex in one variable and strongly-concave in the other
variable. Such minimax optimization problems have attracted significant
attention lately due to their applications in modern machine learning tasks. We
first design and analyze the Zeroth-Order Gradient Descent Ascent
(\texttt{ZO-GDA}) algorithm, and provide improved results compared to existing
works, in terms of oracle complexity. Next, we propose the Zeroth-Order
Gradient Descent Multi-Step Ascent (\texttt{ZO-GDMSA}) algorithm that
significantly improves the oracle complexity of \texttt{ZO-GDA}. We also
provide stochastic version of \texttt{ZO-GDA} and \texttt{ZO-GDMSA} to handle
stochastic nonconvex minimax problems, and provide oracle complexity results.

more |
pdf
| html
None.

BrundageBot:
Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities. Zhongruo Wang, Krishnakumar Balasubramanian, Shiqian Ma, and Meisam Razaviyayn https://t.co/s0OLQZvku8

arxiv_cs_LG:
Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities. Zhongruo Wang, Krishnakumar Balasubramanian, Shiqian Ma, and Meisam Razaviyayn https://t.co/ZzjIXq0aQe

StatsPapers:
Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities. https://t.co/DQBnj4ARwH

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 14036

Unqiue Words: 2541

Computer simulations are invaluable tools for scientific discovery. However,
accurate simulations are often slow to execute, which limits their
applicability to extensive parameter exploration, large-scale data analysis,
and uncertainty quantification. A promising route to accelerate simulations by
building fast emulators with machine learning requires large training datasets,
which can be prohibitively expensive to obtain with slow simulations. Here we
present a method based on neural architecture search to build accurate
emulators even with a limited number of training data. The method successfully
accelerates simulations by up to 2 billion times in 10 scientific cases
including astrophysics, climate science, biogeochemistry, high energy density
physics, fusion energy, and seismology, using the same super-architecture,
algorithm, and hyperparameters. Our approach also inherently provides emulator
uncertainty estimation, adding further confidence in their use. We anticipate
this work will accelerate research involving expensive...

more |
pdf
| html
HNTweets:
Up to two billion times acceleration of scientific simulations: https://t.co/Y6xS0xZgQG Comments: https://t.co/dmd1k1Xqtb

hn_frontpage:
Up to two billion times acceleration of scientific simulations
L: https://t.co/xk8dEo2eiy
C: https://t.co/Y0JF3O42nL

hacker_news_hir:
Up to two billion times acceleration of scientific simulations : https://t.co/0uPtvRwJno Comments: https://t.co/qoDB9FZhd5

angsuman:
Up to two billion times acceleration of scientific simulations https://t.co/pXvQbtadXH

StatsPapers:
Up to two billion times acceleration of scientific simulations with deep neural architecture search. https://t.co/dRNCfMIkGk

StarshipBuilder:
Up to two billion times acceleration of scientific simulations with deep neural architecture search
https://t.co/DFD102F6Hq

jreuben1:
Up to two billion times acceleration of scientific simulations with deep neural architecture search https://t.co/DeVEIzkZ2z

zhaffsky:
https://t.co/C4LP1bwAFr https://t.co/C4LP1bwAFr

None.

None.

Sample Sizes : None.

Authors: 13

Total Words: 5209

Unqiue Words: 2036

Value-at-Risk (VaR) and Expected Shortfall (ES) are widely used in the
financial sector to measure the market risk and manage the extreme market
movement. The recent link between the quantile score function and the
Asymmetric Laplace density has led to a flexible likelihood-based framework for
joint modelling of VaR and ES. It is of high interest in financial applications
to be able to capture the underlying joint dynamics of these two quantities. We
address this problem by developing a hybrid model that is based on the
Asymmetric Laplace quasi-likelihood and employs the Long Short-Term Memory
(LSTM) time series modelling technique from Machine Learning to capture
efficiently the underlying dynamics of VaR and ES. We refer to this model as
LSTM-AL. We adopt the adaptive Markov chain Monte Carlo (MCMC) algorithm for
Bayesian inference in the LSTM-AL model. Empirical results show that the
proposed LSTM-AL model can improve the VaR and ES forecasting accuracy over a
range of well-established competing models.

more |
pdf
| html
arxiv_org:
A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Fo... https://t.co/4HHTOFH79U https://t.co/sdSQRJjKu2

arxivml:
"A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting",
Zhengkun Li, …
https://t.co/w7f43KRBXl

arxiv_cs_LG:
A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting. Zhengkun Li, Minh-Ngoc Tran, Chao Wang, Richard Gerlach, and Junbin Gao https://t.co/Y4BNuzly79

StatsPapers:
A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting. https://t.co/A4Ekxhiy38

JAdP:
RT @StatsPapers: A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting. https://t.co/A4Ekxhiy38

ankiytweets:
RT @StatsPapers: A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting. https://t.co/A4Ekxhiy38

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 5886

Unqiue Words: 1759

Large-scale collections of electronic records constitutes both an opportunity
for the development of more accurate prediction models and a threat for
privacy. To limit privacy exposure new privacy-enhancing techniques are
emerging such as federated learning which enables large-scale data analysis
while avoiding the centralization of records in a unique database that would
represent a critical point of failure. Although promising regarding privacy
protection, federated learning prevents using some data-cleaning algorithms
thus inducing new biases. In this work we focus on the recurrent problem of
duplicated records that, if not handled properly, may cause over-optimistic
estimations of a model's performances. We introduce and discuss stratified
cross-validation, a validation methodology that leverages stratification
techniques to prevent data leakage in federated learning settings without
relying on demanding deduplication algorithms.

more |
pdf
| html
None.

arxiv_cs_LG:
Stratified cross-validation for unbiased and privacy-preserving federated learning. R. Bey, R. Goussault, M. Benchoufi, and R. Porcher https://t.co/tLdp8FRDPH

StatsPapers:
Stratified cross-validation for unbiased and privacy-preserving federated learning. https://t.co/Kt6mmHSMor

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 7150

Unqiue Words: 2559

Uncertainty quantification for deep learning is a challenging open problem.
Bayesian statistics offer a mathematically grounded framework to reason about
uncertainties; however, approximate posteriors for modern neural networks still
require prohibitive computational costs. We propose a family of algorithms
which split the classification task into two stages: representation learning
and uncertainty estimation. We compare four specific instances, where
uncertainty estimation is performed via either an ensemble of Stochastic
Gradient Descent or Stochastic Gradient Langevin Dynamics snapshots, an
ensemble of bootstrapped logistic regressions, or via a number of Monte Carlo
Dropout passes. We evaluate their performance in terms of \emph{selective}
classification (risk-coverage), and their ability to detect out-of-distribution
samples. Our experiments suggest there is limited value in adding multiple
uncertainty layers to deep classifiers, and we observe that these simple
methods strongly outperform a vanilla point-estimate SGD in some...

more |
pdf
| html
None.

BrundageBot:
On Last-Layer Algorithms for Classification: Decoupling Representation from Uncertainty Estimation. Nicolas Brosse, Carlos Riquelme, Alice Martin, Sylvain Gelly, and Éric Moulines https://t.co/HMEJHH7Mw2

arxiv_cs_LG:
On Last-Layer Algorithms for Classification: Decoupling Representation from Uncertainty Estimation. Nicolas Brosse, Carlos Riquelme, Alice Martin, Sylvain Gelly, and Éric Moulines https://t.co/deQxrplllh

StatsPapers:
On Last-Layer Algorithms for Classification: Decoupling Representation from Uncertainty Estimation. https://t.co/4NaPAyjC8Z

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 0

Unqiue Words: 0

Certain type of documents such as tweets are collected by specifying a set of
keywords. As topics of interest change with time it is beneficial to adjust
keywords dynamically. The challenge is that these need to be specified ahead of
knowing the forthcoming documents and the underlying topics. The future topics
should mimic past topics of interest yet there should be some novelty in them.
We develop a keyword-based topic model that dynamically selects a subset of
keywords to be used to collect future documents. The generative process first
selects keywords and then the underlying documents based on the specified
keywords. The model is trained by using a variational lower bound and
stochastic gradient optimization. The inference consists of finding a subset of
keywords where given a subset the model predicts the underlying topic-word
matrix for the unknown forthcoming documents. We compare the keyword topic
model against a benchmark model using viral predictions of tweets combined with
a topic model. The keyword-based topic model...

more |
pdf
| html
BrundageBot:
Keyword-based Topic Modeling and Keyword Selection. Xingyu Wang, Lida Zhang, and Diego Klabjan https://t.co/Ye8WtOui3r

arxiv_cs_LG:
Keyword-based Topic Modeling and Keyword Selection. Xingyu Wang, Lida Zhang, and Diego Klabjan https://t.co/LC98ttzlrz

StatsPapers:
Keyword-based Topic Modeling and Keyword Selection. https://t.co/lBwB9FOLET

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 11004

Unqiue Words: 2714

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

*Tracking 257,976 papers.*

Sort results based on if they are interesting or reproducible.

Interesting

Reproducible