Top 10 Arxiv Papers Today in Machine Learning


0.0 Mikeys
#1. Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees
Yongchun Li, Weijun Xie
This paper studies a classic maximum entropy sampling problem (MESP), which aims to select the most informative principal submatrix of a prespecified size from a covariance matrix. MESP has been widely applied to many areas, including healthcare, power system, manufacturing and data science. By investigating its Lagrangian dual and primal characterization, we derive a novel convex integer program for MESP and show that its continuous relaxation yields a near-optimal solution. The results motivate us to study an efficient sampling algorithm and develop its approximation bound for MESP, which improves the best-known bound in literature. We then provide an efficient deterministic implementation of the sampling algorithm with the same approximation bound. By developing new mathematical tools for the singular matrices and analyzing the Lagrangian dual of the proposed convex integer program, we investigate the widely-used local search algorithm and prove its first-known approximation bound for MESP. The proof techniques further inspire...
more | pdf | html
Figures
Tweets
arxivml: "Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance G… https://t.co/jP8dBZadmp
arxiv_cs_LG: Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees. Yongchun Li and Weijun Xie https://t.co/aBjOCZmtgW
StatsPapers: Best Principal Submatrix Selection for the Maximum Entropy Sampling Problem: Scalable Algorithms and Performance Guarantees. https://t.co/YYsCgoXULK
Github
Repository: Approximation-Algorithms-for-MESP
User: yongchunli-13
Language: Python
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : [100, 1000, 90, 124, 90, 124, 90, 124, 2000]
Authors: 2
Total Words: 25093
Unqiue Words: 3448

0.0 Mikeys
#2. Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data
Valentina Shumovskaia, Kirill Fedyanin, Ivan Sukharev, Dmitry Berestnev, Maxim Panov
Financial institutions obtain enormous amounts of data about user transactions and money transfers, which can be considered as a large graph dynamically changing in time. In this work, we focus on the task of predicting new interactions in the network of bank clients and treat it as a link prediction problem. We propose a new graph neural network model, which uses not only the topological structure of the network but rich time-series data available for the graph nodes and edges. We evaluate the developed method using the data provided by a large European bank for several years. The proposed model outperforms the existing approaches, including other neural network models, with a significant gap in ROC AUC score on link prediction problem and also allows to improve the quality of credit scoring.
more | pdf | html
Figures
None.
Tweets
arxiv_org: Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data. https://t.co/LU1aA3VsKa https://t.co/s8xowDoGzr
arxivml: "Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data", Valentina Shumovskaia, Kiril… https://t.co/dmg60L2aQr
arxiv_cs_LG: Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data. Valentina Shumovskaia, Kirill Fedyanin, Ivan Sukharev, Dmitry Berestnev, and Maxim Panov https://t.co/OlYUoDIEuj
StatsPapers: Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data. https://t.co/LjpufdjCN4
morioka: RT @arxiv_org: Linking Bank Clients using Graph Neural Networks Powered by Rich Transactional Data. https://t.co/LU1aA3VsKa https://t.co/s8…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#3. Target-Embedding Autoencoders for Supervised Representation Learning
Daniel Jarrett, Mihaela van der Schaar
Autoencoder-based learning has emerged as a staple for disciplining representations in unsupervised and semi-supervised settings. This paper analyzes a framework for improving generalization in a purely supervised setting, where the target space is high-dimensional. We motivate and formalize the general framework of target-embedding autoencoders (TEA) for supervised prediction, learning intermediate latent representations jointly optimized to be both predictable from features as well as predictive of targets---encoding the prior that variations in targets are driven by a compact set of underlying factors. As our theoretical contribution, we provide a guarantee of generalization for linear TEAs by demonstrating uniform stability, interpreting the benefit of the auxiliary reconstruction task as a form of regularization. As our empirical contribution, we extend validation of this approach beyond existing static classification applications to multivariate sequence forecasting, verifying their advantage on both linear and nonlinear...
more | pdf | html
Figures
None.
Tweets
arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS
BrundageBot: Target-Embedding Autoencoders for Supervised Representation Learning. Daniel Jarrett and Mihaela van der Schaar https://t.co/c72QNDnwkd
arxivml: "Target-Embedding Autoencoders for Supervised Representation Learning", Daniel Jarrett, Mihaela van der Schaar https://t.co/1FgeYDvzyt
arxiv_cs_LG: Target-Embedding Autoencoders for Supervised Representation Learning. Daniel Jarrett and Mihaela van der Schaar https://t.co/3K7sPP8Q4N
StatsPapers: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/jEOHEpqfMM
RexDouglass: RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS
morioka: RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS
shubh_300595: RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS
MishakinSergey: RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS
IaXZnumDR5D0mAa: RT @arxiv_org: Target-Embedding Autoencoders for Supervised Representation Learning. https://t.co/wZPwRpsk5j https://t.co/aZVvuESpYS
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#4. A Multi-Scale Tensor Network Architecture for Classification and Regression
Justin Reyes, Miles Stoudenmire
We present an algorithm for supervised learning using tensor networks, employing a step of preprocessing the data by coarse-graining through a sequence of wavelet transformations. We represent these transformations as a set of tensor network layers identical to those in a multi-scale entanglement renormalization ansatz (MERA) tensor network, and perform supervised learning and regression tasks through a model based on a matrix product state (MPS) tensor network acting on the coarse-grained data. Because the entire model consists of tensor contractions (apart from the initial non-linear feature map), we can adaptively fine-grain the optimized MPS model backwards through the layers with essentially no loss in performance. The MPS itself is trained using an adaptive algorithm based on the density matrix renormalization group (DMRG) algorithm. We test our methods by performing a classification task on audio data and a regression task on temperature time-series data, studying the dependence of training accuracy on the number of...
more | pdf | html
Figures
None.
Tweets
arxiv_org: A Multi-Scale Tensor Network Architecture for Classification and Regression. https://t.co/kT8IypVNxa https://t.co/GYwqSKt4wk
arxivml: "A Multi-Scale Tensor Network Architecture for Classification and Regression", Justin Reyes, Miles Stoudenmire https://t.co/5jQuCx6oE6
MLSTjournal: Interesting new work from @MStoudenmire @FlatironCCQ @FlatironInst - A Multi-Scale Tensor Network Architecture for Classification and Regression - https://t.co/3d6j6rDPPW #ML #MachineLearning #networks
StatsPapers: A Multi-Scale Tensor Network Architecture for Classification and Regression. https://t.co/HIsO6QjQPV
puneethmishra: RT @arxiv_org: A Multi-Scale Tensor Network Architecture for Classification and Regression. https://t.co/kT8IypVNxa https://t.co/GYwqSKt4wk
shubh_300595: RT @arxiv_org: A Multi-Scale Tensor Network Architecture for Classification and Regression. https://t.co/kT8IypVNxa https://t.co/GYwqSKt4wk
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#5. Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities
Zhongruo Wang, Krishnakumar Balasubramanian, Shiqian Ma, Meisam Razaviyayn
In this paper, we study zeroth-order algorithms for minimax optimization problems that are nonconvex in one variable and strongly-concave in the other variable. Such minimax optimization problems have attracted significant attention lately due to their applications in modern machine learning tasks. We first design and analyze the Zeroth-Order Gradient Descent Ascent (\texttt{ZO-GDA}) algorithm, and provide improved results compared to existing works, in terms of oracle complexity. Next, we propose the Zeroth-Order Gradient Descent Multi-Step Ascent (\texttt{ZO-GDMSA}) algorithm that significantly improves the oracle complexity of \texttt{ZO-GDA}. We also provide stochastic version of \texttt{ZO-GDA} and \texttt{ZO-GDMSA} to handle stochastic nonconvex minimax problems, and provide oracle complexity results.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities. Zhongruo Wang, Krishnakumar Balasubramanian, Shiqian Ma, and Meisam Razaviyayn https://t.co/s0OLQZvku8
arxiv_cs_LG: Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities. Zhongruo Wang, Krishnakumar Balasubramanian, Shiqian Ma, and Meisam Razaviyayn https://t.co/ZzjIXq0aQe
StatsPapers: Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities. https://t.co/DQBnj4ARwH
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 14036
Unqiue Words: 2541

0.0 Mikeys
#6. Up to two billion times acceleration of scientific simulations with deep neural architecture search
M. F. Kasim, D. Watson-Parris, L. Deaconu, S. Oliver, P. Hatfield, D. H. Froula, G. Gregori, M. Jarvis, S. Khatiwala, J. Korenaga, J. Topp-Mugglestone, E. Viezzer, S. M. Vinko
Computer simulations are invaluable tools for scientific discovery. However, accurate simulations are often slow to execute, which limits their applicability to extensive parameter exploration, large-scale data analysis, and uncertainty quantification. A promising route to accelerate simulations by building fast emulators with machine learning requires large training datasets, which can be prohibitively expensive to obtain with slow simulations. Here we present a method based on neural architecture search to build accurate emulators even with a limited number of training data. The method successfully accelerates simulations by up to 2 billion times in 10 scientific cases including astrophysics, climate science, biogeochemistry, high energy density physics, fusion energy, and seismology, using the same super-architecture, algorithm, and hyperparameters. Our approach also inherently provides emulator uncertainty estimation, adding further confidence in their use. We anticipate this work will accelerate research involving expensive...
more | pdf | html
Figures
Tweets
HNTweets: Up to two billion times acceleration of scientific simulations: https://t.co/Y6xS0xZgQG Comments: https://t.co/dmd1k1Xqtb
hn_frontpage: Up to two billion times acceleration of scientific simulations L: https://t.co/xk8dEo2eiy C: https://t.co/Y0JF3O42nL
hacker_news_hir: Up to two billion times acceleration of scientific simulations : https://t.co/0uPtvRwJno Comments: https://t.co/qoDB9FZhd5
angsuman: Up to two billion times acceleration of scientific simulations https://t.co/pXvQbtadXH
StatsPapers: Up to two billion times acceleration of scientific simulations with deep neural architecture search. https://t.co/dRNCfMIkGk
StarshipBuilder: Up to two billion times acceleration of scientific simulations with deep neural architecture search https://t.co/DFD102F6Hq
jreuben1: Up to two billion times acceleration of scientific simulations with deep neural architecture search https://t.co/DeVEIzkZ2z
zhaffsky: https://t.co/C4LP1bwAFr https://t.co/C4LP1bwAFr
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 13
Total Words: 5209
Unqiue Words: 2036

0.0 Mikeys
#7. A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting
Zhengkun Li, Minh-Ngoc Tran, Chao Wang, Richard Gerlach, Junbin Gao
Value-at-Risk (VaR) and Expected Shortfall (ES) are widely used in the financial sector to measure the market risk and manage the extreme market movement. The recent link between the quantile score function and the Asymmetric Laplace density has led to a flexible likelihood-based framework for joint modelling of VaR and ES. It is of high interest in financial applications to be able to capture the underlying joint dynamics of these two quantities. We address this problem by developing a hybrid model that is based on the Asymmetric Laplace quasi-likelihood and employs the Long Short-Term Memory (LSTM) time series modelling technique from Machine Learning to capture efficiently the underlying dynamics of VaR and ES. We refer to this model as LSTM-AL. We adopt the adaptive Markov chain Monte Carlo (MCMC) algorithm for Bayesian inference in the LSTM-AL model. Empirical results show that the proposed LSTM-AL model can improve the VaR and ES forecasting accuracy over a range of well-established competing models.
more | pdf | html
Figures
Tweets
arxiv_org: A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Fo... https://t.co/4HHTOFH79U https://t.co/sdSQRJjKu2
arxivml: "A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting", Zhengkun Li, … https://t.co/w7f43KRBXl
arxiv_cs_LG: A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting. Zhengkun Li, Minh-Ngoc Tran, Chao Wang, Richard Gerlach, and Junbin Gao https://t.co/Y4BNuzly79
StatsPapers: A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting. https://t.co/A4Ekxhiy38
JAdP: RT @StatsPapers: A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting. https://t.co/A4Ekxhiy38
ankiytweets: RT @StatsPapers: A Bayesian Long Short-Term Memory Model for Value at Risk and Expected Shortfall Joint Forecasting. https://t.co/A4Ekxhiy38
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 5886
Unqiue Words: 1759

0.0 Mikeys
#8. Stratified cross-validation for unbiased and privacy-preserving federated learning
R. Bey, R. Goussault, M. Benchoufi, R. Porcher
Large-scale collections of electronic records constitutes both an opportunity for the development of more accurate prediction models and a threat for privacy. To limit privacy exposure new privacy-enhancing techniques are emerging such as federated learning which enables large-scale data analysis while avoiding the centralization of records in a unique database that would represent a critical point of failure. Although promising regarding privacy protection, federated learning prevents using some data-cleaning algorithms thus inducing new biases. In this work we focus on the recurrent problem of duplicated records that, if not handled properly, may cause over-optimistic estimations of a model's performances. We introduce and discuss stratified cross-validation, a validation methodology that leverages stratification techniques to prevent data leakage in federated learning settings without relying on demanding deduplication algorithms.
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Stratified cross-validation for unbiased and privacy-preserving federated learning. R. Bey, R. Goussault, M. Benchoufi, and R. Porcher https://t.co/tLdp8FRDPH
StatsPapers: Stratified cross-validation for unbiased and privacy-preserving federated learning. https://t.co/Kt6mmHSMor
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 7150
Unqiue Words: 2559

0.0 Mikeys
#9. On Last-Layer Algorithms for Classification: Decoupling Representation from Uncertainty Estimation
Nicolas Brosse, Carlos Riquelme, Alice Martin, Sylvain Gelly, Éric Moulines
Uncertainty quantification for deep learning is a challenging open problem. Bayesian statistics offer a mathematically grounded framework to reason about uncertainties; however, approximate posteriors for modern neural networks still require prohibitive computational costs. We propose a family of algorithms which split the classification task into two stages: representation learning and uncertainty estimation. We compare four specific instances, where uncertainty estimation is performed via either an ensemble of Stochastic Gradient Descent or Stochastic Gradient Langevin Dynamics snapshots, an ensemble of bootstrapped logistic regressions, or via a number of Monte Carlo Dropout passes. We evaluate their performance in terms of \emph{selective} classification (risk-coverage), and their ability to detect out-of-distribution samples. Our experiments suggest there is limited value in adding multiple uncertainty layers to deep classifiers, and we observe that these simple methods strongly outperform a vanilla point-estimate SGD in some...
more | pdf | html
Figures
None.
Tweets
BrundageBot: On Last-Layer Algorithms for Classification: Decoupling Representation from Uncertainty Estimation. Nicolas Brosse, Carlos Riquelme, Alice Martin, Sylvain Gelly, and Éric Moulines https://t.co/HMEJHH7Mw2
arxiv_cs_LG: On Last-Layer Algorithms for Classification: Decoupling Representation from Uncertainty Estimation. Nicolas Brosse, Carlos Riquelme, Alice Martin, Sylvain Gelly, and Éric Moulines https://t.co/deQxrplllh
StatsPapers: On Last-Layer Algorithms for Classification: Decoupling Representation from Uncertainty Estimation. https://t.co/4NaPAyjC8Z
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#10. Keyword-based Topic Modeling and Keyword Selection
Xingyu Wang, Lida Zhang, Diego Klabjan
Certain type of documents such as tweets are collected by specifying a set of keywords. As topics of interest change with time it is beneficial to adjust keywords dynamically. The challenge is that these need to be specified ahead of knowing the forthcoming documents and the underlying topics. The future topics should mimic past topics of interest yet there should be some novelty in them. We develop a keyword-based topic model that dynamically selects a subset of keywords to be used to collect future documents. The generative process first selects keywords and then the underlying documents based on the specified keywords. The model is trained by using a variational lower bound and stochastic gradient optimization. The inference consists of finding a subset of keywords where given a subset the model predicts the underlying topic-word matrix for the unknown forthcoming documents. We compare the keyword topic model against a benchmark model using viral predictions of tweets combined with a topic model. The keyword-based topic model...
more | pdf | html
Figures
Tweets
BrundageBot: Keyword-based Topic Modeling and Keyword Selection. Xingyu Wang, Lida Zhang, and Diego Klabjan https://t.co/Ye8WtOui3r
arxiv_cs_LG: Keyword-based Topic Modeling and Keyword Selection. Xingyu Wang, Lida Zhang, and Diego Klabjan https://t.co/LC98ttzlrz
StatsPapers: Keyword-based Topic Modeling and Keyword Selection. https://t.co/lBwB9FOLET
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 11004
Unqiue Words: 2714

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 257,976 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 257,976 papers.