Top 10 Arxiv Papers Today in Machine Learning


2.105 Mikeys
#1. Additive Bayesian Network Modelling with the R Package abn
Gilles Kratzer, Fraser Iain Lewis, Arianna Comin, Marta Pittavino, Reinhard Furrer
The R package abn is designed to fit additive Bayesian models to observational datasets. It contains routines to score Bayesian networks based on Bayesian or information theoretic formulations of generalized linear models. It is equipped with exact search and greedy search algorithms to select the best network. It supports a possible blend of continuous, discrete and count data and input of prior knowledge at a structural level. The Bayesian implementation supports random effects to control for one-layer clustering. In this paper, we give an overview of the methodology and illustrate the package's functionalities using a veterinary dataset about respiratory diseases in commercial swine production.
more | pdf | html
Figures
None.
Tweets
arxivml: "Additive Bayesian Network Modelling with the R Package abn", Gilles Kratzer, Fraser Iain Lewis, Arianna Comin, Mar… https://t.co/qDIRn2hdQx
GillesKratzer: 📚 #preprint 🚨 "Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR R📦{abn} https://t.co/hkTtXlRvG3 1⃣ Fitting Bayesian Networks with MLE and Bayesian methods 2⃣ Handy visualization/summary tools 3⃣ Tutorials https://t.co/jAHZFb3b5K #rstats
arxiv_cs_LG: Additive Bayesian Network Modelling with the R Package abn. Gilles Kratzer, Fraser Iain Lewis, Arianna Comin, Marta Pittavino, and Reinhard Furrer https://t.co/bnVWjDFYp8
StatsPapers: Additive Bayesian Network Modelling with the R Package abn. https://t.co/ENKFgq3zuc
rstatstweet: RT @GillesKratzer: 📚 #preprint 🚨 "Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR R📦{abn} https://t.…
SGruninger: RT @GillesKratzer: 📚 #preprint 🚨 "Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR R📦{abn} https://t.…
BlasBenito: RT @GillesKratzer: 📚 #preprint 🚨 "Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR R📦{abn} https://t.…
HugDorothea: RT @StatsPapers: Additive Bayesian Network Modelling with the R Package abn. https://t.co/ENKFgq3zuc
GillesKratzer: RT @StatsPapers: Additive Bayesian Network Modelling with the R Package abn. https://t.co/ENKFgq3zuc
d8aninja: RT @GillesKratzer: 📚 #preprint 🚨 "Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR R📦{abn} https://t.…
CelineFaverjon: RT @GillesKratzer: 📚 #preprint 🚨 "Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR R📦{abn} https://t.…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.067 Mikeys
#2. Bayesian interpretation of SGD as Ito process
Soma Yokoi, Issei Sato
The current interpretation of stochastic gradient descent (SGD) as a stochastic process lacks generality in that its numerical scheme restricts continuous-time dynamics as well as the loss function and the distribution of gradient noise. We introduce a simplified scheme with milder conditions that flexibly interprets SGD as a discrete-time approximation of an Ito process. The scheme also works as a common foundation of SGD and stochastic gradient Langevin dynamics (SGLD), providing insights into their asymptotic properties. We investigate the convergence of SGD with biased gradient in terms of the equilibrium mode and the overestimation problem of the second moment of SGLD.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Bayesian interpretation of SGD as Ito process. Soma Yokoi and Issei Sato https://t.co/C4COfOJOEK
arxivml: "Bayesian interpretation of SGD as Ito process", Soma Yokoi, Issei Sato https://t.co/MrdQsNghvv
arxiv_cs_LG: Bayesian interpretation of SGD as Ito process. Soma Yokoi and Issei Sato https://t.co/hoIuWIsm1s
StatsPapers: Bayesian interpretation of SGD as Ito process. https://t.co/AAuMGZrLuK
li_sigmath: RT @StatsPapers: Bayesian interpretation of SGD as Ito process. https://t.co/AAuMGZrLuK
kinematicpath: RT @BrundageBot: Bayesian interpretation of SGD as Ito process. Soma Yokoi and Issei Sato https://t.co/C4COfOJOEK
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.007 Mikeys
#3. SimVAE: Simulator-Assisted Training forInterpretable Generative Models
Akash Srivastava, Jessie Rosenberg, Dan Gutfreund, David D. Cox
This paper presents a simulator-assisted training method (SimVAE) for variational autoencoders (VAE) that leads to a disentangled and interpretable latent space. Training SimVAE is a two-step process in which first a deep generator network(decoder) is trained to approximate the simulator. During this step, the simulator acts as the data source or as a teacher network. Then an inference network (encoder)is trained to invert the decoder. As such, upon complete training, the encoder represents an approximately inverted simulator. By decoupling the training of the encoder and decoder we bypass some of the difficulties that arise in training generative models such as VAEs and generative adversarial networks (GANs). We show applications of our approach in a variety of domains such as circuit design, graphics de-rendering and other natural science problems that involve inference via simulation.
more | pdf | html
Figures
None.
Tweets
BrundageBot: SimVAE: Simulator-Assisted Training forInterpretable Generative Models. Akash Srivastava, Jessie Rosenberg, Dan Gutfreund, and David D. Cox https://t.co/jzIxcDxKcq
StatsPapers: SimVAE: Simulator-Assisted Training forInterpretable Generative Models. https://t.co/3YPSQ5oFTv
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.007 Mikeys
#4. Deep Unsupervised Clustering with Clustered Generator Model
Dandan Zhu, Tian Han, Linqi Zhou, Xiaokang Yang, Ying Nian Wu
This paper addresses the problem of unsupervised clustering which remains one of the most fundamental challenges in machine learning and artificial intelligence. We propose the clustered generator model for clustering which contains both continuous and discrete latent variables. Discrete latent variables model the cluster label while the continuous ones model variations within each cluster. The learning of the model proceeds in a unified probabilistic framework and incorporates the unsupervised clustering as an inner step without the need for an extra inference model as in existing variational-based models. The latent variables learned serve as both observed data embedding or latent representation for data distribution. Our experiments show that the proposed model can achieve competitive unsupervised clustering accuracy and can learn disentangled latent representations to generate realistic samples. In addition, the model can be naturally extended to per-pixel unsupervised clustering which remains largely unexplored.
more | pdf | html
Figures
Tweets
BrundageBot: Deep Unsupervised Clustering with Clustered Generator Model. Dandan Zhu, Tian Han, Linqi Zhou, Xiaokang Yang, and Ying Nian Wu https://t.co/EK4hhxSDpl
StatsPapers: Deep Unsupervised Clustering with Clustered Generator Model. https://t.co/N6Z1HeDEsk
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 6406
Unqiue Words: 1951

2.001 Mikeys
#5. Learning Weighted Submanifolds with Variational Autoencoders and Riemannian Variational Autoencoders
Nina Miolane, Susan Holmes
Manifold-valued data naturally arises in medical imaging. In cognitive neuroscience, for instance, brain connectomes base the analysis of coactivation patterns between different brain regions on the analysis of the correlations of their functional Magnetic Resonance Imaging (fMRI) time series - an object thus constrained by construction to belong to the manifold of symmetric positive definite matrices. One of the challenges that naturally arises consists of finding a lower-dimensional subspace for representing such manifold-valued data. Traditional techniques, like principal component analysis, are ill-adapted to tackle non-Euclidean spaces and may fail to achieve a lower-dimensional representation of the data - thus potentially pointing to the absence of lower-dimensional representation of the data. However, these techniques are restricted in that: (i) they do not leverage the assumption that the connectomes belong on a pre-specified manifold, therefore discarding information; (ii) they can only fit a linear subspace to the data....
more | pdf | html
Figures
None.
Tweets
StatsPapers: Learning Weighted Submanifolds with Variational Autoencoders and Riemannian Variational Autoencoders. https://t.co/FtTxWYPJaK
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.001 Mikeys
#6. Stochastic Gradient Annealed Importance Sampling for Efficient Online Marginal Likelihood Estimation
Scott A. Cameron, Hans C. Eggers, Steve Kroon
We consider estimating the marginal likelihood in settings with independent and identically distributed (i.i.d.) data. We propose estimating the predictive distributions in a sequential factorization of the marginal likelihood in such settings by using stochastic gradient Markov Chain Monte Carlo techniques. This approach is far more efficient than traditional marginal likelihood estimation techniques such as nested sampling and annealed importance sampling due to its use of mini-batches to approximate the likelihood. Stability of the estimates is provided by an adaptive annealing schedule. The resulting stochastic gradient annealed importance sampling (SGAIS) technique, which is the key contribution of our paper, enables us to estimate the marginal likelihood of a number of models considerably faster than traditional approaches, with no noticeable loss of accuracy. An important benefit of our approach is that the marginal likelihood is calculated in an online fashion as data becomes available, allowing the estimates to be used...
more | pdf | html
Figures
Tweets
tweet_nakasho: 確率的勾配アニーリングによる重要なサンプリングについての論文。 新しいアニーリング手法の論文? https://t.co/FUnEKZw2gQ
StatsPapers: Stochastic Gradient Annealed Importance Sampling for Efficient Online Marginal Likelihood Estimation. https://t.co/Q9GUs44zJV
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 7267
Unqiue Words: 2089

2.001 Mikeys
#7. A Simple Heuristic for Bayesian Optimization with A Low Budget
Masahiro Nomura, Kenshi Abe
The aim of black-box optimization is to optimize an objective function within the constraints of a given evaluation budget. In this problem, it is generally assumed that the computational cost for evaluating a point is large; thus, it is important to search efficiently with as low budget as possible. Bayesian optimization is an efficient method for black-box optimization and provides exploration-exploitation trade-off by constructing a surrogate model that considers uncertainty of the objective function. However, because Bayesian optimization should construct the surrogate model for the entire search space, it does not exhibit good performance when points are not sampled sufficiently. In this study, we develop a heuristic method refining the search space for Bayesian optimization when the available evaluation budget is low. The proposed method refines a promising region by dividing the original region so that Bayesian optimization can be executed with the promising region as the initial search space. We confirm that Bayesian...
more | pdf | html
Figures
Tweets
BrundageBot: A Simple Heuristic for Bayesian Optimization with A Low Budget. Masahiro Nomura and Kenshi Abe https://t.co/xcasp1CKeB
tweet_nakasho: ベイジアン最適化のためのヒューリスティクス手法の論文。 サイバーエージェントの人たちの仕事。 https://t.co/rIlrnmTwcm
StatsPapers: A Simple Heuristic for Bayesian Optimization with A Low Budget. https://t.co/FnLsxLkR8m
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 5337
Unqiue Words: 1638

1.998 Mikeys
#8. Defending Against Model Stealing Attacks with Adaptive Misinformation
Sanjay Kariyappa, Moinuddin K Qureshi
Deep Neural Networks (DNNs) are susceptible to model stealing attacks, which allows a data-limited adversary with no knowledge of the training dataset to clone the functionality of a target model, just by using black-box query access. Such attacks are typically carried out by querying the target model using inputs that are synthetically generated or sampled from a surrogate dataset to construct a labeled dataset. The adversary can use this labeled dataset to train a clone model, which achieves a classification accuracy comparable to that of the target model. We propose "Adaptive Misinformation" to defend against such model stealing attacks. We identify that all existing model stealing attacks invariably query the target model with Out-Of-Distribution (OOD) inputs. By selectively sending incorrect predictions for OOD queries, our defense substantially degrades the accuracy of the attacker's clone model (by up to 40%), while minimally impacting the accuracy (<0.5%) for benign users. Compared to existing defenses, our defense has a...
more | pdf | html
Figures
Tweets
BrundageBot: Defending Against Model Stealing Attacks with Adaptive Misinformation. Sanjay Kariyappa and Moinuddin K Qureshi https://t.co/V8OkxIdcWI
StatsPapers: Defending Against Model Stealing Attacks with Adaptive Misinformation. https://t.co/jwKcrpLgZL
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 6788
Unqiue Words: 1696

1.998 Mikeys
#9. Benchmarking time series classification -- Functional data vs machine learning approaches
Florian Pfisterer, Laura Beggel, Xudong Sun, Fabian Scheipl, Bernd Bischl
Time series classification problems have drawn increasing attention in the machine learning and statistical community. Closely related is the field of functional data analysis (FDA): it refers to the range of problems that deal with the analysis of data that is continuously indexed over some domain. While often employing different methods, both fields strive to answer similar questions, a common example being classification or regression problems with functional covariates. We study methods from functional data analysis, such as functional generalized additive models, as well as functionality to concatenate (functional-) feature extraction or basis representations with traditional machine learning algorithms like support vector machines or classification trees. In order to assess the methods and implementations, we run a benchmark on a wide variety of representative (time series) data sets, with in-depth analysis of empirical results, and strive to provide a reference ranking for which method(s) to use for non-expert...
more | pdf | html
Figures
None.
Tweets
StatsPapers: Benchmarking time series classification -- Functional data vs machine learning approaches. https://t.co/trdT72xWpt
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

1.998 Mikeys
#10. How data, synapses and neurons interact with each other: a variational principle marrying gradient ascent and message passing
Haiping Huang
Unsupervised learning requiring only raw data is not only a fundamental function of the cerebral cortex, but also a foundation for a next generation of artificial neural networks. However, a unified theoretical framework to treat sensory inputs, synapses and neural activity together is still lacking. The computational obstacle originates from the discrete nature of synapses, and complex interactions among these three essential elements of learning. Here, we propose a variational mean-field theory in which only the distribution of synaptic weight is considered. The unsupervised learning can then be decomposed into two interwoven steps: a maximization step is carried out as a gradient ascent of the lower-bound on the data log-likelihood, and an expectation step is carried out as a message passing procedure on an equivalent or dual neural network whose parameter is specified by the variational parameter of the weight distribution. Therefore, our framework explains how data (or sensory inputs), synapses and neural activities interact...
more | pdf | html
Figures
None.
Tweets
arxivml: "How data, synapses and neurons interact with each other: a variational principle marrying gradient ascent and mess… https://t.co/4yz8LAJ0Ol
BioPapers: How data, synapses and neurons interact with each other: a variational principle marrying gradient ascent and message passing. https://t.co/Rut8pUrfK0
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 5519
Unqiue Words: 1689

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 225,779 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 225,779 papers.