The R package abn is designed to fit additive Bayesian models to
observational datasets. It contains routines to score Bayesian networks based
on Bayesian or information theoretic formulations of generalized linear models.
It is equipped with exact search and greedy search algorithms to select the
best network. It supports a possible blend of continuous, discrete and count
data and input of prior knowledge at a structural level. The Bayesian
implementation supports random effects to control for one-layer clustering. In
this paper, we give an overview of the methodology and illustrate the package's
functionalities using a veterinary dataset about respiratory diseases in
commercial swine production.

more |
pdf
| html
None.

arxivml:
"Additive Bayesian Network Modelling with the R Package abn",
Gilles Kratzer, Fraser Iain Lewis, Arianna Comin, Mar…
https://t.co/qDIRn2hdQx

GillesKratzer:
📚 #preprint 🚨
"Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR
R📦{abn} https://t.co/hkTtXlRvG3
1⃣ Fitting Bayesian Networks with MLE and Bayesian methods
2⃣ Handy visualization/summary tools
3⃣ Tutorials https://t.co/jAHZFb3b5K
#rstats

arxiv_cs_LG:
Additive Bayesian Network Modelling with the R Package abn. Gilles Kratzer, Fraser Iain Lewis, Arianna Comin, Marta Pittavino, and Reinhard Furrer https://t.co/bnVWjDFYp8

StatsPapers:
Additive Bayesian Network Modelling with the R Package abn. https://t.co/ENKFgq3zuc

rstatstweet:
RT @GillesKratzer: 📚 #preprint 🚨
"Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR
R📦{abn} https://t.…

SGruninger:
RT @GillesKratzer: 📚 #preprint 🚨
"Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR
R📦{abn} https://t.…

BlasBenito:
RT @GillesKratzer: 📚 #preprint 🚨
"Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR
R📦{abn} https://t.…

HugDorothea:
RT @StatsPapers: Additive Bayesian Network Modelling with the R Package abn. https://t.co/ENKFgq3zuc

GillesKratzer:
RT @StatsPapers: Additive Bayesian Network Modelling with the R Package abn. https://t.co/ENKFgq3zuc

d8aninja:
RT @GillesKratzer: 📚 #preprint 🚨
"Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR
R📦{abn} https://t.…

CelineFaverjon:
RT @GillesKratzer: 📚 #preprint 🚨
"Additive Bayesian Network Modelling with the R Package abn" https://t.co/ncHNhtp4BR
R📦{abn} https://t.…

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 0

Unqiue Words: 0

The current interpretation of stochastic gradient descent (SGD) as a
stochastic process lacks generality in that its numerical scheme restricts
continuous-time dynamics as well as the loss function and the distribution of
gradient noise. We introduce a simplified scheme with milder conditions that
flexibly interprets SGD as a discrete-time approximation of an Ito process. The
scheme also works as a common foundation of SGD and stochastic gradient
Langevin dynamics (SGLD), providing insights into their asymptotic properties.
We investigate the convergence of SGD with biased gradient in terms of the
equilibrium mode and the overestimation problem of the second moment of SGLD.

more |
pdf
| html
None.

BrundageBot:
Bayesian interpretation of SGD as Ito process. Soma Yokoi and Issei Sato https://t.co/C4COfOJOEK

arxivml:
"Bayesian interpretation of SGD as Ito process",
Soma Yokoi, Issei Sato
https://t.co/MrdQsNghvv

arxiv_cs_LG:
Bayesian interpretation of SGD as Ito process. Soma Yokoi and Issei Sato https://t.co/hoIuWIsm1s

StatsPapers:
Bayesian interpretation of SGD as Ito process. https://t.co/AAuMGZrLuK

li_sigmath:
RT @StatsPapers: Bayesian interpretation of SGD as Ito process. https://t.co/AAuMGZrLuK

kinematicpath:
RT @BrundageBot: Bayesian interpretation of SGD as Ito process. Soma Yokoi and Issei Sato https://t.co/C4COfOJOEK

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

This paper presents a simulator-assisted training method (SimVAE) for
variational autoencoders (VAE) that leads to a disentangled and interpretable
latent space. Training SimVAE is a two-step process in which first a deep
generator network(decoder) is trained to approximate the simulator. During this
step, the simulator acts as the data source or as a teacher network. Then an
inference network (encoder)is trained to invert the decoder. As such, upon
complete training, the encoder represents an approximately inverted simulator.
By decoupling the training of the encoder and decoder we bypass some of the
difficulties that arise in training generative models such as VAEs and
generative adversarial networks (GANs). We show applications of our approach in
a variety of domains such as circuit design, graphics de-rendering and other
natural science problems that involve inference via simulation.

more |
pdf
| html
None.

BrundageBot:
SimVAE: Simulator-Assisted Training forInterpretable Generative Models. Akash Srivastava, Jessie Rosenberg, Dan Gutfreund, and David D. Cox https://t.co/jzIxcDxKcq

StatsPapers:
SimVAE: Simulator-Assisted Training forInterpretable Generative Models. https://t.co/3YPSQ5oFTv

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 0

Unqiue Words: 0

This paper addresses the problem of unsupervised clustering which remains one
of the most fundamental challenges in machine learning and artificial
intelligence. We propose the clustered generator model for clustering which
contains both continuous and discrete latent variables. Discrete latent
variables model the cluster label while the continuous ones model variations
within each cluster. The learning of the model proceeds in a unified
probabilistic framework and incorporates the unsupervised clustering as an
inner step without the need for an extra inference model as in existing
variational-based models. The latent variables learned serve as both observed
data embedding or latent representation for data distribution. Our experiments
show that the proposed model can achieve competitive unsupervised clustering
accuracy and can learn disentangled latent representations to generate
realistic samples. In addition, the model can be naturally extended to
per-pixel unsupervised clustering which remains largely unexplored.

more |
pdf
| html
BrundageBot:
Deep Unsupervised Clustering with Clustered Generator Model. Dandan Zhu, Tian Han, Linqi Zhou, Xiaokang Yang, and Ying Nian Wu https://t.co/EK4hhxSDpl

StatsPapers:
Deep Unsupervised Clustering with Clustered Generator Model. https://t.co/N6Z1HeDEsk

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 6406

Unqiue Words: 1951

Manifold-valued data naturally arises in medical imaging. In cognitive
neuroscience, for instance, brain connectomes base the analysis of coactivation
patterns between different brain regions on the analysis of the correlations of
their functional Magnetic Resonance Imaging (fMRI) time series - an object thus
constrained by construction to belong to the manifold of symmetric positive
definite matrices. One of the challenges that naturally arises consists of
finding a lower-dimensional subspace for representing such manifold-valued
data. Traditional techniques, like principal component analysis, are
ill-adapted to tackle non-Euclidean spaces and may fail to achieve a
lower-dimensional representation of the data - thus potentially pointing to the
absence of lower-dimensional representation of the data. However, these
techniques are restricted in that: (i) they do not leverage the assumption that
the connectomes belong on a pre-specified manifold, therefore discarding
information; (ii) they can only fit a linear subspace to the data....

more |
pdf
| html
None.

StatsPapers:
Learning Weighted Submanifolds with Variational Autoencoders and Riemannian Variational Autoencoders. https://t.co/FtTxWYPJaK

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

We consider estimating the marginal likelihood in settings with independent
and identically distributed (i.i.d.) data. We propose estimating the predictive
distributions in a sequential factorization of the marginal likelihood in such
settings by using stochastic gradient Markov Chain Monte Carlo techniques. This
approach is far more efficient than traditional marginal likelihood estimation
techniques such as nested sampling and annealed importance sampling due to its
use of mini-batches to approximate the likelihood. Stability of the estimates
is provided by an adaptive annealing schedule. The resulting stochastic
gradient annealed importance sampling (SGAIS) technique, which is the key
contribution of our paper, enables us to estimate the marginal likelihood of a
number of models considerably faster than traditional approaches, with no
noticeable loss of accuracy. An important benefit of our approach is that the
marginal likelihood is calculated in an online fashion as data becomes
available, allowing the estimates to be used...

more |
pdf
| html
tweet_nakasho:
確率的勾配アニーリングによる重要なサンプリングについての論文。
新しいアニーリング手法の論文？
https://t.co/FUnEKZw2gQ

StatsPapers:
Stochastic Gradient Annealed Importance Sampling for Efficient Online Marginal Likelihood Estimation. https://t.co/Q9GUs44zJV

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 7267

Unqiue Words: 2089

The aim of black-box optimization is to optimize an objective function within
the constraints of a given evaluation budget. In this problem, it is generally
assumed that the computational cost for evaluating a point is large; thus, it
is important to search efficiently with as low budget as possible. Bayesian
optimization is an efficient method for black-box optimization and provides
exploration-exploitation trade-off by constructing a surrogate model that
considers uncertainty of the objective function. However, because Bayesian
optimization should construct the surrogate model for the entire search space,
it does not exhibit good performance when points are not sampled sufficiently.
In this study, we develop a heuristic method refining the search space for
Bayesian optimization when the available evaluation budget is low. The proposed
method refines a promising region by dividing the original region so that
Bayesian optimization can be executed with the promising region as the initial
search space. We confirm that Bayesian...

more |
pdf
| html
BrundageBot:
A Simple Heuristic for Bayesian Optimization with A Low Budget. Masahiro Nomura and Kenshi Abe https://t.co/xcasp1CKeB

tweet_nakasho:
ベイジアン最適化のためのヒューリスティクス手法の論文。
サイバーエージェントの人たちの仕事。
https://t.co/rIlrnmTwcm

StatsPapers:
A Simple Heuristic for Bayesian Optimization with A Low Budget. https://t.co/FnLsxLkR8m

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 5337

Unqiue Words: 1638

Deep Neural Networks (DNNs) are susceptible to model stealing attacks, which
allows a data-limited adversary with no knowledge of the training dataset to
clone the functionality of a target model, just by using black-box query
access. Such attacks are typically carried out by querying the target model
using inputs that are synthetically generated or sampled from a surrogate
dataset to construct a labeled dataset. The adversary can use this labeled
dataset to train a clone model, which achieves a classification accuracy
comparable to that of the target model. We propose "Adaptive Misinformation" to
defend against such model stealing attacks. We identify that all existing model
stealing attacks invariably query the target model with Out-Of-Distribution
(OOD) inputs. By selectively sending incorrect predictions for OOD queries, our
defense substantially degrades the accuracy of the attacker's clone model (by
up to 40%), while minimally impacting the accuracy (<0.5%) for benign users.
Compared to existing defenses, our defense has a...

more |
pdf
| html
BrundageBot:
Defending Against Model Stealing Attacks with Adaptive Misinformation. Sanjay Kariyappa and Moinuddin K Qureshi https://t.co/V8OkxIdcWI

StatsPapers:
Defending Against Model Stealing Attacks with Adaptive Misinformation. https://t.co/jwKcrpLgZL

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 6788

Unqiue Words: 1696

Time series classification problems have drawn increasing attention in the
machine learning and statistical community. Closely related is the field of
functional data analysis (FDA): it refers to the range of problems that deal
with the analysis of data that is continuously indexed over some domain. While
often employing different methods, both fields strive to answer similar
questions, a common example being classification or regression problems with
functional covariates. We study methods from functional data analysis, such as
functional generalized additive models, as well as functionality to concatenate
(functional-) feature extraction or basis representations with traditional
machine learning algorithms like support vector machines or classification
trees. In order to assess the methods and implementations, we run a benchmark
on a wide variety of representative (time series) data sets, with in-depth
analysis of empirical results, and strive to provide a reference ranking for
which method(s) to use for non-expert...

more |
pdf
| html
None.

StatsPapers:
Benchmarking time series classification -- Functional data vs machine learning approaches. https://t.co/trdT72xWpt

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 0

Unqiue Words: 0

Unsupervised learning requiring only raw data is not only a fundamental
function of the cerebral cortex, but also a foundation for a next generation of
artificial neural networks. However, a unified theoretical framework to treat
sensory inputs, synapses and neural activity together is still lacking. The
computational obstacle originates from the discrete nature of synapses, and
complex interactions among these three essential elements of learning. Here, we
propose a variational mean-field theory in which only the distribution of
synaptic weight is considered. The unsupervised learning can then be decomposed
into two interwoven steps: a maximization step is carried out as a gradient
ascent of the lower-bound on the data log-likelihood, and an expectation step
is carried out as a message passing procedure on an equivalent or dual neural
network whose parameter is specified by the variational parameter of the weight
distribution. Therefore, our framework explains how data (or sensory inputs),
synapses and neural activities interact...

more |
pdf
| html
None.

arxivml:
"How data, synapses and neurons interact with each other: a variational principle marrying gradient ascent and mess…
https://t.co/4yz8LAJ0Ol

BioPapers:
How data, synapses and neurons interact with each other: a variational principle marrying gradient ascent and message passing. https://t.co/Rut8pUrfK0

None.

None.

Sample Sizes : None.

Authors: 1

Total Words: 5519

Unqiue Words: 1689

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

*Tracking 225,779 papers.*

Sort results based on if they are interesting or reproducible.

Interesting

Reproducible