We introduce the variational filtering EM algorithm, a simple,
general-purpose method for performing variational inference in dynamical latent
variable models using information from only past and present variables, i.e.
filtering. The algorithm is derived from the variational objective in the
filtering setting and consists of an optimization procedure at each time step.
By performing each inference optimization procedure with an iterative amortized
inference model, we obtain a computationally efficient implementation of the
algorithm, which we call amortized variational filtering. We present
experiments demonstrating that this general-purpose method improves performance
across several deep dynamical latent variable models.

Authors: 3

Total Words: 6968

Unqiue Words: 2126

The synthetic control method (SCM) is a popular approach for estimating the
impact of a treatment on a single unit in panel data settings. The "synthetic
control" is a weighted average of control units that balances the treated
unit's pre-treatment outcomes as closely as possible. The curse of
dimensionality, however, means that SCM does not generally achieve exact
balance, which can bias the SCM estimate. We propose an extension, Augmented
SCM, which uses an outcome model to estimate the bias due to covariate
imbalance and then de-biases the original SCM estimate, analogous to bias
correction for inexact matching. We motivate this approach by showing that SCM
is a (regularized) inverse propensity score weighting estimator, with
pre-treatment outcomes as covariates and a ridge penalty on the propensity
score coefficients. We give theoretical guarantees for specific cases and
propose a new inference procedure. We demonstrate gains from Augmented SCM with
extensive simulation studies and apply this framework to canonical...

Authors: 3

Total Words: 19346

Unqiue Words: 3589

We introduce a novel framework for the estimation of the posterior
distribution of the weights of a neural network, based on a new probabilistic
interpretation of adaptive subgradient algorithms such as AdaGrad and Adam.
Having a confidence measure of the weights allows several shortcomings of
neural networks to be addressed. In particular, the robustness of the network
can be improved by performing weight pruning based on signal-to-noise ratios
from the weight posterior distribution. Using the MNIST dataset, we demonstrate
that the empirical performance of Badam, a particular instance of our framework
based on Adam, is competitive in comparison to related Bayesian approaches such
as Bayes By Backprop.

Authors: 3

Total Words: 5967

Unqiue Words: 1882

In this paper we offer an asymptotically efficient, non-parametric way to
assess treatment effect variability via the conditional average treatment
effect (CATE) which is a function of the measured confounders or strata, giving
the average treatment effect for a given stratum. We can ask the two main
questions of the CATE function: What are its mean and variance? The mean gives
the more easily estimable and well-studied average treatment effect whereas
CATE variance measures reliability of treatment or the extent of effect
modification. With the knowledge of CATE variance and hence, CATE standard
deviation, a doctor or policy analyst can give a precise statement as to what
an individual patient can expect, which we distinguish as clinical effect
heterogeneity. We can also assess how much precision in treatment can be gained
in assigning treatments based on patient covariates. Through simulations we
will verify some of the theoretical properties of our proposed estimator and we
will also point out some of the challenges in...

Sample Sizes : [1000]

Authors: 4

Total Words: 13370

Unqiue Words: 2983

The doctrinal paradox is analysed from a probabilistic point of view assuming
a simple parametric model for the committee's behaviour. The well known
issue-by-issue and case-by-case majority rules are compared in this model, by
means of the concepts of false positive rate (FPR), false negative rate (FNR)
and Receiver Operating Characteristics (ROC) space. We introduce also a new
rule that we call path-by-path, which is somehow halfway between the other two.
Under our model assumptions, the issue-by-issue rule is shown to be the best of
the three according to an optimality criterion based in ROC maps, for all
values of the model parameters (committee size and competence of its members),
when equal weight is given to FPR an FNR. For unequal weights, the relative
goodness of the rules depends on the values of the competence and the weights,
in a way which is precisely described. The results are illustrated with some
numerical examples.

Authors: 2

Total Words: 13346

Unqiue Words: 2763

We present a deep learning framework for quantifying and propagating
uncertainty in systems governed by non-linear differential equations using
physics-informed neural networks. Specifically, we employ latent variable
models to construct probabilistic representations for the system states, and
put forth an adversarial inference procedure for training them on data, while
constraining their predictions to satisfy given physical laws expressed by
partial differential equations. Such physics-informed constraints provide a
regularization mechanism for effectively training deep generative models as
surrogates of physical systems in which the cost of data acquisition is high,
and training data-sets are typically small. This provides a flexible framework
for characterizing uncertainty in the outputs of physical systems due to
randomness in their inputs or noise in their observations that entirely
bypasses the need for repeatedly sampling expensive experiments or numerical
simulators. We demonstrate the effectiveness of our approach...

Authors: 2

Total Words: 9971

Unqiue Words: 2649

Collected data, which is used for analysis or prediction tasks, often have a
hierarchical structure, for example, data from various people performing the
same task. Modeling the data's structure can improve the reliability of the
derived results and prediction performance of newly unobserved data. Bayesian
modeling provides a tool-kit for designing hierarchical models. However, Markov
Chain Monte Carlo methods which are commonly used for parameter estimation are
computationally expensive. This often renders its use for many applications not
applicable. However, variational Bayesian methods allow to derive an
approximation with much less computational effort. This document describes the
derivation of a variational approximation for a hierarchical linear Bayesian
regression and demonstrates its application to data analysis.

Authors: 1

Total Words: 6431

Unqiue Words: 1710

The rapid development of modern technology facilitates the appearance of
numerous unprecedented complex data which do not satisfy the axioms of
Euclidean geometry, while most of the statistical hypothesis tests are
available in Euclidean or Hilbert spaces. To properly analyze the data of more
complicated structures, efforts have been made to solve the fundamental test
problems in more general spaces. In this paper, a publicly available R package
Ball is provided to implement Ball statistical test procedures for K-sample
distribution comparison and test of mutual independence in metric spaces, which
extend the test procedures for two sample distribution comparison and test of
independence. The tailormade algorithms as well as engineering techniques are
employed on the Ball package to speed up computation to the best of our
ability. Two real data analyses and several numerical studies have been
performed and the results certify the powerfulness of Ball package in analyzing
complex data, e.g., spherical data and symmetric positive...

Authors: 4

Total Words: 9591

Unqiue Words: 2672

While the HVTN 505 trial showed no overall efficacy of the tested vaccine to
prevent HIV infection over placebo, previous studies, biological theories, and
the finding that immune response markers strongly correlated with infection in
vaccine recipients generated the hypothesis that a qualitative interaction
occurred. This hypothesis can be assessed with statistical methods for studying
treatment effect modification by an intermediate response variable (i.e.,
principal stratification effect modification (PSEM) methods). However,
available PSEM methods make untestable structural risk assumptions, such that
assumption-lean versions of PSEM methods are needed in order to surpass the
high bar of evidence to demonstrate a qualitative interaction. Fortunately, the
survivor average causal effect (SACE) literature is replete with
assumption-lean methods that can be readily adapted to the PSEM application for
the special case of a binary intermediate response variable. We map this
adaptation, opening up a host of new PSEM methods for a...

Sample Sizes : [400, 800, 1600, 400, 800, 1600, 1600, 800, 400, 400, 800, 1600, 400, 800, 1600, 1600, 800, 400, 2000, 4000, 8000, 2000, 4000, 8000, 8000, 4000, 2000, 2000, 4000, 8000, 2000, 4000, 8000, 8000, 4000, 2000, 400, 800, 1600, 400, 800, 1600, 1600, 800, 400, 400, 800, 1600, 400, 800, 1600, 1600, 800, 400, 1600, 400, 800, 1600, 400, 800, 1600, 800, 400, 2000, 4000, 8000, 2000, 4000, 8000, 8000, 4000, 2000, 2000, 4000, 8000, 2000, 4000, 8000, 8000, 4000, 2000, 8000, 2000, 4000, 8000, 2000, 4000, 8000, 4000, 2000]

Authors: 4

Total Words: 15312

Unqiue Words: 2785

Cardiovascular diseases (CVDs) is a number one cause of death globally. WHO
estimated that CVD is a cause of 17.9 million deaths (or 31% of all global
deaths) in 2016. It may seem surprising, CVDs can be easily prevented by
altering lifestyle to avoid risk factors. The only requirement needed is to
know your risk prior. Thai CV Risk score is a trustworthy tool to forecast risk
of having cardiovascular event in the future for Thais. This study is an
external validation of the Thai CV risk score. We aim to answer two key
questions. Firstly, Can Thai CV Risk score developed using dataset of people
from central and north western parts of Thailand is applicable to people from
other parts of the country? Secondly, Can Thai CV Risk score developed for
general public works for hospital's patients who tend to have higher risk? We
answer these two questions using a dataset of 1,025 patients (319 males, 35-70
years old) from Lansaka Hospital in the southern Thailand. In brief, we find
that the Thai CV risk score works for southern Thais...

Authors: 4

Total Words: 1531

Unqiue Words: 645

