Top 10 Arxiv Papers Today in Computation


0.0 Mikeys
#1. Stable Multiple Time Step Simulation/Prediction from Lagged Dynamic Network Regression Models
Abhirup Mallik, Zack W. Almquist
Recent developments in computers and automated data collection strategies have greatly increased the interest in statistical modeling of dynamic networks. Many of the statistical models employed for inference on large-scale dynamic networks suffer from limited forward simulation/prediction ability. A major problem with many of the forward simulation procedures is the tendency for the model to become degenerate in only a few time steps, i.e., the simulation/prediction procedure results in either null graphs or complete graphs. Here, we describe an algorithm for simulating a sequence of networks generated from lagged dynamic network regression models DNR(V), a sub-family of TERGMs. We introduce a smoothed estimator for forward prediction based on smoothing of the change statistics obtained for a dynamic network regression model. We focus on the implementation of the algorithm, providing a series of motivating examples with comparisons to dynamic network models from the literature. We find that our algorithm significantly improves...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 10831
Unqiue Words: 2978

0.0 Mikeys
#2. A practical example for the non-linear Bayesian filtering of model parameters
Matthieu Bulté, Jonas Latz, Elisabeth Ullmann
In this tutorial we consider the non-linear Bayesian filtering of static parameters in a time-dependent model. We outline the theoretical background and discuss appropriate solvers. We focus on particle-based filters and present Sequential Importance Sampling (SIS) and Sequential Monte Carlo (SMC). Throughout the paper we illustrate the concepts and techniques with a practical example using real-world data. The task is to estimate the gravitational acceleration of the Earth $g$ by using observations collected from a simple pendulum. Importantly, the particle filters enable the adaptive updating of the estimate for $g$ as new observations become available. For tutorial purposes we provide the data set and a Python implementation of the particle filters.
more | pdf | html
Figures
Tweets
matthieubulte: Very proud to annouce our new tutorial paper: “A practical example for the non-linear Bayesian filtering of model parameters”. An accessible and reproducible intro Bayesian filtering and Sequential Monte Carlo. Preprint: https://t.co/cwp0xN59AJ Code: https://t.co/Bza1jFsewi
StatsPapers: A practical example for the non-linear Bayesian filtering of model parameters. https://t.co/KV5OUMTJ4V
matthieubulte: RT @StatsPapers: A practical example for the non-linear Bayesian filtering of model parameters. https://t.co/KV5OUMTJ4V
Github

Additional material for Bulté, Latz, Ullmann (2018): A practical example of non-linear Bayesian filtering of model parameters

Repository: PenduSMC
User: BayesianLearning
Language: Jupyter Notebook
Stargazers: 1
Subscribers: 0
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : [1]
Authors: 3
Total Words: 11368
Unqiue Words: 2662

0.0 Mikeys
#3. Fast computation of p-values for the permutation test based on Pearson's correlation coefficient and other statistical tests
Jean-Marie Droz
Permutation tests are among the simplest and most widely used statistical tools. Their p-values can be computed by a straightforward sampling of permutations. However, this way of computing p-values is often so slow that it is replaced by an approximation, which is accurate only for part of the interesting range of parameters. Moreover, the accuracy of the approximation can usually not be improved by increasing the computation time. We introduce a new sampling-based algorithm which uses the fast Fourier transform to compute p-values for the permutation test based on Pearson's correlation coefficient. The algorithm is practically and asymptotically faster than straightforward sampling. Typically, its complexity is logarithmic in the input size, while the complexity of straightforward sampling is linear. The idea behind the algorithm can also be used to accelerate the computation of p-values for many other common statistical tests. The algorithm is easy to implement, but its analysis involves results from the representation theory...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 5482
Unqiue Words: 1472

0.0 Mikeys
#4. A distributed regression analysis application based on SAS software. Part I: Linear and logistic regression
Qoua L. Her, Yury Vilk, Jessica Young, Zilu Zhang, Jessica M. Malenfant, Sarah Malek, Sengwee Toh
Previous work has demonstrated the feasibility and value of conducting distributed regression analysis (DRA), a privacy-protecting analytic method that performs multivariable-adjusted regression analysis with only summary-level information from participating sites. To our knowledge, there are no DRA applications in SAS, the statistical software used by several large national distributed data networks (DDNs), including the Sentinel System and PCORnet. SAS/IML is available to perform the required matrix computations for DRA in the SAS system. However, not all data partners in these large DDNs have access to SAS/IML, which is licensed separately. In this first article of a two-paper series, we describe a DRA application developed for use in Base SAS and SAS/STAT modules for linear and logistic DRA within horizontally partitioned DDNs and its successful tests.
more | pdf | html
Figures
Tweets
darrentoh_epi: Distributed linear, logistic, and Cox regression based on #SAS, preprints available here: https://t.co/uGfKQCXRXW https://t.co/sRGW9Kkb3z SAS packages and test datasets available here: https://t.co/fimoh8txSp #sentinelinitiative #distributedanalysis
StatsPapers: A distributed regression analysis application based on SAS software. Part I: Linear and logistic regression. https://t.co/qDtDMngieG
DeptPopMed: RT @darrentoh_epi: Distributed linear, logistic, and Cox regression based on #SAS, preprints available here: https://t.co/uGfKQCXRXW https:…
jess_malenfant: RT @darrentoh_epi: Distributed linear, logistic, and Cox regression based on #SAS, preprints available here: https://t.co/uGfKQCXRXW https:…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 13959
Unqiue Words: 3124

0.0 Mikeys
#5. Correlated pseudo-marginal Metropolis-Hastings using quasi-Newton proposals
Johan Dahlin, Adrian Wills, Brett Ninness
Pseudo-marginal Metropolis-Hastings (pmMH) is a versatile algorithm for sampling from target distributions which are not easy to evaluate point-wise. However, pmMH requires good proposal distributions to sample efficiently from the target, which can be problematic to construct in practice. This is especially a problem for high-dimensional targets when the standard random-walk proposal is inefficient. We extend pmMH to allow for constructing the proposal based on information from multiple past iterations. As a consequence, quasi-Newton (qN) methods can be employed to form proposals which utilize gradient information to guide the Markov chain to areas of high probability and to construct approximations of the local curvature to scale step sizes. The proposed method is demonstrated on several problems which indicate that qN proposals can perform better than other common Hessian-based proposals.
more | pdf | html
Figures
None.
Tweets
Github

Correlated pseudo-marginal Metropolis-Hastings using quasi-Newton proposals

Repository: pmmh-qn
User: compops
Language: Python
Stargazers: 1
Subscribers: 2
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 13469
Unqiue Words: 2892

0.0 Mikeys
#6. MPS: An R package for modelling new families of distributions
Mahdi Teimouri
We introduce an \verb|R| package, called \verb|MPS|, for computing the probability density function, computing the cumulative distribution function, computing the quantile function, simulating random variables, and estimating the parameters of 24 new shifted families of distributions. By considering an extra shift (location) parameter for each family more flexibility yields. Under some situations, since the maximum likelihood estimators may fail to exist, we adopt the well-known maximum product spacings approach to estimate the parameters of shifted 24 new families of distributions. The performance of the \verb|MPS| package for computing the cdf, pdf, and simulating random samples will be checked by examples. The performance of the maximum product spacings approach is demonstrated by executing \verb|MPS| package for three sets of real data. As it will be shown, for the first set, the maximum likelihood estimators break down but \verb|MPS| package find them. For the second set, adding the location parameter leads to acceptance the...
more | pdf | html
Figures
None.
Tweets
siero5335: RT @StatsPapers: MPS: An R package for modelling new families of distributions. https://t.co/bBR4Zqjhtt
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 13349
Unqiue Words: 3143

0.0 Mikeys
#7. Adaptive Approximation Error Models for Efficient Uncertainty Quantification with Application to Multiphase Subsurface Fluid Flow
Tiangang Cui, Colin Fox, Michael J O'Sullivan
Sample-based Bayesian inference provides a route to uncertainty quantification in the geosciences, though is very computationally demanding in the na\"ive form that requires simulating an accurate computer model at each iteration. We present a new approach that adaptively builds a stochastic model for the error induced by a reduced model. This enables sampling from the correct target distribution at reduced computational cost, while avoiding appreciable loss of statistical efficiency. We build on recent simplified conditions for adaptive Markov chain Monte Carlo algorithms to give practical approximation schemes and algorithms with guaranteed convergence. We demonstrate the efficacy of our new approach on two computational examples, including calibration of a large-scale numerical model of a real geothermal reservoir, that show good computational and statistical efficiencies on both synthetic and measured data sets.
more | pdf | html
Figures
Tweets
StatsPapers: Adaptive Approximation Error Models for Efficient Uncertainty Quantification with Application to Multiphase Subsurface Fluid Flow. https://t.co/5KjSPxsmYB
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 12392
Unqiue Words: 2969

0.0 Mikeys
#8. Iterative proportional scaling revisited: a modern optimization perspective
Yiyuan She, Shao Tang
This paper revisits the classic iterative proportional scaling (IPS) from a modern optimization perspective. In contrast to the criticisms made in the literature, we show that based on a coordinate descent characterization, IPS can be slightly modified to deliver coefficient estimates, and from a majorization-minimization standpoint, IPS can be extended to handle log-affine models with features not necessarily binary-valued or nonnegative. Furthermore, some state-of-the-art optimization techniques such as block-wise computation, randomization and momentum-based acceleration can be employed to provide more scalable IPS algorithms, as well as some regularized variants of IPS for concurrent feature selection.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 11832
Unqiue Words: 3305

0.0 Mikeys
#9. Biologically Plausible Online Principal Component Analysis Without Recurrent Neural Dynamics
Victor Minden, Cengiz Pehlevan, Dmitri B. Chklovskii
Artificial neural networks that learn to perform Principal Component Analysis (PCA) and related tasks using strictly local learning rules have been previously derived based on the principle of similarity matching: similar pairs of inputs should map to similar pairs of outputs. However, the operation of these networks (and of similar networks) requires a fixed-point iteration to determine the output corresponding to a given input, which means that dynamics must operate on a faster time scale than the variation of the input. Further, during these fast dynamics such networks typically "disable" learning, updating synaptic weights only once the fixed-point iteration has been resolved. Here, we derive a network for PCA-based dimensionality reduction that avoids this fast fixed-point iteration. The key novelty of our approach is a modification of the similarity matching objective to encourage near-diagonality of a synaptic weight matrix. We then approximately invert this matrix using a Taylor series approximation, replacing the previous...
more | pdf | html
Figures
Tweets
BrundageBot: Biologically Plausible Online Principal Component Analysis Without Recurrent Neural Dynamics. Victor Minden, Cengiz Pehlevan, and Dmitri B. Chklovskii https://t.co/fRQi1HWga7
chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementations, these are free from recurrent neural dynamics largely absent in early sensory systems.
StatsPapers: Biologically Plausible Online Principal Component Analysis Without Recurrent Neural Dynamics. https://t.co/B7pcjmd1Vb
VenkRamaswamy: RT @chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementa…
AdamHantman: RT @chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementa…
SCglobalbrain: RT @chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementa…
shlizee: RT @chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementa…
GunnarBlohm: RT @chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementa…
DanBumbarger: RT @chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementa…
ConstantineDovr: RT @chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementa…
johntigue: RT @chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementa…
OestlundMartin: RT @chklovskii: New biologically plausible neural networks for dimensionality reduction: https://t.co/Wbmrms8yjU Unlike previous implementa…
vdf12827: RT @StatsPapers: Biologically Plausible Online Principal Component Analysis Without Recurrent Neural Dynamics. https://t.co/B7pcjmd1Vb
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6265
Unqiue Words: 1735

0.0 Mikeys
#10. plsRglm: Partial least squares linear and generalized linear regression for processing incomplete datasets by cross-validation and bootstrap techniques with R
F. Bertrand, M. Maumy-Bertrand
The aim of the plsRglm package is to deal with complete and incomplete datasets through several new techniques or, at least, some which were not yet implemented in R. Indeed, not only does it make available the extension of the PLS regression to the generalized linear regression models, but also bootstrap techniques, leave-one-out and repeated $k$-fold cross-validation. In addition, graphical displays help the user to assess the significance of the predictors when using bootstrap techniques. Biplots (Fig. 4) can be used to delve into the relationship between individuals and variables.
more | pdf | html
Figures
None.
Tweets
StatsPapers: plsRglm: Partial least squares linear and generalized linear regression for processing incomplete datasets by cross-validation and bootstrap techniques with R. https://t.co/DSVlMa6iSq
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 2212
Unqiue Words: 976

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 72,995 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 72,995 papers.