Top 10 Arxiv Papers Today in Computation


1.998 Mikeys
#1. Estimation of Multivariate Wrapped Models for Data in Torus
Anahita Nodehi, Mousa Golalizadeh, Mehdi Maadooliat, Claudio Agostinelli
Multivariate circular observations, i.e. points on a torus are nowadays very common. Multivariate wrapped models are often appropriate to describe data points scattered on p-dimensional torus. However, statistical inference based on this model is quite complicated since each contribution in the log likelihood involve an infinite sum of indices in Z^p where p is the dimension of the problem. To overcome this, two estimates procedures based on Expectation Maximization and Classification Expectation Maximization algorithms are proposed that worked well in moderate dimension size. The performance of the introduced methods are studied by Monte Carlo simulation and illustrated on three real data sets.
more | pdf | html
Figures
None.
Tweets
StatsPapers: Estimation of Multivariate Wrapped Models for Data in Torus. https://t.co/spePfKvGa3
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 6907
Unqiue Words: 1847

0.0 Mikeys
#2. A Scalable MCEM Estimator for Spatio-Temporal Autoregressive Models
Philipp Hunziker, Julian Wucherpfennig, Aya Kachi, Nils-Christian Bormann
Very large spatio-temporal lattice data are becoming increasingly common across a variety of disciplines. However, estimating interdependence across space and time in large areal datasets remains challenging, as existing approaches are often (i) not scalable, (ii) designed for conditionally Gaussian outcome data, or (iii) are limited to cross-sectional and univariate outcomes. This paper proposes an MCEM estimation strategy for a family of latent-Gaussian multivariate spatio-temporal models that addresses these issues. The proposed estimator is applicable to a wide range of non-Gaussian outcomes, and implementations for binary and count outcomes are discussed explicitly. The methodology is illustrated on simulated data, as well as on weekly data of IS-related events in Syrian districts.
more | pdf | html
Figures
None.
Tweets
arxiv_org: A Scalable MCEM Estimator for Spatio-Temporal Autoregressive Models. https://t.co/lTAyjCRs2k https://t.co/JrRuCuAhiV
kachi_aya: New working paper up: "A Scalable MCEM Estimator for Spatio-Temporal Autoregressive Models" w/ Julian Nils & Philipp https://t.co/Vsp1gEEYkt https://t.co/BinQJIksTb
DrPjenFI: RT @arxiv_org: A Scalable MCEM Estimator for Spatio-Temporal Autoregressive Models. https://t.co/lTAyjCRs2k https://t.co/JrRuCuAhiV
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 9948
Unqiue Words: 2831

0.0 Mikeys
#3. Scalable Gaussian Process Computations Using Hierarchical Matrices
Christopher J. Geoga, Mihai Anitescu, Michael L. Stein
We present a kernel-independent method that applies hierarchical matrices to the problem of maximum likelihood estimation for Gaussian processes. The proposed approximation provides natural and scalable stochastic estimators for its gradient and Hessian, as well as the expected Fisher information matrix, that are computable in quasilinear $O(n \log^2 n)$ complexity for a large range of models. To accomplish this, we (i) choose a specific hierarchical approximation for covariance matrices that enables the computation of their exact derivatives and (ii) use a stabilized form of the Hutchinson stochastic trace estimator. Since both the observed and expected information matrices can be computed in quasilinear complexity, covariance matrices for MLEs can also be estimated efficiently. After discussing the associated mathematics, we demonstrate the scalability of the method, discuss details of its implementation, and validate that the resulting MLEs and confidence intervals based on the inverse Fisher information matrix faithfully...
more | pdf | html
Figures
Tweets
dan_p_simpson: O(n log^2 n) scaling for maximum likelihood estimation for Gaussian processes. Pretty cool, although it's unclear how to extend it to non-Gaussian observations. https://t.co/9o1fjWx4aj
StatsPapers: Scalable Gaussian Process Computations Using Hierarchical Matrices. https://t.co/vSchlXb31R
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 9003
Unqiue Words: 2406

0.0 Mikeys
#4. Iterative proportional scaling revisited: a modern optimization perspective
Yiyuan She, Shao Tang
This paper revisits the classic iterative proportional scaling (IPS) from a modern optimization perspective. In contrast to the criticisms made in the literature, we show that based on a coordinate descent characterization, IPS can be slightly modified to deliver coefficient estimates, and from a majorization-minimization standpoint, IPS can be extended to handle log-affine models with features not necessarily binary-valued or nonnegative. Furthermore, some state-of-the-art optimization techniques such as block-wise computation, randomization and momentum-based acceleration can be employed to provide more scalable IPS algorithms, as well as some regularized variants of IPS for concurrent feature selection.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 11832
Unqiue Words: 3305

0.0 Mikeys
#5. corr2D - Implementation of Two-Dimensional Correlation Analysis in R
Robert Geitner, Robby Fritzsch, Jürgen Popp, Thomas W. Bocklitz
In the package corr2D two-dimensional correlation analysis is implemented in R. This paper describes how two-dimensional correlation analysis is done in the package and how the mathematical equations are translated into R code. The paper features a simple tutorial with executable code for beginners, insight into at the calculations done before the correlation analysis, a detailed look at the parallelization of the fast Fourier transformation based correlation analysis and a speed test of the calculation. The package corr2D offers the possibility to preprocess, correlate and postprocess spectroscopic data using exclusively the R language. Thus, corr2D is a welcome addition to the toolbox of spectroscopists and makes two-dimensional correlation analysis more accessible and transparent.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 13641
Unqiue Words: 3189

0.0 Mikeys
#6. A distributed regression analysis application based on SAS software. Part I: Linear and logistic regression
Qoua L. Her, Yury Vilk, Jessica Young, Zilu Zhang, Jessica M. Malenfant, Sarah Malek, Sengwee Toh
Previous work has demonstrated the feasibility and value of conducting distributed regression analysis (DRA), a privacy-protecting analytic method that performs multivariable-adjusted regression analysis with only summary-level information from participating sites. To our knowledge, there are no DRA applications in SAS, the statistical software used by several large national distributed data networks (DDNs), including the Sentinel System and PCORnet. SAS/IML is available to perform the required matrix computations for DRA in the SAS system. However, not all data partners in these large DDNs have access to SAS/IML, which is licensed separately. In this first article of a two-paper series, we describe a DRA application developed for use in Base SAS and SAS/STAT modules for linear and logistic DRA within horizontally partitioned DDNs and its successful tests.
more | pdf | html
Figures
Tweets
darrentoh_epi: Distributed linear, logistic, and Cox regression based on #SAS, preprints available here: https://t.co/uGfKQCXRXW https://t.co/sRGW9Kkb3z SAS packages and test datasets available here: https://t.co/fimoh8txSp #sentinelinitiative #distributedanalysis
StatsPapers: A distributed regression analysis application based on SAS software. Part I: Linear and logistic regression. https://t.co/qDtDMngieG
DeptPopMed: RT @darrentoh_epi: Distributed linear, logistic, and Cox regression based on #SAS, preprints available here: https://t.co/uGfKQCXRXW https:…
jess_malenfant: RT @darrentoh_epi: Distributed linear, logistic, and Cox regression based on #SAS, preprints available here: https://t.co/uGfKQCXRXW https:…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 13959
Unqiue Words: 3124

0.0 Mikeys
#7. A distributed regression analysis application based on SAS software Part II: Cox proportional hazards regression
Yury Vilk, Zilu Zhang, Jessica Young, Qoua L. Her, Jessica M. Malenfant, Sarah Malek, Sengwee Toh
Previous work has demonstrated the feasibility and value of conducting distributed regression analysis (DRA), a privacy-protecting analytic method that performs multivariable-adjusted regression analysis with only summary-level information from participating sites. To our knowledge, there are no DRA applications in SAS, the statistical software used by several large national distributed data networks (DDNs), including the Sentinel System and PCORnet. SAS/IML is available to perform the required matrix computations for DRA in the SAS system. However, not all data partners in these large DDNs have access to SAS/IML, which is licensed separately. In this second article of a two-paper series, we describe a DRA application developed using Base SAS and SAS/STAT modules for distributed Cox proportional hazards regression within horizontally partitioned DDNs and its successful tests.
more | pdf | html
Figures
None.
Tweets
kaz_yos: [1808.02392] A distributed regression analysis application based on SAS software Part II: Cox proportional hazards https://t.co/TEIyrzy3i9
StatsPapers: A distributed regression analysis application based on SAS software Part II: Cox proportional hazards regression. https://t.co/mhMYgbrFYh
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 8517
Unqiue Words: 2160

0.0 Mikeys
#8. Optimal allocation of subjects in a cluster randomized trial with fixed number of clusters when the ICCs or costs are heterogeneous over clusters
Satya Prakash Singh, Pradeep Yadav
The intra-cluster correlation coefficient (ICC) plays an important role while designing the cluster randomized trials (CRTs). Often optimal CRTs are designed assuming that the magnitude of the ICC is constant across the clusters. However, this assumption is hardly satisfied. In some applications, the precise information about the cluster specific correlation is known in advance. In this article, we propose an optimal design with non-constant ICC across the clusters. Also in many situations, the cost of sampling of an observation from a particular cluster may differ from that of some other cluster. An optimal design in those scenarios is also obtained assuming unequal costs of sampling from different clusters. The theoretical findings are supplemented by thorough numerical examples.
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 6934
Unqiue Words: 1913

0.0 Mikeys
#9. An Empirical Evaluation of the Approximation of Subjective Logic Operators Using Monte Carlo Simulations
Fabio Massimo Zennaro, Magdalena Ivanovska, Audun Jøsang
In this paper we analyze the use of subjective logic as a framework for performing approximate transformations over probability distribution functions. As for any approximation, we evaluate subjective logic in terms of computational efficiency and bias. However, while the computational cost may be easily estimated, the bias of subjective logic operators have not yet been investigated. In order to evaluate this bias, we propose an experimental protocol that exploits Monte Carlo simulations and their properties to assess the distance between the result produced by subjective logic operators and the true result of the corresponding transformation over probability distribution. This protocol allows a modeler to get an estimate of the degree of approximation she must be ready to accept as a trade-off for the computational efficiency and the interpretability of the subjective logic framework. Concretely, we apply our method to the relevant case study of the subjective logic operator for binomial multiplication and we study empirically...
more | pdf | html
Figures
Tweets
Github

Empirical Evaluation of the Approximation of Binomial Multiplication in Subjective Logic using Monte Carlo Simulations

Repository: SLBinomialProduct
User: FMZennaro
Language: None
Stargazers: 0
Subscribers: 0
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 10657
Unqiue Words: 1953

0.0 Mikeys
#10. Ball: An R package for detecting distribution difference and association in metric spaces
Jin Zhu, Wenliang Pan, Wei Zheng, Xueqin Wang
The rapid development of modern technology facilitates the appearance of numerous unprecedented complex data which do not satisfy the axioms of Euclidean geometry, while most of the statistical hypothesis tests are available in Euclidean or Hilbert spaces. To properly analyze the data of more complicated structures, efforts have been made to solve the fundamental test problems in more general spaces. In this paper, a publicly available R package Ball is provided to implement Ball statistical test procedures for K-sample distribution comparison and test of mutual independence in metric spaces, which extend the test procedures for two sample distribution comparison and test of independence. The tailormade algorithms as well as engineering techniques are employed on the Ball package to speed up computation to the best of our ability. Two real data analyses and several numerical studies have been performed and the results certify the powerfulness of Ball package in analyzing complex data, e.g., spherical data and symmetric positive...
more | pdf | html
Figures
Tweets
StatsPapers: Ball: An R package for detecting distribution difference and association in metric spaces. https://t.co/WuGgyaYUyk
RexDouglass: RT @StatsPapers: Ball: An R package for detecting distribution difference and association in metric spaces. https://t.co/WuGgyaYUyk
madsyair: RT @StatsPapers: Ball: An R package for detecting distribution difference and association in metric spaces. https://t.co/WuGgyaYUyk
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 9591
Unqiue Words: 2672

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 58,338 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 58,338 papers.