Top 5 Arxiv Papers Today in Methodology


2.009 Mikeys
#1. Principal nested shape space analysis of molecular dynamics data
Ian L. Dryden, Kwang-Rae Kim, Charles A. Laughton, Huiling Le
Molecular dynamics simulations produce huge datasets of temporal sequences of molecules. It is of interest to summarize the shape evolution of the molecules in a succinct, low-dimensional representation. However, Euclidean techniques such as principal components analysis (PCA) can be problematic as the data may lie far from in a flat manifold. Principal nested spheres gives a fundamentally different decomposition of data from the usual Euclidean sub-space based PCA (Jung, Dryden and Marron, 2012, Biometrika). Sub-spaces of successively lower dimension are fitted to the data in a backwards manner, with the aim of retaining signal and dispensing with noise at each stage. We adapt the methodology to 3D sub-shape spaces and provide some practical fitting algorithms. The methodology is applied to cluster analysis of peptides, where different states of the molecules can be identified. Also, the temporal transitions between cluster states are explored.
more | pdf | html
Figures
Tweets
razoralign: Principal nested shape space analysis of molecular dynamics data https://t.co/JQsbawhazl https://t.co/TLNib8U67o
StatsPapers: Principal nested shape space analysis of molecular dynamics data. https://t.co/AS7ozhb1b4
MattChallacombe: RT @StatsPapers: Principal nested shape space analysis of molecular dynamics data. https://t.co/AS7ozhb1b4
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 8362
Unqiue Words: 2093

2.007 Mikeys
#2. Differentially Private Nonparametric Hypothesis Testing
Simon Couch, Zeki Kazan, Kaiyan Shi, Andrew Bray, Adam Groce
Hypothesis tests are a crucial statistical tool for data mining and are the workhorse of scientific research in many fields. Here we study differentially private tests of independence between a categorical and a continuous variable. We take as our starting point traditional nonparametric tests, which require no distributional assumption (e.g., normality) about the data distribution. We present private analogues of the Kruskal-Wallis, Mann-Whitney, and Wilcoxon signed-rank tests, as well as the parametric one-sample t-test. These tests use novel test statistics developed specifically for the private setting. We compare our tests to prior work, both on parametric and nonparametric tests. We find that in all cases our new nonparametric tests achieve large improvements in statistical power, even when the assumptions of parametric tests are met.
more | pdf | html
Figures
Tweets
ArtofWarm: Differentially Private Nonparametric Hypothesis Testing https://t.co/NpshVtkiyp https://t.co/1tlWXapAeq
StatsPapers: Differentially Private Nonparametric Hypothesis Testing. https://t.co/y65LADh4dp
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 24614
Unqiue Words: 4060

2.004 Mikeys
#3. Estimating the sample mean and standard deviation from commonly reported quantiles in meta-analysis
Sean McGrath, XiaoFei Zhao, Russell Steele, Brett D. Thombs, Andrea Benedetti, the DEPRESsion Screening Data, Collaboration
Researchers increasingly use meta-analysis to synthesize the results of several studies in order to estimate a common effect. When the outcome variable is continuous, standard meta-analytic approaches assume that the primary studies report the sample mean and standard deviation of the outcome. However, when the outcome is skewed, authors sometimes summarize the data by reporting the sample median and one or both of (i) the minimum and maximum values and (ii) the first and third quartiles, but do not report the mean or standard deviation. To include these studies in meta-analysis, several methods have been developed to estimate the sample mean and standard deviation from the reported summary data. A major limitation of these widely used methods is that they assume that the outcome distribution is normal, which is unlikely to be tenable for studies reporting medians. We propose two novel approaches to estimate the sample mean and standard deviation when data are suspected to be non-normal. Our simulation results and empirical...
more | pdf | html
Figures
Tweets
StatsPapers: Estimating the sample mean and standard deviation from commonly reported quantiles in meta-analysis. https://t.co/DIYw9KzDay
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 10687
Unqiue Words: 2546

2.003 Mikeys
#4. Doubly stochastic models for replicated spatio-temporal point processes
Daniel Gervini
This paper proposes a log-linear model for the latent intensity functions of a replicated spatio-temporal point process. By simultaneously fitting correlated spatial and temporal Karhunen-Lo\`eve expansions, the model produces spatial and temporal components that are usually easy to interpret and capture the most important modes of variation and spatio-temporal correlation of the process. The asymptotic distribution of the estimators is derived. The finite sample properties are studied by simulations. As an example of application, we analyze bike usage patterns on the Divvy bike sharing system of the city of Chicago.
more | pdf | html
Figures
None.
Tweets
StatsPapers: Doubly stochastic models for replicated spatio-temporal point processes. https://t.co/KNm52HLIJD
HirokiT57858674: RT @StatsPapers: Doubly stochastic models for replicated spatio-temporal point processes. https://t.co/KNm52HLIJD
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 8045
Unqiue Words: 2458

2.001 Mikeys
#5. The Hitchhikers Guide to Nonlinear Filtering
Anna Kutschireiter, Simone Carlo Surace, Jean-Pascal Pfister
Nonlinear filtering is the problem of online estimation of a dynamic hidden variable from incoming data and has vast applications in different fields, ranging from engineering, machine learning, economic science and natural sciences. We start our review of the theory on nonlinear filtering from the most simple filtering task we can think of, namely static Bayesian inference. From there we continue our journey through discrete-time models, which is usually encountered in machine learning, and generalize to and further emphasize continuous-time filtering theory. The idea of changing the probability measure connects and elucidates several aspects of the theory, such as the similarities between the discrete and continuous time nonlinear filtering equations, as well as formulations of these for different observation models. Furthermore, it gives insight into the construction of particle filtering algorithms. This tutorial is targeted at researchers in machine learning, time series analysis, and the natural sciences, and should serve...
more | pdf | html
Figures
Tweets
StatsPapers: The Hitchhikers Guide to Nonlinear Filtering. https://t.co/qVuNJlWYWw
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 17355
Unqiue Words: 3648

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 100,377 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 100,377 papers.