Top 10 Arxiv Papers Today in Methodology


0.0 Mikeys
#1. Reconciliation of probabilistic forecasts with an application to wind power
Jooyoung Jeon, Anastasios Panagiotelis, Fotios Petropoulos
New methods are proposed for adjusting probabilistic forecasts to ensure coherence with the aggregation constraints inherent in temporal hierarchies. The different approaches nested within this framework include methods that exploit information at all levels of the hierarchy as well as a novel method based on cross-validation. The methods are evaluated using real data from two wind farms in Crete, an application where it is imperative for optimal decisions related to grid operations and bidding strategies to be based on coherent probabilistic forecasts of wind power. Empirical evidence is also presented showing that probabilistic forecast reconciliation improves the accuracy of both point forecasts and probabilistic forecasts.
more | pdf | html
Figures
None.
Tweets
StatsPapers: Reconciliation of probabilistic forecasts with an application to wind power. https://t.co/CvojlntByj
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 10643
Unqiue Words: 2571

0.0 Mikeys
#2. Simultaneous Inference for Best Linear Predictor of the Conditional Average Treatment Effect and Other Structural Functions
Victor Chernozhukov, Vira Semenova
This paper provides estimation and inference methods for a structural function, such as Conditional Average Treatment Effect (CATE), based on modern machine learning (ML) tools. We assume that such function can be represented as an expectation g(x) of a signal Y conditional on X that depends on an unknown nuisance function. In addition to CATE, examples of such functions include regression function with Partially Missing Outcome and Conditional Average Partial Derivative. We approximate g(x) by a linear form that is a product of a vector of the approximating basis functions p(x) and the Best Linear Predictor (BLP), which we refer to a pseudo-target. Plugging in the first-stage estimate of the nuisance function into the signal, we estimate BLP via ordinary least squares. We deliver a high-quality estimate of the pseudo-target function that features (a) a pointwise Gaussian approximation, (b) a simultaneous Gaussian approximation, and (c) optimal rate of simultaneous convergence. In the case, the misspecification error of the linear...
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 15984
Unqiue Words: 3104

0.0 Mikeys
#3. Inference via low-dimensional couplings
Alessio Spantini, Daniele Bigoni, Youssef Marzouk
We investigate the low-dimensional structure of deterministic transformations between random variables, i.e., transport maps between probability measures. In the context of statistics and machine learning, these transformations can be used to couple a tractable "reference" measure (e.g., a standard Gaussian) with a target measure of interest. Direct simulation from the desired measure can then be achieved by pushing forward reference samples through the map. Yet characterizing such a map---e.g., representing and evaluating it---grows challenging in high dimensions. The central contribution of this paper is to establish a link between the Markov properties of the target measure and the existence of low-dimensional couplings, induced by transport maps that are sparse and/or decomposable. Our analysis not only facilitates the construction of transformations in high-dimensional settings, but also suggests new inference methodologies for continuous non-Gaussian graphical models. For instance, in the context of nonlinear state-space...
more | pdf | html
Figures
None.
Tweets
sam_power_825: @ChadScherrer @junpenglao @aseyboldt not quite NUTS/ADVI, but this work - https://t.co/GJdwCTYqNx - might be of interest. they use Markov structure on the graphical model to build localised (normalising flows) / (transport maps) to approximate the posterior. nice talk about it here - https://t.co/jWh52v9VnA.
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 33261
Unqiue Words: 5530

0.0 Mikeys
#4. Joint species distribution modeling with additive multivariate Gaussian process priors and heteregenous data
Jarno Vanhatalo, Marcelo Hartmann, Lari Veneranta
In this work, we propose JSDMs where the responses to environmental covariates are modeled with multivariate additive Gaussian processes. These allow inference for wide range of functional forms and interspecific correlations between the responses. We propose also an efficient approach for inference by utilizing Laplace approximation with a parameterization of the interspecific covariance matrices on the euclidean space. We demonstrate the benefits of our model with two small scale examples and one real world case study. We use cross-validation to compare the proposed model to analogous single species models in interpolation and extrapolation tasks. The proposed model outperforms the single species models in both cases. We also show that the proposed model can be seen as an extension of the current state-of-the-art JSDMs to semiparametric models.
more | pdf | html
Figures
Tweets
StatsPapers: Joint species distribution modeling with additive multivariate Gaussian process priors and heteregenous data. https://t.co/s8YMseWFKJ
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 16757
Unqiue Words: 4041

0.0 Mikeys
#5. On a Loss-based prior for the number of components in mixture models
Clara Grazian, Cristiano Villa, Brunero Liseo
We propose a prior distribution for the number of components of a finite mixture model. The novelty is that the prior distribution is obtained by considering the loss one would incur if the true value representing the number of components were not considered. The prior has an elegant and easy to implement structure, which allows to naturally include any prior information one may have as well as to opt for a default solution in cases where this information is not available. The performance of the prior, and comparison with existing alternatives, is studied through the analysis of both real and simulated data.
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 4765
Unqiue Words: 1566

0.0 Mikeys
#6. Modelling Preference Data with the Wallenius Distribution
Clara Grazian, Fabrizio Leisen, Brunero Liseo
The Wallenius distribution is a generalisation of the Hypergeometric distribution where weights are assigned to balls of different colours. This naturally defines a model for ranking categories which can be used for classification purposes. Since, in general, the resulting likelihood is not analytically available, we adopt an approximate Bayesian computational (ABC) approach for estimating the importance of the categories. We illustrate the performance of the estimation procedure on simulated datasets. Finally, we use the new model for analysing two datasets about movies ratings and Italian academic statisticians' journal preferences. The latter is a novel dataset collected by the authors.
more | pdf | html
Figures
None.
Tweets
ABC_Research: Grazian et al use ABC to Model Preference Data with the Wallenius Distribution https://t.co/affCS13XcM
bayesian_stats: RT @ABC_Research: Grazian et al use ABC to Model Preference Data with the Wallenius Distribution https://t.co/affCS13XcM
AlexanderLyNL: RT @ABC_Research: Grazian et al use ABC to Model Preference Data with the Wallenius Distribution https://t.co/affCS13XcM
clarabayes: RT @ABC_Research: Grazian et al use ABC to Model Preference Data with the Wallenius Distribution https://t.co/affCS13XcM
Beta_GOS: RT @ABC_Research: Grazian et al use ABC to Model Preference Data with the Wallenius Distribution https://t.co/affCS13XcM
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 7703
Unqiue Words: 2435

0.0 Mikeys
#7. Node-specific effects in latent space modelling of multidimensional networks
Silvia D'Angelo, Marco Alfò, Thomas Brendan Murphy
Observed multidimensional network data can have different levels of complexity, as nodes may be characterized by heterogeneous individual-specific features. Also, such characteristics may vary across the networks. This article discusses a novel class of models for multidimensional networks, able to deal with different levels of heterogeneity within and between networks. The proposed framework is developed within the family of latent space models, in order to distinguish recurrent symmetrical relations between the nodes from node-specific features in the different views. Models parameters are estimated via a Markov Chain Monte Carlo algorithm. Simulated data and also FAO fruits import/export data are analysed to illustrate the performances of the proposed models.
more | pdf | html
Figures
None.
Tweets
BayeSNA: Node-specific effects in latent space modelling of multidimensional networks https://t.co/nOSvTmKXXU
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 13715
Unqiue Words: 2522

0.0 Mikeys
#8. The conditional permutation test
Thomas B. Berrett, Yi Wang, Rina Foygel Barber, Richard J. Samworth
We propose a general new method, the \emph{conditional permutation test}, for testing the conditional independence of variables $X$ and $Y$ given a potentially high-dimensional random vector $Z$ that may contain confounding factors. The proposed test permutes entries of $X$ non-uniformly, so as to respect the existing dependence between $X$ and $Z$ and thus account for the presence of these confounders. Like the conditional randomization test of \citet{candes2018panning}, our test relies on the availability of an approximation to the distribution of $X \mid Z$---while \citet{candes2018panning}'s test uses this estimate to draw new $X$ values, for our test we use this approximation to design an appropriate non-uniform distribution on permutations of the $X$ values already seen in the true data. We provide an efficient Markov Chain Monte Carlo sampler for the implementation of our method, and establish bounds on the Type~I error in terms of the error in the approximation of the conditional distribution of $X\mid Z$, finding that,...
more | pdf | html
Figures
None.
Tweets
wittawatj: » [1807.05405] The conditional permutation test https://t.co/J4Ufuollfe
MarcosMatabuena: RT @StatsPapers: The conditional permutation test. https://t.co/wz0ogY42eK
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 10934
Unqiue Words: 2331

0.0 Mikeys
#9. Assessing Method Agreement for Paired Repeated Binary Measurements
Wei Wang, Nan Lin, Jordan D. Oberhaus, Michael S. Avidan
Method comparison studies are essential for development in medical and clinical fields. These studies often compare a cheaper, faster, or less invasive measuring method with a widely used one to see if they have sufficient agreement for interchangeable use. In the clinical and medical context, the response measurement is usually impacted not only by the measuring method but by the rater as well. This paper proposes a model-based approach to assess agreement of two measuring methods for paired repeated binary measurements under the scenario when the agreement between two measuring methods and the agreement among raters are required to be studied in a unified framework. Based upon the generalized linear mixed models (GLMM), the decision on the adequacy of interchangeable use is made by testing the equality of fixed effects of methods. Approaches for assessing method agreement, such as the Bland-Altman diagram and Cohen's kappa, are also developed for repeated binary measurements based upon the latent variables in GLMMs. We assess...
more | pdf | html
Figures
Tweets
StatsPapers: Assessing Method Agreement for Paired Repeated Binary Measurements. https://t.co/tgffbp0FOd
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 7250
Unqiue Words: 1853

0.0 Mikeys
#10. A flexible sequential Monte Carlo algorithm for shape-constrained regression
Kenyon Ng, Kevin Murray, Berwin A. Turlach
We propose an algorithm that is capable of imposing shape constraints on regression curves, without requiring the constraints to be written as closed-form expressions, nor assuming the functional form of the loss function. Our algorithm, which is based on Sequential Monte Carlo-Simulated Annealing, only relies on an indicator function that assesses whether or not the constraints are fulfilled, thus allowing us to enforce various complex constraints by specifying an appropriate indicator function without altering other parts of the algorithm. We demonstrate our algorithm by fitting rational function models subject to monotonicity and continuity constraints. The algorithm was implemented using R (R Core Team, 2018) and the code is freely available on GitHub.
more | pdf | html
Figures
None.
Tweets
StatsPapers: A flexible sequential Monte Carlo algorithm for shape-constrained regression. https://t.co/B2ZYUVdKLz
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 5846
Unqiue Words: 1872

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 72,995 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 72,995 papers.