Top 10 Arxiv Papers Today in Applications


2.008 Mikeys
#1. A new centered spatio-temporal autologistic regression model. Application to spatio-temporal analysis of esca disease in a vineyard
Anne Gégout-Petit, Lucia Guérin-Dubrana, Shuxian Li
We propose a new centered autologistic spatio-temporal model for binary data on a lattice. The centering allows the interpretation of the autoregression coefficients in separating the large scale structure of the model corresponding to an expected mean and the small-scale structure corresponding to the auto-correlation. We discuss the existence of the joint law of the process and show by simulation the interest of this kind of centering. We propose and show the efficiency of the maximum pseudo-likelihood estimator and also a method to choose the best structure of neighborhood. Method is applied to model and fit epidemiological data about Esca disease on a vineyard of the Bordeaux region.
more | pdf | html
Figures
None.
Tweets
StatsPapers: A new centered spatio-temporal autologistic regression model. Application to spatio-temporal analysis of esca disease in a vineyard. https://t.co/dnqdGyZr2i
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 10114
Unqiue Words: 2272

2.008 Mikeys
#2. Estimation from Quantized Gaussian Measurements: When and How to Use Dither
Joshua Rapp, Robin M. A. Dawson, Vivek K Goyal
Subtractive dither is a powerful method for removing the signal dependence of quantization noise for coarsely-quantized signals. However, estimation from dithered measurements often naively applies the sample mean or midrange, even when the total noise is not well described with a Gaussian or uniform distribution. We show that the generalized Gaussian distribution approximately describes subtractively-dithered, quantized samples of a Gaussian distribution. Furthermore, a generalized Gaussian fit leads to simple estimators based on order statistics that match the performance of more complicated maximum likelihood estimators requiring iterative solvers. The order statistics-based estimators outperform both the sample mean and midrange for nontrivial sums of Gaussian and uniform noise. Additional analysis of the generalized Gaussian approximation yields rules of thumb for determining when and how to apply dither to quantized measurements.
more | pdf | html
Figures
None.
Tweets
StatsPapers: Estimation from Quantized Gaussian Measurements: When and How to Use Dither. https://t.co/BCdGhWuLax
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 8922
Unqiue Words: 2255

2.0 Mikeys
#3. Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions
Fredrik D. Johansson
We study heterogeneity in the effect of a mindset intervention on student-level performance through an observational dataset from the National Study of Learning Mindsets (NSLM). Our analysis uses machine learning (ML) to address the following associated problems: assessing treatment group overlap and covariate balance, imputing conditional average treatment effects, and interpreting imputed effects. By comparing several different model families we illustrate the flexibility of both off-the-shelf and purpose-built estimators. We find that the mindset intervention has a positive average effect of 0.26, 95%-CI [0.22, 0.30], and that heterogeneity in the range of [0.1, 0.4] is moderated by school-level achievement level, poverty concentration, urbanicity, and student prior expectations.
more | pdf | html
Figures
None.
Tweets
arxiv_org: Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions. https://t.co/tj6sDRUQta https://t.co/r29taskwod
arxivml: "Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions", Fredrik D. Johansson https://t.co/5qAiD7mLDB
nmfeeds: [O] https://t.co/GQWPohQn61 Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions. We ...
StatsPapers: Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions. https://t.co/C9866hhCc0
PerthMLGroup: RT @arxiv_org: Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions. https://t.co/tj6sDRUQta https://t…
tomhardyofmaths: RT @arxiv_org: Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions. https://t.co/tj6sDRUQta https://t…
AssistedEvolve: RT @arxiv_org: Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions. https://t.co/tj6sDRUQta https://t…
danielk_oxsci: RT @arxiv_org: Machine Learning Analysis of Heterogeneity in the Effect of Student Mindset Interventions. https://t.co/tj6sDRUQta https://t…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 4534
Unqiue Words: 1558

0.0 Mikeys
#4. An external validation of Thais' cardiovascular 10-year risk assessment in the southern Thailand
Suthara Aramcharoen, Ponlapat Satian, Ponlachart Chotikarn, Sipat Triukose
Cardiovascular diseases (CVDs) is a number one cause of death globally. WHO estimated that CVD is a cause of 17.9 million deaths (or 31% of all global deaths) in 2016. It may seem surprising, CVDs can be easily prevented by altering lifestyle to avoid risk factors. The only requirement needed is to know your risk prior. Thai CV Risk score is a trustworthy tool to forecast risk of having cardiovascular event in the future for Thais. This study is an external validation of the Thai CV risk score. We aim to answer two key questions. Firstly, Can Thai CV Risk score developed using dataset of people from central and north western parts of Thailand is applicable to people from other parts of the country? Secondly, Can Thai CV Risk score developed for general public works for hospital's patients who tend to have higher risk? We answer these two questions using a dataset of 1,025 patients (319 males, 35-70 years old) from Lansaka Hospital in the southern Thailand. In brief, we find that the Thai CV risk score works for southern Thais...
more | pdf | html
Figures
Tweets
StatsPapers: An external validation of Thais' cardiovascular 10-year risk assessment in the southern Thailand. https://t.co/RhQH7Hwyox
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 1531
Unqiue Words: 645

0.0 Mikeys
#5. A pliable lasso for the Cox model
Wenfei Du, Rob Tibshirani
We introduce a pliable lasso method for estimation of interaction effects in the Cox proportional hazards model framework. The pliable lasso is a linear model that includes interactions between covariates X and a set of modifying variables Z and assumes sparsity of the main effects and interaction effects. The hierarchical penalty excludes interaction effects when the corresponding main effects are zero: this avoids overfitting and an explosion of model complexity. We extend this method to the Cox model for survival data, incorporating modifiers that are either fixed or varying in time into the partial likelihood. For example, this allows modeling of survival times that differ based on interactions of genes with age, gender, or other demographic information. The optimization is done by blockwise coordinate descent on a second order approximation of the objective.
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 5845
Unqiue Words: 1331

0.0 Mikeys
#6. More investment in Research and Development for better Education in the future?
Rim Lahmandi-Ayed, Dhafer Malouche
The question in this paper is whether R&D efforts affect education performance in small classes. Merging two datasets collected from the PISA studies and the World Development Indicators and using Learning Bayesian Networks, we prove the existence of a statistical causal relationship between investment in R&D of a country and its education performance (PISA scores). We also prove that the effect of R\&D on Education is long term as a country has to invest at least 10 years before beginning to improve the level of young pupils.
more | pdf | html
Figures
None.
Tweets
StatsPapers: More investment in Research and Development for better Education in the future?. https://t.co/ZXeQKnf87c
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 10182
Unqiue Words: 2954

0.0 Mikeys
#7. Bias Correction For Paid Search In Media Mix Modeling
Aiyou Chen, David Chan, Mike Perry, Yuxue Jin, Yunting Sun, Yueqing Wang, Jim Koehler
Evaluating the return on ad spend (ROAS), the causal effect of advertising on sales, is critical to advertisers for understanding the performance of their existing marketing strategy as well as how to improve and optimize it. Media Mix Modeling (MMM) has been used as a convenient analytical tool to address the problem using observational data. However it is well recognized that MMM suffers from various fundamental challenges: data collection, model specification and selection bias due to ad targeting, among others \citep{chan2017,wolfe2016}. In this paper, we study the challenge associated with measuring the impact of search ads in MMM, namely the selection bias due to ad targeting. Using causal diagrams of the search ad environment, we derive a statistically principled method for bias correction based on the \textit{back-door} criterion \citep{pearl2013causality}. We use case studies to show that the method provides promising results by comparison with results from randomized experiments. We also report a more complex case...
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 11025
Unqiue Words: 2676

0.0 Mikeys
#8. Judging the Judges: Evaluating the Performance of International Gymnastics Judges
Hugues Mercier, Sandro Heiniger
Judging a gymnastics routine is a noisy process, and the performance of judges varies widely. In collaboration with the F\'ed\'eration Internationale de Gymnastique (FIG) and Longines, we are designing and implementing an improved statistical engine to analyze the performance of gymnastics judges during and after major competitions like the Olympic Games and the World Championships. The engine, called the Judge Evaluation Program (JEP), has three objectives: (1) provide constructive feedback to judges, executive committees and national federations; (2) assign the best judges to the most important competitions; (3) detect bias and outright cheating. Using data from international gymnastics competitions held during the 2013--2016 Olympic cycle, we first develop a marking score evaluating the accuracy of the marks given by gymnastics judges. Judging a gymnastics routine is a random process, and we can model this process very accurately using heteroscedastic random variables. The marking score scales the difference between the mark...
more | pdf | html
Figures
None.
Tweets
arxiv_org: Judging the Judges: Evaluating the Performance of International Gymnastics Judges. https://t.co/H57T3gm8ad https://t.co/RAHGsX4Y0w
StatsPapers: Judging the Judges: Evaluating the Performance of International Gymnastics Judges. https://t.co/YzPKnwgVg7
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 9117
Unqiue Words: 2472

0.0 Mikeys
#9. A Nonparametric Bayesian Model for Synthesising Residential Solar Generation and Demand
Thomas Power, Gregor Verbič, Archie C. Chapman
Increasing installations of distributed electricity generation have vastly increased the need for stochastic generation and demand data. However, the effects of such installations is uncertain, as high quality data is not always available before an installation is completed. In particular, there is a need for stochastic models of demand and generation profiles for unobserved prosumers. The model formulated in this paper bridges the gap between the limited available empirical data, and the large amount of high-quality, stochastic demand and generation data required for network and system analysis. The approach employs clustering analysis and a Dirichlet-categorical hierarchical model of the features of unobserved prosumers. Based on the data of clusters of prosumers, Markov chain models of demand and generation profiles are constructed from empirical data, and synthetic demand profiles are subsequently sampled from these. The sampled traces are cross-validated and show a good statistical fit to the observed data, and then two case...
more | pdf | html
Figures
Tweets
arxiv_org: A Nonparametric Bayesian Model for Synthesising Residential Solar Generation and Demand. https://t.co/Xi5xSKZviw https://t.co/Apj8rCeE6U
gaialive: RT @arxiv_org: A Nonparametric Bayesian Model for Synthesising Residential Solar Generation and Demand. https://t.co/Xi5xSKZviw https://t.c…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 7750
Unqiue Words: 2292

0.0 Mikeys
#10. Optimal Design in Hierarchical Models with application in Multi-center Trials
Maryna Prus, Norbert Benda, Rainer Schwabe
Hierarchical random effect models are used for different purposes in clinical research and other areas. In general, the main focus is on population parameters related to the expected treatment effects or group differences among all units of an upper level (e.g. subjects in many settings). Optimal design for estimation of population parameters are well established for many models. However, optimal designs for the prediction for the individual units may be different. Several settings are identiffed in which individual prediction may be of interest. In this paper we determine optimal designs for the individual predictions, e.g. in multi-center trials, and compare them to a conventional balanced design with respect to treatment allocation. Our investigations show, that balanced designs are far from optimal if the treatment effects vary strongly as compared to the residual error and more subjects should be recruited to the active (new) treatment in multi-center trials. Nevertheless, effciency loss may be limited resulting in a moderate...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 3448
Unqiue Words: 901

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 58,338 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 58,338 papers.