Top 10 Arxiv Papers Today in Statistics


2.155 Mikeys
#1. Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models
Vincent Le Guen, Nicolas Thome
This paper addresses the problem of time series forecasting for non-stationary signals and multiple future steps prediction. To handle this challenging task, we introduce the Shape and Time Distortion Loss (STDL), a new objective function dedicated to training deep neural networks. STDL aims at accurately predicting sudden changes, and explicitly incorporates two terms supporting precise shape and temporal change detection. We introduce a differentiable loss function suitable for training deep neural nets, and provide a custom back-prop implementation for speeding up optimization. We also introduce a variant of STDL, which provides a smooth generalization of temporally-constrained Dynamic Time Warping (DTW). Experiments carried out on various non-stationary datasets reveal the very good behaviour of STDL compared to models trained with the standard Mean Squared Error (MSE) loss function, and also to DTW and variants. STDL is also agnostic to the choice of the model, and we highlight its benefit for training fully connected...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models. Vincent Le Guen and Nicolas Thome https://t.co/OuStdjLCja
evolvingstuff: Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models paper: https://t.co/jgRT3tC5Df code: https://t.co/NIC6XjXlyn https://t.co/dopIL5qYYu
StatsPapers: Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models. https://t.co/qJqJfAKPDz
iamknighton: RT @evolvingstuff: Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models paper: https://t.co/jgRT3tC5Df code: ht…
treasured_write: RT @evolvingstuff: Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models paper: https://t.co/jgRT3tC5Df code: ht…
jeandut14000: RT @evolvingstuff: Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models paper: https://t.co/jgRT3tC5Df code: ht…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.103 Mikeys
#2. Explaining Visual Models by Causal Attribution
Álvaro Parafita, Jordi Vitrià
Model explanations based on pure observational data cannot compute the effects of features reliably, due to their inability to estimate how each factor alteration could affect the rest. We argue that explanations should be based on the causal model of the data and the derived intervened causal models, that represent the data distribution subject to interventions. With these models, we can compute counterfactuals, new samples that will inform us how the model reacts to feature changes on our input. We propose a novel explanation methodology based on Causal Counterfactuals and identify the limitations of current Image Generative Models in their application to counterfactual creation.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Explaining Visual Models by Causal Attribution. Álvaro Parafita and Jordi Vitrià https://t.co/HjWaNXBqPe
alvaro_parafita: Our paper "Explaining Visual Models by Causal Attribution" got accepted for the #ICCV2019 Workshop on Interpreting and Explaining Visual Artificial Intelligence Models! @bitenmascarado https://t.co/PLDfReecKC
StatsPapers: Explaining Visual Models by Causal Attribution. https://t.co/XMSeSUyfWX
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.097 Mikeys
#3. Absum: Simple Regularization Method for Reducing Structural Sensitivity of Convolutional Neural Networks
Sekitoshi Kanai, Yasutoshi Ida, Yasuhiro Fujiwara, Masanori Yamada, Shuichi Adachi
We propose Absum, which is a regularization method for improving adversarial robustness of convolutional neural networks (CNNs). Although CNNs can accurately recognize images, recent studies have shown that the convolution operations in CNNs commonly have structural sensitivity to specific noise composed of Fourier basis functions. By exploiting this sensitivity, they proposed a simple black-box adversarial attack: Single Fourier attack. To reduce structural sensitivity, we can use regularization of convolution filter weights since the sensitivity of linear transform can be assessed by the norm of the weights. However, standard regularization methods can prevent minimization of the loss function because they impose a tight constraint for obtaining high robustness. To solve this problem, Absum imposes a loose constraint; it penalizes the absolute values of the summation of the parameters in the convolution layers. Absum can improve robustness against single Fourier attack while being as simple and efficient as standard...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Absum: Simple Regularization Method for Reducing Structural Sensitivity of Convolutional Neural Networks. Sekitoshi Kanai, Yasutoshi Ida, Yasuhiro Fujiwara, Masanori Yamada, and Shuichi Adachi https://t.co/jjVBYKbzLo
StatsPapers: Absum: Simple Regularization Method for Reducing Structural Sensitivity of Convolutional Neural Networks. https://t.co/xW8tUg4g4s
arxiv_cs_cv_pr: Absum: Simple Regularization Method for Reducing Structural Sensitivity of Convolutional Neural Networks. Sekitoshi Kanai, Yasutoshi Ida, Yasuhiro Fujiwara, Masanori Yamada, and Shuichi Adachi https://t.co/3DdW0YLCSt
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.051 Mikeys
#4. Relaxed Softmax for learning from Positive and Unlabeled data
Ugo Tanielian, Flavian Vasile
In recent years, the softmax model and its fast approximations have become the de-facto loss functions for deep neural networks when dealing with multi-class prediction. This loss has been extended to language modeling and recommendation, two fields that fall into the framework of learning from Positive and Unlabeled data. In this paper, we stress the different drawbacks of the current family of softmax losses and sampling schemes when applied in a Positive and Unlabeled learning setup. We propose both a Relaxed Softmax loss (RS) and a new negative sampling scheme based on Boltzmann formulation. We show that the new training objective is better suited for the tasks of density estimation, item similarity and next-event prediction by driving uplifts in performance on textual and recommendation datasets against classical softmax.
more | pdf | html
Figures
None.
Tweets
arxiv_org: Relaxed Softmax for learning from Positive and Unlabeled data. https://t.co/hqt7lJwSG8 https://t.co/rBaR0bfd6q
BrundageBot: Relaxed Softmax for learning from Positive and Unlabeled data. Ugo Tanielian and Flavian Vasile https://t.co/54GDoHiLvB
arxivml: "Relaxed Softmax for learning from Positive and Unlabeled data", Ugo Tanielian, Flavian Vasile https://t.co/R5QJwKDj0R
arxiv_cs_LG: Relaxed Softmax for learning from Positive and Unlabeled data. Ugo Tanielian and Flavian Vasile https://t.co/PZdn5X5AKj
StatsPapers: Relaxed Softmax for learning from Positive and Unlabeled data. https://t.co/UhevoW5sOm
arxiv_cscl: Relaxed Softmax for learning from Positive and Unlabeled data https://t.co/itM4losNsT
arxiv_cscl: Relaxed Softmax for learning from Positive and Unlabeled data https://t.co/itM4losNsT
treasured_write: RT @BrundageBot: Relaxed Softmax for learning from Positive and Unlabeled data. Ugo Tanielian and Flavian Vasile https://t.co/54GDoHiLvB
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.042 Mikeys
#5. Semantic and Cognitive Tools to Aid Statistical Inference: Replace Confidence and Significance by Compatibility and Surprise
Zad R. Chow, Sander Greenland
Researchers often misinterpret and misrepresent statistical outputs. This abuse has led to a large literature on modification or replacement of testing thresholds and P-values with confidence intervals, Bayes factors, and other devices. Because the core problems appear cognitive rather than statistical, we review some simple proposals to aid researchers in interpreting statistical outputs. These proposals emphasize logical and information concepts over probability, and thus may be more robust to common misinterpretations than are traditional descriptions. The latter treat statistics as referring to targeted hypotheses conditional on background assumptions. In contrast, we advise reinterpretation of P-values and interval estimates in unconditional terms, in which they describe compatibility of data with the entire set of analysis assumptions. We use the Shannon transform of the P-value $p$, also known as the surprisal or S-value $s=-log(p)$, to provide a measure of the information supplied by the testing procedure against these...
more | pdf | html
Figures
None.
Tweets
dailyzad: Also, S-values are one of the topics we do a deep dive on in the first of our recent pair of papers about improving statistical interpretations https://t.co/Cyv3u2knX7 https://t.co/C9tb8NnEjT
dailyzad: In paper 1, we discuss: - A comprehensive discussion of P-value issues and their reconciliation with S-values - Testing alternatives rather than just the null - Graphical functions/tables to present alternative results 2/7 https://t.co/Cyv3u2knX7 https://t.co/EBWhPPpVpG
dailyzad: THREAD Happy to say that two papers by @Lester_Domes and I, on how we all can improve statistical teaching, reviewing, and practice via cognitive/semantic tools are up on arXiv 1/7 1: https://t.co/Cyv3u2knX7 2: https://t.co/la8HpJXmMr #statstwitter #epitwitter #datascience https://t.co/ghtZCXITZm
rtorkar: Very nice paper by Zad R. Chow and @Lester_Domes where they discuss the S-value, i.e., S=-log(p). I will introduce this in a course since I believe it conceptually makes the whole p-value thingy easier to understand! https://t.co/fZzXihrxs1 Quotes: 1/3
Mr1Paleo: #RT @PaleoFoundation: RT @dailyzad: THREAD Happy to say that two papers by @Lester_Domes and I, on how we all can improve statistical teaching, reviewing, and practice via cognitive/semantic tools are up on arXiv 1/7 1: https://t.co/LNd26oVyfx 2: … https://t.co/BoT9uaNb1L
BioPapers: Semantic and Cognitive Tools to Aid Statistical Inference: Replace Confidence and Significance by Compatibility and Surprise. https://t.co/T6zGsmOSaV
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 8692
Unqiue Words: 3076

2.031 Mikeys
#6. Bayesian Analysis of Multidimensional Functional Data
John Shamshoian, Damla Senturk, Shafali Jeste, Donatello Telesca
Multi-dimensional functional data arises in numerous modern scientific experimental and observational studies. In this paper we focus on longitudinal functional data, a structured form of multidimensional functional data. Operating within a longitudinal functional framework we aim to capture low dimensional interpretable features. We propose a computationally efficient nonparametric Bayesian method to simultaneously smooth observed data, estimate conditional functional means and functional covariance surfaces. Statistical inference is based on Monte Carlo samples from the posterior measure through adaptive blocked Gibbs sampling. Several operative characteristics associated with the proposed modeling framework are assessed comparatively in a simulated environment. We illustrate the application of our work in two case studies. The first case study involves age-specific fertility collected over time for various countries. The second case study is an implicit learning experiment in children with Autism Spectrum Disorder (ASD).
more | pdf | html
Figures
None.
Tweets
StatsPapers: Bayesian Analysis of Multidimensional Functional Data. https://t.co/VVRfsekO2H
389jan: RT @StatsPapers: Bayesian Analysis of Multidimensional Functional Data. https://t.co/VVRfsekO2H
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.031 Mikeys
#7. Properties of Laplacian Pyramids for Extension and Denoising
William Leeb
We analyze the Laplacian pyramids algorithm of Rabin and Coifman for extending and denoising a function sampled on a discrete set of points. We provide mild conditions under which the algorithm converges, and prove stability bounds on the extended function. We also consider the iterative application of truncated Laplacian pyramids kernels for denoising signals by non-local means.
more | pdf | html
Figures
None.
Tweets
arxiv_org: Properties of Laplacian Pyramids for Extension and Denoising. https://t.co/WShqSkMQkZ https://t.co/9QQCAQEEdr
arxivml: "Properties of Laplacian Pyramids for Extension and Denoising", William Leeb https://t.co/cnLERscw6J
arxiv_cs_LG: Properties of Laplacian Pyramids for Extension and Denoising. William Leeb https://t.co/Xhfj1jVhM4
StatsPapers: Properties of Laplacian Pyramids for Extension and Denoising. https://t.co/KPW0E94YmQ
udmrzn: RT @arxiv_org: Properties of Laplacian Pyramids for Extension and Denoising. https://t.co/WShqSkMQkZ https://t.co/9QQCAQEEdr
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

2.019 Mikeys
#8. Robust statistical modeling of monthly rainfall: The minimum density power divergence approach
Arnab Hazra, Abhik Ghosh
Statistical modeling of rainfall is an important challenge in meteorology, particularly from the perspective of rainfed agriculture where a proper assessment of the future availability of rainwater is necessary. The probability models mostly used for this purpose are exponential, gamma, Weibull and lognormal distributions, where the unknown model parameters are routinely estimated using the maximum likelihood estimator (MLE). However, presence of outliers or extreme observations is quite common in rainfall data and the MLEs being highly sensitive to them often leads to spurious inference. In this paper, we discuss a robust parameter estimation approach based on the minimum density power divergence estimators (MDPDEs) which provides a class of estimates through a tuning parameter including the MLE as a special case. The underlying tuning parameter controls the trade-offs between efficiency and robustness of the resulting inference; we also discuss a procedure for data-driven optimal selection of this tuning parameter as well as...
more | pdf | html
Figures
None.
Tweets
arxiv_org: Robust statistical modeling of monthly rainfall: The minimum density power divergence app... https://t.co/BZtWdMpM97 https://t.co/33hyT4zk2t
StatsPapers: Robust statistical modeling of monthly rainfall: The minimum density power divergence approach. https://t.co/doVnZhgcAd
Rosenchild: RT @arxiv_org: Robust statistical modeling of monthly rainfall: The minimum density power divergence app... https://t.co/BZtWdMpM97 https:/…
subhobrata1: RT @arxiv_org: Robust statistical modeling of monthly rainfall: The minimum density power divergence app... https://t.co/BZtWdMpM97 https:/…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.013 Mikeys
#9. Stacking Models for Nearly Optimal Link Prediction in Complex Networks
Amir Ghasemian, Homa Hosseinmardi, Aram Galstyan, Edoardo M. Airoldi, Aaron Clauset
Most real-world networks are incompletely observed. Algorithms that can accurately predict which links are missing can dramatically speedup the collection of network data and improve the validity of network models. Many algorithms now exist for predicting missing links, given a partially observed network, but it has remained unknown whether a single best predictor exists, how link predictability varies across methods and networks from different domains, and how close to optimality current methods are. We answer these questions by systematically evaluating 203 individual link predictor algorithms, representing three popular families of methods, applied to a large corpus of 548 structurally diverse networks from six scientific domains. We first show that individual algorithms exhibit a broad diversity of prediction errors, such that no one predictor or family is best, or worst, across all realistic inputs. We then exploit this diversity via meta-learning to construct a series of "stacked" models that combine predictors into a single...
more | pdf | html
Figures
Tweets
alexvespi: Stacking Models for Nearly Optimal Link Prediction in Complex Networks “Applied to a broad range of synthetic networks, for which we may analytically calculate optimal performance, these stacked models achieve optimal or nearly optimal levels of accuracy” https://t.co/hLCtygSOFx https://t.co/VOhwtXIO4F
aaronclauset: Excited to share a new preprint "Stacking models for nearly optimal link prediction in complex networks," led by @Amir_Ghasemian and @HomaHosseinmar1, with @aram_galstyan and @eairoldi: https://t.co/iCxftB6xjF Here’s a little summary: 1/7 https://t.co/k0VVdsV4ce
net_science: Stacking Models for Nearly Optimal Link Prediction in Complex Networks. (arXiv:1909.07578v1 [https://t.co/E3LUKJpMju]) https://t.co/T1CFh8xujP
BrundageBot: Stacking Models for Nearly Optimal Link Prediction in Complex Networks. Amir Ghasemian, Homa Hosseinmardi, Aram Galstyan, Edoardo M. Airoldi, and Aaron Clauset https://t.co/tFGrLqHQwv
arxiv_cs_LG: Stacking Models for Nearly Optimal Link Prediction in Complex Networks. Amir Ghasemian, Homa Hosseinmardi, Aram Galstyan, Edoardo M. Airoldi, and Aaron Clauset https://t.co/WOKu8OXKCf
Github

This page is a companion for our paper on optimal link prediction, written by Amir Ghasemian, Homa Hosseinmardi, Aram Galstyan, Edoardo M. Airoldi, and Aaron Clauset. (arXiv:1909.07578)

Repository: OptimalLinkPrediction
User: Aghasemian
Language: Python
Stargazers: 1
Subscribers: 2
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 19905
Unqiue Words: 3619

2.011 Mikeys
#10. On Efficient Multilevel Clustering via Wasserstein Distances
Viet Huynh, Nhat Ho, Nhan Dam, XuanLong Nguyen, Mikhail Yurochkin, Hung Bui, and Dinh Phung
We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grouping patterns among groups in a potentially large hierarchically structured corpus of data. Our method involves a joint optimization formulation over several spaces of discrete probability measures, which are endowed with Wasserstein distance metrics. We propose several variants of this problem, which admit fast optimization algorithms, by exploiting the connection to the problem of finding Wasserstein barycenters. Consistency properties are established for the estimates of both local and global clusters. Finally, the experimental results with both synthetic and real data are presented to demonstrate the flexibility and scalability of the proposed approach.
more | pdf | html
Figures
Tweets
StatsPapers: On Efficient Multilevel Clustering via Wasserstein Distances. https://t.co/F4vZehw8CR
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 13997
Unqiue Words: 2823

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 192,929 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 192,929 papers.