This paper addresses the problem of time series forecasting for
non-stationary signals and multiple future steps prediction. To handle this
challenging task, we introduce the Shape and Time Distortion Loss (STDL), a new
objective function dedicated to training deep neural networks. STDL aims at
accurately predicting sudden changes, and explicitly incorporates two terms
supporting precise shape and temporal change detection. We introduce a
differentiable loss function suitable for training deep neural nets, and
provide a custom back-prop implementation for speeding up optimization. We also
introduce a variant of STDL, which provides a smooth generalization of
temporally-constrained Dynamic Time Warping (DTW). Experiments carried out on
various non-stationary datasets reveal the very good behaviour of STDL compared
to models trained with the standard Mean Squared Error (MSE) loss function, and
also to DTW and variants. STDL is also agnostic to the choice of the model, and
we highlight its benefit for training fully connected...

more |
pdf
| html
None.

BrundageBot:
Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models. Vincent Le Guen and Nicolas Thome https://t.co/OuStdjLCja

evolvingstuff:
Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models
paper: https://t.co/jgRT3tC5Df
code: https://t.co/NIC6XjXlyn https://t.co/dopIL5qYYu

StatsPapers:
Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models. https://t.co/qJqJfAKPDz

iamknighton:
RT @evolvingstuff: Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models
paper: https://t.co/jgRT3tC5Df
code: ht…

treasured_write:
RT @evolvingstuff: Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models
paper: https://t.co/jgRT3tC5Df
code: ht…

jeandut14000:
RT @evolvingstuff: Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models
paper: https://t.co/jgRT3tC5Df
code: ht…

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

Model explanations based on pure observational data cannot compute the
effects of features reliably, due to their inability to estimate how each
factor alteration could affect the rest. We argue that explanations should be
based on the causal model of the data and the derived intervened causal models,
that represent the data distribution subject to interventions. With these
models, we can compute counterfactuals, new samples that will inform us how the
model reacts to feature changes on our input. We propose a novel explanation
methodology based on Causal Counterfactuals and identify the limitations of
current Image Generative Models in their application to counterfactual
creation.

more |
pdf
| html
None.

BrundageBot:
Explaining Visual Models by Causal Attribution. Álvaro Parafita and Jordi Vitrià https://t.co/HjWaNXBqPe

alvaro_parafita:
Our paper "Explaining Visual Models by Causal Attribution" got accepted for the #ICCV2019 Workshop on Interpreting and Explaining Visual Artificial Intelligence Models! @bitenmascarado https://t.co/PLDfReecKC

StatsPapers:
Explaining Visual Models by Causal Attribution. https://t.co/XMSeSUyfWX

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

We propose Absum, which is a regularization method for improving adversarial
robustness of convolutional neural networks (CNNs). Although CNNs can
accurately recognize images, recent studies have shown that the convolution
operations in CNNs commonly have structural sensitivity to specific noise
composed of Fourier basis functions. By exploiting this sensitivity, they
proposed a simple black-box adversarial attack: Single Fourier attack. To
reduce structural sensitivity, we can use regularization of convolution filter
weights since the sensitivity of linear transform can be assessed by the norm
of the weights. However, standard regularization methods can prevent
minimization of the loss function because they impose a tight constraint for
obtaining high robustness. To solve this problem, Absum imposes a loose
constraint; it penalizes the absolute values of the summation of the parameters
in the convolution layers. Absum can improve robustness against single Fourier
attack while being as simple and efficient as standard...

more |
pdf
| html
None.

BrundageBot:
Absum: Simple Regularization Method for Reducing Structural Sensitivity of Convolutional Neural Networks. Sekitoshi Kanai, Yasutoshi Ida, Yasuhiro Fujiwara, Masanori Yamada, and Shuichi Adachi https://t.co/jjVBYKbzLo

StatsPapers:
Absum: Simple Regularization Method for Reducing Structural Sensitivity of Convolutional Neural Networks. https://t.co/xW8tUg4g4s

arxiv_cs_cv_pr:
Absum: Simple Regularization Method for Reducing Structural Sensitivity of Convolutional Neural Networks. Sekitoshi Kanai, Yasutoshi Ida, Yasuhiro Fujiwara, Masanori Yamada, and Shuichi Adachi https://t.co/3DdW0YLCSt

None.

None.

Sample Sizes : None.

Authors: 5

Total Words: 0

Unqiue Words: 0

In recent years, the softmax model and its fast approximations have become
the de-facto loss functions for deep neural networks when dealing with
multi-class prediction. This loss has been extended to language modeling and
recommendation, two fields that fall into the framework of learning from
Positive and Unlabeled data. In this paper, we stress the different drawbacks
of the current family of softmax losses and sampling schemes when applied in a
Positive and Unlabeled learning setup. We propose both a Relaxed Softmax loss
(RS) and a new negative sampling scheme based on Boltzmann formulation. We show
that the new training objective is better suited for the tasks of density
estimation, item similarity and next-event prediction by driving uplifts in
performance on textual and recommendation datasets against classical softmax.

more |
pdf
| html
None.

arxiv_org:
Relaxed Softmax for learning from Positive and Unlabeled data. https://t.co/hqt7lJwSG8 https://t.co/rBaR0bfd6q

BrundageBot:
Relaxed Softmax for learning from Positive and Unlabeled data. Ugo Tanielian and Flavian Vasile https://t.co/54GDoHiLvB

arxivml:
"Relaxed Softmax for learning from Positive and Unlabeled data",
Ugo Tanielian, Flavian Vasile
https://t.co/R5QJwKDj0R

arxiv_cs_LG:
Relaxed Softmax for learning from Positive and Unlabeled data. Ugo Tanielian and Flavian Vasile https://t.co/PZdn5X5AKj

StatsPapers:
Relaxed Softmax for learning from Positive and Unlabeled data. https://t.co/UhevoW5sOm

arxiv_cscl:
Relaxed Softmax for learning from Positive and Unlabeled data https://t.co/itM4losNsT

arxiv_cscl:
Relaxed Softmax for learning from Positive and Unlabeled data https://t.co/itM4losNsT

treasured_write:
RT @BrundageBot: Relaxed Softmax for learning from Positive and Unlabeled data. Ugo Tanielian and Flavian Vasile https://t.co/54GDoHiLvB

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

Researchers often misinterpret and misrepresent statistical outputs. This
abuse has led to a large literature on modification or replacement of testing
thresholds and P-values with confidence intervals, Bayes factors, and other
devices. Because the core problems appear cognitive rather than statistical, we
review some simple proposals to aid researchers in interpreting statistical
outputs. These proposals emphasize logical and information concepts over
probability, and thus may be more robust to common misinterpretations than are
traditional descriptions. The latter treat statistics as referring to targeted
hypotheses conditional on background assumptions. In contrast, we advise
reinterpretation of P-values and interval estimates in unconditional terms, in
which they describe compatibility of data with the entire set of analysis
assumptions. We use the Shannon transform of the P-value $p$, also known as the
surprisal or S-value $s=-log(p)$, to provide a measure of the information
supplied by the testing procedure against these...

more |
pdf
| html
None.

dailyzad:
Also, S-values are one of the topics we do a deep dive on in the first of our recent pair of papers about improving statistical interpretations
https://t.co/Cyv3u2knX7 https://t.co/C9tb8NnEjT

dailyzad:
In paper 1, we discuss:
- A comprehensive discussion of P-value issues and their reconciliation with S-values
- Testing alternatives rather than just the null
- Graphical functions/tables to present alternative results 2/7
https://t.co/Cyv3u2knX7 https://t.co/EBWhPPpVpG

dailyzad:
THREAD
Happy to say that two papers by @Lester_Domes and I, on how we all can improve statistical teaching, reviewing, and practice via cognitive/semantic tools are up on arXiv 1/7
1: https://t.co/Cyv3u2knX7
2: https://t.co/la8HpJXmMr
#statstwitter #epitwitter #datascience https://t.co/ghtZCXITZm

rtorkar:
Very nice paper by Zad R. Chow and @Lester_Domes where they discuss the S-value, i.e., S=-log(p). I will introduce this in a course since I believe it conceptually makes the whole p-value thingy easier to understand! https://t.co/fZzXihrxs1 Quotes: 1/3

Mr1Paleo:
#RT @PaleoFoundation: RT @dailyzad: THREAD
Happy to say that two papers by @Lester_Domes and I, on how we all can improve statistical teaching, reviewing, and practice via cognitive/semantic tools are up on arXiv 1/7
1: https://t.co/LNd26oVyfx
2: … https://t.co/BoT9uaNb1L

BioPapers:
Semantic and Cognitive Tools to Aid Statistical Inference: Replace Confidence and Significance by Compatibility and Surprise. https://t.co/T6zGsmOSaV

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 8692

Unqiue Words: 3076

Multi-dimensional functional data arises in numerous modern scientific
experimental and observational studies. In this paper we focus on longitudinal
functional data, a structured form of multidimensional functional data.
Operating within a longitudinal functional framework we aim to capture low
dimensional interpretable features. We propose a computationally efficient
nonparametric Bayesian method to simultaneously smooth observed data, estimate
conditional functional means and functional covariance surfaces. Statistical
inference is based on Monte Carlo samples from the posterior measure through
adaptive blocked Gibbs sampling. Several operative characteristics associated
with the proposed modeling framework are assessed comparatively in a simulated
environment. We illustrate the application of our work in two case studies. The
first case study involves age-specific fertility collected over time for
various countries. The second case study is an implicit learning experiment in
children with Autism Spectrum Disorder (ASD).

more |
pdf
| html
None.

StatsPapers:
Bayesian Analysis of Multidimensional Functional Data. https://t.co/VVRfsekO2H

389jan:
RT @StatsPapers: Bayesian Analysis of Multidimensional Functional Data. https://t.co/VVRfsekO2H

None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 0

Unqiue Words: 0

We analyze the Laplacian pyramids algorithm of Rabin and Coifman for
extending and denoising a function sampled on a discrete set of points. We
provide mild conditions under which the algorithm converges, and prove
stability bounds on the extended function. We also consider the iterative
application of truncated Laplacian pyramids kernels for denoising signals by
non-local means.

more |
pdf
| html
None.

arxiv_org:
Properties of Laplacian Pyramids for Extension and Denoising. https://t.co/WShqSkMQkZ https://t.co/9QQCAQEEdr

arxivml:
"Properties of Laplacian Pyramids for Extension and Denoising",
William Leeb
https://t.co/cnLERscw6J

arxiv_cs_LG:
Properties of Laplacian Pyramids for Extension and Denoising. William Leeb https://t.co/Xhfj1jVhM4

StatsPapers:
Properties of Laplacian Pyramids for Extension and Denoising. https://t.co/KPW0E94YmQ

udmrzn:
RT @arxiv_org: Properties of Laplacian Pyramids for Extension and Denoising. https://t.co/WShqSkMQkZ https://t.co/9QQCAQEEdr

None.

None.

Sample Sizes : None.

Authors: 1

Total Words: 0

Unqiue Words: 0

Statistical modeling of rainfall is an important challenge in meteorology,
particularly from the perspective of rainfed agriculture where a proper
assessment of the future availability of rainwater is necessary. The
probability models mostly used for this purpose are exponential, gamma, Weibull
and lognormal distributions, where the unknown model parameters are routinely
estimated using the maximum likelihood estimator (MLE). However, presence of
outliers or extreme observations is quite common in rainfall data and the MLEs
being highly sensitive to them often leads to spurious inference. In this
paper, we discuss a robust parameter estimation approach based on the minimum
density power divergence estimators (MDPDEs) which provides a class of
estimates through a tuning parameter including the MLE as a special case. The
underlying tuning parameter controls the trade-offs between efficiency and
robustness of the resulting inference; we also discuss a procedure for
data-driven optimal selection of this tuning parameter as well as...

more |
pdf
| html
None.

arxiv_org:
Robust statistical modeling of monthly rainfall: The minimum density power divergence app... https://t.co/BZtWdMpM97 https://t.co/33hyT4zk2t

StatsPapers:
Robust statistical modeling of monthly rainfall: The minimum density power divergence approach. https://t.co/doVnZhgcAd

Rosenchild:
RT @arxiv_org: Robust statistical modeling of monthly rainfall: The minimum density power divergence app... https://t.co/BZtWdMpM97 https:/…

subhobrata1:
RT @arxiv_org: Robust statistical modeling of monthly rainfall: The minimum density power divergence app... https://t.co/BZtWdMpM97 https:/…

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 0

Unqiue Words: 0

Most real-world networks are incompletely observed. Algorithms that can
accurately predict which links are missing can dramatically speedup the
collection of network data and improve the validity of network models. Many
algorithms now exist for predicting missing links, given a partially observed
network, but it has remained unknown whether a single best predictor exists,
how link predictability varies across methods and networks from different
domains, and how close to optimality current methods are. We answer these
questions by systematically evaluating 203 individual link predictor
algorithms, representing three popular families of methods, applied to a large
corpus of 548 structurally diverse networks from six scientific domains. We
first show that individual algorithms exhibit a broad diversity of prediction
errors, such that no one predictor or family is best, or worst, across all
realistic inputs. We then exploit this diversity via meta-learning to construct
a series of "stacked" models that combine predictors into a single...

more |
pdf
| html
alexvespi:
Stacking Models for Nearly Optimal Link Prediction in Complex Networks
“Applied to a broad range of synthetic networks, for which we may analytically calculate optimal performance, these stacked models achieve optimal or nearly optimal levels of accuracy”
https://t.co/hLCtygSOFx https://t.co/VOhwtXIO4F

aaronclauset:
Excited to share a new preprint "Stacking models for nearly optimal link prediction in complex networks," led by @Amir_Ghasemian and @HomaHosseinmar1, with @aram_galstyan and @eairoldi: https://t.co/iCxftB6xjF Here’s a little summary: 1/7 https://t.co/k0VVdsV4ce

net_science:
Stacking Models for Nearly Optimal Link Prediction in Complex Networks. (arXiv:1909.07578v1 [https://t.co/E3LUKJpMju]) https://t.co/T1CFh8xujP

BrundageBot:
Stacking Models for Nearly Optimal Link Prediction in Complex Networks. Amir Ghasemian, Homa Hosseinmardi, Aram Galstyan, Edoardo M. Airoldi, and Aaron Clauset https://t.co/tFGrLqHQwv

arxiv_cs_LG:
Stacking Models for Nearly Optimal Link Prediction in Complex Networks. Amir Ghasemian, Homa Hosseinmardi, Aram Galstyan, Edoardo M. Airoldi, and Aaron Clauset https://t.co/WOKu8OXKCf

This page is a companion for our paper on optimal link prediction, written by Amir Ghasemian, Homa Hosseinmardi, Aram Galstyan, Edoardo M. Airoldi, and Aaron Clauset. (arXiv:1909.07578)

Stargazers: 1

Subscribers: 2

Subscribers: 2

Forks: 0

Open Issues: 0

Open Issues: 0

None.

Sample Sizes : None.

Authors: 5

Total Words: 19905

Unqiue Words: 3619

We propose a novel approach to the problem of multilevel clustering, which
aims to simultaneously partition data in each group and discover grouping
patterns among groups in a potentially large hierarchically structured corpus
of data. Our method involves a joint optimization formulation over several
spaces of discrete probability measures, which are endowed with Wasserstein
distance metrics. We propose several variants of this problem, which admit fast
optimization algorithms, by exploiting the connection to the problem of finding
Wasserstein barycenters. Consistency properties are established for the
estimates of both local and global clusters. Finally, the experimental results
with both synthetic and real data are presented to demonstrate the flexibility
and scalability of the proposed approach.

more |
pdf
| html
StatsPapers:
On Efficient Multilevel Clustering via Wasserstein Distances. https://t.co/F4vZehw8CR

None.

None.

Sample Sizes : None.

Authors: 7

Total Words: 13997

Unqiue Words: 2823

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

*Tracking 192,929 papers.*

Sort results based on if they are interesting or reproducible.

Interesting

Reproducible