Machine learning methods such as convolutional neural networks (CNNs) are
becoming an integral part of scientific research in many disciplines, spatial
vector data often fail to be analyzed using these powerful learning methods
because of its irregularities. With the aid of graph Fourier transform and
convolution theorem, it is possible to convert the convolution as a point-wise
product in Fourier domain and construct a learning architecture of CNN on graph
for the analysis task of irregular spatial data. In this study, we used the
classification task of building patterns as a case study to test this method,
and experiments showed that this method has achieved outstanding results in
identifying regular and irregular patterns, and has significantly improved in
comparing with other methods.

more |
pdf
| html
arxivml:
"Analysis of Irregular Spatial Data with Machine Learning: Classification of Building Patterns with a Graph Convolu…
https://t.co/XwMdLOb5RC

nmfeeds:
[O] https://t.co/PhJx9JIFPv Analysis of Irregular Spatial Data with Machine Learning: Classification of Building Patterns ...

Memoirs:
Analysis of Irregular Spatial Data with Machine Learning: Classification of Building Patterns with a Graph Convolutional Neural Network. https://t.co/StsSt7Bv5c

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 2872

Unqiue Words: 1081

Many problems that appear in biomedical decision making, such as diagnosing
disease and predicting response to treatment, can be expressed as binary
classification problems. The costs of false positives and false negatives vary
across application domains and receiver operating characteristic (ROC) curves
provide a visual representation of this trade-off. Nonparametric estimators for
the ROC curve, such as a weighted support vector machine (SVM), are desirable
because they are robust to model misspecification. While weighted SVMs have
great potential for estimating ROC curves, their theoretical properties were
heretofore underdeveloped. We propose a method for constructing confidence
bands for the SVM ROC curve and provide the theoretical justification for the
SVM ROC curve by showing that the risk function of the estimated decision rule
is uniformly consistent across the weight parameter. We demonstrate the
proposed confidence band method and the superior sensitivity and specificity of
the weighted SVM compared to commonly used...

more |
pdf
| html
arxiv_org:
Receiver Operating Characteristic Curves and Confidence Bands for Support Vector Machines. https://t.co/DrZTs3WR4R https://t.co/g61WrLS9cA

HubBucket:
RT @arxiv_org: Receiver Operating Characteristic Curves and Confidence Bands for Support Vector Machines. https://t.co/DrZTs3WR4R https://t…

None.

None.

Sample Sizes : None.

Authors: 8

Total Words: 11320

Unqiue Words: 2363

The quantification problem consists of determining the prevalence of a given
label in a target population. However, one often has access to the labels in a
sample from the training population but not in the target population. A common
assumption in this situation is that of prior probability shift, that is, once
the labels are known, the distribution of the features is the same in the
training and target populations. In this paper, we derive a new lower bound for
the risk of the quantification problem under the prior shift assumption.
Complementing this lower bound, we present a new approximately minimax class of
estimators, ratio estimators, which generalize several previous proposals in
the literature. Using a weaker version of the prior shift assumption, which can
be tested, we show that ratio estimators can be used to build confidence
intervals for the quantification problem. We also extend the ratio estimator so
that it can: (i) incorporate labels from the target population, when they are
available and (ii) estimate how the...

more |
pdf
| html
None.

arxiv_org:
Quantification under prior probability shift: the ratio estimator and its extensions. https://t.co/vCtSDqJ2XW https://t.co/FIF7eHto8V

HubBucket:
RT @arxiv_org: Quantification under prior probability shift: the ratio estimator and its extensions. https://t.co/vCtSDqJ2XW https://t.co/F…

udmrzn:
RT @arxiv_org: Quantification under prior probability shift: the ratio estimator and its extensions. https://t.co/vCtSDqJ2XW https://t.co/F…

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 11908

Unqiue Words: 2193

We present an approximate Bayesian inference approach for estimating the
intensity of a inhomogeneous Poisson process, where the intensity function is
modelled using a Gaussian process (GP) prior via a sigmoid link function.
Augmenting the model using a latent marked Poisson process and P\'olya--Gamma
random variables we obtain a representation of the likelihood which is
conjugate to the GP prior. We approximate the posterior using a free--form mean
field approximation together with the framework of sparse GPs. Furthermore, as
alternative approximation we suggest a sparse Laplace approximation of the
posterior, for which an efficient expectation--maximisation algorithm is
derived to find the posterior's mode. Results of both algorithms compare well
with exact inference obtained by a Markov Chain Monte Carlo sampler and
standard variational Gauss approach, while being one order of magnitude faster.

more |
pdf
| html
hiropon_matsu:
"Efficient Bayesian Inference of Sigmoidal Gaussian Cox Processes" https://t.co/MQDtFOAtzE

None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 11455

Unqiue Words: 2673

We study a stylized dynamic assortment planning problem during a selling
season of finite length $T$, by considering a nested multinomial logit model
with $M$ nests and $N$ items per nest. Our policy simultaneously learns
customers' choice behavior and makes dynamic decisions on assortments based on
the current knowledge. It achieves the regret at the order of
$\tilde{O}(\sqrt{MNT}+MN^2)$, where $M$ is the number of nests and $N$ is the
number of products in each nest. We further provide a lower bound result of
$\Omega(\sqrt{MT})$, which shows the optimality of the upper bound when $T>M$
and $N$ is small. However, the $N^2$ term in the upper bound is not ideal for
applications where $N$ is large as compared to $T$. To address this issue, we
further generalize our first policy by introducing a discretization technique,
which leads to a regret of $\tilde{O}(\sqrt{M}T^{2/3}+MNT^{1/3})$ with a
specific choice of discretization granularity. It improves the previous regret
bound whenever $N>T^{1/3}$. We provide numerical results to...

more |
pdf
| html
None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 17873

Unqiue Words: 3153

Uplift modeling is aimed at estimating the incremental impact of an action on
an individual's behavior, which is useful in various application domains such
as targeted marketing (advertisement campaigns) and personalized medicine
(medical treatments). Conventional methods of uplift modeling require every
instance to be jointly equipped with two types of labels: the taken action and
its outcome. However, obtaining two labels for each instance at the same time
is difficult or expensive in many real-world problems. In this paper, we
propose a novel method of uplift modeling that is applicable to a more
practical setting where only one type of labels is available for each instance.
We show a generalization error bound for the proposed method and demonstrate
its effectiveness through experiments.

more |
pdf
| html
None.

None.

Sample Sizes : None.

Authors: 4

Total Words: 9308

Unqiue Words: 2424

Neural network ensembles at initialisation give rise to the trainability and
training speed of neural networks and thus support parameter choices at
initialisation. These insights rely so far on mean field approximations that
assume infinite layer width and study average squared signals. Thus,
information about the full output distribution gets lost. Therefore, we derive
the output distribution exactly (without mean field assumptions), for
fully-connected networks with Gaussian weights and biases. The layer-wise
transition of the signal distribution is guided by a linear integral operator,
whose kernel has a closed form solution in case of rectified linear units for
nonlinear activations. This enables us to analyze some of its spectral
properties, for instance, the shape of the stationary distribution for
different parameter choices and the dynamics of signal propagation.

more |
pdf
| html
None.

None.

Sample Sizes : None.

Authors: 2

Total Words: 4861

Unqiue Words: 1334

The paper deals with regression problems, in which the nonsmooth target is
assumed to switch between different operating modes. Specifically, piecewise
smooth (PWS) regression considers target functions switching deterministically
via a partition of the input space, while switching regression considers
arbitrary switching laws. The paper derives generalization error bounds in
these two settings by following the approach based on Rademacher complexities.
For PWS regression, our derivation involves a chaining argument and a
decomposition of the covering numbers of PWS classes in terms of the ones of
their component functions and the capacity of the classifier partitioning the
input space. This yields error bounds with a radical dependency on the number
of modes. For switching regression, the decomposition can be performed directly
at the level of the Rademacher complexities, which yields bounds with a linear
dependency on the number of modes. By using once more chaining and a
decomposition at the level of covering numbers, we show...

more |
pdf
| html
None.

None.

None.

Sample Sizes : None.

Authors: 1

Total Words: 11326

Unqiue Words: 2382

Predicting keywords performance, such as number of impressions, click-through
rate (CTR), conversion rate (CVR), revenue per click (RPC), and cost per click
(CPC), is critical for sponsored search in the online advertising industry. An
interesting phenomenon is that, despite the size of the overall data, the data
are very sparse at the individual unit level. To overcome the sparsity and
leverage hierarchical information across the data structure, we propose a
Dynamic Hierarchical Empirical Bayesian (DHEB) model that dynamically
determines the hierarchy through a data-driven process and provides
shrinkage-based estimations. Our method is also equipped with an efficient
empirical approach to derive inferences through the hierarchy. We evaluate the
proposed method in both simulated and real-world datasets and compare to
several competitive models. The results favor the proposed method among all
comparisons in terms of both accuracy and efficiency. In the end, we design a
two-phase system to serve prediction in real time.

more |
pdf
| html
arxiv_org:
Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co/OS9SzjzgSM

M157q_News_RSS:
Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. (arXiv:1809.02213v1 [https://t.co/eOmVsbWZjL
https://t.co/2J3mIjH6n2

arxivml:
"Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising",
Yuan Yuan, Xiaojing Dong,…
https://t.co/hTiANDWc4s

dan_marinazzo:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

EldarSilver:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

elasticjava:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

udmrzn:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

gaialive:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

morioka:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

bottom100x100:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

vnzloy:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

PerthMLGroup:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

esigma6:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

AssistedEvolve:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

11shubh_laabh11:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

festivalWon:
RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…

None.

None.

Sample Sizes : None.

Authors: 6

Total Words: 5638

Unqiue Words: 1582

Sparse regression such as Lasso has achieved great success in dealing with
high dimensional data for several decades. However, there are few methods
applicable to missing data, which often occurs in high dimensional data.
Recently, CoCoLasso was proposed to deal with high dimensional missing data,
but it still suffers from highly missing data. In this paper, we propose a
novel Lasso-type regression technique for Highly Missing data, called
`HMLasso'. We use the mean imputed covariance matrix, which is notorious in
general due to its estimation bias for missing data. However, we effectively
incorporate it into Lasso, by using a useful connection with the pairwise
covariance matrix. The resulting optimization problem can be seen as a weighted
modification of CoCoLasso with the missing ratios, and is quite effective for
highly missing data. To the best of our knowledge, this is the first method
that can efficiently deal with both high dimensional and highly missing data.
We show that the proposed method is beneficial with regards to...

more |
pdf
| html
arxivml:
"HMLasso: Lasso for High Dimensional and Highly Missing Data",
Masaaki Takada, Hironori Fujisawa, Takeichiro Nishik…
https://t.co/UQRkIuStTK

FerrumA:
Lasso regression for highly missing data: https://t.co/LPKqAKQr60 (How good, practical for large p and p>n?)

FerrumA:
LASSO regression for highly missing data: https://t.co/LPKqAKQr60 (experiments for p << n).

ComputerPapers:
HMLasso: Lasso for High Dimensional and Highly Missing Data. https://t.co/U55nlLKzrs

None.

None.

Sample Sizes : None.

Authors: 3

Total Words: 8323

Unqiue Words: 1961

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

*Tracking 72,995 papers.*

Sort results based on if they are interesting or reproducible.

Interesting

Reproducible