Top 10 Arxiv Papers Today in Machine Learning


0.0 Mikeys
#1. Analysis of Irregular Spatial Data with Machine Learning: Classification of Building Patterns with a Graph Convolutional Neural Network
Xiongfeng Yan, Tinghua Ai
Machine learning methods such as convolutional neural networks (CNNs) are becoming an integral part of scientific research in many disciplines, spatial vector data often fail to be analyzed using these powerful learning methods because of its irregularities. With the aid of graph Fourier transform and convolution theorem, it is possible to convert the convolution as a point-wise product in Fourier domain and construct a learning architecture of CNN on graph for the analysis task of irregular spatial data. In this study, we used the classification task of building patterns as a case study to test this method, and experiments showed that this method has achieved outstanding results in identifying regular and irregular patterns, and has significantly improved in comparing with other methods.
more | pdf | html
Figures
Tweets
arxivml: "Analysis of Irregular Spatial Data with Machine Learning: Classification of Building Patterns with a Graph Convolu… https://t.co/XwMdLOb5RC
nmfeeds: [O] https://t.co/PhJx9JIFPv Analysis of Irregular Spatial Data with Machine Learning: Classification of Building Patterns ...
Memoirs: Analysis of Irregular Spatial Data with Machine Learning: Classification of Building Patterns with a Graph Convolutional Neural Network. https://t.co/StsSt7Bv5c
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 2872
Unqiue Words: 1081

0.0 Mikeys
#2. Receiver Operating Characteristic Curves and Confidence Bands for Support Vector Machines
Daniel J. Luckett, Eric B. Laber, Samer S. El-Kamary, Cheng Fan, Ravi Jhaveri, Charles M. Perou, Fatma M. Shebl, Michael R. Kosorok
Many problems that appear in biomedical decision making, such as diagnosing disease and predicting response to treatment, can be expressed as binary classification problems. The costs of false positives and false negatives vary across application domains and receiver operating characteristic (ROC) curves provide a visual representation of this trade-off. Nonparametric estimators for the ROC curve, such as a weighted support vector machine (SVM), are desirable because they are robust to model misspecification. While weighted SVMs have great potential for estimating ROC curves, their theoretical properties were heretofore underdeveloped. We propose a method for constructing confidence bands for the SVM ROC curve and provide the theoretical justification for the SVM ROC curve by showing that the risk function of the estimated decision rule is uniformly consistent across the weight parameter. We demonstrate the proposed confidence band method and the superior sensitivity and specificity of the weighted SVM compared to commonly used...
more | pdf | html
Figures
Tweets
arxiv_org: Receiver Operating Characteristic Curves and Confidence Bands for Support Vector Machines. https://t.co/DrZTs3WR4R https://t.co/g61WrLS9cA
HubBucket: RT @arxiv_org: Receiver Operating Characteristic Curves and Confidence Bands for Support Vector Machines. https://t.co/DrZTs3WR4R https://t…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 8
Total Words: 11320
Unqiue Words: 2363

0.0 Mikeys
#3. Quantification under prior probability shift: the ratio estimator and its extensions
Afonso Fernandes Vaz, Rafael Izbicki, Rafael Bassi Stern
The quantification problem consists of determining the prevalence of a given label in a target population. However, one often has access to the labels in a sample from the training population but not in the target population. A common assumption in this situation is that of prior probability shift, that is, once the labels are known, the distribution of the features is the same in the training and target populations. In this paper, we derive a new lower bound for the risk of the quantification problem under the prior shift assumption. Complementing this lower bound, we present a new approximately minimax class of estimators, ratio estimators, which generalize several previous proposals in the literature. Using a weaker version of the prior shift assumption, which can be tested, we show that ratio estimators can be used to build confidence intervals for the quantification problem. We also extend the ratio estimator so that it can: (i) incorporate labels from the target population, when they are available and (ii) estimate how the...
more | pdf | html
Figures
None.
Tweets
arxiv_org: Quantification under prior probability shift: the ratio estimator and its extensions. https://t.co/vCtSDqJ2XW https://t.co/FIF7eHto8V
HubBucket: RT @arxiv_org: Quantification under prior probability shift: the ratio estimator and its extensions. https://t.co/vCtSDqJ2XW https://t.co/F…
udmrzn: RT @arxiv_org: Quantification under prior probability shift: the ratio estimator and its extensions. https://t.co/vCtSDqJ2XW https://t.co/F…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 11908
Unqiue Words: 2193

0.0 Mikeys
#4. Efficient Bayesian Inference of Sigmoidal Gaussian Cox Processes
Christian Donner, Manfred Opper
We present an approximate Bayesian inference approach for estimating the intensity of a inhomogeneous Poisson process, where the intensity function is modelled using a Gaussian process (GP) prior via a sigmoid link function. Augmenting the model using a latent marked Poisson process and P\'olya--Gamma random variables we obtain a representation of the likelihood which is conjugate to the GP prior. We approximate the posterior using a free--form mean field approximation together with the framework of sparse GPs. Furthermore, as alternative approximation we suggest a sparse Laplace approximation of the posterior, for which an efficient expectation--maximisation algorithm is derived to find the posterior's mode. Results of both algorithms compare well with exact inference obtained by a Markov Chain Monte Carlo sampler and standard variational Gauss approach, while being one order of magnitude faster.
more | pdf | html
Figures
Tweets
hiropon_matsu: "Efficient Bayesian Inference of Sigmoidal Gaussian Cox Processes" https://t.co/MQDtFOAtzE
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 11455
Unqiue Words: 2673

0.0 Mikeys
#5. Dynamic Assortment Selection under the Nested Logit Models
Xi Chen, Yining Wang, Yuan Zhou
We study a stylized dynamic assortment planning problem during a selling season of finite length $T$, by considering a nested multinomial logit model with $M$ nests and $N$ items per nest. Our policy simultaneously learns customers' choice behavior and makes dynamic decisions on assortments based on the current knowledge. It achieves the regret at the order of $\tilde{O}(\sqrt{MNT}+MN^2)$, where $M$ is the number of nests and $N$ is the number of products in each nest. We further provide a lower bound result of $\Omega(\sqrt{MT})$, which shows the optimality of the upper bound when $T>M$ and $N$ is small. However, the $N^2$ term in the upper bound is not ideal for applications where $N$ is large as compared to $T$. To address this issue, we further generalize our first policy by introducing a discretization technique, which leads to a regret of $\tilde{O}(\sqrt{M}T^{2/3}+MNT^{1/3})$ with a specific choice of discretization granularity. It improves the previous regret bound whenever $N>T^{1/3}$. We provide numerical results to...
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 17873
Unqiue Words: 3153

0.0 Mikeys
#6. Uplift Modeling from Separate Labels
Ikko Yamane, Florian Yger, Jamal Atif, Masashi Sugiyama
Uplift modeling is aimed at estimating the incremental impact of an action on an individual's behavior, which is useful in various application domains such as targeted marketing (advertisement campaigns) and personalized medicine (medical treatments). Conventional methods of uplift modeling require every instance to be jointly equipped with two types of labels: the taken action and its outcome. However, obtaining two labels for each instance at the same time is difficult or expensive in many real-world problems. In this paper, we propose a novel method of uplift modeling that is applicable to a more practical setting where only one type of labels is available for each instance. We show a generalization error bound for the proposed method and demonstrate its effectiveness through experiments.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 9308
Unqiue Words: 2424

0.0 Mikeys
#7. Exact information propagation through fully-connected feed forward neural networks
Rebekka Burkholz, Alina Dubatovka
Neural network ensembles at initialisation give rise to the trainability and training speed of neural networks and thus support parameter choices at initialisation. These insights rely so far on mean field approximations that assume infinite layer width and study average squared signals. Thus, information about the full output distribution gets lost. Therefore, we derive the output distribution exactly (without mean field assumptions), for fully-connected networks with Gaussian weights and biases. The layer-wise transition of the signal distribution is guided by a linear integral operator, whose kernel has a closed form solution in case of rectified linear units for nonlinear activations. This enables us to analyze some of its spectral properties, for instance, the shape of the stationary distribution for different parameter choices and the dynamics of signal propagation.
more | pdf | html
Figures
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 4861
Unqiue Words: 1334

0.0 Mikeys
#8. Error Bounds for Piecewise Smooth and Switching Regression
Fabien Lauer
The paper deals with regression problems, in which the nonsmooth target is assumed to switch between different operating modes. Specifically, piecewise smooth (PWS) regression considers target functions switching deterministically via a partition of the input space, while switching regression considers arbitrary switching laws. The paper derives generalization error bounds in these two settings by following the approach based on Rademacher complexities. For PWS regression, our derivation involves a chaining argument and a decomposition of the covering numbers of PWS classes in terms of the ones of their component functions and the capacity of the classifier partitioning the input space. This yields error bounds with a radical dependency on the number of modes. For switching regression, the decomposition can be performed directly at the level of the Rademacher complexities, which yields bounds with a linear dependency on the number of modes. By using once more chaining and a decomposition at the level of covering numbers, we show...
more | pdf | html
Figures
None.
Tweets
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 11326
Unqiue Words: 2382

0.0 Mikeys
#9. Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising
Yuan Yuan, Xiaojing Dong, Chen Dong, Yiwen Sun, Zhenyu Yan, Abhishek Pani
Predicting keywords performance, such as number of impressions, click-through rate (CTR), conversion rate (CVR), revenue per click (RPC), and cost per click (CPC), is critical for sponsored search in the online advertising industry. An interesting phenomenon is that, despite the size of the overall data, the data are very sparse at the individual unit level. To overcome the sparsity and leverage hierarchical information across the data structure, we propose a Dynamic Hierarchical Empirical Bayesian (DHEB) model that dynamically determines the hierarchy through a data-driven process and provides shrinkage-based estimations. Our method is also equipped with an efficient empirical approach to derive inferences through the hierarchy. We evaluate the proposed method in both simulated and real-world datasets and compare to several competitive models. The results favor the proposed method among all comparisons in terms of both accuracy and efficiency. In the end, we design a two-phase system to serve prediction in real time.
more | pdf | html
Figures
Tweets
arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co/OS9SzjzgSM
M157q_News_RSS: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. (arXiv:1809.02213v1 [https://t.co/eOmVsbWZjL https://t.co/2J3mIjH6n2
arxivml: "Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising", Yuan Yuan, Xiaojing Dong,… https://t.co/hTiANDWc4s
dan_marinazzo: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
EldarSilver: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
elasticjava: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
udmrzn: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
gaialive: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
morioka: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
bottom100x100: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
vnzloy: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
PerthMLGroup: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
esigma6: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
AssistedEvolve: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
11shubh_laabh11: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
festivalWon: RT @arxiv_org: Dynamic Hierarchical Empirical Bayes: A Predictive Model Applied to Online Advertising. https://t.co/N3zvytNV94 https://t.co…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 5638
Unqiue Words: 1582

0.0 Mikeys
#10. HMLasso: Lasso for High Dimensional and Highly Missing Data
Masaaki Takada, Hironori Fujisawa, Takeichiro Nishikawa
Sparse regression such as Lasso has achieved great success in dealing with high dimensional data for several decades. However, there are few methods applicable to missing data, which often occurs in high dimensional data. Recently, CoCoLasso was proposed to deal with high dimensional missing data, but it still suffers from highly missing data. In this paper, we propose a novel Lasso-type regression technique for Highly Missing data, called `HMLasso'. We use the mean imputed covariance matrix, which is notorious in general due to its estimation bias for missing data. However, we effectively incorporate it into Lasso, by using a useful connection with the pairwise covariance matrix. The resulting optimization problem can be seen as a weighted modification of CoCoLasso with the missing ratios, and is quite effective for highly missing data. To the best of our knowledge, this is the first method that can efficiently deal with both high dimensional and highly missing data. We show that the proposed method is beneficial with regards to...
more | pdf | html
Figures
Tweets
arxivml: "HMLasso: Lasso for High Dimensional and Highly Missing Data", Masaaki Takada, Hironori Fujisawa, Takeichiro Nishik… https://t.co/UQRkIuStTK
FerrumA: Lasso regression for highly missing data: https://t.co/LPKqAKQr60 (How good, practical for large p and p>n?)
FerrumA: LASSO regression for highly missing data: https://t.co/LPKqAKQr60 (experiments for p << n).
ComputerPapers: HMLasso: Lasso for High Dimensional and Highly Missing Data. https://t.co/U55nlLKzrs
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 8323
Unqiue Words: 1961

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 72,995 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 72,995 papers.