Top 10 Arxiv Papers Today in Machine Learning


0.0 Mikeys
#1. Fourier Transform Approach to Machine Learning III: Fourier Classification
Soheil Mehrabkhani
We propose a Fourier-based learning algorithm for highly nonlinear multiclass classification. The algorithm is based on a smoothing technique to calculate the probability distribution of all classes. To obtain the probability distribution, the density distribution of each class is smoothed by a low-pass filter separately. The advantage of the Fourier representation is capturing the nonlinearities of the data distribution without defining any kernel function. Furthermore, contrary to the support vector machines, it makes a probabilistic explanation for the classification possible. Moreover, it can treat overlapped classes as well. Comparing to the logistic regression, it does not require feature engineering. In general, its computational performance is also very well for large data sets and in contrast to other algorithms, the typical overfitting problem does not happen at all. The capability of the algorithm is demonstrated for multiclass classification with overlapped classes and very high nonlinearity of the class distributions.
more | pdf | html
Figures
Tweets
arxivml: "Fourier Transform Approach to Machine Learning III: Fourier Classification", Soheil Mehrabkhani https://t.co/PJICyanoKx
Memoirs: Fourier Transform Approach to Machine Learning III: Fourier Classification. https://t.co/CBrD70EAv4
alexandralacruz: RT @Memoirs: Fourier Transform Approach to Machine Learning III: Fourier Classification. https://t.co/CBrD70EAv4
cygarde: RT @Memoirs: Fourier Transform Approach to Machine Learning III: Fourier Classification. https://t.co/CBrD70EAv4
StigmaEnder: RT @Memoirs: Fourier Transform Approach to Machine Learning III: Fourier Classification. https://t.co/CBrD70EAv4
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 2815
Unqiue Words: 916

0.0 Mikeys
#2. Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications
Qing Qu, Zhihui Zhu, Xiao Li, Manolis C. Tsakiris, John Wright, René Vidal
The problem of finding the sparsest vector (direction) in a low dimensional subspace can be considered as a homogeneous variant of the sparse recovery problem, which finds applications in robust subspace recovery, dictionary learning, sparse blind deconvolution, and many other problems in signal processing and machine learning. However, in contrast to the classical sparse recovery problem, the most natural formulation for finding the sparsest vector in a subspace is usually nonconvex. In this paper, we overview recent advances on global nonconvex optimization theory for solving this problem, ranging from geometric analysis of its optimization landscapes, to efficient optimization algorithms for solving the associated nonconvex optimization problem, to applications in machine intelligence, representation learning, and imaging sciences. Finally, we conclude this review by pointing out several interesting open problems for future research.
more | pdf | html
Figures
None.
Tweets
arxivml: "Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications", Qing Qu, Zhihui Zhu, Xiao Li, M… https://t.co/NI7zMWjft0
StatsPapers: Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications. https://t.co/VyLo5gam4f
IgorCarron: RT @StatsPapers: Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications. https://t.co/VyLo5gam4f
jackiefloyd: RT @StatsPapers: Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications. https://t.co/VyLo5gam4f
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#3. FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, Colin Raffel
Semi-supervised learning (SSL) provides an effective means of leveraging unlabeled data to improve a model's performance. In this paper, we demonstrate the power of a simple combination of two common SSL methods: consistency regularization and pseudo-labeling. Our algorithm, FixMatch, first generates pseudo-labels using the model's predictions on weakly-augmented unlabeled images. For a given image, the pseudo-label is only retained if the model produces a high-confidence prediction. The model is then trained to predict the pseudo-label when fed a strongly-augmented version of the same image. Despite its simplicity, we show that FixMatch achieves state-of-the-art performance across a variety of standard semi-supervised learning benchmarks, including 94.93% accuracy on CIFAR-10 with 250 labels and 88.61% accuracy with 40 -- just 4 labels per class. Since FixMatch bears many similarities to existing SSL methods that achieve worse performance, we carry out an extensive ablation study to tease apart the experimental factors that are...
more | pdf | html
Figures
None.
Tweets
D_Berthelot_ML: FixMatch: focusing on simplicity for semi-supervised learning and improving state of the art (CIFAR 94.9% with 250 labels, 88.6% with 40). https://t.co/QuP6oN7iCS Collaboration with Kihyuk Sohn, @chunliang_tw @ZizhaoZhang Nicholas Carlini @ekindogus @Han_Zhang_ @colinraffel https://t.co/BmeYvpEHzX
arxivml: "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence", Kihyuk Sohn, David Berthelot, Chu… https://t.co/bXM2Sjlwyq
phalanxXxXxX: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence paper: https://t.co/bJq2a2D0dG code: https://t.co/mRpHGSoIv8 https://t.co/9htNCgAnlN
hereticreader: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence - https://t.co/9D1COPFWPY https://t.co/TszkmFaMKr
arxiv_cs_cv_pr: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence. Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, and Colin Raffel https://t.co/2yOM5S9jz8
StatsPapers: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence. https://t.co/cfgsxX3XLx
FujitaAtsunori: RT @phalanxXxXxX: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence paper: https://t.co/bJq2a2D0dG code: https…
inoichan: RT @phalanxXxXxX: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence paper: https://t.co/bJq2a2D0dG code: https…
TeraBytesMemory: RT @phalanxXxXxX: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence paper: https://t.co/bJq2a2D0dG code: https…
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 9
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#4. Mobility Inference on Long-Tailed Sparse Trajectory
Lei Shi
Analyzing the urban trajectory in cities has become an important topic in data mining. How can we model the human mobility consisting of stay and travel from the raw trajectory data? How can we infer such a mobility model from the single trajectory information? How can we further generalize the mobility inference to accommodate the real-world trajectory data that is sparsely sampled over time? In this paper, based on formal and rigid definitions of the stay/travel mobility, we propose a single trajectory inference algorithm that utilizes a generic long-tailed sparsity pattern in the large-scale trajectory data. The algorithm guarantees a 100\% precision in the stay/travel inference with a provable lower-bound in the recall. Furthermore, we introduce an encoder-decoder learning architecture that admits multiple trajectories as inputs. The architecture is optimized for the mobility inference problem through customized embedding and learning mechanism. Evaluations with three trajectory data sets of 40 million urban users validate...
more | pdf | html
Figures
None.
Tweets
arxivml: "Mobility Inference on Long-Tailed Sparse Trajectory", Lei Shi https://t.co/cFQlHR0F2M
StatsPapers: Mobility Inference on Long-Tailed Sparse Trajectory. https://t.co/iW63jLjHRX
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#5. Generate High-Resolution Adversarial Samples by Identifying Effective Features
Sizhe Chen, Peidong Zhang, Chengjin Sun, Jia Cai, Xiaolin Huang
As the prevalence of deep learning in computer vision, adversarial samples that weaken the neural networks emerge in large numbers, revealing their deep-rooted defects. Most adversarial attacks calculate an imperceptible perturbation in image space to fool the DNNs. In this strategy, the perturbation looks like noise and thus could be mitigated. Attacks in feature space produce semantic perturbation, but they could only deal with low resolution samples. The reason lies in the great number of coupled features to express a high-resolution image. In this paper, we propose Attack by Identifying Effective Features (AIEF), which learns different weights for features to attack. Effective features, those with great weights, influence the victim model much but distort the image little, and thus are more effective for attack. By attacking mostly on them, AIEF produces high resolution adversarial samples with acceptable distortions. We demonstrate the effectiveness of AIEF by attacking on different tasks with different generative models.
more | pdf | html
Figures
Tweets
arxivml: "Generate High-Resolution Adversarial Samples by Identifying Effective Features", Sizhe Chen, Peidong Zhang, Chengj… https://t.co/GGFDXbG93S
arxiv_cs_cv_pr: Generate High-Resolution Adversarial Samples by Identifying Effective Features. Sizhe Chen, Peidong Zhang, Chengjin Sun, Jia Cai, and Xiaolin Huang https://t.co/ya7ejWg32g
StatsPapers: Generate High-Resolution Adversarial Samples by Identifying Effective Features. https://t.co/UtaTvpEQ0V
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 5361
Unqiue Words: 1751

0.0 Mikeys
#6. batchboost: regularization for stabilizing training with resistance to underfitting & overfitting
Maciej A. Czyzewski
Overfitting & underfitting and stable training are an important challenges in machine learning. Current approaches for these issues are mixup, SamplePairing and BC learning. In our work, we state the hypothesis that mixing many images together can be more effective than just two. Batchboost pipeline has three stages: (a) pairing: method of selecting two samples. (b) mixing: how to create a new one from two samples. (c) feeding: combining mixed samples with new ones from dataset into batch (with ratio $\gamma$). Note that sample that appears in our batch propagates with subsequent iterations with less and less importance until the end of training. Pairing stage calculates the error per sample, sorts the samples and pairs with strategy: hardest with easiest one, than mixing stage merges two samples using mixup, $x_1 + (1-\lambda)x_2$. Finally, feeding stage combines new samples with mixed by ratio 1:1. Batchboost has 0.5-3% better accuracy than the current state-of-the-art mixup regularization on CIFAR-10 & Fashion-MNIST. Our method...
more | pdf | html
Figures
None.
Tweets
StatsPapers: batchboost: regularization for stabilizing training with resistance to underfitting & overfitting. https://t.co/hq6sEOuouo
arxivml: "batchboost: regularization for stabilizing training with resistance to underfitting & overfitting", Maciej A. Czyz… https://t.co/Zns2ZLjPxA
arxiv_cs_cv_pr: batchboost: regularization for stabilizing training with resistance to underfitting & overfitting. Maciej A. Czyzewski https://t.co/tAdif09ufj
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#7. Simple and Effective Graph Autoencoders with One-Hop Linear Models
Guillaume Salha, Romain Hennequin, Michalis Vazirgiannis
Graph autoencoders (AE) and variational autoencoders (VAE) recently emerged as powerful node embedding methods, with promising performances on challenging tasks such as link prediction and node clustering. Graph AE, VAE and most of their extensions rely on graph convolutional networks (GCN) encoders to learn vector space representations of nodes. In this paper, we propose to replace the GCN encoder by a significantly simpler linear model w.r.t. the direct neighborhood (one-hop) adjacency matrix of the graph. For the two aforementioned tasks, we show that this approach consistently reaches competitive performances w.r.t. GCN-based models for numerous real-world graphs, including all benchmark datasets commonly used to evaluate graph AE and VAE. We question the relevance of repeatedly using these datasets to compare complex graph AE and VAE. We also emphasize the effectiveness of the proposed encoding scheme, that appears as a simpler and faster alternative to GCN encoders for many real-world applications.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Simple and Effective Graph Autoencoders with One-Hop Linear Models. Guillaume Salha, Romain Hennequin, and Michalis Vazirgiannis https://t.co/gwZdV2H7OD
arxivml: "Simple and Effective Graph Autoencoders with One-Hop Linear Models", Guillaume Salha, Romain Hennequin, Michalis V… https://t.co/KwMKt2nKtD
StatsPapers: Simple and Effective Graph Autoencoders with One-Hop Linear Models. https://t.co/Flv14FtrEg
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#8. Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data
Kyoung-Woon On, Eun-Sol Kim, Yu-Jung Heo, Byoung-Tak Zhang
Conventional sequential learning methods such as Recurrent Neural Networks (RNNs) focus on interactions between consecutive inputs, i.e. first-order Markovian dependency. However, most of sequential data, as seen with videos, have complex dependency structures that imply variable-length semantic flows and their compositions, and those are hard to be captured by conventional methods. Here, we propose Cut-Based Graph Learning Networks (CB-GLNs) for learning video data by discovering these complex structures of the video. The CB-GLNs represent video data as a graph, with nodes and edges corresponding to frames of the video and their dependencies respectively. The CB-GLNs find compositional dependencies of the data in multilevel graph forms via a parameterized kernel with graph-cut and a message passing framework. We evaluate the proposed method on the two different tasks for video understanding: Video theme classification (Youtube-8M dataset) and Video Question and Answering (TVQA dataset). The experimental results show that...
more | pdf | html
Figures
None.
Tweets
arxivml: "Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data", Kyoung-Woon On, E… https://t.co/fE9biROPCc
StatsPapers: Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data. https://t.co/MOiPKOokKC
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#9. Understanding the Limitations of Network Online Learning
Timothy LaRock, Timothy Sakharov, Sahely Bhadra, Tina Eliassi-Rad
Studies of networked phenomena, such as interactions in online social media, often rely on incomplete data, either because these phenomena are partially observed, or because the data is too large or expensive to acquire all at once. Analysis of incomplete data leads to skewed or misleading results. In this paper, we investigate limitations of learning to complete partially observed networks via node querying. Concretely, we study the following problem: given (i) a partially observed network, (ii) the ability to query nodes for their connections (e.g., by accessing an API), and (iii) a budget on the number of such queries, sequentially learn which nodes to query in order to maximally increase observability. We call this querying process Network Online Learning and present a family of algorithms called NOL*. These algorithms learn to choose which partially observed node to query next based on a parameterized model that is trained online through a process of exploration and exploitation. Extensive experiments on both synthetic and...
more | pdf | html
Figures
None.
Tweets
arxivml: "Understanding the Limitations of Network Online Learning", Timothy LaRock, Timothy Sakharov, Sahely Bhadra, Tina E… https://t.co/0GGEqjEsUC
StatsPapers: Understanding the Limitations of Network Online Learning. https://t.co/tZ3uypWayV
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

0.0 Mikeys
#10. Data-Driven Permanent Magnet Temperature Estimation in Synchronous Motors with Supervised Machine Learning
Wilhelm Kirchgässner, Oliver Wallscheid, Joachim Böcker
Monitoring the magnet temperature in permanent magnet synchronous motors (PMSMs) for automotive applications is a challenging task for several decades now, as signal injection or sensor-based methods still prove unfeasible in a commercial context. Overheating results in severe motor deterioration and is thus of high concern for the machine's control strategy and its design. Lack of precise temperature estimations leads to lesser device utilization and higher material cost. In this work, several machine learning (ML) models are empirically evaluated on their estimation accuracy for the task of predicting latent high-dynamic magnet temperature profiles. The range of selected algorithms covers as diverse approaches as possible with ordinary and weighted least squares, support vector regression, $k$-nearest neighbors, randomized trees and neural networks. Having test bench data available, it is shown that ML approaches relying merely on collected data meet the estimation performance of classical thermal models built on thermodynamic...
more | pdf | html
Figures
Tweets
arxivml: "Data-Driven Permanent Magnet Temperature Estimation in Synchronous Motors with Supervised Machine Learning", Wilhe… https://t.co/IaYmkBLmXv
arxiv_cs_LG: Data-Driven Permanent Magnet Temperature Estimation in Synchronous Motors with Supervised Machine Learning. Wilhelm Kirchgässner, Oliver Wallscheid, and Joachim Böcker https://t.co/CERh88LPOj
Memoirs: Data-Driven Permanent Magnet Temperature Estimation in Synchronous Motors with Supervised Machine Learning. https://t.co/i7m9wshKUy
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6429
Unqiue Words: 2498

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 256,574 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 256,574 papers.