### Top 10 Arxiv Papers Today in Machine Learning

##### #1. Fourier Transform Approach to Machine Learning III: Fourier Classification
###### Soheil Mehrabkhani
We propose a Fourier-based learning algorithm for highly nonlinear multiclass classification. The algorithm is based on a smoothing technique to calculate the probability distribution of all classes. To obtain the probability distribution, the density distribution of each class is smoothed by a low-pass filter separately. The advantage of the Fourier representation is capturing the nonlinearities of the data distribution without defining any kernel function. Furthermore, contrary to the support vector machines, it makes a probabilistic explanation for the classification possible. Moreover, it can treat overlapped classes as well. Comparing to the logistic regression, it does not require feature engineering. In general, its computational performance is also very well for large data sets and in contrast to other algorithms, the typical overfitting problem does not happen at all. The capability of the algorithm is demonstrated for multiclass classification with overlapped classes and very high nonlinearity of the class distributions.
more | pdf | html
###### Tweets
arxivml: "Fourier Transform Approach to Machine Learning III: Fourier Classification", Soheil Mehrabkhani https://t.co/PJICyanoKx
Memoirs: Fourier Transform Approach to Machine Learning III: Fourier Classification. https://t.co/CBrD70EAv4
alexandralacruz: RT @Memoirs: Fourier Transform Approach to Machine Learning III: Fourier Classification. https://t.co/CBrD70EAv4
cygarde: RT @Memoirs: Fourier Transform Approach to Machine Learning III: Fourier Classification. https://t.co/CBrD70EAv4
StigmaEnder: RT @Memoirs: Fourier Transform Approach to Machine Learning III: Fourier Classification. https://t.co/CBrD70EAv4
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 2815
Unqiue Words: 916

##### #2. Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications
###### Qing Qu, Zhihui Zhu, Xiao Li, Manolis C. Tsakiris, John Wright, René Vidal
The problem of finding the sparsest vector (direction) in a low dimensional subspace can be considered as a homogeneous variant of the sparse recovery problem, which finds applications in robust subspace recovery, dictionary learning, sparse blind deconvolution, and many other problems in signal processing and machine learning. However, in contrast to the classical sparse recovery problem, the most natural formulation for finding the sparsest vector in a subspace is usually nonconvex. In this paper, we overview recent advances on global nonconvex optimization theory for solving this problem, ranging from geometric analysis of its optimization landscapes, to efficient optimization algorithms for solving the associated nonconvex optimization problem, to applications in machine intelligence, representation learning, and imaging sciences. Finally, we conclude this review by pointing out several interesting open problems for future research.
more | pdf | html
None.
###### Tweets
arxivml: "Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications", Qing Qu, Zhihui Zhu, Xiao Li, M… https://t.co/NI7zMWjft0
StatsPapers: Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications. https://t.co/VyLo5gam4f
IgorCarron: RT @StatsPapers: Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications. https://t.co/VyLo5gam4f
jackiefloyd: RT @StatsPapers: Finding the Sparsest Vectors in a Subspace: Theory, Algorithms, and Applications. https://t.co/VyLo5gam4f
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

##### #3. FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
###### Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, Colin Raffel
Semi-supervised learning (SSL) provides an effective means of leveraging unlabeled data to improve a model's performance. In this paper, we demonstrate the power of a simple combination of two common SSL methods: consistency regularization and pseudo-labeling. Our algorithm, FixMatch, first generates pseudo-labels using the model's predictions on weakly-augmented unlabeled images. For a given image, the pseudo-label is only retained if the model produces a high-confidence prediction. The model is then trained to predict the pseudo-label when fed a strongly-augmented version of the same image. Despite its simplicity, we show that FixMatch achieves state-of-the-art performance across a variety of standard semi-supervised learning benchmarks, including 94.93% accuracy on CIFAR-10 with 250 labels and 88.61% accuracy with 40 -- just 4 labels per class. Since FixMatch bears many similarities to existing SSL methods that achieve worse performance, we carry out an extensive ablation study to tease apart the experimental factors that are...
more | pdf | html
None.
###### Tweets
D_Berthelot_ML: FixMatch: focusing on simplicity for semi-supervised learning and improving state of the art (CIFAR 94.9% with 250 labels, 88.6% with 40). https://t.co/QuP6oN7iCS Collaboration with Kihyuk Sohn, @chunliang_tw @ZizhaoZhang Nicholas Carlini @ekindogus @Han_Zhang_ @colinraffel https://t.co/BmeYvpEHzX
arxivml: "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence", Kihyuk Sohn, David Berthelot, Chu… https://t.co/bXM2Sjlwyq
phalanxXxXxX: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence paper: https://t.co/bJq2a2D0dG code: https://t.co/mRpHGSoIv8 https://t.co/9htNCgAnlN
hereticreader: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence - https://t.co/9D1COPFWPY https://t.co/TszkmFaMKr
arxiv_cs_cv_pr: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence. Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, and Colin Raffel https://t.co/2yOM5S9jz8
StatsPapers: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence. https://t.co/cfgsxX3XLx
FujitaAtsunori: RT @phalanxXxXxX: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence paper: https://t.co/bJq2a2D0dG code: https…
inoichan: RT @phalanxXxXxX: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence paper: https://t.co/bJq2a2D0dG code: https…
TeraBytesMemory: RT @phalanxXxXxX: FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence paper: https://t.co/bJq2a2D0dG code: https…
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 9
Total Words: 0
Unqiue Words: 0

##### #4. Mobility Inference on Long-Tailed Sparse Trajectory
###### Lei Shi
Analyzing the urban trajectory in cities has become an important topic in data mining. How can we model the human mobility consisting of stay and travel from the raw trajectory data? How can we infer such a mobility model from the single trajectory information? How can we further generalize the mobility inference to accommodate the real-world trajectory data that is sparsely sampled over time? In this paper, based on formal and rigid definitions of the stay/travel mobility, we propose a single trajectory inference algorithm that utilizes a generic long-tailed sparsity pattern in the large-scale trajectory data. The algorithm guarantees a 100\% precision in the stay/travel inference with a provable lower-bound in the recall. Furthermore, we introduce an encoder-decoder learning architecture that admits multiple trajectories as inputs. The architecture is optimized for the mobility inference problem through customized embedding and learning mechanism. Evaluations with three trajectory data sets of 40 million urban users validate...
more | pdf | html
None.
###### Tweets
arxivml: "Mobility Inference on Long-Tailed Sparse Trajectory", Lei Shi https://t.co/cFQlHR0F2M
StatsPapers: Mobility Inference on Long-Tailed Sparse Trajectory. https://t.co/iW63jLjHRX
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

##### #5. Generate High-Resolution Adversarial Samples by Identifying Effective Features
###### Sizhe Chen, Peidong Zhang, Chengjin Sun, Jia Cai, Xiaolin Huang
As the prevalence of deep learning in computer vision, adversarial samples that weaken the neural networks emerge in large numbers, revealing their deep-rooted defects. Most adversarial attacks calculate an imperceptible perturbation in image space to fool the DNNs. In this strategy, the perturbation looks like noise and thus could be mitigated. Attacks in feature space produce semantic perturbation, but they could only deal with low resolution samples. The reason lies in the great number of coupled features to express a high-resolution image. In this paper, we propose Attack by Identifying Effective Features (AIEF), which learns different weights for features to attack. Effective features, those with great weights, influence the victim model much but distort the image little, and thus are more effective for attack. By attacking mostly on them, AIEF produces high resolution adversarial samples with acceptable distortions. We demonstrate the effectiveness of AIEF by attacking on different tasks with different generative models.
more | pdf | html
###### Tweets
arxivml: "Generate High-Resolution Adversarial Samples by Identifying Effective Features", Sizhe Chen, Peidong Zhang, Chengj… https://t.co/GGFDXbG93S
arxiv_cs_cv_pr: Generate High-Resolution Adversarial Samples by Identifying Effective Features. Sizhe Chen, Peidong Zhang, Chengjin Sun, Jia Cai, and Xiaolin Huang https://t.co/ya7ejWg32g
StatsPapers: Generate High-Resolution Adversarial Samples by Identifying Effective Features. https://t.co/UtaTvpEQ0V
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 5
Total Words: 5361
Unqiue Words: 1751

##### #6. batchboost: regularization for stabilizing training with resistance to underfitting & overfitting
###### Maciej A. Czyzewski
Overfitting & underfitting and stable training are an important challenges in machine learning. Current approaches for these issues are mixup, SamplePairing and BC learning. In our work, we state the hypothesis that mixing many images together can be more effective than just two. Batchboost pipeline has three stages: (a) pairing: method of selecting two samples. (b) mixing: how to create a new one from two samples. (c) feeding: combining mixed samples with new ones from dataset into batch (with ratio $\gamma$). Note that sample that appears in our batch propagates with subsequent iterations with less and less importance until the end of training. Pairing stage calculates the error per sample, sorts the samples and pairs with strategy: hardest with easiest one, than mixing stage merges two samples using mixup, $x_1 + (1-\lambda)x_2$. Finally, feeding stage combines new samples with mixed by ratio 1:1. Batchboost has 0.5-3% better accuracy than the current state-of-the-art mixup regularization on CIFAR-10 & Fashion-MNIST. Our method...
more | pdf | html
None.
###### Tweets
StatsPapers: batchboost: regularization for stabilizing training with resistance to underfitting &amp; overfitting. https://t.co/hq6sEOuouo
arxivml: "batchboost: regularization for stabilizing training with resistance to underfitting &amp; overfitting", Maciej A． Czyz… https://t.co/Zns2ZLjPxA
arxiv_cs_cv_pr: batchboost: regularization for stabilizing training with resistance to underfitting &amp; overfitting. Maciej A. Czyzewski https://t.co/tAdif09ufj
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

##### #7. Simple and Effective Graph Autoencoders with One-Hop Linear Models
###### Guillaume Salha, Romain Hennequin, Michalis Vazirgiannis
Graph autoencoders (AE) and variational autoencoders (VAE) recently emerged as powerful node embedding methods, with promising performances on challenging tasks such as link prediction and node clustering. Graph AE, VAE and most of their extensions rely on graph convolutional networks (GCN) encoders to learn vector space representations of nodes. In this paper, we propose to replace the GCN encoder by a significantly simpler linear model w.r.t. the direct neighborhood (one-hop) adjacency matrix of the graph. For the two aforementioned tasks, we show that this approach consistently reaches competitive performances w.r.t. GCN-based models for numerous real-world graphs, including all benchmark datasets commonly used to evaluate graph AE and VAE. We question the relevance of repeatedly using these datasets to compare complex graph AE and VAE. We also emphasize the effectiveness of the proposed encoding scheme, that appears as a simpler and faster alternative to GCN encoders for many real-world applications.
more | pdf | html
None.
###### Tweets
BrundageBot: Simple and Effective Graph Autoencoders with One-Hop Linear Models. Guillaume Salha, Romain Hennequin, and Michalis Vazirgiannis https://t.co/gwZdV2H7OD
arxivml: "Simple and Effective Graph Autoencoders with One-Hop Linear Models", Guillaume Salha, Romain Hennequin, Michalis V… https://t.co/KwMKt2nKtD
StatsPapers: Simple and Effective Graph Autoencoders with One-Hop Linear Models. https://t.co/Flv14FtrEg
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

##### #8. Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data
###### Kyoung-Woon On, Eun-Sol Kim, Yu-Jung Heo, Byoung-Tak Zhang
Conventional sequential learning methods such as Recurrent Neural Networks (RNNs) focus on interactions between consecutive inputs, i.e. first-order Markovian dependency. However, most of sequential data, as seen with videos, have complex dependency structures that imply variable-length semantic flows and their compositions, and those are hard to be captured by conventional methods. Here, we propose Cut-Based Graph Learning Networks (CB-GLNs) for learning video data by discovering these complex structures of the video. The CB-GLNs represent video data as a graph, with nodes and edges corresponding to frames of the video and their dependencies respectively. The CB-GLNs find compositional dependencies of the data in multilevel graph forms via a parameterized kernel with graph-cut and a message passing framework. We evaluate the proposed method on the two different tasks for video understanding: Video theme classification (Youtube-8M dataset) and Video Question and Answering (TVQA dataset). The experimental results show that...
more | pdf | html
None.
###### Tweets
arxivml: "Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data", Kyoung-Woon On, E… https://t.co/fE9biROPCc
StatsPapers: Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data. https://t.co/MOiPKOokKC
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

##### #9. Understanding the Limitations of Network Online Learning
Studies of networked phenomena, such as interactions in online social media, often rely on incomplete data, either because these phenomena are partially observed, or because the data is too large or expensive to acquire all at once. Analysis of incomplete data leads to skewed or misleading results. In this paper, we investigate limitations of learning to complete partially observed networks via node querying. Concretely, we study the following problem: given (i) a partially observed network, (ii) the ability to query nodes for their connections (e.g., by accessing an API), and (iii) a budget on the number of such queries, sequentially learn which nodes to query in order to maximally increase observability. We call this querying process Network Online Learning and present a family of algorithms called NOL*. These algorithms learn to choose which partially observed node to query next based on a parameterized model that is trained online through a process of exploration and exploitation. Extensive experiments on both synthetic and...
more | pdf | html
None.
###### Tweets
arxivml: "Understanding the Limitations of Network Online Learning", Timothy LaRock, Timothy Sakharov, Sahely Bhadra, Tina E… https://t.co/0GGEqjEsUC
StatsPapers: Understanding the Limitations of Network Online Learning. https://t.co/tZ3uypWayV
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

##### #10. Data-Driven Permanent Magnet Temperature Estimation in Synchronous Motors with Supervised Machine Learning
###### Wilhelm Kirchgässner, Oliver Wallscheid, Joachim Böcker
Monitoring the magnet temperature in permanent magnet synchronous motors (PMSMs) for automotive applications is a challenging task for several decades now, as signal injection or sensor-based methods still prove unfeasible in a commercial context. Overheating results in severe motor deterioration and is thus of high concern for the machine's control strategy and its design. Lack of precise temperature estimations leads to lesser device utilization and higher material cost. In this work, several machine learning (ML) models are empirically evaluated on their estimation accuracy for the task of predicting latent high-dynamic magnet temperature profiles. The range of selected algorithms covers as diverse approaches as possible with ordinary and weighted least squares, support vector regression, $k$-nearest neighbors, randomized trees and neural networks. Having test bench data available, it is shown that ML approaches relying merely on collected data meet the estimation performance of classical thermal models built on thermodynamic...
more | pdf | html
###### Tweets
arxivml: "Data-Driven Permanent Magnet Temperature Estimation in Synchronous Motors with Supervised Machine Learning", Wilhe… https://t.co/IaYmkBLmXv
arxiv_cs_LG: Data-Driven Permanent Magnet Temperature Estimation in Synchronous Motors with Supervised Machine Learning. Wilhelm Kirchgässner, Oliver Wallscheid, and Joachim Böcker https://t.co/CERh88LPOj
Memoirs: Data-Driven Permanent Magnet Temperature Estimation in Synchronous Motors with Supervised Machine Learning. https://t.co/i7m9wshKUy
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6429
Unqiue Words: 2498

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 256,574 papers.

###### Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Online
###### Stats
Tracking 256,574 papers.