Top 10 Arxiv Papers Today in Machine Learning


2.152 Mikeys
#1. Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach
Kuruparan Shanmugalingam, Nisal Chandrasekara, Calvin Hindle, Gihan Fernando, Chanaka Gunawardhana
Comprehensive IT support teams in large scale organizations require more man power for handling engagement and requests of employees from different channels on a 24*7 basis. Automated email technical queries help desk is proposed to have instant real-time quick solutions and email categorisation. Email topic modelling with various machine learning, deep-learning approaches are compared with different features for a scalable, generalised solution along with sure-shot static rules. Email's title, body, attachment, OCR text, and some feature engineered custom features are given as input elements. XGBoost cascaded hierarchical models, Bi-LSTM model with word embeddings perform well showing 77.3 overall accuracy For the real world corporate email data set. By introducing the thresholding techniques, the overall automation system architecture provides 85.6 percentage of accuracy for real world corporate emails. Combination of quick fixes, static rules, ML categorization as a low cost inference solution reduces 81 percentage of the human...
more | pdf | html
Figures
None.
Tweets
SciFi: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach. https://t.co/ff0PqXssoa
arxiv_cscv: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach https://t.co/blWsMIJa19
arxiv_cscl: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach https://t.co/QESCR4jW72
arxiv_cscl: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach https://t.co/QESCR4BwYA
SantchiWeb: RT @arxiv_cscv: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach https://t.co/blWsMIJa19
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.096 Mikeys
#2. Training Robust Deep Neural Networks via Adversarial Noise Propagation
Aishan Liu, Xianglong Liu, Chongzhi Zhang, Hang Yu, Qiang Liu, Junfeng He
Deep neural networks have been found vulnerable to noises like adversarial examples and corruption in practice. A number of adversarial defense methods have been developed, which indeed improve the model robustness towards adversarial examples in practice. However, only relying on training with the data mixed with noises, most of them still fail to defend the generalized types of noises. Motivated by the fact that hidden layers play a very important role in maintaining a robust model, this paper comes up with a simple yet powerful training algorithm named Adversarial Noise Propagation (ANP) that injects diversified noises into the hidden layers in a layer-wise manner. We show that ANP can be efficiently implemented by exploiting the nature of the popular backward-forward training style for deep models. To comprehensively understand the behaviors and contributions of hidden layers, we further explore the insights from hidden representation insensitivity and human vision perception alignment. Extensive experiments on MNIST,...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Training Robust Deep Neural Networks via Adversarial Noise Propagation. Aishan Liu, Xianglong Liu, Chongzhi Zhang, Hang Yu, Qiang Liu, and Junfeng He https://t.co/B7U71vCwdK
arxiv_cscv: Training Robust Deep Neural Networks via Adversarial Noise Propagation https://t.co/ItVQ6WzIeo
arxiv_cs_cv_pr: Training Robust Deep Neural Networks via Adversarial Noise Propagation. Aishan Liu, Xianglong Liu, Chongzhi Zhang, Hang Yu, Qiang Liu, and Junfeng He https://t.co/RhCiYCYc8D
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

2.062 Mikeys
#3. Adversarial Attacks and Defenses in Images, Graphs and Text: A Review
Han Xu, Yao Ma, Haochen Liu, Debayan Deb, Hui Liu, Jiliang Tang, Anil Jain
Deep neural networks (DNN) have achieved unprecedented success in numerous machine learning tasks in various domains. However, the existence of adversarial examples raises our concerns in adopting deep learning to safety-critical applications. As a result, we have witnessed increasing interests in studying attack and defense mechanisms for DNN models on different data types, such as images, graphs and text. Thus, it is necessary to provide a systematic and comprehensive overview of the main threats of attacks and the success of corresponding countermeasures. In this survey, we review the state of the art algorithms for generating adversarial examples and the countermeasures against adversarial examples, for three most popular data types, including images, graphs and text.
more | pdf | html
Figures
Tweets
arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
cynicalsecurity: H. Xu et al., “Adversarial Attacks and Defenses in Images, Graphs and Text: A Review” […the state of the art algorithms for generating adversarial examples and the countermeasures… for three most popular data types, including images, graphs and text…] https://t.co/3MPRioapUK
BrundageBot: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. Han Xu, Yao Ma, Haochen Liu, Debayan Deb, Hui Liu, Jiliang Tang, and Anil Jain https://t.co/DCIDCEifVu
arxivml: "Adversarial Attacks and Defenses in Images, Graphs and Text: A Review", Han Xu, Yao Ma, Haochen Liu, Debayan Deb, … https://t.co/JjSrIwLlZG
arxiv_cs_LG: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. Han Xu, Yao Ma, Haochen Liu, Debayan Deb, Hui Liu, Jiliang Tang, and Anil Jain https://t.co/8Tf5JQLjrc
StatsPapers: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/D6VYhUUfLA
jaialkdanel: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
udmrzn: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
SythonUK: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
puneethmishra: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
ovikrai: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
HanXu21003318: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 7
Total Words: 20285
Unqiue Words: 4480

2.053 Mikeys
#4. AdaFair: Cumulative Fairness Adaptive Boosting
Vasileios Iosifidis, Eirini Ntoutsi
The widespread use of ML-based decision making in domains with high societal impact such as recidivism, job hiring and loan credit has raised a lot of concerns regarding potential discrimination. In particular, in certain cases it has been observed that ML algorithms can provide different decisions based on sensitive attributes such as gender or race and therefore can lead to discrimination. Although, several fairness-aware ML approaches have been proposed, their focus has been largely on preserving the overall classification accuracy while improving fairness in predictions for both protected and non-protected groups (defined based on the sensitive attribute(s)). The overall accuracy however is not a good indicator of performance in case of class imbalance, as it is biased towards the majority class. As we will see in our experiments, many of the fairness-related datasets suffer from class imbalance and therefore, tackling fairness requires also tackling the imbalance problem. To this end, we propose AdaFair, a fairness-aware...
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: AdaFair: Cumulative Fairness Adaptive Boosting. Vasileios Iosifidis and Eirini Ntoutsi https://t.co/vw0cJuTAe0
StatsPapers: AdaFair: Cumulative Fairness Adaptive Boosting. https://t.co/42AXvjfyx8
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.053 Mikeys
#5. Using Latent Codes for Class Imbalance Problem in Unsupervised Domain Adaptation
Boris Chidlovskii
We address the problem of severe class imbalance in unsupervised domain adaptation, when the class spaces in source and target domains diverge considerably. Till recently, domain adaptation methods assumed the aligned class spaces, such that reducing distribution divergence makes the transfer between domains easier. Such an alignment assumption is invalidated in real world scenarios where some source classes are often under-represented or simply absent in the target domain. We revise the current approaches to class imbalance and propose a new one that uses latent codes in the adversarial domain adaptation framework. We show how the latent codes can be used to disentangle the silent structure of the target domain and to identify under-represented classes. We show how to learn the latent code reconstruction jointly with the domain invariant representation and use them to accurately estimate the target labels.
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Using Latent Codes for Class Imbalance Problem in Unsupervised Domain Adaptation. Boris Chidlovskii https://t.co/t6DV1S119z
StatsPapers: Using Latent Codes for Class Imbalance Problem in Unsupervised Domain Adaptation. https://t.co/2UJKb4P0vp
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

2.053 Mikeys
#6. Impact of novel aggregation methods for flexible, time-sensitive EHR prediction without variable selection or cleaning
Jacob Deasy, Ari Ercole, Pietro Liò
Dynamic assessment of patient status (e.g. by an automated, continuously updated assessment of outcome) in the Intensive Care Unit (ICU) is of paramount importance for early alerting, decision support and resource allocation. Extraction and cleaning of expert-selected clinical variables discards information and protracts collaborative efforts to introduce machine learning in medicine. We present improved aggregation methods for a flexible deep learning architecture which learns a joint representation of patient chart, lab and output events. Our models outperform recent deep learning models for patient mortality classification using ICU timeseries, by embedding and aggregating all events with no pre-processing or variable selection. Our model achieves a strong performance of AUROC 0.87 at 48 hours on the MIMIC-III dataset while using 13,233 unique un-preprocessed variables in an interpretable manner via hourly softmax aggregation. This demonstrates how our method can be easily combined with existing electronic health record systems...
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: Impact of novel aggregation methods for flexible, time-sensitive EHR prediction without variable selection or cleaning. Jacob Deasy, Ari Ercole, and Pietro Liò https://t.co/wGHIEQ8oSd
StatsPapers: Impact of novel aggregation methods for flexible, time-sensitive EHR prediction without variable selection or cleaning. https://t.co/HSaGP8Fob1
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.052 Mikeys
#7. Memory-Augmented Neural Networks for Machine Translation
Mark Collier, Joeran Beel
Memory-augmented neural networks (MANNs) have been shown to outperform other recurrent neural network architectures on a series of artificial sequence learning tasks, yet they have had limited application to real-world tasks. We evaluate direct application of Neural Turing Machines (NTM) and Differentiable Neural Computers (DNC) to machine translation. We further propose and evaluate two models which extend the attentional encoder-decoder with capabilities inspired by memory augmented neural networks. We evaluate our proposed models on IWSLT Vietnamese to English and ACL Romanian to English datasets. Our proposed models and the memory augmented neural networks perform similarly to the attentional encoder-decoder on the Vietnamese to English translation task while have a 0.3-1.9 lower BLEU score for the Romanian to English task. Interestingly, our analysis shows that despite being equipped with additional flexibility and being randomly initialized memory augmented neural networks learn an algorithm for machine translation almost...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Memory-Augmented Neural Networks for Machine Translation. Mark Collier and Joeran Beel https://t.co/G63T4QYNbM
arxivml: "Memory-Augmented Neural Networks for Machine Translation", Mark Collier, Joeran Beel https://t.co/6QOKTq2liR
StatsPapers: Memory-Augmented Neural Networks for Machine Translation. https://t.co/M2YfcXcSJY
arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDIytV
arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
udmrzn: RT @arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
morioka: RT @arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.04 Mikeys
#8. Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis
Minlong Peng, Qi Zhang, Xuanjing Huang
Cross-domain sentiment analysis is currently a hot topic in the research and engineering areas. One of the most popular frameworks in this field is the domain-invariant representation learning (DIRL) paradigm, which aims to learn a distribution-invariant feature representation across domains. However, in this work, we find out that applying DIRL may harm domain adaptation when the label distribution $\rm{P}(\rm{Y})$ changes across domains. To address this problem, we propose a modification to DIRL, obtaining a novel weighted domain-invariant representation learning (WDIRL) framework. We show that it is easy to transfer existing SOTA DIRL models to WDIRL. Empirical studies on extensive cross-domain sentiment analysis tasks verified our statements and showed the effectiveness of our proposed solution.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis. Minlong Peng, Qi Zhang, and Xuanjing Huang https://t.co/NGa576uEXq
arxiv_in_review: #EMNLP2019 Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis. (arXiv:1909.08167v1 [cs\.LG]) https://t.co/LNcpu3PvVI
arxivml: "Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis", Minlong Peng, Qi Zhang, Xua… https://t.co/TxqD2CHD91
StatsPapers: Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis. https://t.co/OVRkEs7tmV
arxiv_cscl: Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis https://t.co/nhEwHmGl32
arxiv_cscl: Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis https://t.co/nhEwHmGl32
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6305
Unqiue Words: 1971

2.036 Mikeys
#9. A Distributed Fair Machine Learning Framework with Private Demographic Data Protection
Hui Hu, Yijun Liu, Zhen Wang, Chao Lan
Fair machine learning has become a significant research topic with broad societal impact. However, most fair learning methods require direct access to personal demographic data, which is increasingly restricted to use for protecting user privacy (e.g. by the EU General Data Protection Regulation). In this paper, we propose a distributed fair learning framework for protecting the privacy of demographic data. We assume this data is privately held by a third party, which can communicate with the data center (responsible for model development) without revealing the demographic information. We propose a principled approach to design fair learning methods under this framework, exemplify four methods and show they consistently outperform their existing counterparts in both fairness and accuracy across three real-world data sets. We theoretically analyze the framework, and prove it can learn models with high fairness or high accuracy, with their trade-offs balanced by a threshold variable.
more | pdf | html
Figures
Tweets
arxiv_org: A Distributed Fair Machine Learning Framework with Private Demographic Data Protection. https://t.co/xdi7oNklUA https://t.co/ljWsJRNzuv
BrundageBot: A Distributed Fair Machine Learning Framework with Private Demographic Data Protection. Hui Hu, Yijun Liu, Zhen Wang, and Chao Lan https://t.co/PZQS0qjeww
arxivml: "A Distributed Fair Machine Learning Framework with Private Demographic Data Protection", Hui Hu, Yijun Liu, Zhen W… https://t.co/Y2lCognm0z
arxiv_cs_LG: A Distributed Fair Machine Learning Framework with Private Demographic Data Protection. Hui Hu, Yijun Liu, Zhen Wang, and Chao Lan https://t.co/hGsQBxfisi
Github
Repository: Distributed-Private-Fair-Learning
User: HuiHu1
Language: MATLAB
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 7169
Unqiue Words: 2305

2.032 Mikeys
#10. Automobile Theft Detection by Clustering Owner Driver Data
Yong Goo Kang, Kyung Ho Park, Huy Kang Kim
As automobiles become intelligent, automobile theft methods are evolving intelligently. Therefore automobile theft detection has become a major research challenge. Data-mining, biometrics, and additional authentication methods have been proposed to address automobile theft, in previous studies. Among these methods, data-mining can be used to analyze driving characteristics and identify a driver comprehensively. However, it requires a labeled driving dataset to achieve high accuracy. It is impractical to use the actual automobile theft detection system because real theft driving data cannot be collected in advance. Hence, we propose a method to detect an automobile theft attempt using only owner driving data. We cluster the key features of the owner driving data using the k-means algorithm. After reconstructing the driving data into one of these clusters, theft is detected using an error from the original driving data. To validate the proposed models, we tested our actual driving data and obtained 99% accuracy from the best model....
more | pdf | html
Figures
None.
Tweets
StatsPapers: Automobile Theft Detection by Clustering Owner Driver Data. https://t.co/7RQYqx9n6H
udmrzn: RT @StatsPapers: Automobile Theft Detection by Clustering Owner Driver Data. https://t.co/7RQYqx9n6H
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 192,929 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 192,929 papers.