### Top 10 Arxiv Papers Today in Machine Learning

##### #1. Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach
###### Kuruparan Shanmugalingam, Nisal Chandrasekara, Calvin Hindle, Gihan Fernando, Chanaka Gunawardhana
Comprehensive IT support teams in large scale organizations require more man power for handling engagement and requests of employees from different channels on a 24*7 basis. Automated email technical queries help desk is proposed to have instant real-time quick solutions and email categorisation. Email topic modelling with various machine learning, deep-learning approaches are compared with different features for a scalable, generalised solution along with sure-shot static rules. Email's title, body, attachment, OCR text, and some feature engineered custom features are given as input elements. XGBoost cascaded hierarchical models, Bi-LSTM model with word embeddings perform well showing 77.3 overall accuracy For the real world corporate email data set. By introducing the thresholding techniques, the overall automation system architecture provides 85.6 percentage of accuracy for real world corporate emails. Combination of quick fixes, static rules, ML categorization as a low cost inference solution reduces 81 percentage of the human...
more | pdf | html
None.
###### Tweets
SciFi: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach. https://t.co/ff0PqXssoa
arxiv_cscv: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach https://t.co/blWsMIJa19
arxiv_cscl: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach https://t.co/QESCR4jW72
arxiv_cscl: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach https://t.co/QESCR4BwYA
SantchiWeb: RT @arxiv_cscv: Corporate IT-support Help-Desk Process Hybrid-Automation Solution with Machine Learning Approach https://t.co/blWsMIJa19
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

##### #2. Training Robust Deep Neural Networks via Adversarial Noise Propagation
###### Aishan Liu, Xianglong Liu, Chongzhi Zhang, Hang Yu, Qiang Liu, Junfeng He
Deep neural networks have been found vulnerable to noises like adversarial examples and corruption in practice. A number of adversarial defense methods have been developed, which indeed improve the model robustness towards adversarial examples in practice. However, only relying on training with the data mixed with noises, most of them still fail to defend the generalized types of noises. Motivated by the fact that hidden layers play a very important role in maintaining a robust model, this paper comes up with a simple yet powerful training algorithm named Adversarial Noise Propagation (ANP) that injects diversified noises into the hidden layers in a layer-wise manner. We show that ANP can be efficiently implemented by exploiting the nature of the popular backward-forward training style for deep models. To comprehensively understand the behaviors and contributions of hidden layers, we further explore the insights from hidden representation insensitivity and human vision perception alignment. Extensive experiments on MNIST,...
more | pdf | html
None.
###### Tweets
BrundageBot: Training Robust Deep Neural Networks via Adversarial Noise Propagation. Aishan Liu, Xianglong Liu, Chongzhi Zhang, Hang Yu, Qiang Liu, and Junfeng He https://t.co/B7U71vCwdK
arxiv_cscv: Training Robust Deep Neural Networks via Adversarial Noise Propagation https://t.co/ItVQ6WzIeo
arxiv_cs_cv_pr: Training Robust Deep Neural Networks via Adversarial Noise Propagation. Aishan Liu, Xianglong Liu, Chongzhi Zhang, Hang Yu, Qiang Liu, and Junfeng He https://t.co/RhCiYCYc8D
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

##### #3. Adversarial Attacks and Defenses in Images, Graphs and Text: A Review
###### Han Xu, Yao Ma, Haochen Liu, Debayan Deb, Hui Liu, Jiliang Tang, Anil Jain
Deep neural networks (DNN) have achieved unprecedented success in numerous machine learning tasks in various domains. However, the existence of adversarial examples raises our concerns in adopting deep learning to safety-critical applications. As a result, we have witnessed increasing interests in studying attack and defense mechanisms for DNN models on different data types, such as images, graphs and text. Thus, it is necessary to provide a systematic and comprehensive overview of the main threats of attacks and the success of corresponding countermeasures. In this survey, we review the state of the art algorithms for generating adversarial examples and the countermeasures against adversarial examples, for three most popular data types, including images, graphs and text.
more | pdf | html
###### Tweets
arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
cynicalsecurity: H. Xu et al., “Adversarial Attacks and Defenses in Images, Graphs and Text: A Review” […the state of the art algorithms for generating adversarial examples and the countermeasures… for three most popular data types, including images, graphs and text…] https://t.co/3MPRioapUK
BrundageBot: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. Han Xu, Yao Ma, Haochen Liu, Debayan Deb, Hui Liu, Jiliang Tang, and Anil Jain https://t.co/DCIDCEifVu
arxivml: "Adversarial Attacks and Defenses in Images, Graphs and Text: A Review", Han Xu, Yao Ma, Haochen Liu, Debayan Deb, … https://t.co/JjSrIwLlZG
arxiv_cs_LG: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. Han Xu, Yao Ma, Haochen Liu, Debayan Deb, Hui Liu, Jiliang Tang, and Anil Jain https://t.co/8Tf5JQLjrc
StatsPapers: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/D6VYhUUfLA
jaialkdanel: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
udmrzn: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
SythonUK: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
puneethmishra: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
ovikrai: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
HanXu21003318: RT @arxiv_org: Adversarial Attacks and Defenses in Images, Graphs and Text: A Review. https://t.co/PjWKCcpaCU https://t.co/sZUhjhcVRU
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 7
Total Words: 20285
Unqiue Words: 4480

###### Vasileios Iosifidis, Eirini Ntoutsi
The widespread use of ML-based decision making in domains with high societal impact such as recidivism, job hiring and loan credit has raised a lot of concerns regarding potential discrimination. In particular, in certain cases it has been observed that ML algorithms can provide different decisions based on sensitive attributes such as gender or race and therefore can lead to discrimination. Although, several fairness-aware ML approaches have been proposed, their focus has been largely on preserving the overall classification accuracy while improving fairness in predictions for both protected and non-protected groups (defined based on the sensitive attribute(s)). The overall accuracy however is not a good indicator of performance in case of class imbalance, as it is biased towards the majority class. As we will see in our experiments, many of the fairness-related datasets suffer from class imbalance and therefore, tackling fairness requires also tackling the imbalance problem. To this end, we propose AdaFair, a fairness-aware...
more | pdf | html
None.
###### Tweets
arxiv_cs_LG: AdaFair: Cumulative Fairness Adaptive Boosting. Vasileios Iosifidis and Eirini Ntoutsi https://t.co/vw0cJuTAe0
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

##### #5. Using Latent Codes for Class Imbalance Problem in Unsupervised Domain Adaptation
###### Boris Chidlovskii
We address the problem of severe class imbalance in unsupervised domain adaptation, when the class spaces in source and target domains diverge considerably. Till recently, domain adaptation methods assumed the aligned class spaces, such that reducing distribution divergence makes the transfer between domains easier. Such an alignment assumption is invalidated in real world scenarios where some source classes are often under-represented or simply absent in the target domain. We revise the current approaches to class imbalance and propose a new one that uses latent codes in the adversarial domain adaptation framework. We show how the latent codes can be used to disentangle the silent structure of the target domain and to identify under-represented classes. We show how to learn the latent code reconstruction jointly with the domain invariant representation and use them to accurately estimate the target labels.
more | pdf | html
None.
###### Tweets
arxiv_cs_LG: Using Latent Codes for Class Imbalance Problem in Unsupervised Domain Adaptation. Boris Chidlovskii https://t.co/t6DV1S119z
StatsPapers: Using Latent Codes for Class Imbalance Problem in Unsupervised Domain Adaptation. https://t.co/2UJKb4P0vp
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

##### #6. Impact of novel aggregation methods for flexible, time-sensitive EHR prediction without variable selection or cleaning
###### Jacob Deasy, Ari Ercole, Pietro Liò
Dynamic assessment of patient status (e.g. by an automated, continuously updated assessment of outcome) in the Intensive Care Unit (ICU) is of paramount importance for early alerting, decision support and resource allocation. Extraction and cleaning of expert-selected clinical variables discards information and protracts collaborative efforts to introduce machine learning in medicine. We present improved aggregation methods for a flexible deep learning architecture which learns a joint representation of patient chart, lab and output events. Our models outperform recent deep learning models for patient mortality classification using ICU timeseries, by embedding and aggregating all events with no pre-processing or variable selection. Our model achieves a strong performance of AUROC 0.87 at 48 hours on the MIMIC-III dataset while using 13,233 unique un-preprocessed variables in an interpretable manner via hourly softmax aggregation. This demonstrates how our method can be easily combined with existing electronic health record systems...
more | pdf | html
None.
###### Tweets
arxiv_cs_LG: Impact of novel aggregation methods for flexible, time-sensitive EHR prediction without variable selection or cleaning. Jacob Deasy, Ari Ercole, and Pietro Liò https://t.co/wGHIEQ8oSd
StatsPapers: Impact of novel aggregation methods for flexible, time-sensitive EHR prediction without variable selection or cleaning. https://t.co/HSaGP8Fob1
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

##### #7. Memory-Augmented Neural Networks for Machine Translation
###### Mark Collier, Joeran Beel
Memory-augmented neural networks (MANNs) have been shown to outperform other recurrent neural network architectures on a series of artificial sequence learning tasks, yet they have had limited application to real-world tasks. We evaluate direct application of Neural Turing Machines (NTM) and Differentiable Neural Computers (DNC) to machine translation. We further propose and evaluate two models which extend the attentional encoder-decoder with capabilities inspired by memory augmented neural networks. We evaluate our proposed models on IWSLT Vietnamese to English and ACL Romanian to English datasets. Our proposed models and the memory augmented neural networks perform similarly to the attentional encoder-decoder on the Vietnamese to English translation task while have a 0.3-1.9 lower BLEU score for the Romanian to English task. Interestingly, our analysis shows that despite being equipped with additional flexibility and being randomly initialized memory augmented neural networks learn an algorithm for machine translation almost...
more | pdf | html
None.
###### Tweets
BrundageBot: Memory-Augmented Neural Networks for Machine Translation. Mark Collier and Joeran Beel https://t.co/G63T4QYNbM
arxivml: "Memory-Augmented Neural Networks for Machine Translation", Mark Collier, Joeran Beel https://t.co/6QOKTq2liR
StatsPapers: Memory-Augmented Neural Networks for Machine Translation. https://t.co/M2YfcXcSJY
arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDIytV
arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
udmrzn: RT @arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
morioka: RT @arxiv_cscl: Memory-Augmented Neural Networks for Machine Translation https://t.co/NqJzXDqXCn
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

##### #8. Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis
###### Minlong Peng, Qi Zhang, Xuanjing Huang
Cross-domain sentiment analysis is currently a hot topic in the research and engineering areas. One of the most popular frameworks in this field is the domain-invariant representation learning (DIRL) paradigm, which aims to learn a distribution-invariant feature representation across domains. However, in this work, we find out that applying DIRL may harm domain adaptation when the label distribution $\rm{P}(\rm{Y})$ changes across domains. To address this problem, we propose a modification to DIRL, obtaining a novel weighted domain-invariant representation learning (WDIRL) framework. We show that it is easy to transfer existing SOTA DIRL models to WDIRL. Empirical studies on extensive cross-domain sentiment analysis tasks verified our statements and showed the effectiveness of our proposed solution.
more | pdf | html
None.
###### Tweets
BrundageBot: Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis. Minlong Peng, Qi Zhang, and Xuanjing Huang https://t.co/NGa576uEXq
arxiv_in_review: #EMNLP2019 Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis. (arXiv:1909.08167v1 [cs\.LG]) https://t.co/LNcpu3PvVI
arxivml: "Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis", Minlong Peng, Qi Zhang, Xua… https://t.co/TxqD2CHD91
StatsPapers: Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis. https://t.co/OVRkEs7tmV
arxiv_cscl: Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis https://t.co/nhEwHmGl32
arxiv_cscl: Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis https://t.co/nhEwHmGl32
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6305
Unqiue Words: 1971

##### #9. A Distributed Fair Machine Learning Framework with Private Demographic Data Protection
###### Hui Hu, Yijun Liu, Zhen Wang, Chao Lan
Fair machine learning has become a significant research topic with broad societal impact. However, most fair learning methods require direct access to personal demographic data, which is increasingly restricted to use for protecting user privacy (e.g. by the EU General Data Protection Regulation). In this paper, we propose a distributed fair learning framework for protecting the privacy of demographic data. We assume this data is privately held by a third party, which can communicate with the data center (responsible for model development) without revealing the demographic information. We propose a principled approach to design fair learning methods under this framework, exemplify four methods and show they consistently outperform their existing counterparts in both fairness and accuracy across three real-world data sets. We theoretically analyze the framework, and prove it can learn models with high fairness or high accuracy, with their trade-offs balanced by a threshold variable.
more | pdf | html
###### Tweets
arxiv_org: A Distributed Fair Machine Learning Framework with Private Demographic Data Protection. https://t.co/xdi7oNklUA https://t.co/ljWsJRNzuv
BrundageBot: A Distributed Fair Machine Learning Framework with Private Demographic Data Protection. Hui Hu, Yijun Liu, Zhen Wang, and Chao Lan https://t.co/PZQS0qjeww
arxivml: "A Distributed Fair Machine Learning Framework with Private Demographic Data Protection", Hui Hu, Yijun Liu, Zhen W… https://t.co/Y2lCognm0z
arxiv_cs_LG: A Distributed Fair Machine Learning Framework with Private Demographic Data Protection. Hui Hu, Yijun Liu, Zhen Wang, and Chao Lan https://t.co/hGsQBxfisi
###### Github
Repository: Distributed-Private-Fair-Learning
User: HuiHu1
Language: MATLAB
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 7169
Unqiue Words: 2305

##### #10. Automobile Theft Detection by Clustering Owner Driver Data
###### Yong Goo Kang, Kyung Ho Park, Huy Kang Kim
As automobiles become intelligent, automobile theft methods are evolving intelligently. Therefore automobile theft detection has become a major research challenge. Data-mining, biometrics, and additional authentication methods have been proposed to address automobile theft, in previous studies. Among these methods, data-mining can be used to analyze driving characteristics and identify a driver comprehensively. However, it requires a labeled driving dataset to achieve high accuracy. It is impractical to use the actual automobile theft detection system because real theft driving data cannot be collected in advance. Hence, we propose a method to detect an automobile theft attempt using only owner driving data. We cluster the key features of the owner driving data using the k-means algorithm. After reconstructing the driving data into one of these clusters, theft is detected using an error from the original driving data. To validate the proposed models, we tested our actual driving data and obtained 99% accuracy from the best model....
more | pdf | html
None.
###### Tweets
StatsPapers: Automobile Theft Detection by Clustering Owner Driver Data. https://t.co/7RQYqx9n6H
udmrzn: RT @StatsPapers: Automobile Theft Detection by Clustering Owner Driver Data. https://t.co/7RQYqx9n6H
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 192,929 papers.

###### Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Online
###### Stats
Tracking 192,929 papers.