Top 10 Arxiv Papers Today in Computation And Language


2.104 Mikeys
#1. A Stable Variational Autoencoder for Text Modelling
Ruizhe Li, Xiao Li, Chenghua Lin, Matthew Collinson, Rui Mao
Variational Autoencoder (VAE) is a powerful method for learning representations of high-dimensional data. However, VAEs can suffer from an issue known as latent variable collapse (or KL loss vanishing), where the posterior collapses to the prior and the model will ignore the latent codes in generative tasks. Such an issue is particularly prevalent when employing VAE-RNN architectures for text modelling (Bowman et al., 2016). In this paper, we present a simple architecture called holistic regularisation VAE (HR-VAE), which can effectively avoid latent variable collapse. Compared to existing VAE-RNN architectures, we show that our model can achieve much more stable training process and can generate text with significantly better quality.
more | pdf | html
Figures
Tweets
BrundageBot: A Stable Variational Autoencoder for Text Modelling. Ruizhe Li, Xiao Li, Chenghua Lin, Matthew Collinson, and Rui Mao https://t.co/y3k1WwayHM
arxivml: "A Stable Variational Autoencoder for Text Modelling", Ruizhe Li, Xiao Li, Chenghua Lin, Matthew Collinson, Rui Mao https://t.co/xSnXZ9cmip
fpocket: "A Stable Variational Autoencoder for Text Modelling. (arXiv:1911.05343v1 [https://t.co/fkv1Ct3qsg])" https://t.co/QMjDRRL3EN
arxiv_cscl: A Stable Variational Autoencoder for Text Modelling https://t.co/3xUkMcOGmR
arxiv_cscl: A Stable Variational Autoencoder for Text Modelling https://t.co/3xUkMcOGmR
ryo_masumura: RT @arxiv_cscl: A Stable Variational Autoencoder for Text Modelling https://t.co/3xUkMcOGmR
Github

Code for the paper "A Stable Variational Autoencoder for Text Modelling"

Repository: HR-VAE
User: ruizheliUOA
Language: Python
Stargazers: 4
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 3397
Unqiue Words: 1206

2.064 Mikeys
#2. Mark my Word: A Sequence-to-Sequence Approach to Definition Modeling
Timothee Mickus, Denis Paperno, Mathieu Constant
Defining words in a textual context is a useful task both for practical purposes and for gaining insight into distributed word representations. Building on the distributional hypothesis, we argue here that the most natural formalization of definition modeling is to treat it as a sequence-to-sequence task, rather than a word-to-sequence task: given an input sequence with a highlighted word, generate a contextually appropriate definition for it. We implement this approach in a Transformer-based sequence-to-sequence model. Our proposal allows to train contextualization and definition generation in an end-to-end fashion, which is a conceptual improvement over earlier works. We achieve state-of-the-art results both in contextual and non-contextual definition modeling.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Mark my Word: A Sequence-to-Sequence Approach to Definition Modeling. Timothee Mickus, Denis Paperno, and Mathieu Constant https://t.co/DHTg2GSbCC
fpocket: "Mark my Word: A Sequence-to-Sequence Approach to Definition Modeling. (arXiv:1911.05715v1 [https://t.co/fkv1Ct3qsg])" https://t.co/Haypf7ZLvA
arxiv_cscl: Mark my Word: A Sequence-to-Sequence Approach to Definition Modeling https://t.co/G2BJyxSARd
arxiv_cscl: Mark my Word: A Sequence-to-Sequence Approach to Definition Modeling https://t.co/G2BJyxAZsD
arxiv_cscl: Mark my Word: A Sequence-to-Sequence Approach to Definition Modeling https://t.co/G2BJyxSARd
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.058 Mikeys
#3. Robustness to Capitalization Errors in Named Entity Recognition
Sravan Bodapati, Hyokun Yun, Yaser Al-Onaizan
Robustness to capitalization errors is a highly desirable characteristic of named entity recognizers, yet we find standard models for the task are surprisingly brittle to such noise. Existing methods to improve robustness to the noise completely discard given orthographic information, mwhich significantly degrades their performance on well-formed text. We propose a simple alternative approach based on data augmentation, which allows the model to \emph{learn} to utilize or ignore orthographic information depending on its usefulness in the context. It achieves competitive robustness to capitalization errors while making negligible compromise to its performance on well-formed text and significantly improving generalization power on noisy user-generated text. Our experiments clearly and consistently validate our claim across different types of machine learning models, languages, and dataset sizes.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Robustness to Capitalization Errors in Named Entity Recognition. Sravan Bodapati, Hyokun Yun, and Yaser Al-Onaizan https://t.co/ct9TdrJDTr
arxivml: "Robustness to Capitalization Errors in Named Entity Recognition", Sravan Bodapati, Hyokun Yun, Yaser Al-Onaizan https://t.co/up7SuTV5JU
arxiv_cscl: Robustness to Capitalization Errors in Named Entity Recognition https://t.co/FH358FVEGO
arxiv_cscl: Robustness to Capitalization Errors in Named Entity Recognition https://t.co/FH358FVEGO
RexDouglass: RT @arxiv_cscl: Robustness to Capitalization Errors in Named Entity Recognition https://t.co/FH358FVEGO
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.057 Mikeys
#4. Adapting and evaluating a deep learning language model for clinical why-question answering
Andrew Wen, Mohamed Y. Elwazir, Sungrim Moon, Jungwei Fan
Objectives: To adapt and evaluate a deep learning language model for answering why-questions based on patient-specific clinical text. Materials and Methods: Bidirectional encoder representations from transformers (BERT) models were trained with varying data sources to perform SQuAD 2.0 style why-question answering (why-QA) on clinical notes. The evaluation focused on: 1) comparing the merits from different training data, 2) error analysis. Results: The best model achieved an accuracy of 0.707 (or 0.760 by partial match). Training toward customization for the clinical language helped increase 6% in accuracy. Discussion: The error analysis suggested that the model did not really perform deep reasoning and that clinical why-QA might warrant more sophisticated solutions. Conclusion: The BERT model achieved moderate accuracy in clinical why-QA and should benefit from the rapidly evolving technology. Despite the identified limitations, it could serve as a competent proxy for question-driven clinical information extraction.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Adapting and evaluating a deep learning language model for clinical why-question answering. Andrew Wen, Mohamed Y. Elwazir, Sungrim Moon, and Jungwei Fan https://t.co/3Q1Wibn9Wu
arxiv_cscl: Adapting and evaluating a deep learning language model for clinical why-question answering https://t.co/4XkysKRQk3
arxiv_cscl: Adapting and evaluating a deep learning language model for clinical why-question answering https://t.co/4XkysKRQk3
Posiwise: RT @arxiv_cscl: Adapting and evaluating a deep learning language model for clinical why-question answering https://t.co/4XkysKRQk3
akdm_bot: RT @arxiv_cscl: Adapting and evaluating a deep learning language model for clinical why-question answering https://t.co/4XkysKRQk3
muktabh: RT @arxiv_cscl: Adapting and evaluating a deep learning language model for clinical why-question answering https://t.co/4XkysKRQk3
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.051 Mikeys
#5. Improving Robustness of Task Oriented Dialog Systems
Arash Einolghozati, Sonal Gupta, Mrinal Mohit, Rushin Shah
Task oriented language understanding in dialog systems is often modeled using intents (task of a query) and slots (parameters for that task). Intent detection and slot tagging are, in turn, modeled using sentence classification and word tagging techniques respectively. Similar to adversarial attack problems with computer vision models discussed in existing literature, these intent-slot tagging models are often over-sensitive to small variations in input -- predicting different and often incorrect labels when small changes are made to a query, thus reducing their accuracy and reliability. However, evaluating a model's robustness to these changes is harder for language since words are discrete and an automated change (e.g. adding `noise') to a query sometimes changes the meaning and thus labels of a query. In this paper, we first describe how to create an adversarial test set to measure the robustness of these models. Furthermore, we introduce and adapt adversarial training methods as well as data augmentation using back-translation...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Improving Robustness of Task Oriented Dialog Systems. Arash Einolghozati, Sonal Gupta, Mrinal Mohit, and Rushin Shah https://t.co/YsEdGfCBLR
arxivml: "Improving Robustness of Task Oriented Dialog Systems", Arash Einolghozati, Sonal Gupta, Mrinal Mohit, Rushin Shah https://t.co/yI2Vzcj1DC
arxiv_cscl: Improving Robustness of Task Oriented Dialog Systems https://t.co/xS7AMuQc5P
arxiv_cscl: Improving Robustness of Task Oriented Dialog Systems https://t.co/xS7AMuQc5P
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 6230
Unqiue Words: 1975

2.045 Mikeys
#6. How to Evaluate Word Representations of Informal Domain?
Yekun Chai, Naomi Saphra, Adam Lopez
Diverse word representations have surged in most state-of-the-art natural language processing (NLP) applications. Nevertheless, how to efficiently evaluate such word embeddings in the informal domain such as Twitter or forums, remains an ongoing challenge due to the lack of sufficient evaluation dataset. We derived a large list of variant spelling pairs from UrbanDictionary with the automatic approaches of weakly-supervised pattern-based bootstrapping and self-training linear-chain conditional random field (CRF). With these extracted relation pairs we promote the odds of eliding the text normalization procedure of traditional NLP pipelines and directly adopting representations of non-standard words in the informal domain. Our code is available.
more | pdf | html
Figures
None.
Tweets
BrundageBot: How to Evaluate Word Representations of Informal Domain?. Yekun Chai, Naomi Saphra, and Adam Lopez https://t.co/i8Iubtesoc
arxivml: "How to Evaluate Word Representations of Informal Domain?", Yekun Chai, Naomi Saphra, Adam Lopez https://t.co/IdSQVbSaeZ
fpocket: "How to Evaluate Word Representations of Informal Domain?. (arXiv:1911.04669v1 [https://t.co/fkv1Ct3qsg])" https://t.co/xo4cimlUcC
arxiv_cs_LG: How to Evaluate Word Representations of Informal Domain?. Yekun Chai, Naomi Saphra, and Adam Lopez https://t.co/xMeEcE8SX8
arxiv_cscl: How to Evaluate Word Representations of Informal Domain? https://t.co/Q9M3c6XQxB
arxiv_cscl: How to Evaluate Word Representations of Informal Domain? https://t.co/Q9M3c7frp9
arxiv_cscl: How to Evaluate Word Representations of Informal Domain? https://t.co/Q9M3c7frp9
arxiv_cscl: How to Evaluate Word Representations of Informal Domain? https://t.co/Q9M3c6XQxB
arxiv_cscl: How to Evaluate Word Representations of Informal Domain? https://t.co/Q9M3c6XQxB
arxiv_cscl: How to Evaluate Word Representations of Informal Domain? https://t.co/Q9M3c6XQxB
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.035 Mikeys
#7. Prediction of Missing Semantic Relations in Lexical-Semantic Network using Random Forest Classifier
Kévin Cousot, Mehdi Mirzapour, Waleed Ragheb
This study focuses on the prediction of missing six semantic relations (such as is_a and has_part) between two given nodes in RezoJDM a French lexical-semantic network. The output of this prediction is a set of pairs in which the first entries are semantic relations and the second entries are the probabilities of existence of such relations. Due to the statement of the problem we choose the random forest (RF) predictor classifier approach to tackle this problem. We take for granted the existing semantic relations, for training/test dataset, gathered and validated by crowdsourcing. We describe how all of the mentioned ideas can be followed after using the node2vec approach in the feature extraction phase. We show how this approach can lead to acceptable results.
more | pdf | html
Figures
Tweets
arxivml: "Prediction of Missing Semantic Relations in Lexical-Semantic Network using Random Forest Classifier", Kévin Cousot… https://t.co/HwHoouMTr5
fpocket: "Prediction of Missing Semantic Relations in Lexical-Semantic Network using Random Forest Classifier. (arXiv:1911.04759v1 [https://t.co/fkv1Ct3qsg])" https://t.co/jTwuLqwVJJ
arxiv_cs_LG: Prediction of Missing Semantic Relations in Lexical-Semantic Network using Random Forest Classifier. Kévin Cousot, Mehdi Mirzapour, and Waleed Ragheb https://t.co/JaSp7VbIis
arxiv_cscl: Prediction of Missing Semantic Relations in Lexical-Semantic Network using Random Forest Classifier https://t.co/BzRmo2U8GI
arxiv_cscl: Prediction of Missing Semantic Relations in Lexical-Semantic Network using Random Forest Classifier https://t.co/BzRmo2CxPa
arxiv_cscl: Prediction of Missing Semantic Relations in Lexical-Semantic Network using Random Forest Classifier https://t.co/BzRmo2CxPa
arxiv_cscl: Prediction of Missing Semantic Relations in Lexical-Semantic Network using Random Forest Classifier https://t.co/BzRmo2CxPa
Github

A project/paper that uses Graph Embedding and Random Forest classifier for missing semantic relation prediction.

Repository: JDM_Graph_Embedding
User: mehdi-mirzapour
Language: Jupyter Notebook
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 2876
Unqiue Words: 1233

2.033 Mikeys
#8. LexiPers: An ontology based sentiment lexicon for Persian
Behnam Sabeti, Pedram Hosseini, Gholamreza Ghassem-Sani, Seyed Abolghasem Mirroshandel
Sentiment analysis refers to the use of natural language processing to identify and extract subjective information from textual resources. One approach for sentiment extraction is using a sentiment lexicon. A sentiment lexicon is a set of words associated with the sentiment orientation that they express. In this paper, we describe the process of generating a general purpose sentiment lexicon for Persian. A new graph-based method is introduced for seed selection and expansion based on an ontology. Sentiment lexicon generation is then mapped to a document classification problem. We used the K-nearest neighbors and nearest centroid methods for classification. These classifiers have been evaluated based on a set of hand labeled synsets. The final sentiment lexicon has been generated by the best classifier. The results show an acceptable performance in terms of accuracy and F-measure in the generated sentiment lexicon.
more | pdf | html
Figures
None.
Tweets
arxivml: "LexiPers: An ontology based sentiment lexicon for Persian", Behnam Sabeti, Pedram Hosseini, Gholamreza Ghassem-San… https://t.co/4jjxKygqAe
arxiv_cscl: LexiPers: An ontology based sentiment lexicon for Persian https://t.co/wBsQytfnqV
arxiv_cscl: LexiPers: An ontology based sentiment lexicon for Persian https://t.co/wBsQytfnqV
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.033 Mikeys
#9. Creating Auxiliary Representations from Charge Definitions for Criminal Charge Prediction
Liangyi Kang, Jie Liu, Lingqiao Liu, Qinfeng Shi, Dan Ye
Charge prediction, determining charges for criminal cases by analyzing the textual fact descriptions, is a promising technology in legal assistant systems. In practice, the fact descriptions could exhibit a significant intra-class variation due to factors like non-normative use of language, which makes the prediction task very challenging, especially for charge classes with too few samples to cover the expression variation. In this work, we explore to use the charge definitions from criminal law to alleviate this issue. The key idea is that the expressions in a fact description should have corresponding formal terms in charge definitions, and those terms are shared across classes and could account for the diversity in the fact descriptions. Thus, we propose to create auxiliary fact representations from charge definitions to augment fact descriptions representation. The generated auxiliary representations are created through the interaction of fact description with the relevant charge definitions and terms in those definitions by...
more | pdf | html
Figures
None.
Tweets
arxivml: "Creating Auxiliary Representations from Charge Definitions for Criminal Charge Prediction", Liangyi Kang, Jie Liu,… https://t.co/F0iIocLEay
arxiv_cscl: Creating Auxiliary Representations from Charge Definitions for Criminal Charge Prediction https://t.co/0GdzQXlipv
arxiv_cscl: Creating Auxiliary Representations from Charge Definitions for Criminal Charge Prediction https://t.co/0GdzQXlipv
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.033 Mikeys
#10. Neural Duplicate Question Detection without Labeled Training Data
Andreas Rücklé, Nafise Sadat Moosavi, Iryna Gurevych
Supervised training of neural models to duplicate question detection in community Question Answering (cQA) requires large amounts of labeled question pairs, which can be costly to obtain. To minimize this cost, recent works thus often used alternative methods, e.g., adversarial domain adaptation. In this work, we propose two novel methods---weak supervision using the title and body of a question, and the automatic generation of duplicate questions---and show that both can achieve improved performances even though they do not require any labeled data. We provide a comparison of popular training strategies and show that our proposed approaches are more effective in many cases because they can utilize larger amounts of data from the cQA forums. Finally, we show that weak supervision with question title and body information is also an effective method to train cQA answer selection models without direct answer supervision.
more | pdf | html
Figures
None.
Tweets
arxivml: "Neural Duplicate Question Detection without Labeled Training Data", Andreas Rücklé, Nafise Sadat Moosavi, Iryna Gu… https://t.co/h8SJveVc1L
arxiv_cscl: Neural Duplicate Question Detection without Labeled Training Data https://t.co/c8ua1pooTN
arxiv_cscl: Neural Duplicate Question Detection without Labeled Training Data https://t.co/c8ua1pooTN
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 222,102 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 222,102 papers.