Top 10 Arxiv Papers Today in Computation And Language


2.296 Mikeys
#1. TWEETQA: A Social Media Focused Question Answering Dataset
Wenhan Xiong, Jiawei Wu, Hong Wang, Vivek Kulkarni, Mo Yu, Shiyu Chang, Xiaoxiao Guo, William Yang Wang
With social media becoming increasingly pop-ular on which lots of news and real-time eventsare reported, developing automated questionanswering systems is critical to the effective-ness of many applications that rely on real-time knowledge. While previous datasets haveconcentrated on question answering (QA) forformal text like news and Wikipedia, wepresent the first large-scale dataset for QA oversocial media data. To ensure that the tweetswe collected are useful, we only gather tweetsused by journalists to write news articles. Wethen ask human annotators to write questionsand answers upon these tweets. Unlike otherQA datasets like SQuAD in which the answersare extractive, we allow the answers to be ab-stractive. We show that two recently proposedneural models that perform well on formaltexts are limited in their performance when ap-plied to our dataset. In addition, even the fine-tuned BERT model is still lagging behind hu-man performance with a large margin. Our re-sults thus point to the need of improved QAsystems targeting...
more | pdf | html
Figures
Tweets
BrundageBot: TWEETQA: A Social Media Focused Question Answering Dataset. Wenhan Xiong, Jiawei Wu, Hong Wang, Vivek Kulkarni, Mo Yu, Shiyu Chang, Xiaoxiao Guo, and William Yang Wang https://t.co/PDS4xYLjhm
arxiv_in_review: #acl2019nlp TWEETQA: A Social Media Focused Question Answering Dataset. (arXiv:1907.06292v1 [cs\.CL]) https://t.co/CPyCpwYHgS
arxiv_cscl: TWEETQA: A Social Media Focused Question Answering Dataset https://t.co/najk28dV24
Github
Repository: mprc
User: shuohangwang
Language: Lua
Stargazers: 64
Subscribers: 12
Forks: 18
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 8
Total Words: 6255
Unqiue Words: 2213

2.128 Mikeys
#2. Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task
Alexandre Bérard, Ioan Calapodescu, Claude Roux
This paper describes the systems that we submitted to the WMT19 Machine Translation robustness task. This task aims to improve MT's robustness to noise found on social media, like informal language, spelling mistakes and other orthographic variations. The organizers provide parallel data extracted from a social media website in two language pairs: French-English and Japanese-English (in both translation directions). The goal is to obtain the best scores on unseen test sets from the same source, according to automatic metrics (BLEU) and human evaluation. We proposed one single and one ensemble system for each translation direction. Our ensemble models ranked first in all language pairs, according to BLEU evaluation. We discuss the pre-processing choices that we made, and present our solutions for robustness to noise and domain adaptation.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task. Alexandre Bérard, Ioan Calapodescu, and Claude Roux https://t.co/plXsybLYLL
arxiv_in_review: #acl2019nlp Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task. (arXiv:1907.06488v1 [cs\.CL]) https://t.co/T1yGFKsA4X
arxiv_cscl: Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task https://t.co/ex809d6nZ6
arxiv_cscl: Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task https://t.co/ex809cOMAw
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.126 Mikeys
#3. Pykaldi2: Yet another speech toolkit based on Kaldi and Pytorch
Liang Lu, Xiong Xiao, Zhuo Chen, Yifan Gong
We introduce PyKaldi2 speech recognition toolkit implemented based on Kaldi and PyTorch. While similar toolkits are available built on top of the two, a key feature of PyKaldi2 is sequence training with criteria such as MMI, sMBR and MPE. In particular, we implemented the sequence training module with on-the-fly lattice generation during model training in order to simplify the training pipeline. To address the challenging acoustic environments in real applications, PyKaldi2 also supports on-the-fly noise and reverberation simulation to improve the model robustness. With this feature, it is possible to backpropogate the gradients from the sequence-level loss to the front-end feature extraction module, which, hopefully, can foster more research in the direction of joint front-end and backend learning. We performed benchmark experiments on Librispeech, and show that PyKaldi2 can achieve reasonable recognition accuracy. The toolkit is released under the MIT license.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Pykaldi2: Yet another speech toolkit based on Kaldi and Pytorch. Liang Lu, Xiong Xiao, Zhuo Chen, and Yifan Gong https://t.co/b1izS4AVSI
arxiv_cscl: Pykaldi2: Yet another speech toolkit based on Kaldi and Pytorch https://t.co/QqYaWLYzgB
Github

A Python wrapper for Kaldi

Repository: pykaldi
User: jzlianglu
Language: Python
Stargazers: 0
Subscribers: 1
Forks: 0
Open Issues: 0
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 3412
Unqiue Words: 1169

2.112 Mikeys
#4. Tackling Graphical NLP problems with Graph Recurrent Networks
Linfeng Song
How to properly model graphs is a long-existing and important problem in NLP area, where several popular types of graphs are knowledge graphs, semantic graphs and dependency graphs. Comparing with other data structures, such as sequences and trees, graphs are generally more powerful in representing complex correlations among entities. For example, a knowledge graph stores real-word entities (such as "Barack_Obama" and "U.S.") and their relations (such as "live_in" and "lead_by"). Properly encoding a knowledge graph is beneficial to user applications, such as question answering and knowledge discovery. Modeling graphs is also very challenging, probably because graphs usually contain massive and cyclic relations. Recent years have witnessed the success of deep learning, especially RNN-based models, on many NLP problems. Besides, RNNs and their variations have been extensively studied on several graph problems and showed preliminary successes. Despite the successes that have been achieved, RNN-based models suffer from several major...
more | pdf | html
Figures
Tweets
arxiv_cs_LG: Tackling Graphical NLP problems with Graph Recurrent Networks. Linfeng Song https://t.co/5ej82yZd4o
Memoirs: Tackling Graphical NLP problems with Graph Recurrent Networks. https://t.co/LARFpAtL8Z
arxiv_cscl: Tackling Graphical NLP problems with Graph Recurrent Networks https://t.co/bOcAuwm2zA
arxiv_cscl: Tackling Graphical NLP problems with Graph Recurrent Networks https://t.co/bOcAuwDDr8
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 35225
Unqiue Words: 6598

2.111 Mikeys
#5. A Simple BERT-Based Approach for Lexical Simplification
Jipeng Qiang, Yun Li, Yi Zhu, Yunhao Yuan
Lexical simplification (LS) aims to replace complex words in a given sentence with their simpler alternatives of equivalent meaning. Recently unsupervised lexical simplification approaches only rely on the complex word itself regardless of the given sentence to generate candidate substitutions, which will inevitably produce a large number of spurious candidates. We present a simple BERT-based LS approach that makes use of the pre-trained unsupervised deep bidirectional representations BERT. We feed the given sentence masked the complex word into the masking language model of BERT to generate candidate substitutions. By considering the whole sentence, the generated simpler alternatives are easier to hold cohesion and coherence of a sentence. Experimental results show that our approach obtains obvious improvement on standard LS benchmark.
more | pdf | html
Figures
None.
Tweets
SciFi: A Simple BERT-Based Approach for Lexical Simplification. https://t.co/6N2PXV50Gc
arxiv_cscl: A Simple BERT-Based Approach for Lexical Simplification https://t.co/Gd1MLUtw3I
arxiv_cscl: A Simple BERT-Based Approach for Lexical Simplification https://t.co/Gd1MLUbUF8
RexDouglass: RT @arxiv_cscl: A Simple BERT-Based Approach for Lexical Simplification https://t.co/Gd1MLUbUF8
sussenglish: RT @SciFi: A Simple BERT-Based Approach for Lexical Simplification. https://t.co/6N2PXV50Gc
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.08 Mikeys
#6. RaKUn: Rank-based Keyword extraction via Unsupervised learning and Meta vertex aggregation
Blaž Škrlj, Andraž Repar, Senja Pollak
Keyword extraction is used for summarizing the content of a document and supports efficient document retrieval, and is as such an indispensable part of modern text-based systems. We explore how load centrality, a graph-theoretic measure applied to graphs derived from a given text can be used to efficiently identify and rank keywords. Introducing meta vertices (aggregates of existing vertices) and systematic redundancy filters, the proposed method performs on par with state-of-the-art for the keyword extraction task on 14 diverse datasets. The proposed method is unsupervised, interpretable and can also be used for document visualization.
more | pdf | html
Figures
None.
Tweets
arxiv_cs_LG: RaKUn: Rank-based Keyword extraction via Unsupervised learning and Meta vertex aggregation. Blaž Škrlj, Andraž Repar, and Senja Pollak https://t.co/7XG6VJ41V6
Memoirs: RaKUn: Rank-based Keyword extraction via Unsupervised learning and Meta vertex aggregation. https://t.co/aPq9Q7j7HC
arxiv_cscl: RaKUn: Rank-based Keyword extraction via Unsupervised learning and Meta vertex aggregation https://t.co/nlkNq3B28b
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.071 Mikeys
#7. GLOSS: Generative Latent Optimization of Sentence Representations
Sidak Pal Singh, Angela Fan, Michael Auli
We propose a method to learn unsupervised sentence representations in a non-compositional manner based on Generative Latent Optimization. Our approach does not impose any assumptions on how words are to be combined into a sentence representation. We discuss a simple Bag of Words model as well as a variant that models word positions. Both are trained to reconstruct the sentence based on a latent code and our model can be used to generate text. Experiments show large improvements over the related Paragraph Vectors. Compared to uSIF, we achieve a relative improvement of 5% when trained on the same data and our method performs competitively to Sent2vec while trained on 30 times less data.
more | pdf | html
Figures
None.
Tweets
BrundageBot: GLOSS: Generative Latent Optimization of Sentence Representations. Sidak Pal Singh, Angela Fan, and Michael Auli https://t.co/ag6gqzJZy0
arxiv_cscl: GLOSS: Generative Latent Optimization of Sentence Representations https://t.co/kqTd6ERpnK
RexDouglass: RT @arxiv_cscl: GLOSS: Generative Latent Optimization of Sentence Representations https://t.co/kqTd6ERpnK
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.053 Mikeys
#8. Relational Memory-based Knowledge Graph Embedding
Dai Quoc Nguyen, Tu Dinh Nguyen, Dinh Phung
Knowledge graph embedding models often suffer from a limitation of remembering existing triples to predict new triples. To overcome this issue, we introduce a novel embedding model, named R-MeN, that explores a relational memory network to model relationship triples. In R-MeN, we simply represent each triple as a sequence of 3 input vectors which recurrently interact with a relational memory. This memory network is constructed to incorporate new information using a self-attention mechanism over the memory and input vectors to return a corresponding output vector for every timestep. Consequently, we obtain 3 output vectors which are then multiplied element-wisely into a single one; and finally, we feed this vector to a linear neural layer to produce a scalar score for the triple. Experimental results show that our proposed R-MeN obtains state-of-the-art results on two well-known benchmark datasets WN11 and FB13 for triple classification task.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Relational Memory-based Knowledge Graph Embedding. Dai Quoc Nguyen, Tu Dinh Nguyen, and Dinh Phung https://t.co/K2XN2PoiuI
arxiv_cscl: Relational Memory-based Knowledge Graph Embedding https://t.co/8l1rWnfAtm
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.053 Mikeys
#9. Cross-Lingual Transfer Learning for Question Answering
Chia-Hsuan Lee, Hung-Yi Lee
Deep learning based question answering (QA) on English documents has achieved success because there is a large amount of English training examples. However, for most languages, training examples for high-quality QA models are not available. In this paper, we explore the problem of cross-lingual transfer learning for QA, where a source language task with plentiful annotations is utilized to improve the performance of a QA model on a target language task with limited available annotations. We examine two different approaches. A machine translation (MT) based approach translates the source language into the target language, or vice versa. Although the MT-based approach brings improvement, it assumes the availability of a sentence-level translation system. A GAN-based approach incorporates a language discriminator to learn language-universal feature representations, and consequentially transfer knowledge from the source language. The GAN-based approach rivals the performance of the MT-based approach with fewer linguistic resources....
more | pdf | html
Figures
Tweets
BrundageBot: Cross-Lingual Transfer Learning for Question Answering. Chia-Hsuan Lee and Hung-Yi Lee https://t.co/s5s4FLpmXz
arxiv_cscl: Cross-Lingual Transfer Learning for Question Answering https://t.co/LQLn4Awx3z
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 7073
Unqiue Words: 2217

2.053 Mikeys
#10. Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation
Marcin Junczys-Dowmunt
This paper describes the Microsoft Translator submissions to the WMT19 news translation shared task for English-German. Our main focus is document-level neural machine translation with deep transformer models. We start with strong sentence-level baselines, trained on large-scale data created via data-filtering and noisy back-translation and find that back-translation seems to mainly help with translationese input. We explore fine-tuning techniques, deeper models and different ensembling strategies to counter these effects. Using document boundaries present in the authentic and synthetic parallel data, we create sequences of up to 1000 subword segments and train transformer translation models. We experiment with data augmentation techniques for the smaller authentic data with document-boundaries and for larger authentic data without boundaries. We further explore multi-task training for the incorporation of document-level source language monolingual data via the BERT-objective on the encoder and two-pass decoding for combinations...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation. Marcin Junczys-Dowmunt https://t.co/wmGBGsWJUp
arxiv_cscl: Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation https://t.co/9stI5ICTsf
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 1
Total Words: 0
Unqiue Words: 0

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 158,360 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 158,360 papers.