Top 10 Arxiv Papers Today in Computation And Language


2.195 Mikeys
#1. A Comparative Study on Transformer vs RNN in Speech Applications
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang
Sequence-to-sequence models have been widely used in end-to-end speech processing, for example, automatic speech recognition (ASR), speech translation (ST), and text-to-speech (TTS). This paper focuses on an emergent sequence-to-sequence model called Transformer, which achieves state-of-the-art performance in neural machine translation and other natural language processing applications. We undertook intensive studies in which we experimentally compared and analyzed Transformer and conventional recurrent neural networks (RNN) in a total of 15 ASR, one multilingual ASR, one ST, and two TTS benchmarks. Our experiments revealed various training tips and significant performance benefits obtained with Transformer for each task including the surprising superiority of Transformer in 13/15 ASR benchmarks in comparison with RNN. We are preparing to release Kaldi-style reproducible recipes using open source and publicly available datasets for all the ASR, ST, and TTS tasks for the community to succeed our exciting outcomes.
more | pdf | html
Figures
None.
Tweets
BrundageBot: A Comparative Study on Transformer vs RNN in Speech Applications. Karita, Chen, Hayashi, Hori, Inaguma, Jiang, Someki, Soplin, Yamamoto, Wang, Watanabe, Yoshimura, and Zhang https://t.co/ZG2tuYIGSz
kari_tech: https://t.co/3QPtNiEk46 Our preprint "A Comparative Study on Transformer vs RNN in Speech Applications" (ASRU2019) is available! https://t.co/g6XFlxpYYB
ballforest: RT @kari_tech: https://t.co/3QPtNiEk46 Our preprint "A Comparative Study on Transformer vs RNN in Speech Applications" (ASRU2019) is availa…
kastnerkyle: RT @kari_tech: https://t.co/3QPtNiEk46 Our preprint "A Comparative Study on Transformer vs RNN in Speech Applications" (ASRU2019) is availa…
ymas0315: RT @kari_tech: https://t.co/3QPtNiEk46 Our preprint "A Comparative Study on Transformer vs RNN in Speech Applications" (ASRU2019) is availa…
r9y9: RT @kari_tech: https://t.co/3QPtNiEk46 Our preprint "A Comparative Study on Transformer vs RNN in Speech Applications" (ASRU2019) is availa…
chbalajitilak: RT @arxiv_cscl: A Comparative Study on Transformer vs RNN in Speech Applications https://t.co/KDiMBh0O1D
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 13
Total Words: 0
Unqiue Words: 0

2.173 Mikeys
#2. Addressing Semantic Drift in Question Generation for Semi-Supervised Question Answering
Shiyue Zhang, Mohit Bansal
Text-based Question Generation (QG) aims at generating natural and relevant questions that can be answered by a given answer in some context. Existing QG models suffer from a "semantic drift" problem, i.e., the semantics of the model-generated question drifts away from the given context and answer. In this paper, we first propose two semantics-enhanced rewards obtained from downstream question paraphrasing and question answering tasks to regularize the QG model to generate semantically valid questions. Second, since the traditional evaluation metrics (e.g., BLEU) often fall short in evaluating the quality of generated questions, we propose a QA-based evaluation method which measures the QG model's ability to mimic human annotators in generating QA training data. Experiments show that our method achieves the new state-of-the-art performance w.r.t. traditional metrics, and also performs best on our QA-based evaluation metrics. Further, we investigate how to use our QG model to augment QA datasets and enable semi-supervised QA. We...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Addressing Semantic Drift in Question Generation for Semi-Supervised Question Answering. Shiyue Zhang and Mohit Bansal https://t.co/JAV8U4ybe5
arxivml: "Addressing Semantic Drift in Question Generation for Semi-Supervised Question Answering", Shiyue Zhang, Mohit Bans… https://t.co/XHCrUAdEIQ
arxiv_cs_LG: Addressing Semantic Drift in Question Generation for Semi-Supervised Question Answering. Shiyue Zhang and Mohit Bansal https://t.co/wVa2BULeLu
arxiv_cscl: Addressing Semantic Drift in Question Generation for Semi-Supervised Question Answering https://t.co/LQ3AvzwQWg
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 0
Unqiue Words: 0

2.13 Mikeys
#3. A Neural Approach to Irony Generation
Mengdi Zhu, Zhiwei Yu, Xiaojun Wan
Ironies can not only express stronger emotions but also show a sense of humor. With the development of social media, ironies are widely used in public. Although many prior research studies have been conducted in irony detection, few studies focus on irony generation. The main challenges for irony generation are the lack of large-scale irony dataset and difficulties in modeling the ironic pattern. In this work, we first systematically define irony generation based on style transfer task. To address the lack of data, we make use of twitter and build a large-scale dataset. We also design a combination of rewards for reinforcement learning to control the generation of ironic sentences. Experimental results demonstrate the effectiveness of our model in terms of irony accuracy, sentiment preservation, and content preservation.
more | pdf | html
Figures
None.
Tweets
BrundageBot: A Neural Approach to Irony Generation. Mengdi Zhu, Zhiwei Yu, and Xiaojun Wan https://t.co/jJLZFaBsYA
arxivml: "A Neural Approach to Irony Generation", Mengdi Zhu, Zhiwei Yu, Xiaojun Wan https://t.co/FMpI9oeB3Z
arxiv_cs_LG: A Neural Approach to Irony Generation. Mengdi Zhu, Zhiwei Yu, and Xiaojun Wan https://t.co/HKT304Qhdk
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 6534
Unqiue Words: 1844

2.13 Mikeys
#4. Scene Graph Parsing by Attention Graph
Martin Andrews, Yew Ken Chia, Sam Witteveen
Scene graph representations, which form a graph of visual object nodes together with their attributes and relations, have proved useful across a variety of vision and language applications. Recent work in the area has used Natural Language Processing dependency tree methods to automatically build scene graphs. In this work, we present an 'Attention Graph' mechanism that can be trained end-to-end, and produces a scene graph structure that can be lifted directly from the top layer of a standard Transformer model. The scene graphs generated by our model achieve an F-score similarity of 52.21% to ground-truth graphs on the evaluation set using the SPICE metric, surpassing the best previous approaches by 2.5%.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Scene Graph Parsing by Attention Graph. Martin Andrews, Yew Ken Chia, and Sam Witteveen https://t.co/NbIT7PCgvA
arxivml: "Scene Graph Parsing by Attention Graph", Martin Andrews, Yew Ken Chia, Sam Witteveen https://t.co/zDNwvOREMU
arxiv_cs_LG: Scene Graph Parsing by Attention Graph. Martin Andrews, Yew Ken Chia, and Sam Witteveen https://t.co/Qd8UOHHKpc
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.079 Mikeys
#5. Taxonomical hierarchy of canonicalized relations from multiple Knowledge Bases
Akshay Parekh, Ashish Anand, Amit Awekar
This work addresses two important questions pertinent to Relation Extraction (RE). First, what are all possible relations that could exist between any two given entity types? Second, how do we define an unambiguous taxonomical (is-a) hierarchy among the identified relations? To address the first question, we use three resources Wikipedia Infobox, Wikidata, and DBpedia. This study focuses on relations between person, organization and location entity types. We exploit Wikidata and DBpedia in a data-driven manner, and Wikipedia Infobox templates manually to generate lists of relations. Further, to address the second question, we canonicalize, filter, and combine the identified relations from the three resources to construct a taxonomical hierarchy. This hierarchy contains 623 canonical relations with highest contribution from Wikipedia Infobox followed by DBpedia and Wikidata. The generated relation list subsumes an average of 85% of relations from RE datasets when entity types are restricted.
more | pdf | html
Figures
None.
Tweets
anand_ashish: How many relations among Person, Location and Organisations exists?How to estimate the number of such relations in a data driven manner?What relation exists among these relations?An attempt to answer the above questions with @babaakki25 @amitawekar in https://t.co/a7j8EfrmuA
RexDouglass: Taxonomical hierarchy of canonicalized relations from multiple Knowledge Bases https://t.co/VeQfQHzZd9
RexDouglass: RT @arxiv_cscl: Taxonomical hierarchy of canonicalized relations from multiple Knowledge Bases https://t.co/yjGHHczJj8
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 3
Total Words: 0
Unqiue Words: 0

2.076 Mikeys
#6. Neural Machine Translation with 4-Bit Precision and Beyond
Alham Fikri Aji, Kenneth Heafield
Neural Machine Translation (NMT) is resource intensive. We design a quantization procedure to compress fit NMT models better for devices with limited hardware capability. We use logarithmic quantization, instead of the more commonly used fixed-point quantization, based on the empirical fact that parameters distribution is not uniform. We find that biases do not take a lot of memory and show that biases can be left uncompressed to improve the overall quality without affecting the compression rate. We also propose to use an error-feedback mechanism during retraining, to preserve the compressed model as a stale gradient. We empirically show that NMT models based on Transformer or RNN architecture can be compressed up to 4-bit precision without any noticeable quality degradation. Models can be compressed up to binary precision, albeit with lower quality. RNN architecture seems to be more robust towards compression, compared to the Transformer.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Neural Machine Translation with 4-Bit Precision and Beyond. Alham Fikri Aji and Kenneth Heafield https://t.co/OgBvnRq0Jd
arxiv_cs_LG: Neural Machine Translation with 4-Bit Precision and Beyond. Alham Fikri Aji and Kenneth Heafield https://t.co/HNhP4KVdMl
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 2
Total Words: 3747
Unqiue Words: 1436

2.076 Mikeys
#7. Say What I Want: Towards the Dark Side of Neural Dialogue Models
Haochen Liu, Tyler Derr, Zitao Liu, Jiliang Tang
Neural dialogue models have been widely adopted in various chatbot applications because of their good performance in simulating and generalizing human conversations. However, there exists a dark side of these models -- due to the vulnerability of neural networks, a neural dialogue model can be manipulated by users to say what they want, which brings in concerns about the security of practical chatbot services. In this work, we investigate whether we can craft inputs that lead a well-trained black-box neural dialogue model to generate targeted outputs. We formulate this as a reinforcement learning (RL) problem and train a Reverse Dialogue Generator which efficiently finds such inputs for targeted outputs. Experiments conducted on a representative neural dialogue model show that our proposed model is able to discover such desired inputs in a considerable portion of cases. Overall, our work reveals this weakness of neural dialogue models and may prompt further researches of developing corresponding solutions to avoid it.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Say What I Want: Towards the Dark Side of Neural Dialogue Models. Haochen Liu, Tyler Derr, Zitao Liu, and Jiliang Tang https://t.co/YSpRnGMK9Q
arxiv_cs_LG: Say What I Want: Towards the Dark Side of Neural Dialogue Models. Haochen Liu, Tyler Derr, Zitao Liu, and Jiliang Tang https://t.co/v2aylzpuca
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

2.028 Mikeys
#8. Toward Automated Quest Generation in Text-Adventure Games
Prithviraj Ammanabrolu, William Broniec, Alex Mueller, Jeremy Paul, Mark O. Riedl
Interactive fictions, or text-adventures, are games in which a player interacts with a world entirely through textual descriptions and text actions. Text-adventure games are typically structured as puzzles or quests wherein the player must execute certain actions in a certain order to succeed. In this paper, we consider the problem of procedurally generating a quest, defined as a series of actions required to progress towards a goal, in a text-adventure game. Quest generation in text environments is challenging because they must be semantically coherent. We present and evaluate two quest generation techniques: (1) a Markov chains, and (2) a neural generative model. We specifically look at generating quests about cooking and train our models on recipe data. We evaluate our techniques with human participant studies looking at perceived creativity and coherence.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Toward Automated Quest Generation in Text-Adventure Games. Prithviraj Ammanabrolu, William Broniec, Alex Mueller, Jeremy Paul, and Mark O. Riedl https://t.co/n6u8BTmeLt
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

2.028 Mikeys
#9. Finding Generalizable Evidence by Learning to Convince Q&A Models
Ethan Perez, Siddharth Karamcheti, Rob Fergus, Jason Weston, Douwe Kiela, Kyunghyun Cho
We propose a system that finds the strongest supporting evidence for a given answer to a question, using passage-based question-answering (QA) as a testbed. We train evidence agents to select the passage sentences that most convince a pretrained QA model of a given answer, if the QA model received those sentences instead of the full passage. Rather than finding evidence that convinces one model alone, we find that agents select evidence that generalizes; agent-chosen evidence increases the plausibility of the supported answer, as judged by other QA models and humans. Given its general nature, this approach improves QA in a robust manner: using agent-selected evidence (i) humans can correctly answer questions with only ~20% of the full passage and (ii) QA models can generalize to longer passages and harder questions.
more | pdf | html
Figures
None.
Tweets
BrundageBot: Finding Generalizable Evidence by Learning to Convince Q&A Models. Ethan Perez, Siddharth Karamcheti, Rob Fergus, Jason Weston, Douwe Kiela, and Kyunghyun Cho https://t.co/jKw6MjHicd
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 6
Total Words: 0
Unqiue Words: 0

2.028 Mikeys
#10. Analyzing machine-learned representations: A natural language case study
Ishita Dasgupta, Demi Guo, Samuel J. Gershman, Noah D. Goodman
As modern deep networks become more complex, and get closer to human-like capabilities in certain domains, the question arises of how the representations and decision rules they learn compare to the ones in humans. In this work, we study representations of sentences in one such artificial system for natural language processing. We first present a diagnostic test dataset to examine the degree of abstract composable structure represented. Analyzing performance on these diagnostic tests indicates a lack of systematicity in the representations and decision rules, and reveals a set of heuristic strategies. We then investigate the effect of the training distribution on learning these heuristic strategies, and study changes in these representations with various augmentations to the training set. Our results reveal parallels to the analogous representations in people. We find that these systems can learn abstract rules and generalize them to new contexts under certain circumstances -- similar to human zero-shot reasoning. However, we also...
more | pdf | html
Figures
None.
Tweets
BrundageBot: Analyzing machine-learned representations: A natural language case study. Ishita Dasgupta, Demi Guo, Samuel J. Gershman, and Noah D. Goodman https://t.co/WRJKLCOCAa
Github
None.
Youtube
None.
Other stats
Sample Sizes : None.
Authors: 4
Total Words: 0
Unqiue Words: 0

About

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 189,566 papers.

Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Categories
All
Astrophysics
Cosmology and Nongalactic Astrophysics
Earth and Planetary Astrophysics
Astrophysics of Galaxies
High Energy Astrophysical Phenomena
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Condensed Matter
Disordered Systems and Neural Networks
Mesoscale and Nanoscale Physics
Materials Science
Other Condensed Matter
Quantum Gases
Soft Condensed Matter
Statistical Mechanics
Strongly Correlated Electrons
Superconductivity
Computer Science
Artificial Intelligence
Hardware Architecture
Computational Complexity
Computational Engineering, Finance, and Science
Computational Geometry
Computation and Language
Cryptography and Security
Computer Vision and Pattern Recognition
Computers and Society
Databases
Distributed, Parallel, and Cluster Computing
Digital Libraries
Discrete Mathematics
Data Structures and Algorithms
Emerging Technologies
Formal Languages and Automata Theory
General Literature
Graphics
Computer Science and Game Theory
Human-Computer Interaction
Information Retrieval
Information Theory
Machine Learning
Logic in Computer Science
Multiagent Systems
Multimedia
Mathematical Software
Numerical Analysis
Neural and Evolutionary Computing
Networking and Internet Architecture
Other Computer Science
Operating Systems
Performance
Programming Languages
Robotics
Symbolic Computation
Sound
Software Engineering
Social and Information Networks
Systems and Control
Economics
Econometrics
General Economics
Theoretical Economics
Electrical Engineering and Systems Science
Audio and Speech Processing
Image and Video Processing
Signal Processing
General Relativity and Quantum Cosmology
General Relativity and Quantum Cosmology
High Energy Physics - Experiment
High Energy Physics - Experiment
High Energy Physics - Lattice
High Energy Physics - Lattice
High Energy Physics - Phenomenology
High Energy Physics - Phenomenology
High Energy Physics - Theory
High Energy Physics - Theory
Mathematics
Commutative Algebra
Algebraic Geometry
Analysis of PDEs
Algebraic Topology
Classical Analysis and ODEs
Combinatorics
Category Theory
Complex Variables
Differential Geometry
Dynamical Systems
Functional Analysis
General Mathematics
General Topology
Group Theory
Geometric Topology
History and Overview
Information Theory
K-Theory and Homology
Logic
Metric Geometry
Mathematical Physics
Numerical Analysis
Number Theory
Operator Algebras
Optimization and Control
Probability
Quantum Algebra
Rings and Algebras
Representation Theory
Symplectic Geometry
Spectral Theory
Statistics Theory
Mathematical Physics
Mathematical Physics
Nonlinear Sciences
Adaptation and Self-Organizing Systems
Chaotic Dynamics
Cellular Automata and Lattice Gases
Pattern Formation and Solitons
Exactly Solvable and Integrable Systems
Nuclear Experiment
Nuclear Experiment
Nuclear Theory
Nuclear Theory
Physics
Accelerator Physics
Atmospheric and Oceanic Physics
Applied Physics
Atomic and Molecular Clusters
Atomic Physics
Biological Physics
Chemical Physics
Classical Physics
Computational Physics
Data Analysis, Statistics and Probability
Physics Education
Fluid Dynamics
General Physics
Geophysics
History and Philosophy of Physics
Instrumentation and Detectors
Medical Physics
Optics
Plasma Physics
Popular Physics
Physics and Society
Space Physics
Quantitative Biology
Biomolecules
Cell Behavior
Genomics
Molecular Networks
Neurons and Cognition
Other Quantitative Biology
Populations and Evolution
Quantitative Methods
Subcellular Processes
Tissues and Organs
Quantitative Finance
Computational Finance
Economics
General Finance
Mathematical Finance
Portfolio Management
Pricing of Securities
Risk Management
Statistical Finance
Trading and Market Microstructure
Quantum Physics
Quantum Physics
Statistics
Applications
Computation
Methodology
Machine Learning
Other Statistics
Statistics Theory
Feedback
Online
Stats
Tracking 189,566 papers.