### Top 10 Arxiv Papers Today in Computation And Language

##### #1. Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms
###### Giancarlo D. Salton, Robert J. Ross, John D. Kelleher
Idioms pose problems to almost all Machine Translation systems. This type of language is very frequent in day-to-day language use and cannot be simply ignored. The recent interest in memory augmented models in the field of Language Modelling has aided the systems to achieve good results by bridging long-distance dependencies. In this paper we explore the use of such techniques into a Neural Machine Translation system to help in translation of idiomatic language.
more | pdf | html
None.
###### Tweets
arxivml: "Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms", Giancarlo… https://t.co/WXBLkdIBDA
arxiv_cscl: Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms https://t.co/OGb40a6UDD
arxiv_cscl: Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms https://t.co/OGb40a6UDD
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 4952
Unqiue Words: 1594

##### #2. Proceedings of the 2018 Workshop on Compositional Approaches in Physics, NLP, and Social Sciences
###### Martha Lewis, Bob Coecke, Jules Hedges, Dimitri Kartsaklis, Dan Marsden
The ability to compose parts to form a more complex whole, and to analyze a whole as a combination of elements, is desirable across disciplines. This workshop bring together researchers applying compositional approaches to physics, NLP, cognitive science, and game theory. Within NLP, a long-standing aim is to represent how words can combine to form phrases and sentences. Within the framework of distributional semantics, words are represented as vectors in vector spaces. The categorical model of Coecke et al. [2010], inspired by quantum protocols, has provided a convincing account of compositionality in vector space models of NLP. There is furthermore a history of vector space models in cognitive science. Theories of categorization such as those developed by Nosofsky [1986] and Smith et al. [1988] utilise notions of distance between feature vectors. More recently G\"ardenfors [2004, 2014] has developed a model of concepts in which conceptual spaces provide geometric structures, and information is represented by points, vectors and...
more | pdf | html
None.
###### Tweets
arxivml: "Proceedings of the 2018 Workshop on Compositional Approaches in Physics, NLP, and Social Sciences", Martha Lewis, … https://t.co/pFD1Y9ItOr
arxiv_cscl: Proceedings of the 2018 Workshop on Compositional Approaches in Physics, NLP, and Social Sciences https://t.co/rk9FE05hc0
arxiv_cscl: Proceedings of the 2018 Workshop on Compositional Approaches in Physics, NLP, and Social Sciences https://t.co/rk9FE05hc0
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 5
Total Words: 0
Unqiue Words: 0

##### #3. The ADAPT System Description for the IWSLT 2018 Basque to English Translation Task
###### Alberto Poncelas, Andy Way, Kepa Sarasola
In this paper we present the ADAPT system built for the Basque to English Low Resource MT Evaluation Campaign. Basque is a low-resourced, morphologically-rich language. This poses a challenge for Neural Machine Translation models which usually achieve better performance when trained with large sets of data. Accordingly, we used synthetic data to improve the translation quality produced by a model built using only authentic data. Our proposal uses back-translated data to: (a) create new sentences, so the system can be trained with more data; and (b) translate sentences that are close to the test set, so the model can be fine-tuned to the document to be translated.
more | pdf | html
###### Tweets
BrundageBot: The ADAPT System Description for the IWSLT 2018 Basque to English Translation Task. Alberto Poncelas, Andy Way, and Kepa Sarasola https://t.co/HutJRoHzom
arxivml: "The ADAPT System Description for the IWSLT 2018 Basque to English Translation Task", Alberto Poncelas, Andy Way, K… https://t.co/BOoKaP0ogc
arxiv_cscl: The ADAPT System Description for the IWSLT 2018 Basque to English Translation Task https://t.co/5u94k7aALu
ComputerPapers: The ADAPT System Description for the IWSLT 2018 Basque to English Translation Task. https://t.co/xaqGG6rIkY
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 4210
Unqiue Words: 1468

##### #4. Improving Topic Models with Latent Feature Word Representations
###### Dat Quoc Nguyen, Richard Billingsley, Lan Du, Mark Johnson
Probabilistic topic models are widely used to discover latent topics in document collections, while latent feature vector representations of words have been used to obtain high performance in many NLP tasks. In this paper, we extend two different Dirichlet multinomial topic models by incorporating latent feature vector representations of words trained on very large corpora to improve the word-topic mapping learnt on a smaller corpus. Experimental results show that by using information from the external corpora, our new models produce significant improvements on topic coherence, document clustering and document classification tasks, especially on datasets with few or short documents.
more | pdf | html
###### Tweets
BrundageBot: Improving Topic Models with Latent Feature Word Representations. Dat Quoc Nguyen, Richard Billingsley, Lan Du, and Mark Johnson https://t.co/lRQ47WpJvk
arxivml: "Improving Topic Models with Latent Feature Word Representations", Dat Quoc Nguyen, Richard Billingsley, Lan Du, Ma… https://t.co/e53gGdeO0T
krokodama: 【作製中】φ(．．) Twitterの文章を↓でトピック抽出やりたいけどうまくいかないね 今週か来週中にはかんせいさせたいね だから三連休こうしていまも研究室に引きこもってるわけです Improving Topic Models with Latent Feature Word Representations https://t.co/1yLwPBVnxQ
Memoirs: Improving Topic Models with Latent Feature Word Representations. https://t.co/rp992qiWIE
arxiv_cscl: Improving Topic Models with Latent Feature Word Representations https://t.co/nEZQEFAX3K
arxiv_cscl: Improving Topic Models with Latent Feature Word Representations https://t.co/nEZQEFAX3K
RexDouglass: RT @arxiv_cscl: Improving Topic Models with Latent Feature Word Representations https://t.co/nEZQEFAX3K
johnmvore: RT @arxiv_cscl: Improving Topic Models with Latent Feature Word Representations https://t.co/nEZQEFAX3K
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 9949
Unqiue Words: 2806

##### #5. A Hierarchical Framework for Relation Extraction with Reinforcement Learning
###### Ryuichi Takanobu, Tianyang Zhang, Jiexi Liu, Minlie Huang
Most existing methods determine relation types only after all the entities have been recognized, thus the interaction between relation types and entity mentions is not fully modeled. This paper presents a novel paradigm to deal with relation extraction by regarding the related entities as the arguments of a relation. We apply a hierarchical reinforcement learning (HRL) framework in this paradigm to enhance the interaction between entity mentions and relation types. The whole extraction process is decomposed into a hierarchy of two-level RL policies for relation detection and entity extraction respectively, so that it is more feasible and natural to deal with overlapping relations. Our model was evaluated on public datasets collected via distant supervision, and results show that it gains better performance than existing methods and is more powerful for extracting overlapping relations.
more | pdf | html
###### Tweets
BrundageBot: A Hierarchical Framework for Relation Extraction with Reinforcement Learning. Ryuichi Takanobu, Tianyang Zhang, Jiexi Liu, and Minlie Huang https://t.co/knGwDt25N8
arxivml: "A Hierarchical Framework for Relation Extraction with Reinforcement Learning", Ryuichi Takanobu, Tianyang Zhang, J… https://t.co/gGI233FR68
_makoh_: "A Hierarchical Framework for Relation Extraction with Reinforcement Learning. (arXiv:1811.03925v1 [https://t.co/Elc9rIUsHa])" https://t.co/LR2nQ8pRXC #arxiv #feedly
arxiv_cscl: A Hierarchical Framework for Relation Extraction with Reinforcement Learning https://t.co/RIGTpQg9c9
arxiv_cscl: A Hierarchical Framework for Relation Extraction with Reinforcement Learning https://t.co/RIGTpPYykB
ComputerPapers: A Hierarchical Framework for Relation Extraction with Reinforcement Learning. https://t.co/Ra15XZqvfb
RexDouglass: RT @arxiv_cscl: A Hierarchical Framework for Relation Extraction with Reinforcement Learning https://t.co/RIGTpPYykB
puneethmishra: RT @arxiv_cscl: A Hierarchical Framework for Relation Extraction with Reinforcement Learning https://t.co/RIGTpPYykB
###### Github

Joint Relation Extraction with Hierarchical Reinforcement Learning

Repository: HRL-RE
User: truthless11
Language: Python
Stargazers: 1
Subscribers: 1
Forks: 0
Open Issues: 0
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 6457
Unqiue Words: 2059

##### #6. Bringing back simplicity and lightliness into neural image captioning
###### Jean-Benoit Delbrouck, Stéphane Dupont
Neural Image Captioning (NIC) or neural caption generation has attracted a lot of attention over the last few years. Describing an image with a natural language has been an emerging challenge in both fields of computer vision and language processing. Therefore a lot of research has focused on driving this task forward with new creative ideas. So far, the goal has been to maximize scores on automated metric and to do so, one has to come up with a plurality of new modules and techniques. Once these add up, the models become complex and resource-hungry. In this paper, we take a small step backwards in order to study an architecture with interesting trade-off between performance and computational complexity. To do so, we tackle every component of a neural captioning model and propose one or more solution that lightens the model overall. Our ideas are inspired by two related tasks: Multimodal and Monomodal Neural Machine Translation.
more | pdf | html
###### Tweets
BrundageBot: Bringing back simplicity and lightliness into neural image captioning. Jean-Benoit Delbrouck and Stéphane Dupont https://t.co/q7ChYEXE3p
arxiv_flying: #AAAI2019 Bringing back simplicity and lightliness into neural image captioning. (arXiv:1810.06245v1 [cs\.CL]) https://t.co/x5tU6BC1Jv
arxivml: "Bringing back simplicity and lightliness into neural image captioning", Jean-Benoit Delbrouck, Stéphane Dupont https://t.co/yjfdP3zjAO
arxiv_cscl: Bringing back simplicity and lightliness into neural image captioning https://t.co/VB560eoaTI
arxiv_cscl: Bringing back simplicity and lightliness into neural image captioning https://t.co/VB560e6zv8
ComputerPapers: Bringing back simplicity and lightliness into neural image captioning. https://t.co/MNRZIUA4vF
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 2
Total Words: 5450
Unqiue Words: 1962

##### #7. From direct tagging to Tagging with sentences compression
###### Peihui Chen
In essence, the two tagging methods (direct tagging and tagging with sentences compression) are to tag the information we need by using regular expression which basing on the inherent language patterns of the natural language. Though it has many advantages in extracting regular data, Direct tagging is not applicable to some situations. if the data we need extract is not regular and its surrounding words are regular is relatively regular, then we can use information compression to cut the information we do not need before we tagging the data we need. In this way we can increase the precision of the data while not undermine the recall of the data.
more | pdf | html
###### Tweets
arxivml: "From direct tagging to Tagging with sentences compression", Peihui Chen https://t.co/aaJXvuORSt
arxiv_cscl: From direct tagging to Tagging with sentences compression https://t.co/SRjeYjmvj0
ComputerPapers: From direct tagging to Tagging with sentences compression. https://t.co/oJvxhGIkwW
OlegBaskov: RT @arxiv_cscl: From direct tagging to Tagging with sentences compression https://t.co/SRjeYjmvj0
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 1
Total Words: 321
Unqiue Words: 175

##### #8. Text Assisted Insight Ranking Using Context-Aware Memory Network
###### Qi Zeng, Liangchen Luo, Wenhao Huang, Yang Tang
Extracting valuable facts or informative summaries from multi-dimensional tables, i.e. insight mining, is an important task in data analysis and business intelligence. However, ranking the importance of insights remains a challenging and unexplored task. The main challenge is that explicitly scoring an insight or giving it a rank requires a thorough understanding of the tables and costs a lot of manual efforts, which leads to the lack of available training data for the insight ranking problem. In this paper, we propose an insight ranking model that consists of two parts: A neural ranking model explores the data characteristics, such as the header semantics and the data statistical features, and a memory network model introduces table structure and context information into the ranking process. We also build a dataset with text assistance. Experimental results show that our approach largely improves the ranking precision as reported in multi evaluation metrics.
more | pdf | html
###### Tweets
arxiv_org: Text Assisted Insight Ranking Using Context-Aware Memory Network. https://t.co/7Q11BwB5c9 https://t.co/mXECUcU5hx
arxivml: "Text Assisted Insight Ranking Using Context-Aware Memory Network", Qi Zeng, Liangchen Luo, Wenhao Huang, Yang Tang https://t.co/pF6POF4PzL
arxiv_cscl: Text Assisted Insight Ranking Using Context-Aware Memory Network https://t.co/DWBp2wQ9iX
IntuitMachine: RT @arxiv_org: Text Assisted Insight Ranking Using Context-Aware Memory Network. https://t.co/7Q11BwB5c9 https://t.co/mXECUcU5hx
_bha1: RT @arxiv_org: Text Assisted Insight Ranking Using Context-Aware Memory Network. https://t.co/7Q11BwB5c9 https://t.co/mXECUcU5hx
mansoorfayyaz: RT @arxiv_org: Text Assisted Insight Ranking Using Context-Aware Memory Network. https://t.co/7Q11BwB5c9 https://t.co/mXECUcU5hx
shubh_300595: RT @arxiv_org: Text Assisted Insight Ranking Using Context-Aware Memory Network. https://t.co/7Q11BwB5c9 https://t.co/mXECUcU5hx
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 6398
Unqiue Words: 2031

##### #9. Dependency Grammar Induction with a Neural Variational Transition-based Parser
###### Bowen Li, Jianpeng Cheng, Yang Liu, Frank Keller
Dependency grammar induction is the task of learning dependency syntax without annotated training data. Traditional graph-based models with global inference achieve state-of-the-art results on this task but they require $O(n^3)$ run time. Transition-based models enable faster inference with $O(n)$ time complexity, but their performance still lags behind. In this work, we propose a neural transition-based parser for dependency grammar induction, whose inference procedure utilizes rich neural features with $O(n)$ time complexity. We train the parser with an integration of variational inference, posterior regularization and variance reduction techniques. The resulting framework outperforms previous unsupervised transition-based dependency parsers and achieves performance comparable to graph-based models, both on the English Penn Treebank and on the Universal Dependency Treebank. In an empirical comparison, we show that our approach substantially increases parsing speed over graph-based models.
more | pdf | html
None.
###### Tweets
BrundageBot: Dependency Grammar Induction with a Neural Variational Transition-based Parser. Bowen Li, Jianpeng Cheng, Yang Liu, and Frank Keller https://t.co/epTDUIbho7
arxivml: "Dependency Grammar Induction with a Neural Variational Transition-based Parser", Bowen Li, Jianpeng Cheng, Yang Li… https://t.co/Oxzb6UXByZ
arxiv_cscl: Dependency Grammar Induction with a Neural Variational Transition-based Parser https://t.co/AYaaBHN4iZ
arxiv_cscl: Dependency Grammar Induction with a Neural Variational Transition-based Parser https://t.co/AYaaBHvsUp
ComputerPapers: Dependency Grammar Induction with a Neural Variational Transition-based Parser. https://t.co/Jx7UuiHHQK
###### Github

Dependency Grammar Induction

Repository: VI-dependency-syntax
User: libowen2121
Language: None
Stargazers: 7
Subscribers: 1
Forks: 0
Open Issues: 0
None.
###### Other stats
Sample Sizes : None.
Authors: 4
Total Words: 7050
Unqiue Words: 2134

##### #10. Automatic Data Expansion for Customer-care Spoken Language Understanding
###### Shahab Jalalvand, Andrej Ljolje, Srinivas Bangalore
Spoken language understanding (SLU) systems are widely used in handling of customer-care calls.A traditional SLU system consists of an acoustic model (AM) and a language model (LM) that areused to decode the utterance and a natural language understanding (NLU) model that predicts theintent. While AM can be shared across different domains, LM and NLU models need to be trainedspecifically for every new task. However, preparing enough data to train these models is prohibitivelyexpensive. In this paper, we introduce an efficient method to expand the limited in-domain data. Theprocess starts with training a preliminary NLU model based on logistic regression on the in-domaindata. Since the features are based onn= 1,2-grams, we can detect the most informative n-gramsfor each intent class. Using these n-grams, we find the samples in the out-of-domain corpus that1) contain the desired n-gram and/or 2) have similar intent label. The ones which meet the firstconstraint are used to train a new LM model and the ones that meet both constraints...
more | pdf | html
###### Tweets
arxivml: "Automatic Data Expansion for Customer-care Spoken Language Understanding", Shahab Jalalvand, Andrej Ljolje, Sriniv… https://t.co/kFfJxza74V
arxiv_cscl: Automatic Data Expansion for Customer-care Spoken Language Understanding https://t.co/bT15AgbXIW
arxiv_cscl: Automatic Data Expansion for Customer-care Spoken Language Understanding https://t.co/bT15AgbXIW
arxiv_cscl: Automatic Data Expansion for Customer-care Spoken Language Understanding https://t.co/bT15AgbXIW
ComputerPapers: Automatic Data Expansion for Customer-care Spoken Language Understanding. https://t.co/FyHW6Ydb6y
None.
None.
###### Other stats
Sample Sizes : None.
Authors: 3
Total Words: 5197
Unqiue Words: 1634

Assert is a website where the best academic papers on arXiv (computer science, math, physics), bioRxiv (biology), BITSS (reproducibility), EarthArXiv (earth science), engrXiv (engineering), LawArXiv (law), PsyArXiv (psychology), SocArXiv (social science), and SportRxiv (sport research) bubble to the top each day.

Papers are scored (in real-time) based on how verifiable they are (as determined by their Github repos) and how interesting they are (based on Twitter).

To see top papers, follow us on twitter @assertpub_ (arXiv), @assert_pub (bioRxiv), and @assertpub_dev (everything else).

To see beautiful figures extracted from papers, follow us on Instagram.

Tracking 72,893 papers.

###### Search
Sort results based on if they are interesting or reproducible.
Interesting
Reproducible
Online
###### Stats
Tracking 72,893 papers.