Showing papers on "Natural language published in 2017"

PDF

Open Access

Proceedings Article•

ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

[...]

Robert Speer, Joshua Chin¹, Catherine Havasi•Institutions (1)

12 Feb 2017

TL;DR: ConceptNet as mentioned in this paper is a knowledge graph that connects words and phrases of natural language with labeled edges to represent the general knowledge involved in understanding language, improving natural language applications by allowing the application to better understand the meanings behind the words people use.

...read moreread less

Abstract: Machine learning about language can be improved by supplying it with specific knowledge and sources of external information. We present here a new version of the linked open data resource ConceptNet that is particularly well suited to be used with modern NLP techniques such as word embeddings. ConceptNet is a knowledge graph that connects words and phrases of natural language with labeled edges. Its knowledge is collected from many sources that include expert-created resources, crowd-sourcing, and games with a purpose. It is designed to represent the general knowledge involved in understanding language, improving natural language applications by allowing the application to better understand the meanings behind the words people use. When ConceptNet is combined with word embeddings acquired from distributional semantics (such as word2vec), it provides applications with understanding that they would not acquire from distributional semantics alone, nor from narrower resources such as WordNet or DBPedia. We demonstrate this with state-of-the-art results on intrinsic evaluations of word relatedness that translate into improvements on applications of word vectors, including solving SAT-style analogies.

...read moreread less

1,136 citations

Journal Article•DOI•

Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge

[...]

Oriol Vinyals¹, Alexander Toshev¹, Samy Bengio¹, Dumitru Erhan¹•Institutions (1)

Google¹

01 Apr 2017-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A generative model based on a deep recurrent architecture that combines recent advances in computer vision and machine translation and that can be used to generate natural sentences describing an image is presented.

...read moreread less

Abstract: Automatically describing the content of an image is a fundamental problem in artificial intelligence that connects computer vision and natural language processing. In this paper, we present a generative model based on a deep recurrent architecture that combines recent advances in computer vision and machine translation and that can be used to generate natural sentences describing an image. The model is trained to maximize the likelihood of the target description sentence given the training image. Experiments on several datasets show the accuracy of the model and the fluency of the language it learns solely from image descriptions. Our model is often quite accurate, which we verify both qualitatively and quantitatively. Finally, given the recent surge of interest in this task, a competition was organized in 2015 using the newly released COCO dataset. We describe and analyze the various improvements we applied to our own baseline and show the resulting performance in the competition, which we won ex-aequo with a team from Microsoft Research.

...read moreread less

848 citations

Proceedings Article•DOI•

Deep Learning for Hate Speech Detection in Tweets

[...]

Pinkesh Badjatiya¹, Shashank Gupta¹, Manish Gupta², Vasudeva Varma¹•Institutions (2)

International Institute of Information Technology, Hyderabad¹, International Institute of Information Technology²

03 Apr 2017

TL;DR: These experiments on a benchmark dataset of 16K annotated tweets show that such deep learning methods outperform state-of-the-art char/word n-gram methods by ~18 F1 points.

...read moreread less

Abstract: Hate speech detection on Twitter is critical for applications like controversial event extraction, building AI chatterbots, content recommendation, and sentiment analysis. We define this task as being able to classify a tweet as racist, sexist or neither. The complexity of the natural language constructs makes this task very challenging. We perform extensive experiments with multiple deep learning architectures to learn semantic word embeddings to handle this complexity. Our experiments on a benchmark dataset of 16K annotated tweets show that such deep learning methods outperform state-of-the-art char/word n-gram methods by ~18 F1 points.

...read moreread less

706 citations

Proceedings Article•DOI•

Bilateral Multi-Perspective Matching for Natural Language Sentences

[...]

Zhiguo Wang¹, Wael Hamza², Radu Florian¹•Institutions (2)

IBM¹, Amazon.com²

13 Feb 2017

TL;DR: This article proposed a bilateral multi-perspective matching (BiMPM) model under the "matching-aggregation" framework, which first encodes two sentences with a BiLSTM encoder and then matches the two encoded sentences in two directions.

...read moreread less

Abstract: Natural language sentence matching is a fundamental technology for a variety of tasks. Previous approaches either match sentences from a single direction or only apply single granular (word-by-word or sentence-by-sentence) matching. In this work, we propose a bilateral multi-perspective matching (BiMPM) model under the "matching-aggregation" framework. Given two sentences $P$ and $Q$, our model first encodes them with a BiLSTM encoder. Next, we match the two encoded sentences in two directions $P \rightarrow Q$ and $P \leftarrow Q$. In each matching direction, each time step of one sentence is matched against all time-steps of the other sentence from multiple perspectives. Then, another BiLSTM layer is utilized to aggregate the matching results into a fix-length matching vector. Finally, based on the matching vector, the decision is made through a fully connected layer. We evaluate our model on three tasks: paraphrase identification, natural language inference and answer sentence selection. Experimental results on standard benchmark datasets show that our model achieves the state-of-the-art performance on all tasks.

...read moreread less

563 citations

Posted Content•

Toward Controlled Generation of Text

[...]

Zhiting Hu¹, Zichao Yang¹, Xiaodan Liang¹, Ruslan Salakhutdinov¹, Eric P. Xing¹ - Show less +1 more•Institutions (1)

Carnegie Mellon University¹

02 Mar 2017-arXiv: Learning

TL;DR: This article proposed a new neural generative model which combines variational auto-encoders and holistic attribute discriminators for effective imposition of semantic structures, which learns highly interpretable representations from even only word annotations, and produces realistic sentences with desired attributes.

...read moreread less

Abstract: Generic generation and manipulation of text is challenging and has limited success compared to recent deep generative modeling in visual domain. This paper aims at generating plausible natural language sentences, whose attributes are dynamically controlled by learning disentangled latent representations with designated semantics. We propose a new neural generative model which combines variational auto-encoders and holistic attribute discriminators for effective imposition of semantic structures. With differentiable approximation to discrete text samples, explicit constraints on independent attribute controls, and efficient collaborative learning of generator and discriminators, our model learns highly interpretable representations from even only word annotations, and produces realistic sentences with desired attributes. Quantitative evaluation validates the accuracy of sentence and attribute generation.

...read moreread less

536 citations

Posted Content•

A Survey of Machine Learning for Big Code and Naturalness

[...]

Miltiadis Allamanis¹, Earl T. Barr², Premkumar Devanbu³, Charles Sutton⁴•Institutions (4)

Microsoft¹, University College London², University of California, Davis³, University of Edinburgh⁴

18 Sep 2017-arXiv: Software Engineering

TL;DR: This article presents a taxonomy based on the underlying design principles of each model and uses it to navigate the literature and discuss cross-cutting and application-specific challenges and opportunities.

...read moreread less

Abstract: Research at the intersection of machine learning, programming languages, and software engineering has recently taken important steps in proposing learnable probabilistic models of source code that exploit code's abundance of patterns. In this article, we survey this work. We contrast programming languages against natural languages and discuss how these similarities and differences drive the design of probabilistic models. We present a taxonomy based on the underlying design principles of each model and use it to navigate the literature. Then, we review how researchers have adapted these models to application areas and discuss cross-cutting and application-specific challenges and opportunities.

...read moreread less

503 citations

Proceedings Article•DOI•

TALL: Temporal Activity Localization via Language Query

[...]

Jiyang Gao¹, Chen Sun², Zhenheng Yang¹, Ram Nevatia¹•Institutions (2)

University of Southern California¹, Google²

01 Oct 2017

TL;DR: A novel Cross-modal Temporal Regression Localizer (CTRL) is proposed to jointly model text query and video clips, output alignment scores and action boundary regression results for candidate clips, and Experimental results show that CTRL outperforms previous methods significantly on both datasets.

...read moreread less

Abstract: This paper focuses on temporal localization of actions in untrimmed videos. Existing methods typically train classifiers for a pre-defined list of actions and apply them in a sliding window fashion. However, activities in the wild consist of a wide combination of actors, actions and objects; it is difficult to design a proper activity list that meets users’ needs. We propose to localize activities by natural language queries. Temporal Activity Localization via Language (TALL) is challenging as it requires: (1) suitable design of text and video representations to allow cross-modal matching of actions and language queries; (2) ability to locate actions accurately given features from sliding windows of limited granularity. We propose a novel Cross-modal Temporal Regression Localizer (CTRL) to jointly model text query and video clips, output alignment scores and action boundary regression results for candidate clips. Lor evaluation, we adopt TaCoS dataset, and build a new dataset for this task on top of Charades by adding sentence temporal annotations, called Charades-STA. We also build complex sentence queries in Charades-STA for test. Experimental results show that CTRL outperforms previous methods significantly on both datasets.

...read moreread less

490 citations

Proceedings Article•DOI•

Visual Translation Embedding Network for Visual Relation Detection

[...]

Hanwang Zhang¹, Zawlin Kyaw², Shih-Fu Chang¹, Tat-Seng Chua²•Institutions (2)

Columbia University¹, National University of Singapore²

21 Jul 2017

TL;DR: Zhang et al. as discussed by the authors proposed a Visual Translation Embedding Network (VTransE) for visual relation detection, which places objects in a low-dimensional relation space where a relation can be modeled as a simple vector translation, i.e., subject + predicate.

...read moreread less

Abstract: Visual relations, such as person ride bike and bike next to car, offer a comprehensive scene understanding of an image, and have already shown their great utility in connecting computer vision and natural language. However, due to the challenging combinatorial complexity of modeling subject-predicate-object relation triplets, very little work has been done to localize and predict visual relations. Inspired by the recent advances in relational representation learning of knowledge bases and convolutional object detection networks, we propose a Visual Translation Embedding network (VTransE) for visual relation detection. VTransE places objects in a low-dimensional relation space where a relation can be modeled as a simple vector translation, i.e., subject + predicate ≈ object. We propose a novel feature extraction layer that enables object-relation knowledge transfer in a fully-convolutional fashion that supports training and inference in a single forward/backward pass. To the best of our knowledge, VTransE is the first end-toend relation detection network. We demonstrate the effectiveness of VTransE over other state-of-the-art methods on two large-scale datasets: Visual Relationship and Visual Genome. Note that even though VTransE is a purely visual model, it is still competitive to the Lu’s multi-modal model with language priors [27].

...read moreread less

484 citations

Posted Content•

Learning to Represent Programs with Graphs

[...]

Miltiadis Allamanis¹, Marc Brockschmidt¹, Mahmoud Khademi•Institutions (1)

Microsoft¹

01 Nov 2017-arXiv: Learning

TL;DR: In this article, a Gated Graph Neural Network (GNN) is used to predict the name of a variable given its usage, and to reason about selecting the correct variable that should be used at a given program location.

...read moreread less

Abstract: Learning tasks on source code (i.e., formal languages) have been considered recently, but most work has tried to transfer natural language methods and does not capitalize on the unique opportunities offered by code's known syntax. For example, long-range dependencies induced by using the same variable or function in distant locations are often not considered. We propose to use graphs to represent both the syntactic and semantic structure of code and use graph-based deep learning methods to learn to reason over program structures. In this work, we present how to construct graphs from source code and how to scale Gated Graph Neural Networks training to such large graphs. We evaluate our method on two tasks: VarNaming, in which a network attempts to predict the name of a variable given its usage, and VarMisuse, in which the network learns to reason about selecting the correct variable that should be used at a given program location. Our comparison to methods that use less structured program representations shows the advantages of modeling known structure, and suggests that our models learn to infer meaningful names and to solve the VarMisuse task in many cases. Additionally, our testing showed that VarMisuse identifies a number of bugs in mature open-source projects.

...read moreread less

478 citations

Proceedings Article•DOI•

Localizing Moments in Video with Natural Language

[...]

Lisa Anne Hendricks¹, Lisa Anne Hendricks², Oliver Wang², Eli Shechtman², Josef Sivic, Trevor Darrell¹, Bryan Russell² - Show less +3 more•Institutions (2)

Lawrence Berkeley National Laboratory¹, Adobe Systems²

01 Oct 2017

TL;DR: In this paper, a Moment Context Network (MCNCLN) is proposed to localize natural language queries in videos by integrating local and global video features over time, which can identify a specific temporal segment, or moment, from a video given a natural language text description.

...read moreread less

Abstract: We consider retrieving a specific temporal segment, or moment, from a video given a natural language text description. Methods designed to retrieve whole video clips with natural language determine what occurs in a video but not when. To address this issue, we propose the Moment Context Network (MCN) which effectively localizes natural language queries in videos by integrating local and global video features over time. A key obstacle to training our MCN model is that current video datasets do not include pairs of localized video segments and referring expressions, or text descriptions which uniquely identify a corresponding moment. Therefore, we collect the Distinct Describable Moments (DiDeMo) dataset which consists of over 10,000 unedited, personal videos in diverse visual settings with pairs of localized video segments and referring expressions. We demonstrate that MCN outperforms several baseline methods and believe that our initial results together with the release of DiDeMo will inspire further research on localizing video moments with natural language.

...read moreread less

469 citations

Posted Content•

Bilateral Multi-Perspective Matching for Natural Language Sentences

[...]

Zhiguo Wang¹, Wael Hamza², Radu Florian¹•Institutions (2)

IBM¹, Amazon.com²

13 Feb 2017-arXiv: Artificial Intelligence

TL;DR: This work proposes a bilateral multi-perspective matching (BiMPM) model under the "matching-aggregation" framework that achieves the state-of-the-art performance on all tasks.

...read moreread less

Proceedings Article•

DeepFix: Fixing Common C Language Errors by Deep Learning

[...]

Rahul Gupta¹, Soham Pal¹, Aditya Kanade¹, Shirish Shevade¹•Institutions (1)

Indian Institute of Science¹

12 Feb 2017

TL;DR: DeepFix is a multi-layered sequence-to-sequence neural network with attention which is trained to predict erroneous program locations along with the required correct statements and could fix 1881 programs completely and 1338 programs partially.

...read moreread less

Abstract: The problem of automatically fixing programming errors is a very active research topic in software engineering. This is a challenging problem as fixing even a single error may require analysis of the entire program. In practice, a number of errors arise due to programmer's inexperience with the programming language or lack of attention to detail. We call these common programming errors. These are analogous to grammatical errors in natural languages. Compilers detect such errors, but their error messages are usually inaccurate. In this work, we present an end-to-end solution, called DeepFix, that can fix multiple such errors in a program without relying on any external tool to locate or fix them. At the heart of DeepFix is a multi-layered sequence-to-sequence neural network with attention which is trained to predict erroneous program locations along with the required correct statements. On a set of 6971 erroneous C programs written by students for 93 programming tasks, DeepFix could fix 1881 (27%) programs completely and 1338 (19%) programs partially.

...read moreread less

Proceedings Article•DOI•

Deep Learning for Hate Speech Detection in Tweets

[...]

Pinkesh Badjatiya¹, Shashank Gupta¹, Manish Gupta², Vasudeva Varma¹•Institutions (2)

International Institute of Information Technology, Hyderabad¹, International Institute of Information Technology²

01 Jun 2017-arXiv: Computation and Language

TL;DR: In this article, the authors perform extensive experiments with multiple deep learning architectures to learn semantic word embeddings to handle the complexity of the natural language constructs and achieve state-of-the-art performance on hate speech detection on Twitter.

...read moreread less

Proceedings Article•DOI•

A Syntactic Neural Model for General-Purpose Code Generation

[...]

Pengcheng Yin¹, Graham Neubig¹•Institutions (1)

Carnegie Mellon University¹

01 Jul 2017

TL;DR: This paper propose a neural architecture powered by a grammar model to explicitly capture the target syntax as prior knowledge, which achieves state-of-the-art results in code generation and semantic parsing.

...read moreread less

Abstract: We consider the problem of parsing natural language descriptions into source code written in a general-purpose programming language like Python. Existing data-driven methods treat this problem as a language generation task without considering the underlying syntax of the target programming language. Informed by previous work in semantic parsing, in this paper we propose a novel neural architecture powered by a grammar model to explicitly capture the target syntax as prior knowledge. Experiments find this an effective way to scale up to generation of complex programs from natural language descriptions, achieving state-of-the-art results that well outperform previous code generation and semantic parsing approaches.

...read moreread less

Journal Article•DOI•

Sentiment Analysis Is a Big Suitcase

[...]

Erik Cambria¹, Soujanya Poria¹, Alexander Gelbukh², Mike Thelwall³•Institutions (3)

Nanyang Technological University¹, Instituto Politécnico Nacional², University of Wolverhampton³

01 Nov 2017-IEEE Intelligent Systems

TL;DR: The authors argue that there are (at least) 15 NLP problems that need to be solved to achieve human-like performance in sentiment analysis, and address the composite nature of the problem via a three-layer structure inspired by the “jumping NLP curves” paradigm.

...read moreread less

Abstract: Although most works approach it as a simple categorization problem, sentiment analysis is actually a suitcase research problem that requires tackling many natural language processing (NLP) tasks The expression “sentiment analysis” itself is a big suitcase (like many others related to affective computing, such as emotion recognition or opinion mining) that all of us use to encapsulate our jumbled idea about how our minds convey emotions and opinions through natural language The authors address the composite nature of the problem via a three-layer structure inspired by the “jumping NLP curves” paradigm In particular, they argue that there are (at least) 15 NLP problems that need to be solved to achieve human-like performance in sentiment analysis

...read moreread less

Proceedings Article•

Breaking the Softmax Bottleneck: A High-Rank RNN Language Model

[...]

Zhilin Yang¹, Zihang Dai¹, Ruslan Salakhutdinov², William W. Cohen³•Institutions (3)

Carnegie Mellon University¹, Apple Inc.², Google³

10 Nov 2017

TL;DR: The authors formulate language modeling as a matrix factorization problem, and show that the expressiveness of softmax-based models is limited by a Softmax bottleneck, which further implies that in practice Softmax with distributed word embeddings does not have enough capacity to model natural language.

...read moreread less

Abstract: We formulate language modeling as a matrix factorization problem, and show that the expressiveness of Softmax-based models (including the majority of neural language models) is limited by a Softmax bottleneck. Given that natural language is highly context-dependent, this further implies that in practice Softmax with distributed word embeddings does not have enough capacity to model natural language. We propose a simple and effective method to address this issue, and improve the state-of-the-art perplexities on Penn Treebank and WikiText-2 to 47.69 and 40.68 respectively. The proposed method also excels on the large-scale 1B Word dataset, outperforming the baseline by over 5.6 points in perplexity.

...read moreread less

Posted Content•

SQLNet: Generating Structured Queries From Natural Language Without Reinforcement Learning

[...]

Xiaojun Xu, Chang Liu, Dawn Song

13 Nov 2017-arXiv: Computation and Language

TL;DR: A sketch-based approach where the sketch contains a dependency graph, so that one prediction can be done by taking into consideration only the previous predictions that it depends on, and it is shown that SQLNet can outperform the prior art by 9% to 13% on the WikiSQL task.

...read moreread less

Abstract: Synthesizing SQL queries from natural language is a long-standing open problem and has been attracting considerable interest recently. Toward solving the problem, the de facto approach is to employ a sequence-to-sequence-style model. Such an approach will necessarily require the SQL queries to be serialized. Since the same SQL query may have multiple equivalent serializations, training a sequence-to-sequence-style model is sensitive to the choice from one of them. This phenomenon is documented as the "order-matters" problem. Existing state-of-the-art approaches rely on reinforcement learning to reward the decoder when it generates any of the equivalent serializations. However, we observe that the improvement from reinforcement learning is limited. In this paper, we propose a novel approach, i.e., SQLNet, to fundamentally solve this problem by avoiding the sequence-to-sequence structure when the order does not matter. In particular, we employ a sketch-based approach where the sketch contains a dependency graph so that one prediction can be done by taking into consideration only the previous predictions that it depends on. In addition, we propose a sequence-to-set model as well as the column attention mechanism to synthesize the query based on the sketch. By combining all these novel techniques, we show that SQLNet can outperform the prior art by 9% to 13% on the WikiSQL task.

...read moreread less

Proceedings Article•

Adversarial ranking for language generation

[...]

Kevin Lin¹, Dianqi Li¹, Xiaodong He², Zhengyou Zhang², Ming-Ting Sun¹ - Show less +1 more•Institutions (2)

University of Washington¹, Microsoft²

04 Dec 2017

TL;DR: This paper proposes a novel generative adversarial network, RankGAN, for generating high-quality language descriptions by viewing a set of data samples collectively and evaluating their quality through relative ranking scores, which helps to make better assessment which in turn helps to learn a better generator.

...read moreread less

Abstract: Generative adversarial networks (GANs) have great successes on synthesizing data However, the existing GANs restrict the discriminator to be a binary classifier, and thus limit their learning capacity for tasks that need to synthesize output with rich structures such as natural language descriptions In this paper, we propose a novel generative adversarial network, RankGAN, for generating high-quality language descriptions Rather than training the discriminator to learn and assign absolute binary predicate for individual data sample, the proposed RankGAN is able to analyze and rank a collection of human-written and machine-written sentences by giving a reference group By viewing a set of data samples collectively and evaluating their quality through relative ranking scores, the discriminator is able to make better assessment which in turn helps to learn a better generator The proposed RankGAN is optimized through the policy gradient technique Experimental results on multiple public datasets clearly demonstrate the effectiveness of the proposed approach

...read moreread less

Proceedings Article•DOI•

Learning a Neural Semantic Parser from User Feedback

[...]

Srinivasan Iyer¹, Ioannis Konstas¹, Alvin Cheung¹, Jayant Krishnamurthy², Luke Zettlemoyer² - Show less +1 more•Institutions (2)

University of Washington¹, Allen Institute for Artificial Intelligence²

27 Apr 2017

TL;DR: An approach to rapidly and easily build natural language interfaces to databases for new domains, whose performance improves over time based on user feedback, and requires minimal intervention is presented.

...read moreread less

Abstract: We present an approach to rapidly and easily build natural language interfaces to databases for new domains, whose performance improves over time based on user feedback, and requires minimal intervention. To achieve this, we adapt neural sequence models to map utterances directly to SQL with its full expressivity, bypassing any intermediate meaning representations. These models are immediately deployed online to solicit feedback from real users to flag incorrect queries. Finally, the popularity of SQL facilitates gathering annotations for incorrect predictions using the crowd, which is directly used to improve our models. This complete feedback loop, without intermediate representations or database specific engineering, opens up new ways of building high quality semantic parsers. Experiments suggest that this approach can be deployed quickly for any new target domain, as we show by learning a semantic parser for an online academic database from scratch.

...read moreread less

Posted Content•

Localizing Moments in Video with Natural Language

[...]

Lisa Anne Hendricks¹, Lisa Anne Hendricks², Oliver Wang², Eli Shechtman², Josef Sivic, Trevor Darrell¹, Bryan Russell² - Show less +3 more•Institutions (2)

Lawrence Berkeley National Laboratory¹, Adobe Systems²

04 Aug 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: The Moment Context Network (MCN) is proposed which effectively localizes natural language queries in videos by integrating local and global video features over time and outperforms several baseline methods.

...read moreread less

Proceedings Article•DOI•

CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies

[...]

Daniel Zeman¹, Martin Popel¹, Milan Straka¹, Jan Hajič¹, Joakim Nivre², Filip Ginter³, Juhani Luotolahti³, Sampo Pyysalo⁴, Slav Petrov⁵, Martin Potthast⁶, Francis M. Tyers⁷, Elena Badmaeva⁸, Memduh Gökırmak⁹, Anna Nedoluzhko¹, Silvie Cinková¹, Jaroslava Hlaváčová¹, Václava Kettnerová¹, Zdenka Uresova¹, Jenna Kanerva³, Stina Ojala³, Anna Missilä³, Christopher D. Manning¹⁰, Sebastian Schuster¹⁰, Siva Reddy¹⁰, Dima Taji¹¹, Nizar Habash¹¹, Herman Leung¹², Marie-Catherine de Marneffe¹³, Manuela Sanguinetti¹⁴, Maria Simi¹⁵, Hiroshi Kanayama¹⁶, Valeria dePaiva¹⁷, Kira Droganova¹, Héctor Martínez Alonso¹⁸, Ça ugrÄ± Çöltekin¹⁹, Umut Sulubacak, Hans Uszkoreit²⁰, Vivien Macketanz²⁰, Aljoscha Burchardt²⁰, Kim Harris, Katrin Marheinecke, Georg Rehm²⁰, Tolga Kayadelen⁵, Mohammed Attia⁵, Ali Elkahky⁵, Zhuoran Yu⁵, Emily Pitler⁵, Saran Lertpradit⁵, Michael Mandl⁵, Jesse Kirchner⁵, Hector Fernandez Alcalde⁵, Jana Strnadová⁵, Esha Banerjee⁵, Ruli Manurung⁵, Antonio Stella⁵, Atsuko Shimada⁵, Sookyoung Kwak⁵, Gustavo Mendonça⁵, Tatiana Lando⁵, Rattima Nitisaroj⁵, Josie Li⁵ - Show less +57 more•Institutions (20)

01 Jan 2017

TL;DR: The task and evaluation methodology is defined, how the data sets were prepared, report and analyze the main results, and a brief categorization of the different approaches of the participating systems are provided.

...read moreread less

Abstract: The Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their learning systems on the same data sets. In 2017, the task was devoted to learning dependency parsers for a large number of languages, in a real-world setting without any gold-standard annotation on input. All test sets followed a unified annotation scheme, namely that of Universal Dependencies. In this paper, we define the task and evaluation methodology, describe how the data sets were prepared, report and analyze the main results, and provide a brief categorization of the different approaches of the participating systems.

...read moreread less

Posted Content•

Grounded Language Learning in a Simulated 3D World

[...]

Karl Moritz Hermann, Felix Hill, Simon Green, Fumin Wang, Ryan Faulkner, Hubert Soyer, David Szepesvari, Wojciech Marian Czarnecki, Max Jaderberg, Denis Teplyashin, Marcus Wainwright, Chris Apps, Demis Hassabis, Phil Blunsom - Show less +10 more

20 Jun 2017-arXiv: Computation and Language

TL;DR: An agent is presented that learns to interpret language in a simulated 3D environment where it is rewarded for the successful execution of written instructions and its comprehension of language extends beyond its prior experience, enabling it to apply familiar language to unfamiliar situations and to interpret entirely novel instructions.

...read moreread less

Abstract: We are increasingly surrounded by artificially intelligent technology that takes decisions and executes actions on our behalf. This creates a pressing need for general means to communicate with, instruct and guide artificial agents, with human language the most compelling means for such communication. To achieve this in a scalable fashion, agents must be able to relate language to the world and to actions; that is, their understanding of language must be grounded and embodied. However, learning grounded language is a notoriously challenging problem in artificial intelligence research. Here we present an agent that learns to interpret language in a simulated 3D environment where it is rewarded for the successful execution of written instructions. Trained via a combination of reinforcement and unsupervised learning, and beginning with minimal prior knowledge, the agent learns to relate linguistic symbols to emergent perceptual representations of its physical surroundings and to pertinent sequences of actions. The agent's comprehension of language extends beyond its prior experience, enabling it to apply familiar language to unfamiliar situations and to interpret entirely novel instructions. Moreover, the speed with which this agent learns new words increases as its semantic knowledge grows. This facility for generalising and bootstrapping semantic knowledge indicates the potential of the present approach for reconciling ambiguous natural language with the complexity of the physical world.

...read moreread less

Book Chapter•DOI•

Cross-Lingual Entity Alignment via Joint Attribute-Preserving Embedding

[...]

Zequn Sun¹, Wei Hu¹, Chengkai Li²•Institutions (2)

Nanjing University¹, University of Texas at Arlington²

21 Oct 2017

TL;DR: This paper propose a joint attribute-preserving embedding model for cross-lingual entity alignment, which jointly embeds the structures of two knowledge bases into a unified vector space and further refines it by leveraging attribute correlations in the knowledge bases.

...read moreread less

Abstract: Entity alignment is the task of finding entities in two knowledge bases (KBs) that represent the same real-world object. When facing KBs in different natural languages, conventional cross-lingual entity alignment methods rely on machine translation to eliminate the language barriers. These approaches often suffer from the uneven quality of translations between languages. While recent embedding-based techniques encode entities and relationships in KBs and do not need machine translation for cross-lingual entity alignment, a significant number of attributes remain largely unexplored. In this paper, we propose a joint attribute-preserving embedding model for cross-lingual entity alignment. It jointly embeds the structures of two KBs into a unified vector space and further refines it by leveraging attribute correlations in the KBs. Our experimental results on real-world datasets show that this approach significantly outperforms the state-of-the-art embedding approaches for cross-lingual entity alignment and could be complemented with methods based on machine translation.

...read moreread less

Posted Content•

Efficient Natural Language Response Suggestion for Smart Reply

[...]

Matthew L. Henderson, Rami Al-Rfou, Brian Strope, Yun-Hsuan Sung, László Lukács, Ruiqi Guo, Sanjiv Kumar, Balint Miklos, Ray Kurzweil - Show less +5 more

01 May 2017-arXiv: Computation and Language

TL;DR: A computationally efficient machine-learned method for natural language response suggestion using feed-forward neural networks using n-gram embedding features that achieves the same quality at a small fraction of the computational requirements and latency.

...read moreread less

Abstract: This paper presents a computationally efficient machine-learned method for natural language response suggestion. Feed-forward neural networks using n-gram embedding features encode messages into vectors which are optimized to give message-response pairs a high dot-product value. An optimized search finds response suggestions. The method is evaluated in a large-scale commercial e-mail application, Inbox by Gmail. Compared to a sequence-to-sequence approach, the new system achieves the same quality at a small fraction of the computational requirements and latency.

...read moreread less

Journal Article•DOI•

Visual question answering: A survey of methods and datasets

[...]

Qi Wu¹, Damien Teney¹, Peng Wang¹, Chunhua Shen¹, Anthony Dick¹, Anton van den Hengel¹ - Show less +2 more•Institutions (1)

University of Adelaide¹

01 Oct 2017-Computer Vision and Image Understanding

TL;DR: Visual Question Answering (VQA) is a challenging task that has received increasing attention from both the computer vision and the natural language processing communities as mentioned in this paper, which requires reasoning over visual elements of the image and general knowledge to infer the correct answer.

...read moreread less

DOI•

Disinventing and Reconstituting Languages

[...]

Daragh Hayes

19 Mar 2017

Proceedings Article•DOI•

Neural Network-based Question Answering over Knowledge Graphs on Word and Character Level

[...]

Denis Lukovnikov¹, Asja Fischer¹, Jens Lehmann¹, Sören Auer¹•Institutions (1)

University of Bonn¹

03 Apr 2017

TL;DR: This work trains a neural network for answering simple questions in an end-to-end manner, leaving all decisions to the model, which contains a nested word/character-level question encoder which allows to handle out-of-vocabulary and rare word problems while still being able to exploit word-level semantics.

...read moreread less

Abstract: Question Answering (QA) systems over Knowledge Graphs (KG) automatically answer natural language questions using facts contained in a knowledge graph. Simple questions, which can be answered by the extraction of a single fact, constitute a large part of questions asked on the web but still pose challenges to QA systems, especially when asked against a large knowledge resource. Existing QA systems usually rely on various components each specialised in solving different sub-tasks of the problem (such as segmentation, entity recognition, disambiguation, and relation classification etc.). In this work, we follow a quite different approach: We train a neural network for answering simple questions in an end-to-end manner, leaving all decisions to the model. It learns to rank subject-predicate pairs to enable the retrieval of relevant facts given a question. The network contains a nested word/character-level question encoder which allows to handle out-of-vocabulary and rare word problems while still being able to exploit word-level semantics. Our approach achieves results competitive with state-of-the-art end-to-end approaches that rely on an attention mechanism.

...read moreread less

Journal Article•DOI•

SQLizer: query synthesis from natural language

[...]

Navid Yaghmazadeh¹, Yuepeng Wang¹, Isil Dillig¹, Thomas Dillig¹•Institutions (1)

University of Texas at Austin¹

12 Oct 2017

TL;DR: This paper presents a new technique for automatically synthesizing SQL queries from natural language (NL) using a new NL-based program synthesis methodology that combines semantic parsing techniques from the NLP community with type-directed program synthesis and automated program repair.

...read moreread less

Abstract: This paper presents a new technique for automatically synthesizing SQL queries from natural language (NL). At the core of our technique is a new NL-based program synthesis methodology that combines semantic parsing techniques from the NLP community with type-directed program synthesis and automated program repair. Starting with a program sketch obtained using standard parsing techniques, our approach involves an iterative refinement loop that alternates between probabilistic type inhabitation and automated sketch repair. We use the proposed idea to build an end-to-end system called SQLIZER that can synthesize SQL queries from natural language. Our method is fully automated, works for any database without requiring additional customization, and does not require users to know the underlying database schema. We evaluate our approach on over 450 natural language queries concerning three different databases, namely MAS, IMDB, and YELP. Our experiments show that the desired query is ranked within the top 5 candidates in close to 90% of the cases and that SQLIZER outperforms NALIR, a state-of-the-art tool that won a best paper award at VLDB'14.

...read moreread less

Proceedings Article•DOI•

A Corpus of Natural Language for Visual Reasoning.

[...]

Alane Suhr¹, Michael Lewis², James Yeh, Yoav Artzi¹•Institutions (2)

Cornell University¹, University of Pittsburgh²

01 Jul 2017

TL;DR: A method of crowdsourcing linguistically-diverse data, and an analysis of the data demonstrates a broad set of linguistic phenomena, requiring visual and set-theoretic reasoning.

...read moreread less

Abstract: We present a new visual reasoning language dataset, containing 92,244 pairs of examples of natural statements grounded in synthetic images with 3,962 unique sentences. We describe a method of crowdsourcing linguistically-diverse data, and present an analysis of our data. The data demonstrates a broad set of linguistic phenomena, requiring visual and set-theoretic reasoning. We experiment with various models, and show the data presents a strong challenge for future research.

...read moreread less

Book Chapter•DOI•

LC-QuAD: A Corpus for Complex Question Answering over Knowledge Graphs

[...]

Priyansh Trivedi¹, Gaurav Maheshwari¹, Mohnish Dubey¹, Jens Lehmann¹•Institutions (1)

University of Bonn¹

21 Oct 2017

TL;DR: The Large-Scale Complex Question Answering Dataset (LC-QuAD) is provided, providing a dataset with 5000 questions and their corresponding SPARQL queries over the DBpedia dataset to assess the robustness and accuracy of the next generation of QA systems for knowledge graphs.

...read moreread less

Abstract: Being able to access knowledge bases in an intuitive way has been an active area of research over the past years. In particular, several question answering (QA) approaches which allow to query RDF datasets in natural language have been developed as they allow end users to access knowledge without needing to learn the schema of a knowledge base and learn a formal query language. To foster this research area, several training datasets have been created, e.g. in the QALD (Question Answering over Linked Data) initiative. However, existing datasets are insufficient in terms of size, variety or complexity to apply and evaluate a range of machine learning based QA approaches for learning complex SPARQL queries. With the provision of the Large-Scale Complex Question Answering Dataset (LC-QuAD), we close this gap by providing a dataset with 5000 questions and their corresponding SPARQL queries over the DBpedia dataset. In this article, we describe the dataset creation process and how we ensure a high variety of questions, which should enable to assess the robustness and accuracy of the next generation of QA systems for knowledge graphs.

...read moreread less

Collapse