Top 11 papers published by Luke Zettlemoyer from Facebook in 2016

Proceedings Article•DOI•

Summarizing Source Code using a Neural Attention Model

[...]

Srinivasan Iyer¹, Ioannis Konstas¹, Alvin Cheung¹, Luke Zettlemoyer¹•Institutions (1)

01 Aug 2016

TL;DR: This paper presents the first completely datadriven approach for generating high level summaries of source code, which uses Long Short Term Memory (LSTM) networks with attention to produce sentences that describe C# code snippets and SQL queries.

...read moreread less

Abstract: High quality source code is often paired with high level summaries of the computation it performs, for example in code documentation or in descriptions posted in online forums. Such summaries are extremely useful for applications such as code search but are expensive to manually author, hence only done for a small fraction of all code that is produced. In this paper, we present the first completely datadriven approach for generating high level summaries of source code. Our model, CODE-NN , uses Long Short Term Memory (LSTM) networks with attention to produce sentences that describe C# code snippets and SQL queries. CODE-NN is trained on a new corpus that is automatically collected from StackOverflow, which we release. Experiments demonstrate strong performance on two tasks: (1) code summarization, where we establish the first end-to-end learning results and outperform strong baselines, and (2) code retrieval, where our learned model improves the state of the art on a recently introduced C# benchmark by a large margin.

...read moreread less

612 citations

Proceedings Article•DOI•

Situation Recognition: Visual Semantic Role Labeling for Image Understanding

[...]

Mark Yatskar¹, Luke Zettlemoyer¹, Ali Farhadi²•Institutions (2)

University of Washington¹, Allen Institute for Artificial Intelligence²

01 Jun 2016

TL;DR: This article introduced situation recognition, the problem of producing a concise summary of the situation an image depicts including: (1) the main activity (e.g., clipping), (2) the participating actors, objects, substances, and locations, and most importantly (3) the roles these participants play in the activity (i.e., the man is clipping, the shears are his tool, the wool is being clipped from the sheep, and the clipping is in a field).

...read moreread less

Abstract: This paper introduces situation recognition, the problem of producing a concise summary of the situation an image depicts including: (1) the main activity (e.g., clipping), (2) the participating actors, objects, substances, and locations (e.g., man, shears, sheep, wool, and field) and most importantly (3) the roles these participants play in the activity (e.g., the man is clipping, the shears are his tool, the wool is being clipped from the sheep, and the clipping is in a field). We use FrameNet, a verb and role lexicon developed by linguists, to define a large space of possible situations and collect a large-scale dataset containing over 500 activities, 1,700 roles, 11,000 objects, 125,000 images, and 200,000 unique situations. We also introduce structured prediction baselines and show that, in activity-centric images, situation-driven prediction of objects and activities outperforms independent object and activity recognition.

...read moreread less

272 citations

Proceedings Article•DOI•

Globally Coherent Text Generation with Neural Checklist Models

[...]

Chloé Kiddon¹, Luke Zettlemoyer¹, Yejin Choi¹•Institutions (1)

University of Washington¹

01 Nov 2016

TL;DR: The neural checklist model is presented, a recurrent neural network that models global coherence by storing and updating an agenda of text strings which should be mentioned somewhere in the output, and demonstrates high coherence with greatly improved semantic coverage of the agenda.

...read moreread less

Abstract: Recurrent neural networks can generate locally coherent text but often have difficulties representing what has already been generated and what still needs to be said – especially when constructing long texts. We present the neural checklist model, a recurrent neural network that models global coherence by storing and updating an agenda of text strings which should be mentioned somewhere in the output. The model generates output by dynamically adjusting the interpolation among a language model and a pair of attention models that encourage references to agenda items. Evaluations on cooking recipes and dialogue system responses demonstrate high coherence with greatly improved semantic coverage of the agenda.

...read moreread less

265 citations

Proceedings Article•DOI•

LSTM CCG Parsing.

[...]

Michael Lewis¹, Kenton Lee¹, Luke Zettlemoyer¹•Institutions (1)

University of Washington¹

01 Jun 2016

TL;DR: This work demonstrates that a state-of-the-art parser can be built using only a lexical tagging model and a deterministic grammar, with no explicit model of bi-lexical dependencies, and can recover long-range dependencies with high accuracy.

...read moreread less

Abstract: We demonstrate that a state-of-the-art parser can be built using only a lexical tagging model and a deterministic grammar, with no explicit model of bi-lexical dependencies. Instead, all dependencies are implicitly encoded in an LSTM supertagger that assigns CCG lexical categories. The parser significantly outperforms all previously published CCG results, supports efficient and optimal A decoding, and benefits substantially from semisupervised tri-training. We give a detailed analysis, demonstrating that the parser can recover long-range dependencies with high accuracy and that the semi-supervised learning enables significant accuracy gains. By running the LSTM on a GPU, we are able to parse over 2600 sentences per second while improving state-of-the-art accuracy by 1.1 F1 in domain and up to 4.5 F1 out of domain.

...read moreread less

102 citations

Proceedings Article•DOI•

Human-in-the-Loop Parsing

[...]

Luheng He¹, Julian Michael², Michael Lewis³, Luke Zettlemoyer¹•Institutions (3)

University of Washington¹, New York University², University UCINF³

01 Nov 2016

TL;DR: This paper demonstrates that it is possible for a parser to improve its performance with a human in the loop, by posing simple questions to non-experts, and applies the approach to a CCG parser, converting uncertain attachment decisions into natural language questions about the arguments of verbs.

...read moreread less

Abstract: This paper demonstrates that it is possible for a parser to improve its performance with a human in the loop, by posing simple questions to non-experts. For example, given the first sentence of this abstract, if the parser is uncertain about the subject of the verb “pose,” it could generate the question What would pose something? with candidate answers this paper and a parser. Any fluent speaker can answer this question, and the correct answer resolves the original uncertainty. We apply the approach to a CCG parser, converting uncertain attachment decisions into natural language questions about the arguments of verbs. Experiments show that crowd workers can answer these questions quickly, accurately and cheaply. Our human-in-the-loop parser improves on the state of the art with less than 2 questions per sentence on average, with a gain of 1.7 F1 on the 10% of sentences whose parses are changed.

...read moreread less

40 citations

Proceedings Article•DOI•

Global Neural CCG Parsing with Optimality Guarantees

[...]

Kenton Lee¹, Michael Lewis², Luke Zettlemoyer¹•Institutions (2)

University of Washington¹, University UCINF²

01 Nov 2016

TL;DR: This paper introduced the first global recursive neural parsing model with optimality guarantees during decoding, which is trained with a new objective that encourages the parser to explore a tiny fraction of the search space.

...read moreread less

Abstract: We introduce the first global recursive neural parsing model with optimality guarantees during decoding. To support global features, we give up dynamic programs and instead search directly in the space of all possible subtrees. Although this space is exponentially large in the sentence length, we show it is possible to learn an efficient A* parser. We augment existing parsing models, which have informative bounds on the outside score, with a global model that has loose bounds but only needs to model non-local phenomena. The global model is trained with a new objective that encourages the parser to explore a tiny fraction of the search space. The approach is applied to CCG parsing, improving state-of-the-art accuracy by 0.4 F1. The parser finds the optimal parse for 99.9% of held-out sentences, exploring on average only 190 subtrees.

...read moreread less

29 citations

Proceedings Article•DOI•

A Theme-Rewriting Approach for Generating Algebra Word Problems

[...]

Rik Koncel-Kedziorski¹, Ioannis Konstas¹, Luke Zettlemoyer¹, Hannaneh Hajishirzi¹•Institutions (1)

University of Washington¹

01 Nov 2016

TL;DR: This paper used a two-stage decoding process, which proposes new words from the target theme and scores the resulting stories according to a number of factors defining aspects of syntactic, semantic, and thematic coherence.

...read moreread less

Abstract: Texts present coherent stories that have a particular theme or overall setting, for example science fiction or western. In this paper, we present a text generation method called {\it rewriting} that edits existing human-authored narratives to change their theme without changing the underlying story. We apply the approach to math word problems, where it might help students stay more engaged by quickly transforming all of their homework assignments to the theme of their favorite movie without changing the math concepts that are being taught. Our rewriting method uses a two-stage decoding process, which proposes new words from the target theme and scores the resulting stories according to a number of factors defining aspects of syntactic, semantic, and thematic coherence. Experiments demonstrate that the final stories typically represent the new theme well while still testing the original math concepts, outperforming a number of baselines. We also release a new dataset of human-authored rewrites of math word problems in several themes.

...read moreread less

28 citations

Proceedings Article•DOI•

Document-level Sentiment Inference with Social, Faction, and Discourse Context

[...]

Eunsol Choi¹, Hannah Rashkin¹, Luke Zettlemoyer¹, Yejin Choi¹•Institutions (1)

University of Washington¹

01 Aug 2016

TL;DR: An ILP is introduced that jointly models sentenceand discourse-level sentiment cues, factual evidence about entity factions, and global constraints based on social science theories such as homophily, social balance, and reciprocity, which allows for rich inference across groups of entities.

...read moreread less

Abstract: We present a new approach for documentlevel sentiment inference, where the goal is to predict directed opinions (who feels positively or negatively towards whom) for all entities mentioned in a text. To encourage more complete and consistent predictions, we introduce an ILP that jointly models (1) sentenceand discourse-level sentiment cues, (2) factual evidence about entity factions, and (3) global constraints based on social science theories such as homophily, social balance, and reciprocity. Together, these cues allow for rich inference across groups of entities, including for example that CEOs and the companies they lead are likely to have similar sentiment towards others. We evaluate performance on new, densely labeled data that provides supervision for all pairs, complementing previous work that only labeled pairs mentioned in the same sentence. Experiments demonstrate that the global model outperforms sentence-level baselines, by providing more coherent predictions across sets of related entities.

...read moreread less

28 citations

Posted Content•

Commonly Uncommon: Semantic Sparsity in Situation Recognition

[...]

Mark Yatskar¹, Vicente Ordonez², Luke Zettlemoyer¹, Ali Farhadi¹•Institutions (2)

University of Washington¹, Allen Institute for Artificial Intelligence²

03 Dec 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, a tensor composition function was proposed to learn to share examples across role-noun combinations and to augment the training data with rarely observed outputs using web data.

...read moreread less

Abstract: Semantic sparsity is a common challenge in structured visual classification problems; when the output space is complex, the vast majority of the possible predictions are rarely, if ever, seen in the training set. This paper studies semantic sparsity in situation recognition, the task of producing structured summaries of what is happening in images, including activities, objects and the roles objects play within the activity. For this problem, we find empirically that most object-role combinations are rare, and current state-of-the-art models significantly underperform in this sparse data regime. We avoid many such errors by (1) introducing a novel tensor composition function that learns to share examples across role-noun combinations and (2) semantically augmenting our training data with automatically gathered examples of rarely observed outputs using web data. When integrated within a complete CRF-based structured prediction model, the tensor-based approach outperforms existing state of the art by a relative improvement of 2.11% and 4.40% on top-5 verb and noun-role accuracy, respectively. Adding 5 million images with our semantic augmentation techniques gives further relative improvements of 6.23% and 9.57% on top-5 verb and noun-role accuracy.

...read moreread less

22 citations

Posted Content•

A Theme-Rewriting Approach for Generating Algebra Word Problems

[...]

Rik Koncel-Kedziorski¹, Ioannis Konstas¹, Luke Zettlemoyer¹, Hannaneh Hajishirzi¹•Institutions (1)

University of Washington¹

19 Oct 2016-arXiv: Computation and Language

TL;DR: This paper presents a text generation method called It rewriting, which edits existing human-authored narratives to change their theme without changing the underlying story, and applies it to math word problems, where it might help students stay more engaged by quickly transforming all of their homework assignments to the theme of their favorite movie without changes the math concepts that are being taught.

...read moreread less

Abstract: Texts present coherent stories that have a particular theme or overall setting, for example science fiction or western. In this paper, we present a text generation method called {\it rewriting} that edits existing human-authored narratives to change their theme without changing the underlying story. We apply the approach to math word problems, where it might help students stay more engaged by quickly transforming all of their homework assignments to the theme of their favorite movie without changing the math concepts that are being taught. Our rewriting method uses a two-stage decoding process, which proposes new words from the target theme and scores the resulting stories according to a number of factors defining aspects of syntactic, semantic, and thematic coherence. Experiments demonstrate that the final stories typically represent the new theme well while still testing the original math concepts, outperforming a number of baselines. We also release a new dataset of human-authored rewrites of math word problems in several themes.

...read moreread less

11 citations

Posted Content•

Global Neural CCG Parsing with Optimality Guarantees

[...]

Kenton Lee¹, Michael Lewis², Luke Zettlemoyer¹•Institutions (2)

University of Washington¹, University UCINF²

05 Jul 2016-arXiv: Computation and Language

TL;DR: This article introduced the first global recursive neural parsing model with optimality guarantees during decoding, which is trained with a new objective that encourages the parser to explore a tiny fraction of the search space.

...read moreread less

Abstract: We introduce the first global recursive neural parsing model with optimality guarantees during decoding. To support global features, we give up dynamic programs and instead search directly in the space of all possible subtrees. Although this space is exponentially large in the sentence length, we show it is possible to learn an efficient A* parser. We augment existing parsing models, which have informative bounds on the outside score, with a global model that has loose bounds but only needs to model non-local phenomena. The global model is trained with a new objective that encourages the parser to explore a tiny fraction of the search space. The approach is applied to CCG parsing, improving state-of-the-art accuracy by 0.4 F1. The parser finds the optimal parse for 99.9% of held-out sentences, exploring on average only 190 subtrees.

...read moreread less

Showing papers by "Luke Zettlemoyer published in 2016"