The NarrativeQA Reading Comprehension Challenge

doi:10.1162/TACL_A_00023

Open AccessJournal ArticleDOI

The NarrativeQA Reading Comprehension Challenge

Tomáš Kočiský, +6 more

- 28 May 2018 -

Transactions of the Association for Comp...

- Vol. 6, pp 317-328

TLDR

A new dataset and set of tasks in which the reader must answer questions about stories by reading entire books or movie scripts are presented, designed so that successfully answering their questions requires understanding the underlying narrative rather than relying on shallow pattern matching or salience.

Abstract:

Reading comprehension (RC)—in contrast to information retrieval—requires integrating information and reasoning about events, entities, and their relations across a full document. Question answering...

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Natural Questions: A Benchmark for Question Answering Research

Tom Kwiatkowski, +17 more

- 02 Aug 2019 -

Transactions of the Association for Comp...

TL;DR: The Natural Questions corpus, a question answering data set, is presented, introducing robust metrics for the purposes of evaluating question answering systems; demonstrating high human upper bounds on these metrics; and establishing baseline results using competitive methods drawn from related literature.

...read moreread less

Journal ArticleDOI

CoQA: A Conversational Question Answering Challenge

Siva Reddy, +2 more

- 29 May 2019 -

Transactions of the Association for Comp...

TL;DR: The CoQA dataset as mentioned in this paper contains 127k questions with answers, obtained from 8k conversations about text passages from seven diverse domains, and the answers are free-form text with their corresponding evidence highlighted in the passage.

...read moreread less

Proceedings ArticleDOI

QuAC: Question Answering in Context

Eunsol Choi, +9 more

TL;DR: QuAC introduces challenges not found in existing machine comprehension datasets: its questions are often more open-ended, unanswerable, or only meaningful within the dialog context, as it shows in a detailed qualitative evaluation.

...read moreread less

Proceedings ArticleDOI

UNIFIEDQA: Crossing Format Boundaries with a Single QA System

Daniel Khashabi, +6 more

TL;DR: This work uses the latest advances in language modeling to build a single pre-trained QA model, UNIFIEDQA, that performs well across 19 QA datasets spanning 4 diverse formats, and results in a new state of the art on 10 factoid and commonsense question answering datasets.

...read moreread less

Posted Content

Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering

Gautier Izacard, +1 more

- 02 Jul 2020 -

arXiv: Computation and Language

TL;DR: Interestingly, it is observed that the performance of this method significantly improves when increasing the number of retrieved passages, evidence that sequence-to-sequence models offers a flexible framework to efficiently aggregate and combine evidence from multiple passages.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Proceedings ArticleDOI

Bleu: a Method for Automatic Evaluation of Machine Translation

Kishore Papineni, +3 more

TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.

...read moreread less

Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

Proceedings Article

ROUGE: A Package for Automatic Evaluation of Summaries

Chin-Yew Lin

TL;DR: Four different RouGE measures are introduced: ROUGE-N, ROUge-L, R OUGE-W, and ROUAGE-S included in the Rouge summarization evaluation package and their evaluations.

...read moreread less

Collapse

The NarrativeQA Reading Comprehension Challenge

Citations

Natural Questions: A Benchmark for Question Answering Research

CoQA: A Conversational Question Answering Challenge

QuAC: Question Answering in Context

UNIFIEDQA: Crossing Format Boundaries with a Single QA System

Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering

References

Latent dirichlet allocation

Glove: Global Vectors for Word Representation

Bleu: a Method for Automatic Evaluation of Machine Translation

Sequence to Sequence Learning with Neural Networks

ROUGE: A Package for Automatic Evaluation of Summaries

Related Papers (5)

SQuAD: 100,000+ Questions for Machine Comprehension of Text

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

Know What You Don't Know: Unanswerable Questions for SQuAD

Teaching machines to read and comprehend

Trending Questions (2)