Learning to Ask: Neural Question Generation for Reading Comprehension
Xinya Du,Junru Shao,Claire Cardie +2 more
- Vol. 1, pp 1342-1352
TLDR
This paper proposed an attention-based sequence learning model for question generation from text passages in reading comprehension, which is trainable end-to-end via sequence-tosequence learning and significantly outperforms the state-of-the-art rule-based system.Abstract:
We study automatic question generation for sentences from text passages in reading comprehension. We introduce an attention-based sequence learning model for the task and investigate the effect of encoding sentence- vs. paragraph-level information. In contrast to all previous work, our model does not rely on hand-crafted rules or a sophisticated NLP pipeline; it is instead trainable end-to-end via sequence-to-sequence learning. Automatic evaluation results show that our system significantly outperforms the state-of-the-art rule-based system. In human evaluations, questions generated by our system are also rated as being more natural (i.e.,, grammaticality, fluency) and as more difficult to answer (in terms of syntactic and lexical divergence from the original text and reasoning needed to answer).read more
Citations
More filters
Journal ArticleDOI
Natural Language Generation Using Deep Learning to Support MOOC Learners
Chenglu Li,Wanli Xing +1 more
TL;DR: Examining the extent to which the deep-learning-based natural language generation (NLG) models can offer responses similar to human-generated responses to the learners in MOOC forums suggested that the GPT-2 model can comparably provide emotional and community support to human learners with contextual replies.
Posted Content
Multi-Task Learning with Language Modeling for Question Generation
TL;DR: This paper proposes to incorporate an auxiliary task of language modeling to help question generation in a hierarchical multi-task learning structure based on the attention-based pointer generator model, and develops a joint-learning model.
Posted Content
Fluent Response Generation for Conversational Question Answering
TL;DR: This work proposes a method for situating QA responses within a SEQ2SEQ NLG approach to generate fluent grammatical answer responses while maintaining correctness, and develops Syntactic Transformations (STs) to produce question-specific candidate answer responses and rank them using a BERT-based classifier.
Proceedings ArticleDOI
What's The Latest? A Question-driven News Chatbot
TL;DR: The algorithmic framework for an automatic news chatbot is described and the results of a usability study are presented that shows that news readers using the system successfully engage in multi-turn conversations about specific news stories.
Posted Content
Stay Hungry, Stay Focused: Generating Informative and Specific Questions in Information-Seeking Conversations
TL;DR: To generate pragmatic questions, reinforcement learning is used to optimize an informativeness metric the authors propose, combined with a reward function designed to promote more specific questions, and it is demonstrated that the resulting pragmatic questioner substantially improves the informativity and specificity of questions generated over a baseline model.
References
More filters
Journal ArticleDOI
Long short-term memory
TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
Proceedings ArticleDOI
Glove: Global Vectors for Word Representation
TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.
Proceedings ArticleDOI
Bleu: a Method for Automatic Evaluation of Machine Translation
TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.
Proceedings Article
Neural Machine Translation by Jointly Learning to Align and Translate
TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.
Proceedings ArticleDOI
Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation
Kyunghyun Cho,Bart van Merriënboer,Caglar Gulcehre,Dzmitry Bahdanau,Fethi Bougares,Holger Schwenk,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio +8 more
TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.