Neural Approaches to Conversational AI

Open AccessPosted Content

Neural Approaches to Conversational AI

- 21 Sep 2018 -

TLDR

In this article, the authors present a survey of state-of-the-art neural approaches to conversational AI, and discuss the progress that has been made and challenges still being faced, using specific systems and models as case studies.

Abstract:

The present paper surveys neural approaches to conversational AI that have been developed in the last few years. We group conversational systems into three categories: (1) question answering agents, (2) task-oriented dialogue agents, and (3) chatbots. For each category, we present a review of state-of-the-art neural approaches, draw the connection between them and traditional approaches, and discuss the progress that has been made and challenges still being faced, using specific systems and models as case studies.

Citations

PDF

Open Access

More filters

Posted Content

DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

Yizhe Zhang, +8 more

- 01 Nov 2019 -

arXiv: Computation and Language

TL;DR: The authors presented a large, tunable neural conversational response generation model, DialoGPT (dialogue generative pre-trained transformer) trained on 147M conversation-like exchanges extracted from Reddit comment chains over a period spanning from 2005 through 2017.

...read moreread less

Posted Content

Towards a Human-like Open-Domain Chatbot

Daniel Adiwardana, +10 more

- 27 Jan 2020 -

arXiv: Computation and Language

TL;DR: Meena, a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations, is presented and a human evaluation metric called Sensibleness and Specificity Average (SSA) is proposed, which captures key elements of a human-like multi- turn conversation.

...read moreread less

Journal ArticleDOI

Deep Learning--based Text Classification: A Comprehensive Review

Shervin Minaee, +5 more

- 17 Apr 2021 -

ACM Computing Surveys

TL;DR: This paper provided a comprehensive review of more than 150 deep learning-based models for text classification developed in recent years, and discussed their technical contributions, similarities, and strengths, and provided a quantitative analysis of the performance of different deep learning models on popular benchmarks.

...read moreread less

Posted Content

Multi-Task Deep Neural Networks for Natural Language Understanding

Xiaodong Liu, +3 more

- 31 Jan 2019 -

arXiv: Computation and Language

TL;DR: A Multi-Task Deep Neural Network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks that allows domain adaptation with substantially fewer in-domain labels than the pre-trained BERT representations.

...read moreread less

Proceedings ArticleDOI

DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

Yizhe Zhang, +8 more

TL;DR: It is shown that conversational systems that leverage DialoGPT generate more relevant, contentful and context-consistent responses than strong baseline systems.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Collapse

Neural Approaches to Conversational AI

Citations

DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

Towards a Human-like Open-Domain Chatbot

Deep Learning--based Text Classification: A Comprehensive Review

Multi-Task Deep Neural Networks for Natural Language Understanding

DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

References

Long short-term memory

Attention is All you Need

Generative Adversarial Nets

Deep Learning

Reinforcement Learning: An Introduction

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Bleu: a Method for Automatic Evaluation of Machine Translation

A Diversity-Promoting Objective Function for Neural Conversation Models

Attention is All you Need

Neural Machine Translation by Jointly Learning to Align and Translate