Neural Approaches to Conversational AI

doi:10.1145/3209978.3210183

Open AccessProceedings ArticleDOI

Neural Approaches to Conversational AI

Jianfeng Gao, +2 more

- pp 1371-1374

Chats0

TLDR

This tutorial surveys neural approaches to conversational AI that were developed in the last few years, and presents a review of state-of-the-art neural approaches, drawing the connection between neural approaches and traditional symbolic approaches.

Abstract:

This tutorial surveys neural approaches to conversational AI that were developed in the last few years. We group conversational systems into three categories: (1) question answering agents, (2) task-oriented dialogue agents, and (3) social bots. For each category, we present a review of state-of-the-art neural approaches, draw the connection between neural approaches and traditional symbolic approaches, and discuss the progress we have made and challenges we are facing, using specific systems and models as case studies.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Machine learning

Thomas G. Dietterich

- 01 Dec 1996 -

ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Proceedings ArticleDOI

Multi-Task Deep Neural Networks for Natural Language Understanding

Xiaodong Liu, +3 more

TL;DR: The authors proposed a multi-task deep neural network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks, which not only leverages large amounts of cross-task data, but also benefits from a regularization effect that leads to more general representations to help adapt to new tasks and domains.

...read moreread less

Posted Content

Neural Approaches to Conversational AI

Jianfeng Gao, +2 more

- 21 Sep 2018 -

arXiv: Computation and Language

TL;DR: In this article, the authors present a survey of state-of-the-art neural approaches to conversational AI, and discuss the progress that has been made and challenges still being faced, using specific systems and models as case studies.

...read moreread less

Proceedings ArticleDOI

DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

Yizhe Zhang, +8 more

TL;DR: It is shown that conversational systems that leverage DialoGPT generate more relevant, contentful and context-consistent responses than strong baseline systems.

...read moreread less

Proceedings ArticleDOI

Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation

Xin Wang, +7 more

TL;DR: In this paper, a reinforcement learning-based approach is proposed to enforce cross-modal grounding both locally and globally via reinforcement learning (RL), where a matching critic is used to provide an intrinsic reward to encourage global matching between instructions and trajectories.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Journal ArticleDOI

Generative Adversarial Nets

Ian Goodfellow, +7 more

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Collapse

Neural Approaches to Conversational AI

Citations

Machine learning

Multi-Task Deep Neural Networks for Natural Language Understanding

Neural Approaches to Conversational AI

DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation

Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation

References

Long short-term memory

Generative Adversarial Nets

Reinforcement Learning: An Introduction

Glove: Global Vectors for Word Representation

Distributed Representations of Words and Phrases and their Compositionality

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Attention is All you Need

Bleu: a Method for Automatic Evaluation of Machine Translation

Glove: Global Vectors for Word Representation

Adam: A Method for Stochastic Optimization