Neural Belief Tracker: Data-Driven Dialogue State Tracking

doi:10.18653/V1/P17-1163

Open AccessProceedings ArticleDOI

Neural Belief Tracker: Data-Driven Dialogue State Tracking

- Vol. 1, pp 1777-1788

TLDR

This work proposes a novel Neural Belief Tracking (NBT) framework which overcomes past limitations, matching the performance of state-of-the-art models which rely on hand-crafted semantic lexicons and outperforming them when such lexicons are not provided.

Abstract:

One of the core components of modern spoken dialogue systems is the belief tracker, which estimates the user’s goal at every step of the dialogue. However, most current approaches have difficulty scaling to larger, more complex dialogue domains. This is due to their dependency on either: a) Spoken Language Understanding models that require large amounts of annotated training data; or b) hand-crafted lexicons for capturing some of the linguistic variation in users’ language. We propose a novel Neural Belief Tracking (NBT) framework which overcomes these problems by building on recent advances in representation learning. NBT models reason over pre-trained word vectors, learning to compose them into distributed representations of user utterances and dialogue context. Our evaluation on two datasets shows that this approach surpasses past limitations, matching the performance of state-of-the-art models which rely on hand-crafted semantic lexicons and outperforming them when such lexicons are not provided.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

A Network-based End-to-End Trainable Task-oriented Dialogue System

Tsung-Hsien Wen, +7 more

TL;DR: The authors introduced a neural network-based text-in, text-out end-to-end trainable goal-oriented dialogue system along with a new way of collecting dialogue data based on a novel pipe-lined Wizard-of-Oz framework.

...read moreread less

Posted Content

MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Paweł Budzianowski, +6 more

- 29 Sep 2018 -

arXiv: Computation and Language

TL;DR: The Multi-Domain Wizard-of-Oz dataset (MultiWOZ) as discussed by the authors is a fully-labeled collection of human-human written conversations spanning over multiple domains and topics.

...read moreread less

Posted Content

The Natural Language Decathlon: Multitask Learning as Question Answering

Bryan McCann, +3 more

- 28 Aug 2018 -

arXiv: Computation and Language

TL;DR: Presented on August 28, 2018 at 12:15 p.m. in the Pettit Microelectronics Research Center, Room 102 A/B.

...read moreread less

Posted Content

Neural Approaches to Conversational AI

Jianfeng Gao, +2 more

- 21 Sep 2018 -

arXiv: Computation and Language

TL;DR: In this article, the authors present a survey of state-of-the-art neural approaches to conversational AI, and discuss the progress that has been made and challenges still being faced, using specific systems and models as case studies.

...read moreread less

Proceedings ArticleDOI

MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

Paweł Budzianowski, +6 more

TL;DR: The Multi-Domain Wizard-of-Oz dataset (MultiWOZ), a fully-labeled collection of human-human written conversations spanning over multiple domains and topics is introduced, at a size of 10k dialogues, at least one order of magnitude larger than all previous annotated task-oriented corpora.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Convolutional Neural Networks for Sentence Classification

Yoon Kim

TL;DR: The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification, and are proposed to allow for the use of both task-specific and static vectors.

...read moreread less

Proceedings Article

Understanding the difficulty of training deep feedforward neural networks

Xavier Glorot, +1 more

TL;DR: The objective here is to understand better why standard gradient descent from random initialization is doing so poorly with deep neural networks, to better understand these recent relative successes and help design better algorithms in the future.

...read moreread less

Posted Content

Convolutional Neural Networks for Sentence Classification

Yoon Kim

- 25 Aug 2014 -

arXiv: Computation and Language

TL;DR: In this article, CNNs are trained on top of pre-trained word vectors for sentence-level classification tasks and a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks.

...read moreread less

Journal Article

Natural Language Processing (Almost) from Scratch

Ronan Collobert, +5 more

- 01 Feb 2011 -

Journal of Machine Learning Research

TL;DR: A unified neural network architecture and learning algorithm that can be applied to various natural language processing tasks including part-of-speech tagging, chunking, named entity recognition, and semantic role labeling is proposed.

...read moreread less

Proceedings ArticleDOI

A Convolutional Neural Network for Modelling Sentences

Nal Kalchbrenner, +2 more

TL;DR: A convolutional architecture dubbed the Dynamic Convolutional Neural Network (DCNN) is described that is adopted for the semantic modelling of sentences and induces a feature graph over the sentence that is capable of explicitly capturing short and long-range relations.

...read moreread less

Collapse

Neural Belief Tracker: Data-Driven Dialogue State Tracking

Citations

A Network-based End-to-End Trainable Task-oriented Dialogue System

MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

The Natural Language Decathlon: Multitask Learning as Question Answering

Neural Approaches to Conversational AI

MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling

References

Convolutional Neural Networks for Sentence Classification

Understanding the difficulty of training deep feedforward neural networks

Convolutional Neural Networks for Sentence Classification

Natural Language Processing (Almost) from Scratch

A Convolutional Neural Network for Modelling Sentences

Related Papers (5)

The Second Dialog State Tracking Challenge

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Adam: A Method for Stochastic Optimization

A Network-based End-to-End Trainable Task-oriented Dialogue System

POMDP-Based Statistical Spoken Dialog Systems: A Review