Neural Belief Tracker: Data-Driven Dialogue State Tracking

doi:10.18653/V1/P17-1163

Open AccessProceedings ArticleDOI

Neural Belief Tracker: Data-Driven Dialogue State Tracking

- Vol. 1, pp 1777-1788

TLDR

This work proposes a novel Neural Belief Tracking (NBT) framework which overcomes past limitations, matching the performance of state-of-the-art models which rely on hand-crafted semantic lexicons and outperforming them when such lexicons are not provided.

Abstract:

One of the core components of modern spoken dialogue systems is the belief tracker, which estimates the user’s goal at every step of the dialogue. However, most current approaches have difficulty scaling to larger, more complex dialogue domains. This is due to their dependency on either: a) Spoken Language Understanding models that require large amounts of annotated training data; or b) hand-crafted lexicons for capturing some of the linguistic variation in users’ language. We propose a novel Neural Belief Tracking (NBT) framework which overcomes these problems by building on recent advances in representation learning. NBT models reason over pre-trained word vectors, learning to compose them into distributed representations of user utterances and dialogue context. Our evaluation on two datasets shows that this approach surpasses past limitations, matching the performance of state-of-the-art models which rely on hand-crafted semantic lexicons and outperforming them when such lexicons are not provided.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Hyperlex: A large-scale evaluation of graded lexical entailment

Ivan Vulić, +4 more

- 01 Dec 2017 -

Computational Linguistics

TL;DR: HyperLex is introduced—a data set and evaluation resource that quantifies the extent of the semantic category membership, that is, type-of relation, also known as hyponymy–hypernymy or lexical entailment (LE) relation between 2,616 concept pairs.

...read moreread less

Posted Content

End-to-End Knowledge-Routed Relational Dialogue System for Automatic Diagnosis

Lin Xu, +5 more

- 30 Jan 2019 -

arXiv: Computation and Language

TL;DR: An End-to-End Knowledge-routed Relational Dialogue System (KR-DS) that seamlessly incorporates rich medical knowledge graph into the topic transition in dialogue management, and makes it cooperative with natural language understanding and natural language generation is proposed.

...read moreread less

Posted Content

Multi-domain Dialogue State Tracking as Dynamic Knowledge Graph Enhanced Question Answering.

Li Zhou, +1 more

- 07 Nov 2019 -

arXiv: Computation and Language

TL;DR: This paper proposes to model multi-domain dialogue state tracking as a question answering problem, referred to as Dialogue State Tracking via Question Answering (DSTQA), and uses a dynamically-evolving knowledge graph to explicitly learn relationships between (domain, slot) pairs.

...read moreread less

Posted Content

Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

Osman Ramadan, +2 more

- 17 Jul 2018 -

arXiv: Computation and Language

TL;DR: In this article, a novel approach is introduced that fully utilizes semantic similarity between dialogue utterances and the ontology terms, allowing the information to be shared across domains, and demonstrates great capability in handling multi-domain dialogues, simultaneously outperforming existing state-of-theart models in single-domain dialogue tracking tasks.

...read moreread less

Proceedings ArticleDOI

Dialog state tracking, a machine reading approach using Memory Network

Julien Perez, +1 more

TL;DR: In this paper, an end-to-end memory network, MemN2N, was proposed to solve the problem of dialog state tracking using the general paradigm of machine reading.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Journal Article

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, +4 more

- 01 Jan 2014 -

Journal of Machine Learning Research

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.

...read moreread less

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Journal Article

Visualizing Data using t-SNE

Laurens van der Maaten, +1 more

- 01 Jan 2008 -

Journal of Machine Learning Research

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.

...read moreread less

Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair, +1 more

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.

...read moreread less

Collapse

Neural Belief Tracker: Data-Driven Dialogue State Tracking

Citations

Hyperlex: A large-scale evaluation of graded lexical entailment

End-to-End Knowledge-Routed Relational Dialogue System for Automatic Diagnosis

Multi-domain Dialogue State Tracking as Dynamic Knowledge Graph Enhanced Question Answering.

Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing

Dialog state tracking, a machine reading approach using Memory Network

References

Adam: A Method for Stochastic Optimization

Dropout: a simple way to prevent neural networks from overfitting

Glove: Global Vectors for Word Representation

Visualizing Data using t-SNE

Rectified Linear Units Improve Restricted Boltzmann Machines

Related Papers (5)

The Second Dialog State Tracking Challenge

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Adam: A Method for Stochastic Optimization

A Network-based End-to-End Trainable Task-oriented Dialogue System

POMDP-Based Statistical Spoken Dialog Systems: A Review