A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion

doi:10.1145/2806416.2806493

Open AccessProceedings ArticleDOI

A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion

- pp 553-562

TLDR

This work presents a novel hierarchical recurrent encoder-decoder architecture that makes possible to account for sequences of previous queries of arbitrary lengths and is sensitive to the order of queries in the context while avoiding data sparsity.

Abstract:

Users may strive to formulate an adequate textual query for their information need. Search engines assist the users by presenting query suggestions. To preserve the original search intent, suggestions should be context-aware and account for the previous queries issued by the user. Achieving context awareness is challenging due to data sparsity. We present a novel hierarchical recurrent encoder-decoder architecture that makes possible to account for sequences of previous queries of arbitrary lengths. As a result, our suggestions are sensitive to the order of queries in the context while avoiding data sparsity. Additionally, our model can suggest for rare, or long-tail, queries. The produced suggestions are synthetic and are sampled one word at a time, using computationally cheap decoding techniques. This is in contrast to current synthetic suggestion models relying upon machine learning pipelines and hand-engineered feature sets. Results show that our model outperforms existing context-aware approaches in a next query prediction setting. In addition to query suggestion, our architecture is general enough to be used in a variety of other applications.

Citations

PDF

Open Access

More filters

Proceedings Article

Building end-to-end dialogue systems using generative hierarchical neural network models

Iulian Vlad Serban, +4 more

TL;DR: The authors extend the hierarchical recurrent encoder-decoder neural network to the dialogue domain, and demonstrate that this model is competitive with state-of-the-art neural language models and backoff n-gram models.

...read moreread less

Posted Content

Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models

Iulian Vlad Serban, +4 more

- 17 Jul 2015 -

arXiv: Computation and Language

TL;DR: The authors extend the hierarchical recurrent encoder-decoder neural network to the dialogue domain, and demonstrate that this model is competitive with state-of-the-art neural language models and back-off n-gram models.

...read moreread less

Posted Content

A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

Iulian Vlad Serban, +6 more

- 19 May 2016 -

arXiv: Computation and Language

TL;DR: A neural network-based generative architecture, with latent stochastic variables that span a variable number of time steps, that improves upon recently proposed models and that the latent variables facilitate the generation of long outputs and maintain the context.

...read moreread less

Proceedings ArticleDOI

Learning to Match using Local and Distributed Representations of Text for Web Search

Bhaskar Mitra, +2 more

TL;DR: This work proposes a novel document ranking model composed of two separate deep neural networks, one that matches the query and the document using a local representation, and another that Matching with distributed representations complements matching with traditional local representations.

...read moreread less

Proceedings ArticleDOI

Personalizing Session-based Recommendations with Hierarchical Recurrent Neural Networks

Massimo Quadrana, +3 more

TL;DR: A seamless way to personalize RNN models with cross-session information transfer is proposed and a Hierarchical RNN model that relays end evolves latent hidden states of the RNNs across user sessions is devised.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Journal ArticleDOI

Learning representations by back-propagating errors

David E. Rumelhart, +2 more

- 01 Jan 1988 -

Nature

TL;DR: Back-propagation repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector, which helps to represent important features of the task domain.

...read moreread less

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Proceedings ArticleDOI

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Kyunghyun Cho, +8 more

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.

...read moreread less

Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

Collapse

A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion

Citations

Building end-to-end dialogue systems using generative hierarchical neural network models

Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models

A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues

Learning to Match using Local and Distributed Representations of Text for Web Search

Personalizing Session-based Recommendations with Hierarchical Recurrent Neural Networks

References

Long short-term memory

Learning representations by back-propagating errors

Efficient Estimation of Word Representations in Vector Space

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Sequence to Sequence Learning with Neural Networks

Related Papers (5)

Long short-term memory

Glove: Global Vectors for Word Representation

Sequence to Sequence Learning with Neural Networks

Neural Machine Translation by Jointly Learning to Align and Translate

Bleu: a Method for Automatic Evaluation of Machine Translation