Efficient Natural Language Response Suggestion for Smart Reply

Open AccessPosted Content

Efficient Natural Language Response Suggestion for Smart Reply

Matthew L. Henderson, +8 more

- 01 May 2017 -

arXiv: Computation and Language

Chats0

TLDR

A computationally efficient machine-learned method for natural language response suggestion using feed-forward neural networks using n-gram embedding features that achieves the same quality at a small fraction of the computational requirements and latency.

Abstract:

This paper presents a computationally efficient machine-learned method for natural language response suggestion. Feed-forward neural networks using n-gram embedding features encode messages into vectors which are optimized to give message-response pairs a high dot-product value. An optimized search finds response suggestions. The method is evaluated in a large-scale commercial e-mail application, Inbox by Gmail. Compared to a sequence-to-sequence approach, the new system achieves the same quality at a small fraction of the computational requirements and latency.

Citations

PDF

Open Access

More filters

Posted Content

Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data

Rongsheng Zhang, +5 more

- 20 Sep 2020 -

arXiv: Computation and Language

TL;DR: Automatic and manual evaluation indicates that the proposed novel data augmentation method for training open-domain dialogue models by utilizing unpaired data can produce high-quality dialogue pairs with diverse contents, and can improve the performance of competitive baselines.

...read moreread less

Proceedings ArticleDOI

Few-Shot Learning with Siamese Networks and Label Tuning

Thomas Mueller, +2 more

TL;DR: This work shows that with proper pre-training, Siamese Networks that embed texts and labels offer a competitive alternative in text classification, and introduces label tuning, a simple and computationally efficient approach that allows to adapt the models in a few-shot setup by only changing the label embeddings.

...read moreread less

Patent

Removing personal information from text using a neural network

Frederick William Poe Heckel, +1 more

TL;DR: In this paper, a neural network is used to remove personal information from text (such as names, addresses, credit card numbers, or social security numbers), and replace the personal information with a label indicating the type or class of the removed information.

...read moreread less

Proceedings Article

SimCSE: Simple Contrastive Learning of Sentence Embeddings

Tianyu Gao, +2 more

TL;DR: This paper proposed a contrastive learning objective to regularize pre-trained embeddings' anisotropic space to be more uniform, and better aligns positive pairs when supervised signals are available.

...read moreread less

Posted Content

Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models

Jianmo Ni, +6 more

- 19 Aug 2021 -

arXiv: Computation and Language

TL;DR: This paper provided the first exploration of text-to-text transformers (T5) sentence embeddings for language processing tasks and achieved state-of-the-art performance on transfer tasks and semantic textual similarity.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Posted Content

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 10 Sep 2014 -

arXiv: Computation and Language

TL;DR: This paper presents a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure, and finds that reversing the order of the words in all source sentences improved the LSTM's performance markedly, because doing so introduced many short term dependencies between the source and the target sentence which made the optimization problem easier.

...read moreread less

Collapse

Efficient Natural Language Response Suggestion for Smart Reply

Citations

Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data

Few-Shot Learning with Siamese Networks and Label Tuning

Removing personal information from text using a neural network

SimCSE: Simple Contrastive Learning of Sentence Embeddings

Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models

References

Long short-term memory

Glove: Global Vectors for Word Representation

Efficient Estimation of Word Representations in Vector Space

Sequence to Sequence Learning with Neural Networks

TensorFlow: a system for large-scale machine learning

Related Papers (5)

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Attention is All you Need

A large annotated corpus for learning natural language inference

Glove: Global Vectors for Word Representation

Distributed Representations of Words and Phrases and their Compositionality