Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks

doi:10.1145/2983323.2983872

Proceedings ArticleDOI

Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks

Jinfeng Rao, +2 more

- pp 1913-1916

Chats0

TLDR

The Noise-Contrastive Estimation approach is extended with a triplet ranking loss function to exploit interactions in triplet inputs over the question paired with positive and negative examples and achieves state-of-the-art effectiveness without the need for external knowledge sources or feature engineering.

Abstract:

We study answer selection for question answering, in which given a question and a set of candidate answer sentences, the goal is to identify the subset that contains the answer. Unlike previous work which treats this task as a straightforward pointwise classification problem, we model this problem as a ranking task and propose a pairwise ranking approach that can directly exploit existing pointwise neural network models as base components. We extend the Noise-Contrastive Estimation approach with a triplet ranking loss function to exploit interactions in triplet inputs over the question paired with positive and negative examples. Experiments on TrecQA and WikiQA datasets show that our approach achieves state-of-the-art effectiveness without the need for external knowledge sources or feature engineering.

Citations

PDF

Open Access

More filters

Posted Content

Bilateral Multi-Perspective Matching for Natural Language Sentences

Zhiguo Wang, +2 more

- 13 Feb 2017 -

arXiv: Artificial Intelligence

TL;DR: This work proposes a bilateral multi-perspective matching (BiMPM) model under the "matching-aggregation" framework that achieves the state-of-the-art performance on all tasks.

...read moreread less

Journal ArticleDOI

A Deep Look into neural ranking models for information retrieval

Jiafeng Guo, +8 more

- 01 Nov 2020 -

Information Processing and Management

TL;DR: A deep look into the neural ranking models from different dimensions is taken to analyze their underlying assumptions, major design principles, and learning strategies to obtain a comprehensive empirical understanding of the existing techniques.

...read moreread less

Proceedings ArticleDOI

Improved Lexically Constrained Decoding for Translation and Monolingual Rewriting

J. Edward Hu, +6 more

TL;DR: The authors describe vectorized dynamic beam allocation, which extends work in lexically-constrained decoding to work with batching, leading to a five-fold improvement in throughput when working with positive constraints.

...read moreread less

Proceedings Article

Polisis: Automated Analysis and Presentation of Privacy Policies Using Deep Learning

Hamza Harkous, +5 more

TL;DR: In this paper, the authors propose an automated framework for privacy Policies analysis, called Polisis, which enables scalable, dynamic, and multi-dimensional queries on privacy policies, and demonstrate the modularity and utility with two robust applications that support structured and free-form querying.

...read moreread less

Posted Content

Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information

Seonhoon Kim, +2 more

- 29 May 2018 -

arXiv: Computation and Language

TL;DR: The authors proposed a densely-connected co-attentive recurrent neural network (C-RNN), which uses concatenated information of attentive features as well as hidden features of all the preceding recurrent layers.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Proceedings Article

Signature Verification using a "Siamese" Time Delay Neural Network

Jane Bromley, +4 more

TL;DR: An algorithm for verification of signatures written on a pen-input tablet based on a novel, artificial neural network called a "Siamese" neural network, which consists of two identical sub-networks joined at their outputs.

...read moreread less

Journal ArticleDOI

Signature verification using a “siamese” time delay neural network

Jane Bromley, +10 more

- 01 Jan 1993 -

International Journal of Pattern Recogni...

TL;DR: In this article, a Siamese time delay neural network is used to measure the similarity between pairs of signatures, and the output of this half network is the feature vector for the input signature.

...read moreread less

Journal ArticleDOI

ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

Wenpeng Yin, +3 more

- 23 Jun 2016 -

Transactions of the Association for Comp...

TL;DR: This paper proposed three attention schemes that integrate mutual influence between sentences into CNNs, thus the representation of each sentence takes into consideration its counterpart, and achieved state-of-the-art performance on answer selection, paraphrase identification, and textual entailment.

...read moreread less

Noise-Contrastive Estimation for Answer Selection with Deep Neural Networks

Citations

Bilateral Multi-Perspective Matching for Natural Language Sentences

A Deep Look into neural ranking models for information retrieval

Improved Lexically Constrained Decoding for Translation and Monolingual Rewriting

Polisis: Automated Analysis and Presentation of Privacy Policies Using Deep Learning

Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information

References

Glove: Global Vectors for Word Representation

Distributed Representations of Words and Phrases and their Compositionality

Signature Verification using a "Siamese" Time Delay Neural Network

Signature verification using a “siamese” time delay neural network

ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs

Related Papers (5)

What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA

WikiQA: A Challenge Dataset for Open-Domain Question Answering

Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks

Glove: Global Vectors for Word Representation

Multi-Perspective Sentence Similarity Modeling with Convolutional Neural Networks