A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval

doi:10.1145/2661829.2661935

Proceedings ArticleDOI

A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval

Yelong Shen, +4 more

- pp 101-110

Chats0

TLDR

A new latent semantic model that incorporates a convolutional-pooling structure over word sequences to learn low-dimensional, semantic vector representations for search queries and Web documents is proposed.

Abstract:

In this paper, we propose a new latent semantic model that incorporates a convolutional-pooling structure over word sequences to learn low-dimensional, semantic vector representations for search queries and Web documents. In order to capture the rich contextual structures in a query or a document, we start with each word within a temporal context window in a word sequence to directly capture contextual features at the word n-gram level. Next, the salient word n-gram features in the word sequence are discovered by the model and are then aggregated to form a sentence-level feature vector. Finally, a non-linear transformation is applied to extract high-level semantic information to generate a continuous vector representation for the full text string. The proposed convolutional latent semantic model (CLSM) is trained on clickthrough data and is evaluated on a Web document ranking task using a large-scale, real-world data set. Results show that the proposed model effectively captures salient semantic information in queries and documents for the task while significantly outperforming previous state-of-the-art semantic models.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Hierarchical Attention Networks for Document Classification

Zichao Yang, +5 more

TL;DR: Experiments conducted on six large scale text classification tasks demonstrate that the proposed architecture outperform previous methods by a substantial margin.

...read moreread less

Proceedings Article

Character-level convolutional networks for text classification

Xiang Zhang, +2 more

TL;DR: In this paper, the use of character-level convolutional networks (ConvNets) for text classification has been explored and compared with traditional models such as bag of words, n-grams and their TFIDF variants.

...read moreread less

Journal ArticleDOI

Recent Trends in Deep Learning Based Natural Language Processing [Review Article]

Tom Young, +3 more

- 20 Jul 2018 -

IEEE Computational Intelligence Magazine

TL;DR: This paper reviews significant deep learning related models and methods that have been employed for numerous NLP tasks and provides a walk-through of their evolution.

...read moreread less

Proceedings Article

Embedding Entities and Relations for Learning and Inference in Knowledge Bases

Bishan Yang, +4 more

TL;DR: It is found that embeddings learned from the bilinear objective are particularly good at capturing relational semantics and that the composition of relations is characterized by matrix multiplication.

...read moreread less

Proceedings ArticleDOI

Stacked Attention Networks for Image Question Answering

Zichao Yang, +4 more

TL;DR: In this paper, a stacked attention network (SAN) is proposed to learn to answer natural language questions from images by using semantic representation of a question as query to search for the regions in an image that are related to the answer.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings Article

Latent Dirichlet Allocation

David M. Blei, +2 more

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Journal ArticleDOI

Indexing by Latent Semantic Analysis

Scott Deerwester, +4 more

- 01 Sep 1990 -

Journal of the Association for Informati...

TL;DR: A new method for automatic indexing and retrieval to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries.

...read moreread less

Book

Learning Deep Architectures for AI

Yoshua Bengio

TL;DR: The motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer modelssuch as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks are discussed.

...read moreread less

Collapse

Related Papers (5)

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

arXiv: Computation and Language

A Latent Semantic Model with Convolutional-Pooling Structure for Information Retrieval

Citations

Hierarchical Attention Networks for Document Classification

Character-level convolutional networks for text classification

Recent Trends in Deep Learning Based Natural Language Processing [Review Article]

Embedding Entities and Relations for Learning and Inference in Knowledge Bases

Stacked Attention Networks for Image Question Answering

References

Latent dirichlet allocation

Latent Dirichlet Allocation

Distributed Representations of Words and Phrases and their Compositionality

Indexing by Latent Semantic Analysis

Learning Deep Architectures for AI

Related Papers (5)

Glove: Global Vectors for Word Representation

Long short-term memory

Distributed Representations of Words and Phrases and their Compositionality

Convolutional Neural Networks for Sentence Classification

Efficient Estimation of Word Representations in Vector Space