Deep Learning for Information Retrieval

doi:10.1145/2911451.2914800

Proceedings ArticleDOI

Deep Learning for Information Retrieval

- pp 1203-1206

TLDR

This tutorial aims at summarizing and introducing the results of recent research on deep learning for information retrieval, in order to stimulate and foster more significant research and development work on the topic in the future.

Abstract:

Recent years have observed a significant progress in information retrieval and natural language processing with deep learning technologies being successfully applied into almost all of their major tasks. The key to the success of deep learning is its capability of accurately learning distributed representations (vector representations or structured arrangement of them) of natural language expressions such as sentences, and effectively utilizing the representations in the tasks. This tutorial aims at summarizing and introducing the results of recent research on deep learning for information retrieval, in order to stimulate and foster more significant research and development work on the topic in the future. The tutorial mainly consists of three parts. In the first part, we introduce the fundamental techniques of deep learning for natural language processing and information retrieval, such as word embedding, recurrent neural networks, and convolutional neural networks. In the second part, we explain how deep learning, particularly representation learning techniques, can be utilized in fundamental NLP and IR problems, including matching, translation, classification, and structured prediction. In the third part, we describe how deep learning can be used in specific application tasks in details. The tasks are search, question answering (from either documents, database, or knowledge base), and image retrieval.

Citations

PDF

Open Access

More filters

Posted Content

Neural Models for Information Retrieval

Bhaskar Mitra, +1 more

- 03 May 2017 -

arXiv: Information Retrieval

TL;DR: This tutorial introduces basic concepts and intuitions behind neural IR models, and places them in the context of traditional retrieval models, by introducing fundamental concepts of IR and different neural and non-neural approaches to learning vector representations of text.

...read moreread less

Posted Content

Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting

Jun Wang, +2 more

- 07 Oct 2016 -

arXiv: Computer Science and Game Theory

TL;DR: Topics covered include user response prediction, bid landscape forecasting, bidding algorithms, revenue optimization, statistical arbitrage, dynamic pricing, and ad fraud detection are an invaluable text for researchers and practitioners alike.

...read moreread less

Journal ArticleDOI

Best Match: New relevance search for PubMed.

Nicolas Fiorini, +11 more

- 28 Aug 2018 -

PLOS Biology

TL;DR: This work presents Best Match, a new relevance search algorithm for PubMed that leverages the intelligence of users and cutting-edge machine-learning technology as an alternative to the traditional date sort order.

...read moreread less

Journal ArticleDOI

Adversarial Transfer Learning for Deep Learning Based Automatic Modulation Classification

Ke Bu, +3 more

- 06 May 2020 -

IEEE Signal Processing Letters

TL;DR: An adversarial transfer learning architecture (ATLA), incorporating adversarial training and knowledge transfer in a unified way, is proposed, which substantially boosts the performance of the target model, which outperforms the existing parameter-transfer approach.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

Posted Content

Efficient Estimation of Word Representations in Vector Space

Tomas Mikolov, +3 more

- 16 Jan 2013 -

arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

Collapse

Neural Computation

Deep Learning for Information Retrieval

Citations

Neural Models for Information Retrieval

Neural information retrieval: at the end of the early years

Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting

Best Match: New relevance search for PubMed.

Adversarial Transfer Learning for Deep Learning Based Automatic Modulation Classification

References

Long short-term memory

Deep learning

Distributed Representations of Words and Phrases and their Compositionality

Efficient Estimation of Word Representations in Vector Space

Neural Machine Translation by Jointly Learning to Align and Translate

Related Papers (5)

Glove: Global Vectors for Word Representation

Learning to Rank Short Text Pairs with Convolutional Deep Neural Networks

Learning deep structured semantic models for web search using clickthrough data

Learning semantic representations using convolutional neural networks for web search

Long short-term memory