Learning to Rank for Information Retrieval

doi:10.1561/1500000016

Journal ArticleDOI

Learning to Rank for Information Retrieval

Tie-Yan Liu

- 01 Mar 2009 -

Foundations and Trends in Information Re...

- Vol. 3, Iss: 3, pp 225-331

Chats0

TLDR

A statistical ranking theory is introduced, which can describe different learning-to-rank algorithms, and be used to analyze their query-level generalization abilities.

Abstract:

Learning to rank for Information Retrieval (IR) is a task to automatically construct a ranking model using training data, such that the model can sort new objects according to their degrees of relevance, preference, or importance. Many IR problems are by nature ranking problems, and many IR technologies can be potentially enhanced by using learning-to-rank techniques. The objective of this tutorial is to give an introduction to this research direction. Specifically, the existing learning-to-rank algorithms are reviewed and categorized into three approaches: the pointwise, pairwise, and listwise approaches. The advantages and disadvantages with each approach are analyzed, and the relationships between the loss functions used in these approaches and IR evaluation measures are discussed. Then the empirical evaluations on typical learning-to-rank methods are shown, with the LETOR collection as a benchmark dataset, which seems to suggest that the listwise approach be the most effective one among all the approaches. After that, a statistical ranking theory is introduced, which can describe different learning-to-rank algorithms, and be used to analyze their query-level generalization abilities. At the end of the tutorial, we provide a summary and discuss potential future work on learning to rank.

Citations

PDF

Open Access

More filters

Book

Learning to Rank for Information Retrieval

Tie-Yan Liu

TL;DR: Three major approaches to learning to rank are introduced, i.e., the pointwise, pairwise, and listwise approaches, the relationship between the loss functions used in these approaches and the widely-used IR evaluation measures are analyzed, and the performance of these approaches on the LETOR benchmark datasets is evaluated.

...read moreread less

Journal ArticleDOI

Survey on deep learning with class imbalance

Justin M. Johnson, +1 more

- 01 Mar 2019 -

Journal of Big Data

TL;DR: Examination of existing deep learning techniques for addressing class imbalanced data finds that research in this area is very limited, that most existing work focuses on computer vision tasks with convolutional neural networks, and that the effects of big data are rarely considered.

...read moreread less

Proceedings ArticleDOI

Relative attributes

Devi Parikh, +1 more

TL;DR: This work proposes a generative model over the joint space of attribute ranking outputs, and proposes a novel form of zero-shot learning in which the supervisor relates the unseen object category to previously seen objects via attributes (for example, ‘bears are furrier than giraffes’).

...read moreread less

Posted Content

Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval

Lee Xiong, +7 more

- 01 Jul 2020 -

arXiv: Information Retrieval

TL;DR: Approximate nearest neighbor Negative Contrastive Estimation (ANCE) is presented, a training mechanism that constructs negatives from an Approximate Nearest Neighbor (ANN) index of the corpus, which is parallelly updated with the learning process to select more realistic negative training instances.

...read moreread less

Book

Learning to Rank for Information Retrieval and Natural Language Processing

Hang Li

TL;DR: The author explains several example applications of learning to rank including web search, collaborative filtering, definition search, keyphrase extraction, query dependent summarization, and re-ranking in machine translation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

The Nature of Statistical Learning Theory

Vladimir Vapnik

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

Statistical learning theory

Vladimir Vapnik

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

Journal ArticleDOI

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Yoav Freund, +1 more

TL;DR: The model studied can be interpreted as a broad, abstract extension of the well-studied on-line prediction model to a general decision-theoretic setting, and it is shown that the multiplicative weight-update Littlestone?Warmuth rule can be adapted to this model, yielding bounds that are slightly weaker in some cases, but applicable to a considerably more general class of learning problems.

...read moreread less

Proceedings Article

The PageRank Citation Ranking : Bringing Order to the Web

Lawrence Page, +3 more

TL;DR: This paper describes PageRank, a mathod for rating Web pages objectively and mechanically, effectively measuring the human interest and attention devoted to them, and shows how to efficiently compute PageRank for large numbers of pages.

...read moreread less

Journal ArticleDOI

Indexing by Latent Semantic Analysis

Scott Deerwester, +4 more

- 01 Sep 1990 -

Journal of the Association for Informati...

TL;DR: A new method for automatic indexing and retrieval to take advantage of implicit higher-order structure in the association of terms with documents (“semantic structure”) in order to improve the detection of relevant documents on the basis of terms found in queries.

...read moreread less

Collapse

ACM Transactions on Information Systems

An efficient boosting algorithm for combining preferences

Yoav Freund, +3 more

- 01 Dec 2003 -

Journal of Machine Learning Research

Learning to Rank for Information Retrieval

Citations

Learning to Rank for Information Retrieval

Survey on deep learning with class imbalance

Relative attributes

Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval

Learning to Rank for Information Retrieval and Natural Language Processing

References

The Nature of Statistical Learning Theory

Statistical learning theory

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

The PageRank Citation Ranking : Bringing Order to the Web

Indexing by Latent Semantic Analysis

Related Papers (5)

Optimizing search engines using clickthrough data

Learning to rank using gradient descent

Learning to rank: from pairwise approach to listwise approach

Cumulated gain-based evaluation of IR techniques

An efficient boosting algorithm for combining preferences