Adapting ranking SVM to document retrieval

doi:10.1145/1148170.1148205

Proceedings ArticleDOI

Adapting ranking SVM to document retrieval

Yunbo Cao, +5 more

- pp 186-193

Chats0

TLDR

Experimental results show that the modifications made in conventional Ranking SVM can outperform the conventional ranking SVM and other existing methods for document retrieval on two datasets and employ two methods to conduct optimization on the loss function: gradient descent and quadratic programming.

Abstract:

The paper is concerned with applying learning to rank to document retrieval. Ranking SVM is a typical method of learning to rank. We point out that there are two factors one must consider when applying Ranking SVM, in general a "learning to rank" method, to document retrieval. First, correctly ranking documents on the top of the result list is crucial for an Information Retrieval system. One must conduct training in a way that such ranked results are accurate. Second, the number of relevant documents can vary from query to query. One must avoid training a model biased toward queries with a large number of relevant documents. Previously, when existing methods that include Ranking SVM were applied to document retrieval, none of the two factors was taken into consideration. We show it is possible to make modifications in conventional Ranking SVM, so it can be better used for document retrieval. Specifically, we modify the "Hinge Loss" function in Ranking SVM to deal with the problems described above. We employ two methods to conduct optimization on the loss function: gradient descent and quadratic programming. Experimental results show that our method, referred to as Ranking SVM for IR, can outperform the conventional Ranking SVM and other existing methods for document retrieval on two datasets.

Adapting ranking SVM to document retrieval

Citations

Learning to Rank for Information Retrieval

Learning to rank: from pairwise approach to listwise approach

Intelligent Automated Assistant

Search Engines: Information Retrieval in Practice

AdaRank: a boosting algorithm for information retrieval

References

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Optimizing search engines using clickthrough data

The SMART Retrieval System—Experiments in Automatic Document Processing

Learning to rank using gradient descent

Related Papers (5)

Optimizing search engines using clickthrough data

Learning to rank using gradient descent

An efficient boosting algorithm for combining preferences

Learning to rank: from pairwise approach to listwise approach

AdaRank: a boosting algorithm for information retrieval