Optimizing ranking functions: a connectionist approach to adaptive information retrieval

Open Access

Optimizing ranking functions: a connectionist approach to adaptive information retrieval

TLDR

This dissertation examines the use of adaptive methods to automatically improve the performance of ranked text retrieval systems and proposes and empirically validate general adaptive methods which improve the ability of a large class of retrieval systems to rank documents effectively.

Abstract:

This dissertation examines the use of adaptive methods to automatically improve the performance of ranked text retrieval systems. The goal of a ranked retrieval system is to manage a large collection of text documents and to order documents for a user based on the estimated relevance of the documents to the user's information need (or query). The ordering enables the user to quickly find documents of interest. Ranked retrieval is a difficult problem because of the ambiguity of natural language, the large size of the collections, and because of the varying needs of users and varying collection characteristics. We propose and empirically validate general adaptive methods which improve the ability of a large class of retrieval systems to rank documents effectively. Our main adaptive method is to numerically optimize free parameters in a retrieval system by minimizing a non-metric criterion function. The criterion measures how well the system is ranking documents relative to a target ordering, defined by a set of training queries which include the users' desired document orderings. Thus, the system learns parameter settings which better enable it to rank relevant documents before irrelevant. The non-metric approach is interesting because it is a general adaptive method, an alternative to supervised methods for training neural networks in domains in which rank order or prioritization is important. A second adaptive method is also examined, which is applicable to a restricted class of retrieval systems but which permits an analytic solution. The adaptive methods are applied to a number of problems in text retrieval to validate their utility and practical efficiency. The applications include: A dimensionality reduction of vector-based document representations to a vector space in which inter-document similarity more accurately predicts semantic association; the estimation of a similarity measure which better predicts the relevance of documents to queries; and the estimation of a high-performance neural network combination of multiple retrieval systems into a single overall system. The applications demonstrate that the approaches improve performance and adapt to varying retrieval environments. We also compare the methods to numerous alternative adaptive methods in the text retrieval literature, with very positive results.

Optimizing ranking functions: a connectionist approach to adaptive information retrieval

Citations

Models for metasearch

Automatic combination of multiple ranked retrieval systems

Condorcet fusion for improved retrieval

Semantically enhanced Information Retrieval: An ontology-based approach

FRank: a ranking method with fidelity loss

References

Learning internal representations by error propagation

Pattern Classification and Scene Analysis.

The use of multiple measurements in taxonomic problems

Introduction to Modern Information Retrieval

Numerical Recipes in C: The Art of Scientific Computing

Related Papers (5)

Analyses of multiple evidence combination

Combination of multiple searches

Combining evidence from multiple searches

Combining Approaches to Information Retrieval

Models for metasearch