Predicting query performance

doi:10.1145/564376.564429

Proceedings ArticleDOI

Predicting query performance

Steve Cronen-Townsend, +2 more

- pp 299-306

Chats0

TLDR

It is suggested that clarity scores measure the ambiguity of a query with respect to a collection of documents and show that they correlate positively with average precision in a variety of TREC test sets.

Abstract:

We develop a method for predicting query performance by computing the relative entropy between a query language model and the corresponding collection language model. The resulting clarity score measures the coherence of the language usage in documents whose models are likely to generate the query. We suggest that clarity scores measure the ambiguity of a query with respect to a collection of documents and show that they correlate positively with average precision in a variety of TREC test sets. Thus, the clarity score may be used to identify ineffective queries, on average, without relevance information. We develop an algorithm for automatically setting the clarity score threshold between predicted poorly-performing queries and acceptable queries and validate it using TREC data. In particular, we compare the automatic thresholds to optimum thresholds and also check how frequently results as good are achieved in sampling experiments that randomly assign queries to the two classes.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Novelty and diversity in information retrieval evaluation

Charles L. A. Clarke, +6 more

TL;DR: This paper develops a framework for evaluation that systematically rewards novelty and diversity into a specific evaluation measure, based on cumulative gain, and demonstrates the feasibility of this approach using a test collection based on the TREC question answering track.

...read moreread less

Journal ArticleDOI

Exploratory Search:Beyond the Query-Response Paradigm

Ryen W. White, +1 more

- 16 Feb 2009 -

Synthesis Lectures on Information Concep...

TL;DR: This lecture introduces exploratory search, relates it to relevant extant research, outline the features of exploratorySearch systems, discuss the evaluation of these systems, and suggest some future directions for supporting exploratorysearch.

...read moreread less

Proceedings ArticleDOI

Overview of TREC 2004

Ellen M. Voorhees

TL;DR: The thirteenth Text REtrieval Conference, TREC 2004, was held at the National Institute of Standards and Technology (NIST) November 16–19, 2004.

...read moreread less

Determining the informational, navigational and transactional intent of web queries

Danielle L. Booth, +2 more

TL;DR: A software application was developed that automatically classified queries using a Web search engine log of over a million and a half queries submitted by several hundred thousand users and showed that the automatic classification has an accuracy of 74%.

...read moreread less

Journal ArticleDOI

Determining the informational, navigational, and transactional intent of Web queries

Bernard J. Jansen, +2 more

- 01 May 2008 -

Information Processing and Management

TL;DR: In this article, the authors define and present a comprehensive classification of user intent for Web searching, which consists of three hierarchical levels of informational, navigational, and transactional intent, and then develop a software application that automatically classified queries using a web search engine log of over a million and a half queries submitted by several hundred thousand users.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Elements of information theory

Thomas M. Cover, +1 more

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.

...read moreread less

Journal ArticleDOI

Pattern Classification and Scene Analysis.

Ulf Grenander, +2 more

- 01 Sep 1974 -

Journal of the American Statistical Asso...

Book

Pattern classification and scene analysis

Richard O. Duda, +1 more

TL;DR: In this article, a unified, comprehensive and up-to-date treatment of both statistical and descriptive methods for pattern recognition is provided, including Bayesian decision theory, supervised and unsupervised learning, nonparametric techniques, discriminant analysis, clustering, preprosessing of pictorial data, spatial filtering, shape description techniques, perspective transformations, projective invariants, linguistic procedures, and artificial intelligence techniques for scene analysis.

...read moreread less

Book

Foundations of Statistical Natural Language Processing

Christopher D. Manning, +1 more

TL;DR: This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear and provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.

...read moreread less

Book

Nonparametric Statistical Inference

Jean D. Gibbons, +1 more

TL;DR: Theoretical Bases for Calculating the ARE Examples of the Calculations of Efficacy and ARE Analysis of Count Data.

...read moreread less

Collapse

Predicting query performance

Citations

Novelty and diversity in information retrieval evaluation

Exploratory Search:Beyond the Query-Response Paradigm

Overview of TREC 2004

Determining the informational, navigational and transactional intent of web queries

Determining the informational, navigational, and transactional intent of Web queries

References

Elements of information theory

Pattern Classification and Scene Analysis.

Pattern classification and scene analysis

Foundations of Statistical Natural Language Processing

Nonparametric Statistical Inference

Related Papers (5)

Inferring Query Performance Using Pre-retrieval Predictors

Query performance prediction in web search environments

Relevance-Based Language Models

Learning to estimate query difficulty: including applications to missing content detection and distributed information retrieval

A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval