scispace - formally typeset
Proceedings ArticleDOI

Predicting query performance

Reads0
Chats0
TLDR
It is suggested that clarity scores measure the ambiguity of a query with respect to a collection of documents and show that they correlate positively with average precision in a variety of TREC test sets.
Abstract
We develop a method for predicting query performance by computing the relative entropy between a query language model and the corresponding collection language model. The resulting clarity score measures the coherence of the language usage in documents whose models are likely to generate the query. We suggest that clarity scores measure the ambiguity of a query with respect to a collection of documents and show that they correlate positively with average precision in a variety of TREC test sets. Thus, the clarity score may be used to identify ineffective queries, on average, without relevance information. We develop an algorithm for automatically setting the clarity score threshold between predicted poorly-performing queries and acceptable queries and validate it using TREC data. In particular, we compare the automatic thresholds to optimum thresholds and also check how frequently results as good are achieved in sampling experiments that randomly assign queries to the two classes.

read more

Citations
More filters
Proceedings ArticleDOI

Novelty and diversity in information retrieval evaluation

TL;DR: This paper develops a framework for evaluation that systematically rewards novelty and diversity into a specific evaluation measure, based on cumulative gain, and demonstrates the feasibility of this approach using a test collection based on the TREC question answering track.
Journal ArticleDOI

Exploratory Search:Beyond the Query-Response Paradigm

TL;DR: This lecture introduces exploratory search, relates it to relevant extant research, outline the features of exploratorySearch systems, discuss the evaluation of these systems, and suggest some future directions for supporting exploratorysearch.
Proceedings ArticleDOI

Overview of TREC 2004

TL;DR: The thirteenth Text REtrieval Conference, TREC 2004, was held at the National Institute of Standards and Technology (NIST) November 16–19, 2004.

Determining the informational, navigational and transactional intent of web queries

TL;DR: A software application was developed that automatically classified queries using a Web search engine log of over a million and a half queries submitted by several hundred thousand users and showed that the automatic classification has an accuracy of 74%.
Journal ArticleDOI

Determining the informational, navigational, and transactional intent of Web queries

TL;DR: In this article, the authors define and present a comprehensive classification of user intent for Web searching, which consists of three hierarchical levels of informational, navigational, and transactional intent, and then develop a software application that automatically classified queries using a web search engine log of over a million and a half queries submitted by several hundred thousand users.
References
More filters
Book

Elements of information theory

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.
Book

Pattern classification and scene analysis

TL;DR: In this article, a unified, comprehensive and up-to-date treatment of both statistical and descriptive methods for pattern recognition is provided, including Bayesian decision theory, supervised and unsupervised learning, nonparametric techniques, discriminant analysis, clustering, preprosessing of pictorial data, spatial filtering, shape description techniques, perspective transformations, projective invariants, linguistic procedures, and artificial intelligence techniques for scene analysis.
Book

Foundations of Statistical Natural Language Processing

TL;DR: This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear and provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.
Book

Nonparametric Statistical Inference

TL;DR: Theoretical Bases for Calculating the ARE Examples of the Calculations of Efficacy and ARE Analysis of Count Data.