Conference

European Conference on Information Retrieval

About: European Conference on Information Retrieval is an academic conference. The conference publishes majorly in the area(s): Computer science & Ranking (information retrieval). Over the lifetime, 2006 publications have been published by the conference receiving 37931 citations.

...read moreread less

Topics: Computer science, Ranking (information retrieval), Relevance (information retrieval), Query expansion, Task (project management) ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

A probabilistic interpretation of precision, recall and F -score, with implication for evaluation

[...]

Cyril Goutte¹, Eric Gaussier¹•Institutions (1)

Xerox¹

21 Mar 2005

TL;DR: A probabilistic setting is used which allows us to obtain posterior distributions on these performance indicators, rather than point estimates, and is applied to the case where different methods are run on different datasets from the same source.

...read moreread less

Abstract: We address the problems of 1/ assessing the confidence of the standard point estimates, precision, recall and F-score, and 2/ comparing the results, in terms of precision, recall and F-score, obtained using two different methods. To do so, we use a probabilistic setting which allows us to obtain posterior distributions on these performance indicators, rather than point estimates. This framework is applied to the case where different methods are run on different datasets from the same source, as well as the standard situation where competing results are obtained on the same data.

...read moreread less

1,402 citations

Book Chapter•DOI•

Comparing twitter and traditional media using topic models

[...]

Wayne Xin Zhao¹, Jing Jiang², Jianshu Weng², Jing He¹, Ee-Peng Lim², Hongfei Yan¹, Xiaoming Li¹ - Show less +3 more•Institutions (2)

Peking University¹, Singapore Management University²

18 Apr 2011

TL;DR: This paper empirically compare the content of Twitter with a traditional news medium, New York Times, using unsupervised topic modeling, and finds interesting and useful findings for downstream IR or DM applications.

...read moreread less

Abstract: Twitter as a new form of social media can potentially contain much useful information, but content analysis on Twitter has not been well studied. In particular, it is not clear whether as an information source Twitter can be simply regarded as a faster news feed that covers mostly the same information as traditional news media. In This paper we empirically compare the content of Twitter with a traditional news medium, New York Times, using unsupervised topic modeling. We use a Twitter-LDA model to discover topics from a representative sample of the entire Twitter. We then use text mining techniques to compare these Twitter topics with topics from New York Times, taking into consideration topic categories and types. We also study the relation between the proportions of opinionated tweets and retweets and topic categories and types. Our comparisons show interesting and useful findings for downstream IR or DM applications.

...read moreread less

1,193 citations

Book Chapter•DOI•

Developing a test collection for the evaluation of integrated search

[...]

Marianne Lykke, Birger Larsen, Haakon Lund, Peter Ingwersen

28 Mar 2010

TL;DR: The characteristics needed in an information retrieval (IR) test collection to facilitate the evaluation of integrated search, i.e. search across a range of different sources but with one search box and one ranked result list, are discussed and a new test collection is described and analyses.

...read moreread less

Abstract: The poster discusses the characteristics needed in an information retrieval (IR) test collection to facilitate the evaluation of integrated search, i.e. search across a range of different sources but with one search box and one ranked result list, and describes and analyses a new test collection constructed for this purpose. The test collection consists of approx. 18,000 monographic records, 160,000 papers and journal articles in PDF and 275,000 abstracts with a varied set of metadata and vocabularies from the physics domain, 65 topics based on real work tasks and corresponding graded relevance assessments. The test collection may be used for systems- as well as user-oriented evaluation.

...read moreread less

1,039 citations

Book Chapter•DOI•

A study of global inference algorithms in multi-document summarization

[...]

Ryan McDonald¹•Institutions (1)

Google¹

02 Apr 2007

TL;DR: This work defines a general framework for inference in summarization and presents three algorithms: a greedy approximate method, a dynamic programming approach based on solutions to the knapsack problem, and an exact algorithm that uses an Integer Linear Programming formulation of the problem.

...read moreread less

Abstract: In this work we study the theoretical and empirical properties of various global inference algorithms for multi-document summarization. We start by defining a general framework for inference in summarization. We then present three algorithms: The first is a greedy approximate method, the second a dynamic programming approach based on solutions to the knapsack problem, and the third is an exact algorithm that uses an Integer Linear Programming formulation of the problem. We empirically evaluate all three algorithms and show that, relative to the exact solution, the dynamic programming algorithm provides near optimal results with preferable scaling properties.

...read moreread less

382 citations

Book Chapter•DOI•

[...]

Donald Metzler¹, Susan T. Dumais², Christopher Meek²•Institutions (2)

University of Massachusetts Amherst¹, Microsoft²

02 Apr 2007

TL;DR: This work formally evaluate and analyze the methods on a query-query similarity task using 363,822 queries from a web search log, and provides insights into the strengths and weaknesses of each method, including important tradeoffs between effectiveness and efficiency.

...read moreread less

Abstract: Measuring the similarity between documents and queries has been extensively studied in information retrieval However, there are a growing number of tasks that require computing the similarity between two very short segments of text These tasks include query reformulation, sponsored search, and image retrieval Standard text similarity measures perform poorly on such tasks because of data sparseness and the lack of context In this work, we study this problem from an information retrieval perspective, focusing on text representations and similarity measures We examine a range of similarity measures, including purely lexical measures, stemming, and language modeling-based measures We formally evaluate and analyze the methods on a query-query similarity task using 363,822 queries from a web search log Our analysis provides insights into the strengths and weaknesses of each method, including important tradeoffs between effectiveness and efficiency

...read moreread less

354 citations

Collapse

Performance

Metrics

2,006

Papers

37,931

Citations

No. of papers from the Conference in previous years
Year	Papers
2023	162
2022	110
2021	135
2020	145
2019	122
2018	85