Home
/
Topics
/
Ranking (information retrieval)

Topic

Ranking (information retrieval)

About: Ranking (information retrieval) is a research topic. Over the lifetime, 21109 publications have been published within this topic receiving 435130 citations.

...read moreread less

Papers published on a yearly basis

2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•

ProFusion*: Intelligent Fusion from Multiple, Distributed Search Engines 1

[...]

Susan Gauch, Guijun Wang, Mario Gomez

01 Jan 1996-Journal of Universal Computer Science

TL;DR: ProFusion, a meta search engine, sends user queries to multiple underlying search engines in parallel, retrieves and merges the resulting URLs, and identifies and removes duplicates and creates one relevance-ranked list.

...read moreread less

Abstract: The explosive growth of the World Wide Web, and the resulting information overload, has led to a mini-explosion in World Wide Web search engines. This mini-explosion, in turn, led to the development of ProFusion, a meta search engine. Educators, like other users, do not have the time to evaluate multiple search engines to knowledgeably select the best for their uses. Nor do they have the time to submit each query to multiple search engines and wade through the resulting flood of good information, duplicated information, irrelevant information, and missing documents. ProFusion sends user queries to multiple underlying search engines in parallel, retrieves and merges the resulting URLs. It identifies and removes duplicates and creates one relevance-ranked list. If desired, the actual documents can be pre-fetched to remove yet more duplicates and broken links. ProFusion's performance has been compared to the individual search engines and other meta searchers, demonstrating its ability to retrieve more relevant information and present fewer duplicates pages. The system can automatically analyze queries to identify its topic(s) and, based on that analysis, select the most appropriate search engines for the query.

...read moreread less

174 citations

Journal Article•DOI•

Statistical Analysis of Bayes Optimal Subset Ranking

[...]

D. Cossock¹, Tong Zhang•Institutions (1)

Yahoo!¹

01 Nov 2008-IEEE Transactions on Information Theory

TL;DR: This work considers a formulation of the statistical ranking problem which it calls subset ranking, and focuses on the discounted cumulated gain (DCG) criterion that measures the quality of items near the top of the rank-list.

...read moreread less

Abstract: The ranking problem has become increasingly important in modern applications of statistical methods in automated decision making systems. In particular, we consider a formulation of the statistical ranking problem which we call subset ranking, and focus on the discounted cumulated gain (DCG) criterion that measures the quality of items near the top of the rank-list. Similar to error minimization for binary classification, direct optimization of natural ranking criteria such as DCG leads to a nonconvex optimization problems that can be NP-hard. Therefore, a computationally more tractable approach is needed. We present bounds that relate the approximate optimization of DCG to the approximate minimization of certain regression errors. These bounds justify the use of convex learning formulations for solving the subset ranking problem. The resulting estimation methods are not conventional, in that we focus on the estimation quality in the top-portion of the rank-list. We further investigate the asymptotic statistical behavior of these formulations. Under appropriate conditions, the consistency of the estimation schemes with respect to the DCG metric can be derived.

...read moreread less

174 citations

Proceedings Article•DOI•

A cluster-based resampling method for pseudo-relevance feedback

[...]

Kyung-Soon Lee¹, W. Bruce Croft², James Allan²•Institutions (2)

Chonbuk National University¹, University of Massachusetts Amherst²

20 Jul 2008

TL;DR: This paper presents a cluster-based resampling method to select better pseudo-relevant documents based on the relevance model, and shows higher relevance density than the baseline relevance model on all collections, resulting in better retrieval accuracy in pseudo-relevance feedback.

...read moreread less

Abstract: Typical pseudo-relevance feedback methods assume the top-retrieved documents are relevant and use these pseudo-relevant documents to expand terms. The initial retrieval set can, however, contain a great deal of noise. In this paper, we present a cluster-based resampling method to select better pseudo-relevant documents based on the relevance model. The main idea is to use document clusters to find dominant documents for the initial retrieval set, and to repeatedly feed the documents to emphasize the core topics of a query. Experimental results on large-scale web TREC collections show significant improvements over the relevance model. For justification of the resampling approach, we examine relevance density of feedback documents. A higher relevance density will result in greater retrieval accuracy, ultimately approaching true relevance feedback. The resampling approach shows higher relevance density than the baseline relevance model on all collections, resulting in better retrieval accuracy in pseudo-relevance feedback. This result indicates that the proposed method is effective for pseudo-relevance feedback.

...read moreread less

174 citations

Journal Article•DOI•

A new QoS ontology and its QoS-based ranking algorithm for Web services

[...]

Vuong Xuan Tran¹, Hidekazu Tsuji¹, Ryosuke Masuda¹•Institutions (1)

Tokai University¹

01 Sep 2009-Simulation Modelling Practice and Theory

TL;DR: This paper proposes a novel approach for designing and developing a QoS ontology and its QoS-based ranking algorithm for evaluating Web services and can be used in various applications in order to facilitate automatic and dynamic discovery and selection of Web services.

...read moreread less

173 citations

Patent•

Research mode for a knowledge base search and retrieval system

[...]

Kelly Wical¹•Institutions (1)

Oracle Corporation¹

12 Nov 1997

TL;DR: In this paper, the search and retrieval system includes point-of-view gists for documents to provide a synopsis for a corresponding document with a slant toward a specific topic.

...read moreread less

Abstract: A research mode in a search and retrieval system generates a research document that infers an answer to a query from multiple documents. The search and retrieval system includes point of view gists for documents to provide a synopsis for a corresponding document with a slant toward a topic. To generate a research document, the search and retrieval system processes a query to identify one or more topics related to the query, selects document themes relevant to the query, and then selects point of view gists, based on the document themes, that have a slant towards the topics related to the query. A knowledge base, which includes categories arranged hierarchically, is configured as a directed graph to links those categories having a lexical, semantic or usage association. Through use of the knowledge base, an expanded set of query terms are generated, and research documents are compiled that include point of view gists relevant to the expanded set of query terms. A content processing system, which identifies the themes for a document and classifies the document themes in categories of the knowledge base, is also disclosed.

...read moreread less

173 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
…
89
90
91
92
93
94
95
…
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

30,908

Papers

494,240

Citations

No. of papers in the topic in previous years
Year	Papers
2024	1
2023	3,112
2022	6,541
2021	1,105
2020	1,082
2019	1,168

Ranking (information retrieval)

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics