Home
/
Topics
/
Ranking (information retrieval)

Topic

Ranking (information retrieval)

About: Ranking (information retrieval) is a research topic. Over the lifetime, 21109 publications have been published within this topic receiving 435130 citations.

...read moreread less

Papers published on a yearly basis

2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Adapting to source properties in processing data integration queries

[...]

Zachary G. Ives¹, Alon Halevy², Daniel S. Weld²•Institutions (2)

University of Pennsylvania¹, University of Washington²

13 Jun 2004

TL;DR: This paper introduces a new technique, called adaptive data partitioning (ADP), which is based on the idea of dividing the source data into regions, each executed by different, complementary plans, and shows how this model can be applied in novel ways to correct for underestimated selectivity and cardinality values.

...read moreread less

Abstract: An effective query optimizer finds a query plan that exploits the characteristics of the source data. In data integration, little is known in advance about sources' properties, which necessitates the use of adaptive query processing techniques to adjust query processing on-the-fly. Prior work in adaptive query processing has focused on compensating for delays and adjusting for mis-estimated cardinality or selectivity values. In this paper, we present a generalized architecture for adaptive query processing and introduce a new technique, called adaptive data partitioning (ADP), which is based on the idea of dividing the source data into regions, each executed by different, complementary plans. We show how this model can be applied in novel ways to not only correct for underestimated selectivity and cardinality values, but also to discover and exploit order in the source data, and to detect and exploit source data that can be effectively pre-aggregated. We experimentally compare a number of alternative strategies and show that our approach is effective.

...read moreread less

116 citations

Patent•

Apparatus and method for searching and retrieving structured, semi-structured and unstructured content

[...]

Douglass Russell Judd, Bruce D. Karsh, Ram Subbaroyan, Troy Toman, Rahul Lahiri, Patrick Lok - Show less +2 more

14 May 2003

TL;DR: In this article, a search and retrieval system allows a user to search free text within sections of schema independent documents, which may include structured, semi-structured, and unstructured documents.

...read moreread less

Abstract: A search and retrieval permits a user to search free text within sections of schema independent documents. The documents, which may include structured, semi-structured, and unstructured documents, contain text organized into a plurality of sections, such as XML tags. The repository of documents is schema independent, such that the search system does not require pre-defined fields for the sections. To execute a search, the search system receives a query that specifies at least one section and at least one free text query construct for text within the section. In general, the free text query construct specifies at least one free text search condition. The search system identifies sections in the repository of documents as specified in the query, and evaluates the free text query construct for the text within sections to determine whether the free text search condition is met.

...read moreread less

115 citations

Journal Article•DOI•

A ranking of software engineering measures based on expert opinion

[...]

Ming Li¹, Carol Smidts¹•Institutions (1)

University of Maryland, College Park¹

01 Sep 2003-IEEE Transactions on Software Engineering

TL;DR: This research proposes a framework based on expert opinion elicitation, developed to select the software engineering measures which are the best software reliability indicators, based on the top 30 measures identified in an earlier study conducted by Lawrence Livermore National Laboratory.

...read moreread less

Abstract: This research proposes a framework based on expert opinion elicitation, developed to select the software engineering measures which are the best software reliability indicators. The current research is based on the top 30 measures identified in an earlier study conducted by Lawrence Livermore National Laboratory. A set of ranking criteria and their levels were identified. The score of each measure for each ranking criterion was elicited through expert opinion and then aggregated into a single score using multiattribute utility theory. The basic aggregation scheme selected was a linear additive scheme. A comprehensive sensitivity analysis was carried out. The sensitivity analysis included: variation of the ranking criteria levels, variation of the weights, variation of the aggregation schemes. The top-ranked measures were identified. Use of these measures in each software development phase can lead to a more reliable quantitative prediction of software reliability.

...read moreread less

115 citations

Proceedings Article•DOI•

Relevance-based Word Embedding

[...]

Hamed Zamani¹, W. Bruce Croft¹•Institutions (1)

University of Massachusetts Amherst¹

07 Aug 2017

TL;DR: Both query expansion experiments on four TREC collections and query classification experiments on the KDD Cup 2005 dataset suggest that the relevance-based word embedding models significantly outperform state-of-the-art proximity-based embedding model, such as word2vec and GloVe.

...read moreread less

Abstract: Learning a high-dimensional dense representation for vocabulary terms, also known as a word embedding, has recently attracted much attention in natural language processing and information retrieval tasks. The embedding vectors are typically learned based on term proximity in a large corpus. This means that the objective in well-known word embedding algorithms, e.g., word2vec, is to accurately predict adjacent word(s) for a given word or context. However, this objective is not necessarily equivalent to the goal of many information retrieval (IR) tasks. The primary objective in various IR tasks is to capture relevance instead of term proximity, syntactic, or even semantic similarity. This is the motivation for developing unsupervised relevance-based word embedding models that learn word representations based on query-document relevance information. In this paper, we propose two learning models with different objective functions; one learns a relevance distribution over the vocabulary set for each query, and the other classifies each term as belonging to the relevant or non-relevant class for each query. To train our models, we used over six million unique queries and the top ranked documents retrieved in response to each query, which are assumed to be relevant to the query. We extrinsically evaluate our learned word representation models using two IR tasks: query expansion and query classification. Both query expansion experiments on four TREC collections and query classification experiments on the KDD Cup 2005 dataset suggest that the relevance-based word embedding models significantly outperform state-of-the-art proximity-based embedding models, such as word2vec and GloVe.

...read moreread less

115 citations

Journal Article•DOI•

Academic Ranking of World Universities 2008

[...]

Alejandro Márquez Jiménez¹•Institutions (1)

National Autonomous University of Mexico¹

31 Dec 1969-Perfiles Educativos

TL;DR: In particular, the authors analyzes the impact of the Academic Ranking of World Universities (ARWU) on the desempeño of universitarias in the world.

...read moreread less

115 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
…
162
163
164
165
166
167
168
…
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

30,908

Papers

494,240

Citations

No. of papers in the topic in previous years
Year	Papers
2024	1
2023	3,112
2022	6,541
2021	1,105
2020	1,082
2019	1,168

Ranking (information retrieval)

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics