Home
/
Topics
/
Document retrieval

Topic

Document retrieval

About: Document retrieval is a research topic. Over the lifetime, 6821 publications have been published within this topic receiving 214383 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Patent•

User interface for document retrieval

[...]

IJsbrand Jan Aalbersberg¹•Institutions (1)

Philips¹

11 Jan 1995

TL;DR: In this article, a user interface for a full-text document retrieval computerized system comprises a display with a words window in which each query word is displayed by means of a distinctive representation uniquely associated with each displayed word.

...read moreread less

Abstract: A user interface for a full-text document retrieval computerized system comprises a display with a words window in which each query word is displayed by means of a distinctive representation uniquely associated with each displayed word. In a subsequent results window, each document header or title or representation is accompanied by an indicator which employs the same distinctive representation to directly indicate to the user the relative contributions of the individual query words to each listed document. In a preferred embodiment, the distinctive representation is integrated with an associated weight first indicator in a words window, and in the results window the distinctive representations are also integrated with an associated weight second indicator. The distinctive representation can take several forms, such as by a different color or by means of hatching or shading or by displayed icons.

...read moreread less

158 citations

Proceedings Article•

TREC Genomics Track Overview

[...]

William R. Hersh¹, Ravi Teja Bhupatiraju¹•Institutions (1)

Oregon Health & Science University¹

01 Jan 2003

TL;DR: The first year of TREC Genomics Track featured two tasks: ad hoc retrieval and information extraction, which centered around the Gene Reference into Function (GeneRIF) resource of the National Library of Medicine.

...read moreread less

Abstract: The first year of TREC Genomics Track featured two tasks: ad hoc retrieval and information extraction. Both tasks centered around the Gene Reference into Function (GeneRIF) resource of the National Library of Medicine, which was used as both pseudorelevance judgments for ad hoc document retrieval as well as target text for information extraction. The track attracted 29 groups who participated in one or both tasks.

...read moreread less

157 citations

Journal Article•DOI•

Users, user interfaces, and objects: Envision, a digital library

[...]

Edward A. Fox¹, Deborah Hix¹, Lucy T. Nowell¹, Dennis J. Brueni¹, Durgesh Rao, William C. Wake¹, Lenwood S. Heath¹ - Show less +3 more•Institutions (1)

Virginia Tech¹

01 Sep 1993-Journal of the Association for Information Science and Technology

TL;DR: Development of the Envision database, system software, and protocol for client-server communication builds upon work to identify and represent “ objects” that will facilitate reuse and high-level communication of information from author to reader (user).

...read moreread less

Abstract: Project Envision aims to build a “user-centered database from the computer science literature,” initially using the publications of the Association for Computing Machinery (ACM) Accordingly, we have interviewed potential users, as well as experts in library, information, and computer science—to understand their needs, to become aware of their perception of existing information systems, and to collect their recommendations Design and formative usability evaluation of our interface have been based on those interviews, leading to innovative query formulation and search results screens that work well according to our usability testing Our development of the Envision database, system software, and protocol for client-server communication builds upon work to identify and represent “objects” that will facilitate reuse and high-level communication of information from author to reader (user) All these efforts are leading not only to a usable prototype digital library but also to a set of nine principles for digital libraries, which we have tried to follow, covering issues of representation, architecture, and interfacing © 1993 John Wiley & Sons, Inc

...read moreread less

157 citations

Journal Article•DOI•

The Retrieval Effects of Query Expansion on a Feedback Document Retrieval System

[...]

Alan F. Smeaton¹, C. J. van Rijsbergen¹•Institutions (1)

University College Dublin¹

01 Jan 1983-The Computer Journal

157 citations

Proceedings Article•

Natural Language Processing in Information Retrieval.

[...]

Thorsten Brants

01 Jan 2003

TL;DR: NLP needs to be optimized for IR in order to be effective and document retrieval is not an ideal application for NLP, at least given the current state-of-the-art in NLP.

...read moreread less

Abstract: Many Natural Language Processing (NLP) techniques have been used in Information Retrieval. The results are not encouraging. Simple methods (stopwording, porter-style stemming, etc.) usually yield significant improvements, while higher-level processing (chunking, parsing, word sense disambiguation, etc.) only yield very small improvements or even a decrease in accuracy. At the same time, higher-level methods increase the processing and storage cost dramatically. This makes them hard to use on large collections. We review NLP techniques and come to the conclusion that (a) NLP needs to be optimized for IR in order to be effective and (b) document retrieval is not an ideal application for NLP, at least given the current state-of-the-art in NLP. Other IR-related tasks, e.g., question answering and information extraction, seem to be better suited.

...read moreread less

156 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
…
41
42
43
44
45
46
47
…
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

6,866

Papers

224,605

Citations

No. of papers in the topic in previous years
Year	Papers
2023	9
2022	39
2021	107
2020	130
2019	144
2018	111

Document retrieval

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics