Topic

Document retrieval

About: Document retrieval is a research topic. Over the lifetime, 6821 publications have been published within this topic receiving 214383 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•

The TREC Spoken Document Retrieval Track: A Success Story.

[...]

John S. Garofolo¹, Cedric G. P. Auzanne¹, Ellen M. Voorhees¹•Institutions (1)

National Institute of Standards and Technology¹

01 Jan 1999

TL;DR: The NIST Text REtrieval Conference (TREC) SDR Track as mentioned in this paper has provided an infrastructure for the development and evaluation of SDR technology and a common forum for the exchange of knowledge between the speech recognition and information retrieval research communities.

...read moreread less

Abstract: This paper describes work within the NIST Text REtrieval Conference (TREC) over the last three years in designing and implementing evaluations of Spoken Document Retrieval (SDR) technology within a broadcast news domain. SDR involves the search and retrieval of excerpts from spoken audio recordings using a combination of automatic speech recognition and information retrieval technologies. The TREC SDR Track has provided an infrastructure for the development and evaluation of SDR technology and a common forum for the exchange of knowledge between the speech recognition and information retrieval research communities. The SDR Track can be declared a success in that it has provided objective, demonstrable proof that this technology can be successfully applied to realistic audio collections using a combination of existing technologies and that it can be objectively evaluated. The design and implementation of each of the SDR evaluations are presented and the results are summarized. Plans for the 2000 TREC SDR Track are presented and thoughts about how the track might evolve are discussed.

...read moreread less

97 citations

Efficient probabilistic inference for text retrieval

[...]

Howard R. Turtle, W. Bruce Croft

01 Jan 1991

TL;DR: An overview of a retrieval model that is based on probabilistic inference networks is given and simplifications that allow to buid and evaluate networks efficiently, even with very large collections are described.

...read moreread less

Abstract: Probabilistic inference techniques have been shown to significantly improve retrieval performance when compare to conventional retrieval models, but their use can be prohibitely expensive for large collections. We give an overview of a retrieval model that is based on probabilistic inference networks and describe simplifications that allow to buid and evaluate networks efficiently, even with very large collections

...read moreread less

97 citations

Journal Article•DOI•

Query expansion and MEDLINE

[...]

Padmini Srinivasan¹•Institutions (1)

Cornell University¹

01 Jul 1996-Information Processing and Management

TL;DR: This study recommends query expansion using retrieval feedback for adding McSH search terms to a user's initial query.

...read moreread less

Abstract: This paper evaluates the retrieval effectiveness of query expansion strategies on a MEDLINE test collection using Cornell University's SMART retrieval system. Three expansion strategies are tested on their ability to identify appropriate McSH terms for user queries: expansion using an inter-field statistical thesaurus, expansion via retrieval feedback and expansion using a combined approach. These expansion strategies do not require prior relevance decisions. The study compares retrieval effectiveness using the original unexpanded and the alternative expanded user queries on a collection of 75 queries and 2334 MEDLINE citations. Retrieval effectiveness is assessed using eleven point average precision scores (11-AvgP). The combination of expansion using the thesaurus followed by retrieval feedback gives the best improvement of 17% over a baseline performance of 0.5169 11-AvgP. However this improvement is almost identical to that achieved by expansion via retrieval feedback (16.4%). Query expansion using the inter-field thesaurus gives a significant but lower performance improvement (9.9%) over the same baseline. This study recommends query expansion using retrieval feedback for adding McSH search terms to a user's initial query.

...read moreread less

97 citations

Journal Article•DOI•

Content-Based 3-D Model Retrieval: A Survey

[...]

Yang Yu-bin¹, Lin Hui², Zhang Yao¹•Institutions (2)

Nanjing University¹, The Chinese University of Hong Kong²

01 Nov 2007

TL;DR: The system framework and some key techniques of content-based 3D model retrieval are identified and explained, including canonical coordinate normalization and preprocessing, feature extraction, similarity match, query representation and user interface, and performance evaluation.

...read moreread less

Abstract: As the number of available 3D models grows, there is an increasing need to index and retrieve them according to their contents. This paper provides a survey of the up-to-date methods for content-based 3D model retrieval. First, the new challenges encountered in 3D model retrieval are discussed. Then, the system framework and some key techniques of content-based 3D model retrieval are identified and explained, including canonical coordinate normalization and preprocessing, feature extraction, similarity match, query representation and user interface, and performance evaluation. In particular, similarity measures using semantic clues and machine learning methods, as well as retrieval approaches using nonshape features, are given adequate recognition as improvements and complements for traditional shape-matching techniques. Typical 3D model retrieval systems and search engines are also listed and compared. Finally, future research directions are indicated, and an extensive bibliography is provided.

...read moreread less

97 citations

Journal Article•DOI•

The Creation of New Knowledge by Information Retrieval and Classification.

[...]

Roy Davies¹•Institutions (1)

University of Exeter¹

01 Apr 1989-Journal of Documentation

TL;DR: This paper reviews previous work on producing knowledge by information retrieval or classification and describes techniques by which hidden knowledge may be retrieved, e.g. serendipity in browsing, use of appropriate search strategies and, possibly in the future, methods based on Farradane's relational indexing or artificial intelligence.

...read moreread less

Abstract: Knowledge can be created by drawing inferences from what is already known. Often some of the requisite information is lacking and has to be gathered by whatever research techniques are appropriate, e.g. experiments, surveys etc. Even if the information has all been published already, unless it is retrieved no inferences will be drawn from it and consequently there will exist some knowledge that is implicit in the literature and yet is not known by anyone. This ‘undiscovered public knowledge’, as it is termed by Swanson, may exist in the following forms: (i) a hidden refutation or qualification of a hypothesis; (ii) an undrawn conclusion from two or more premises; (iii) the cumulative evidence of weak, independent tests; (iv) solutions to analogous problems; (v) hidden correlations between factors. Methods of classification may also play a direct role in the creation of original knowledge. Novel solutions to problems may be discovered by generating different combinations of the basic features of the solutions, as is done in morphological analysis. Alternatively a natural classification may identify gaps in existing knowledge. This paper reviews previous work on producing knowledge by information retrieval or classification and describes techniques by which hidden knowledge may be retrieved, e.g. serendipity in browsing, use of appropriate search strategies and, possibly in the future, methods based on Farradane's relational indexing or artificial intelligence.

...read moreread less

97 citations

Collapse

Network Information

Performance

Metrics

6,866

Papers

224,605

Citations

No. of papers in the topic in previous years
Year	Papers
2023	9
2022	39
2021	107
2020	130
2019	144
2018	111

Document retrieval

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics