scispace - formally typeset
Search or ask a question
Topic

Document retrieval

About: Document retrieval is a research topic. Over the lifetime, 6821 publications have been published within this topic receiving 214383 citations.


Papers
More filters
Patent
17 Jul 1991
TL;DR: In this article, a methodology for retrieving textual data objects in a multiplicity of languages is disclosed, where data objects are treated in the statistical domain by presuming that there is an underlying, latent semantic structure in the usage of words in each language under consideration.
Abstract: A methodology for retrieving textual data objects in a multiplicity of languages is disclosed. The data objects are treated in the statistical domain by presuming that there is an underlying, latent semantic structure in the usage of words in each language under consideration. Estimates to this latent structure are utilized to represent and retrieve objects. A user query is recouched in the new statistical domain and then processed in the computer system to extract the underlying meaning to respond to the query.

393 citations

Proceedings ArticleDOI
13 Apr 1996
TL;DR: Results indicate that novice IR system users, after minimal training, were able to use the baseline system reasonably effectively and availability and use of relevance feedback increased retrieval effectiveness.
Abstract: This study investigates the use and effectiveness of an ,advanced information retrieval (IR) system (INQUERY). 64 novice IR system users were studied in their use of a baseline version of INQUERY compared with one of three experimental versions, each offering a different level of interaction with a relevance feedback facility for automatic query reformulation. Results, in an inforumtion filtering task, indicate that: these subjects, after minimal training, were able to use the baseline system reasonably effectively; availability and use of relevance feedback increased retrieval effectiveness; and increased opportunity for user interaction with and control of relevance feedback made the interactions more efficient and usable while maintaining or increasing effective-

392 citations

Journal ArticleDOI
TL;DR: In this paper, the authors summarized the theory of psychological relevance proposed by Dan Sperber and Deirdre Wilson (1986) to explicate the relevance of speech utterances to hearers in everyday conversation.
Abstract: This article summarizes the theory of psychological relevance proposed by Dan Sperber and Deirdre Wilson (1986), to explicate the relevance of speech utterances to hearers in everyday conversation. The theory is then interpreted as the concept of relevance in information retrieval, and an extended example is presented. Implications of psychological relevance for research in information retrieval; evaluation of information retrieval systems; and the concepts of information, information need, and the information-seeking process are explored. Connections of the theory to ideas in bibliometrics are also suggested. © 1992 John Wiley & Sons, Inc.

390 citations

Patent
25 Sep 2001
TL;DR: In this paper, an extension of an inverse inference search engine is disclosed which provides cross language document retrieval, in which the information matrix used as input to the inverse inference engine is organized into rows of blocks corresponding to languages within a predetermined set of natural languages.
Abstract: An extension of an inverse inference search engine is disclosed which provides cross language document retrieval, in which the information matrix used as input to the inverse inference engine is organized into rows of blocks corresponding to languages within a predetermined set of natural languages. The information matrix is further organized into two column-wise partitions. The first partition consists of blocks of entries representing fully translated documents, while the second partition is a matrix of blocks of entries representing documents for which translations are not available in all of the predetermined languages. Further in the second partition, entries in blocks outside the main diagonal of blocks are zero. Another disclosed extension to the inverse inference retrieval document retrieval system supports automatic, knowledge based training. This approach applies the idea of using a training set to the problem of searching databases where information that is diluted or not reliable enough to allow the creation of robust semantic links. To address this situation, the disclosed system loads the left-hand partition of the input matrix for the inverse inference engine with information from reliable sources.

385 citations

Journal ArticleDOI
TL;DR: It is argued here that the advanced information retrieval research community is missing an opportunity to design systems that are in better harmony with the actual preferences of many users—sophisticated systems that provide an optimal combination of searcher control and system retrieval power.
Abstract: Many users of online and other automated information systems want to take advantage of the speed and power of automated retrieval, while still controlling and directing the steps of the search themselves. They do not want the system to take over and carry out the search entirely for them. Yet the objective of much of current theory and experimentation in information retrieval systems and interfaces is to design systems in which the user has either no or only reactive involvement with the search process. It is argued here that the advanced information retrieval research community is missing an opportunity to design systems that are in better harmony with the actual preferences of many users—sophisticated systems that provide an optimal combination of searcher control and system retrieval power. The user may be provided effective means of directing the search if capabilities specific to the information retrieval process, that is, strategic behaviors normally associated with information searching, are incorporated into the interface. There are many questions concerning (1) the degree of user vs. system involvement in the search, and (2) the size, or chunking, of activities; that is, how much and what type of activity the user should be able to direct the system to do at once. These two dimensions are analyzed and a number of configurations of system capability that combine user and system control are presented and discussed. In the process, the concept of the information search stratagem is introduced, and particular attention is paid to the provision of strategic, as opposed to purely procedural capabilities for the searcher. Finally, certain types of user-system relationship are selected as deserving particular attention in future information retrieval system design, and arguments are made to support the recommendations.

383 citations


Network Information
Related Topics (5)
Web page
50.3K papers, 975.1K citations
81% related
Metadata
43.9K papers, 642.7K citations
79% related
Recommender system
27.2K papers, 598K citations
79% related
Ontology (information science)
57K papers, 869.1K citations
78% related
Natural language
31.1K papers, 806.8K citations
77% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20239
202239
2021107
2020130
2019144
2018111