scispace - formally typeset
Search or ask a question
Topic

Document retrieval

About: Document retrieval is a research topic. Over the lifetime, 6821 publications have been published within this topic receiving 214383 citations.


Papers
More filters
Patent
15 Apr 2004
TL;DR: In this paper, a search query from an end-user is received and a set of non-enhanced documents are extracted from a base document and the query terms are associated with one or more respective references to the base document.
Abstract: Systems and methods for enhanced document retrieval are described. In one aspect, a search query from an end-user is received. Responsive to receiving the search query, search results are retrieved. The search results include an enhanced document and a set of non-enhanced documents. The enhanced document and the non-enhanced documents include term(s) of the search query. The enhanced document is derived from a base document. The base document was modified with metadata mined from one or more different documents. The metadata is associated with one or more respective references to the base document. The one or more different documents are independent of the base document.

75 citations

Proceedings ArticleDOI
01 Jul 1997
TL;DR: Q~T (QWXYUaa Interfacewith Light Tmnaladon8)k a prototype implementadon of a completecmas-langusge textretrieval 8ystemtbat takeaFkgli8hqude3snd pmduceaw811glo8a translations of Spani8hdocumenla.
Abstract: Q~T (QWXYUaa Interfacewith Light Tmnaladon8)k a prototype implementadon of a completecmas-langusge textretrieval 8ystemtbat takeaFkgli8hqude3snd pmduceaw811glo8a translations of Spani8hdocumenla.The ayatcmindexesthe Spmiah(kmmnts inspatda but-* the B@iahquayiIltoa Spaniab equivalent aet thraugh a novel combinadoo of lexical methodsandpardkkorpus diaambwatkm. Similarmcthodaare appliedtotherctwned documontto -a-transladm thatcanbeemmined bynon-spaniah weakemtogauwhc~ vanceofthe documenttotheorkinalmquuy. lkaystem integralwtraditional,ghaary-bad machinetramhdon technologywith informationmlriev81awoacka 8nddemomtmtfxhat mhtively simpletermaubatitutionand diaambigudkn approached can be viabk for cross-languagetext refrieval.

75 citations

Journal ArticleDOI
01 May 1995
TL;DR: In this paper, the authors consider division of documents into parts as a solution to the problem of the range of document sizes and show that, for databases with long documents, use of document parts can improve the quality of the information presented to the user.
Abstract: Management and retrieval of large volumes of text can be expensive in both space and time. Moreover, the range of document sizes in a large collection such as TREC presents difficulties for both the retrieval mechanism and the user. We consider division of documents into parts as a solution to the problem of the range of document sizes, and show that, for databases with long documents, use of document parts can improve the quality of the information presented to the user. We also describe the compressed text database system we use to manage the TREC collection; the compressed inverted files with which it is indexed; and the techniques we use to evaluate the TREC queries, both on whole documents and on document parts.

75 citations

Patent
01 Oct 1990
TL;DR: An apparatus for document browsing specifically for document retrieval systems is described in this article, which enables users to see multiple document pages on the same screen at the same time in a first mode and to see a bundle of pages on a screen in a second mode.
Abstract: An apparatus for document browsing, specifically for document retrieval systems. The browsing apparatus enables users to see multiple document pages on the same screen at the same time in a first mode and to see a bundle of pages on a screen in a second mode. The images shown on the screen are produced internally according to the user's commands. The pages may be flipped in either direction and selected pages may be marked for later printing instructions.

75 citations


Network Information
Related Topics (5)
Web page
50.3K papers, 975.1K citations
81% related
Metadata
43.9K papers, 642.7K citations
79% related
Recommender system
27.2K papers, 598K citations
79% related
Ontology (information science)
57K papers, 869.1K citations
78% related
Natural language
31.1K papers, 806.8K citations
77% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20239
202239
2021107
2020130
2019144
2018111