scispace - formally typeset
Search or ask a question
Topic

Document retrieval

About: Document retrieval is a research topic. Over the lifetime, 6821 publications have been published within this topic receiving 214383 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: An approach to the document-retrieval problem that aims to increase the efficiency and effectiveness of document- retrieval systems by exploiting the semantic contents of the documents is presented.
Abstract: An approach to the document-retrieval problem that aims to increase the efficiency and effectiveness of document-retrieval systems by exploiting the semantic contents of the documents is presented. The document retrieval problem is delineated, and conceptual document modeling basics and requirements are discussed. An experimental system, the Multimedia Office Server (Multos), which implements some of the document-model concepts described, is presented. >

80 citations

Proceedings Article
01 Jan 2001
TL;DR: It is shown how indexing quality may be evaluated in a machine learning setting and this methodology is applied to results of the Indexing Initiative at the National Library of Medicine.
Abstract: For computational purposes documents or other objects are most often represented by a collection of individual attributes that may be strings or numbers. Such attributes are often called features and success in solving a given problem can depend critically on the nature of the features selected to represent documents. Feature selection has received considerable attention in the machine learning literature. In the area of document retrieval we refer to feature selection as indexing. Indexing has not traditionally been evaluated by the same methods used in machine learning feature selection. Here we show how indexing quality may be evaluated in a machine learning setting and apply this methodology to results of the Indexing Initiative at the National Library of Medicine.

80 citations

Patent
12 Nov 1997
TL;DR: A calendar-based user interface for electronically stored documents is provided by virtue of the present invention as discussed by the authors, where each document has an associated time and an icon representing a particular document is presented at a location corresponding to the time of the document.
Abstract: A calendar-based user interface for electronically stored documents is provided by virtue of the present invention. Each document has an associated time. A calendar view is constructed including an icon representing a particular document. The icon is presented at a location corresponding to the time of the document. In one embodiment, the calendar view includes a month view with each date having a corresponding area for presentation of document icons for documents processed on that date.

80 citations

Book
01 Jan 2002
TL;DR: In this paper, the technologies of document retrieval, information extraction, and text categorization are discussed in a way which highlights commonalities in terms of both general principles and practical concerns.
Abstract: This text covers the technologies of document retrieval, information extraction, and text categorization in a way which highlights commonalities in terms of both general principles and practical concerns. It assumes some mathematical background on the part of the reader, but the chapters typically begin with a non-mathematical account of the key issues. Current research topics are covered only to the extent that they are informing current applications; detailed coverage of longer term research and more theoretical treatments should be sought elsewhere. There are many pointers at the ends of the chapters that the reader can follow to explore the literature. However, the book does maintain a strong emphasis on evaluation in every chapter both in terms of methodology and the results of controlled experimentation.

80 citations

Journal ArticleDOI
TL;DR: This work improves the current Web information retrieval approach by raising the efficiency of information retrieval, enhancing the preciseness and mobility of information services, and enabling intelligent information services.

80 citations


Network Information
Related Topics (5)
Web page
50.3K papers, 975.1K citations
81% related
Metadata
43.9K papers, 642.7K citations
79% related
Recommender system
27.2K papers, 598K citations
79% related
Ontology (information science)
57K papers, 869.1K citations
78% related
Natural language
31.1K papers, 806.8K citations
77% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20239
202239
2021107
2020130
2019144
2018111