scispace - formally typeset
Search or ask a question

Showing papers on "Document retrieval published in 1972"



Journal ArticleDOI
TL;DR: Although seemingly meaningful clusters can be obtained, the results indicate that the effort involved in finding clusters and adding the clustered terms to queries is far too great to warrant their use in an operational system.

96 citations


Journal ArticleDOI
Gerard Salton1
TL;DR: A comparison was made of the performance in an automatic information retrieval environment of user queries and document abstracts available in natural language form in both English and French to indicate that the automatic indexing and retrieval techniques used appear equally effective in handling the query and document texts in both languages.

57 citations


Journal ArticleDOI
Gerard Salton1
TL;DR: A new dynamic document environment is outlined in which clustered files are searched and information is retrieved following an interactive user-controlled search process and methods are described for an automatic query modification based on user needs.
Abstract: The current role of computers in automatic document processing is briefly outlined, and some reasons are given why the early promise of library automation and of the mechanization of documentation processes has not been fulfilledA new dynamic document environment is then outlined in which clustered files are searched and information is retrieved following an interactive user-controlled search process Methods are described for an automatic query modification based on user needs, and for a continuous reorganization of the stored information as a function of earlier file processing and of normal collection growth The proposed procedures provide powerful tools for information retrieval and for the control of dynamic library collections in which new items are continually added and ones are retired

47 citations



Journal ArticleDOI
TL;DR: It is emphasized that one cannot conclude from these experiments that term clusters (or equivalently, keyword classifications or thesauruses) are not useful in retrieval, and more encouraging results might be obtained using one or more of the following strategies.

10 citations



01 Jan 1972
TL;DR: The results indicate that the two systems perform at approximately the same level of effectiveness, although estimated average total retrieval was found to be slightly greater for free-text searching than for descriptor searching at all levels of recall.
Abstract: : The study compares the retrieval effectiveness of two alternative input and search systems in terms of such measures as recall, fallout, precision, and total retrieval. One system operates using manually indexed document files searched by controlled vocabulary while the other employs full- text input using natural language searching. Both systems are applied to a common data base and hardware. Operational information needs were used in the form of request statements from actual users. From these statements of need, search queries were formulated for both systems and recall estimates calculated using a recall base that was pre-specified by the request origninator. The queries were processed and total retrieval, fallout and precision ratios were calculated for both systems. The results indicate that the two systems perform at approximately the same level of effectiveness, although estimated average total retrieval was found to be slightly greater for free-text searching than for descriptor searching at all levels of recall.

6 citations


Journal ArticleDOI
TL;DR: Es wird zur Diskussion gestellt, ein solches Verfahren für Dokumentauskunft als Alternative zu der Faksimile-Punktrasterübertragung of Mikrofilmbildern zu verwenden, die durch den Nachteil einer relativ großen benötigten Kanalgeschwindigkeit > 5 Mbitjs gekennzeich
Abstract: Nach einem Überblick über ein Rechnersystem Jiir Bildverkehr werden verschiedene zu übertragende Bildarten klassifiziert : alpha-numerischer Text, Symbolund Kurvengraphik sowie Grauwertbilder (Photos). Von einfachen Strukturen ausgehend, wird die Grundstruktur eines Sichtgerätetyps für die ersten drei Bildarten beschrieben, die vor allem durch einen SymbolGenerator mit mehreren, durch Steuerzeichen umschaltbaren Alphabeten gekennzeichnet ist ; zur Anpassung an den spezifischen Symbolbedarf der Bilder eines bestimmten Fachgebietes können die Alphabete vor Beginn der eigentlichen Bildübertragung durch den Rechner ausgewechselt werden. Ein solches Sichtgerät kann für mehrere Bilddienste eingesetzt werden: für den allgemeinen Dialog Mensch-Rechner mittels Text sowie Liniengraphik und ebenso für die Fernauswahl und -betrachtung von Dokumenten, die in zeichenweise codierter Form im Großspeicher einer Dokument-DVA gespeichert sind; die benötigte Kanalgeschwindigkeit < 10 kbit Is liegt im Bereich der neuen Datenwählnetze. Es wird zur Diskussion gestellt, ein solches Verfahren für Dokumentauskunft als Alternative zu der Faksimile-Punktrasterübertragung von Mikrofilmbildern zu verwenden, die durch den Nachteil einer relativ großen benötigten Kanalgeschwindigkeit > 5 Mbitjs gekennzeichnet ist. Es ist zweckmäßig, die für Speicherung und Übertragung benötigte zeichenweise codierte Darstellung von Text und Liniengraphiken der Dokumente in gleicher Form auch für den rechnerunterstützten Satz und Druck der Publikation zu verwenden. Als Voraussetzung hierzu sind Vereinbarungen über Darstellungsformate, neue Alphabete bzw. Codes, Behandlung von Grauwertbildern usw. auszuarbeiten.

3 citations


Journal ArticleDOI
TL;DR: Lack of modern resources has severely hampered the scholar in education, humanities, and social science disciplines and attributes the scientists' research advantage to readily available data bases arranged to permit rapid searching.
Abstract: With the expansion of every academic discipline and the burgeoning of published research, the problem of keeping informed has become overwhelming.1 Because individual scholars frequently express a need for computer assistance, especially for information storage and retrieval, computer systems have begun to meet the current needs. Societies, conferences, and workshops on computerassisted research have increased significantly in the last nine years.2 Condon views the computer as an effective tool for the modern researcher, for \"the computer seems really to present the scholar with the possibility of exercising control over a literature that has become almost burdensome to him.\"3 Lack of modern resources, however, has severely hampered the scholar in education, humanities, and social science disciplines. Fogel has pointed up the drastic disparity between the scientist and the humanist with respect to time and effort required for completing research studies, He attributes the scientists' research advantage to readily available data bases arranged to permit rapid searching.4 1 This article is based on John S. Edwards' doctoral dissertation, A Model Computer Assisted Information System in Music Education (University of Georgia, 1970). 2 Charles L. Ruttenberg, \"Report on Data Archives in Social Sciences,\" American Behavioral Scientists, Vol. 18 (April 1965), p. 333; Gerald Lefkoff, Papers from the Western Virginia University Conference on Computer Applications in Music, 1966 (Morgantown, West Virginia: West Virginia University Library, 1967); Harry B. Lincoln, \"The Computer Seminar at Binghamton: A Report,\" Notes, Vol. 22 (December 1966), p. 236.

3 citations


Journal ArticleDOI
TL;DR: This study proposes and investigates file ordering and retrieval techniques which will reduce the average cost of retrieving records which satisfy a query, and increase the rate of retrieval in the initial portion of the response period.

Journal Article
TL;DR: During the last decade the most outstanding advances in the computer field have been increased speeds of operation, larger auxiliary random access to these memories, and similar progress had to be made in the software arena to take advantage of this added new power.
Abstract: During the last decade the most outstanding advances in the computer field have been increased speeds of operation, larger auxiliary random access to these memories. Since these advances obviously are in the hardware domain, i.e., they are technological advances, similar progress had to be made in the software arena to take advantage of this added new power.



Journal Article
TL;DR: ZUCCHINI, a system currently under development to permit the user increased control over his data base, is described and certain aspects of user languages and data structures in information retrieval are discussed.
Abstract: This paper discusses briefly, certain aspects of user languages and data structures in information retrieval. ZUCCHINI, a system currently under development to permit the user increased control over his data base, is then described.