scispace - formally typeset
Search or ask a question

Showing papers on "Document layout analysis published in 1984"


Book
01 Jan 1984
TL;DR: This study sees that communication (feedback) about the queries of inquirers searching for a given document can be incorporated by a retrieval system in order to redescribe that document so that its description matches better those queries.
Abstract: The central problem in document retrieval is that the subject of a document may be described in many different ways and, similarly, different inquirers may express similar information needs by a variety of different queries. This variance makes it difficult to get the "right" documents into the hands of the "right" inquirers, for retrieving a document by means of its subject description depends on that subject description adequately matching an inquirer's query. Document descriptions comprise only one part of a retrieval system, and a "good" document description is one that describes the subject of a document in a way that will match the queries of inquirers who will find that document relevant to their information need. In this study, we see that communication (feedback) about the queries of inquirers searching for a given document can be incorporated by a retrieval system in order to redescribe that document so that its description matches better those queries. An adaptive (genetic) algorithm, responsible for such redescription, achieves two aims: first, it increases the probability of a document's subject description matching a query to which the document is relevant (equivalently, it increases the degree of association between a document and a relevant query); second, the algorithm decreases the probability of a document's subject description matching a query to which the document is not relevant (equivalently, it decreases the degree of association between a document and a non-relevant query). Simulation experiments demonstrate the success of adaptive subject redescription in achieving these aims. The simulation technique, itself, is novel: By establishing a set of queries, (to some of which a document is relevant, the rest of which it is not), and measuring the association between the document's description and each of these queries, we obtain estimates of system recall and fallout without building an actual document collection. The method of obtaining such "simulated queries" is described. The simulation technique may help provide a solution to the problem of predicting the performance of a large-scale retrieval system based on its operation in a smaller-scale experimental setting.

6 citations


Patent
19 Dec 1984
TL;DR: In this article, the information concerning the estimation of layout in the process of a layout plan by a dialog processing is obtained and displayed as a pair at an output device at the computer main body.
Abstract: PURPOSE: To improve the layout design processing efficiency by obtaining and displaying the information concerning the estimation of a layout in the process of a layout plan by a dialog processing. CONSTITUTION: First, position data 5, cost data 6 and object data 7 are fetched into a computer main body 2 in accordance with the procedure of a program 8 as initial data. After that, by noticing one or two objects out of layout objects in accordance with the procedure of the program 8, under the conditions of fixing others, the cost as the layout estimation value to the shifting of noticed objects is counted at a central arithmetic unit 4, and the shifting of the object and the changing quantity of the cost are obtained as a pair. After this, in the obtained result, the local shifting of the object of the layout and the change of the layout estimation value to it are outputted and displayed as a pair at an output device 3. The position data are renewed by a layout designer in accordance with the necessity. COPYRIGHT: (C)1986,JPO&Japio

5 citations


Patent
14 Aug 1984
TL;DR: In this article, an apparatus and process for displaying the layout of text in a text preparing apparatus is described, in order to save the space on the display during layout display, the characters, symbols etc. are converted into plural display elements in compressed form to enable the operator to identify the species of the printed characters.
Abstract: There is disclosed an apparatus and process for displaying the layout of text in a text preparing apparatus. In order to save the space on the display during layout display, the characters, symbols etc. are converted into plural display elements in compressed form to enable the operator to identify the species of the printed characters, symbols etc.

1 citations