scispace - formally typeset
Search or ask a question
Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.


Papers
More filters
Patent
27 Nov 2002
TL;DR: In this paper, a plurality of document definition information for identifying documents, and format control information for recognizing a character recorded on a document corresponding to each of the plurality of definition information are held beforehand.
Abstract: A plurality of document definition information for identifying documents, and format control information for recognizing a character recorded on a document corresponding to each of the plurality of document definition information are held beforehand, documents targeted for character recognition are identified as specific documents based on document images of the entered documents targeted for character recognition and the document definition information and, based on a result of the identification, character recognition is executed by using corresponding format control information. A document definition device adds a plane area of each of documents to be identified to the document definition. An OCR device checks the plane area on the document by using the document definition before check of a preprint accompanied by character recognition.

42 citations

Patent
23 Sep 2008
TL;DR: In this paper, the authors propose a system for determining a logical structure of a document, which stores a collection of models, each of which describes one or more possible logical structures.
Abstract: The invention relates to methods for determining a logical structure of a document. The system stores a collection of models, each of which describes one or more possible logical structures. At least one document hypothesis is generated for the whole document. For each document hypothesis, the system verifies the document hypothesis on each page, for example, by generating at least one block hypothesis for each block in the document based on the document hypothesis, selecting a best block hypothesis for each block, selecting the model that corresponds to a best document hypothesis the document hypothesis that has a best degree of correspondence with the selected best block hypotheses for the document, and forming a representation of the document based on the best document hypothesis described.

42 citations

Patent
Aravind Bala1, Andrew J. Mcglinchey1, James D. Jacoby1, Hsiao-Wuen Hon1, Saikat Sen1 
08 Jul 2004
TL;DR: In this paper, a text generator automatically generates a text document (235, 245) based on the actions of an author (201) on a user interface (205), which provides instruction or other information to a user.
Abstract: A text generator (200) automatically generating a text document (235, 245) based on the actions of an author (201) on a user interface (205). To generate the text document (235) the author (201) activates a recording component (210). The recording component (210) records the author’s actions on the user interface (205). Based on the recorded actions, a text generation component (230) searches a text database (220) and identifies an entry that matches the author’s recorded actions. This text is then combined to form a text document (235), which provides instruction or other information to a user. During the process of generating the text document (235, 245), the text can be edited using an editor (240) as desired, such as to enhance the comprehensibility of the document (235, 245).

42 citations

Patent
John C. Handley1
31 Mar 2005
TL;DR: In this article, a system for classifying a genre of an electronic document may include a network processor configured to parse the RTF document into lines of text ordered from top to bottom and left to right and assign tokens to each line of text based on content of the line and to line separators based on space between blocks of lines.
Abstract: A system for classifying a genre of an electronic document may include a network processor configured to receive an electronic document and convert the electronic document to rich text format (RTF). The processor may be configured to parse the RTF document into lines of text ordered from top to bottom and left to right and assign tokens to each line of text based on content of the line and to line separators based on space between blocks of lines. The network processor may be configured to sequence the tokens, parse the tokenized document with a number of pre-defined document grammars, determine a probability for each genre corresponding to the electronic document, and classify the electronic document as the genre with the highest probability.

42 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
82% related
Feature (computer vision)
128.2K papers, 1.7M citations
82% related
Object detection
46.1K papers, 1.3M citations
81% related
Image segmentation
79.6K papers, 1.8M citations
80% related
Convolutional neural network
74.7K papers, 2M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20235
202219
202134
202019
201914
20189