scispace - formally typeset
Search or ask a question
Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: This paper applies an adaptive regrouping strategy to Chinese documents whose complexity derives from the coexistence of horizontal and vertical textlines, and results are obtained using this approach.

17 citations

Proceedings ArticleDOI
TL;DR: The Document Description Framework as discussed by the authors is a model for declarative document layout and processing where documents are treated as functional programs and a canonical XML tree contains nodes describing layout instructions which will modify and combine their children component parts to build sections of the final presentation.
Abstract: Highly customised variable-data documents make automatic layout of the resulting publication hard. Architectures for defining and processing such documents can benefit if the repertoire of layout methods available can be extended smoothly and easily to accommodate new styles of customisation. The Document Description Framework incorporates a model for declarative document layout and processing where documents are treated as functional programs. A canonical XML tree contains nodes describing layout instructions which will modify and combine their children component parts to build sections of the final presentation. Leaf components such as images, vector graphic fragments and text blocks are 'rendered' to make consistent graphical atoms. These parts are then processed by layout agents, described and parameterised by their parent nodes, which can range from simple layouts like translations, flows, encapsulations and tables through to highly complex arrangements such as constraint-solution or pagination. The result then becomes a 'molecule' for processing at a higher level of the layout tree. A variable and reference mechanism is included for resolving rendering interdependency and supporting component reuse. Addition of new layout types involves definition of a new combinator node and attachment of a suitable agent.

17 citations

Patent
Yoshiaki Kurosawa1, Katsumi Kato1
20 Mar 2000
TL;DR: In this paper, a document image is inputted as image data from an image inputting section, and the layout of the document image was analyzed on the basis of the image data to obtain layout constituents.
Abstract: In a document image processing apparatus, a document image is inputted as image data from an image inputting section. In a layout analyzing section, the layout of the document image is analyzed on the basis of the image data to obtain layout constituents. In an image processing section, the document image is processed. The image processing section includes an editor for specifying a position to be edited in the document image and editing the document image, on the basis of position/size data on the layout constituents, in accordance with the operator's instructions from the operation data inputting section. In an image displaying section, the document image is displayed in cooperation with the image processing section.

17 citations

Patent
25 Jun 2014
TL;DR: In this article, an image of a form or document is captured and an automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document.
Abstract: Automatic classification of different types of documents is disclosed. An image of a form or document is captured. The document is assigned to one or more type definitions by identifying one or more objects within the image of the document. A matching model is selected via identification of the document image. In the case of multiple identifications, a profound analysis of the document type is performed—either automatically or manually. An automatic classifier may be trained with document samples of each of a plurality of document classes or document types where the types are known in advance or a system of classes may be formed automatically without a priori information about types of samples. An automatic classifier determines possible features and calculates a range of feature values and possible other feature parameters for each type or class of document. A decision tree, based on rules specified by a user, may be used for classifying documents. Processing, such as optical character recognition (OCR), may be used in the classification process.

17 citations

Proceedings ArticleDOI
11 Apr 2016
TL;DR: A very simple technique based on FAST key points that highlights that accurate text extraction could be achieved without complex approach and could also be easily improved to be more precise, robust and useful for more complex layout analysis.
Abstract: During past years, text extraction in document images has been widely studied in the general context of Document Image Analysis (DIA) and especially in the framework of layout analysis. Many existing techniques rely on complex processes based on preprocessing, image transforms or component/edges extraction and their analysis. At the same time, text extraction inside videos has received an increased interest and the use of corner or key points has been proven to be very effective. Because it is noteworthy to notice that very few studies were performed on the use of corner points for text extraction in document images, we propose in this paper to evaluate the possibilities associated with this kind of approach for DIA. To do that, we designed a very simple technique based on FAST key points. A first stage divide the image into blocks and the density of points inside each one is computed. The more dense ones are kept as text blocks. Then, connectivity of blocks is checked to group them and to obtain complete text blocks. This technique has been evaluated on different kind of images: different languages (Telugu, Arabic, French), handwritten as well as typewritten, skewed documents, images at different resolution and with different kind and amount of noises (deformations, ink dot, bleed through, acquisition (blur, resolution)), etc. Even with fixed parameters for all such kind of documents images, the precision and recall are close or higher to 90% which makes this basic method already effective. Consequently, even if the proposed approach does not propose a breakthrough from theoretical aspects, it highlights that accurate text extraction could be achieved without complex approach. Moreover, this approach could also be easily improved to be more precise, robust and useful for more complex layout analysis.

17 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
82% related
Feature (computer vision)
128.2K papers, 1.7M citations
82% related
Object detection
46.1K papers, 1.3M citations
81% related
Image segmentation
79.6K papers, 1.8M citations
80% related
Convolutional neural network
74.7K papers, 2M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20235
202219
202134
202019
201914
20189