scispace - formally typeset
Search or ask a question
Topic

Document layout analysis

About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.


Papers
More filters
Patent
01 Apr 2005
TL;DR: In this paper, a reading machine applies text-to-speech to a text file that corresponds to the selected section of the document, to read the selected sections of a document aloud to the user.
Abstract: Controlling a reading machine while reading a document to a user by receiving an image of a document, accessing a knowledge base that provides data that identifies sections in the document and processing user commands to select a section of the document. The reading machine applies text-to-speech to a text file that corresponds to the selected section of the document, to read the selected section of the document aloud to the user.

24 citations

Proceedings ArticleDOI
27 Mar 2000
TL;DR: A new method that forms large connected components by a smoothing algorithm and calculates the document skew by finding the orientation of the minimum-area bounding rectangle of one of several connected components is presented.
Abstract: Detection of document skew is an important step in document image analysis. The paper presents a new method for calculation of document skew. The method forms large connected components by a smoothing algorithm and calculates the document skew by finding the orientation of the minimum-area bounding rectangle of one of several connected components. Connection of text to non-text in the smoothing step does not degrade the performance of the method. The smoothing parameters are determined automatically and no manual adjustment is necessary. The method is not limited in the range of detectable skew angles and the achievable accuracy. Experimental results show the high performance of the algorithm in detecting document skew for a variety of documents with different levels of complexity.

24 citations

Proceedings ArticleDOI
19 Dec 2003
TL;DR: In this article, a novel image representation of compound document images, called SmartNails, is presented to overcome poor readability of text and recognizability of image features in low resolution thumbnails.
Abstract: In order to overcome poor readability of text and recognizability of image features in low resolution thumbnails, a novel image representation of compound document images - a SmartNail representation - is presented. SmartNails are replacements or supplements to traditional thumbnails for compound documents and contain cropped and scaled image and text segments. Image- and text-based analysis are merged to generate a layout for a particular display size with selected readable text and recognizable image regions. The analysis is efficiently performed by using information from document layout analysis and JPEG 2000 compressed file headers.

24 citations

Proceedings ArticleDOI
08 Feb 2015
TL;DR: A new dataset and a ground-truthing methodology for layout analysis of historical documents with complex layouts, targeting the simplicity and the efficiency of the layout ground truthing process on historical document images is proposed and developed.
Abstract: In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of historical documents with complex layouts. The dataset is based on a generic model for ground-truth presentation of the complex layout structure of historical documents. For the purpose of extracting uniformly the document contents, our model defines five types of regions of interest: page, text block, text line, decoration, and comment. Unconstrained polygons are used to outline the regions. A performance metric is proposed in order to evaluate various page segmentation methods based on this model. We have analysed four state-of-the-art ground-truthing tools: TRUVIZ, GEDI, WebGT, and Aletheia. From this analysis, we conceptualized and developed Divadia, a new tool that overcomes some of the drawbacks of these tools, targeting the simplicity and the efficiency of the layout ground truthing process on historical document images. With Divadia, we have created a new public dataset. This dataset contains 120 pages from three historical document image collections of different styles and is made freely available to the scientific community for historical document layout analysis research.

24 citations

Patent
Akio Nakajima1
19 Dec 1994
TL;DR: In this article, a digital copying machine is used to reproduce a document image and the document image is reproduced on a paper, where a user sets a document type, the setting by the user may have priority than the decision according to the document direction.
Abstract: In a digital copying machine, a document image is read and the document image is reproduced on a paper. If a document has a size which does not limit a direction of document on a platen For reading the document image, the document type of portrait or landscape is decided according to the direction of document on the platen. If a user sets a document type, the setting by the user may have priority than the decision according to the document direction. If a document has a size which can be set only along a specified direction for reading the document image, it is requested for a user to set the document direction (portrait document or a landscape document). Then, for example, a margin can be set suitably according to the document type.

24 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
82% related
Feature (computer vision)
128.2K papers, 1.7M citations
82% related
Object detection
46.1K papers, 1.3M citations
81% related
Image segmentation
79.6K papers, 1.8M citations
80% related
Convolutional neural network
74.7K papers, 2M citations
79% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20235
202219
202134
202019
201914
20189