Topic
Document layout analysis
About: Document layout analysis is a research topic. Over the lifetime, 1462 publications have been published within this topic receiving 34021 citations.
Papers published on a yearly basis
Papers
More filters
•
01 Apr 2005
TL;DR: In this paper, a reading machine applies text-to-speech to a text file that corresponds to the selected section of the document, to read the selected sections of a document aloud to the user.
Abstract: Controlling a reading machine while reading a document to a user by receiving an image of a document, accessing a knowledge base that provides data that identifies sections in the document and processing user commands to select a section of the document. The reading machine applies text-to-speech to a text file that corresponds to the selected section of the document, to read the selected section of the document aloud to the user.
24 citations
••
27 Mar 2000TL;DR: A new method that forms large connected components by a smoothing algorithm and calculates the document skew by finding the orientation of the minimum-area bounding rectangle of one of several connected components is presented.
Abstract: Detection of document skew is an important step in document image analysis. The paper presents a new method for calculation of document skew. The method forms large connected components by a smoothing algorithm and calculates the document skew by finding the orientation of the minimum-area bounding rectangle of one of several connected components. Connection of text to non-text in the smoothing step does not degrade the performance of the method. The smoothing parameters are determined automatically and no manual adjustment is necessary. The method is not limited in the range of detectable skew angles and the achievable accuracy. Experimental results show the high performance of the algorithm in detecting document skew for a variety of documents with different levels of complexity.
24 citations
••
19 Dec 2003TL;DR: In this article, a novel image representation of compound document images, called SmartNails, is presented to overcome poor readability of text and recognizability of image features in low resolution thumbnails.
Abstract: In order to overcome poor readability of text and recognizability of image features in low resolution thumbnails, a novel image representation of compound document images - a SmartNail representation - is presented. SmartNails are replacements or supplements to traditional thumbnails for compound documents and contain cropped and scaled image and text segments. Image- and text-based analysis are merged to generate a layout for a particular display size with selected readable text and recognizable image regions. The analysis is efficiently performed by using information from document layout analysis and JPEG 2000 compressed file headers.
24 citations
••
08 Feb 2015TL;DR: A new dataset and a ground-truthing methodology for layout analysis of historical documents with complex layouts, targeting the simplicity and the efficiency of the layout ground truthing process on historical document images is proposed and developed.
Abstract: In this paper, we propose a new dataset and a ground-truthing methodology for layout analysis of historical
documents with complex layouts. The dataset is based on a generic model for ground-truth presentation of
the complex layout structure of historical documents. For the purpose of extracting uniformly the document
contents, our model defines five types of regions of interest: page, text block, text line, decoration, and comment.
Unconstrained polygons are used to outline the regions. A performance metric is proposed in order to evaluate
various page segmentation methods based on this model. We have analysed four state-of-the-art ground-truthing
tools: TRUVIZ, GEDI, WebGT, and Aletheia. From this analysis, we conceptualized and developed Divadia, a
new tool that overcomes some of the drawbacks of these tools, targeting the simplicity and the efficiency of the
layout ground truthing process on historical document images. With Divadia, we have created a new public
dataset. This dataset contains 120 pages from three historical document image collections of different styles and
is made freely available to the scientific community for historical document layout analysis research.
24 citations
•
19 Dec 1994TL;DR: In this article, a digital copying machine is used to reproduce a document image and the document image is reproduced on a paper, where a user sets a document type, the setting by the user may have priority than the decision according to the document direction.
Abstract: In a digital copying machine, a document image is read and the document image is reproduced on a paper. If a document has a size which does not limit a direction of document on a platen For reading the document image, the document type of portrait or landscape is decided according to the direction of document on the platen. If a user sets a document type, the setting by the user may have priority than the decision according to the document direction. If a document has a size which can be set only along a specified direction for reading the document image, it is requested for a user to set the document direction (portrait document or a landscape document). Then, for example, a margin can be set suitably according to the document type.
24 citations