Proceedings ArticleDOI
A document skew detection method using run-length encoding and the Hough transform
S.C. Hinds,James L. Fisher,D.P. D'Amato +2 more
- pp 464-468
TLDR
By creating a burst image from the original document image, the processing time of the Hough transform can be reduced by a factor of as much as 7.4 for documents with gray-scale images and interline spacing can be determined more accurately.Abstract:
As part of the development of a document image analysis system, a method, based on the Hough transform, was devised for the detection of document skew and interline spacing-necessary parameters for the automatic segmentation of text from graphics. Because the Hough transform is computationally expensive, the amount of data within a document image is reduced through the computation of its horizontal and vertical black runlengths. Histograms of these runlengths are used to determine whether the document is in portrait or landscape orientation. A gray scale burst image is created from the black runlengths that are perpendicular to the text lines by placing the length of the run in the run's bottom-most pixel. By creating a burst image from the original document image, the processing time of the Hough transform can be reduced by a factor of as much as 7.4 for documents with gray-scale images. Because only small runlengths are input to the Hough transform and because the accumulator array is incremented by the runlength associated with a pixel rather than by a factor of 1, the negative effects of noise, black margins, and figures are avoided. Consequently, interline spacing can be determined more accurately. >read more
Citations
More filters
Book
Algorithms for image processing and computer vision
TL;DR: Algorithms for Image Processing and Computer Vision, 2nd Edition provides the tools to speed development of image processing applications.
Journal ArticleDOI
The document spectrum for page layout analysis
TL;DR: The document spectrum (or docstrum) as discussed by the authors is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components, which yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks.
Book
The document spectrum for page layout analysis
TL;DR: The document spectrum (or docstrum), which is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components, yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks.
Book
Page segmentation and classification
Theo Pavlidis,Jiangying Zhou +1 more
TL;DR: In this article, a class of techniques based on smeared run length codes that divide a page into gray and nearly white parts are described, and then segmentation is performed by finding connected components either by the gray elements or of the white.
Journal ArticleDOI
Automated page orientation and skew angle detection for binary document images
TL;DR: A new and fast approach is advanced herein whereby skew angle detection takes advantage of information found using the page orientation algorithm, and it is indicated that detection accuracy can be improved by minimizing the effects of non-textual data.
References
More filters
Journal ArticleDOI
Document analysis system
TL;DR: The requirements and components for a proposed Document Analysis System, which assists a user in encoding printed documents for computer processing, are outlined and several critical functions have been investigated and the technical approaches are discussed.
Journal ArticleDOI
A robust algorithm for text string separation from mixed text/graphics images
TL;DR: The development and implementation of an algorithm for automated text string separation that is relatively independent of changes in text font style and size and of string orientation are described and showed superior performance compared to other techniques.
Journal ArticleDOI
Analysis of textual images using the Hough transform
TL;DR: Methods for handling several discretization problems that arise in mapping the rectangular image space to the (ρ, Θ) accumulator array are described.
Book ChapterDOI
Document analysis with an expert system
TL;DR: An algorithm is proposed for assigning labels to the blocks according to their location, extent, and relative position with respect to other (possibly already labeled) blocks.
Proceedings ArticleDOI
Document image understanding
TL;DR: This work examines several rc nbproblems in document understanding tasks from pixel processing issues to symbolic xning to global control hdrawings, half-tone pictures, and icons.