scispace - formally typeset
Proceedings ArticleDOI

A document skew detection method using run-length encoding and the Hough transform

TLDR
By creating a burst image from the original document image, the processing time of the Hough transform can be reduced by a factor of as much as 7.4 for documents with gray-scale images and interline spacing can be determined more accurately.
Abstract
As part of the development of a document image analysis system, a method, based on the Hough transform, was devised for the detection of document skew and interline spacing-necessary parameters for the automatic segmentation of text from graphics. Because the Hough transform is computationally expensive, the amount of data within a document image is reduced through the computation of its horizontal and vertical black runlengths. Histograms of these runlengths are used to determine whether the document is in portrait or landscape orientation. A gray scale burst image is created from the black runlengths that are perpendicular to the text lines by placing the length of the run in the run's bottom-most pixel. By creating a burst image from the original document image, the processing time of the Hough transform can be reduced by a factor of as much as 7.4 for documents with gray-scale images. Because only small runlengths are input to the Hough transform and because the accumulator array is incremented by the runlength associated with a pixel rather than by a factor of 1, the negative effects of noise, black margins, and figures are avoided. Consequently, interline spacing can be determined more accurately. >

read more

Citations
More filters
Book

Algorithms for image processing and computer vision

TL;DR: Algorithms for Image Processing and Computer Vision, 2nd Edition provides the tools to speed development of image processing applications.
Journal ArticleDOI

The document spectrum for page layout analysis

TL;DR: The document spectrum (or docstrum) as discussed by the authors is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components, which yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks.
Book

The document spectrum for page layout analysis

TL;DR: The document spectrum (or docstrum), which is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components, yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks.
Book

Page segmentation and classification

TL;DR: In this article, a class of techniques based on smeared run length codes that divide a page into gray and nearly white parts are described, and then segmentation is performed by finding connected components either by the gray elements or of the white.
Journal ArticleDOI

Automated page orientation and skew angle detection for binary document images

TL;DR: A new and fast approach is advanced herein whereby skew angle detection takes advantage of information found using the page orientation algorithm, and it is indicated that detection accuracy can be improved by minimizing the effects of non-textual data.
References
More filters
Journal ArticleDOI

Document analysis system

TL;DR: The requirements and components for a proposed Document Analysis System, which assists a user in encoding printed documents for computer processing, are outlined and several critical functions have been investigated and the technical approaches are discussed.
Journal ArticleDOI

A robust algorithm for text string separation from mixed text/graphics images

TL;DR: The development and implementation of an algorithm for automated text string separation that is relatively independent of changes in text font style and size and of string orientation are described and showed superior performance compared to other techniques.
Journal ArticleDOI

Analysis of textual images using the Hough transform

TL;DR: Methods for handling several discretization problems that arise in mapping the rectangular image space to the (ρ, Θ) accumulator array are described.
Book ChapterDOI

Document analysis with an expert system

TL;DR: An algorithm is proposed for assigning labels to the blocks according to their location, extent, and relative position with respect to other (possibly already labeled) blocks.
Proceedings ArticleDOI

Document image understanding

TL;DR: This work examines several rc nbproblems in document understanding tasks from pixel processing issues to symbolic xning to global control hdrawings, half-tone pictures, and icons.