scispace - formally typeset
Proceedings ArticleDOI

An algorithm for the skew normalization of document image

TLDR
An algorithm to normalize the skew of document images is proposed, which shows that when graphical elements are included in the documents in addition to printed characters, the accuracy deteriorates to 0.2 degrees.
Abstract
An algorithm to normalize the skew of document images is proposed. The skew angle is detected in two stages. In the first stage, connected regions in an image are extracted and some feature parameters are extracted for each region. In the second stage, the Hough transform is calculated for the parameters, and the angle which gives the minimum of the transform is estimated as the skew angle. In experiments using CCITT standard documents, a detection accuracy of less than 0.1 degrees is obtained for printed documents. When graphical elements are included in the documents in addition to printed characters, the accuracy deteriorates to 0.2 degrees . >

read more

Citations
More filters
Journal ArticleDOI

Segmentation methods for character recognition: from segmentation to document structure analysis

TL;DR: A pattern- oriented segmentation method for optical character recognition that leads to document structure analysis is presented, and an extended form of pattern-oriented segmentation, tabular form recognition, is considered.
Patent

Image processing system with image cropping and skew correction

TL;DR: In this article, a system and method is described for automatically determining in a scanned document image the presence of unwanted extraneous information caused by an extraneous device and scanner background information.
Journal ArticleDOI

A robust and fast skew detection algorithm for generic documents

TL;DR: A robust and fast skew detection algorithm based on hierarchical Hough transform that is capable of detecting the skew angle for various document images, including technical articles, postal labels, handwritten text, forms, drawings and bar codes is proposed.
Journal ArticleDOI

Automatic document processing: A survey

TL;DR: A basic model for document processing is described and many techniques, such as skew detection, Hough transform, Gabor filters, projection, crossing counts, form definition language, etc. which have been used in these approaches are discussed.
Proceedings ArticleDOI

Document skew detection based on local region complexity

TL;DR: A new method is proposed for detecting skew in document images which contain a mixture of text areas, photographs, figures, charts, and tables and it is proposed that skew is detected in local regions in which only text lines are expected.
References
More filters
Journal ArticleDOI

Analysis of textual images using the Hough transform

TL;DR: Methods for handling several discretization problems that arise in mapping the rectangular image space to the (ρ, Θ) accumulator array are described.
Journal ArticleDOI

International digital facsimile coding standards

R. Hunter, +1 more
TL;DR: The coding schemes in detail are described in detail and the factors which led to their choice are discussed, and the performance of the codes is assessed, particularly in relation to their compression efficiency and vulnerability to transmission errors.
Journal ArticleDOI

Combined symbol matching facsimile data compression system

TL;DR: A facsimile data compression system, called combined symbol matching (CSM), is presented that exceeds that obtained with the best run-length coding techniques by a factor of two or more and is comparable for graphics-predominate documents.
Journal ArticleDOI

A high‐speed rotation method for binary images based on coordinate operation of run data

TL;DR: This method can execute a high-speed rotation of the binary image based on coordinate data for the start and the end of the run.