The document spectrum for page layout analysis

Open AccessBook

The document spectrum for page layout analysis

Lawrence O'Gorman

- pp 214-225

Chats0

TLDR

The document spectrum (or docstrum), which is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components, yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks.

Abstract:

Page layout analysis is a document processing technique used to determine the format of a page. This paper describes the document spectrum (or docstrum), which is a method for structural page layout analysis based on bottom-up, nearest-neighbor clustering of page components. The method yields an accurate measure of skew, within-line, and between-line spacings and locates text lines and text blocks. It is advantageous over many other methods in three main ways: independence from skew angle, independence from different text spacings, and the ability to process local regions of different text orientations within the same image. Results of the method shown for several different page formats and for randomly oriented subpages on the same image illustrate the versatility of the method. We also discuss the differences, advantages, and disadvantages of the docstrum with respect to other lay-out methods. >

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Twenty years of document image analysis in PAMI

George Nagy

- 01 Jan 2000 -

IEEE Transactions on Pattern Analysis an...

TL;DR: The contributions to document image analysis of 99 papers published in the IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) are clustered, summarized, interpolated, interpreted, and evaluated.

...read moreread less

Journal ArticleDOI

An overview of character recognition focused on off-line handwriting

Nafiz Arica, +1 more

TL;DR: The historical evolution of CR systems is presented, the available CR techniques, with their superiorities and weaknesses, are reviewed and directions for future research are suggested.

...read moreread less

Proceedings ArticleDOI

Electronic marking and identification techniques to discourage document copying

Jack Brassil, +3 more

TL;DR: Three coding methods are proposed that discourage illicit distribution by embedding each document with a unique codeword, yet enable one to identify the sanctioned recipient of a document by examination of a recovered document.

...read moreread less

Journal ArticleDOI

Textfinder: an automatic system to detect and recognize text in images

V. Wu, +2 more

- 01 Jun 1999 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A robust system is proposed to automatically detect and extract text in images from different sources, including video, newspapers, advertisements, stock certificates, photographs, and checks.

...read moreread less

Journal ArticleDOI

Text line segmentation of historical documents: a survey

Laurence Likforman-Sulem, +2 more

- 04 Apr 2007 -

International Journal on Document Analys...

TL;DR: The objective of this paper is to present a survey of existing methods, developed during the last decade and dedicated to documents of historical interest.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Document analysis system

Kwan Y. Wong, +2 more

- 01 Nov 1982 -

Ibm Journal of Research and Development

TL;DR: The requirements and components for a proposed Document Analysis System, which assists a user in encoding printed documents for computer processing, are outlined and several critical functions have been investigated and the technical approaches are discussed.

...read moreread less

Journal ArticleDOI

A prototype document image analysis system for technical journals

George Nagy, +2 more

- 01 Jul 1992 -

IEEE Computer

TL;DR: The document image acquisition process and the knowledge base that must be entered into the system to process a family of page images are described, and the process by which the X-Y tree data structure converts a 2-D page-segmentation problem into a series of 1-D string-parsing problems that can be tackled using conventional compiler tools.

...read moreread less

Hierarchical representation of optically scanned documents

George Nagy, +1 more

Proceedings ArticleDOI

A document skew detection method using run-length encoding and the Hough transform

S.C. Hinds, +2 more

TL;DR: By creating a burst image from the original document image, the processing time of the Hough transform can be reduced by a factor of as much as 7.4 for documents with gray-scale images and interline spacing can be determined more accurately.

...read moreread less

Journal ArticleDOI

Automated entry system for printed documents

T. Akiyama, +1 more

- 01 Oct 1990 -

Pattern Recognition

TL;DR: Recognition experiments with a prototype system for a variety of complex printed documents shows that the proposed system is capable of reading different types of printed documents at an accuracy rate of 94.8–97.2%.

...read moreread less

The document spectrum for page layout analysis

Citations

Twenty years of document image analysis in PAMI

An overview of character recognition focused on off-line handwriting

Electronic marking and identification techniques to discourage document copying

Textfinder: an automatic system to detect and recognize text in images

Text line segmentation of historical documents: a survey

References

Document analysis system

A prototype document image analysis system for technical journals

Hierarchical representation of optically scanned documents

A document skew detection method using run-length encoding and the Hough transform

Automated entry system for printed documents

Related Papers (5)

Page segmentation and classification utilising a bottom-up approach

Parameter-free geometric document layout analysis

A multiresolution approach for page segmentation

Page segmentation without rectangle assumption

Document layout structure extraction using bounding boxes of different entitles