scispace - formally typeset
Proceedings ArticleDOI

Content directed enhancement of degraded document images

Reads0
Chats0
TLDR
This paper presents a novel framework that learns optimal parameters, depending on the nature of the document image content for binarization and text/graphics segmentation, using EM algorithm.
Abstract
Most of the document pre-processing techniques are parameter dependent. In this paper, we present a novel framework that learns optimal parameters, depending on the nature of the document image content for binarization and text/graphics segmentation. The learning problem has been formulated as an optimization problem using EM algorithm to adaptively learn optimal parameters. Experimental results have established the effectiveness of our approach.

read more

Citations
More filters
Proceedings ArticleDOI

Newspaper Article Extraction Using Hierarchical Fixed Point Model

TL;DR: A novel learning based framework to extract articles from newspaper images using a Fixed-Point Model that uses contextual information and features of each block to learn the layout of newspaper images and attains a contraction mapping to assign a unique label to every block.
Proceedings ArticleDOI

A novel local enhancement technique for rebuilding Broken characters in a degraded Kannada script

TL;DR: A novel method to rebuild the broken characters are thinned and the endpoints of the lines are obtained and the line segments are effectively rebuilt so as to preserve the degraded character.
Proceedings ArticleDOI

Automatic Selection of Parameters for Document Image Enhancement Using Image Quality Assessment

TL;DR: A novel framework for automatic selection of optimal parameters for pre-processing algorithm by estimating the quality of the document image and compute parameters to maximize the expected recognition accuracy found in E-step.
Proceedings ArticleDOI

Text graphic separation in Indian newspapers

TL;DR: A novel framework for learning optimal parameters for text graphic separation in the presence of complex layouts of Indian newspaper is proposed.
References
More filters
Journal ArticleDOI

Adaptive document image binarization

TL;DR: A new method is presented for adaptive document image binarization, where the page is considered as a collection of subcomponents such as text, background and picture, which adapts and performs well in each case qualitatively and quantitatively.
Journal ArticleDOI

A robust algorithm for text string separation from mixed text/graphics images

TL;DR: The development and implementation of an algorithm for automated text string separation that is relatively independent of changes in text font style and size and of string orientation are described and showed superior performance compared to other techniques.
Journal ArticleDOI

Adaptive degraded document image binarization

TL;DR: The proposed method does not require any parameter tuning by the user and can deal with degradations which occur due to shadows, non-uniform illumination, low contrast, large signal-dependent noise, smear and strain.
Journal ArticleDOI

Block segmentation and text extraction in mixed text/image documents

TL;DR: It is shown that a constrained run length algorithm is well suited to partition most documents into areas of text lines, solid black lines, and rectangular ☐es enclosing graphics and halftone images.
Related Papers (5)