Content directed enhancement of degraded document images
TL;DR: This paper presents a novel framework that learns optimal parameters, depending on the nature of the document image content for binarization and text/graphics segmentation, using EM algorithm.
Abstract: Most of the document pre-processing techniques are parameter dependent. In this paper, we present a novel framework that learns optimal parameters, depending on the nature of the document image content for binarization and text/graphics segmentation. The learning problem has been formulated as an optimization problem using EM algorithm to adaptively learn optimal parameters. Experimental results have established the effectiveness of our approach.
...read more
Citations
12 citations
Cites methods from "Content directed enhancement of deg..."
...The gray scale image is first binarized using the method described in our earlier work [2]....
[...]
5 citations
5 citations
Cites methods from "Content directed enhancement of deg..."
...An EM based formulations for parameter optimization is presented in [8]....
[...]
3 citations
Cites background or methods from "Content directed enhancement of deg..."
...For a given gray-scale document image, it is binarized using the method described in [1]....
[...]
...The proposed framework is a modification of our earlier work [1]....
[...]
...This paper presents a modification of the earlier work on text graphic separation [1] that exploits the nature of the document image content for learning optimal parameters for binarization and effective text graphic separation....
[...]
...In contrast to our earlier work [1], where a fixed neighbourhood of size 350× 350 was used, we learn optimal neighbourhood size to improve the segmentation in newspaper images....
[...]
References
31,977 citations
Additional excerpts
...Global binarization techniques [11] are preferred in cases where there is a good separation between foreground and background....
[...]
1,902 citations
658 citations
"Content directed enhancement of deg..." refers methods in this paper
...The most commonly used approach for text/graphic separation in document images [4, 5] is based on connected component analysis....
[...]
548 citations
Additional excerpts
...However, in case of degradations like shadow, non-uniform illuminations, scratch, ink bleeds and other complex degradations, local binarization techniques [6, 10, 13, 18] have provided better results....
[...]
425 citations