scispace - formally typeset
Journal ArticleDOI

Robust Document Image Binarization Technique for Degraded Document Images

Reads0
Chats0
TLDR
A novel document image binarization technique that addresses issues ofSegmentation of text from badly degraded document images by using adaptive image contrast, a combination of the local image contrast and theLocal image gradient that is tolerant to text and background variation caused by different types of document degradations.
Abstract
Segmentation of text from badly degraded document images is a very challenging task due to the high inter/intra-variation between the document background and the foreground text of different document images. In this paper, we propose a novel document image binarization technique that addresses these issues by using adaptive image contrast. The adaptive image contrast is a combination of the local image contrast and the local image gradient that is tolerant to text and background variation caused by different types of document degradations. In the proposed technique, an adaptive contrast map is first constructed for an input degraded document image. The contrast map is then binarized and combined with Canny's edge map to identify the text stroke edge pixels. The document text is further segmented by a local threshold that is estimated based on the intensities of detected text stroke edge pixels within a local window. The proposed method is simple, robust, and involves minimum parameter tuning. It has been tested on three public datasets that are used in the recent document image binarization contest (DIBCO) 2009 & 2011 and handwritten-DIBCO 2010 and achieves accuracies of 93.5%, 87.8%, and 92.03%, respectively, that are significantly higher than or close to that of the best-performing methods reported in the three contests. Experiments on the Bickley diary dataset that consists of several challenging bad quality document images also show the superior performance of our proposed method, compared with other techniques.

read more

Citations
More filters
Proceedings ArticleDOI

ICDAR 2013 Document Image Binarization Contest (DIBCO 2013)

TL;DR: The contest details including the evaluation measures used as well as the performance of the 23 submitted methods are described along with a short description of each method.
Proceedings ArticleDOI

ICFHR 2012 Competition on Handwritten Document Image Binarization (H-DIBCO 2012)

TL;DR: This paper reports on the contest details including the evaluation measures used as well as the performance of the 24 submitted methods along with a short description of each method.
Journal ArticleDOI

A selectional auto-encoder approach for document image binarization

TL;DR: This paper discusses the use of convolutional auto-encoders devoted to learning an end-to-end map from an input image to its selectional output, in which activations indicate the likelihood of pixels to be either foreground or background.
Journal ArticleDOI

DeepOtsu: Document enhancement and binarization using iterative deep learning

TL;DR: The proposed method provides a new, clean version of the degraded image, one that is suitable for visualization and which shows promising results for binarization using Otsu’s global threshold, based on enhanced images learned iteratively by the neural network.
Journal ArticleDOI

Binarization of degraded document images based on hierarchical deep supervised network

TL;DR: A novel supervised-binarization method is proposed, in which a hierarchical deep supervised network (DSN) architecture is learned for the prediction of the text pixels at different feature levels, which achieves state-of-the-art results over widely used DIBCO datasets.
References
More filters
Journal ArticleDOI

A Computational Approach to Edge Detection

TL;DR: There is a natural uncertainty principle between detection and localization performance, which are the two main goals, and with this principle a single operator shape is derived which is optimal at any scale.
Journal ArticleDOI

Survey over image thresholding techniques and quantitative performance evaluation

TL;DR: 40 selected thresholding methods from various categories are compared in the context of nondestructive testing applications as well as for document images, and the thresholding algorithms that perform uniformly better over nonde- structive testing and document image applications are identified.
Journal ArticleDOI

Feature Detection with Automatic Scale Selection

TL;DR: It is shown how the proposed methodology applies to the problems of blob detection, junction detection, edge detection, ridge detection and local frequency estimation and how it can be used as a major mechanism in algorithms for automatic scale selection, which adapt the local scales of processing to the local image structure.
Journal ArticleDOI

Adaptive document image binarization

TL;DR: A new method is presented for adaptive document image binarization, where the page is considered as a collection of subcomponents such as text, background and picture, which adapts and performs well in each case qualitatively and quantitatively.