Text localization, enhancement and binarization in multimedia documents

doi:10.1109/ICPR.2002.1048482

Proceedings ArticleDOI

Text localization, enhancement and binarization in multimedia documents

- Vol. 2, pp 1037-1040

TLDR

An algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text is presented and the quality of the localized text is improved by robust multiple frame integration.

Abstract:

The systems currently available for content based image and video retrieval work without semantic knowledge, i.e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e.g. by key word based queries. In this paper we present an algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text. The quality of the localized text is improved by robust multiple frame integration. Anew technique for the binarization of the text boxes is proposed. Finally, detection and OCR results for a commercial OCR are presented.

Text localization, enhancement and binarization in multimedia documents

Citations

Extraction of handwritten text from carbon copy medical form images

Text detection and character recognition using fuzzy image processing

2DVTE: A two-directional videotext extractor for rapid and elaborate design

Parallel nonparametric binarization for degraded document images

Text Localization, Extraction and Inpainting in Color Images using Combined Structural and Textural Features

References

A threshold selection method from gray level histograms

IEEE transactions on pattern analysis and machine intelligence

An introduction to digital image processing

Goal-directed evaluation of binarization methods

Automatic text location in images and video frames

Related Papers (5)

A threshold selection method from gray level histograms

Adaptive document image binarization

An introduction to digital image processing

Text information extraction in images and video: a survey

Detecting text in natural scenes with stroke width transform