scispace - formally typeset
Proceedings ArticleDOI

Text localization, enhancement and binarization in multimedia documents

TLDR
An algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text is presented and the quality of the localized text is improved by robust multiple frame integration.
Abstract
The systems currently available for content based image and video retrieval work without semantic knowledge, i.e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e.g. by key word based queries. In this paper we present an algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text. The quality of the localized text is improved by robust multiple frame integration. Anew technique for the binarization of the text boxes is proposed. Finally, detection and OCR results for a commercial OCR are presented.

read more

Citations
More filters
Journal ArticleDOI

The deviation of a set of strings

TL;DR: It is shown how the set deviation can be efficiently used in well-known statistical algorithms to improve the computation of the set median of a set of strings, illustrating this concept with several examples, particularly in post-processing of texts extracted from video sequences.
Proceedings ArticleDOI

Gabor filters for degraded document image binarization

TL;DR: Experimental results conducted on DIBCO Datasets show that the proposed method is more appropriate for poor contrasted documents and ink-bleed through degradations.
Book ChapterDOI

A new approach for vehicle detection in congested traffic scenes based on strong shadow segmentation

TL;DR: To demonstrate robustness and accuracy of the proposed approach, impressive results of the method in real traffic images including high congestion, noise, clutter, snow, and rain containing cast shadows, bad illumination conditions and occlusions, taken from both outdoor highways and city roads are presented.
Proceedings ArticleDOI

Adaptative Smart-Binarization Method: For Images of Business Documents

TL;DR: This paper proposes in this paper a smart-binarization method of the images of business documents that offers a better reading of characters by the OCR and remains constant with the variation of the local window size through the use of integral images.
Proceedings ArticleDOI

Binarization of Textual Content in Video Frames

TL;DR: A binarization technique for textual content in video frames which can be applied in the resulting image of the text detection step aiming in an improved OCR performance is presented.
References
More filters

IEEE transactions on pattern analysis and machine intelligence

Ieee Xplore
TL;DR: This special issue aims at gathering the recent advances in learning with shared information methods and their applications in computer vision and multimedia analysis and addressing interesting real-world computer Vision and multimedia applications.
Journal ArticleDOI

Goal-directed evaluation of binarization methods

TL;DR: This paper presents a methodology for evaluation of low-level image analysis methods, using binarization (two-level thresholding) as an example, and defines the performance of the character recognition module as the objective measure.
Proceedings ArticleDOI

Automatic text location in images and video frames

TL;DR: Compared with some traditional text location methods, this method has the following advantages: 1) low computational cost; 2) robust to font size; and 3) high accuracy.