scispace - formally typeset
Proceedings ArticleDOI

Text localization, enhancement and binarization in multimedia documents

TLDR
An algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text is presented and the quality of the localized text is improved by robust multiple frame integration.
Abstract
The systems currently available for content based image and video retrieval work without semantic knowledge, i.e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e.g. by key word based queries. In this paper we present an algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text. The quality of the localized text is improved by robust multiple frame integration. Anew technique for the binarization of the text boxes is proposed. Finally, detection and OCR results for a commercial OCR are presented.

read more

Citations
More filters
Journal Article

An Edge Based Text Segmentation From Complex Images

TL;DR: An edge-based text segmentation algorithm is proposed, which is robust with respect to font sizes, styles, color/intensity, orientations and alignment of text, and can be used in a large variety of application fields, such as vehicle license detection and recognition, document retrieving and page segmentation etc.

A survey on comparision and performance analysis of text extraction techniques

TL;DR: A large number of techniques to address the problem of text information extraction are classified and reviewed, benchmark data and performance evaluation are discussed, and promising directions for future research are pointed out.
Proceedings ArticleDOI

Prewett Edge Detector Method for Content Extraction in Moving Pictures or Images

TL;DR: This work presents calculation for removing content in video, pictures using Prewett Edge Detection in terms of precision rate and recall rate and proposes a substitute method to discover and compute the contents that is uncommonly planned for being associated with shading pictures with complex foundation.

Video Text Segmentation Using Particle Filters

TL;DR: A probabilistic algorithm for segmenting and recognizing text embedded in video sequences based on adaptive thresholding using a Bayes filtering method that allows us to evaluate an text image segmentor on the basis of recognition result instead of visual segmentation result, which is directly relevant to the authors' character recognition task.

Contrast Enhanced Niblack Binarization of Document Images

TL;DR: A method of document image binarization that performs well on grayscale images with complex backgrounds, maintains good text extraction abilities and retains the graphic features that might be present in the image.
References
More filters

IEEE transactions on pattern analysis and machine intelligence

Ieee Xplore
TL;DR: This special issue aims at gathering the recent advances in learning with shared information methods and their applications in computer vision and multimedia analysis and addressing interesting real-world computer Vision and multimedia applications.
Journal ArticleDOI

Goal-directed evaluation of binarization methods

TL;DR: This paper presents a methodology for evaluation of low-level image analysis methods, using binarization (two-level thresholding) as an example, and defines the performance of the character recognition module as the objective measure.
Proceedings ArticleDOI

Automatic text location in images and video frames

TL;DR: Compared with some traditional text location methods, this method has the following advantages: 1) low computational cost; 2) robust to font size; and 3) high accuracy.