scispace - formally typeset
Proceedings ArticleDOI

Text localization, enhancement and binarization in multimedia documents

TLDR
An algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text is presented and the quality of the localized text is improved by robust multiple frame integration.
Abstract
The systems currently available for content based image and video retrieval work without semantic knowledge, i.e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e.g. by key word based queries. In this paper we present an algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text. The quality of the localized text is improved by robust multiple frame integration. Anew technique for the binarization of the text boxes is proposed. Finally, detection and OCR results for a commercial OCR are presented.

read more

Citations
More filters
Proceedings ArticleDOI

A Method of Synthesizing Handwritten Chinese Images for Data Augmentation

TL;DR: This work proposes a novel strategy, in the particular case of Chinese characters, to generate synthetic lines of text, given samples of the isolated characters, using the well-known CASIA database to train MDLSTM-RNN models and also in the creation of synthetic line images.
Book ChapterDOI

Improving computer vision-based indoor wayfinding for blind persons with context information

TL;DR: This paper presents an effective and robust method of text extraction and recognition to improve computer vision-based indoor wayfinding and identifies text characters in the extracted regions by using the features of size, aspect ratio and nested edge boundaries.
Journal ArticleDOI

Binarization and cleanup of handwritten text from carbon copy medical form images

TL;DR: This paper presents a methodology for separating handwritten foreground pixels, from background pixels, in carbon copied medical forms, which is a vital step in automating emergency medical health surveillance systems.

Automatic detection and extraction of artificial text in video

TL;DR: An algorithm for detection and localisation of artificial text in video using a horizontal difference magnitude measure and morphological processing is presented and results for a 20min long MPEG-1 encoded television programme are presented.
Proceedings ArticleDOI

Detection and extraction of the text in a video sequence

TL;DR: This article consists achieving a system that extracts the inclusive text in video while relying on hypotheses and in trying to take advantage of the previous research in this field.
References
More filters

IEEE transactions on pattern analysis and machine intelligence

Ieee Xplore
TL;DR: This special issue aims at gathering the recent advances in learning with shared information methods and their applications in computer vision and multimedia analysis and addressing interesting real-world computer Vision and multimedia applications.
Journal ArticleDOI

Goal-directed evaluation of binarization methods

TL;DR: This paper presents a methodology for evaluation of low-level image analysis methods, using binarization (two-level thresholding) as an example, and defines the performance of the character recognition module as the objective measure.
Proceedings ArticleDOI

Automatic text location in images and video frames

TL;DR: Compared with some traditional text location methods, this method has the following advantages: 1) low computational cost; 2) robust to font size; and 3) high accuracy.