scispace - formally typeset
Proceedings ArticleDOI

Text localization, enhancement and binarization in multimedia documents

TLDR
An algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text is presented and the quality of the localized text is improved by robust multiple frame integration.
Abstract
The systems currently available for content based image and video retrieval work without semantic knowledge, i.e. they use image processing methods to extract low level features of the data. The similarity obtained by these approaches does not always correspond to the similarity a human user would expect. A way to include more semantic knowledge into the indexing process is to use the text included in the images and video sequences. It is rich in information but easy to use, e.g. by key word based queries. In this paper we present an algorithm to localize artificial text in images and videos using a measure of accumulated gradients and morphological post processing to detect the text. The quality of the localized text is improved by robust multiple frame integration. Anew technique for the binarization of the text boxes is proposed. Finally, detection and OCR results for a commercial OCR are presented.

read more

Citations
More filters
Proceedings ArticleDOI

Multiscale Edge-Based Text Extraction from Complex Images

TL;DR: A multiscale edge-based text extraction algorithm, which can automatically detect and extract text in complex images, and is robust with respect to the font size, style, color, orientation, and alignment of text.
Journal ArticleDOI

Handwritten Optical Character Recognition (OCR): A Comprehensive Systematic Literature Review (SLR)

TL;DR: This review article serves the purpose of presenting state of the art results and techniques on OCR and also provide research directions by highlighting research gaps.
Proceedings ArticleDOI

ICFHR2016 Handwritten Document Image Binarization Contest (H-DIBCO 2016)

TL;DR: The contest details including the evaluation measures used as well as the performance of the 12 submitted methods are described along with a brief description of each method.
Journal ArticleDOI

A selectional auto-encoder approach for document image binarization

TL;DR: This paper discusses the use of convolutional auto-encoders devoted to learning an end-to-end map from an input image to its selectional output, in which activations indicate the likelihood of pixels to be either foreground or background.
Book ChapterDOI

Action classification in soccer videos with long short-term memory recurrent neural networks

TL;DR: Experimental results show that the proposed approach for action classification in soccer videos outperforms classification methods of related works, and that the combination of the two features (BoW and dominant motion) leads to a classification rate of 92%.
References
More filters

IEEE transactions on pattern analysis and machine intelligence

Ieee Xplore
TL;DR: This special issue aims at gathering the recent advances in learning with shared information methods and their applications in computer vision and multimedia analysis and addressing interesting real-world computer Vision and multimedia applications.
Journal ArticleDOI

Goal-directed evaluation of binarization methods

TL;DR: This paper presents a methodology for evaluation of low-level image analysis methods, using binarization (two-level thresholding) as an example, and defines the performance of the character recognition module as the objective measure.
Proceedings ArticleDOI

Automatic text location in images and video frames

TL;DR: Compared with some traditional text location methods, this method has the following advantages: 1) low computational cost; 2) robust to font size; and 3) high accuracy.