scispace - formally typeset
Proceedings ArticleDOI

Script recognition in images with complex backgrounds

TLDR
This paper presents an approach for discriminating between Latin and Ideographic script using a k-nearest neighbour classifier, and initial experimental results for a set of images containing text of different scripts demonstrate the good performance of the proposed solution.
Abstract
The extraction of textual information from images and videos is an important task for automatic content-based indexing and retrieval purposes. To extract text from images or videos coming from unknown international sources, it is necessary to know the script beforehand in order to employ suitable text segmentation and optical character recognition (OCR) methods. In this paper, we present an approach for discriminating between Latin and Ideographic script. The proposed approach proceeds as follows: first, the text present in an image is localized. Then, a set of low-level features is extracted from the localized text image. Finally, based on the extracted features, the decision about the type of the script is made using a k-nearest neighbour classifier. Initial experimental results for a set of images containing text of different scripts demonstrate the good performance of the proposed solution

read more

Citations
More filters
Journal ArticleDOI

Script Recognition—A Review

TL;DR: An overview of the different script identification methodologies under each of the two broad categories-structure-based and visual-appearance-based techniques is given.
Journal ArticleDOI

Word level multi-script identification

TL;DR: The combination of Gabor features with nearest neighbor or SVM classifier shows promising results; i.e., over 98% for bi-script and tri-script cases and above 89% for the eleven-script scenario.
Journal ArticleDOI

Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network

TL;DR: A novel method that involves extraction of local and global features using CNN-LSTM framework and weighting them dynamically for script identification is proposed and achieves superior results in comparison to conventional methods.
Journal ArticleDOI

Improving patch-based scene text script identification with ensembles of conjoined networks

TL;DR: In this paper, a patch-based classification method for script identificattion in the wild is presented. But this method does not address a key characteristic of scene text instances: their extremely variable aspect ratio.
Posted Content

Improving patch-based scene text script identification with ensembles of conjoined networks

TL;DR: A patch-based classification method for script identificattion in the wild based on the use of ensembles of conjoined networks and a new public benchmark dataset for the evaluation of multi-lingual scene text end-to-end reading systems.
References
More filters
Journal ArticleDOI

Wavelet filter evaluation for image compression

TL;DR: This work has evaluated all possible reasonably short (less than 36 taps in the synthesis/analysis pair) minimum-order biorthogonal wavelet filter banks and selected the filters best suited to image compression.
Journal ArticleDOI

A comprehensive method for multilingual video text detection, localization, and extraction

TL;DR: A comprehensive, efficient video text detection, localization, and extraction method, which emphasizes the multilingual capability over the whole processing, and is also robust to various background complexities and text appearances.
Journal ArticleDOI

Rotation invariant texture features and their use in automatic script identification

TL;DR: Rotation invariant texture features are computed based on an extension of the popular multi-channel Gabor filtering technique, and their effectiveness is tested with 300 randomly rotated samples of 15 Brodatz textures to solve a practical but hitherto mostly overlooked problem in document image processing.
Proceedings ArticleDOI

Video OCR for digital news archive

TL;DR: This paper applies an interpolation filter, multi-frame integration and a combination of four filters to solve the problems of character recognition for videos: low resolution characters and extremely complex backgrounds.
Journal ArticleDOI

Determination of the script and language content of document images

TL;DR: This work has developed techniques for distinguishing which language is represented in an image of text using a technique based on character shape codes, a representation of Latin text that is inexpensive to compute.
Related Papers (5)