Script recognition in images with complex backgrounds

doi:10.1109/ISSPIT.2005.1577163

Proceedings ArticleDOI

Script recognition in images with complex backgrounds

- pp 589-594

TLDR

This paper presents an approach for discriminating between Latin and Ideographic script using a k-nearest neighbour classifier, and initial experimental results for a set of images containing text of different scripts demonstrate the good performance of the proposed solution.

Abstract:

The extraction of textual information from images and videos is an important task for automatic content-based indexing and retrieval purposes. To extract text from images or videos coming from unknown international sources, it is necessary to know the script beforehand in order to employ suitable text segmentation and optical character recognition (OCR) methods. In this paper, we present an approach for discriminating between Latin and Ideographic script. The proposed approach proceeds as follows: first, the text present in an image is localized. Then, a set of low-level features is extracted from the localized text image. Finally, based on the extracted features, the decision about the type of the script is made using a k-nearest neighbour classifier. Initial experimental results for a set of images containing text of different scripts demonstrate the good performance of the proposed solution

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Script Recognition—A Review

Debashis Ghosh, +2 more

- 01 Dec 2010 -

IEEE Transactions on Pattern Analysis an...

TL;DR: An overview of the different script identification methodologies under each of the two broad categories-structure-based and visual-appearance-based techniques is given.

...read moreread less

Journal ArticleDOI

Word level multi-script identification

Peeta Basa Pati, +1 more

- 01 Jul 2008 -

Pattern Recognition Letters

TL;DR: The combination of Gabor features with nearest neighbor or SVM classifier shows promising results; i.e., over 98% for bi-script and tri-script cases and above 89% for the eleven-script scenario.

...read moreread less

Journal ArticleDOI

Script identification in natural scene image and video frames using an attention based Convolutional-LSTM network

Ankan Kumar Bhunia, +5 more

- 01 Jan 2019 -

Pattern Recognition

TL;DR: A novel method that involves extraction of local and global features using CNN-LSTM framework and weighting them dynamically for script identification is proposed and achieves superior results in comparison to conventional methods.

...read moreread less

Journal ArticleDOI

Improving patch-based scene text script identification with ensembles of conjoined networks

Lluis Gomez, +2 more

- 01 Jul 2017 -

Pattern Recognition

TL;DR: In this paper, a patch-based classification method for script identificattion in the wild is presented. But this method does not address a key characteristic of scene text instances: their extremely variable aspect ratio.

...read moreread less

Posted Content

Improving patch-based scene text script identification with ensembles of conjoined networks

Lluis Gomez, +2 more

- 24 Feb 2016 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A patch-based classification method for script identificattion in the wild based on the use of ensembles of conjoined networks and a new public benchmark dataset for the evaluation of multi-lingual scene text end-to-end reading systems.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Wavelet filter evaluation for image compression

John Villasenor, +2 more

- 01 Aug 1995 -

IEEE Transactions on Image Processing

TL;DR: This work has evaluated all possible reasonably short (less than 36 taps in the synthesis/analysis pair) minimum-order biorthogonal wavelet filter banks and selected the filters best suited to image compression.

...read moreread less

Journal ArticleDOI

A comprehensive method for multilingual video text detection, localization, and extraction

Michael R. Lyu, +2 more

- 01 Feb 2005 -

IEEE Transactions on Circuits and System...

TL;DR: A comprehensive, efficient video text detection, localization, and extraction method, which emphasizes the multilingual capability over the whole processing, and is also robust to various background complexities and text appearances.

...read moreread less

Journal ArticleDOI

Rotation invariant texture features and their use in automatic script identification

Tieniu Tan

- 01 Jul 1998 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Rotation invariant texture features are computed based on an extension of the popular multi-channel Gabor filtering technique, and their effectiveness is tested with 300 randomly rotated samples of 15 Brodatz textures to solve a practical but hitherto mostly overlooked problem in document image processing.

...read moreread less

Proceedings ArticleDOI

Video OCR for digital news archive

Toshio Sato, +3 more

TL;DR: This paper applies an interpolation filter, multi-frame integration and a combination of four filters to solve the problems of character recognition for videos: low resolution characters and extremely complex backgrounds.

...read moreread less

Journal ArticleDOI

Determination of the script and language content of document images

A.L. Spitz

- 01 Mar 1997 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work has developed techniques for distinguishing which language is represented in an image of text using a technique based on character shape codes, a representation of Latin text that is inexpensive to compute.

...read moreread less