Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals

doi:10.1109/TPAMI.2008.88

Journal ArticleDOI

Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals

Ujjwal Bhattacharya, +1 more

- 01 Mar 2009 -

IEEE Transactions on Pattern Analysis an...

- Vol. 31, Iss: 3, pp 444-457

TLDR

P pioneering development of two databases for handwritten numerals of two most popular Indian scripts, a multistage cascaded recognition scheme using wavelet based multiresolution representations and multilayer perceptron classifiers and application for the recognition of mixed handwritten numeral recognition of three Indian scripts Devanagari, Bangla and English.

Abstract:

This article primarily concerns the problem of isolated handwritten numeral recognition of major Indian scripts. The principal contributions presented here are (a) pioneering development of two databases for handwritten numerals of two most popular Indian scripts, (b) a multistage cascaded recognition scheme using wavelet based multiresolution representations and multilayer perceptron classifiers and (c) application of (b) for the recognition of mixed handwritten numerals of three Indian scripts Devanagari, Bangla and English. The present databases include respectively 22,556 and 23,392 handwritten isolated numeral samples of Devanagari and Bangla collected from real-life situations and these can be made available free of cost to researchers of other academic Institutions. In the proposed scheme, a numeral is subjected to three multilayer perceptron classifiers corresponding to three coarse-to-fine resolution levels in a cascaded manner. If rejection occurred even at the highest resolution, another multilayer perceptron is used as the final attempt to recognize the input numeral by combining the outputs of three classifiers of the previous stages. This scheme has been extended to the situation when the script of a document is not known a priori or the numerals written on a document belong to different scripts. Handwritten numerals in mixed scripts are frequently found in Indian postal mails and table-form documents.

Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals

Citations

Handwritten character recognition using wavelet energy and extreme learning machine

Offline Recognition of Devanagari Script: A Survey

Handwritten Optical Character Recognition (OCR): A Comprehensive Systematic Literature Review (SLR)

Diagonal Based Feature Extraction for Handwritten Alphabets Recognition System using Neural Network

Multilingual scene character recognition with co-occurrence of histogram of oriented gradients

References

Gradient-based learning applied to document recognition

A threshold selection method from gray level histograms

Neural Networks: A Comprehensive Foundation

A theory for multiresolution signal decomposition: the wavelet representation

Learning internal representations by error propagation

Related Papers (5)

Online and off-line handwriting recognition: a comprehensive survey

Indian script character recognition: a survey

Gradient-based learning applied to document recognition

An overview of character recognition focused on off-line handwriting

Feature extraction methods for character recognition--a survey