Handwritten Chinese Text Recognition by Integrating Multiple Contexts

doi:10.1109/TPAMI.2011.264

Journal ArticleDOI

Handwritten Chinese Text Recognition by Integrating Multiple Contexts

Qiu-Feng Wang, +2 more

- 01 Aug 2012 -

IEEE Transactions on Pattern Analysis an...

- Vol. 34, Iss: 8, pp 1469-1481

Chats0

TLDR

The experimental results show that confidence transformation and combining multiple contexts improve the text line recognition performance significantly, and are superior by far to the best results reported in the literature.

Abstract:

This paper presents an effective approach for the offline recognition of unconstrained handwritten Chinese texts. Under the general integrated segmentation-and-recognition framework with character oversegmentation, we investigate three important issues: candidate path evaluation, path search, and parameter estimation. For path evaluation, we combine multiple contexts (character recognition scores, geometric and linguistic contexts) from the Bayesian decision view, and convert the classifier outputs to posterior probabilities via confidence transformation. In path search, we use a refined beam search algorithm to improve the search efficiency and, meanwhile, use a candidate character augmentation strategy to improve the recognition accuracy. The combining weights of the path evaluation function are optimized by supervised learning using a Maximum Character Accuracy criterion. We evaluated the recognition performance on a Chinese handwriting database CASIA-HWDB, which contains nearly four million character samples of 7,356 classes and 5,091 pages of unconstrained handwritten texts. The experimental results show that confidence transformation and combining multiple contexts improve the text line recognition performance significantly. On a test set of 1,015 handwritten pages, the proposed approach achieved character-level accurate rate of 90.75 percent and correct rate of 91.39 percent, which are superior by far to the best results reported in the literature.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

ICDAR 2013 Robust Reading Competition

Dimosthenis Karatzas, +9 more

TL;DR: The datasets and ground truth specification are described, the performance evaluation protocols used are details, and the final results are presented along with a brief summary of the participating methods.

...read moreread less

Proceedings ArticleDOI

ICDAR 2013 Chinese Handwriting Recognition Competition

Fei Yin, +3 more

TL;DR: This paper describes the Chinese handwriting recognition competition held at the 12th International Conference on Document Analysis and Recognition (ICDAR 2013), and reports the best results (correct rates) for classification on extracted features, offline character recognition, and online/offline handwritten text recognition.

...read moreread less

Journal ArticleDOI

Drawing and Recognizing Chinese Characters with Recurrent Neural Network

Xu-Yao Zhang, +4 more

- 01 Apr 2018 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Wang et al. as mentioned in this paper proposed a framework by using the recurrent neural network (RNN) as both a discriminative model for recognizing Chinese characters and a generator model for drawing (generating) Chinese characters.

...read moreread less

Journal ArticleDOI

Online and offline handwritten Chinese character recognition: A comprehensive study and new benchmark

Xu-Yao Zhang, +3 more

- 01 Jan 2017 -

Pattern Recognition

TL;DR: In this article, a new adaptation layer is proposed to reduce the mismatch between training and test data on a particular source layer, and the adaptation process can be efficiently and effectively implemented in an unsupervised manner.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

SRILM – An Extensible Language Modeling Toolkit

Andreas Stolcke

TL;DR: The functionality of the SRILM toolkit is summarized and its design and implementation is discussed, highlighting ease of rapid prototyping, reusability, and combinability of tools.

...read moreread less

Journal ArticleDOI

Two decades of statistical language modeling: where do we go from here?

Roni Rosenfeld

TL;DR: A Bayesian approach to integration of linguistic theories with data is argued for inStatistical language models estimate the distribution of various natural language phenomena for the purpose of speech recognition and other language technologies.

...read moreread less

Journal ArticleDOI

Minimum classification error rate methods for speech recognition

Biing-Hwang Juang, +2 more

- 01 May 1997 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: The issue of speech recognizer training from a broad perspective with root in the classical Bayes decision theory is discussed, and the superiority of the minimum classification error (MCE) method over the distribution estimation method is shown by providing the results of several key speech recognition experiments.

...read moreread less

Journal ArticleDOI

Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error

Erik McDermott, +4 more

- 01 Jan 2007 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: This article reports significant gains in recognition performance and model compactness as a result of discriminative training based on MCE training applied to HMMs, in the context of three challenging large-vocabulary speech recognition tasks.

...read moreread less

Journal ArticleDOI

Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition

Fumitaka Kimura, +3 more

- 01 Jan 1987 -

IEEE Transactions on Pattern Analysis an...

TL;DR: Two types of modified quadratic disriminant functions (MQDF1, MQDF2) which are less sensitive to the estimation error of the covariance matrices are proposed.

...read moreread less

Collapse

Related Papers (5)

CASIA Online and Offline Chinese Handwriting Databases

Cheng-Lin Liu, +3 more

A Novel Connectionist System for Unconstrained Handwriting Recognition

Alex Graves, +5 more

- 01 May 2009 -

IEEE Transactions on Pattern Analysis an...

Handwritten Chinese Text Recognition by Integrating Multiple Contexts

Citations

ICDAR 2015 competition on Robust Reading

ICDAR 2013 Robust Reading Competition

ICDAR 2013 Chinese Handwriting Recognition Competition

Drawing and Recognizing Chinese Characters with Recurrent Neural Network

Online and offline handwritten Chinese character recognition: A comprehensive study and new benchmark

References

SRILM – An Extensible Language Modeling Toolkit

Two decades of statistical language modeling: where do we go from here?

Minimum classification error rate methods for speech recognition

Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error

Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition

Related Papers (5)

CASIA Online and Offline Chinese Handwriting Databases

A Novel Connectionist System for Unconstrained Handwriting Recognition

Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks

Deep Residual Learning for Image Recognition

Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition