scispace - formally typeset
Journal ArticleDOI

Handwritten Chinese Text Recognition by Integrating Multiple Contexts

Qiu-Feng Wang, +2 more
- 01 Aug 2012 - 
- Vol. 34, Iss: 8, pp 1469-1481
Reads0
Chats0
TLDR
The experimental results show that confidence transformation and combining multiple contexts improve the text line recognition performance significantly, and are superior by far to the best results reported in the literature.
Abstract
This paper presents an effective approach for the offline recognition of unconstrained handwritten Chinese texts. Under the general integrated segmentation-and-recognition framework with character oversegmentation, we investigate three important issues: candidate path evaluation, path search, and parameter estimation. For path evaluation, we combine multiple contexts (character recognition scores, geometric and linguistic contexts) from the Bayesian decision view, and convert the classifier outputs to posterior probabilities via confidence transformation. In path search, we use a refined beam search algorithm to improve the search efficiency and, meanwhile, use a candidate character augmentation strategy to improve the recognition accuracy. The combining weights of the path evaluation function are optimized by supervised learning using a Maximum Character Accuracy criterion. We evaluated the recognition performance on a Chinese handwriting database CASIA-HWDB, which contains nearly four million character samples of 7,356 classes and 5,091 pages of unconstrained handwritten texts. The experimental results show that confidence transformation and combining multiple contexts improve the text line recognition performance significantly. On a test set of 1,015 handwritten pages, the proposed approach achieved character-level accurate rate of 90.75 percent and correct rate of 91.39 percent, which are superior by far to the best results reported in the literature.

read more

Citations
More filters
Proceedings ArticleDOI

ICDAR 2013 Robust Reading Competition

TL;DR: The datasets and ground truth specification are described, the performance evaluation protocols used are details, and the final results are presented along with a brief summary of the participating methods.
Proceedings ArticleDOI

ICDAR 2013 Chinese Handwriting Recognition Competition

TL;DR: This paper describes the Chinese handwriting recognition competition held at the 12th International Conference on Document Analysis and Recognition (ICDAR 2013), and reports the best results (correct rates) for classification on extracted features, offline character recognition, and online/offline handwritten text recognition.
Journal ArticleDOI

Drawing and Recognizing Chinese Characters with Recurrent Neural Network

TL;DR: Wang et al. as mentioned in this paper proposed a framework by using the recurrent neural network (RNN) as both a discriminative model for recognizing Chinese characters and a generator model for drawing (generating) Chinese characters.
Journal ArticleDOI

Online and offline handwritten Chinese character recognition: A comprehensive study and new benchmark

TL;DR: In this article, a new adaptation layer is proposed to reduce the mismatch between training and test data on a particular source layer, and the adaptation process can be efficiently and effectively implemented in an unsupervised manner.
References
More filters
Proceedings Article

SRILM – An Extensible Language Modeling Toolkit

TL;DR: The functionality of the SRILM toolkit is summarized and its design and implementation is discussed, highlighting ease of rapid prototyping, reusability, and combinability of tools.
Journal ArticleDOI

Two decades of statistical language modeling: where do we go from here?

TL;DR: A Bayesian approach to integration of linguistic theories with data is argued for inStatistical language models estimate the distribution of various natural language phenomena for the purpose of speech recognition and other language technologies.
Journal ArticleDOI

Minimum classification error rate methods for speech recognition

TL;DR: The issue of speech recognizer training from a broad perspective with root in the classical Bayes decision theory is discussed, and the superiority of the minimum classification error (MCE) method over the distribution estimation method is shown by providing the results of several key speech recognition experiments.
Journal ArticleDOI

Discriminative Training for Large-Vocabulary Speech Recognition Using Minimum Classification Error

TL;DR: This article reports significant gains in recognition performance and model compactness as a result of discriminative training based on MCE training applied to HMMs, in the context of three challenging large-vocabulary speech recognition tasks.
Journal ArticleDOI

Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition

TL;DR: Two types of modified quadratic disriminant functions (MQDF1, MQDF2) which are less sensitive to the estimation error of the covariance matrices are proposed.
Related Papers (5)