scispace - formally typeset
Open Access

Large vocabulary off-line handwritten word recognition

TLDR
Novel search strategies and a novel verification approach are introduced that allow us to achieve a 120 speedup and 10% accuracy improvement over a state-of-art baseline recognition system for a very-large vocabulary recognition task (80,000 words).
Abstract
Considerable progress has been made in handwriting recognition technology over the last few years. Thus far, handwriting recognition systems have been limited to small-scale and very constrained applications where the number of different words that a system can recognize is the key point for its performance. The capability of dealing with large vocabularies, however, opens up many more applications. In order to translate the gains made by research into large and very-large vocabulary handwriting recognition, it is necessary to further improve the computational efficiency and the accuracy of the current recognition strategies and algorithms. In this thesis we focus on efficient and accurate large vocabulary handwriting recognition. The main challenge is to speedup the recognition process and to improve the recognition accuracy. However, these two aspects are in mutual conflict. It is relatively easy to improve recognition speed while trading away some accuracy. But it is much harder to improve the recognition speed while preserving the accuracy. First, several strategies have been investigated for improving the performance of a baseline recognition system in terms of recognition speed to deal with large and very-large vocabularies. Next, we improve the performance in terms of recognition accuracy while preserving all the original characteristics of the baseline recognition system: omniwriter, unconstrained handwriting, and dynamic lexicons. The main contributions of this thesis are novel search strategies and a novel verification approach that allow us to achieve a 120 speedup and 10% accuracy improvement over a state-of-art baseline recognition system for a very-large vocabulary recognition task (80,000 words). The improvements in speed are obtained by the following techniques: lexical tree search, standard and constrained lexicon-driven level building algorithms, fast two-level decoding algorithm, and a distributed recognition scheme. The recognition accuracy is improved by post-processing the list of the candidate N-best-scoring word hypotheses generated by the baseline recognition system. The list also contains the segmentation of such word hypotheses into characters. A verification module based on a neural network classifier is used to generate a score for each segmented character and in the end, the scores from the baseline recognition system and the verification module are combined to optimize performance. A rejection mechanism is introduced over the combination of the baseline recognition system with the verification module to improve significantly the word recognition rate to about 95% while rejecting 30% of the word hypotheses.

read more

Citations
More filters
Journal Article

Lexicon directed algorithm for recognition of unconstrained handwritten words

TL;DR: Improvements made to a lexicon directed algorithm for recognition of unconstrained handwritten words (cursive, discrete, or mixed) such as those encountered in mail pieces, in order to achieve higher recognition accuracy and speed.
Proceedings ArticleDOI

Combining Multiple Feature Extraction Techniques for Handwritten Devnagari Character Recognition

TL;DR: In this paper, the authors used four feature extraction techniques namely, intersection, shadow feature, chain code histogram, and straight line fitting features for handwritten Devnagari characters recognition using weighted majority voting technique.
Journal ArticleDOI

Recognition and verification of unconstrained handwritten words

TL;DR: A novel approach for the verification of the word hypotheses generated by a large vocabulary, offline handwritten word recognition system that has improved the word recognition rate as well as the reliability of the recognition system, while not causing significant delays in the recognition process.
Journal ArticleDOI

Fine Classification & Recognition of Hand Written Devnagari Characters with Regular Expressions & Minimum Edit Distance Method

TL;DR: A use of regular expressions in character recognition problem scenarios in sequence analysis that are ideally suited for the application of regular expression algorithms.
Journal ArticleDOI

Off-line cursive script recognition: current advances, comparisons and remaining problems

TL;DR: This paper presents detailed review in the field of off-line cursive script recognition, where various methods are analyzed that have been proposed to realize the core of script recognition in a word recognition system.
References
More filters
Book

Neural Networks: A Comprehensive Foundation

Simon Haykin
TL;DR: Thorough, well-organized, and completely up to date, this book examines all the important aspects of this emerging technology, including the learning process, back-propagation learning, radial-basis function networks, self-organizing systems, modular networks, temporal processing and neurodynamics, and VLSI implementation of neural networks.
Book ChapterDOI

Neural Networks for Pattern Recognition

TL;DR: The chapter discusses two important directions of research to improve learning algorithms: the dynamic node generation, which is used by the cascade correlation algorithm; and designing learning algorithms where the choice of parameters is not an issue.
Book

Fundamentals of speech recognition

TL;DR: This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech.
Journal ArticleDOI

Statistical pattern recognition: a review

TL;DR: The objective of this review paper is to summarize and compare some of the well-known methods used in various stages of a pattern recognition system and identify research topics and applications which are at the forefront of this exciting and challenging field.
Journal ArticleDOI

The viterbi algorithm

TL;DR: This paper gives a tutorial exposition of the Viterbi algorithm and of how it is implemented and analyzed, and increasing use of the algorithm in a widening variety of areas is foreseen.
Related Papers (5)