Open Access
Handwriting Recognition of Whiteboard Notes
Marcus Liwicki,Horst Bunke +1 more
TLDR
A new system for processing on-line whiteboard notes and using an off-line HMM-recognizer, which has been developed in the context of previous work, to achieve a statistically significant increase of the recognition rate.Abstract:
This paper introduces a new system for processing on-line whiteboard notes. Notes written on a whiteboard is a new modality in handwriting recognition research that has received relatively little attention in the past. For the recognition we use an off-line HMM-recognizer, which has been developed in the context of our previous work. The recognizer is supplemented with methods for processing the on-line data and generating the images. The system consists of six main modules: on-line preprocessing, transformation to off-line data, off-line preprocessing, feature extraction, classification and post-processing. The recognition rate of the basic recognizer in a writer independent experiment is 59,5%. By applying state-of-the-art methods, such as optimizing the number of states and Gaussian components, and by including a language model we could achieve a statistically significant increase of the recognition rate to 64.3%.read more
Citations
More filters
Book
Supervised Sequence Labelling with Recurrent Neural Networks
TL;DR: A new type of output layer that allows recurrent networks to be trained directly for sequence labelling tasks where the alignment between the inputs and the labels is unknown, and an extension of the long short-term memory network architecture to multidimensional data, such as images and video sequences.
Book ChapterDOI
The AMI meeting corpus: a pre-announcement
Jean Carletta,Simone Ashby,Sebastien Bourban,Michael J. Flynn,Maël Guillemot,Thomas Hain,Jaroslav Kadlec,Vasilis Karaiskos,Wessel Kraaij,Melissa Kronenthal,Guillaume Lathoud,Mike Lincoln,Agnes Lisowska,Iain McCowan,Wilfried Post,Dennis Reidsma,Pierre Wellner +16 more
TL;DR: The AMI Meeting Corpus as mentioned in this paper is a multi-modal data set consisting of 100 hours of meeting recordings, which is being created in the context of a project that is developing meeting browsing technology and will eventually be released publicly.
The AMI meeting corpus
Iain McCowan,Jean Carletta,Wessel Kraaij,Simone Ashby,S. Bourban,Michael J. Flynn,Maël Guillemot,Thomas Hain,J. Kadlec,Vasilis Karaiskos,Melissa Kronenthal,Guillaume Lathoud,Mike Lincoln,Agnes Lisowska,Wilfried Post,Dennis Reidsma,Pierre Wellner +16 more
TL;DR: The corpus is being distributed using a web server designed to allow convenient browsing and download of multimedia content and associated annotations, as well as data collection, annotation and distribution.
Journal ArticleDOI
Markov models for offline handwriting recognition: a survey
Thomas Plötz,Gernot A. Fink +1 more
TL;DR: A comprehensive overview of the application of Markov models in the research field of offline handwriting recognition, covering both the widely used hidden Markov model and the less complex Markov-chain or n-gram models is provided.
Proceedings ArticleDOI
IAM-OnDB - an on-line English sentence database acquired from handwritten text on a whiteboard
Marcus Liwicki,Horst Bunke +1 more
TL;DR: IAM-OnDB is a new large online handwritten sentences database that consists of text acquired via an electronic interface from a whiteboard and a recognizer for unconstrained English text that was trained and tested using this database.
References
More filters
Journal ArticleDOI
Maximum likelihood from incomplete data via the EM algorithm
Journal ArticleDOI
The viterbi algorithm
TL;DR: This paper gives a tutorial exposition of the Viterbi algorithm and of how it is implemented and analyzed, and increasing use of the algorithm in a widening variety of areas is foreseen.
Journal ArticleDOI
Online and off-line handwriting recognition: a comprehensive survey
TL;DR: The nature of handwritten language, how it is transduced into electronic data, and the basic concepts behind written language recognition algorithms are described.
Journal ArticleDOI
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
TL;DR: An important feature of the method is that arbitrary adaptation data can be used—no special enrolment sentences are needed and that as more data is used the adaptation performance improves.
Journal ArticleDOI
The IAM-database: an English sentence database for offline handwriting recognition
Urs-Viktor Marti,Horst Bunke +1 more
TL;DR: A database that consists of handwritten English sentences based on the Lancaster-Oslo/Bergen corpus, which is expected that the database would be particularly useful for recognition tasks where linguistic knowledge beyond the lexicon level is used.
Related Papers (5)
Online and off-line handwriting recognition: a comprehensive survey
IAM-OnDB - an on-line English sentence database acquired from handwritten text on a whiteboard
Marcus Liwicki,Horst Bunke +1 more