Showing papers on "Handwriting recognition published in 2012"

PDF

Open Access

Journal Article•DOI•

The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web]

[...]

Li Deng¹•Institutions (1)

18 Oct 2012-IEEE Signal Processing Magazine

TL;DR: “Best of the Web” presents the modified National Institute of Standards and Technology (MNIST) resources, consisting of a collection of handwritten digit images used extensively in optical character recognition and machine learning research.

...read moreread less

Abstract: In this issue, “Best of the Web” presents the modified National Institute of Standards and Technology (MNIST) resources, consisting of a collection of handwritten digit images used extensively in optical character recognition and machine learning research.

...read moreread less

1,626 citations

Journal Article•DOI•

Lexicon-free handwritten word spotting using character HMMs

[...]

Andreas Fischer¹, Andreas Keller¹, Volkmar Frinken¹, Horst Bunke¹•Institutions (1)

University of Bern¹

01 May 2012-Pattern Recognition Letters

TL;DR: For a multi-writer scenario on the IAM off-line database as well as for two single writer scenarios on historical data sets, it is shown that the proposed learning-based system outperforms a standard template matching method.

...read moreread less

293 citations

Journal Article•DOI•

A Novel Word Spotting Method Based on Recurrent Neural Networks

[...]

Volkmar Frinken¹, Andreas Fischer¹, R. Manmatha², Horst Bunke¹•Institutions (2)

University of Bern¹, University of Massachusetts Amherst²

01 Feb 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A novel keyword spotting method for handwritten documents is described, derived from a neural network-based system for unconstrained handwriting recognition, that performs template-free spotting, i.e., it is not necessary for a keyword to appear in the training set.

...read moreread less

Abstract: Keyword spotting refers to the process of retrieving all instances of a given keyword from a document. In the present paper, a novel keyword spotting method for handwritten documents is described. It is derived from a neural network-based system for unconstrained handwriting recognition. As such it performs template-free spotting, i.e., it is not necessary for a keyword to appear in the training set. The keyword spotting is done using a modification of the CTC Token Passing algorithm in conjunction with a recurrent neural network. We demonstrate that the proposed systems outperform not only a classical dynamic time warping-based approach but also a modern keyword spotting system, based on hidden Markov models. Furthermore, we analyze the performance of the underlying neural networks when using them in a recognition task followed by keyword spotting on the produced transcription. We point out the advantages of keyword spotting when compared to classic text line recognition.

...read moreread less

283 citations

Journal Article•DOI•

An Accelerometer-Based Digital Pen With a Trajectory Recognition Algorithm for Handwritten Digit and Gesture Recognition

[...]

Jeen-Shing Wang¹, Fang-Chen Chuang¹•Institutions (1)

National Cheng Kung University¹

01 Jul 2012-IEEE Transactions on Industrial Electronics

TL;DR: The experimental results have successfully validated the effectiveness of the trajectory recognition algorithm for handwritten digit and gesture recognition using the proposed digital pen.

...read moreread less

Abstract: This paper presents an accelerometer-based digital pen for handwritten digit and gesture trajectory recognition applications. The digital pen consists of a triaxial accelerometer, a microcontroller, and an RF wireless transmission module for sensing and collecting accelerations of handwriting and gesture trajectories. The proposed trajectory recognition algorithm composes of the procedures of acceleration acquisition, signal preprocessing, feature generation, feature selection, and feature extraction. The algorithm is capable of translating time-series acceleration signals into important feature vectors. Users can use the pen to write digits or make hand gestures, and the accelerations of hand motions measured by the accelerometer are wirelessly transmitted to a computer for online trajectory recognition. The algorithm first extracts the time- and frequency-domain features from the acceleration signals and, then, further identifies the most important features by a hybrid method: kernel-based class separability for selecting significant features and linear discriminant analysis for reducing the dimension of features. The reduced features are sent to a trained probabilistic neural network for recognition. Our experimental results have successfully validated the effectiveness of the trajectory recognition algorithm for handwritten digit and gesture recognition using the proposed digital pen.

...read moreread less

164 citations

Journal Article•DOI•

Handwritten Chinese Text Recognition by Integrating Multiple Contexts

[...]

Qiu-Feng Wang, Fei Yin, Cheng-Lin Liu

01 Aug 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The experimental results show that confidence transformation and combining multiple contexts improve the text line recognition performance significantly, and are superior by far to the best results reported in the literature.

...read moreread less

Abstract: This paper presents an effective approach for the offline recognition of unconstrained handwritten Chinese texts. Under the general integrated segmentation-and-recognition framework with character oversegmentation, we investigate three important issues: candidate path evaluation, path search, and parameter estimation. For path evaluation, we combine multiple contexts (character recognition scores, geometric and linguistic contexts) from the Bayesian decision view, and convert the classifier outputs to posterior probabilities via confidence transformation. In path search, we use a refined beam search algorithm to improve the search efficiency and, meanwhile, use a candidate character augmentation strategy to improve the recognition accuracy. The combining weights of the path evaluation function are optimized by supervised learning using a Maximum Character Accuracy criterion. We evaluated the recognition performance on a Chinese handwriting database CASIA-HWDB, which contains nearly four million character samples of 7,356 classes and 5,091 pages of unconstrained handwritten texts. The experimental results show that confidence transformation and combining multiple contexts improve the text line recognition performance significantly. On a test set of 1,015 handwritten pages, the proposed approach achieved character-level accurate rate of 90.75 percent and correct rate of 91.39 percent, which are superior by far to the best results reported in the literature.

...read moreread less

164 citations

Journal Article•DOI•

HMM-Based Lexicon-Driven and Lexicon-Free Word Recognition for Online Handwritten Indic Scripts

[...]

A. Bharath¹, Sriganesh Madhvanath•Institutions (1)

Hewlett-Packard¹

01 Apr 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper proposes two different techniques for word recognition based on Hidden Markov Models (HMM): lexicon driven and lexicon free, which significantly outperforms either of them used in isolation on handwritten Devanagari word samples.

...read moreread less

Abstract: Research for recognizing online handwritten words in Indic scripts is at its early stages when compared to Latin and Oriental scripts In this paper, we address this problem specifically for two major Indic scripts-Devanagari and Tamil In contrast to previous approaches, the techniques we propose are largely data driven and script independent We propose two different techniques for word recognition based on Hidden Markov Models (HMM): lexicon driven and lexicon free The lexicon-driven technique models each word in the lexicon as a sequence of symbol HMMs according to a standard symbol writing order derived from the phonetic representation The lexicon-free technique uses a novel Bag-of-Symbols representation of the handwritten word that is independent of symbol order and allows rapid pruning of the lexicon On handwritten Devanagari word samples featuring both standard and nonstandard symbol writing orders, a combination of lexicon-driven and lexicon-free recognizers significantly outperforms either of them used in isolation In contrast, most Tamil word samples feature the standard symbol order, and the lexicon-driven recognizer outperforms the lexicon free one as well as their combination The best recognition accuracies obtained for 20,000 word lexicons are 8713 percent for Devanagari when the two recognizers are combined, and 918 percent for Tamil using the lexicon-driven technique

...read moreread less

107 citations

Proceedings Article•DOI•

Airwriting: Hands-Free Mobile Text Input by Spotting and Continuous Recognition of 3d-Space Handwriting with Inertial Sensors

[...]

Christoph Amma, Marcus Georgi, Tanja Schultz

18 Jun 2012

TL;DR: An input method which enables complex hands-free interaction through 3d handwriting recognition through Hidden Markov Models and a statistical language model is used to enhance recognition performance and restrict the search space.

...read moreread less

Abstract: We present an input method which enables complex hands-free interaction through 3d handwriting recognition. Users can write text in the air as if they were using an imaginary blackboard. Motion sensing is done wirelessly by accelerometers and gyroscopes which are attached to the back of the hand. We propose a two-stage approach for spotting and recognition of handwriting gestures. The spotting stage uses a Support Vector Machine to identify data segments which contain handwriting. The recognition stage uses Hidden Markov Models (HMM) to generate the text representation from the motion sensor data. Individual characters are modeled by HMMs and concatenated to word models. Our system can continuously recognize arbitrary sentences, based on a freely definable vocabulary with over 8000 words. A statistical language model is used to enhance recognition performance and restrict the search space. We report the results from a nine-user experiment on sentence recognition for person dependent and person independent setups on 3d-space handwriting data. For the person independent setup, a word error rate of 11% is achieved, for the person dependent setup 3% are achieved. We evaluate the spotting algorithm in a second experiment on a realistic dataset including everyday activities and achieve a sample based recall of 99\% and a precision of 25%. We show that additional filtering in the recognition stage can detect up to 99% of the false positive segments.

...read moreread less

105 citations

Proceedings Article•DOI•

QUWI: An Arabic and English Handwriting Dataset for Offline Writer Identification

[...]

Somaya Al Maadeed¹, Wael Ayouby¹, Abdelaali Hassaine¹, Jihad Mohamad Alja'am¹•Institutions (1)

Qatar University¹

18 Sep 2012

TL;DR: This paper presents a new offline dataset called the Qatar University Writer Identification dataset (QUWI), which consists of handwritten documents of 1017 volunteers and allows the dataset to be used for both text-dependent and text-independent writer identification tasks.

...read moreread less

Abstract: This paper presents a new offline dataset called the Qatar University Writer Identification dataset (QUWI). This dataset contains both Arabic and English handwritings and can be used to evaluate the performance of offline writer identification systems. It consists of handwritten documents of 1017 volunteers of different ages, nationalities, genders and education levels. The writers were asked to copy a specific text and to generate a random text, which allows the dataset to be used for both text-dependent and text-independent writer identification tasks. We describe the gathering and processing steps and define several evaluation tasks regarding the use of this dataset.

...read moreread less

102 citations

Journal Article•DOI•

A Model-Based Sequence Similarity with Application to Handwritten Word Spotting

[...]

Jose A. Rodriguez-Serrano¹, Florent Perronnin¹•Institutions (1)

Xerox¹

01 Nov 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The proposed similarity outperforms the traditional DTW between the original sequences, and the model-based approach which uses ordinary continuous HMMs, and it is shown that this increase in accuracy can be traded against a significant reduction of the computational cost.

...read moreread less

Abstract: This paper proposes a novel similarity measure between vector sequences. We work in the framework of model-based approaches, where each sequence is first mapped to a Hidden Markov Model (HMM) and then a measure of similarity is computed between the HMMs. We propose to model sequences with semicontinuous HMMs (SC-HMMs). This is a particular type of HMM whose emission probabilities in each state are mixtures of shared Gaussians. This crucial constraint provides two major benefits. First, the a priori information contained in the common set of Gaussians leads to a more accurate estimate of the HMM parameters. Second, the computation of a similarity between two SC-HMMs can be simplified to a Dynamic Time Warping (DTW) between their mixture weight vectors, which significantly reduces the computational cost. Experiments are carried out on a handwritten word retrieval task in three different datasets-an in-house dataset of real handwritten letters, the George Washington dataset, and the IFN/ENIT dataset of Arabic handwritten words. These experiments show that the proposed similarity outperforms the traditional DTW between the original sequences, and the model-based approach which uses ordinary continuous HMMs. We also show that this increase in accuracy can be traded against a significant reduction of the computational cost.

...read moreread less

102 citations

Proceedings Article•DOI•

KHATT: Arabic Offline Handwritten Text Database

[...]

Sabri A. Mahmoud¹, Irfan Ahmad¹, Mohammad Alshayeb¹, Wasfi G. Al-Khatib¹, Mohammad Tanvir Parvez², Gernot A. Fink, Volker Märgner, Haikal El Abed - Show less +4 more•Institutions (2)

King Fahd University of Petroleum and Minerals¹, Qassim University²

18 Sep 2012

TL;DR: The comprehensive Arabic offline Handwritten Text database (KHATT) is reported after completion of the collection of 1000 handwritten forms written by 1000 writers from different countries, composed of an image database containing images of the written text at 200, 300, and 600 dpi resolutions, and a manually verified ground truth database that contains meta-data describing thewritten text at the page, paragraph, and line levels.

...read moreread less

Abstract: In this paper, we report our comprehensive Arabic offline Handwritten Text database (KHATT) after completion of the collection of 1000 handwritten forms written by 1000 writers from different countries. It is composed of an image database containing images of the written text at 200, 300, and 600 dpi resolutions, a manually verified ground truth database that contains meta-data describing the written text at the page, paragraph, and line levels. A formal verification procedure is implemented to align the handwritten text with its ground truth at the form, paragraph and line levels. Tools to extract paragraphs from pages and segment paragraphs into lines are developed. Preliminary experiments on Arabic handwritten text recognition are conducted using sample data from the database and the results are reported. The database will be made freely available to researchers world-wide for research in various handwritten-related problems such as text recognition, writer identification and verification, etc.

...read moreread less

88 citations

Book•DOI•

Guide to OCR for Arabic Scripts

[...]

Volker Mrgner, Haikal El Abed

04 Jul 2012

TL;DR: This Guide to OCR for Arabic Scripts is the first book of its kind, specifically devoted to this emerging field and describes numerous applications of Arabic script recognition technology, from historical Arabic manuscripts to online Arabic recognition.

...read moreread less

Abstract: This Guide to OCR for Arabic Scripts is the first book of its kind, specifically devoted to this emerging field. Topics and features: contains contributions from the leading researchers in the field; with a Foreword by Professor Bente Maegaard of the University of Copenhagen; presents a detailed overview of Arabic character recognition technology, covering a range of different aspects of pre-processing and feature extraction; reviews a broad selection of varying approaches, including HMM-based methods and a recognition system based on multidimensional recurrent neural networks; examines the evaluation of Arabic script recognition systems, discussing data collection and annotation, benchmarking strategies, and handwriting recognition competitions; describes numerous applications of Arabic script recognition technology, from historical Arabic manuscripts to online Arabic recognition.

...read moreread less

Journal Article•DOI•

On the influence of word representations for handwritten word spotting in historical documents

[...]

Josep Lladós¹, Marçal Rusiñol¹, Alicia Fornés¹, David Fernández¹, Anjan Dutta¹ - Show less +1 more•Institutions (1)

Autonomous University of Barcelona¹

26 Nov 2012-International Journal of Pattern Recognition and Artificial Intelligence

TL;DR: It is experimentally demonstrated that statistical representations generally give a better performance, however it cannot be neglected that large descriptors are difficult to be implemented in a retrieval scenario where word spotting requires the indexation of data with million word images.

...read moreread less

Abstract: Word spotting is the process of retrieving all instances of a queried keyword from a digital library of document images. In this paper we evaluate the performance of different word descriptors to assess the advantages and disadvantages of statistical and structural models in a framework of query-by-example word spotting in historical documents. We compare four word representation models, namely sequence alignment using DTW as a baseline reference, a bag of visual words approach as statistical model, a pseudo-structural model based on a Loci features representation, and a structural approach where words are represented by graphs. The four approaches have been tested with two collections of historical data: the George Washington database and the marriage records from the Barcelona Cathedral. We experimentally demonstrate that statistical representations generally give a better performance, however it cannot be neglected that large descriptors are difficult to be implemented in a retrieval scenario where word spotting requires the indexation of data with million word images.

...read moreread less

Proceedings Article•DOI•

The A2iA French handwriting recognition system at the Rimes-ICDAR2011 competition

[...]

Fares Menasri, Jérôme Louradour, Anne-Laure Bianne-Bernard, Christopher Kermorvant

22 Jan 2012

TL;DR: This paper describes the system for the recognition of French handwriting submitted by A2iA to the competition organized at ICDAR2011 using the Rimes database, which outperformed all previously proposed systems on these tasks.

...read moreread less

Abstract: This paper describes the system for the recognition of French handwriting submitted by A2iA to the competition organized at ICDAR2011 using the Rimes database. This system is composed of several recognizers based on three different recognition technologies, combined using a novel combination method. A framework multi-word recognition based on weighted finite state transducers is presented, using an explicit word segmentation, a combination of isolated word recognizers and a language model. The system was tested both for isolated word recognition and for multi-word line recognition and submitted to the RIMES-ICDAR2011 competition. This system outperformed all previously proposed systems on these tasks.

...read moreread less

Proceedings Article•DOI•

Vision-based handwriting recognition for unrestricted text input in mid-air

[...]

Alexander Schick, Daniel Morlock¹, Christoph Amma¹, Tanja Schultz¹, Rainer Stiefelhagen¹ - Show less +1 more•Institutions (1)

Karlsruhe Institute of Technology¹

22 Oct 2012

TL;DR: A vision-based system that recognizes handwriting in mid-air and provides an easy-to-use and accurate text input modality without placing restrictions on the users is proposed.

...read moreread less

Abstract: We propose a vision-based system that recognizes handwriting in mid-air. The system does not depend on sensors or markers attached to the users and allows unrestricted character and word input from any position. It is the result of combining handwriting recognition based on Hidden Markov Models with multi-camera 3D hand tracking. We evaluated the system for both quantitative and qualitative aspects. The system achieves recognition rates of 86.15% for character and 97.54% for small-vocabulary isolated word recognition. Limitations are due to slow and low-resolution cameras or physical strain. Overall, the proposed handwriting recognition system provides an easy-to-use and accurate text input modality without placing restrictions on the users.

...read moreread less

Journal Article•DOI•

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

[...]

J. Pradeep¹, E. Srinivasan¹, S. Himavathi•Institutions (1)

Pondicherry Engineering College¹

01 May 2012-International Journal of Engineering

TL;DR: This paper identifies the most suitable NN for the design of hand written English character recognition system using back propagation neural network, nearest neighbour network and radial basis function network to classify the characters.

...read moreread less

Abstract: Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten character recognition. This paper identifies the most suitable NN for the design of hand written English character recognition system. Different Neural Network (NN) topologies namely, back propagation neural network, nearest neighbour network and radial basis function network are built to classify the characters. All the NN based Recognition systems use the same training data set and are trained for the same target mean square error. Two hundred different character data sets for each of the 26 English characters are used to train the networks. The performance of the recognition systems is compared extensively using test data to draw the major conclusions of this paper

...read moreread less

Proceedings Article•DOI•

Script Independent Word Spotting in Offline Handwritten Documents Based on Hidden Markov Models

[...]

Safwan Wshah¹, Gaurav Kumar¹, Venu Govindaraju¹•Institutions (1)

University at Buffalo¹

18 Sep 2012

TL;DR: A novel script independent line based word spotting framework for offline handwritten documents based on Hidden Markov Models that outperforms the modern line based approach on the English, Arabic and Devanagari Datasets.

...read moreread less

Abstract: Keyword spotting aims to retrieve all instances of a given keyword from a document in any language. In this paper, we propose a novel script independent line based word spotting framework for offline handwritten documents based on Hidden Markov Models. The methodology simulates the keywords in model space as a sequence of character models and uses the filler models for better representation of background or non-keyword text. We propose a two stage spotting framework where the candidate keywords are further pruned using the character based background and lexicon based background model. The system deals with large vocabulary without the need for word or character segmentation. The system has been evaluated on many public dataset from several languages such as IAM for English, AMA for Arabic and LAW for Devanagari. The system outperforms the modern line based approach on the English, Arabic and Devanagari Datasets.

...read moreread less

Journal Article•DOI•

A synthesised word approach to word retrieval in handwritten documents

[...]

Yiqing Liang¹, Michael Fairhurst¹, Richard Guest¹•Institutions (1)

University of Kent¹

01 Dec 2012-Pattern Recognition

TL;DR: A novel method is described to overcome the training data problem using a character-based modelling approach and a word modelling technique enabling the retrieval of keywords that have not explicitly been seen in the training set.

...read moreread less

Journal Article•DOI•

Binary segmentation algorithm for English cursive handwriting recognition

[...]

Hong Lee¹, Brijesh Verma¹•Institutions (1)

Central Queensland University¹

01 Apr 2012-Pattern Recognition

TL;DR: This paper presents a novel Binary Segmentation Algorithm (BSA) that reduces the risks of the chain failure problems during validation and improves the segmentation accuracy.

...read moreread less

Proceedings Article•DOI•

Mode Detection in Online Handwritten Documents Using BLSTM Neural Networks

[...]

Emanuel Indermühle¹, Volkmar Frinken², Horst Bunke¹•Institutions (2)

University of Bern¹, Autonomous University of Barcelona²

18 Sep 2012

TL;DR: A new approach to mode detection is proposed that uses bidirectional long-short term memory (BLSTM) neural networks, a novel type of recursive neural network that has been successfully applied in speech and handwriting recognition.

...read moreread less

Abstract: Mode detection in online handwritten documents refers to the process of distinguishing different types of contents, such as text, formulas, diagrams, or tables, one from another. In this paper a new approach to mode detection is proposed that uses bidirectional long-short term memory (BLSTM) neural networks. The BLSTM neural network is a novel type of recursive neural network that has been successfully applied in speech and handwriting recognition. In this paper we show that it has the potential to significantly outperform traditional methods for mode detection, which are usually based on stroke classification. As a further advantage over previous approaches, the proposed system is trainable and does not rely on user-defined heuristics. Moreover, it can be easily adapted to new or additional types of modes by just providing the system with new training data.

...read moreread less

Proceedings Article•DOI•

Bag-of-Features Representations for Offline Handwriting Recognition Applied to Arabic Script

[...]

Leonard Rothacker, Szilárd Vajda, Gernot A. Fink

18 Sep 2012

TL;DR: This paper focuses on feature learning by estimating and applying a statistical bag-of-features model that is successfully used in image categorization and retrieval and the integration with a Hidden Markov Model (HMM) that is used for recognition.

...read moreread less

Abstract: Due to the great variabilities in human writing, unconstrained handwriting recognition is still considered an open research topic. Recent trends in computer vision, however, suggest that there is still potential for better recognition by improving feature representations. In this paper we focus on feature learning by estimating and applying a statistical bag-of-features model. These models are successfully used in image categorization and retrieval. The novelty here is the integration with a Hidden Markov Model (HMM) that we use for recognition. Our method is evaluated on the IFN/ENIT database consisting of images of handwritten Arabic town and village names.

...read moreread less

Proceedings Article•DOI•

Moment-Based Image Normalization for Handwritten Text Recognition

[...]

Michal Kozielski¹, Jens Forster¹, Hermann Ney¹•Institutions (1)

RWTH Aachen University¹

18 Sep 2012

TL;DR: The proposed normalization methods for handwriting recognition and moment-based normalization of images from digit recognition to the recognition of handwritten text provide robust estimates for text characteristics such as size and position of words within an image.

...read moreread less

Abstract: In this paper, we extend the concept of moment-based normalization of images from digit recognition to the recognition of handwritten text. Image moments provide robust estimates for text characteristics such as size and position of words within an image. For handwriting recognition the normalization procedure is applied to image slices independently. Additionally, a novel moment-based algorithm for line-thickness normalization is presented. The proposed normalization methods are evaluated on the RIMES database of French handwriting and the IAM database of English handwriting. For RIMES we achieve an improvement from 16.7% word error rate to 13.4% and for IAM from 46.6% to 40.4%.

...read moreread less

Proceedings Article•DOI•

Local Feature Based Online Mode Detection with Recurrent Neural Networks

[...]

Sebastian Otte, Dirk Krechel, Marcus Liwicki¹, Andreas Dengel¹•Institutions (1)

German Research Centre for Artificial Intelligence¹

18 Sep 2012

TL;DR: A novel approach for online mode detection, where the task is to classify ink traces into several categories, is proposed, where standard recurrent neural networks and the recently introduced long short-term memory networks are used.

...read moreread less

Abstract: In this paper we propose a novel approach for online mode detection, where the task is to classify ink traces into several categories. In contrast to previous approaches working on global features, we introduce a system completely relying on local features. For classification, standard recurrent neural networks (RNNs) and the recently introduced long short-term memory (LSTM) networks are used. Experiments are performed on the publicly available IAMonDo-database which serves as a benchmark data set for several researches. In the experiments we investigate several RNN structures and classification sub-tasks of different complexities. The final recognition rate on the complete test set is 98.47% in average, which is significantly higher than the 97% achieved with an MCS in previous work. Further interesting results on different subsets are also reported in this paper.

...read moreread less

Journal Article•DOI•

Development of comprehensive devnagari numeral and character database for offline handwritten character recognition

[...]

Vikas J. Dongre, Vijay H. Mankar

01 Jan 2012

TL;DR: The present work generated 5137 and 20305 isolated samples for numeral and character database, respectively, from 750 writers of all ages, sex, education, and profession, to facilitate research on handwriting recognition of Devnagari script through free access to the researchers.

...read moreread less

Abstract: In handwritten character recognition, benchmark database plays an important role in evaluating the performance of various algorithms and the results obtained by various researchers. In Devnagari script, there is lack of such official benchmark. This paper focuses on the generation of offline benchmark database for Devnagari handwritten numerals and characters. The present work generated 5137 and 20305 isolated samples for numeral and character database, respectively, from 750 writers of all ages, sex, education, and profession. The offline sample images are stored in TIFF image format as it occupies less memory. Also, the data is presented in binary level so that memory requirement is further reduced. It will facilitate research on handwriting recognition of Devnagari script through free access to the researchers.

...read moreread less

Book Chapter•DOI•

RWTH OCR: A Large Vocabulary Optical Character Recognition System for Arabic Scripts

[...]

Philippe Dreuw¹, David Rybach¹, Georg Heigold¹, Hermann Ney¹•Institutions (1)

RWTH Aachen University¹

01 Jan 2012

TL;DR: A novel large vocabulary OCR system, which implements a confidence- and margin-based discriminative training approach for model adaptation of an HMM-based recognition system to handle multiple fonts, different handwriting styles, and their variations.

...read moreread less

Abstract: We present a novel large vocabulary OCR system, which implements a confidence- and margin-based discriminative training approach for model adaptation of an HMM-based recognition system to handle multiple fonts, different handwriting styles, and their variations. Most current HMM approaches are HTK-based systems which are maximum likelihood (ML) trained and which try to adapt their models to different writing styles using writer adaptive training, unsupervised clustering, or additional writer-specific data. Here, discriminative training based on the maximum mutual information (MMI) and minimum phone error (MPE) criteria are used instead. For model adaptation during decoding, an unsupervised confidence-based discriminative training within a two-pass decoding process is proposed. Additionally, we use neural network-based features extracted by a hierarchical multi-layer perceptron (MLP) network either in a hybrid MLP/HMM approach or to discriminatively retrain a Gaussian HMM system in a tandem approach. The proposed framework and methods are evaluated for closed-vocabulary isolated handwritten word recognition on the IFN/ENIT-database Arabic handwriting database, where the word error rate is decreased by more than 50 % relative to an ML trained baseline system. Preliminary results for large vocabulary Arabic machine-printed text recognition tasks are presented on a novel publicly available newspaper database.

...read moreread less

Journal Article•DOI•

A paradigm for handwriting-based intelligent tutors

[...]

Lisa Anthony¹, Jie Yang¹, Kenneth R. Koedinger¹•Institutions (1)

Carnegie Mellon University¹

01 Nov 2012-International Journal of Human-computer Studies \/ International Journal of Man-machine Studies

TL;DR: This work can inform the design of future systems for students using pen and sketch input for math or other topics by motivating the use of context and pragmatics to decrease the impact of recognition errors and put user focus on the task at hand.

...read moreread less

Abstract: This paper presents the interaction design of, and demonstration of technical feasibility for, intelligent tutoring systems that can accept handwriting input from students. Handwriting and pen input offer several affordances for students that traditional typing-based interactions do not. To illustrate these affordances, we present evidence, from tutoring mathematics, that the ability to enter problem solutions via pen input enables students to record algebraic equations more quickly, more smoothly (fewer errors), and with increased transfer to non-computer-based tasks. Furthermore our evidence shows that students tend to like pen input for these types of problems more than typing. However, a clear downside to introducing handwriting input into intelligent tutors is that the recognition of such input is not reliable. In our work, we have found that handwriting input is more likely to be useful and reliable when context is considered, for example, the context of the problem being solved. We present an intelligent tutoring system for algebra equation solving via pen-based input that is able to use context to decrease recognition errors by 18% and to reduce recognition error recovery interactions to occur on one out of every four problems. We applied user-centered design principles to reduce the negative impact of recognition errors in the following ways: (1) though students handwrite their problem-solving process, they type their final answer to reduce ambiguity for tutoring purposes, and (2) in the small number of cases in which the system must involve the student in recognition error recovery, the interaction focuses on identifying the student's problem-solving error to keep the emphasis on tutoring. Many potential recognition errors can thus be ignored and distracting interactions are avoided. This work can inform the design of future systems for students using pen and sketch input for math or other topics by motivating the use of context and pragmatics to decrease the impact of recognition errors and put user focus on the task at hand.

...read moreread less

Book Chapter•DOI•

Online Arabic Databases and Applications

[...]

Houcine Boubaker¹, Abdelkarim Elbaati¹, Najiba Tagougui¹, Haikal El Abed², Monji Kherallah¹, Adel M. Alimi¹ - Show less +2 more•Institutions (2)

University of Sfax¹, Braunschweig University of Technology²

01 Jan 2012

TL;DR: The nature of the Arabic handwritten language and the basic concepts behind the recognition process are presented and an overview of online Arabic databases and applications presented in the literature is discussed in detail.

...read moreread less

Abstract: Large databases were developed for handwriting recognition in Latin script. In contrast, very few databases have been developed for Arabic script, and fewer have become publicly available. This paper describes a pilot study in which we present the nature of the Arabic handwritten language and the basic concepts behind the recognition process. An overview of online Arabic databases and applications presented in the literature is discussed in detail. We also present some related works using these databases.

...read moreread less

Proceedings Article•DOI•

ICFHR 2012 Competition on Recognition of On-Line Mathematical Expressions (CROHME 2012)

[...]

Harold Mouchère¹, Christian Viard-Gaudin¹, Dae Hwan Kim², Jin Hyung Kim², Utpal Garain - Show less +1 more•Institutions (2)

École Normale Supérieure¹, KAIST²

18 Sep 2012

TL;DR: The contest details including the evaluation measures used as well as the performance of the 7 submitted systems are described along with a short description of each system.

...read moreread less

Abstract: This paper presents an overview of the second Competition on Recognition of Online Handwritten Mathematical Expressions, CROHME 2012. The objective of the contest is to identify current advances in mathematical expression recognition using common evaluation performance measures and datasets. This paper describes the contest details including the evaluation measures used as well as the performance of the 7 submitted systems along with a short description of each system. Progress as compared to the 1st version of CROHME is also documented.

...read moreread less

Journal Article•DOI•

On Using Entropy for Enhancing Handwriting Preprocessing

[...]

Andreas Holzinger, Christof Stocker, Bernhard Peischl, Klaus Martin Simonic¹•Institutions (1)

Medical University of Graz¹

19 Nov 2012-Entropy

TL;DR: This paper evaluates the performance of entropy based slant- and skew-correction, and compares the results to other methods, and shows that the entropy-based slant correction method outperforms a window based approach with an average precision.

...read moreread less

Abstract: Handwriting is an important modality for Human-Computer Interaction. For medical professionals, handwriting is (still) the preferred natural method of documentation. Handwriting recognition has long been a primary research area in Computer Science. With the tremendous ubiquity of smartphones, along with the renaissance of the stylus, handwriting recognition has become a new impetus. However, recognition rates are still not 100% perfect, and researchers still are constantly improving handwriting algorithms. In this paper we evaluate the performance of entropy based slant- and skew-correction, and compare the results to other methods. We selected 3700 words of 23 writers out of the Unipen-ICROW-03 benchmark set, which we annotated with their associated error angles by hand. Our results show that the entropy-based slant correction method outperforms a window based approach with an average precision of ±6.02° for the entropy-based method, compared with the ±7.85° for the alternative. On the other hand, the entropy-based skew correction yields a lower average precision of ±2:86°, compared with the average precision of ±2.13° for the alternative LSM based approach.

...read moreread less

Proceedings Article•DOI•

An Unconstrained Benchmark Urdu Handwritten Sentence Database with Automatic Line Segmentation

[...]

Ahsen Raza¹, Imran Siddiqi², Ali Abidi¹, Fahim Arif¹•Institutions (2)

National University of Science and Technology¹, Bahria University²

18 Sep 2012

TL;DR: A novel off-line sentence database of Urdu handwritten documents along with a few preprocessing and text line segmentation procedures are presented and announced.

...read moreread less

Abstract: In this paper we present and announce a novel off-line sentence database of Urdu handwritten documents along with a few preprocessing and text line segmentation procedures. Despite an increased research interest in Urdu handwritten document analysis over the recent years, a standard benchmark dataset, which could be used in Urdu handwriting recognition tasks, has been missing. Based on our own developed and updated corpus named CENIP-UCCP (Center for Image Processing-Urdu Corpus Construction Project), we have developed an Urdu handwritten database. The corpus is a collection of a variety of Urdu texts that were used to generate forms. These forms were subsequently filled by native writers in their natural handwritings. Six categories of text were used to generate these forms with each category using approximately 66 forms. Up till now, the database comprises 400 digitized forms produced by 200 different writers. The database is completely labeled for content information as well as content detection and supports the evaluation of systems like Urdu handwriting recognition, line segmentation and writer identification. The database was also experimented with the proposed Urdu text line segmentation scheme rendering promising segmentation results.

...read moreread less

Proceedings Article•DOI•

Stroke Segmentation and Recognition from Bangla Online Handwritten Text

[...]

Nilanjana Bhattacharya¹, Umapada Pal²•Institutions (2)

Bose Institute¹, Indian Statistical Institute²

18 Sep 2012

TL;DR: This paper deals with recognition of online handwritten Bangla (Bengali) text with segmentation of text into strokes, and discovered some rules analyzing different joining patterns of Bangla characters.

...read moreread less

Abstract: This paper deals with recognition of online handwritten Bangla (Bengali) text. Here, at first, we segment cursive words into strokes. A stroke may represent a character or a part of a character. We selected a set of Bangla words written by different groups of people such that they contain all basic characters, all vowel and consonant modifiers and almost all types of possible joining among them. For segmentation of text into strokes, we discovered some rules analyzing different joining patterns of Bangla characters. Combination of online and offline information was used for segmentation. We achieved correct segmentation rate of 97.89% on the dataset. We manually analyzed different strokes to create a ground truth set of distinct stroke classes for result verification and we obtained 85 stroke classes. Directional features were used in SVM for recognition and we achieved correct stroke recognition rate of 97.68%.

...read moreread less

Collapse