Showing papers on "Intelligent word recognition published in 1997"

PDF

Open Access

Book•

Handbook of Character Recognition and Document Image Analysis

[...]

02 May 1997

TL;DR: Arabic character recognition, A. Amin automatic reading of braille documents, and Antonacopoulos techniques for improving OCR results.

...read moreread less

Abstract: Arabic character recognition, A. Amin automatic reading of braille documents, A. Antonacopoulos techniques for improving OCR results, A. Dengel offline handwritten word recognition using hidden Markov models, A. Kundu combinations of multiple classifier decisions for OCR, L. Lam and C.Y. Suen classification techniques - statistical pattern recognition, neural networks and their relations, J. Schurmann cursive handwriting recognition - contextual and context - free techniques, M. Shridhar and Kimura multilingual document recognition, L. Spitz information retrieval and OCR, K. Taghva technical drawing analysis - including vectorization, D. Dori and K. Tombre reading of music notation, N. Carter and D. Bainbridge benchmarking, T. Nartker et al automatic signature verification, S. Impedovo. (Part Contents).

...read moreread less

387 citations

Journal Article•DOI•

A lexicon driven approach to handwritten word recognition for real-time applications

[...]

Gyeonghwan Kim¹, Venu Govindaraju²•Institutions (2)

State University of New York System¹, University at Buffalo²

01 Apr 1997-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Experimental results prove that the approach using the variable duration outperforms the method using fixed duration in terms of both accuracy and speed.

...read moreread less

Abstract: A fast method of handwritten word recognition suitable for real time applications is presented in this paper. Preprocessing, segmentation and feature extraction are implemented using a chain code representation of the word contour. Dynamic matching between characters of a lexicon entry and segment(s) of the input word image is used to rank the lexicon entries in order of best match. Variable duration for each character is defined and used during the matching. Experimental results prove that our approach using the variable duration outperforms the method using fixed duration in terms of both accuracy and speed. Speed of the entire recognition process is about 200 msec on a single SPARC-10 platform and the recognition accuracy is 96.8 percent are achieved for lexicon size of 10, on a database of postal words captured at 212 dpi.

...read moreread less

286 citations

Journal Article•DOI•

Handwritten word recognition with character and inter-character neural networks

[...]

Paul D. Gader¹, Magdi A. Mohamed¹, Jung-Hsien Chiang•Institutions (1)

University of Missouri¹

01 Feb 1997

TL;DR: An off-line handwritten word recognition system that assigns confidence that pairs of segments are compatible with character confidence assignments and that this confidence is integrated into the dynamic programming is described.

...read moreread less

Abstract: An off-line handwritten word recognition system is described. Images of handwritten words are matched to lexicons of candidate strings. A word image is segmented into primitives. The best match between sequences of unions of primitives and a lexicon string is found using dynamic programming. Neural networks assign match scores between characters and segments. Two particularly unique features are that neural networks assign confidence that pairs of segments are compatible with character confidence assignments and that this confidence is integrated into the dynamic programming. Experimental results are provided on data from the U.S. Postal Service.

...read moreread less

151 citations

Patent•

Recognition dictionary system structure and changeover method of speech recognition system for car navigation

[...]

Shinji Wakisaka¹, Kazuyoshi Ishiwatari¹, Kouji Ito¹, Tetsuji Toge¹, Makoto Tanaka¹ - Show less +1 more•Institutions (1)

Hitachi¹

13 Nov 1997-Journal of the Acoustical Society of America

TL;DR: In this paper, a dictionary change-over section for making a changeover between dictionaries to be subjected to speech recognition in accordance with dictionary changeover information, a first memory for storing a plurality of dictionaries, a second record for storing one dictionary made an object of recognition, and a speech recognition section for performing speech recognition processing, whereby speech recognition is performed while making the changeover, as required.

...read moreread less

Abstract: A speech recognition system realizing large-vocabulary speech recognition at a low cost without deteriorating the rate of recognition and a recognition speed performance is provided with a dictionary change-over section for making a change-over between dictionaries to be subjected to speech recognition in accordance with dictionary change-over information, a first memory for storing a plurality of dictionaries, a second memory for storing one dictionary made an object of recognition, and a speech recognition section for performing a speech recognition processing, whereby speech recognition is performed while making a change-over between dictionaries, as required. For example, in a car navigation speech recognition system, the change-over between dictionaries is made for each area in accordance with position information.

...read moreread less

83 citations

Patent•

Apparatus and method for OCR character and confidence determination using multiple OCR devices

[...]

Roger B. Bradford¹•Institutions (1)

Science Applications International Corporation¹

13 Jan 1997

TL;DR: In an optical character recognition (OCR) system an improved method and apparatus for recognizing the character and producing an indication of the confidence with which the character has been recognized as mentioned in this paper.

...read moreread less

Abstract: In an optical character recognition (OCR) system an improved method and apparatus for recognizing the character and producing an indication of the confidence with which the character has been recognized. The system employs a plurality of different OCR devices each of which outputs a indicated (or recognized) character along with the individual devices own determination of how confident it is in the indication. The OCR system uses that data output from each of the different OCR devices along with other attributes of the indicated character such as the relative accuracy of the particular OCR device indicating the character to choose the select character recognized by the system and to produce a combined confidence indication of how confident the system is in its recognition.

...read moreread less

80 citations

Journal Article•DOI•

On-line handwritten alphanumeric character recognition using dominant points in strokes

[...]

Xiaolin Li¹, Dit-Yan Yeung¹•Institutions (1)

Hong Kong University of Science and Technology¹

01 Jan 1997-Pattern Recognition

TL;DR: In this paper, an approach to on-line handwritten alphanumeric character recognition based on sequential handwriting signals is presented and the issue of reference (or template) set evolution is also addressed.

...read moreread less

78 citations

Patent•

Method of grouping handwritten word segments in handwritten document images

[...]

Tanveer Syeda-Mahmood¹•Institutions (1)

Xerox¹

29 Sep 1997

TL;DR: In this paper, a method and system of recognizing handwritten words in scanned documents is presented, wherein by processing a document containing handwriting, features for word localization are extracted from handwritten words contained in said document through basis points taken from a single curve of text lines.

...read moreread less

Abstract: A method and system of recognizing handwritten words in scanned documents, wherein by processing a document containing handwriting, features for word localization are extracted from handwritten words contained in said document through basis points taken from a single curve of text lines. The method is independent of page orientation, and does not assume that the individual lines of handwritten text are parallel, and the method does not require that word regions be aligned with text line orientation wherein intra-word statistics are derived from sample pages rather than using a fixed threshold. The method has applications in digital libraries, handwriting tokenization, document management and OCR systems.

...read moreread less

78 citations

Proceedings Article•DOI•

Font recognition and contextual processing for more accurate text recognition

[...]

Hongwei Shi¹, T. Pavlidis•Institutions (1)

State University of New York System¹

18 Aug 1997

TL;DR: Font recognition and contextual processing are developed as two components that enhance the recognition accuracy of a text recognition system presented in a previous paper.

...read moreread less

Abstract: Font recognition and contextual processing are developed as two components that enhance the recognition accuracy of a text recognition system presented in a previous paper ((H. Shi and T. Pavlidis, 1996). Font information is extracted from two sources: one is the global page properties, and the other is the graph matching result of recognized short words such as a, it and of etc. Contextual processing is done by first composing word candidates from the recognition results and then checking each candidate with a dictionary through a spelling checker. Positional binary trigrams and word affixes are used to prune the search for word candidates.

...read moreread less

64 citations

Patent•

Confusion matrix based method and system for correcting misrecognized words appearing in documents generated by an optical character recognition technique

[...]

Randy G. Goldberg¹•Institutions (1)

AT&T¹

11 Aug 1997

TL;DR: In this article, a method and apparatus for correcting misrecognized words appearing in electronic documents that have been generated by scanning an original document in accordance with an optical character recognition ("OCR") technique is presented.

...read moreread less

Abstract: A method and apparatus for correcting misrecognized words appearing in electronic documents that have been generated by scanning an original document in accordance with an optical character recognition ("OCR") technique. If an incorrect word is found in the electronic document, the present invention generates at least one reference word and selects the reference word that is the most likely correct replacement for the incorrect word. This selection is accomplished by performing a probabilistic determination that assigns to each reference word a replacement word recognition probability. The probabilistic determination is carried out on the basis of a pre-stored confusion matrix that stores a plurality of probability values. The confusion matrix is used to associate each character of recognized word in the electronic document with a corresponding character of a word in the original document on the basis of these probability values.

...read moreread less

63 citations

Proceedings Article•DOI•

On-line handwritten character pattern database sampled in a sequence of sentences without any writing instructions

[...]

Masaki Nakagawa¹, T. Higashiyama, Y. Yamanaka, Shin-ichi Sawada, L. Higashigawa, K. Akiyama - Show less +2 more•Institutions (1)

University of Tokyo¹

18 Aug 1997

TL;DR: A database of on-line handwritten character patterns sampled in a sequence of sentences without any instructions is presented, describing the characteristics of this database as well as several tools to collect patterns.

...read moreread less

Abstract: The paper presents a database of on-line handwritten character patterns sampled in a sequence of sentences without any instructions. The sentences according to which character patterns are collected have been picked up from newspaper to include 1227 frequently appearing character categories with the result that they are composed of about 10000 characters and include 1537 JIS 1st level character categories. The rest of the JIS 1st level 1808 categories have been added at the end of the above text and written one by one. The total text has been commonly employed for collecting script patterns from a number of people. Patterns offered were inspected and omissions and wrong patterns were rewritten. The authors collected data from 80 people and made the 12000/spl times/80 patterns available from February 1996. More patterns are being collected. The paper describes the characteristics of this database as well as several tools to collect patterns.

...read moreread less

50 citations

Patent•

Method and apparatus for performing an automatic correction of misrecognized words produced by an optical character recognition technique by using a Hidden Markov Model based algorithm

[...]

Randy G. Goldberg¹•Institutions (1)

AT&T¹

11 Aug 1997

TL;DR: In this paper, a method and apparatus for correcting misrecognized words appearing in electronic documents that have been generated by scanning an original document in accordance with an optical character recognition (OCR) technique is presented.

...read moreread less

Abstract: A method and apparatus for correcting misrecognized words appearing in electronic documents that have been generated by scanning an original document in accordance with an optical character recognition (“OCR”) technique. Each recognized word is generated by first producing, for each character position of the corresponding word in the original document, the N-best characters for occupying that character position. If an incorrect word is found in the electronic document, the present invention generates a plurality of reference words from which one is selected for replacing the incorrect word. This selected reference word is determined by the present invention to be the reference word that is the most likely correct replacement for the incorrect recognized word. This selection is accomplished by computing for each reference word a replacement word value. The reference word that is selected to replace the incorrect recognized word corresponds to the highest replacement word value.

...read moreread less

Journal Article•DOI•

A structural and relational approach to handwritten word recognition

[...]

Richard Buse¹, Zhi-Qiang Liu², Terry Caelli¹•Institutions (2)

University of Melbourne¹, City University of Hong Kong²

01 Oct 1997

TL;DR: A new off-line word recognition system that is able to recognize unconstrained handwritten words using grey-scale images based on structural and relational information in the handwritten word is presented.

...read moreread less

Abstract: In this paper, we present a new off-line word recognition system that is able to recognize unconstrained handwritten words using grey-scale images. This is based on structural and relational information in the handwritten word. We use Gabor filters to extract features from the words, and then use an evidence-based approach for word classification. A solution to the Gabor filter parameter estimation problem is given, enabling the Gabor filter to be automatically tuned to the word image properties. We also developed two new methods for correcting the slope of the handwritten words. Our experiments show that the proposed method achieves good recognition rates compared to standard classification methods.

...read moreread less

Book Chapter•DOI•

Handwritten word recognition using hidden markov model

[...]

Amlan Kundu

01 May 1997

Proceedings Article•DOI•

Machine and human recognition of segmented characters from handwritten words

[...]

Fumitaka Kimura¹, N. Kayahara¹, Yasuji Miyake¹, M. Shridhar¹•Institutions (1)

Mie University¹

18 Aug 1997

TL;DR: Experimental results show that when the characters are segmented from words and are randomly presented, the accuracy of the machine recognition is comparable with the average human recognition accuracy.

...read moreread less

Abstract: Handwritten character recognition by human readers, a statistical classifier, and a neural network is compared to know the required accuracy for handwritten word recognition. Sample characters extracted from postal address words on mail pieces collected by USPS were used to evaluate human and machine performance. Experimental results show that: 1) when the characters are segmented from words and are randomly presented, the accuracy of the machine recognition is comparable with the average human recognition accuracy, 2) the neural network employing the feature vector of size 64 outperforms the statistical classifier employing the same feature vector, and that 3) the statistical classifier employing the feature vector of size 400 achieves comparable recognition rate with the best human reader.

...read moreread less

Proceedings Article•DOI•

Recovery of temporal information of cursively handwritten words for on-line recognition

[...]

Horst Bunke, R. Ammann¹, Guido Kaufmann¹, Thien M. Ha¹, M. Schenkel², R. Seiler², F. Eggimann² - Show less +3 more•Institutions (2)

University of Bern¹, École Polytechnique Fédérale de Lausanne²

18 Aug 1997

TL;DR: A method for the recovery of the stroke order from static handwritten images is presented, tested by classifying the words of an off-line database with a state-of-the-art on-line recognition system.

...read moreread less

Abstract: On-line recognition differs from off-line recognition in that additional information about the drawing order of the strokes is available. This temporal information makes it easier to recognize handwritten texts with an on-line recognition system. In this paper we present a method for the recovery of the stroke order from static handwritten images. The algorithm was tested by classifying the words of an off-line database with a state-of-the-art on-line recognition system. On this database with 150 different words, written by four cooperative writers, a recognition rate of 97.4% was obtained.

...read moreread less

Journal Article•DOI•

Document retrieval tolerating character recognition errors—evaluation and application

[...]

Katsumi Marukawa¹, Tao Hu¹, Hiromichi Fujisawa¹, Yoshihiro Shima¹•Institutions (1)

Hitachi¹

01 Aug 1997-Pattern Recognition

TL;DR: Two methods of combining character recognition with techniques for retrieving Japanese documents are presented and it is shown how these methods can be applied to textual image retrieval.

...read moreread less

Patent•

Handwritten character recognition apparatus and method using a clustering algorithm

[...]

Hotta Yoshinobu¹, Naoi Satoshi¹, Misako Suwa¹•Institutions (1)

Fujitsu¹

18 Feb 1997

TL;DR: For a plurality of handwritten characters extracted from an input image, a character category for each character is first determined by a character recognition process as discussed by the authors, and according to a clustering process, similarity levels of character-forms among extracted characters are determined, and based on the determination result, the character category determination result from the first character classification process is modified.

...read moreread less

Abstract: For a plurality of handwritten characters extracted from an input image, a character category for each character is first determined by a character recognition process. Second, according to a clustering process, similarity levels of character-forms among extracted characters are determined, and based on the determination result, the character category determination result from the first character recognition process is modified.

...read moreread less

Proceedings Article•DOI•

Automatic prototype extraction for adaptive OCR

[...]

George Nagy¹, Yihong Xu¹•Institutions (1)

Rensselaer Polytechnic Institute¹

18 Aug 1997

TL;DR: A Bayesian method of isolating character bitmaps from paragraph-length samples of heavily degraded text images is demonstrated and is sufficiently robust to tolerate errors in transcripts obtained from multifont commercial OCR software.

...read moreread less

Abstract: A Bayesian method of isolating character bitmaps from paragraph-length samples of heavily degraded text images is demonstrated. The method requires a transcript of the text, but it is sufficiently robust to tolerate errors in transcripts obtained from multifont commercial OCR software. The resulting prototypes (labeled character images) are used to recognize additional text an the same document.

...read moreread less

Proceedings Article•

Learning and Application of Differential Grammars

[...]

David M. W. Powers

01 Jan 1997

TL;DR: This paper discusses models of confusion which may be used in the identification of confused words, shows how significant contexts may be identified and condensed into Differential Grammars, and compares the performance of the implementa t ion with two commercial checkers which purpor t to handle the confused word problem.

...read moreread less

Abstract: We examine the Differential Grammar , a representat ion designed to discr iminate which of a set of eonfusable al ternat ives is most likely in the context it occurs in. This approach is useful whereever uncer ta inty may exist about the ident i ty of a token or sequence of tokens, including in speech recognition, optical character recognition and machine t ransla t ion. In this paper our appl ica t ion is word processing: we discuss mul t ip le models of confusion which may be used in the identification of confused words, we show how significant contexts may be identified and condensed into Differential Grammars , and we contrast the performance of our implementa t ion with tha t of two commercial g r ammar checkers which purpor t to handle the confused word problem.

...read moreread less

Proceedings Article•DOI•

A study of moment functions and its use in Chinese character recognition

[...]

Simon Liao¹, Qin Lu¹•Institutions (1)

University of Winnipeg¹

18 Aug 1997

TL;DR: New moment features for Chinese character recognition are proposed that provide significant improvements in terms of Chinese character Recognition, especially for those characters that are very close in shapes.

...read moreread less

Abstract: Moment descriptors have been developed as features in pattern recognition since the moment method was first introduced. In this paper, new moment features for Chinese character recognition are proposed. These provide significant improvements in terms of Chinese character recognition, especially for those characters that are very close in shapes.

...read moreread less

Journal Article•

Chain Code Processing for Handwritten Word Recognition

[...]

Sriganesh Madhvanath, Eugene H. Kim, Venu Govindaraju

01 Jan 1997-IEEE Transactions on Pattern Analysis and Machine Intelligence

Book Chapter•DOI•

A Spatio-temporal Perceptron for On-Line Handwritten Character Recognition

[...]

Nasser Mozayyani¹, Gilles Vaucher¹•Institutions (1)

Supélec¹

08 Oct 1997

TL;DR: The objective of this work is the application of the spatio-temporal multilayer perceptron (ST-MLP) developed in the laboratory to the recognition of on-line handwritten characters.

...read moreread less

Abstract: The objective of this work is the application of the spatio-temporal multilayer perceptron (ST-MLP) developed in our laboratory to the recognition of on-line handwritten characters. The ST-MLP integrates a spatio-temporal data coding defined in the complex domain. Starting from the stroke of a character produced by a digitizing tablet, we conduct the recognition process in two steps. This procedure which is classic in this domain, consist of a preprocessing step and a recognition one. The first step (segmentation step), identifies some elementary (basic) lines, called primitives, from the stroke of the character. Then we utilise the ST-MLP to recognize the traced character from the primitives provided.

...read moreread less

Journal Article•DOI•

A language model based on semantically clustered words in a Chinese character recognition system

[...]

Hsi-Jian Lee¹, Cheng-Huang Tung¹•Institutions (1)

National Chiao Tung University¹

01 Aug 1997-Pattern Recognition

TL;DR: A new method for clustering the words in a dictionary into word groups so that a Chinese character recognition system can then use these groups in a language model to improve the recognition accuracy.

...read moreread less

Proceedings Article•DOI•

Contour-based image preprocessing for holistic handwritten word recognition

[...]

S. Madhvanath¹, Venu Govindaraju²•Institutions (2)

State University of New York System¹, University at Buffalo²

18 Aug 1997

TL;DR: The issues of determination of upper and lower contours of the word, determination of significant focal extrema on the contour, and determination of reference lines from contour representations of handwritten words are discussed.

...read moreread less

Abstract: The one-dimensional nature of contour representations presents interesting challenges for processing of images for handwritten word recognition. In this paper, we discuss the issues of determination of upper and lower contours of the word, determination of significant focal extrema on the contour, and determination of reference lines from contour representations of handwritten words.

...read moreread less

Patent•

Handwritten character entry method and device having two display areas

[...]

Satomi Sakai¹, Kengo Osawa¹•Institutions (1)

Fujitsu¹

16 Jan 1997

TL;DR: In an area of a tablet where a handwritten character is written for entry, the result of the recognition of handwritten characters is displayed by replacing the handwritten character written therein; at the same time, the recognition result is also displayed in a display field that can display more characters than can be shown in the handwritten characters entry area at one time as discussed by the authors.

...read moreread less

Abstract: In an area of a tablet where a handwritten character is written for entry, the result of the recognition of the handwritten character is displayed by replacing the handwritten character written therein; at the same time, the recognition result is also displayed in a display field that can display more characters than can be shown in the handwritten character entry area at one time

...read moreread less

Proceedings Article•DOI•

Off-line handwritten Chinese character recognition based on crossing line feature

[...]

Youbin Chen¹, Xiaoqing Ding, Youshou Wu•Institutions (1)

Tsinghua University¹

18 Aug 1997

TL;DR: A new method to extract crossing line features for off-line handwritten Chinese character recognition is proposed, in which the input pattern is nonlinearly normalized in order to compensate for shape variations.

...read moreread less

Abstract: A new method to extract crossing line features for off-line handwritten Chinese character recognition is proposed in this paper. Firstly, the input pattern is nonlinearly normalized in order to compensate for shape variations. Secondly, the normalized pattern is separated into four subpatterns according to the four kinds of elementary strokes. Thirdly, the four subpatterns are uniformly divided into M/spl times/M cells respectively. In every cell, the crossing lines are counted. Then a 4M/sup 2/-dimensional feature vector is generated. An off-line handwritten Chinese character recognition system is built based on this feature. Our experiments have demonstrated the effectiveness of the method proposed in this paper.

...read moreread less

Proceedings Article•DOI•

Offline handwritten Chinese character recognition via radical extraction and recognition

[...]

W.W.S. Ip, K.F.L. Chung¹, Daniel S. Yeung¹•Institutions (1)

Hong Kong Polytechnic University¹

18 Aug 1997

TL;DR: In this paper, a character decomposition approach based on deformable templates (DTs) has been used to extract radical sub-images from Chinese characters and feed the extracted radical images to an adopted structural based Chinese character recognizer whose outputs are then combined to produce the class label of the input character.

...read moreread less

Abstract: Despite the fact that Chinese characters are composed of radicals and that Chinese people usually formulate their knowledge of Chinese characters as a combination of radicals, very few studies have focused on a character decomposition approach to recognition, i.e., recognizing a character by first extracting and recognizing its radicals. Such an approach is adopted and the problem of how to extract radical sub-images from character images is particularly addressed. A radical extraction algorithm based on deformable templates (DTs) has been developed. The advantage of the character decomposition approach is demonstrated by feeding the extracted radical images to an adopted structural based Chinese character recognizer whose outputs are then combined to produce the class label of the input character. Simulation results show that the performance of the adopted Chinese character recognition system can be improved significantly when the character decomposition approach is used.

...read moreread less

Book Chapter•DOI•

Handwritten character recognition using neural networks

[...]

Thomas M Breuel

01 Jan 1997

Proceedings Article•DOI•

Recognition of handwritten Hindi numerals using structural descriptors

[...]

Ashraf Elnagar¹, F. Al-Kharousi, S. Harous•Institutions (1)

Sultan Qaboos University¹

12 Oct 1997

TL;DR: A method for the recognition of handwritten Hindi numerals is proposed based on structural descriptors of numeral shapes, which proves the tolerance of the proposed system to recognize a high variability ofnumeral shapes.

...read moreread less

Abstract: A method for the recognition of handwritten Hindi numerals is proposed based on structural descriptors of numeral shapes. The method consists of three major steps: 1) preprocessing, where a handwritten numeral is scanned, normalized, and then thinned; 2) a robust algorithm is developed to segment the scanned numeral image into stroke(s), based on feature points; and 3) identify cavity features. The output of this algorithm is a syntactic representation (that is one or more syntactic terms) of the scanned numeral. Finally, the syntactic representation is matched against a set of syntactic representation prototypes of handwritten numerals and the recognition result is reported. Early experimental results are encouraging and prove the tolerance of the proposed system to recognize a high variability of numeral shapes.

...read moreread less

Journal Article•DOI•

Image-based keyword recognition in oriental language document images

[...]

Jason Z. Zhu¹, Tao Hong², Jonathan J. Hull³•Institutions (3)

Microsoft¹, University at Buffalo², Ricoh³

01 Aug 1997-Pattern Recognition

TL;DR: Experimental results demonstrate the ability of the proposed algorithm to correctly recognize words in the presence of noise that could not be overcome by conventional character recognition or post-processing algorithms.

...read moreread less