scispace - formally typeset
Search or ask a question
Topic

Intelligent word recognition

About: Intelligent word recognition is a research topic. Over the lifetime, 2480 publications have been published within this topic receiving 45813 citations.


Papers
More filters
Patent
28 Dec 1995
TL;DR: In this article, an adaptive weighting handwriting recognition device and method compares information representing handwritten input with stored recognition data at least some of which has a weighting value associated therewith, which can be further processed with user editing instructions to modify and correct the candidate recognition information.
Abstract: An adaptive weighting handwriting recognition device and method compares information representing handwritten input with stored recognition data at least some of which has a weighting value associated therewith. The weighting values remains fixed during comparison of the handwritten input and stored recognition data and provide candidate recognition information, which can be further processed with user editing instructions to modify and correct the candidate recognition information. During user editing, the weighting values of the stored recognition data associated with the corrected candidate recognition information modified are varied to enhance the likelihood of correct future handwriting recognition.

12 citations

Proceedings ArticleDOI
Xujun Peng1, Huaigu Cao1, Krishna Subramanian1, Rohit Prasad1, Prem Natarajan1 
25 Aug 2013
TL;DR: A novel binarization method that is especially effective on historical documents with the following characteristics, which includes free-form cursive handwritten text with significant but consistent slant and a CRF-based framework which handles bleeds using a novel approach.
Abstract: We present a novel binarization method that is especially effective on historical documents with the following characteristics: (a) the documents contain free-form cursive handwritten text with significant but consistent slant, (b) scanning artifacts resulting in the text and background pixels not having uniform intensity even within the same page, and (c) pages containing significant amount of bleeds from the other side of the page. In order to tackle the problem of non-uniform text and background intensity, we use a thresholding algorithm that works equally well for regions of the page containing text and regions of the page containing no text. We then combine this algorithm with a CRF-based framework which handles bleeds using a novel approach to further improve the quality of binarization. We compare the proposed binarization algorithm against other popular binarization algorithms both qualitatively using examples and quantitatively using the word error rate (WER) metric from performing optical character recognition (OCR) on binarized text using the BBN Byblos Offline Handwritten text recognition (OHR) system.

12 citations

Proceedings ArticleDOI
16 Aug 1998
TL;DR: This method provides a solution to the crucial issue of assigning reliable cost to the edges of the segmentation graph in the popular over-segmentation followed by dynamic programming approach for word recognition.
Abstract: We present a method of combining multiple classifiers for optimizing word recognition. The proposed method combines the results of individual classifiers in such a way that the correct word is more likely to be hypothesized. This method provides a solution to the crucial issue of assigning reliable cost to the edges of the segmentation graph in the popular over-segmentation followed by dynamic programming approach for word recognition. Three combination functions are proposed and implemented. Experiments show that proposed method has a significant improvement on the word recognition accuracy.

12 citations

Proceedings ArticleDOI
01 Aug 1992
TL;DR: A robust algorithm for offline cursive script recognition that is not requiring explicit word training yet is able to recognize many handwriting styles and is being successfully tested on a database of handwritten words extracted from live mail with dictionary sizes of up to 300 words.
Abstract: A robust algorithm for offline cursive script recognition is described. The algorithm uses a generate-and-test paradigm to analyze cursive word images. The generate phase of the algorithm intelligently segments the word after analyzing certain structural features present in the word. The test phase determines the most likely character candidates among the segmentation points by using a recognition algorithm trained on generalized cursive letter shapes. In a sense, word recognition is done by sliding a variable sized window across the word looking for recognizable characters and strokes. The output of this system is a list of all plausible interpretations of the word. This list is then analyzed by a two-step contextual post- processor which first matches all of the interpretations to a supplied dictionary using a string matching algorithm. This eliminates the least likely interpretations. The remaining candidates are then analyzed for certain character spatial relationships (local reference line finder) to finally rank the dictionary. The system has the advantage of not requiring explicit word training yet is able to recognize many handwriting styles. This system is being successfully tested on a database of handwritten words extracted from live mail with dictionary sizes of up to 300 words. Planned extensions include developing a multilevel generate-and-test paradigm which can handle any type of handwritten word.

12 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
86% related
Feature (computer vision)
128.2K papers, 1.7M citations
85% related
Image segmentation
79.6K papers, 1.8M citations
84% related
Convolutional neural network
74.7K papers, 2M citations
84% related
Object detection
46.1K papers, 1.3M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202314
202241
20201
20192
20189
201751