scispace - formally typeset
Search or ask a question
Topic

Intelligent word recognition

About: Intelligent word recognition is a research topic. Over the lifetime, 2480 publications have been published within this topic receiving 45813 citations.


Papers
More filters
Patent
Randy G. Goldberg1
11 Aug 1997
TL;DR: In this article, a method and apparatus for correcting misrecognized words appearing in electronic documents that have been generated by scanning an original document in accordance with an optical character recognition ("OCR") technique is presented.
Abstract: A method and apparatus for correcting misrecognized words appearing in electronic documents that have been generated by scanning an original document in accordance with an optical character recognition ("OCR") technique. If an incorrect word is found in the electronic document, the present invention generates at least one reference word and selects the reference word that is the most likely correct replacement for the incorrect word. This selection is accomplished by performing a probabilistic determination that assigns to each reference word a replacement word recognition probability. The probabilistic determination is carried out on the basis of a pre-stored confusion matrix that stores a plurality of probability values. The confusion matrix is used to associate each character of recognized word in the electronic document with a corresponding character of a word in the original document on the basis of these probability values.

63 citations

Proceedings ArticleDOI
21 May 2015
TL;DR: The proposed OCR system was evaluated on the off-line handwritten Bangla numeral database CMATERdb 3.1, and achieved an excellent accuracy of 96:7% character recognition rate.
Abstract: Local Binary Pattern (LBP) is a simple yet robust texture descriptor that has been widely used in many computer vision applications including face recognition. In this paper, we exploit LBP for handwritten Bangla numeral recognition. We classify Bangla digits from their LBP histograms using K Nearest Neighbors (KNN) classifier. The performance of three different variations of LBP - the basic LBP, the uniform LBP and the simplified LBP was investigated. The proposed OCR system was evaluated on the off-line handwritten Bangla numeral database CMATERdb 3.1.1, and achieved an excellent accuracy of 96:7% character recognition rate.

63 citations

Patent
11 Sep 2000
TL;DR: A speech recognition apparatus includes a speech input device, a storage device that stores a recognition word indicating a pronunciation of a word to undergo speech recognition, and a speech recognition processing device that performs speech recognition by comparing audio data obtained through the voice input device and speech recognition data created in correspondence to the recognition word as discussed by the authors.
Abstract: A speech recognition apparatus includes: a speech input device; a storage device that stores a recognition word indicating a pronunciation of a word to undergo speech recognition; and a speech recognition processing device that performs speech recognition processing by comparing audio data obtained through the voice input device and speech recognition data created in correspondence to the recognition word, and the storage device stores both a first recognition word corresponding to a pronunciation of an entirety of the word to undergo speech recognition and a second recognition word corresponding to a pronunciation of only a starting portion of a predetermined length of the entirety of the word to undergo speech recognition as recognition words for the word to undergo speech recognition.

62 citations

Journal ArticleDOI
TL;DR: A segmentation-free approach to OCR is presented as part of a knowledge-based word interpretation model based on the recognition of subgraphs homeomorphic to previously defined prototypes of characters based on a variant of the notion of relative neighborhood used in computational perception.
Abstract: A segmentation-free approach to OCR is presented as part of a knowledge-based word interpretation model. It is based on the recognition of subgraphs homeomorphic to previously defined prototypes of characters. Gaps are identified as potential parts of characters by implementing a variant of the notion of relative neighborhood used in computational perception. Each subgraph of strokes that matches a previously defined character prototype is recognized anywhere in the word even if it corresponds to a broken character or to a character touching another one. The characters are detected in the order defined by the matching quality. Each subgraph that is recognized is introduced as a node in a directed net that compiles different alternatives of interpretation of the features in the feature graph. A path in the net represents a consistent succession of characters. A final search for the optimal path under certain criteria gives the best interpretation of the word features. Broken characters are recognized by looking for gaps between features that may be interpreted as part of a character. Touching characters are recognized because the matching allows nonmatched adjacent strokes. The recognition results for over 24,000 printed numeral characters belonging to a USPS database and on some hand-printed words confirmed the method's high robustness level. >

62 citations

Proceedings Article
01 Jan 1999
TL;DR: A new approach to penalize word hypotheses that are inconsistent with prosodic features such as duration and pitch is investigated, and the language model is modified to represent hidden events such as sentence boundaries and various forms of disfluency.
Abstract: We investigate a new approach for using speech prosody as a knowledge source for speech recognition. The idea is to penalize word hypotheses that are inconsistent with prosodic features such as duration and pitch. To model the interaction between words and prosody we modify the language model to represent hidden events such as sentence boundaries and various forms of disfluency, and combine with it decision trees that predict such events from prosodic features. N-best rescoring experiments on the Switchboard corpus show a small but consistent reduction of word error as a result of this modeling. We conclude with a preliminary analysis of the types of errors that are corrected by the prosodically informed model.

62 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
86% related
Feature (computer vision)
128.2K papers, 1.7M citations
85% related
Image segmentation
79.6K papers, 1.8M citations
84% related
Convolutional neural network
74.7K papers, 2M citations
84% related
Object detection
46.1K papers, 1.3M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202314
202241
20201
20192
20189
201751