scispace - formally typeset
Search or ask a question
Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.


Papers
More filters
Proceedings ArticleDOI
23 Mar 2011
TL;DR: This paper proposes a system for recognition of offline handwritten Malayalam vowels using Chain code and Image Centroid for the purpose of extracting features and a two layer feed forward network with scaled conjugate gradient for classification.
Abstract: Optical Character Recognition plays an important role in Digital Image Processing and Pattern Recognition. Even though ambient study had been performed on foreign languages like Chinese and Japanese, effort on Indian script is still immature. OCR in Malayalam language is more complex as it is enriched with largest number of characters among all Indian languages. The challenge of recognition of characters is even high in handwritten domain, due to the varying writing style of each individual. In this paper we propose a system for recognition of offline handwritten Malayalam vowels. The proposed method uses Chain code and Image Centroid for the purpose of extracting features and a two layer feed forward network with scaled conjugate gradient for classification.

30 citations

Book ChapterDOI
04 Nov 1998
TL;DR: A novel method for recognizing character strings, based on a lexical search approach, that character segmentation and character classification work as subfunctions of the search and improves the recognition accuracy by omitting useless candidates of character classification and changing the criterion of rejection dynamically.
Abstract: A novel method for recognizing character strings, based on a lexical search approach, is presented. In this method, a character string is recognized by searching for a sequence of segmented patterns that fits a string in a lexicon. A remarkable characteristic of this method is that character segmentation and character classification work as subfunctions of the search. The lexical search approach enables the parameters of character classifier to adapt to each segmented pattern. As a result, it improves the recognition accuracy by omitting useless candidates of character classification and by changing the criterion of rejection dynamically. Moreover, the processing time is drastically reduced by using minimum sets of categories for each segmented pattern. The validity of the developed method is shown by the experimental results using a lexicon including 44,700 character strings.

30 citations

Journal ArticleDOI
TL;DR: A comprehensive review of ACR techniques and evaluate the status of the ACR system development and an up to date bibliography is presented.
Abstract: An optical character recognition (OCR) system may provide a solution to the data entry problems, a bottleneck for the data processing industry. Therefore, OCR systems are being developed for almost all major languages and Arabic language is no exception to it. During the past three decades, considerable research and development works have been done towards the development of an efficient Arabic optical character recognition (ACR) system. In this paper we present a comprehensive review of ACR techniques and evaluate the status of the ACR system development and an up to date bibliography.

30 citations

Proceedings ArticleDOI
01 Sep 2001
TL;DR: The construction of a model for off-line word recognizers based on over-segmentation of the input image and recognition of segment combinations as characters in a given lexicon word is described.
Abstract: We describe the construction of a model for off-line word recognizers based on over-segmentation of the input image and recognition of segment combinations as characters in a given lexicon word. One such recognizer, the Word Model Recognizer (WMR), is used extensively. Based on the proposed model it was possible to improve the performance of WMR.

30 citations

Proceedings Article
01 Aug 2018
TL;DR: Past achievements and future horizons for Indigenous language transliteration, text prediction, spell-checking, approximate search, machine translation, speech recognition, speaker diarization, speech synthesis, optical character recognition, and computer-aided language learning are considered.
Abstract: In this article, we discuss which text, speech, and image technologies have been developed, and would be feasible to develop, for the approximately 60 Indigenous languages spoken in Canada. In particular, we concentrate on technologies that may be feasible to develop for most or all of these languages, not just those that may be feasible for the few most-resourced of these. We assess past achievements and consider future horizons for Indigenous language transliteration, text prediction, spell-checking, approximate search, machine translation, speech recognition, speaker diarization, speech synthesis, optical character recognition, and computer-aided language learning.

30 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
87% related
Feature (computer vision)
128.2K papers, 1.7M citations
85% related
Image segmentation
79.6K papers, 1.8M citations
85% related
Convolutional neural network
74.7K papers, 2M citations
84% related
Deep learning
79.8K papers, 2.1M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023186
2022425
2021333
2020448
2019430
2018357