scispace - formally typeset
Search or ask a question
Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.


Papers
More filters
Proceedings ArticleDOI
11 Dec 2000
TL;DR: Presents a learning-based approach for the construction of a license-plate recognition system that has shown the following performances on average: car detection rate 100%, segmentation rate 97.5%, and character recognition rate about 97.2%.
Abstract: Presents a learning-based approach for the construction of a license-plate recognition system. The system consists of three modules. They are, respectively, the car detection module, the license-plate segmentation module and the recognition module. The car detection module detects a car in a given image sequence obtained from a camera with a simple color-based approach. The segmentation module extracts the license plate in the detected car image using neural networks as filters for analyzing the color and texture properties of the license plate. The recognition module then reads the characters on the detected license plate with a support vector machine (SVM)-based character recognizer. The system has been tested with 1000 video sequences obtained from toll-gates, parking lots, etc., and has shown the following performances on average: car detection rate 100%, segmentation rate 97.5%, and character recognition rate about 97.2%.

222 citations

Journal ArticleDOI
TL;DR: Preprocessing, feature extraction and postprocessing techniques for commercial reading machines for optical character recognition (OCR) and problems related to handwritten and printed character recognition are pointed out.
Abstract: In order to highlight the interesting problems and actual results on the state of the art in optical character recognition (OCR), this paper describes and compares preprocessing, feature extraction and postprocessing techniques for commercial reading machines. Problems related to handwritten and printed character recognition are pointed out, and the functions and operations of the major components of an OCR system are described. Historical background on the development of character recognition is briefly given and the working of an optical scanner is explained. The specifications of several recognition systems that are commercially available are reported and compared.

221 citations

Proceedings ArticleDOI
31 Aug 2005
TL;DR: This paper describes the Arabic handwriting recognition competition held at ICDAR 2007, again uses the IFN/ENIT-database with Arabic handwritten Tunisian town names, and 8 groups with 14 systems are participating in the competition.
Abstract: This paper describes the Arabic handwriting recognition competition held at ICDAR 2007. This second competition (the first was at ICDAR 2005) again uses the IFN/ENIT-database with Arabic handwritten Tunisian town names. Today, more than 54 research groups from universities, research centers, and industry are working with this database worldwide. This year, 8 groups with 14 systems are participating in the competition. The systems were tested on known data and on two datasets which are unknown to the participants. The systems are compared on the most important characteristic, the recognition rate. Additionally, the relative speed of the different systems were compared. A short description of the participating groups, their systems, and the results achieved are finally presented.

220 citations

Journal ArticleDOI
TL;DR: This paper proposes a recognition method which is able to account for a variety of distortions due to eccentric handwriting, and tested its method on two worldwide standard databases of isolated numerals, namely CEDAR and NIST.
Abstract: This paper presents a new approach to off-line, handwritten numeral recognition. From the concept of perturbation due to writing habits and instruments, we propose a recognition method which is able to account for a variety of distortions due to eccentric handwriting. We tested our method on two worldwide standard databases of isolated numerals, namely CEDAR and NIST, and obtained 99.09 percent and 99.54 percent correct recognition rates at no-rejection level respectively. The latter result was obtained by testing on more than 170000 numerals.

219 citations

Journal ArticleDOI
TL;DR: A texture feature based thresholding algorithm that is appreciably better than those obtained by typical existing thresholding techniques for document images with poor contrast, strong noise, complex patterns, and/or variable modalities in gray-scale histograms is developed.
Abstract: Binarization has been difficult for document images with poor contrast, strong noise, complex patterns, and/or variable modalities in gray-scale histograms. We developed a texture feature based thresholding algorithm to address this problem. Our algorithm consists of three steps: 1) candidate thresholds are produced through iterative use of Otsu's algorithm (1978); 2) texture features associated with each candidate threshold are extracted from the run-length histogram of the accordingly binarized image; 3) the optimal threshold is selected so that desirable document texture features are preserved. Experiments with 9,000 machine printed address blocks from an unconstrained US mail stream demonstrated that over 99.6 percent of the images were successfully binarized by the new thresholding method, appreciably better than those obtained by typical existing thresholding techniques. Also, a system run with 500 troublesome mail address blocks showed that an 8.1 percent higher character recognition rate was achieved with our algorithm as compared with Otsu's algorithm.

218 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
87% related
Feature (computer vision)
128.2K papers, 1.7M citations
85% related
Image segmentation
79.6K papers, 1.8M citations
85% related
Convolutional neural network
74.7K papers, 2M citations
84% related
Deep learning
79.8K papers, 2.1M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023186
2022425
2021333
2020448
2019430
2018357