scispace - formally typeset
Search or ask a question
Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.


Papers
More filters
Proceedings ArticleDOI
08 May 2017
TL;DR: A method for analyzing floor plan images using wall segmentation, object detection, and optical character recognition, and fully convolutional networks (FCN) is introduced and applications in automatic 3D model building and interactive furniture fitting are shown.
Abstract: This paper introduces a method for analyzing floor plan images using wall segmentation, object detection, and optical character recognition. We introduce a challenging new real-estate floor plan dataset, R-FP, evaluate different wall segmentation methods, and propose fully convolutional networks (FCN) for this task. We explore architectures with different pixel-stride values and more compact ones with skipped pooling layers. An FCN-2s with a 2-pixel stride layer achieves state-of-the-art performance, obtaining a mean Intersection-over-Union score of 89.9% on R-FP, and 94.4% on the public CVC-FP data set. Using OCR and object detection, we estimate room sizes. Finally, we show applications in automatic 3D model building and interactive furniture fitting.

64 citations

Journal ArticleDOI
TL;DR: It is shown that in order to improve classification results obtained with single classifiers, it is necessary to combine several sources of information either at the level of feature extraction/description, or at the classification stage, orat both levels.
Abstract: In this paper, we present a review of the state of the art in the current classification techniques used in the optical character recognition of the Arabic script (AOCR). We consider multiple sources of information-based hybrid approaches and multiple classifiers. We show that in order to improve classification results obtained with single classifiers, it is necessary to combine several sources of information either at the level of feature extraction/description, or at the classification stage, or at both levels. We provide a qualitative comparison and discuss the strengths and weaknesses of these approaches.

64 citations

Patent
Aaron Michael Burry1, Peter Paul1
14 Dec 2010
TL;DR: In this article, an automated license plate recognition (ALPR) system and method using a human-in-the-loop based adaptive learning approach is presented. But the method is limited to the recognition of vehicles.
Abstract: An automated license plate recognition (ALPR) system and method using a human-in-the-loop based adaptive learning approach. One or more images with respect to an automotive vehicle can be segmented in order to determine a license plate of the automotive vehicle within a scene. An optical character recognition (OCR) engine loaded with an OCR algorithm can be further adapted to determine a character sequence of the license plate based on a training data set. A confidence level with respect to the images can be generated in order to route a low confidence image to an operator for obtaining a human interpreted image. The parameters with respect to the OCR algorithm can be adjusted based on the human interpreted image and the actual image of the license plate. A license plate design can be then incorporated into the OCR engine in order to automate the process of recognizing the license plate with respect to the automotive vehicle in a wide range of transportation related applications.

63 citations

Journal ArticleDOI
TL;DR: Handwritten Chinese characters can be recognized by first extracting the basic shapes (radicals) of which they are composed by using nonlinear active shape models and optimal parameters found using the chamfer distance transform and a dynamic tunneling algorithm.
Abstract: Handwritten Chinese characters can be recognized by first extracting the basic shapes (radicals) of which they are composed. Radicals are described by nonlinear active shape models and optimal parameters found using the chamfer distance transform and a dynamic tunneling algorithm. The radical recognition rate is 96.5 percent correct (writer-independent) on 280,000 characters containing 98 radical classes.

63 citations

Journal ArticleDOI
TL;DR: In this article, the authors proposed a novel benchmark composed of a dataset designed to focus specifically on the character segmentation step of the ALPR within an evaluation protocol, which is composed of 2,000 Brazilian license plates consisting of 14,000 alphanumeric symbols and their corresponding bounding box annotations.
Abstract: Automatic License Plate Recognition (ALPR) has been the focus of many researches in the past years. In general, ALPR is divided into the following problems: detection of on-track vehicles, license plates detection, segmention of license plate characters and optical character recognition (OCR). Even though commercial solutions are available for controlled acquisition conditions, e.g., the entrance of a parking lot, ALPR is still an open problem when dealing with data acquired from uncontrolled environments, such as roads and highways when relying only on imaging sensors. Due to the multiple orientations and scales of the license plates captured by the camera, a very challenging task of the ALPR is the License Plate Character Segmentation (LPCS) step, which effectiveness is required to be (near) optimal to achieve a high recognition rate by the OCR. To tackle the LPCS problem, this work proposes a novel benchmark composed of a dataset designed to focus specifically on the character segmentation step of the ALPR within an evaluation protocol. Furthermore, we propose the Jaccard-Centroid coefficient, a new evaluation measure more suitable than the Jaccard coefficient regarding the location of the bounding box within the ground-truth annotation. The dataset is composed of 2,000 Brazilian license plates consisting of 14,000 alphanumeric symbols and their corresponding bounding box annotations. We also present a new straightforward approach to perform LPCS efficiently. Finally, we provide an experimental evaluation for the dataset based on four LPCS approaches and demonstrate the importance of character segmentation for achieving an accurate OCR.

63 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
87% related
Feature (computer vision)
128.2K papers, 1.7M citations
85% related
Image segmentation
79.6K papers, 1.8M citations
85% related
Convolutional neural network
74.7K papers, 2M citations
84% related
Deep learning
79.8K papers, 2.1M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023186
2022425
2021333
2020448
2019430
2018357