scispace - formally typeset
Search or ask a question
Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.


Papers
More filters
Proceedings ArticleDOI
Wei Qi1, Lie Gu, Hao Jiang1, Xiang-Rong Chen, Hong-Jiang Zhang 
10 Sep 2000
TL;DR: Two advanced video browsers for home users are developed: intelligent highlight player and HTML-based video browser that perform automated categorization of news stories based on the texts obtained from close caption or video OCR process.
Abstract: We present a system developed for content-based broadcast news video browsing for home users. There are three main factors that distinguish our work from other similar ones. First, we have integrated the image and audio analysis results in identifying news segments. Second, we use the video OCR technology to detect text from frames, which provides a good source of textual information for story classification when transcripts and close captions are not available. Finally, natural language processing (NLP) technologies are used to perform automated categorization of news stories based on the texts obtained from close caption or video OCR process. Based on these video structure and content analysis technologies, we have developed two advanced video browsers for home users: intelligent highlight player and HTML-based video browser.

100 citations

Journal ArticleDOI
TL;DR: A method for the automatic localization of text embedded in complex images permits to detect the spatial position and the skew of the text lines which are present in the scene and to return a binary representation of each text line.

100 citations

Journal ArticleDOI
Shuichi Tsujimoto1, Haruo Asada1
01 Jul 1992
TL;DR: Experiments have proved that the proposed approaches to document analysis and document understanding are robust even for multicolumned and multiarticle documents containing graphics and photographs, and thatThe proposed character segmentation/recognition method is robust enough to cope with omnifont characters which frequently touch each other.
Abstract: The document image processes used in a recently developed text reading system are described. The system consists of three major components: document analysis, document understanding, and character segmentation/recognition. The document analysis component extracts lines of text from a page for recognition. The document understanding component extracts logical relationships between the document constituents. The character segmentation/recognition component extracts characters from a text line and recognizes them. Experiments on more than a hundred documents have proved that the proposed approaches to document analysis and document understanding are robust even for multicolumned and multiarticle documents containing graphics and photographs, and that the proposed character segmentation/recognition method is robust enough to cope with omnifont characters which frequently touch each other. >

100 citations

Journal ArticleDOI
TL;DR: By recovering a drawing order of a handwritten script, the temporal information can be recovered from a static 2D image and this method will be used as a bridge from the offline handwriting character recognition problem to the online one.
Abstract: Describes a method to recover a drawing order of a handwritten script from a static 2D image. The script should be written in a single stroke and may include double-traced lines. After the script is scanned in and preprocessed, we apply our recovery method which consists of two phases. In the first phase, we globally analyze the graph constructed from the skeletal image and label the graph by determining the types of each edge. In the second phase, we trace the graph from the start vertex to the end vertex using the labeling information. This method does not enumerate the possible cases, for example, by solving the traveling salesman problem and, therefore, does not cause a combinatorial explosion even if the script is very complex. By recovering a drawing order of a handwritten script, the temporal information can be recovered from a static 2D image. Hence, this method will be used as a bridge from the offline handwriting character recognition problem to the online one.

99 citations

Patent
01 Apr 2005
TL;DR: In this article, a portable reading machine that operates in several modes and performs image preprocessing to prior to optical character recognition is presented, where the reading machine receives a low resolution image and a high resolution image of a scene and processing the low-resolution image to recognize a user-initiated gesture using a gesturing item.
Abstract: A portable reading machine that operates in several modes and performs image preprocessing to prior to optical character recognition. The portable reading machine receives a low resolution image and a high resolution image of a scene and processing the low resolution image to recognize a user-initiated gesture using a gesturing item that indicates a command from the user to the reading machine and the high resolution image to recognize text in the image of the scene, according to the command from the user to the machine.

98 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
87% related
Feature (computer vision)
128.2K papers, 1.7M citations
85% related
Image segmentation
79.6K papers, 1.8M citations
85% related
Convolutional neural network
74.7K papers, 2M citations
84% related
Deep learning
79.8K papers, 2.1M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023186
2022425
2021333
2020448
2019430
2018357