scispace - formally typeset
Search or ask a question
Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.


Papers
More filters
Proceedings ArticleDOI
Richard G. Casey1, Stephen K. Boyer1, P. Healey1, Alex Miller1, B. Oudot1, K. Zilles1 
20 Oct 1993
TL;DR: A prototype system for encoding chemical structure diagrams from scanned printed documents is described, and the final coded output interfaces to conventional chemistry software for database storage and retrieval, publishing, and modeling.
Abstract: A prototype system for encoding chemical structure diagrams from scanned printed documents is described. The system distinguishes a structure diagram from other printed material on a page image using size and spacing characteristics. It distinguishes line graphics from symbols in an intermediate vectorization stage. Line information is mapped into a connection diagram that represents atomic bonds. Atomic symbols are identified by means of chemical drawing conventions and optical character recognition. The final coded output interfaces to conventional chemistry software for database storage and retrieval, publishing, and modeling. >

40 citations

Proceedings ArticleDOI
01 Jan 2002
TL;DR: A very general, theoretically optimal model is applied to the problem of OCR word correction, practical methods for parameter estimation are introduced, and performance on real data is evaluated.
Abstract: In this paper, we take a pattern recognition approach to correcting errors in text generated from printed documents using optical character recognition (OCR). We apply a very general, theoretically optimal model to the problem of OCR word correction, introduce practical methods for parameter estimation, and evaluate performance on real data.

40 citations

Patent
Dar-Shyang Lee1, Lee-Feng Chien1, Aries Hsieh1, Pin Ting1, Kin Wong1 
06 Oct 2010
TL;DR: In this article, an image is displayed on a touch screen and a user's underline gesture on the displayed image is detected and a text region including the text is identified in the surrounding region and cropped from the image.
Abstract: An image is displayed on a touch screen. A user's underline gesture on the displayed image is detected. The area of the image touched by the underline gesture and a surrounding region approximate to the touched area are identified. Skew for text in the surrounding region is determined and compensated. A text region including the text is identified in the surrounding region and cropped from the image. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and returns OCR'ed text. The OCR'ed text is outputted.

40 citations

Posted Content
TL;DR: It is demonstrated that state-of-the-art optical character recognition (OCR) based on deep learning is vulnerable to adversarial images.
Abstract: We demonstrate that state-of-the-art optical character recognition (OCR) based on deep learning is vulnerable to adversarial images. Minor modifications to images of printed text, which do not change the meaning of the text to a human reader, cause the OCR system to "recognize" a different text where certain words chosen by the adversary are replaced by their semantic opposites. This completely changes the meaning of the output produced by the OCR system and by the NLP applications that use OCR for preprocessing their inputs.

40 citations

Proceedings Article
27 Jul 2011
TL;DR: A re-scoring strategy is proposed that makes it feasible to capture more long-distance dependencies in the natural language and a hill climbing method (iterative decoding) is proposed to search over islands of confusability in the word lattice.
Abstract: A re-scoring strategy is proposed that makes it feasible to capture more long-distance dependencies in the natural language. Two pass strategies have become popular in a number of recognition tasks such as ASR (automatic speech recognition), MT (machine translation) and OCR (optical character recognition). The first pass typically applies a weak language model (n-grams) to a lattice and the second pass applies a stronger language model to N best lists. The stronger language model is intended to capture more long-distance dependencies. The proposed method uses RNN-LM (recurrent neural network language model), which is a long span LM, to re-score word lattices in the second pass. A hill climbing method (iterative decoding) is proposed to search over islands of confusability in the word lattice. An evaluation based on Broadcast News shows speedups of 20 over basic N best re-scoring, and word error rate reduction of 8% (relative) on a highly competitive setup.

40 citations


Network Information
Related Topics (5)
Feature extraction
111.8K papers, 2.1M citations
87% related
Feature (computer vision)
128.2K papers, 1.7M citations
85% related
Image segmentation
79.6K papers, 1.8M citations
85% related
Convolutional neural network
74.7K papers, 2M citations
84% related
Deep learning
79.8K papers, 2.1M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023186
2022425
2021333
2020448
2019430
2018357