Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Intelligent forms processing system

[...]

Richard G. Casey, David R. Ferguson¹, Kottappuram Mohameda Mohiuddin, Eugene Walach¹•Institutions (1)

IBM¹

01 Jul 1992

TL;DR: An intelligent forms processing system (IFPS) which provides capabilities for automatically indexing form documents for storage/retrieval to/from a document library and for capturing information from scanned form images using intelligent character recognition (ICR).

...read moreread less

Abstract: This paper describes an intelligent forms processing system (IFPS) which provides capabilities for automatically indexing form documents for storage/retrieval to/from a document library and for capturing information from scanned form images using intelligent character recognition (ICR). The system also provides capabilities for efficiently storing form images. IFPS consists of five major processing components: (1) An interactive document analysis stage that analyzes a blank form in order to define a model of each type of form to be accepted by the system; the parameters of each model are stored in a form library. (2) A form recognition module that collects features of an input form in order to match it against one represented in the form library; the primary features used in this step are the pattern of lines defining data areas on the form. (3) A data extraction component that registers the selected model to the input form, locates data added to the form in fields of interest, and removes the data image to a separate image area. A simple mask defining the center of the data region suffices to initiate the extraction process; search routines are invoked to track data that extends beyond the masks. Other special processing is called on to detect lines that intersect the data image and to delete the lines with minimum distortion to the rest of the image. (4) An ICR unit that converts the extracted image data to symbol code for input to data base or other conventional processing systems. Three types of ICR logic have been implemented in order to accommodate monospace typing, proportionally spaced machine text, and handprinted alphanumerics. (5) A forms dropout module that removes the fixed part of a form and retains only the data filled in for storage. The stored data can be later combined with the fixed form to reconstruct the original form. This provides for extremely efficient storage of form images, thus making possible the storage of very large number of forms in the system. IFPS is implemented as part of a larger image management system called Image and Records Management system (IRM). It is being applied in forms data management in several state government applications.

...read moreread less

102 citations

Journal Article•DOI•

Perspective rectification of document images using fuzzy set and morphological operations

[...]

Shijian Lu¹, Ben M. Chen¹, Chi Chung Ko¹•Institutions (1)

National University of Singapore¹

01 May 2005-Image and Vision Computing

TL;DR: A new technique is proposed, capable of removing perspective distortion and recovering the fronto-parallel view of text with a single image, which is carried out using character stroke boundaries and tip points based on multiple fuzzy sets and morphological operators.

...read moreread less

102 citations

Journal Article•

Performance and Scalability of a Large-Scale N-gram Based Information Retrieval System

[...]

Ethan Millar¹, Dan Shen¹, Junli Liu¹, Charles Nicholas¹•Institutions (1)

University of Maryland, Baltimore County¹

01 Jan 2000-Journal of Digital Information

TL;DR: This paper presents a few approaches that enable large-scale information retrieval for the TELLTALE system and compares several different types of query methods such as tf.idf and incremental similarity to the original technique of centroid subtraction.

...read moreread less

Abstract: Information retrieval has become more and more important due to the rapid growth of all kinds of information. However, there are few suitable systems available. This paper presents a few approaches that enable large-scale information retrieval for the TELLTALE system. TELLTALE is an information retrieval environment that provides full-text search for text corpora that may be garbled by OCR (optical character recognition) or transmission errors, and that may contain multiple languages. It can find similar documents against a 1 kB query from 1 GB of text data in 45 seconds. This remarkable performance is achieved by integrating new data structures and gamma compression into the TELLTALE framework. This paper also compares several different types of query methods such as tf.idf and incremental similarity to the original technique of centroid subtraction. The new similarity techniques give better performance but less accuracy.

...read moreread less

101 citations

Journal Article•DOI•

High accuracy optical character recognition using neural networks with centroid dithering

[...]

H.I. Avi-Itzhak¹, T.A. Diep¹, H. Garland¹•Institutions (1)

Stanford University¹

01 Feb 1995-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A neural network approach is introduced to perform high accuracy recognition on multi-size and multi-font characters; a novel centroid-dithering training process with a low noise-sensitivity normalization procedure is used to achieve high accuracy results.

...read moreread less

Abstract: Optical character recognition (OCR) refers to a process whereby printed documents are transformed into ASCII files for the purpose of compact storage, editing, fast retrieval, and other file manipulations through the use of a computer. The recognition stage of an OCR process is made difficult by added noise, image distortion, and the various character typefaces, sizes, and fonts that a document may have. In this study a neural network approach is introduced to perform high accuracy recognition on multi-size and multi-font characters; a novel centroid-dithering training process with a low noise-sensitivity normalization procedure is used to achieve high accuracy results. The study consists of two parts. The first part focuses on single size and single font characters, and a two-layered neural network is trained to recognize the full set of 94 ASCII character images in 12-pt Courier font. The second part trades accuracy for additional font and size capability, and a larger two-layered neural network is trained to recognize the full set of 94 ASCII character images for all point sizes from 8 to 32 and for 12 commonly used fonts. The performance of these two networks is evaluated based on a database of more than one million character images from the testing data set. >

...read moreread less

101 citations

Proceedings Article•DOI•

A segmentation-free approach to Arabic and Urdu OCR

[...]

Nazly Sabbour¹, Faisal Shafait•Institutions (1)

German University in Cairo¹

04 Feb 2013

TL;DR: A generic Optical Character Recognition system for Arabic script languages called Nabocr is presented, initially trained to recognize both Urdu Nastaleeq and Arabic Naskh fonts, however, it can be trained by users to be used for other ArabicScript languages.

...read moreread less

Abstract: In this paper, we present a generic Optical Character Recognition system for Arabic script languages called Nabocr. Nabocr uses OCR approaches specific for Arabic script recognition. Performing recognition on Arabic script text is relatively more difficult than Latin text due to the nature of Arabic script, which is cursive and context sensitive. Moreover, Arabic script has different writing styles that vary in complexity. Nabocr is initially trained to recognize both Urdu Nastaleeq and Arabic Naskh fonts. However, it can be trained by users to be used for other Arabic script languages. We have evaluated our system's performance for both Urdu and Arabic. In order to evaluate Urdu recognition, we have generated a dataset of Urdu text called UPTI (Urdu Printed Text Image Database), which measures different aspects of a recognition system. The performance of our system for Urdu clean text is 91%. For Arabic clean text, the performance is 86%. Moreover, we have compared the performance of our system against Tesseract's newly released Arabic recognition, and the performance of both systems on clean images is almost the same.

...read moreread less

101 citations

Collapse

Network Information

Performance

Metrics

7,941

Papers

180,323

Citations

No. of papers in the topic in previous years
Year	Papers
2023	186
2022	425
2021	333
2020	448
2019	430
2018	357

Optical character recognition

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics