scispace - formally typeset
Journal ArticleDOI

Intelligent forms processing

R. G. Casey, +1 more
- 02 Aug 1990 - 
- Vol. 29, Iss: 3, pp 435-450
Reads0
Chats0
TLDR
The automatic reading of optically scanned forms consists of two major components: extraction of the data image from the form and interpretation of the image as coded alphanumerics, also known as optical character recognition, or OCR.
Abstract
The automatic reading of optically scanned forms consists of two major components: extraction of the data image from the form and interpretation of the image as coded alphanumerics. The second component is also known as optical character recognition, or OCR. We have implemented a method for entry of a wide variety of forms that contain machine-printed data and that are often produced in business environments. The function, called Intelligent Forms Processing (IFP), accepts conventional forms that call for information to be printed in designated blank areas, but in which the information may exceed boundaries due to poor registration during printing. The human eye easily accommodates data that impinge on form boundaries or on background text; however, the same powers of discrimination applied to machine processing pose a technical challenge. The IFP system uses a setup phase to create a model of each form that is to be read. Scanned forms containing data are compared against the matching form model. Special algorithms are employed to extract data fields while removing background printing (e.g., form lines) intersecting the data. The extracted data images are interpreted by an OCR process that reads typical monospace fonts. New fonts may be added easily in a separate design mode. If the data are alphabetic, a lexicon may be assembled to define the possible entries.

read more

Citations
More filters
Patent

Financial transaction processing systems and methods

TL;DR: In this article, an optically scanned image (34, 208) of at least a portion of document containing visual data, in a particular format, representing information related to the financial transaction was generated.
Patent

Advanced data capture architecture data processing system and method for scanned images of document forms

TL;DR: In this paper, an advanced data capture architecture is proposed which enables the free definition and re-definition of the format of document forms without requiring any reprogramming of the data processors which capture and use the data on the completed forms.
Patent

Payment identification code and payment system using the same

TL;DR: In this article, the authors proposed a method for effecting electronic payment, safeguarding banking and account information, while utilizing existing payment systems, which comprises generating a system routing number and a payment identification code (PIC) relating to the beneficiary's account information and distributing payment identification codes to the existing payment system.
Proceedings ArticleDOI

A retargetable table reader

TL;DR: The architecture of a system for reading machine-printed documents in known predefined tabular-data layout styles, and algorithms for identifying and segmenting records with known layout, and integration of these algorithms with a graphical user interface (GUI) for defining new layouts are described.
Journal ArticleDOI

Integrating knowledge sources in Devanagari text recognition system

TL;DR: The reading process has been widely studied and there is a general agreement among researchers that knowledge in different forms and at different levels plays a vital role, which is the underlying philosophy of the Devanagari document recognition system described in this work.
References
More filters
Journal ArticleDOI

A processor-based OCR system

TL;DR: A previously developed classification technique, based on decision trees, has been extended in order to improve reading accuracy in an environment of considerable character variation, including the possibility that documents in the same font style may be produced using quite different print technologies.
Journal ArticleDOI

The extraction of line-structured data from engineering drawings

TL;DR: Progress towards generating a representation of the drawing as a set of line segments and interpreted characters is described, within an overall strategy for planning the sampling of the image and the application of analysis algorithms.
Journal ArticleDOI

Experience gained in implementing ImagePlus

TL;DR: The experience gained from identifying, selecting, and preparing several areas within IBM for an ImagePlus system is discussed and ImagePlus as an application enabler is discussed.