Home
/
Authors
/
Richard G. Casey

Author

Richard G. Casey

Bio: Richard G. Casey is an academic researcher from IBM. The author has contributed to research in topics: Tree (data structure) & Character (mathematics). The author has an hindex of 16, co-authored 26 publications receiving 1896 citations. Previous affiliations of Richard G. Casey include Cisco Systems, Inc..

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Document analysis system

[...]

Kwan Y. Wong¹, Richard G. Casey¹, Friedrich M. Wahl•Institutions (1)

IBM¹

01 Nov 1982-Ibm Journal of Research and Development

TL;DR: The requirements and components for a proposed Document Analysis System, which assists a user in encoding printed documents for computer processing, are outlined and several critical functions have been investigated and the technical approaches are discussed.

...read moreread less

Abstract: This paper outlines the requirements and components for a proposed Document Analysis System, which assists a user in encoding printed documents for computer processing. Several critical functions have been investigated and the technical approaches are discussed. The first is the segmentation and classification of digitized printed documents into regions of text and images. A nonlinear, run-length smoothing algorithm has been used for this purpose. By using the regular features of text lines, a linear adaptive classification scheme discriminates text regions from others. The second technique studied is an adaptive approach to the recognition of the hundreds of font styles and sizes that can occur on printed documents. A preclassifier is constructed during the input process and used to speed up a well-known pattern-matching method for clustering characters from an arbitrary print source into a small sample of prototypes. Experimental results are included.

...read moreread less

718 citations

Journal Article•DOI•

Block segmentation and text extraction in mixed text/image documents

[...]

Friedrich M. Wahl¹, Kwan Y. Wong¹, Richard G. Casey¹•Institutions (1)

IBM¹

01 Dec 1982-Computer Graphics and Image Processing

TL;DR: It is shown that a constrained run length algorithm is well suited to partition most documents into areas of text lines, solid black lines, and rectangular ☐es enclosing graphics and halftone images.

...read moreread less

428 citations

Journal Article•DOI•

Moment normalization of handprinted characters

[...]

Richard G. Casey¹•Institutions (1)

IBM¹

01 Sep 1970-Ibm Journal of Research and Development

TL;DR: Comparison experiments showed that error rates were reduced by integral factors if the patterns were normalized before scanning for recognition, and second-order moments of the pattern are convenient properties to use in specifying the transformation.

...read moreread less

Abstract: Handprinted characters can be made more uniform in appearance than the as-written version if an appropriate linear transformation is performed on each input pattern. The transformation can be implemented electronically by programming a flying-spot raster-scanner to scan at specified angles rather than only along specified axes. Alternatively, curve-follower normalization can be achieved by transforming the coordinate waveforms in a linear combining network. Second-order moments of the pattern are convenient properties to use in specifying the transformation. By mapping the original pattern into one having a scalar moment matrix all linear pattern variations can be removed. Comparison experiments with three sets of handprinted numerals showed that error rates were reduced by integral factors if the patterns were normalized before scanning for recognition.

...read moreread less

130 citations

Journal Article•DOI•

Decomposition of a data base and the theory of Boolean switching functions

[...]

C. Delobel¹, Richard G. Casey²•Institutions (2)

University of Grenoble¹, IBM²

01 Sep 1973-Ibm Journal of Research and Development

TL;DR: An important equivalence between operations with functional relations and operations with analogous Boolean functions is demonstrated and is computationally helpful in exploring the properties of a given set of functional relations, as well as in the task of partitioning a data set into subfiles for efficient implementation.

...read moreread less

Abstract: The notion of a functional relation among the attributes of a data set can be fruitfully applied in the structuring of an information system. These relations are meaningful both to the user of the system in his semantic understanding of the data, and to the designer in implementing the system. An important equivalence between operations with functional relations and operations with analogous Boolean functions is demonstrated in this paper. The equivalence is computationally helpful in exploring the properties of a given set of functional relations, as well as in the task of partitioning a data set into subfiles for efficient implementation.

...read moreread less

127 citations

Patent•

Computer-implemented method for automatic extraction of data from printed forms

[...]

Richard G. Casey¹, David R. Ferguson¹•Institutions (1)

IBM¹

02 Oct 1989

TL;DR: In this paper, a computer-implemented method is proposed to extract character data from printed forms, which consists only of lines in the master form, and the resulting image can then be displayed, each mask corresponding to a field where data would be located in a filled-in form.

...read moreread less

Abstract: A computer-implemented method operable with conventional OCR scanning equipment and software, extracts character data from printed forms. A blank master form is scanned and its digital image stored. Clusters of ON bits of the master form image are first recognized as part of a line and then connected to form lines. All of the lines in the master form image are then identified by row and column start position and column end position, thereby creating a master-form-description. The resulting image, which consists only of lines in the master form, can then be displayed. Regions or masks in the displayed image of master form lines are then created, each mask corresponding to a field where data would be located in a filled-in form. Each data mask is spaced from nearby lines by a predetermined data margin, referred to as D. A filled-in or data form is then scanned and lines are also recognized and identified in a similar manner to create a data-form-description. The data-form-description is compared with the master-form-description by computing the horizontal and vertical offsets and skew of the two forms relative to one another. The created data masks, whose orientation with respect to the master form has been previously determined, are then transposed into the data form image using the computed values of horizontal and vertical offsets and skew. In this manner, the data masks are correctly located on the data form so that the actual data values in the data form reside within the corresponding data masks. Routines are then implemented for detecting extraneous data intruding into the data masks and for growing the masks, i.e. enlarging the masks to capture data which may extend beyond the perimeter of the masks. Thus, the data masks are adaptive in that they are grown if data does not lie entirely within the perimeter of the masks. During the mask growth routine, lines which are part of the background form are detected and removed by line removal algorithms. Following the removal of extraneous data from the masks, the growth of the masks to capture data, and any subsequent line removal, the remaining data from the masks is extracted and transferred to a new file. The new file then contains only data comprising characters of the data values in the desired regions, which can then be operated on by conventional OCR software to identify the specific character values.

...read moreread less

110 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

On image analysis by the methods of moments

[...]

C.-H. Teh¹, Roland T. Chin¹•Institutions (1)

University of Wisconsin-Madison¹

01 Jul 1988-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Various types of moments have been used to recognize image patterns in a number of applications and some fundamental questions are addressed, such as image-representation ability, noise sensitivity, and information redundancy.

...read moreread less

Abstract: Various types of moments have been used to recognize image patterns in a number of applications. A number of moments are evaluated and some fundamental questions are addressed, such as image-representation ability, noise sensitivity, and information redundancy. Moments considered include regular moments, Legendre moments, Zernike moments, pseudo-Zernike moments, rotational moments, and complex moments. Properties of these moments are examined in detail and the interrelationships among them are discussed. Both theoretical and experimental results are presented. >

...read moreread less

1,522 citations

Journal Article•DOI•

The Grid File: An Adaptable, Symmetric Multikey File Structure

[...]

Jürg Nievergelt, Hans Hinterberger, Kenneth C. Sevcik¹•Institutions (1)

University of Toronto¹

23 Mar 1984-ACM Transactions on Database Systems

TL;DR: This work discusses in detail the design decisions that led to the grid file, present simulation results of its behavior, and compare it to other multikey access file structures.

...read moreread less

Abstract: Traditional file structures that provide multikey access to records, for example, inverted files, are extensions of file structures originally designed for single-key access. They manifest various deficiencies in particular for multikey access to highly dynamic files. We study the dynamic aspects of file structures that treat all keys symmetrically, that is, file structures which avoid the distinction between primary and secondary keys. We start from a bitmap approach and treat the problem of file design as one of data compression of a large sparse matrix. This leads to the notions of a grid partition of the search space and of a grid directory, which are the keys to a dynamic file structure called the grid file. This file system adapts gracefully to its contents under insertions and deletions, and thus achieves an upper bound of two disk accesses for single record retrieval; it also handles range queries and partially specified queries efficiently. We discuss in detail the design decisions that led to the grid file, present simulation results of its behavior, and compare it to other multikey access file structures.

...read moreread less

1,222 citations

Journal Article•DOI•

Shape quantization and recognition with randomized trees

[...]

Yali Amit¹, Donald Geman²•Institutions (2)

University of Chicago¹, University of Massachusetts Amherst²

01 Oct 1997-Neural Computation

TL;DR: A new approach to shape recognition based on a virtually infinite family of binary features (queries) of the image data, designed to accommodate prior information about shape invariance and regularity, and a comparison with artificial neural networks methods is presented.

...read moreread less

Abstract: We explore a new approach to shape recognition based on a virtually infinite family of binary features (queries) of the image data, designed to accommodate prior information about shape invariance and regularity. Each query corresponds to a spatial arrangement of several local topographic codes (or tags), which are in themselves too primitive and common to be informative about shape. All the discriminating power derives from relative angles and distances among the tags. The important attributes of the queries are a natural partial ordering corresponding to increasing structure and complexity; semi-invariance, meaning that most shapes of a given class will answer the same way to two queries that are successive in the ordering; and stability, since the queries are not based on distinguished points and substructures. No classifier based on the full feature set can be evaluated, and it is impossible to determine a priori which arrangements are informative. Our approach is to select informative features and build tree classifiers at the same time by inductive learning. In effect, each tree provides an approximation to the full posterior where the features chosen depend on the branch that is traversed. Due to the number and nature of the queries, standard decision tree construction based on a fixed-length feature vector is not feasible. Instead we entertain only a small random sample of queries at each node, constrain their complexity to increase with tree depth, and grow multiple trees. The terminal nodes are labeled by estimates of the corresponding posterior distribution over shape classes. An image is classified by sending it down every tree and aggregating the resulting distributions. The method is applied to classifying handwritten digits and synthetic linear and nonlinear deformations of three hundred L AT E X symbols. Stateof-the-art error rates are achieved on the National Institute of Standards and Technology database of digits. The principal goal of the experiments on L AT E X symbols is to analyze invariance, generalization error and related issues, and a comparison with artificial neural networks methods is presented in this context.

...read moreread less

1,214 citations

Patent•

Formless forms and paper web using a reference-based mark extraction technique

[...]

Todd A. Cass¹•Institutions (1)

Xerox¹

30 Jul 1996

TL;DR: In this article, a reference-based mark extraction technique was proposed, in which the second document image serves as a reference image and in which substantially the entirety of the first document image is compared with substantially the whole of the second image.

...read moreread less

Abstract: A processor is provided with first and second document images. The first image represents an instance of a reference document to which instance a mark has been added. The second image is selected from among a collection of document images and represents the reference document without the mark. The processor automatically extracts from the first document image a set of pixels representing the mark. This is done by performing a reference-based mark extraction technique in which the second document image serves as a reference image and in which substantially the entirety of the first document image is compared with substantially the entirety of the second document image. Also, the processor is provided with information about a set of active elements of the reference document. The reference document has at least one such active element and each active element is associated with at least one action. The processor interprets the extracted set of pixels representing the mark by determining whether the mark indicates any of the active elements of the reference document. If the mark indicates an active element, the processor facilitates the action with which the indicated active element is associated.

...read moreread less

1,099 citations

Journal Article•DOI•

Logic and Databases: A Deductive Approach

[...]

Hervé Gallaire, Jack Minker¹, Jean-Marie Nicolas²•Institutions (2)

University of Maryland, College Park¹, Community emergency response team²

01 Jun 1984-ACM Computing Surveys

TL;DR: It is shown that logic provides a convenient formalism for studying classical database problems and the representation and manipulation of deduced facts and incomplete information is shown.

...read moreread less

Abstract: The purpose of this paper is to show that logic provides a convenient formalism for studying classical database problems. There are two main parts to the paper, devoted respectively to conventional databases and deductive databases. In the first part, we focus on query languages, integrity modeling and maintenance, query optimization, and data dependencies. The second part deals mainly with the representation and manipulation of deduced facts and incomplete information. Categories and Subject Descriptors: H.2.1 [Database Management]: Logical Design— data models; H.2.3 [Database Management]: Languages— query languages; H.2.4 [Database Management]: Systems— query processing General Terms: Deductive Databases, Indefinite Data, Logic and Databases, Null Values, Relational Databases

...read moreread less

769 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse