Home
/
Topics
/
Optical character recognition

Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Patent•

Image enhancement system

[...]

Donald F. Willis, John E. Brooks¹, Hosni Adra, Hsieh S. Hou•Institutions (1)

Bell & Howell¹

14 Dec 1992

TL;DR: In this article, a process and system for processing a digitally stored image on a digital computer is described, which scans and digitizes an image, separate text from non-text components, enhances and deskews the image, compresses the resulting image file, and stores the enhanced, deskewed, and compressed file for later transmission, optical character recognition, or high quality printing or viewing of the image.

...read moreread less

Abstract: This specification discloses a process and system for processing a digitally stored image on a digital computer. The system scans and digitizes an image, separate text from non-text components, enhances and deskews the image, compresses the resulting image file, and stores the enhanced, deskewed, and compressed file for later transmission, optical character recognition, or high quality printing or viewing of the image.

...read moreread less

62 citations

Proceedings Article•DOI•

On how to describe shapes of Devanagari characters and use them for recognition

[...]

Veena Bansal¹, R.M.K. Sinha•Institutions (1)

Indian Institute of Technology Kanpur¹

20 Sep 1999

TL;DR: A schema for the description of shapes of Devanagari characters and its application in their recognition is presented, which exploits certain features of the script in both reducing the search space and creating a reference with respect to which correspondence could be established, during the matching process.

...read moreread less

Abstract: The paper presents a schema for the description of shapes of Devanagari characters and its application in their recognition. It exploits certain features of the script in both reducing the search space and creating a reference with respect to which correspondence could be established, during the matching process. The description prototypes are constructed using the real-life script after segmentation so that the aberrations introduced during the inevitable process of segmentation get accounted for in the description. This has been tested on printed Devanagari text with a success of approximately 70% without any post-processing and 88% correct recognition with the help of a word dictionary.

...read moreread less

62 citations

Patent•

Optical character recognition neural network system for machine-printed characters

[...]

Roger S. Gaborski¹, Louis James Beato¹, Lori L. Barski¹, Hin-Leong Tan¹, Andrew M Assad¹, Dawn Lorraine Dutton¹ - Show less +2 more•Institutions (1)

Eastman Kodak Company¹

02 Feb 1990

TL;DR: In this paper, the output of the neural network is processed by an optical character recognition post-processor, which corrects erroneous symbol identifications made by the network and identifies special symbols and symbol cases not identifiable by the neural networks following character normalization.

...read moreread less

Abstract: Character images which are to be sent to a neural network trained to recognize a predetermined set of symbols are first processed by an optical character recognition pre-processor which normalizes the character images. The output of the neural network is processed by an optical character recognition post-processor. The post-processor corrects erroneous symbol identifications made by the neural network. The post-processor identifies special symbols and symbol cases not identifiable by the neural network following character normalization. For characters identified by the neural network with low scores, the post-processor attempts to find and separate adjacent characters which are kerned and characters which are touching. The touching characters are separated in one of nine successively initiated processes depending upon the geometric parameters of the image. When all else fails, the post-processor selects either the second or third highest scoring symbol identified by the neural network based upon the likelihood of the second or third highest scoring symbol being confused with the highest scoring symbol.

...read moreread less

62 citations

Proceedings Article•DOI•

Improved document image segmentation algorithm using multiresolution morphology

[...]

Syed Saqib Bukhari¹, Faisal Shafait, Thomas M. Breuel¹•Institutions (1)

Kaiserslautern University of Technology¹

24 Jan 2011

TL;DR: Modifications to the text/non-text segmentation algorithm presented by Bloomberg are described which result in significant improvements and achieved better segmentation accuracy than the original algorithm for UW-III, UNLV, ICDAR 2009 page segmentation competition test images and circuit diagram datasets.

...read moreread less

Abstract: Page segmentation into text and non-text elements is an essential preprocessing step before optical character recognition (OCR) operation. In case of poor segmentation, an OCR classification engine produces garbage characters due to the presence of non-text elements. This paper describes modifications to the text/non-text segmentation algorithm presented by Bloomberg,1 which is also available in his open-source Leptonica library.2The modifications result in significant improvements and achieved better segmentation accuracy than the original algorithm for UW-III, UNLV, ICDAR 2009 page segmentation competition test images and circuit diagram datasets.

...read moreread less

62 citations

Journal Article•DOI•

An MLP-SVM combination architecture for offline handwritten digit recognition

[...]

A. Bellili¹, M. Gilloux, Patrick Gallinari¹•Institutions (1)

Pierre-and-Marie-Curie University¹

01 Jul 2003-International Journal on Document Analysis and Recognition

TL;DR: An original hybrid MLP-SVM method for unconstrained handwritten digits recognition, based on the idea that the correct digit class almost systematically belongs to the two maximum MLP outputs and that some pairs of digit classes constitute the majority of MLP substitutions (errors).

...read moreread less

Abstract: This paper presents an original hybrid MLP-SVM method for unconstrained handwritten digits recognition. Specialized Support Vector Machines (SVMs) are introduced to improve significantly the multilayer perceptron (MLP) performance in local areas around the separating surfaces between each pair of digit classes, in the input pattern space. This hybrid architecture is based on the idea that the correct digit class almost systematically belongs to the two maximum MLP outputs and that some pairs of digit classes constitute the majority of MLP substitutions (errors). Specialized local SVMs are introduced to detect the correct class among these two classification hypotheses. The hybrid MLP-SVM recognizer achieves a recognition rate of $98.01\%$ , for real mail zipcode digits recognition task. By introducing a rejection mechanism based on the distances provided by the local SVMs, the error/reject trade-off performance of our recognition system is better than several classifiers reported in recent research.

...read moreread less

62 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
…
82
83
84
85
86
87
88
…
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

7,941

Papers

180,323

Citations

No. of papers in the topic in previous years
Year	Papers
2023	186
2022	425
2021	333
2020	448
2019	430
2018	357

Optical character recognition

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics