Home
/
Topics
/
Optical character recognition

Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Table image segmentation

[...]

K. Zuyev

18 Aug 1997

TL;DR: The proposed approach introduces a concept of table grid which can serve for advanced methods of table structure analysis, which provides a layer of terminal symbols for the table, which is used by syntactical methods.

...read moreread less

Abstract: Algorithm for table image segmentation, a part of complete document recognition system is presented. The proposed approach introduces a concept of table grid which can serve for advanced methods of table structure analysis. It provides a layer of terminal symbols for the table, which is used by syntactical methods. Detailed discussion of grid detection is presented which is performed through the analysis of connected components projection profile. Simple rules for analysis of table structure cover majority of real life tables. The system is implemented, rested, and is now extensively used in FineReader OCR product.

...read moreread less

44 citations

Journal Article•DOI•

The past, present, and future of neural networks for signal processing

[...]

Jenq-Nen Hwang¹, Sun-Yan Kung, M. Niranjan, Jose C. Principe•Institutions (1)

University of Washington¹

01 Jan 1997-IEEE Signal Processing Magazine

TL;DR: The article provides a review of the fundamental of neural networks and reports recent progress on Topics covered include dynamic modeling, model-based neural networks, statistical learning, eigenstructure-based processing, active learning, and generalization capability.

...read moreread less

Abstract: The article provides a review of the fundamental of neural networks and reports recent progress. Topics covered include dynamic modeling, model-based neural networks, statistical learning, eigenstructure-based processing, active learning, and generalization capability. Current and potential applications of neural networks are also described in detail. Those applications include optical character recognition, speech recognition and synthesis, automobile and aircraft control, image analysis and neural vision, and several medical applications. Essentially, neural networks have become a very effective tool in signal processing, particularly in various recognition tasks.

...read moreread less

44 citations

Proceedings Article•DOI•

Combining multiple thresholding binarization values to improve OCR output

[...]

William B. Lund¹, Douglas J. Kennard¹, Eric K. Ringger¹•Institutions (1)

Brigham Young University¹

04 Feb 2013

TL;DR: This novel approach combines the OCR outputs from multiple thresholded images by aligning the text output and producing a lattICE of word alternatives from which a lattice word error rate (LWER) is calculated.

...read moreread less

Abstract: For noisy, historical documents, a high optical character recognition (OCR) word error rate (WER) can render the OCR text unusable. Since image binarization is often the method used to identify foreground pixels, a body of research seeks to improve image-wide binarization directly. Instead of relying on any one imperfect binarization technique, our method incorporates information from multiple simple thresholding binarizations of the same image to improve text output. Using a new corpus of 19th century newspaper grayscale images for which the text transcription is known, we observe WERs of 13.8% and higher using current binarization techniques and a state-of-the-art OCR engine. Our novel approach combines the OCR outputs from multiple thresholded images by aligning the text output and producing a lattice of word alternatives from which a lattice word error rate (LWER) is calculated. Our results show a LWER of 7.6% when aligning two threshold images and a LWER of 6.8% when aligning five. From the word lattice we commit to one hypothesis by applying the methods of Lund et al. (2011) achieving an improvement over the original OCR output and a 8.41% WER result on this data set.

...read moreread less

43 citations

Journal Article•DOI•

High level document analysis guided by geometric aspects

[...]

Andreas Dengel¹, Gerhard Barth¹•Institutions (1)

University of Stuttgart¹

01 Dec 1988-International Journal of Pattern Recognition and Artificial Intelligence

TL;DR: This article proposes an approach to identify the layout of a document page by dividing it recursively into nested rectangular areas and uses it as a basis for a document layout model, which is able to control an automatic interpretation mechanism for deriving a high level representation of the contents of a documents.

...read moreread less

Abstract: The realization of the paper-free office seems to be difficult that expected. Therefore, good paper-computer interfaces are necessary to transform paper documents into an electronic form, which allows the use of a filing and retrieval system. An electronic document page is an optically scanned and digitized representation of a printed page. Document analysis is the problem of interpreting and labeling the constitutents of the document. Although there are very reliable optical character recognition (OCR) methods, the process could be very inefficient. To prune the search space and to become more efficient, some search supporting methods have to be developed. This article proposes an approach to identify the layout of a document page by dividing it recursively into nested rectangular areas. The procedure is used as a basis for a document layout model, which is able to control an automatic interpretation mechanism for deriving a high level representation of the contents of a document. We have implemented our method in Common Lisp on a Symbolies 3640 Workstation and have run it for a large population of office documents. The results obtained have been very encouraging and have convincingly confirmed the soundness of our approach.

...read moreread less

43 citations

Journal Article•DOI•

Chinese Character CAPTCHA Recognition and performance estimation via deep neural network

[...]

Dazhen Lin¹, Dazhen Lin², Fan Lin², Yanping Lv², Feipeng Cai², Donglin Cao², Donglin Cao¹ - Show less +3 more•Institutions (2)

Minjiang University¹, Xiamen University²

02 May 2018-Neurocomputing

TL;DR: A Convolution Neural Network (CNN) based approach to learn strokes, radicals and character features of Chinese characters, and proves that the network structure is superior to LENET-5 in this task.

...read moreread less

43 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
…
126
127
128
129
130
131
132
…
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

7,941

Papers

180,323

Citations

No. of papers in the topic in previous years
Year	Papers
2023	186
2022	425
2021	333
2020	448
2019	430
2018	357

Optical character recognition

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics