Home
/
Topics
/
Optical character recognition

Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Patent•

One-screen reconciliation of business document image data, optical character recognition extracted data, and enterprise resource planning data

[...]

Jean-Jacques Berard, Nicolas Perotin

03 Oct 2007

TL;DR: In this article, a business document is scanned to create an imaged document and a set of extracted data is extracted from the business document image via optical character recognition (OCR) and compared with data in business information management or enterprise resource planning (ERP) system.

...read moreread less

Abstract: Systems and methods of reconciling data from an imaged document. In one embodiment, a business document is scanned to create a business document image. A set of extracted data is extracted from the business document image via optical character recognition (OCR). The set of OCR extracted data is then compared with data in business information management or enterprise resource planning (ERP) system. A set of ERP data is retrieved from the ERP system that relates to the set of OCR extracted data. The retrieved ERP data is than assigned to the set of OCR extracted data to create a set of assigned data. The business document image is then displayed in a business document image pane, the set of OCR extracted data is displayed in the OCR data pane, and the retrieved ERP data is displayed in the ERP data pane. The set of assigned data is validated, and the ERP system is updated with the set of validated, assigned data. In other embodiments, data is extracted from text files without using OCR.

...read moreread less

42 citations

Proceedings Article•DOI•

Software tools and test data for research and testing of page-reading OCR systems

[...]

Thomas A. Nartker¹, Stephen V. Rice¹, Steven E. Lumos¹•Institutions (1)

University of Nevada, Las Vegas¹

17 Jan 2005

TL;DR: The UNLV/ISRI Analytic Tools for OCR Evaluation together with a large and diverse collection of scanned document images with the associated ground-truth text will allow anyone to conduct a meaningful test comparing the performance of competing page-reading algorithms.

...read moreread less

Abstract: We announce the availability of the UNLV/ISRI Analytic Tools for OCR Evaluation together with a large and diverse collection of scanned document images with the associated ground-truth text. This combination of tools and test data will allow anyone to conduct a meaningful test comparing the performance of competing page-reading algorithms. The value of this collection of software tools and test data is enhanced by knowledge of the past performance of several systems using exactly these tools and this data. These performance comparisons were published in previous ISRI Test Reports and are also provided. Another value is that the tools can be used to test the character accuracy of any page-reading OCR system for any language included in the Unicode standard. The paper concludes with a summary of the programs, test data, and documentation that is available and gives the URL where they can be located.

...read moreread less

42 citations

Patent•

Apparatus, systems and methods for presenting text identified in a video image

[...]

Dale Mountain

18 Jan 2012

TL;DR: In this article, a complete video frame that is associated with a presented video image of a video content event is presented, where a region of text is identified in the video frame and an optical character recognition (OCR) algorithm is used to translate the text.

...read moreread less

Abstract: Systems and methods are operable to present text identified in a presented video image of a media content event. An exemplary embodiment receives a complete video frame that is associated with a presented video image of a video content event, wherein the presented video image includes a region of text; finds the text in the complete video frame; uses an optical character recognition (OCR) algorithm to translate the found text; and presents the translated text. The translated text may be presented on a display concurrently with the video image that is presented on the display. Alternatively, or additionally, the translated text may be presented as audible speech emitted from at least one speaker.

...read moreread less

42 citations

Proceedings Article•DOI•

Application of artificial neural network model for optical character recognition

[...]

N. Mani¹, Bala Srinivasan•Institutions (1)

Monash University, Clayton campus¹

12 Oct 1997

TL;DR: This paper examines a simple pattern-recognition system using an artificial neural network to simulate character recognition, and uses the backpropagation method for learning in the neural network.

...read moreread less

Abstract: Many artificial neural network models (ANNs) have been proposed to mimic the human brain in solving problems involving human-like intelligence. An application of an artificial neural network approach for optical character recognition (OCR) is discussed in this paper. We examine a simple pattern-recognition system using an artificial neural network to simulate character recognition. A simple feedforward neural network model has been trained with different sets of noisy data. The backpropagation method is used for learning in the neural network.

...read moreread less

42 citations

Book Chapter•DOI•

Natural Language Watermarking Using Semantic Substitution for Chinese Text

[...]

Chiang Yuei-Lin, Chang Lu-Ping, Hsieh Wen-Tai, Chen Wen-Chih

20 Oct 2003

TL;DR: This study attempts to develop a method for embedding watermark in the text that is as successful as the frequency-domain methods have been for image and audio.

...read moreread less

Abstract: Numerous schemes have been designed for watermarking multimedia contents. Many of these schemes are vulnerable to watermark erasing attacks. Naturally, such methods are ineffective on text unless the text is represented as a bitmap image, but in that case, the watermark can be erased easily by using Optical Character Recognition (OCR) to change the representation of the text from a bitmap to ASCII or EBCDIC. This study attempts to develop a method for embedding watermark in the text that is as successful as the frequency-domain methods have been for image and audio. The novel method embeds the watermark in original text, creating ciphertext, which preserves the meaning of the original text via various semantic replacements.

...read moreread less

42 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
…
132
133
134
135
136
137
138
…
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

7,941

Papers

180,323

Citations

No. of papers in the topic in previous years
Year	Papers
2023	186
2022	425
2021	333
2020	448
2019	430
2018	357

Optical character recognition

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics