Home
/
Topics
/
Optical character recognition

Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A real time marking inspection scheme for semiconductor industries

[...]

R. Nagarajan¹, Sazali Yaacob¹, Paulraj Pandian¹, M. Karthigayan¹, Shamsudin H. M. Amin², Marzuki Khalid² - Show less +2 more•Institutions (2)

University College of Engineering¹, Universiti Teknologi Malaysia²

24 Sep 2007-The International Journal of Advanced Manufacturing Technology

TL;DR: A real time industrial machine vision system incorporating optical character recognition (OCR) is employed to inspect markings on integrated circuit (IC) chips to identify print errors such as illegible characters, missing characters and upside down printing.

...read moreread less

Abstract: In this paper, a real time industrial machine vision system incorporating optical character recognition (OCR) is employed to inspect markings on integrated circuit (IC) chips. This inspection is carried out while the ICs are coming out from the manufacturing line. A TSSOP-DGG type of IC package from Texas Instruments is used in the investigation. The IC chip markings are laser printed. This inspection system tests whether the laser printed marking on IC chips is proper. The inspection has to identify print errors such as illegible characters, missing characters and upside down printing. The vision inspection of the printed markings on the IC chip is carried out in three phases, namely, image preprocessing, feature extraction and classification. The MATLAB platform and its toolboxes are used for designing the inspection processing technique. Speed of the marking inspection is mostly dependent on the effectiveness of the feature extraction technique. The performances of four feature extraction techniques are compared in terms of their respective speed. The feature extracted data are used in a neural network for classifying the marking errors. A suggestion to optimize the number of input neurons of the neural network for a fast classification is also presented.

...read moreread less

35 citations

Proceedings Article•DOI•

Offline recognition of large vocabulary cursive handwritten text

[...]

Alessandro Vinciarelli, Samy Bengio, Horst Bunke¹•Institutions (1)

University of Bern¹

03 Aug 2003

TL;DR: This paper presents a system for the offline recognition of cursive handwritten lines of text based on continuous density HMMs and Statistical Language Models, which shows a recognition rate of ~85% with a lexicon containing 50'000 words.

...read moreread less

Abstract: This paper presents a system for the offline recognitionof cursive handwritten lines of text. The system is based oncontinuous density HMMs and Statistical Language Models.The system recognizes data produced by a single writer.No a-priori knowledge is used about the content of the textto be recognized. Changes in the experimental setup withrespect to the recognition of single words are highlighted.The results show a recognition rate of ~85% with a lexiconcontaining 50'000 words. The experiments were performedover a publicly available database.

...read moreread less

34 citations

Proceedings Article•DOI•

Text segmentation and recognition in complex background based on Markov random field

[...]

Datong Chen, J.-M. Olobez, Hervé Bourlard

11 Aug 2002

TL;DR: By varying the number of gaussians, multiple hypotheses are provided to an OCR system and the final result is selected from the set of outputs, leading to an improvement of the system's performances.

...read moreread less

Abstract: In this paper we propose a method to segment and recognize text embedded in video and images. We modelize the gray level distribution in the text images as mixture of gaussians, and then assign each pixel to one of the gaussian layer. The assignment is based on prior of the contextual information, which is modeled by a Markov random field (MRF) with online estimated coefficients. Each layer is then processed through a connected component analysis module and forwarded to the OCR system as one segmentation hypothesis. By varying the number of gaussians, multiple hypotheses are provided to an OCR system and the final result is selected from the set of outputs, leading to an improvement of the system's performances.

...read moreread less

34 citations

Journal Article•DOI•

A Document Image Retrieval System

[...]

Konstantinos Zagoris¹, Kavallieratou Ergina², Nikos Papamarkos¹•Institutions (2)

Democritus University of Thrace¹, University of the Aegean²

01 Sep 2010-Engineering Applications of Artificial Intelligence

TL;DR: A system that locates words in document image archives bypassing character recognition and using word images as queries makes use of document image processing techniques, in order to extract powerful features for the description of the word images.

...read moreread less

34 citations

Journal Article•DOI•

Recognition of Handwritten Arabic Characters using Histograms of Oriented Gradient (HOG)

[...]

Noor A. Jebril¹, Hussein Al-Zoubi², Qasem Abu Al-Haija¹•Institutions (2)

King Faisal University¹, Yarmouk University²

16 Jun 2018-Pattern Recognition and Image Analysis

TL;DR: Experimental results showed a great success of the recognition method compared to the state of the art techniques, where it could achieve very high recognition rates exceeding 99%.

...read moreread less

Abstract: Optical Character Recognition (OCR) is the process of recognizing printed or handwritten text on paper documents. This paper proposes an OCR system for Arabic characters. In addition to the preprocessing phase, the proposed recognition system consists mainly of three phases. In the first phase, we employ word segmentation to extract characters. In the second phase, Histograms of Oriented Gradient (HOG) are used for feature extraction. The final phase employs Support Vector Machine (SVM) for classifying characters. We have applied the proposed method for the recognition of Jordanian city, town, and village names as a case study, in addition to many other words that offers the characters shapes that are not covered with Jordan cites. The set has carefully been selected to include every Arabic character in its all four forms. To this end, we have built our own dataset consisting of more than 43.000 handwritten Arabic words (30000 used in the training stage and 13000 used in the testing stage). Experimental results showed a great success of our recognition method compared to the state of the art techniques, where we could achieve very high recognition rates exceeding 99%.

...read moreread less

34 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
…
166
167
168
169
170
171
172
…
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

7,941

Papers

180,323

Citations

No. of papers in the topic in previous years
Year	Papers
2023	186
2022	425
2021	333
2020	448
2019	430
2018	357

Optical character recognition

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics