Home
/
Topics
/
Optical character recognition

Topic

Optical character recognition

About: Optical character recognition is a research topic. Over the lifetime, 7342 publications have been published within this topic receiving 158193 citations. The topic is also known as: OCR & optical character reader.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

N-tuple features for OCR revisited

[...]

D.-M. Jung, Mukkai S. Krishnamoorthy¹, George Nagy¹, A. Shapira¹•Institutions (1)

Rensselaer Polytechnic Institute¹

01 Jul 1996-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This work proves that the problem of finding a distinguishing n-tuple is NP-complete, by examining a natural subproblem with binary strings called the missing configuration problem, and exhibits a practical search algorithm for generating a collection of n-tuples with low class-conditional correlation and with specified design parameters n, p, and q.

...read moreread less

Abstract: N-tuple features for optical character recognition have received only scattered attention since the 1960s. Our main purpose is to show that advances in computer technology and computer science compel renewed interest. N-tuple features are useful for printed character classification because they indicate the presence or absence of a given rigid configuration of n black and white pixels in a pattern. Desirable n-tuples fit each pattern of a specified (positive) training set of characters in at least p different shift positions, and fail to fit each pattern of a specified (negative) training set by at least n-q pixels in each shift position. We prove that the problem of finding a distinguishing n-tuple is NP-complete, by examining a natural subproblem with binary strings called the missing configuration problem. The NP-completeness result notwithstanding, distinguishing n-tuples are found automatically in a few seconds on contemporary workstations. We exhibit a practical search algorithm for generating, from a small training set, a collection of n-tuples with low class-conditional correlation and with specified design parameters n, p, and q. The generator, which is available on the Internet, is empirically shown to be effective through a comparison with a benchmark generator. We show experimentally that the design parameters provide a useful tradeoff between distinguishing power and generation time, and also between the conditional probabilities for the positive and negative classes. We explore the feature probabilities obtainable for various dichotomies, and show that the design parameters control the feature probabilities.

...read moreread less

36 citations

Proceedings Article•DOI•

Real time text detection and recognition on hand held objects to assist blind people

[...]

Samruddhi Deshpande¹, Revati Shriram¹•Institutions (1)

MKSSS's Cummins College of Engineering for Women¹

01 Sep 2016

TL;DR: This paper presents camera based system which will help blind person for reading text patterns printed on hand held objects and the framework to assist visually impaired persons to read text patterns and convert it into the audio output.

...read moreread less

Abstract: This paper presents camera based system which will help blind person for reading text patterns printed on hand held objects. This is the framework to assist visually impaired persons to read text patterns and convert it into the audio output. To obtain the object from the background and extract the text pattern from that object, the system first proposes the method that will capture the image from the camera and object region is detected. The text which are maximally stable are detected using Maximally Stable External Regions (MSER) feature. A novel algorithm is evaluated on variety of scenes. The detected text is compared with the template and converted into the speech output. The text patterns are localized and binarized using Optical Character Recognition (OCR). The recognized text is converted to an audio output. The speech output is given to the blind user. Experimental results shows the analysis of MSER and OCR for different text patterns. MSER shows that it is robust algorithm for the text detection. Therefore, this paper deals with analysis of detection and recognition of different text patterns on different objects.

...read moreread less

36 citations

Journal Article•DOI•

On the recognition of Devanagari ancient handwritten characters using SIFT and Gabor features

[...]

Sonika Rani Narang¹, Manish Kumar Jindal², Shruti Ahuja¹, Munish Kumar³•Institutions (3)

DAV College, Chandigarh¹, Panjab University, Chandigarh², Punjab Technical University³

01 Nov 2020

TL;DR: Improved recognition results for Devanagari ancient characters have been presented using the scale-invariant feature transform (SIFT) and Gabor filter feature extraction techniques and poly-SVM classifier.

...read moreread less

Abstract: Recognition of Devanagari ancient handwritten character is an important task for resourceful contents' exploitation of the priceless information contained in them. There are numerous Devanagari ancient handwritten documents from fifteenth to the nineteenth century. This paper presents an optical character recognition system for the recognition of Devanagari ancient manuscripts. In this paper, improved recognition results for Devanagari ancient characters have been presented using the scale-invariant feature transform (SIFT) and Gabor filter feature extraction techniques. Support vector machine (SVM) classifier is used for the classification task in this work. For experimental results, a database consisting of 5484 samples of Devanagari characters was collected from various ancient manuscripts placed in libraries and museums. SIFT- and Gabor filter-based features are used to extract the properties of the handwritten Devanagari ancient characters for recognition. Principle component analysis is used to reduce the length of the feature vector for reducing training time of the model and to improve recognition accuracy. Recognition accuracy of 91.39% has been achieved using the proposed system based on tenfold cross-validation technique and poly-SVM classifier.

...read moreread less

36 citations

Journal Article•DOI•

Identification of text-only areas in mixed-type documents

[...]

C. Strouthopoulos¹, Nikos Papamarkos¹, Christodoulos Chamzas¹•Institutions (1)

Democritus University of Thrace¹

01 Aug 1997-Engineering Applications of Artificial Intelligence

TL;DR: The proposed segmentation method belongs to the bottom-up categories, and is more robust than other techniques, and can identify text regions in difficult cases such as skewed documents, non-rectangular text regions, or text included in drawings or halftone regions.

...read moreread less

36 citations

Proceedings Article•DOI•

Hand printed Arabic character recognition system

[...]

Adnan Amin¹, H.B. Al-Sadoun•Institutions (1)

University of New South Wales¹

09 Oct 1994

TL;DR: The paper proposes a structural technique for automatic recognition of hand printed Arabic characters that is more efficient for large and complex sets such as Arabic characters; not expensive for feature extraction; and its execution time does not depend on either the font or the size of the characters.

...read moreread less

Abstract: The paper proposes a structural technique for automatic recognition of hand printed Arabic characters. The advantages of this technique are: more efficient for large and complex sets such as Arabic characters; not expensive for feature extraction; and its execution time does not depend on either the font or the size of the characters. The algorithm was implemented on a microcomputer and tested by 10 different users. The recognition rate obtained was about 90%.

...read moreread less

36 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
…
155
156
157
158
159
160
161
…
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

7,941

Papers

180,323

Citations

No. of papers in the topic in previous years
Year	Papers
2023	186
2022	425
2021	333
2020	448
2019	430
2018	357

Optical character recognition

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics