Home
/
Authors
/
Palaiahnakote Shivakumara

Author

Palaiahnakote Shivakumara

Other affiliations: National University of Singapore, University of Malaya, University of Mysore

Bio: Palaiahnakote Shivakumara is an academic researcher from Information Technology University. The author has contributed to research in topics: Pixel & Feature extraction. The author has an hindex of 32, co-authored 215 publications receiving 3377 citations. Previous affiliations of Palaiahnakote Shivakumara include National University of Singapore & University of Malaya.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Recognizing Text with Perspective Distortion in Natural Scenes

[...]

Trung Quy Phan¹, Palaiahnakote Shivakumara², Shangxuan Tian¹, Chew Lim Tan¹•Institutions (2)

National University of Singapore¹, University of Malaya²

01 Dec 2013

TL;DR: This paper introduces a new dataset called StreetViewText-Perspective, which contains texts in street images with a great variety of viewpoints and significantly outperforms the state-of-the-art on perspective texts of arbitrary orientations.

...read moreread less

Abstract: This paper presents an approach to text recognition in natural scene images. Unlike most existing works which assume that texts are horizontal and frontal parallel to the image plane, our method is able to recognize perspective texts of arbitrary orientations. For individual character recognition, we adopt a bag-of-key points approach, in which Scale Invariant Feature Transform (SIFT) descriptors are extracted densely and quantized using a pre-trained vocabulary. Following [1, 2], the context information is utilized through lexicons. We formulate word recognition as finding the optimal alignment between the set of characters and the list of lexicon words. Furthermore, we introduce a new dataset called StreetViewText-Perspective, which contains texts in street images with a great variety of viewpoints. Experimental results on public datasets and the proposed dataset show that our method significantly outperforms the state-of-the-art on perspective texts of arbitrary orientations.

...read moreread less

378 citations

Journal Article•DOI•

A Laplacian Approach to Multi-Oriented Text Detection in Video

[...]

Palaiahnakote Shivakumara¹, Trung Quy Phan¹, Chew Lim Tan¹•Institutions (1)

National University of Singapore¹

01 Feb 2011-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Experimental results show that the proposed method is able to handle graphics text and scene text of both horizontal and nonhorizontal orientation.

...read moreread less

Abstract: In this paper, we propose a method based on the Laplacian in the frequency domain for video text detection. Unlike many other approaches which assume that text is horizontally-oriented, our method is able to handle text of arbitrary orientation. The input image is first filtered with Fourier-Laplacian. K-means clustering is then used to identify candidate text regions based on the maximum difference. The skeleton of each connected component helps to separate the different text strings from each other. Finally, text string straightness and edge density are used for false positive elimination. Experimental results show that the proposed method is able to handle graphics text and scene text of both horizontal and nonhorizontal orientation.

...read moreread less

278 citations

Journal Article•DOI•

(2D)2LDA: An efficient approach for face recognition

[...]

S. Noushath¹, G. Hemantha Kumar¹, Palaiahnakote Shivakumara²•Institutions (2)

University of Mysore¹, National University of Singapore²

01 Jul 2006-Pattern Recognition

TL;DR: An efficient approach for face image feature extraction, namely, (2D)^2LDA method is presented, which obtains good recognition accuracy despite having less number of coefficients.

...read moreread less

156 citations

Journal Article•DOI•

Multioriented Video Scene Text Detection Through Bayesian Classification and Boundary Growing

[...]

Palaiahnakote Shivakumara¹, R. P. Sreedhar¹, Trung Quy Phan¹, Shijian Lu², Chew Lim Tan¹ - Show less +1 more•Institutions (2)

National University of Singapore¹, Agency for Science, Technology and Research²

01 Aug 2012-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: A new enhancement method that includes the product of Laplacian and Sobel operations to enhance text pixels in videos and proposes a Bayesian classifier without assuming a priori probability about the input frame but estimating it based on three probable matrices.

...read moreread less

Abstract: Multioriented text detection in video frames is not as easy as detection of captions or graphics or overlaid texts, which usually appears in the horizontal direction and has high contrast compared to its background. Multioriented text generally refers to scene text that makes text detection more challenging and interesting due to unfavorable characteristics of scene text. Therefore, conventional text detection methods may not give good results for multioriented scene text detection. Hence, in this paper, we present a new enhancement method that includes the product of Laplacian and Sobel operations to enhance text pixels in videos. To classify true text pixels, we propose a Bayesian classifier without assuming a priori probability about the input frame but estimating it based on three probable matrices. Three different ways of clustering are performed on the output of the enhancement method to obtain the three probable matrices. Text candidates are obtained by intersecting the output of the Bayesian classifier with the Canny edge map of the input frame. A boundary growing method is introduced to traverse the multioriented scene text lines using text candidates. The boundary growing method works based on the concept of nearest neighbors. The robustness of the method has been tested on a variety of datasets that include our own created data (nonhorizontal and horizontal text data) and two publicly available data, namely, video frames of Hua and complex scene text data of ICDAR 2003 competition (camera images). Experimental results show that the performance of the proposed method is encouraging compared with results of existing methods in terms of recall, precision, F-measures, and computational times.

...read moreread less

114 citations

Proceedings Article•DOI•

A Laplacian Method for Video Text Detection

[...]

Trung Quy Phan¹, Palaiahnakote Shivakumara¹, Chew Lim Tan¹•Institutions (1)

National University of Singapore¹

26 Jul 2009

TL;DR: The proposed text detection method outperforms three existing methods in terms of detection and false positive rates and employs empirical rules to eliminate false positives based on geometrical properties.

...read moreread less

Abstract: In this paper, we propose an efficient text detection method based on the Laplacian operator. The maximum gradient difference value is computed for each pixel in the Laplacian-filtered image. K-means is then used to classify all the pixels into two clusters: text and non-text. For each candidate text region, the corresponding region in the Sobel edge map of the input image undergoes projection profile analysis to determine the boundary of the text blocks. Finally, we employ empirical rules to eliminate false positives based on geometrical properties. Experimental results show that the proposed method is able to detect text of different fonts, contrast and backgrounds. Moreover, it outperforms three existing methods in terms of detection and false positive rates.

...read moreread less

97 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

[...]

성기익, 이영탁, 박계현, 전태국, 박표원, 한일용, 장윤희 - Show less +3 more

01 Mar 2003-The Korean Journal of Thoracic and Cardiovascular Surgery

28,685 citations

[신간의 별자리x] 우리/미술, 그리고 ‘슬픔의 박물관’

[...]

이화영

01 Jan 2015

12,972 citations

Reference Entry•DOI•

IEEE Transactions on Pattern Analysis and Machine Intelligence

[...]

King-Sun Fu

15 Oct 2004

2,118 citations

Journal Article•DOI•

Arbitrary-Oriented Scene Text Detection via Rotation Proposals

[...]

Jianqi Ma¹, Weiyuan Shao², Hao Ye², Li Wang¹, Hong Wang², Yingbin Zheng², Xiangyang Xue¹ - Show less +3 more•Institutions (2)

Fudan University¹, Chinese Academy of Sciences²

23 Mar 2018-IEEE Transactions on Multimedia

TL;DR: The Rotation Region Proposal Networks are designed to generate inclined proposals with text orientation angle information that are adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation.

...read moreread less

Abstract: This paper introduces a novel rotation-based framework for arbitrary-oriented text detection in natural scene images. We present the Rotation Region Proposal Networks , which are designed to generate inclined proposals with text orientation angle information. The angle information is then adapted for bounding box regression to make the proposals more accurately fit into the text region in terms of the orientation. The Rotation Region-of-Interest pooling layer is proposed to project arbitrary-oriented proposals to a feature map for a text region classifier. The whole framework is built upon a region-proposal-based architecture, which ensures the computational efficiency of the arbitrary-oriented text detection compared with previous text detection systems. We conduct experiments using the rotation-based framework on three real-world scene text detection datasets and demonstrate its superiority in terms of effectiveness and efficiency over previous approaches.

...read moreread less

1,002 citations

Proceedings Article•DOI•

Detecting texts of arbitrary orientations in natural images

[...]

Cong Yao¹, Xiang Bai¹, Wenyu Liu¹, Yi Ma², Zhuowen Tu² - Show less +1 more•Institutions (2)

Huazhong University of Science and Technology¹, Microsoft²

16 Jun 2012

TL;DR: A system which detects texts of arbitrary orientations in natural images using a two-level classification scheme and two sets of features specially designed for capturing both the intrinsic characteristics of texts to better evaluate its algorithm and compare it with other competing algorithms.

...read moreread less

Abstract: With the increasing popularity of practical vision systems and smart phones, text detection in natural scenes becomes a critical yet challenging task. Most existing methods have focused on detecting horizontal or near-horizontal texts. In this paper, we propose a system which detects texts of arbitrary orientations in natural images. Our algorithm is equipped with a two-level classification scheme and two sets of features specially designed for capturing both the intrinsic characteristics of texts. To better evaluate our algorithm and compare it with other competing algorithms, we generate a new dataset, which includes various texts in diverse real-world scenarios; we also propose a protocol for performance evaluation. Experiments on benchmark datasets and the proposed dataset demonstrate that our algorithm compares favorably with the state-of-the-art algorithms when handling horizontal texts and achieves significantly enhanced performance on texts of arbitrary orientations in complex natural scenes.

...read moreread less

750 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse