Home
/
Authors
/
S.. Kumar

Author

S.. Kumar

Bio: S.. Kumar is an academic researcher from IBM. The author has contributed to research in topics: Image segmentation & Histogram matching. The author has an hindex of 2, co-authored 2 publications receiving 156 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Text Extraction and Document Image Segmentation Using Matched Wavelets and MRF Model

[...]

S.. Kumar¹, Rajat Gupta, Nitin Khanna², Santanu Chaudhury³, Shashank Joshi³ - Show less +1 more•Institutions (3)

IBM¹, Purdue University², Indian Institute of Technology Delhi³

01 Aug 2007-IEEE Transactions on Image Processing

TL;DR: A clustering-based technique has been devised for estimating globally matched wavelet filters using a collection of groundtruth images and a text extraction scheme for the segmentation of document images into text, background, and picture components is extended.

...read moreread less

Abstract: In this paper, we have proposed a novel scheme for the extraction of textual areas of an image using globally matched wavelet filters. A clustering-based technique has been devised for estimating globally matched wavelet filters using a collection of groundtruth images. We have extended our text extraction scheme for the segmentation of document images into text, background, and picture components (which include graphics and continuous tone images). Multiple, two-class Fisher classifiers have been used for this purpose. We also exploit contextual information by using a Markov random field formulation-based pixel labeling scheme for refinement of the segmentation results. Experimental results have established effectiveness of our approach.

...read moreread less

159 citations

Proceedings Article•DOI•

Enhancement of Old Manuscript Images

[...]

Ajay Gupta¹, S.. Kumar², R. D. Gupta³, Santanu Chaudhury¹, Shiv Dutt Joshi¹ - Show less +1 more•Institutions (3)

Indian Institute of Technology Delhi¹, IBM², Cypress Semiconductor³

23 Sep 2007

TL;DR: A segmentation based histogram matching scheme for enhancing small portions of the text in these manuscripts that have degraded with time and are not readable is proposed.

...read moreread less

Abstract: In this paper we address the issue of enhancement in the quality of scanned images of old manuscripts. Small portions of the text in these manuscripts have degraded with time and are not readable. We propose a segmentation based histogram matching scheme for enhancing these degraded text regions. To automatically identify the degraded text we use a matched wavelet based text extraction algorithm followed by MRF(Markov Random Field) post processing. Additionally we do background clearing to improve the quality of results. This method does not require any a priori information about the font, font size, background texture or geometric transformation. We have tested our method on a variety of manuscript images. The results show proposed method to be a robust, versatile and effective tool for enhancement of manuscript images.

...read moreread less

8 citations

Cited by

PDF

Open Access

More filters

Proceedings Article•

Character recognition in natural images

[...]

Teofilo de Campos¹, Bodla Rakesh Babu², Manik Varma³•Institutions (3)

Xerox¹, International Institute of Information Technology, Hyderabad², Microsoft³

01 Feb 2009

TL;DR: It is demonstrated that the performance of the proposed method can be far superior to that of commercial OCR systems, and can benefit from synthetically generated training data obviating the need for expensive data collection and annotation.

...read moreread less

Abstract: This paper tackles the problem of recognizing characters in images of natural scenes. In particular, we focus on recognizing characters in situations that would traditionally not be handled well by OCR techniques. We present an annotated database of images containing English and Kannada characters. The database comprises of images of street scenes taken in Bangalore, India using a standard camera. The problem is addressed in an object cateogorization framework based on a bag-of-visual-words representation. We assess the performance of various features based on nearest neighbour and SVM classification. It is demonstrated that the performance of the proposed method, using as few as 15 training images, can be far superior to that of commercial OCR systems. Furthermore, the method can benefit from synthetically generated training data obviating the need for expensive data collection and annotation.

...read moreread less

520 citations

Journal Article•DOI•

Text String Detection From Natural Scenes by Structure-Based Partition and Grouping

[...]

Chucai Yi¹, Yingli Tian²•Institutions (2)

City University of New York¹, IBM²

01 Sep 2011-IEEE Transactions on Image Processing

TL;DR: A new framework to detect text strings with arbitrary orientations in complex natural scene images with outperform the state-of-the-art results on the public Robust Reading Dataset, which contains text only in horizontal orientation.

...read moreread less

Abstract: Text information in natural scene images serves as important clues for many image-based applications such as scene understanding, content-based image retrieval, assistive navigation, and automatic geocoding. However, locating text from a complex background with multiple colors is a challenging task. In this paper, we explore a new framework to detect text strings with arbitrary orientations in complex natural scene images. Our proposed framework of text string detection consists of two steps: 1) image partition to find text character candidates based on local gradient features and color uniformity of character components and 2) character candidate grouping to detect text strings based on joint structural features of text characters in each text string such as character size differences, distances between neighboring characters, and character alignment. By assuming that a text string has at least three characters, we propose two algorithms of text string detection: 1) adjacent character grouping method and 2) text line grouping method. The adjacent character grouping method calculates the sibling groups of each character candidate as string segments and then merges the intersecting sibling groups into text string. The text line grouping method performs Hough transform to fit text line among the centroids of text candidates. Each fitted text line describes the orientation of a potential text string. The detected text string is presented by a rectangle region covering all characters whose centroids are cascaded in its text line. To improve efficiency and accuracy, our algorithms are carried out in multi-scales. The proposed methods outperform the state-of-the-art results on the public Robust Reading Dataset, which contains text only in horizontal orientation. Furthermore, the effectiveness of our methods to detect text strings with arbitrary orientations is evaluated on the Oriented Scene Text Dataset collected by ourselves containing text strings in nonhorizontal orientations.

...read moreread less

355 citations

Journal Article•DOI•

Robust Document Image Binarization Technique for Degraded Document Images

[...]

Bolan Su¹, Shijian Lu², Chew Lim Tan¹•Institutions (2)

National University of Singapore¹, Institute for Infocomm Research Singapore²

01 Apr 2013-IEEE Transactions on Image Processing

TL;DR: A novel document image binarization technique that addresses issues ofSegmentation of text from badly degraded document images by using adaptive image contrast, a combination of the local image contrast and theLocal image gradient that is tolerant to text and background variation caused by different types of document degradations.

...read moreread less

Abstract: Segmentation of text from badly degraded document images is a very challenging task due to the high inter/intra-variation between the document background and the foreground text of different document images. In this paper, we propose a novel document image binarization technique that addresses these issues by using adaptive image contrast. The adaptive image contrast is a combination of the local image contrast and the local image gradient that is tolerant to text and background variation caused by different types of document degradations. In the proposed technique, an adaptive contrast map is first constructed for an input degraded document image. The contrast map is then binarized and combined with Canny's edge map to identify the text stroke edge pixels. The document text is further segmented by a local threshold that is estimated based on the intensities of detected text stroke edge pixels within a local window. The proposed method is simple, robust, and involves minimum parameter tuning. It has been tested on three public datasets that are used in the recent document image binarization contest (DIBCO) 2009 & 2011 and handwritten-DIBCO 2010 and achieves accuracies of 93.5%, 87.8%, and 92.03%, respectively, that are significantly higher than or close to that of the best-performing methods reported in the three contests. Experiments on the Bickley diary dataset that consists of several challenging bad quality document images also show the superior performance of our proposed method, compared with other techniques.

...read moreread less

255 citations

Journal Article•DOI•

Localizing Text in Scene Images by Boundary Clustering, Stroke Segmentation, and String Fragment Classification

[...]

Chucai Yi¹, Yingli Tian¹•Institutions (1)

City University of New York¹

01 Sep 2012-IEEE Transactions on Image Processing

TL;DR: The proposed framework of text localization is evaluated on scene images, born-digital images, broadcast video images, and images of handheld objects captured by blind persons and demonstrates that the framework outperforms state-of-the-art localization algorithms.

...read moreread less

Abstract: In this paper, we propose a novel framework to extract text regions from scene images with complex backgrounds and multiple text appearances. This framework consists of three main steps: boundary clustering (BC), stroke segmentation, and string fragment classification. In BC, we propose a new bigram-color-uniformity-based method to model both text and attachment surface, and cluster edge pixels based on color pairs and spatial positions into boundary layers. Then, stroke segmentation is performed at each boundary layer by color assignment to extract character candidates. We propose two algorithms to combine the structural analysis of text stroke with color assignment and filter out background interferences. Further, we design a robust string fragment classification based on Gabor-based text features. The features are obtained from feature maps of gradient, stroke distribution, and stroke width. The proposed framework of text localization is evaluated on scene images, born-digital images, broadcast video images, and images of handheld objects captured by blind persons. Experimental results on respective datasets demonstrate that the framework outperforms state-of-the-art localization algorithms.

...read moreread less

135 citations

Journal Article•DOI•

Text detection in images using sparse representation with discriminative dictionaries

[...]

Ming Zhao¹, Shutao Li¹, James T. Kwok²•Institutions (2)

Hunan University¹, Hong Kong University of Science and Technology²

01 Dec 2010-Image and Vision Computing

TL;DR: A classification-based algorithm for text detection using a sparse representation with discriminative dictionaries that can effectively detect texts of various sizes, fonts and colors from images and videos.

...read moreread less

106 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34

Collapse