Home
/
Authors
/
Gaurav Harit

Author

Gaurav Harit

Other affiliations: Indian Institutes of Technology, Indian Institute of Technology Delhi, Indian Institute of Technology Kharagpur

Bio: Gaurav Harit is an academic researcher from Indian Institute of Technology, Jodhpur. The author has contributed to research in topics: Character (mathematics) & Image segmentation. The author has an hindex of 13, co-authored 73 publications receiving 523 citations. Previous affiliations of Gaurav Harit include Indian Institutes of Technology & Indian Institute of Technology Delhi.

Papers published on a yearly basis

2023
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Skeletonizing character images using a modified medial axis-based strategy

[...]

Soumen Bag¹, Gaurav Harit²•Institutions (2)

Indian Institute of Technology Kharagpur¹, Indian Institutes of Technology²

01 Nov 2011-International Journal of Pattern Recognition and Artificial Intelligence

TL;DR: A thinning methodology applicable to character images that is novel in terms of its ability to adapt to local character shape while constructing the thinned skeleton while obtaining less spurious branches compared to other thinning methods.

...read moreread less

Abstract: In this paper we propose a thinning methodology applicable to character images. It is novel in terms of its ability to adapt to local character shape while constructing the thinned skeleton. Our method does not produce many of the distortions in the character shapes which normally result from the use of existing thinning algorithms. The proposed thinning methodology is based on the medial axis of the character. The skeleton has a width of one pixel. As a by-product of our thinning approach, the skeleton also gets segmented into strokes in vector form. Hence further stroke segmentation is not required. We have conducted experiments with printed and handwritten characters in several scripts such as English, Bengali, Hindi, Kannada and Tamil. We obtain less spurious branches compared to other thinning methods. Our method does not use any kind of post processing.

...read moreread less

8 citations

Journal Article•DOI•

Topographic Feature Extraction for Bengali and Hindi Character Images

[...]

Soumen Bag, Gaurav Harit

14 Jul 2011-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this paper, the authors developed topographic features of strokes visible with respect to views from different directions (e.g. North, South, East, and West) for optical character recognition (OCR).

...read moreread less

Abstract: Feature selection and extraction plays an important role in different classification based problems such as face recognition, signature verification, optical character recognition (OCR) etc. The performance of OCR highly depends on the proper selection and extraction of feature set. In this paper, we present novel features based on the topography of a character as visible from different viewing directions on a 2D plane. By topography of a character we mean the structural features of the strokes and their spatial relations. In this work we develop topographic features of strokes visible with respect to views from different directions (e.g. North, South, East, and West). We consider three types of topographic features: closed region, convexity of strokes, and straight line strokes. These features are represented as a shape-based graph which acts as an invariant feature set for discriminating very similar type characters efficiently. We have tested the proposed method on printed and handwritten Bengali and Hindi character images. Initial results demonstrate the efficacy of our approach.

...read moreread less

7 citations

Journal Article•DOI•

Clustering in video data: Dealing with heterogeneous semantics of features

[...]

Gaurav Harit¹, Santanu Chaudhury¹•Institutions (1)

Indian Institute of Technology Delhi¹

01 May 2006-Pattern Recognition

TL;DR: This work proposes a novel clustering strategy, tailored towards the specific requirements of clustering in video data, that takes care of many of the problems with traditional clustering schemes applied to the heterogeneous feature space of video.

...read moreread less

7 citations

Book Chapter•DOI•

Detection of structural concavities in character images--a writer-independent approach

[...]

Soumen Bag¹, Partha Bhowmick¹, Gaurav Harit•Institutions (1)

Indian Institute of Technology Kharagpur¹

12 Jan 2012

TL;DR: A novel technique for detection of concave regions as a structural information of character images by analyzing the sequence of discrete turns taken to describe the character stroke, which has the added advantage of detecting same concave areas of a particular character written by different individuals.

...read moreread less

Abstract: In this paper, we present a novel technique for detection of concave regions as a structural information of character images. The problem difficulty lies in reporting all concavities irrespective of the viewing direction on the 2D plane. In our approach, we detect concave regions by analyzing the sequence of discrete turns taken to describe the character stroke; hence, it becomes view-invariant. The proposed method has the added advantage of detecting same concave regions of a particular character written by different individuals. We have tested our method on printed and handwritten Bangla and Hindi isolated character images. Initial results demonstrate the efficacy of our approach.

...read moreread less

7 citations

Proceedings Article•DOI•

Word image based latent semantic indexing for conceptual querying in document image databases

[...]

Subhashis Banerjee¹, Gaurav Harit¹, Santanu Chaudhury¹•Institutions (1)

Indian Institute of Technology Delhi¹

23 Sep 2007

TL;DR: It is shown through extensive experiments on a large database that use of LSA for document images provides improvements in retrieval precision as is the case with electronic text documents.

...read moreread less

Abstract: In this paper we present an application of latent semantic analysis (LSA) for indexing and retrieval of document images with text The query is specified as a set of word images and the documents which best match with the query representation in the the latent semantic space are retrieved We show through extensive experiments on a large database that use of LSA for document images provides improvements in retrieval precision as is the case with electronic text documents

...read moreread less

7 citations

1
2
…
3
4
5
6
7
8
9
…
10
11
12
13
14
15

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

Document Analysis and Recognition

[...]

Takahiro Watanabe

25 Mar 1999-IEICE Transactions on Information and Systems

TL;DR: This paper addresses current topics about document image understanding from a technical point of view as a survey and proposes methods/approaches for recognition of various kinds of documents.

...read moreread less

Abstract: The subject about document image understanding is to extract and classify individual data meaningfully from paper-based documents. Until today, many methods/approaches have been proposed with regard to recognition of various kinds of documents, various technical problems for extensions of OCR, and requirements for practical usages. Of course, though the technical research issues in the early stage are looked upon as complementary attacks for the traditional OCR which is dependent on character recognition techniques, the application ranges or related issues are widely investigated or should be established progressively. This paper addresses current topics about document image understanding from a technical point of view as a survey. key words: document model, top-down, bottom-up, layout structure, logical structure, document types, layout recognition

...read moreread less

222 citations

Journal Article•DOI•

ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP)

[...]

Newton Lee

01 Apr 2007

TL;DR: Call for papers for Special Issue of ACM Transactions on Multimedia Computing, Communications and Applications on Interactive Digital Television.

...read moreread less

Abstract: Call for papers for Special Issue of ACM Transactions on Multimedia Computing, Communications and Applications on Interactive Digital Television

...read moreread less

201 citations

Journal Article•DOI•

Object Level Grouping for Video Shots

[...]

Josef Sivic¹, Frederik Schaffalitzky¹, Andrew Zisserman¹•Institutions (1)

University of Oxford¹

10 Apr 2006-International Journal of Computer Vision

TL;DR: A method for automatically obtaining object representations suitable for retrieval from generic video shots that includes associating regions within a single shot to represent a deforming object and an affine factorization method that copes with motion degeneracy.

...read moreread less

Abstract: We describe a method for automatically obtaining object representations suitable for retrieval from generic video shots. The object representation consists of an association of frame regions. These regions provide exemplars of the object's possible visual appearances. Two ideas are developed: (i) associating regions within a single shot to represent a deforming object; (ii) associating regions from the multiple visual aspects of a 3D object, thereby implicitly representing 3D structure. For the association we exploit temporal continuity (tracking) and wide baseline matching of affine covariant regions. In the implementation there are three areas of novelty: First, we describe a method to repair short gaps in tracks. Second, we show how to join tracks across occlusions (where many tracks terminate simultaneously). Third, we develop an affine factorization method that copes with motion degeneracy. We obtain tracks that last throughout the shot, without requiring a 3D reconstruction. The factorization method is used to associate tracks into object-level groups, with common motion. The outcome is that separate parts of an object that are not simultaneously visible (such as the front and back of a car, or the front and side of a face) are associated together. In turn this enables object-level matching and recognition throughout a video. We illustrate the method on the feature film "Groundhog Day." Examples are given for the retrieval of deforming objects (heads, walking people) and rigid objects (vehicles, locations).

...read moreread less

162 citations

Journal Article•DOI•

Offline Recognition of Devanagari Script: A Survey

[...]

R. Jayadevan¹, Satish R. Kolhe, Pradeep M. Patil², Umapada Pal•Institutions (2)

Pune Institute of Computer Technology¹, Vishwakarma Institute of Technology²

01 Nov 2011

TL;DR: In this paper, the state of the art from 1970s of machine printed and handwritten Devanagari optical character recognition (OCR) is discussed in various sections of the paper.

...read moreread less

Abstract: In India, more than 300 million people use Devanagari script for documentation. There has been a significant improvement in the research related to the recognition of printed as well as handwritten Devanagari text in the past few years. State of the art from 1970s of machine printed and handwritten Devanagari optical character recognition (OCR) is discussed in this paper. All feature-extraction techniques as well as training, classification and matching techniques useful for the recognition are discussed in various sections of the paper. An attempt is made to address the most important results reported so far and it is also tried to highlight the beneficial directions of the research till date. Moreover, the paper also contains a comprehensive bibliography of many selected papers appeared in reputed journals and conference proceedings as an aid for the researchers working in the field of Devanagari OCR.

...read moreread less

159 citations

Proceedings Article•DOI•

Table Detection Using Deep Learning

[...]

Azka Gilani, Shah Rukh Qasim¹, Imran Malik¹, Faisal Shafait¹•Institutions (1)

University of the Sciences¹

01 Nov 2017

TL;DR: The proposed method works with high precision on document images with varying layouts that include documents, research papers, and magazines and beats Tesseract's state of the art table detection system by a significant margin.

...read moreread less

Abstract: Table detection is a crucial step in many document analysis applications as tables are used for presenting essential information to the reader in a structured manner. It is a hard problem due to varying layouts and encodings of the tables. Researchers have proposed numerous techniques for table detection based on layout analysis of documents. Most of these techniques fail to generalize because they rely on hand engineered features which are not robust to layout variations. In this paper, we have presented a deep learning based method for table detection. In the proposed method, document images are first pre-processed. These images are then fed to a Region Proposal Network followed by a fully connected neural network for table detection. The proposed method works with high precision on document images with varying layouts that include documents, research papers, and magazines. We have done our evaluations on publicly available UNLV dataset where it beats Tesseract's state of the art table detection system by a significant margin.

...read moreread less

159 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110

Collapse