Home
/
Authors
/
Santanu Chaudhury

Author

Santanu Chaudhury

Other affiliations: Central Electronics Engineering Research Institute, Indian Institute of Technology Delhi, Indian Statistical Institute ...read more

Bio: Santanu Chaudhury is an academic researcher from Indian Institute of Technology, Jodhpur. The author has contributed to research in topics: Ontology (information science) & Image segmentation. The author has an hindex of 28, co-authored 380 publications receiving 3691 citations. Previous affiliations of Santanu Chaudhury include Central Electronics Engineering Research Institute & Indian Institute of Technology Delhi.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1988

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Echocardiogram view classification with appearance and spatial distributions

[...]

Ronak Gupta¹, Santanu Chaudhury¹, Navneeth Subramanian², Satish Govind•Institutions (2)

Indian Institute of Technology Delhi¹, General Electric²

16 Apr 2015

TL;DR: An approach for view classification, Spatial Pyramid Histogram of Words which successfully models the appearance and shape distributions of object class and shows a classification accuracy of 98.3% on an exhaustive database of 703 ultrasound images.

...read moreread less

Abstract: When imaging the heart, using a 2D ultrasound probe, different views can manifest depending on the location and angulations of the probe. Some of these views have been labeled as standard views, due to the presentation and ease of assessment of key cardiac structures in them. We present an approach for automatic recognition and classification of these standard views, as a potential enabler for automated measurements or detection of noise — all without a human in the loop. We present an approach for view classification, Spatial Pyramid Histogram of Words which successfully models the appearance and shape distributions of object class. We demonstrate the effectiveness of this technique for the task of discrimination between the B-mode Parasternal Long Axis (PLAX) and the Short Axis (SAX) echocardiograms. For this task, our method shows a classification accuracy of 98.3% on an exhaustive database of 703 ultrasound images.

...read moreread less

Book Chapter•DOI•

Info-Graphics Retrieval: A Multi-kernel Distance Based Hashing Scheme

[...]

Ritu Garg¹, Santanu Chaudhury¹•Institutions (1)

Indian Institute of Technology Delhi¹

19 Dec 2016

TL;DR: This paper presents a multi-modal document image retrieval framework by learning an optimal fusion of information from text and info-graphics regions and demonstrates the evaluation of the proposed concept on documents collected from various sources.

...read moreread less

Abstract: Information retrieval research has shown significant improvement and provided techniques that retrieve documents in image or text form. However, retrieval of multi-modal documents has been given very less attention. We aim to build a system for retrieval of documents with embedded information graphics (Info-graphics). Info-graphics are images of bar charts and line graphs appearing with textual components in magazines, newspapers, and journals. In this paper, we present multi-modal document image retrieval framework by learning an optimal fusion of information from text and info-graphics regions. The evaluation of the proposed concept is demonstrated on documents collected from various sources such as magazines and journals.

...read moreread less

Proceedings Article•DOI•

Sparse representation based classifier to assess video quality

[...]

Manoj Sharma¹, Santanu Chaudhury¹, Brejesh Lall¹•Institutions (1)

Indian Institute of Technology Delhi¹

01 Dec 2015

TL;DR: This work identified the fact that correct frame can be represented precisely in terms of dictionary atoms but while representing a distorted frame, the error drastically increases with increase in distortion thus it can easily classify the frames as correct and distorted based on error score calculated by sparse representation framework.

...read moreread less

Abstract: This paper describes a sparse representation based approach to learn a classifier for assessing the video quality without a reference. First we calculate the natural scene statistics (NSS) based spatial features of each frame/image and then learn a dictionary by K-SVD algorithm from NSS features of correct frames. In this work we identified the fact that correct frame can be represented precisely in terms of dictionary atoms but while representing a distorted frame, the error drastically increases with increase in distortion thus we can easily classify the frames as correct and distorted based on error score calculated by sparse representation framework. This framework has been validated on two datasets and we observe improved accuracies as compared to state-of-art algorithms.

...read moreread less

Journal Article•DOI•

An Ontology Representation Language for Multimedia Event Applications

[...]

Nisha Pahal¹, Brejesh Lall¹, Santanu Chaudhury¹•Institutions (1)

Indian Institute of Technology Delhi¹

16 Mar 2021-Journal of Web Engineering

TL;DR: In this paper, a new Multimedia Web Ontology Language (E-MOWL) is presented to handle events with media depictions, where the temporal, spatial and entity aspects that are implicitly linked to an event are represented through this language to model the context of events.

...read moreread less

Abstract: This paper presents formalization of a new Multimedia Web Ontology Language (E-MOWL) to handle events with media depictions. The temporal, spatial and entity aspects that are implicitly linked to an event are represented through this language to model the context of events. The already existing Multimedia Web Ontology Language (MOWL) can be leveraged for perceptual modelling of a domain, where the concepts manifest into media patterns in the multimedia document and helps in semantic processing of the contents. The language E-MOWL provides a rich method for representing knowledge corresponding to a specific domain wherein the context specifies the intended meaning of each element of the domain of discourse; an element in different context may correspond to different functional role. The context information associated with an event ties the audiovisual data with event related aspects. All these aspects when considered altogether provide the evidence and contribute towards recognizing an event from multimedia documents. The language also enables reasoning with the uncertainty associated with the events and is organized in the form of Bayesian Network (BN). The media items that are semantically relevant can be assimilated together on the basis of their association with events. We have demonstrated the efficacy of our approach by utilizing an ontology for the entertainment category in news domain to offer an application \textit{news aggregation} and event-based book recommendations.

...read moreread less

Book Chapter•DOI•

Incremental Learning of Non-stationary Temporal Causal Networks for Telecommunication Domain

[...]

Ram Mohan, Santanu Chaudhury¹, Brejesh Lall¹•Institutions (1)

Indian Institutes of Technology¹

05 Dec 2017

TL;DR: A novel framework is applied on a telecommunication operator’s data and the framework detects the concept drift related to changes in revenue associated with data usage and the incremental causal network learning algorithm updates the knowledge accordingly.

...read moreread less

Abstract: In today’s competitive telecommunication industry understanding the causes that influence the revenue is of importance. In a continuously evolving business environment, the causes that influence the revenue keeps changing. To understand and quantify the effect of different factors we model it as a non-stationary temporal causal network. To handle the massive volume of data, we propose a novel framework as part of which we define rules to identify the concept drift and propose an incremental algorithm for learning non-stationary temporal causal structure from streaming data. We apply the framework on a telecommunication operator’s data and the framework detects the concept drift related to changes in revenue associated with data usage and the incremental causal network learning algorithm updates the knowledge accordingly.

...read moreread less

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
…
73
74
75
76
77
78

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Data clustering: a review

[...]

Anil K. Jain¹, M. N. Murty², Patrick J. Flynn³•Institutions (3)

Michigan State University¹, Indian Institute of Science², Ohio State University³

01 Sep 1999-ACM Computing Surveys

TL;DR: An overview of pattern clustering methods from a statistical pattern recognition perspective is presented, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners.

...read moreread less

Abstract: Clustering is the unsupervised classification of patterns (observations, data items, or feature vectors) into groups (clusters). The clustering problem has been addressed in many contexts and by researchers in many disciplines; this reflects its broad appeal and usefulness as one of the steps in exploratory data analysis. However, clustering is a difficult problem combinatorially, and differences in assumptions and contexts in different communities has made the transfer of useful generic concepts and methodologies slow to occur. This paper presents an overview of pattern clustering methods from a statistical pattern recognition perspective, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners. We present a taxonomy of clustering techniques, and identify cross-cutting themes and recent advances. We also describe some important applications of clustering algorithms such as image segmentation, object recognition, and information retrieval.

...read moreread less

14,054 citations

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Computer vision : a modern approach = 计算机视觉 : 一种现代的方法

[...]

David Forsyth, Jean Ponce

01 Jan 2004

TL;DR: Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance and describes numerous important application areas such as image based rendering and digital libraries.

...read moreread less

Abstract: From the Publisher: The accessible presentation of this book gives both a general view of the entire computer vision enterprise and also offers sufficient detail to be able to build useful applications. Users learn techniques that have proven to be useful by first-hand experience and a wide range of mathematical methods. A CD-ROM with every copy of the text contains source code for programming practice, color images, and illustrative movies. Comprehensive and up-to-date, this book includes essential topics that either reflect practical significance or are of theoretical importance. Topics are discussed in substantial and increasing depth. Application surveys describe numerous important application areas such as image based rendering and digital libraries. Many important algorithms broken down and illustrated in pseudo code. Appropriate for use by engineers as a comprehensive reference to the computer vision enterprise.

...read moreread less

3,627 citations

Journal Article•DOI•

Online and off-line handwriting recognition: a comprehensive survey

[...]

Réjean Plamondon¹, Sargur N. Srihari²•Institutions (2)

École Normale Supérieure¹, University at Buffalo²

01 Jan 2000-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The nature of handwritten language, how it is transduced into electronic data, and the basic concepts behind written language recognition algorithms are described.

...read moreread less

Abstract: Handwriting has continued to persist as a means of communication and recording information in day-to-day life even with the introduction of new technologies. Given its ubiquity in human transactions, machine recognition of handwriting has practical significance, as in reading handwritten notes in a PDA, in postal addresses on envelopes, in amounts in bank checks, in handwritten fields in forms, etc. This overview describes the nature of handwritten language, how it is transduced into electronic data, and the basic concepts behind written language recognition algorithms. Both the online case (which pertains to the availability of trajectory data during writing) and the off-line case (which pertains to scanned images) are considered. Algorithms for preprocessing, character and word recognition, and performance with practical systems are indicated. Other fields of application, like signature verification, writer authentification, handwriting learning tools are also considered.

...read moreread less

2,653 citations

Reference Entry•DOI•

IEEE Transactions on Pattern Analysis and Machine Intelligence

[...]

King-Sun Fu

15 Oct 2004

2,118 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse