Home
/
Authors
/
Poojan Oza

Author

Poojan Oza

Other affiliations: Indian Institute of Technology Gandhinagar

Bio: Poojan Oza is an academic researcher from Johns Hopkins University. The author has contributed to research in topics: Convolutional neural network & Computer science. The author has an hindex of 10, co-authored 26 publications receiving 393 citations. Previous affiliations of Poojan Oza include Indian Institute of Technology Gandhinagar.

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Medical Transformer: Gated Axial-Attention for Medical Image Segmentation

[...]

Jeya Maria Jose Valanarasu¹, Poojan Oza¹, Ilker Hacihaliloglu², Vishal M. Patel¹•Institutions (2)

Johns Hopkins University¹, Rutgers University²

27 Sep 2021

TL;DR: Jeon et al. as discussed by the authors proposed a gated axial-attention model which extends the existing transformer-based architectures by introducing an additional control mechanism in the selfattention module.

...read moreread less

Abstract: Over the past decade, deep convolutional neural networks have been widely adopted for medical image segmentation and shown to achieve adequate performance. However, due to inherent inductive biases present in convolutional architectures, they lack understanding of long-range dependencies in the image. Recently proposed transformer-based architectures that leverage self-attention mechanism encode long-range dependencies and learn representations that are highly expressive. This motivates us to explore transformer-based solutions and study the feasibility of using transformer-based network architectures for medical image segmentation tasks. Majority of existing transformer-based network architectures proposed for vision applications require large-scale datasets to train properly. However, compared to the datasets for vision applications, in medical imaging the number of data samples is relatively low, making it difficult to efficiently train transformers for medical imaging applications. To this end, we propose a gated axial-attention model which extends the existing architectures by introducing an additional control mechanism in the self-attention module. Furthermore, to train the model effectively on medical images, we propose a Local-Global training strategy (LoGo) which further improves the performance. Specifically, we operate on the whole image and patches to learn global and local features, respectively. The proposed Medical Transformer (MedT) is evaluated on three different medical image segmentation datasets and it is shown that it achieves better performance than the convolutional and other related transformer-based architectures. Code: https://github.com/jeya-maria-jose/Medical-Transformer

...read moreread less

464 citations

Proceedings Article•DOI•

C2AE: Class Conditioned Auto-Encoder for Open-Set Recognition

[...]

Poojan Oza¹, Vishal M. Patel¹•Institutions (1)

Johns Hopkins University¹

15 Jun 2019

TL;DR: In this paper, the authors proposed an open-set recognition algorithm using class conditioned auto-encoders with novel training and testing methodologies, where the training procedure is divided in two sub-tasks, 1. Closed-set classification and 2. Open-set identification.

...read moreread less

Abstract: Models trained for classification often assume that all testing classes are known while training. As a result, when presented with an unknown class during testing, such closed-set assumption forces the model to classify it as one of the known classes. However, in a real world scenario, classification models are likely to encounter such examples. Hence, identifying those examples as unknown becomes critical to model performance. A potential solution to overcome this problem lies in a class of learning problems known as open-set recognition. It refers to the problem of identifying the unknown classes during testing, while maintaining performance on the known classes. In this paper, we propose an open-set recognition algorithm using class conditioned auto-encoders with novel training and testing methodologies. In this method, training procedure is divided in two sub-tasks, 1. closed-set classification and, 2. open-set identification (i.e. identifying a class as known or unknown). Encoder learns the first task following the closed-set classification training pipeline, whereas decoder learns the second task by reconstructing conditioned on class identity. Furthermore, we model reconstruction errors using the Extreme Value Theory of statistical modeling to find the threshold for identifying known/unknown class samples. Experiments performed on multiple image classification datasets show that the proposed method performs significantly better than the state of the art methods. The source code is available at: github.com/otkupjnoz/c2ae.

...read moreread less

147 citations

Posted Content•

C2AE: Class Conditioned Auto-Encoder for Open-set Recognition

[...]

Poojan Oza¹, Vishal M. Patel¹•Institutions (1)

Johns Hopkins University¹

02 Apr 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: An open-set recognition algorithm using class conditioned auto-encoders with novel training and testing methodologies is proposed and experiments show that the proposed method performs significantly better than the state of the art methods.

...read moreread less

Abstract: Models trained for classification often assume that all testing classes are known while training. As a result, when presented with an unknown class during testing, such closed-set assumption forces the model to classify it as one of the known classes. However, in a real world scenario, classification models are likely to encounter such examples. Hence, identifying those examples as unknown becomes critical to model performance. A potential solution to overcome this problem lies in a class of learning problems known as open-set recognition. It refers to the problem of identifying the unknown classes during testing, while maintaining performance on the known classes. In this paper, we propose an open-set recognition algorithm using class conditioned auto-encoders with novel training and testing methodology. In contrast to previous methods, training procedure is divided in two sub-tasks, 1. closed-set classification and, 2. open-set identification (i.e. identifying a class as known or unknown). Encoder learns the first task following the closed-set classification training pipeline, whereas decoder learns the second task by reconstructing conditioned on class identity. Furthermore, we model reconstruction errors using the Extreme Value Theory of statistical modeling to find the threshold for identifying known/unknown class samples. Experiments performed on multiple image classification datasets show proposed method performs significantly better than state of the art.

...read moreread less

131 citations

Journal Article•DOI•

One-Class Convolutional Neural Network

[...]

Poojan Oza¹, Vishal M. Patel¹•Institutions (1)

Johns Hopkins University¹

01 Feb 2019-IEEE Signal Processing Letters

TL;DR: OC-CNN as discussed by the authors uses a zero centered Gaussian noise in the latent space as the pseudo-negative class and trains the network using the cross-entropy loss to learn a good representation as well as the decision boundary for the given class.

...read moreread less

Abstract: We present a novel convolutional neural network (CNN) based approach for one-class classification. The idea is to use a zero centered Gaussian noise in the latent space as the pseudo-negative class and train the network using the cross-entropy loss to learn a good representation as well as the decision boundary for the given class. A key feature of the proposed approach is that any pre-trained CNN can be used as the base network for one-class classification. The proposed one-class CNN is evaluated on the UMDAA-02 Face, Abnormality-1001, and FounderType-200 datasets. These datasets are related to a variety of one-class application problems such as user authentication, abnormality detection, and novelty detection. Extensive experiments demonstrate that the proposed method achieves significant improvements over the recent state-of-the-art methods. The source code is available at: github.com/otkupjnoz/oc-cnn.

...read moreread less

120 citations

Proceedings Article•DOI•

MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection

[...]

Vibashan Vs¹, Vikram Gupta², Poojan Oza¹, Vishwanath A. Sindagi¹, Vishal M. Patel¹ - Show less +1 more•Institutions (2)

Johns Hopkins University¹, Mercedes-Benz²

01 Jun 2021

TL;DR: MeGA-CDA as mentioned in this paper employs category-wise discriminators to ensure category-aware feature alignment for learning domain-invariant discriminative features, and generates memory-guided category-specific attention maps which are then used to route the features appropriately to the corresponding category discriminator.

...read moreread less

Abstract: Existing approaches for unsupervised domain adaptive object detection perform feature alignment via adversarial training. While these methods achieve reasonable improvements in performance, they typically perform category-agnostic domain alignment, thereby resulting in negative transfer of features. To overcome this issue, in this work, we attempt to incorporate category information into the domain adaptation process by proposing Memory Guided Attention for Category-Aware Domain Adaptation (MeGA-CDA). The proposed method consists of employing category-wise discriminators to ensure category-aware feature alignment for learning domain-invariant discriminative features. However, since the category information is not available for the target samples, we propose to generate memory-guided category-specific attention maps which are then used to route the features appropriately to the corresponding category discriminator. The proposed method is evaluated on several benchmark datasets and is shown to outperform existing approaches.

...read moreread less

103 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Recent Advances in Open Set Recognition: A Survey

[...]

Chuanxing Geng¹, Sheng-Jun Huang¹, Songcan Chen¹•Institutions (1)

Nanjing University of Aeronautics and Astronautics¹

01 Oct 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper provides a comprehensive survey of existing open set recognition techniques covering various aspects ranging from related definitions, representations of models, datasets, evaluation criteria, and algorithm comparisons to highlight the limitations of existing approaches and point out some promising subsequent research directions.

...read moreread less

Abstract: In real-world recognition/classification tasks, limited by various objective factors, it is usually difficult to collect training samples to exhaust all classes when training a recognizer or classifier. A more realistic scenario is open set recognition (OSR), where incomplete knowledge of the world exists at training time, and unknown classes can be submitted to an algorithm during testing, requiring the classifiers to not only accurately classify the seen classes, but also effectively deal with unseen ones. This paper provides a comprehensive survey of existing open set recognition techniques covering various aspects ranging from related definitions, representations of models, datasets, evaluation criteria, and algorithm comparisons. Furthermore, we briefly analyze the relationships between OSR and its related tasks including zero-shot, one-shot (few-shot) recognition/learning techniques, classification with reject option, and so forth. Additionally, we also review the open world recognition which can be seen as a natural extension of OSR. Importantly, we highlight the limitations of existing approaches and point out some promising subsequent research directions in this field.

...read moreread less

492 citations

Proceedings Article•DOI•

OCGAN: One-Class Novelty Detection Using GANs With Constrained Latent Representations

[...]

Pramuditha Perera¹, Ramesh Nallapati², Bing Xiang²•Institutions (2)

Johns Hopkins University¹, Amazon.com²

01 Jan 2019

TL;DR: OCGAN as discussed by the authors uses a de-noising auto-encoder network to explicitly constrain the latent space to exclusively represent the given class and uses a gradient-descent based sampling technique to generate potential out-of-class examples.

...read moreread less

Abstract: We present a novel model called OCGAN for the classical problem of one-class novelty detection, where, given a set of examples from a particular class, the goal is to determine if a query example is from the same class. Our solution is based on learning latent representations of in-class examples using a de-noising auto-encoder network. The key contribution of our work is our proposal to explicitly constrain the latent space to exclusively represent the given class. In order to accomplish this goal, firstly, we force the latent space to have bounded support by introducing a tanh activation in the encoder's output layer. Secondly, using a discriminator in the latent space that is trained adversarially, we ensure that encoded representations of in-class examples resemble uniform random samples drawn from the same bounded space. Thirdly, using a second adversarial discriminator in the input space, we ensure all randomly drawn latent samples generate examples that look real. Finally, we introduce a gradient-descent based sampling technique that explores points in the latent space that generate potential out-of-class examples, which are fed back to the network to further train it to generate in-class examples from those points. The effectiveness of the proposed method is measured across four publicly available datasets using two one-class novelty detection protocols where we achieve state-of-the-art results.

...read moreread less

460 citations

Biometric Recognition - Security and Privacy Concerns.

[...]

Luminita Vasiu

01 Jan 2004

370 citations

Posted Content•

Human Action Recognition and Prediction: A Survey.

[...]

Yu Kong, Yun Fu

28 Jun 2018-arXiv: Computer Vision and Pattern Recognition

TL;DR: The complete state-of-the-art techniques in the action recognition and prediction are surveyed, including existing models, popular algorithms, technical difficulties, popular action databases, evaluation protocols, and promising future directions are provided.

...read moreread less

Abstract: Derived from rapid advances in computer vision and machine learning, video analysis tasks have been moving from inferring the present state to predicting the future state. Vision-based action recognition and prediction from videos are such tasks, where action recognition is to infer human actions (present state) based upon complete action executions, and action prediction to predict human actions (future state) based upon incomplete action executions. These two tasks have become particularly prevalent topics recently because of their explosively emerging real-world applications, such as visual surveillance, autonomous driving vehicle, entertainment, and video retrieval, etc. Many attempts have been devoted in the last a few decades in order to build a robust and effective framework for action recognition and prediction. In this paper, we survey the complete state-of-the-art techniques in the action recognition and prediction. Existing models, popular algorithms, technical difficulties, popular action databases, evaluation protocols, and promising future directions are also provided with systematic discussions.

...read moreread less

351 citations

Journal Article•DOI•

A Unifying Review of Deep and Shallow Anomaly Detection

[...]

Lukas Ruff¹, Jacob R. Kauffmann¹, Robert A. Vandermeulen¹, Grégoire Montavon¹, Wojciech Samek², Marius Kloft³, Thomas G. Dietterich⁴, Klaus-Robert Müller¹ - Show less +4 more•Institutions (4)

Technical University of Berlin¹, Heinrich Hertz Institute², Kaiserslautern University of Technology³, Oregon State University⁴

24 Sep 2020-arXiv: Learning

TL;DR: This review aims to identify the common underlying principles and the assumptions that are often made implicitly by various methods in deep learning, and draws connections between classic “shallow” and novel deep approaches and shows how this relation might cross-fertilize or extend both directions.

...read moreread less

Abstract: Deep learning approaches to anomaly detection have recently improved the state of the art in detection performance on complex datasets such as large collections of images or text. These results have sparked a renewed interest in the anomaly detection problem and led to the introduction of a great variety of new methods. With the emergence of numerous such methods, including approaches based on generative models, one-class classification, and reconstruction, there is a growing need to bring methods of this field into a systematic and unified perspective. In this review we aim to identify the common underlying principles as well as the assumptions that are often made implicitly by various methods. In particular, we draw connections between classic 'shallow' and novel deep approaches and show how this relation might cross-fertilize or extend both directions. We further provide an empirical assessment of major existing methods that is enriched by the use of recent explainability techniques, and present specific worked-through examples together with practical advice. Finally, we outline critical open challenges and identify specific paths for future research in anomaly detection.

...read moreread less

310 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse