Home
/
Authors
/
Debesh Jha

Author

Debesh Jha

Bio: Debesh Jha is an academic researcher from Simula Research Laboratory. The author has contributed to research in topics: Computer science & Segmentation. The author has an hindex of 13, co-authored 59 publications receiving 703 citations. Previous affiliations of Debesh Jha include Chosun University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016

Papers

PDF

Open Access

More filters

Posted Content•

Kvasir-SEG: A Segmented Polyp Dataset

[...]

Debesh Jha, Pia H. Smedsrud, Michael Riegler, Pål Halvorsen, Thomas de Lange¹, Dag Johansen, Håvard D. Johansen - Show less +3 more•Institutions (1)

University of Oslo¹

16 Nov 2019-arXiv: Image and Video Processing

TL;DR: Kvasir-SEG as mentioned in this paper is an open-access dataset of gastrointestinal polyp images and corresponding segmentation masks, manually annotated by a medical doctor and then verified by an experienced gastroenterologist.

...read moreread less

Abstract: Pixel-wise image segmentation is a highly demanding task in medical-image analysis. In practice, it is difficult to find annotated medical images with corresponding segmentation masks. In this paper, we present Kvasir-SEG: an open-access dataset of gastrointestinal polyp images and corresponding segmentation masks, manually annotated by a medical doctor and then verified by an experienced gastroenterologist. Moreover, we also generated the bounding boxes of the polyp regions with the help of segmentation masks. We demonstrate the use of our dataset with a traditional segmentation approach and a modern deep-learning based Convolutional Neural Network (CNN) approach. The dataset will be of value for researchers to reproduce results and compare methods. By adding segmentation masks to the Kvasir dataset, which only provide frame-wise annotations, we enable multimedia and computer vision researchers to contribute in the field of polyp segmentation and automatic analysis of colonoscopy images.

...read moreread less

306 citations

Proceedings Article•DOI•

DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation

[...]

Debesh Jha, Michael Riegler, Dag Johansen, Pål Halvorsen¹, Håvard D. Johansen - Show less +1 more•Institutions (1)

Metropolitan University¹

28 Jul 2020

TL;DR: Encouraging results show that DoubleU-Net can be used as a strong baseline for both medical image segmentation and cross-dataset evaluation testing to measure the generalizability of Deep Learning (DL) models.

...read moreread less

Abstract: Semantic image segmentation is the process of labeling each pixel of an image with its corresponding class. An encoder-decoder based approach, like U-Net and its variants, is a popular strategy for solving medical image segmentation tasks. To improve the performance of U-Net on various segmentation tasks, we propose a novel architecture called DoubleU-Net, which is a combination of two U-Net architectures stacked on top of each other. The first U-Net uses a pre-trained VGG-19 as the encoder, which has already learned features from ImageNet and can be transferred to another task easily. To capture more semantic information efficiently, we added another U-Net at the bottom. We also adopt Atrous Spatial Pyramid Pooling (ASPP) to capture contextual information within the network. We have evaluated DoubleU-Net using four medical segmentation datasets, covering various imaging modalities such as colonoscopy, dermoscopy, and microscopy. Experiments on the MICCAI 2015 segmentation challenge, the CVC-ClinicDB, the 2018 Data Science Bowl challenge, and the Lesion boundary segmentation datasets demonstrate that the DoubleU-Net outperforms U-Net and the baseline models. Moreover, DoubleU-Net produces more accurate segmentation masks, especially in the case of the CVC-ClinicDB and MICCAI 2015 segmentation challenge datasets, which have challenging images such as smaller and flat polyps. These results show the improvement over the existing U-Net model. The encouraging results, produced on various medical image segmentation datasets, show that DoubleU-Net can be used as a strong baseline for both medical image segmentation and cross-dataset evaluation testing to measure the generalizability of Deep Learning (DL) models.

...read moreread less

305 citations

Posted Content•

ResUNet++: An Advanced Architecture for Medical Image Segmentation

[...]

Debesh Jha, Pia H. Smedsrud, Michael Riegler¹, Dag Johansen, Thomas de Lange¹, Pål Halvorsen², Håvard D. Johansen - Show less +3 more•Institutions (2)

University of Oslo¹, Metropolitan University²

16 Nov 2019-arXiv: Image and Video Processing

TL;DR: ResUNet++ is proposed, which is an improved ResUNet architecture for colonoscopic image segmentation, which significantly outperforms U-Net and Res UNet, two key state-of-the-art deep learning architectures, by achieving high evaluation scores.

...read moreread less

Abstract: Accurate computer-aided polyp detection and segmentation during colonoscopy examinations can help endoscopists resect abnormal tissue and thereby decrease chances of polyps growing into cancer. Towards developing a fully automated model for pixel-wise polyp segmentation, we propose ResUNet++, which is an improved ResUNet architecture for colonoscopic image segmentation. Our experimental evaluations show that the suggested architecture produces good segmentation results on publicly available datasets. Furthermore, ResUNet++ significantly outperforms U-Net and ResUNet, two key state-of-the-art deep learning architectures, by achieving high evaluation scores with a dice coefficient of 81.33%, and a mean Intersection over Union (mIoU) of 79.27% for the Kvasir-SEG dataset and a dice coefficient of 79.55%, and a mIoU of 79.62% with CVC-612 dataset.

...read moreread less

273 citations

Book Chapter•DOI•

Kvasir-SEG: A Segmented Polyp Dataset

[...]

Debesh Jha, Pia H. Smedsrud, Michael Riegler, Pål Halvorsen, Thomas de Lange¹, Dag Johansen, Håvard D. Johansen - Show less +3 more•Institutions (1)

University of Oslo¹

05 Jan 2020

TL;DR: This paper presents Kvasir-SEG: an open-access dataset of gastrointestinal polyp images and corresponding segmentation masks, manually annotated by a medical doctor and then verified by an experienced gastroenterologist, and demonstrates the use of the dataset with a traditional segmentation approach and a modern deep-learning based Convolutional Neural Network approach.

...read moreread less

270 citations

Proceedings Article•DOI•

ResUNet++: An Advanced Architecture for Medical Image Segmentation

[...]

Debesh Jha, Pia H. Smedsrud, Michael Riegler¹, Dag Johansen, Thomas de Lange¹, Pål Halvorsen², Håvard D. Johansen - Show less +3 more•Institutions (2)

University of Oslo¹, Metropolitan University²

01 Dec 2019

TL;DR: Wang et al. as mentioned in this paper proposed an improved ResUNet architecture for colonoscopic image segmentation, which achieved a dice coefficient of 81.33% and a mean intersection over union (mIoU) of 79.27% for the Kvasir-SEG dataset.

...read moreread less

258 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17

Collapse

Cited by

PDF

Open Access

More filters

CA : A Cancer Journal for Clinicians

[...]

Patrizia Agostinis, Kristian Berg, Keith A. Cengel, Thomas H. Foster, Albert W. Girotti, Sandra O. Gollnick, Stephen M. Hahn, Michael R. Hamblin, Asta Juzeniene, David Kessel, Mladen Korbelik, Johan Moan, Pawel Mroz, Dominika Nowis, Jacques Piette, Brian C. Wilson, Jakub Golab - Show less +13 more

01 Jan 2011

4,646 citations

Posted Content•

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions

[...]

Wenhai Wang¹, Enze Xie², Xiang Li³, Deng-Ping Fan⁴, Kaitao Song⁵, Ding Liang⁶, Tong Lu¹, Ping Luo², Ling Shao⁷ - Show less +5 more•Institutions (7)

Nanjing University¹, University of Hong Kong², New York University Abu Dhabi³, Nankai University⁴, Nanjing University of Science and Technology⁵, SenseTime⁶, Seoul National University⁷

24 Feb 2021-arXiv: Computer Vision and Pattern Recognition

TL;DR: Huang et al. as discussed by the authors proposed Pyramid Vision Transformer (PVT), which is a simple backbone network useful for many dense prediction tasks without convolutions, and achieved state-of-the-art performance on the COCO dataset.

...read moreread less

Abstract: Although using convolutional neural networks (CNNs) as backbones achieves great successes in computer vision, this work investigates a simple backbone network useful for many dense prediction tasks without convolutions. Unlike the recently-proposed Transformer model (e.g., ViT) that is specially designed for image classification, we propose Pyramid Vision Transformer~(PVT), which overcomes the difficulties of porting Transformer to various dense prediction tasks. PVT has several merits compared to prior arts. (1) Different from ViT that typically has low-resolution outputs and high computational and memory cost, PVT can be not only trained on dense partitions of the image to achieve high output resolution, which is important for dense predictions but also using a progressive shrinking pyramid to reduce computations of large feature maps. (2) PVT inherits the advantages from both CNN and Transformer, making it a unified backbone in various vision tasks without convolutions by simply replacing CNN backbones. (3) We validate PVT by conducting extensive experiments, showing that it boosts the performance of many downstream tasks, e.g., object detection, semantic, and instance segmentation. For example, with a comparable number of parameters, RetinaNet+PVT achieves 40.4 AP on the COCO dataset, surpassing RetinNet+ResNet50 (36.3 AP) by 4.1 absolute AP. We hope PVT could serve as an alternative and useful backbone for pixel-level predictions and facilitate future researches. Code is available at this https URL.

...read moreread less

845 citations

Journal Article•DOI•

U-Net and Its Variants for Medical Image Segmentation: A Review of Theory and Applications

[...]

Nahian Siddique¹, Sidike Paheding², Colin Elkin¹, Vijay Devabhaktuni¹•Institutions (2)

Purdue University¹, Michigan Technological University²

03 Jun 2021-IEEE Access

TL;DR: A narrative literature review examines the numerous developments and breakthroughs in the U-net architecture and provides observations on recent trends, and discusses the many innovations that have advanced in deep learning and how these tools facilitate U-nets.

...read moreread less

Abstract: U-net is an image segmentation technique developed primarily for image segmentation tasks. These traits provide U-net with a high utility within the medical imaging community and have resulted in extensive adoption of U-net as the primary tool for segmentation tasks in medical imaging. The success of U-net is evident in its widespread use in nearly all major image modalities, from CT scans and MRI to X-rays and microscopy. Furthermore, while U-net is largely a segmentation tool, there have been instances of the use of U-net in other applications. Given that U-net’s potential is still increasing, this narrative literature review examines the numerous developments and breakthroughs in the U-net architecture and provides observations on recent trends. We also discuss the many innovations that have advanced in deep learning and discuss how these tools facilitate U-net. In addition, we review the different image modalities and application areas that have been enhanced by U-net.

...read moreread less

425 citations

Book Chapter•DOI•

TransFuse: Fusing Transformers and CNNs for Medical Image Segmentation

[...]

Yundong Zhang, Huiye Liu¹, Qiang Hu•Institutions (1)

Georgia Institute of Technology¹

27 Sep 2021

TL;DR: TransFuse as discussed by the authors combines Transformers and CNNs in a parallel style, where both global dependency and low-level spatial details can be efficiently captured in a much shallower manner.

...read moreread less

Abstract: Medical image segmentation - the prerequisite of numerous clinical needs - has been significantly prospered by recent advances in convolutional neural networks (CNNs). However, it exhibits general limitations on modeling explicit long-range relation, and existing cures, resorting to building deep encoders along with aggressive downsampling operations, leads to redundant deepened networks and loss of localized details. Hence, the segmentation task awaits a better solution to improve the efficiency of modeling global contexts while maintaining a strong grasp of low-level details. In this paper, we propose a novel parallel-in-branch architecture, TransFuse, to address this challenge. TransFuse combines Transformers and CNNs in a parallel style, where both global dependency and low-level spatial details can be efficiently captured in a much shallower manner. Besides, a novel fusion technique - BiFusion module is created to efficiently fuse the multi-level features from both branches. Extensive experiments demonstrate that TransFuse achieves the newest state-of-the-art results on both 2D and 3D medical image sets including polyp, skin lesion, hip, and prostate segmentation, with significant parameter decrease and inference speed improvement.

...read moreread less

365 citations

Journal Article•DOI•

Convolutional neural networks for classification of Alzheimer's disease: Overview and reproducible evaluation.

[...]

Junhao Wen¹, Elina Thibeau-Sutre¹, Mauricio Diaz-Melo¹, Jorge Samper-González¹, Alexandre Routier¹, Simona Bottani¹, Didier Dormont¹, Stanley Durrleman¹, Ninon Burgos¹, Olivier Colliot¹ - Show less +6 more•Institutions (1)

University of Paris¹

01 May 2020-Medical Image Analysis

TL;DR: The open-source framework for classification of AD using CNN and T1-weighted MRI is extended and found that more than half of the surveyed papers may have suffered from data leakage and thus reported biased performance.

...read moreread less

346 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse