Home
/
Authors
/
Wouter Bulten

Author

Wouter Bulten

Bio: Wouter Bulten is an academic researcher from Radboud University Nijmegen. The author has contributed to research in topics: Autoencoder & Prostate cancer. The author has an hindex of 10, co-authored 19 publications receiving 670 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Automated deep-learning system for Gleason grading of prostate cancer using biopsies: a diagnostic study

[...]

Wouter Bulten¹, Hans Pinckaers¹, Hester van Boven², Robert Vink, Thomas de Bel¹, Bram van Ginneken¹, Jeroen van der Laak¹, Christina A. Hulsbergen-van de Kaa, Geert Litjens¹ - Show less +5 more•Institutions (2)

Radboud University Nijmegen¹, Netherlands Cancer Institute²

01 Feb 2020-Lancet Oncology

TL;DR: An automated deep-learning system to grade prostate biopsies following the Gleason grading standard achieved a performance similar to pathologists for Gle Mason grading and could potentially contribute to prostate cancer diagnosis.

...read moreread less

Abstract: Summary Background The Gleason score is the strongest correlating predictor of recurrence for prostate cancer, but has substantial inter-observer variability, limiting its usefulness for individual patients. Specialised urological pathologists have greater concordance; however, such expertise is not widely available. Prostate cancer diagnostics could thus benefit from robust, reproducible Gleason grading. We aimed to investigate the potential of deep learning to perform automated Gleason grading of prostate biopsies. Methods In this retrospective study, we developed a deep-learning system to grade prostate biopsies following the Gleason grading standard. The system was developed using randomly selected biopsies, sampled by the biopsy Gleason score, from patients at the Radboud University Medical Center (pathology report dated between Jan 1, 2012, and Dec 31, 2017). A semi-automatic labelling technique was used to circumvent the need for manual annotations by pathologists, using pathologists' reports as the reference standard during training. The system was developed to delineate individual glands, assign Gleason growth patterns, and determine the biopsy-level grade. For validation of the method, a consensus reference standard was set by three expert urological pathologists on an independent test set of 550 biopsies. Of these 550, 100 were used in an observer experiment, in which the system, 13 pathologists, and two pathologists in training were compared with respect to the reference standard. The system was also compared to an external test dataset of 886 cores, which contained 245 cores from a different centre that were independently graded by two pathologists. Findings We collected 5759 biopsies from 1243 patients. The developed system achieved a high agreement with the reference standard (quadratic Cohen's kappa 0·918, 95% CI 0·891–0·941) and scored highly at clinical decision thresholds: benign versus malignant (area under the curve 0·990, 95% CI 0·982–0·996), grade group of 2 or more (0·978, 0·966–0·988), and grade group of 3 or more (0·974, 0·962–0·984). In an observer experiment, the deep-learning system scored higher (kappa 0·854) than the panel (median kappa 0·819), outperforming 10 of 15 pathologist observers. On the external test dataset, the system obtained a high agreement with the reference standard set independently by two pathologists (quadratic Cohen's kappa 0·723 and 0·707) and within inter-observer variability (kappa 0·71). Interpretation Our automated deep-learning system achieved a performance similar to pathologists for Gleason grading and could potentially contribute to prostate cancer diagnosis. The system could potentially assist pathologists by screening biopsies, providing second opinions on grade group, and presenting quantitative measurements of volume percentages. Funding Dutch Cancer Society.

...read moreread less

400 citations

Journal Article•DOI•

Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology.

[...]

David Tellez¹, Geert Litjens¹, Péter Bándi¹, Wouter Bulten¹, John-Melle Bokhorst¹, Francesco Ciompi¹, Jeroen van der Laak² - Show less +3 more•Institutions (2)

Radboud University Nijmegen¹, Linköping University²

01 Dec 2019-Medical Image Analysis

TL;DR: In this article, the authors compared stain color augmentation and normalization techniques and quantified their effect on CNN classification performance using a heterogeneous dataset of hematoxylin and eosin histopathology images from 4 organs and 9 pathology laboratories.

...read moreread less

362 citations

Journal Article•DOI•

Automated Gleason Grading of Prostate Biopsies using Deep Learning.

[...]

Wouter Bulten, Hans Pinckaers, Hester van Boven, Robert Vink, Thomas de Bel, Bram van Ginneken, Jeroen van der Laak, Christina A. Hulsbergen-van de Kaa, Geert Litjens - Show less +5 more

18 Jul 2019-arXiv: Image and Video Processing

TL;DR: In this paper, a semi-automatic labeling technique was used to circumvent the need for full manual annotation by pathologists and the developed system achieved a high agreement with the reference standard.

...read moreread less

Abstract: The Gleason score is the most important prognostic marker for prostate cancer patients but suffers from significant inter-observer variability. We developed a fully automated deep learning system to grade prostate biopsies. The system was developed using 5834 biopsies from 1243 patients. A semi-automatic labeling technique was used to circumvent the need for full manual annotation by pathologists. The developed system achieved a high agreement with the reference standard. In a separate observer experiment, the deep learning system outperformed 10 out of 15 pathologists. The system has the potential to improve prostate cancer prognostics by acting as a first or second reader.

...read moreread less

176 citations

Journal Article•DOI•

Artificial intelligence for diagnosis and Gleason grading of prostate cancer: the PANDA challenge

[...]

Wouter Bulten, Kimmo Kartasalo, Po-Hsuan Cameron Chen, Peter Ström, Hans Pinckaers, Kunal Nagpal, Yuannan Cai, David F. Steiner, Hester van Boven, Rob Vink, Christina A. Hulsbergen-van de Kaa, J. A. W. M. van der Laak, Mahul B. Amin, Andrew Evans, Theodorus van der Kwast, Robert Allan, Peter A. Humphrey, Henrik Grönberg, Hemamali Samaratunga, Brett Delahunt, Toyonori Tsuzuki, Tomi Häkkinen, Lars Egevad, Maggie Demkin, Sohier Dane, Fraser Tan, Masi Valkonen, Greg S. Corrado, Lily Peng, Craig H. Mermel, Pekka Ruusuvuori, Geert Litjens, Martin Eklund, Américo Brilhante, Asli Cakir, Xavier Farré, Katerina Geronatsiou, Vincent Molinié, Guilherme Pereira, Paromita Roy, Günter Saile, Paulo Guilherme de Oliveira Salles, Ewout Schaafsma, Joëlle Tschui, Jorge Billoch-Lima, Emíio M. Pereira, Ming Zhou, Shujun He, Sejun Song, Qing Sun, Hiroshi Yoshihara, Taiki Yamaguchi, Kosaku Ono, Tao Shen, Jianyi Ji, Arnaud Roussel, Kairong Zhou, Tianrui Chai, Nina Weng, Dmitry A. Grechka, Maxim V. Shugaev, Raphael Kiminya, Vassili Kovalev, Dmitry Voynov, V Malyshev, E. Lapo, Manolo Quispe Campos, Noriaki Ota, Shinsuke Yamaoka, Yusuke Fujimoto, Kentaro Yoshioka, Joni Juvonen, Mikko Tukiainen, Antti Karlsson, Rui Guo, Chia-Lun Hsieh, I S Zubarev, Habib S. T. Bukhar, Wenyuan Li, Jiayun Li, William Speier, Corey W. Arnold, Kyungdoc Kim, Byeonguk Bae, Yeong Won Kim, Hong-Seok Lee, Jeonghyuk Park - Show less +83 more

01 Jan 2022-Nature Medicine

TL;DR: The PANDA challenge as mentioned in this paper was organized by 1,290 developers to catalyze development of reproducible AI algorithms for Gleason grading using 10,616 digitized prostate biopsies.

...read moreread less

Abstract: Artificial intelligence (AI) has shown promise for diagnosing prostate cancer in biopsies. However, results have been limited to individual studies, lacking validation in multinational settings. Competitions have been shown to be accelerators for medical imaging innovations, but their impact is hindered by lack of reproducibility and independent validation. With this in mind, we organized the PANDA challenge-the largest histopathology competition to date, joined by 1,290 developers-to catalyze development of reproducible AI algorithms for Gleason grading using 10,616 digitized prostate biopsies. We validated that a diverse set of submitted algorithms reached pathologist-level performance on independent cross-continental cohorts, fully blinded to the algorithm developers. On United States and European external validation sets, the algorithms achieved agreements of 0.862 (quadratically weighted κ, 95% confidence interval (CI), 0.840-0.884) and 0.868 (95% CI, 0.835-0.900) with expert uropathologists. Successful generalization across different patient populations, laboratories and reference standards, achieved by a variety of algorithmic approaches, warrants evaluating AI-based Gleason grading in prospective clinical trials.

...read moreread less

121 citations

Journal Article•DOI•

Epithelium segmentation using deep learning in H&E-stained prostate specimens with immunohistochemistry as reference standard

[...]

Wouter Bulten¹, Péter Bándi¹, Jeffrey Hoven¹, Rob van de Loo¹, Johannes Lotz, Nick Weiss, Jeroen van der Laak¹, Bram van Ginneken¹, Christina A. Hulsbergen-van de Kaa¹, Geert Litjens¹ - Show less +6 more•Institutions (1)

Radboud University Nijmegen¹

29 Jan 2019-Scientific Reports

TL;DR: In this paper, a U-Net was trained to segment epithelial structures in IHC using a subset of the IHC slides that were preprocessed with color deconvolution.

...read moreread less

Abstract: Given the importance of gland morphology in grading prostate cancer (PCa), automatically differentiating between epithelium and other tissues is an important prerequisite for the development of automated methods for detecting PCa. We propose a new deep learning method to segment epithelial tissue in digitised hematoxylin and eosin (H&E) stained prostatectomy slides using immunohistochemistry (IHC) as reference standard. We used IHC to create a precise and objective ground truth compared to manual outlining on H&E slides, especially in areas with high-grade PCa. 102 tissue sections were stained with H&E and subsequently restained with P63 and CK8/18 IHC markers to highlight epithelial structures. Afterwards each pair was co-registered. First, we trained a U-Net to segment epithelial structures in IHC using a subset of the IHC slides that were preprocessed with color deconvolution. Second, this network was applied to the remaining slides to create the reference standard used to train a second U-Net on H&E. Our system accurately segmented both intact glands and individual tumour epithelial cells. The generalisation capacity of our system is shown using an independent external dataset from a different centre. We envision this segmentation as the first part of a fully automated prostate cancer grading pipeline.

...read moreread less

88 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

CA : A Cancer Journal for Clinicians

[...]

Patrizia Agostinis, Kristian Berg, Keith A. Cengel, Thomas H. Foster, Albert W. Girotti, Sandra O. Gollnick, Stephen M. Hahn, Michael R. Hamblin, Asta Juzeniene, David Kessel, Mladen Korbelik, Johan Moan, Pawel Mroz, Dominika Nowis, Jacques Piette, Brian C. Wilson, Jakub Golab - Show less +13 more

01 Jan 2011

4,646 citations

Journal Article•DOI•

Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning.

[...]

Nicolas Coudray¹, Paolo S. Ocampo¹, Theodore Sakellaropoulos², Navneet Narula¹, Matija Snuderl¹, David Fenyö¹, Andre L. Moreira¹, Narges Razavian¹, Aristotelis Tsirigos¹ - Show less +5 more•Institutions (2)

New York University¹, National Technical University of Athens²

17 Sep 2018-Nature Medicine

TL;DR: A deep convolutional neural network model is trained on whole-slide images obtained from The Cancer Genome Atlas to accurately and automatically classify them into LUAD, LUSC or normal lung tissue and predicts the ten most commonly mutated genes in LUAD.

...read moreread less

Abstract: Visual inspection of histopathology slides is one of the main methods used by pathologists to assess the stage, type and subtype of lung tumors. Adenocarcinoma (LUAD) and squamous cell carcinoma (LUSC) are the most prevalent subtypes of lung cancer, and their distinction requires visual inspection by an experienced pathologist. In this study, we trained a deep convolutional neural network (inception v3) on whole-slide images obtained from The Cancer Genome Atlas to accurately and automatically classify them into LUAD, LUSC or normal lung tissue. The performance of our method is comparable to that of pathologists, with an average area under the curve (AUC) of 0.97. Our model was validated on independent datasets of frozen tissues, formalin-fixed paraffin-embedded tissues and biopsies. Furthermore, we trained the network to predict the ten most commonly mutated genes in LUAD. We found that six of them—STK11, EGFR, FAT1, SETBP1, KRAS and TP53—can be predicted from pathology images, with AUCs from 0.733 to 0.856 as measured on a held-out population. These findings suggest that deep-learning models can assist pathologists in the detection of cancer subtype or gene mutations. Our approach can be applied to any cancer type, and the code is available at https://github.com/ncoudray/DeepPATH .

...read moreread less

1,682 citations

Posted Content•

WILDS: A Benchmark of in-the-Wild Distribution Shifts

[...]

Pang Wei Koh¹, Shiori Sagawa¹, Henrik Marklund¹, Sang Michael Xie², Marvin Zhang¹, Akshay Balsubramani¹, Weihua Hu¹, Michihiro Yasunaga³, Richard Lanas Phillips¹, Irena Gao¹, Tony Lee¹, Etienne David⁴, Ian Stavness⁵, Wei Guo⁵, Berton A. Earnshaw, Imran S. Haque⁶, Sara Beery¹, Jure Leskovec¹, Anshul Kundaje⁷, Emma Pierson², Sergey Levine¹, Chelsea Finn¹, Percy Liang¹ - Show less +19 more•Institutions (7)

Stanford University¹, University of California, Berkeley², Cornell University³, University of Saskatchewan⁴, University of Tokyo⁵, California Institute of Technology⁶, Microsoft⁷

14 Dec 2020-arXiv: Learning

TL;DR: WILDS is presented, a benchmark of in-the-wild distribution shifts spanning diverse data modalities and applications, and is hoped to encourage the development of general-purpose methods that are anchored to real-world distribution shifts and that work well across different applications and problem settings.

...read moreread less

Abstract: Distribution shifts -- where the training distribution differs from the test distribution -- can substantially degrade the accuracy of machine learning (ML) systems deployed in the wild. Despite their ubiquity, these real-world distribution shifts are under-represented in the datasets widely used in the ML community today. To address this gap, we present WILDS, a curated collection of 8 benchmark datasets that reflect a diverse range of distribution shifts which naturally arise in real-world applications, such as shifts across hospitals for tumor identification; across camera traps for wildlife monitoring; and across time and location in satellite imaging and poverty mapping. On each dataset, we show that standard training results in substantially lower out-of-distribution than in-distribution performance, and that this gap remains even with models trained by existing methods for handling distribution shifts. This underscores the need for new training methods that produce models which are more robust to the types of distribution shifts that arise in practice. To facilitate method development, we provide an open-source package that automates dataset loading, contains default model architectures and hyperparameters, and standardizes evaluations. Code and leaderboards are available at this https URL.

...read moreread less

579 citations

Proceedings Article•

WiFi-SLAM Using G aussian Process Latent Variable Models

[...]

Brian Ferris, Dieter Fox, Neil D. Lawrence¹•Institutions (1)

University of Sheffield¹

01 Jan 2007

TL;DR: In this paper, the Gaussian Process Latent Variable Model (GPLVM) is used to reconstruct a topological connectivity graph from a signal strength sequence, which can be used to perform efficient WiFi SLAM.

...read moreread less

Abstract: WiFi localization, the task of determining the physical location of a mobile device from wireless signal strengths, has been shown to be an accurate method of indoor and outdoor localization and a powerful building block for location-aware applications. However, most localization techniques require a training set of signal strength readings labeled against a ground truth location map, which is prohibitive to collect and maintain as maps grow large. In this paper we propose a novel technique for solving the WiFi SLAM problem using the Gaussian Process Latent Variable Model (GPLVM) to determine the latent-space locations of unlabeled signal strength data. We show how GPLVM, in combination with an appropriate motion dynamics model, can be used to reconstruct a topological connectivity graph from a signal strength sequence which, in combination with the learned Gaussian Process signal strength model, can be used to perform efficient localization.

...read moreread less

488 citations

Journal Article•DOI•

Explainable Deep Learning for Pulmonary Disease and Coronavirus COVID-19 Detection from X-rays.

[...]

Luca Brunese¹, Francesco Mercaldo¹, Alfonso Reginelli², Antonella Santone¹•Institutions (2)

University of Molise¹, Seconda Università degli Studi di Napoli²

01 Nov 2020-Computer Methods and Programs in Biomedicine

TL;DR: Experimental analysis on 6,523 chest X-rays belonging to different institutions demonstrated the effectiveness of the proposed approach, with an average time for COVID-19 detection of approximately 2.5 seconds and an average accuracy equal to 0.97.

...read moreread less

412 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse