scispace - formally typeset
Open AccessJournal ArticleDOI

Unleashing the potential of digital pathology data by training computer-aided diagnosis models without human annotations

TLDR
In this paper , the authors proposed and evaluated an approach to eliminate the need for manual annotations to train computer-aided diagnosis tools in digital pathology, which includes two components, to automatically extract semantically meaningful concepts from diagnostic reports and use them as weak labels to train convolutional neural networks (CNNs) for histopathology diagnosis.
Abstract
The digitalization of clinical workflows and the increasing performance of deep learning algorithms are paving the way towards new methods for tackling cancer diagnosis. However, the availability of medical specialists to annotate digitized images and free-text diagnostic reports does not scale with the need for large datasets required to train robust computer-aided diagnosis methods that can target the high variability of clinical cases and data produced. This work proposes and evaluates an approach to eliminate the need for manual annotations to train computer-aided diagnosis tools in digital pathology. The approach includes two components, to automatically extract semantically meaningful concepts from diagnostic reports and use them as weak labels to train convolutional neural networks (CNNs) for histopathology diagnosis. The approach is trained (through 10-fold cross-validation) on 3'769 clinical images and reports, provided by two hospitals and tested on over 11'000 images from private and publicly available datasets. The CNN, trained with automatically generated labels, is compared with the same architecture trained with manual labels. Results show that combining text analysis and end-to-end deep neural networks allows building computer-aided diagnosis tools that reach solid performance (micro-accuracy = 0.908 at image-level) based only on existing clinical data without the need for manual annotations.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Data-driven color augmentation for H&E stained images in computational pathology

TL;DR: In this paper , a Data-Driven Color Augmentation (DDCA) method was proposed to improve the efficiency of color augmentation methods by increasing the reliability of the samples used for training computational pathology models.
Journal ArticleDOI

Empowering digital pathology applications through explainable knowledge extraction tools

TL;DR: This article proposed an unsupervised knowledge extraction system combining a rule-based expert system with pre-trained Machine Learning (ML) models, namely the Semantic Knowledge Extractor Tool (SKET).
Journal ArticleDOI

Attention-based Interpretable Regression of Gene Expression in Histology

TL;DR: It is shown that interpretability can reveal connections between the microscopic appearance of cancer tissue and its gene expression profiling, and can help characterise how gene expression shapes tissue morphology in the pathology unit.
Book ChapterDOI

Attention-Based Interpretable Regression of Gene Expression in Histology

Anett Kasza
TL;DR: In this paper , the authors show that interpretability of deep learning can reveal connections between the microscopic appearance of cancer tissue and its gene expression profiling, which can be further used to uncover highly non-trivial patterns which are otherwise imperceptible to the human eye.
Journal ArticleDOI

Interpretable classification of pathology whole-slide images using attention based context-aware graph convolutional neural network

TL;DR: Zhang et al. as mentioned in this paper proposed an interpretable classification model named bidirectional attention-based multiple instance learning graph convolutional network (ABMIL-GCN), which hierarchically aggregates context-aware features of instances into a global representation in a topology fashion to predict the slide labels and localize the region of lymph node metastasis in WSIs.
References
More filters
Proceedings Article

Attention is All you Need

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
Journal Article

Visualizing Data using t-SNE

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Posted Content

Distributed Representations of Words and Phrases and their Compositionality

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.
Journal ArticleDOI

Interrater reliability: the kappa statistic

Marry L. McHugh
- 15 Oct 2012 - 
TL;DR: While the kappa is one of the most commonly used statistics to test interrater reliability, it has limitations and levels for both kappa and percent agreement that should be demanded in healthcare studies are suggested.
Journal ArticleDOI

Occurrence of the potent mutagens 2- nitrobenzanthrone and 3-nitrobenzanthrone in fine airborne particles

TL;DR: In the present study, 2-NBA, 3-NBA and selected PAHs and Nitro-PAHs were determined in fine particle samples collected in a bus station and an outdoor site, showing low cancer risk incidence and incremental lifetime cancer risk (ILCR) calculated for both places.
Related Papers (5)