A Dataset for Breast Cancer Histopathological Image Classification

doi:10.1109/TBME.2015.2496264

Home
/
Papers
/
A Dataset for Breast Cancer Histopathological Image Classification

Journal Article•DOI•

A Dataset for Breast Cancer Histopathological Image Classification

Fabio Alexandre Spanhol¹, Luiz S. Oliveira¹, Caroline Petitjean², Laurent Heutte²•Institutions (2)

Federal University of Paraná¹, University of Rouen²

01 Jul 2016-IEEE Transactions on Biomedical Engineering (IEEE)-Vol. 63, Iss: 7, pp 1455-1462

TL;DR: A dataset of 7909 breast cancer histopathology images acquired on 82 patients, which is now publicly available from http://web.ufpr.br/vri/breast-cancer-database, aimed at automated classification of these images in two classes, which would be a valuable computer-aided diagnosis tool for the clinician.

read less

Abstract: Today, medical image analysis papers require solid experiments to prove the usefulness of proposed methods. However, experiments are often performed on data selected by the researchers, which may come from different institutions, scanners, and populations. Different evaluation measures may be used, making it difficult to compare the methods. In this paper, we introduce a dataset of 7909 breast cancer histopathology images acquired on 82 patients, which is now publicly available from http://web.inf.ufpr.br/vri/breast-cancer-database . The dataset includes both benign and malignant images. The task associated with this dataset is the automated classification of these images in two classes, which would be a valuable computer-aided diagnosis tool for the clinician. In order to assess the difficulty of this task, we show some preliminary results obtained with state-of-the-art image classification systems. The accuracy ranges from 80% to 85%, showing room for improvement is left. By providing this dataset and a standardized evaluation protocol to the scientific community, we hope to gather researchers in both the medical and the machine learning field to advance toward this clinical application.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

A survey of the recent architectures of deep convolutional neural networks

[...]

Asifullah Khan¹, Anabia Sohail¹, Umme Zahoora¹, Aqsa Saeed Qureshi¹•Institutions (1)

Pakistan Institute of Engineering and Applied Sciences¹

01 Dec 2020-Artificial Intelligence Review

TL;DR: Deep Convolutional Neural Networks (CNNs) as mentioned in this paper are a special type of Neural Networks, which has shown exemplary performance on several competitions related to Computer Vision and Image Processing.

...read moreread less

Abstract: Deep Convolutional Neural Network (CNN) is a special type of Neural Networks, which has shown exemplary performance on several competitions related to Computer Vision and Image Processing. Some of the exciting application areas of CNN include Image Classification and Segmentation, Object Detection, Video Processing, Natural Language Processing, and Speech Recognition. The powerful learning ability of deep CNN is primarily due to the use of multiple feature extraction stages that can automatically learn representations from the data. The availability of a large amount of data and improvement in the hardware technology has accelerated the research in CNNs, and recently interesting deep CNN architectures have been reported. Several inspiring ideas to bring advancements in CNNs have been explored, such as the use of different activation and loss functions, parameter optimization, regularization, and architectural innovations. However, the significant improvement in the representational capacity of the deep CNN is achieved through architectural innovations. Notably, the ideas of exploiting spatial and channel information, depth and width of architecture, and multi-path information processing have gained substantial attention. Similarly, the idea of using a block of layers as a structural unit is also gaining popularity. This survey thus focuses on the intrinsic taxonomy present in the recently reported deep CNN architectures and, consequently, classifies the recent innovations in CNN architectures into seven different categories. These seven categories are based on spatial exploitation, depth, multi-path, width, feature-map exploitation, channel boosting, and attention. Additionally, the elementary understanding of CNN components, current challenges, and applications of CNN are also provided.

...read moreread less

1,328 citations

Proceedings Article•DOI•

Breast cancer histopathological image classification using Convolutional Neural Networks

[...]

Fabio Alexandre Spanhol¹, Luiz S. Oliveira², Caroline Petitjean³, Laurent Heutte³•Institutions (3)

Federal University of Technology - Paraná¹, Federal University of Paraná², University of Rouen³

24 Jul 2016

TL;DR: This method aims to allow using the high-resolution histopathological images from BreaKHis as input to existing CNN, avoiding adaptations of the model that can lead to a more complex and computationally costly architecture.

...read moreread less

Abstract: The performance of most conventional classification systems relies on appropriate data representation and much of the efforts are dedicated to feature engineering, a difficult and time-consuming process that uses prior expert domain knowledge of the data to create useful features. On the other hand, deep learning can extract and organize the discriminative information from the data, not requiring the design of feature extractors by a domain expert. Convolutional Neural Networks (CNNs) are a particular type of deep, feedforward network that have gained attention from research community and industry, achieving empirical successes in tasks such as speech recognition, signal processing, object recognition, natural language processing and transfer learning. In this paper, we conduct some preliminary experiments using the deep learning approach to classify breast cancer histopathological images from BreaKHis, a publicly dataset available at http://web.inf.ufpr.br/vri/breast-cancer-database. We propose a method based on the extraction of image patches for training the CNN and the combination of these patches for final classification. This method aims to allow using the high-resolution histopathological images from BreaKHis as input to existing CNN, avoiding adaptations of the model that can lead to a more complex and computationally costly architecture. The CNN performance is better when compared to previously reported results obtained by other machine learning models trained with hand-crafted textural descriptors. Finally, we also investigate the combination of different CNNs using simple fusion rules, achieving some improvement in recognition rates.

...read moreread less

720 citations

Cites background or methods or result from "A Dataset for Breast Cancer Histopa..."

...[11] introduced a dataset composed of 7,909 breast histopathological images acquired on 82 patients....
[...]
...A set of comprehensive experiments on the BreaKHis dataset proposed in [11] shows that the CNN achieves better results than the best results obtained by the other machine learning models trained with textural...
[...]
...The performance at image level is not reported in [11]....
[...]
...Based on the results presented in [11], it is undeniable...
[...]
...However, on the histopathological images assessed, LeNet classification performance were considerably inferior to our previous results reported in [11], achieving about 72% of accuracy....
[...]

Journal Article•DOI•

A novel deep learning based framework for the detection and classification of breast cancer using transfer learning

[...]

Sana Ullah Khan¹, Naveed Islam¹, Zahoor Jan¹, Ikram Ud Din², Joel J. P. C. Rodrigues³, Joel J. P. C. Rodrigues⁴ - Show less +2 more•Institutions (4)

Islamia College University¹, University of Haripur², Inatel³, Federal University of Piauí⁴

01 Jul 2019-Pattern Recognition Letters

TL;DR: A novel deep learning framework for the detection and classification of breast cancer in breast cytology images using the concept of transfer learning is proposed and it has been observed that the proposed framework outclass all the other deep learning architectures in terms of accuracy in detection and classified of breast tumor in cytological images.

...read moreread less

471 citations

Journal Article•DOI•

Breast Cancer Multi-classification from Histopathological Images with Structured Deep Learning Model.

[...]

Zhongyi Han¹, Benzheng Wei¹, Yuanjie Zheng², Yilong Yin³, Kejian Li¹, Shuo Li⁴ - Show less +2 more•Institutions (4)

Shandong University of Traditional Chinese Medicine¹, Shandong Normal University², Shandong University³, University of Western Ontario⁴

23 Jun 2017-Scientific Reports

TL;DR: The structured deep learning model used in this study has achieved remarkable performance on a large-scale dataset, which demonstrates the strength of the method in providing an efficient tool for breast cancer multi-classification in clinical settings.

...read moreread less

Abstract: Automated breast cancer multi-classification from histopathological images plays a key role in computer-aided breast cancer diagnosis or prognosis. Breast cancer multi-classification is to identify subordinate classes of breast cancer (Ductal carcinoma, Fibroadenoma, Lobular carcinoma, etc.). However, breast cancer multi-classification from histopathological images faces two main challenges from: (1) the great difficulties in breast cancer multi-classification methods contrasting with the classification of binary classes (benign and malignant), and (2) the subtle differences in multiple classes due to the broad variability of high-resolution image appearances, high coherency of cancerous cells, and extensive inhomogeneity of color distribution. Therefore, automated breast cancer multi-classification from histopathological images is of great clinical significance yet has never been explored. Existing works in literature only focus on the binary classification but do not support further breast cancer quantitative assessment. In this study, we propose a breast cancer multi-classification method using a newly proposed deep learning model. The structured deep learning model has achieved remarkable performance (average 93.2% accuracy) on a large-scale dataset, which demonstrates the strength of our method in providing an efficient tool for breast cancer multi-classification in clinical settings.

...read moreread less

425 citations

Posted Content•

A Survey of Deep Active Learning

[...]

Pengzhen Ren, Yun Xiao, Xiaojun Chang, Po-Yao Huang, Zhihui Li, Xiaojiang Chen, Xin Wang - Show less +3 more

30 Aug 2020-arXiv: Learning

TL;DR: A formal classification method for the existing work in deep active learning is provided, along with a comprehensive and systematic overview, to investigate whether AL can be used to reduce the cost of sample annotation while retaining the powerful learning capabilities of DL.

...read moreread less

Abstract: Active learning (AL) attempts to maximize the performance gain of the model by marking the fewest samples. Deep learning (DL) is greedy for data and requires a large amount of data supply to optimize massive parameters, so that the model learns how to extract high-quality features. In recent years, due to the rapid development of internet technology, we are in an era of information torrents and we have massive amounts of data. In this way, DL has aroused strong interest of researchers and has been rapidly developed. Compared with DL, researchers have relatively low interest in AL. This is mainly because before the rise of DL, traditional machine learning requires relatively few labeled samples. Therefore, early AL is difficult to reflect the value it deserves. Although DL has made breakthroughs in various fields, most of this success is due to the publicity of the large number of existing annotation datasets. However, the acquisition of a large number of high-quality annotated datasets consumes a lot of manpower, which is not allowed in some fields that require high expertise, especially in the fields of speech recognition, information extraction, medical images, etc. Therefore, AL has gradually received due attention. A natural idea is whether AL can be used to reduce the cost of sample annotations, while retaining the powerful learning capabilities of DL. Therefore, deep active learning (DAL) has emerged. Although the related research has been quite abundant, it lacks a comprehensive survey of DAL. This article is to fill this gap, we provide a formal classification method for the existing work, and a comprehensive and systematic overview. In addition, we also analyzed and summarized the development of DAL from the perspective of application. Finally, we discussed the confusion and problems in DAL, and gave some possible development directions for DAL.

...read moreread less

372 citations

Cites background from "A Dataset for Breast Cancer Histopa..."

...[30, 55, 66, 186] [123, 187, 206] [36, 181, 205] Erie County [66], EEG [9], BreaKHis [210], SVEB, SVDB [186] Biomedical...
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Random Forests

[...]

Leo Breiman¹•Institutions (1)

University of California, Berkeley¹

01 Oct 2001

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.

...read moreread less

Abstract: Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, aaa, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.

...read moreread less

79,257 citations

"A Dataset for Breast Cancer Histopa..." refers background in this paper

...The principle behind ensemble methods is that a group of weak learners (in this case the decision trees) can come together to form a strong learner [32]....
[...]

Journal Article•

Scikit-learn: Machine Learning in Python

[...]

Fabian Pedregosa, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel¹, Peter Prettenhofer², Ron Weiss³, Vincent Dubourg, Jake Vanderplas⁴, Alexandre Passos⁵, David Cournapeau, Matthieu Brucher⁶, Matthieu Perrot, Edouard Duchesnay - Show less +12 more•Institutions (6)

Kobe University¹, Bauhaus University, Weimar², Google³, University of Washington⁴, University of Massachusetts Amherst⁵, Total S.A.⁶

01 Feb 2011-Journal of Machine Learning Research

TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.

...read moreread less

Abstract: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems. This package focuses on bringing machine learning to non-specialists using a general-purpose high-level language. Emphasis is put on ease of use, performance, documentation, and API consistency. It has minimal dependencies and is distributed under the simplified BSD license, encouraging its use in both academic and commercial settings. Source code, binaries, and documentation can be downloaded from http://scikit-learn.sourceforge.net.

...read moreread less

47,974 citations

"A Dataset for Breast Cancer Histopa..." refers methods in this paper

...All the experiments were carried out using scikit-learn, an opensource machine learning library in Python [33]....
[...]

Journal Article•DOI•

Support-Vector Networks

[...]

Corinna Cortes¹, Vladimir Vapnik¹•Institutions (1)

Bell Labs¹

15 Sep 1995-Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

Abstract: The support-vector network is a new learning machine for two-group classification problems. The machine conceptually implements the following idea: input vectors are non-linearly mapped to a very high-dimension feature space. In this feature space a linear decision surface is constructed. Special properties of the decision surface ensures high generalization ability of the learning machine. The idea behind the support-vector network was previously implemented for the restricted case where the training data can be separated without errors. We here extend this result to non-separable training data. High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated. We also compare the performance of the support-vector network to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

37,861 citations

"A Dataset for Breast Cancer Histopa..." refers background in this paper

...Differently from other linear discriminant functions, it provides the optimal hyperplane that separates two classes [31]....
[...]

Posted Content•

Scikit-learn: Machine Learning in Python

[...]

Fabian Pedregosa¹, Gaël Varoquaux¹, Alexandre Gramfort¹, Vincent Michel¹, Bertrand Thirion¹, Olivier Grisel, Mathieu Blondel, Andreas Müller², Joel Nothman, Gilles Louppe², Peter Prettenhofer, Ron Weiss, Vincent Dubourg, Jake Vanderplas, Alexandre Passos, David Cournapeau, Matthieu Brucher, Matthieu Perrot, Edouard Duchesnay - Show less +15 more•Institutions (2)

French Institute for Research in Computer Science and Automation¹, University of Liège²

02 Jan 2012-arXiv: Learning

TL;DR: Scikit-learn as mentioned in this paper is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems.

...read moreread less

28,898 citations

Journal Article•DOI•

Textural Features for Image Classification

[...]

Robert M. Haralick, K. Shanmugam¹, Its'hak Dinstein²•Institutions (2)

Wichita State University¹, University of Kansas²

01 Nov 1973

TL;DR: These results indicate that the easily computable textural features based on gray-tone spatial dependancies probably have a general applicability for a wide variety of image-classification applications.

...read moreread less

Abstract: Texture is one of the important characteristics used in identifying objects or regions of interest in an image, whether the image be a photomicrograph, an aerial photograph, or a satellite image. This paper describes some easily computable textural features based on gray-tone spatial dependancies, and illustrates their application in category-identification tasks of three different kinds of image data: photomicrographs of five kinds of sandstones, 1:20 000 panchromatic aerial photographs of eight land-use categories, and Earth Resources Technology Satellite (ERTS) multispecial imagery containing seven land-use categories. We use two kinds of decision rules: one for which the decision regions are convex polyhedra (a piecewise linear decision rule), and one for which the decision regions are rectangular parallelpipeds (a min-max decision rule). In each experiment the data set was divided into two parts, a training set and a test set. Test set identification accuracy is 89 percent for the photomicrographs, 82 percent for the aerial photographic imagery, and 83 percent for the satellite imagery. These results indicate that the easily computable textural features probably have a general applicability for a wide variety of image-classification applications.

...read moreread less

20,442 citations

"A Dataset for Breast Cancer Histopa..." refers background or methods in this paper

...GLCM are widely used to characterize texture images....
[...]
...Note, however, that the 200× magnification factor also shows high potential, with the best results over GLCM and PFTAS, higher than those obtained with the 40× level....
[...]
...On the GLCM, 13 Haralick parameters are computed [20]: angular second moment, contrast, correlation, sum of squares, variance, inverse difference moment, sum average, sum variance, sum entropy, entropy, difference variance, difference entropy, information measures of correlation 1, and information measures of correlation 2....
[...]
...These include the textural descriptors most commonly found in the literature, such as local binary patterns (LBP) [17], completed LBP (CLBP) [18], local phase quantization (LPQ) [19], gray-level co-occurrence matrix (GLCM) [20], threshold adjacency statistics (TAS) [21], and one keypoint descriptor, named ORB [22]....
[...]
...In our experiments, four adjacency directions 0◦, 45◦, 90◦, 135◦, and eight gray levels are used to compute the GLCM....
[...]