| Literature DB >> 28570557 |
Teresa Araújo1,2, Guilherme Aresta1,2, Eduardo Castro1, José Rouco2, Paulo Aguiar3,4, Catarina Eloy5,6, António Polónia5,6, Aurélio Campilho1,2.
Abstract
Breast cancer is one of the main causes of cancer death worldwide. The diagnosis of biopsy tissue with hematoxylin and eosin stained images is non-trivial and specialists often disagree on the final diagnosis. Computer-aided Diagnosis systems contribute to reduce the cost and increase the efficiency of this process. Conventional classification approaches rely on feature extraction methods designed for a specific problem based on field-knowledge. To overcome the many difficulties of the feature-based approaches, deep learning methods are becoming important alternatives. A method for the classification of hematoxylin and eosin stained breast biopsy images using Convolutional Neural Networks (CNNs) is proposed. Images are classified in four classes, normal tissue, benign lesion, in situ carcinoma and invasive carcinoma, and in two classes, carcinoma and non-carcinoma. The architecture of the network is designed to retrieve information at different scales, including both nuclei and overall tissue organization. This design allows the extension of the proposed system to whole-slide histology images. The features extracted by the CNN are also used for training a Support Vector Machine classifier. Accuracies of 77.8% for four class and 83.3% for carcinoma/non-carcinoma are achieved. The sensitivity of our method for cancer cases is 95.6%.Entities:
Mesh:
Year: 2017 PMID: 28570557 PMCID: PMC5453426 DOI: 10.1371/journal.pone.0177544
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Examples of microscopy image patches from the used dataset [3].
Nuclei and cytoplasm appear purple and pinkish, respectively, due to the hematoxylin and eosin staining. A normal tissue; B benign abnormality; C malignant carcinoma in situ; D malignant invasive carcinoma.
Fig 2Histology image normalization.
A and C original images; B and D images after normalization.
Proposed Convolutional Neural Network architecture.
Left side notes show the histological association with the network layers: A—edges; B—nuclei; C—nuclei organization; D—structure and tissue organization.
| Layer type | Maps & Neurons | Filter size | Effective receptive field | Effective receptive field ( | |||
|---|---|---|---|---|---|---|---|
| 0 | Input | 3M × 512 × 512N | 1 × 1 | 0.4 × 0.4 | |||
| 1 | Convolutional | 16M × 510 × 510N | 3 × 3 | 3 × 3 | 1 × 1 | ||
| 2 | Max-pooling | 16M × 170 × 170N | 3 × 3 | 5 × 5 | 2 × 2 | ||
| 3 | Convolutional | 32M × 168 × 168N | 3 × 3 | 11 × 11 | 4.6 × 4.6 | ||
| 4 | Max-pooling | 32M × 84 × 84N | 2 × 2 | 14 × 14 | 5.9 × 5.9 | ||
| 5 | Convolutional | 64M × 84 × 84N | 3 × 3 | 26 × 26 | 11 × 11 | ||
| 6 | Max-pooling | 64M × 42 × 42N | 2 × 2 | 32 × 32 | 13 × 13 | ||
| 7 | Convolutional | 64M × 42 × 42N | 3 × 3 | 56 × 56 | 24 × 24 | ||
| 8 | Max-pooling | 64M × 14 × 14N | 3 × 3 | 80 × 80 | 34 × 34 | ||
| 9 | Convolutional | 32M × 12 × 12N | 3 × 3 | 152 × 152 | 63.8 × 63.8 | ||
| 10 | Max-pooling | 32M × 12 × 12N | 3 × 3 | 224 × 224 | 94.1 × 94.1 | ||
| 11 | Fully-connected | 256N | 512 × 512 | 215 × 215 | |||
| 12 | Fully-connected | 128N | 512 × 512 | 215 × 215 | |||
| 13 | Fully-connected | 4N | 512 × 512 | 215 × 215 | |||
Fig 3Convolutional Neural Network architecture, according to Table 1.
The original image has 512 × 512 pixels and 3 RGB channels. Orange and purple squares represent the convolutional and max-pooling kernels, respectively.
Number of images (and patches) used for performance evaluation.
A total of 36 images and 512 patches are considered.
| Dataset | non-carcinoma | carcinoma | ||
|---|---|---|---|---|
| Normal | Benign | Invasive | ||
| 10 (120) | 10 (120) | |||
| 5 (60) | 5 (60) | 5 (60) | 5 (60) | |
| 8 (96) | 8 (96) | |||
| 4 (48) | 4 (48) | 4 (48) | 4 (48) | |
| 18 (216) | 18 (216) | |||
| 9 (108) | 9 (108) | 9 (108) | 9 (108) | |
Patch-wise accuracy (%) (2 and 4 classes).
| Classifier | No classes | Initial | Extended | Overall |
|---|---|---|---|---|
| 4 | 72.5 | 59.4 | 66.7 | |
| 2 | 80.4 | 74.0 | 77.6 | |
| 4 | 72.9 | 55.2 | 65.0 | |
| 2 | 82.9 | 69.3 | 76.9 |
Patch-wise sensitivity (%) (2 and 4 classes).
| Dataset | Classifier | non-carcinoma | carcinoma | ||
|---|---|---|---|---|---|
| Normal | Benign | Invasive | |||
| 69.2 | 91.7 | ||||
| 61.7 | 56.7 | 83.3 | 88.3 | ||
| 76.7 | 89.2 | ||||
| 65.0 | 61.7 | 76.7 | 88.3 | ||
| 81.3 | 66.7 | ||||
| 50 | 72.9 | 58.3 | 56.3 | ||
| 82.3 | 56.3 | ||||
| 54.2 | 66.7 | 43.8 | 56.3 | ||
| 74.5 | 80.6 | ||||
| 56.4 | 63.9 | 72.2 | 74.1 | ||
| 79.2 | 74.5 | ||||
| 60.2 | 63.9 | 62.0 | 74.1 | ||
Image-wise accuracy (%) using different voting rules (2 and 4 classes).
| Classif. | Vote | 4 Classes | 2 Classes | ||||
|---|---|---|---|---|---|---|---|
| Init. | Exten. | Overall | Init. | Exten. | Overall | ||
| Maj. | 80.0 | 75.0 | 77.8 | 80.0 | 81.3 | 80.6 | |
| Max. | 80.0 | 62.5 | 72.2 | 80.0 | 75.0 | 77.8 | |
| Sum | 80.0 | 68.8 | 75.0 | 80.0 | 75.0 | 77.8 | |
| Maj. | 85.0 | 68.8 | 77.8 | 90.0 | 75.0 | 83.3 | |
| Max. | 80.0 | 62.5 | 72.2 | 80.0 | 75.0 | 77.8 | |
| Sum | 85.0 | 68.8 | 77.8 | 90.0 | 75.0 | 83.3 | |
Image-wise sensitivity (%) using majority voting (2 and 4 classes).
| Dataset | Classifier | non-carcinoma | carcinoma | ||
|---|---|---|---|---|---|
| Normal | Benign | Invasive | |||
| 70 | 90 | ||||
| 80 | 40 | 100 | 100 | ||
| 80 | 100 | ||||
| 80 | 60 | 100 | 100 | ||
| 50 | 100 | ||||
| 75 | 75 | 75 | 75 | ||
| 50 | 90 | ||||
| 75 | 75 | 50 | 75 | ||
| 61.1 | 94.4 | ||||
| 77.8 | 55.6 | 88.9 | 88.9 | ||
| 66.7 | 95.6 | ||||
| 77.8 | 66.7 | 77.8 | 88.9 | ||
Fig 4Activation examples for the first (A, B) and second (C) layers of the Convolutional Neural Network.
Different structures with diagnostic relevance are analyzed.
Fig 52D projection of the training patches and their activations on different layers of the CNN using t-SNE [29].
A training patches; B last convolutional layer; C second fully-connected layer. Diamond shapes represent test images.