| Literature DB >> 34367687 |
Kamran Kowsari1,2,3, Rasoul Sali1, Lubaina Ehsan4, William Adorno1, Asad Ali5, Sean Moore4, Beatrice Amadi6, Paul Kelly6,7, Sana Syed4,5,8, Donald Brown1,8.
Abstract
Image classification is central to the big data revolution in medicine. Improved information processing methods for diagnosis and classification of digital medical images have shown to be successful via deep learning approaches. As this field is explored, there are limitations to the performance of traditional supervised classifiers. This paper outlines an approach that is different from the current medical image classification tasks that view the issue as multi-class classification. We performed a hierarchical classification using our Hierarchical Medical Image classification (HMIC) approach. HMIC uses stacks of deep learning models to give particular comprehension at each level of the clinical picture hierarchy. For testing our performance, we use biopsy of the small bowel images that contain three categories in the parent level (Celiac Disease, Environmental Enteropathy, and histologically normal controls). For the child level, Celiac Disease Severity is classified into 4 classes (I, IIIa, IIIb, and IIIC).Entities:
Keywords: deep Learning; hierarchical classification; hierarchical medical image classification; medical imaging
Year: 2020 PMID: 34367687 PMCID: PMC8346231 DOI: 10.3390/info11060318
Source DB: PubMed Journal: Information (Basel) ISSN: 2078-2489
Figure 1.HMIC: Hierarchical Medical Image Classification.
Population results of biopsies dataset.
| Total Population | Pakistan | Zambia | US | ||
|---|---|---|---|---|---|
| Data | 150 | EE (n = 10) | EE (n = 16) | Celiac (n = 63) | Normal (n = 61) |
| Biopsy Images | 461 | 29 | 19 | 239 | 174 |
| Age, median (IQR), months | 37.5 (19.0 to 121.5) | 22.2 (20.8 to 23.4) | 16.5 (9.5 to 21.0) | 130.0 (85.0 to 176.0) | 25.0 (16.5 to 41.0) |
| Gender, n (%) | M = 77 (%51.3) | M = 5 (%50) | M = 10 (%62.5) | M = 29 (%46) | M = 33 (%54) |
| LAZ/ HAZ, median (IQR) | −0.6 (−1.9 to 0.4) | −2.8 (−3.6 to −2.3) | −3.1 (−4.1 to −2.2) | −0.3 (−0.8 to 0.7) | −0.2 (−1.3 to 0.5) |
Dataset used for Hierarchical Medical Image Classification (HMIC).
| Data | Train | Test | Total | ||||
|---|---|---|---|---|---|---|---|
| Normal | 22,676 | 9717 | 32,393 | ||||
| Environmental Enteropathy | 20,516 | 8792 | 29,308 | ||||
| Parent | Child | Parent | Child | Parent | Child | ||
| I | 21,140 | 4988 | 9058 | 2137 | 30,198 | 7125 | |
| IIIa | 4790 | 2052 | 6842 | ||||
| IIIb | 5684 | 2436 | 8120 | ||||
| IIIc | 5678 | 2433 | 8111 | ||||
Figure 2.Pipeline of patching and applying an autoencoder to find useful patches for the training model. The biopsy images are very large, so we need to divide into smaller patches to be used in the machine learning model. As you can see in the image, many of these patches are empty. After using an autoencoder, we can apply a clustering algorithm to discard useless patches (green patches contain useful information, while red patches do not).
Figure 3.Example autoencoder architecture with K-means applied on the bottle-neck layer feature vector to cluster useful and not useful patches.
Figure 4.Some samples of clustering results—cluster 1 includes patches with useful information and cluster 2 includes patches without useful information (mostly created from background parts of WSIs).
Figure 5.Color Balancing samples for the three classes.
Figure 6.Stain normalization results when using the method proposed by Vahadane et al. [42]. Images in the first row represent the source images. The source images are normalized images to the stain appearance of the target image in second row [1].
Figure 7.Structure of Convolutional Neural Net using multiple 2D feature detectors and 2D max-pooling.
Result of parent level classifications for normal, environmental enteropathy, and Celiac disease.
| Precision | Recall | F1-Score | |
|---|---|---|---|
| Normal | 89.97 ± 0.59 | 89.35 ± 0.61 | 89.66 ± 0.60 |
| Environmental Enteropathy | 94.02 ± 0.49 | 97.30 ± 0.33 | 95.63 ± 0.42 |
| Celiac Disease | 91.12 ± 0.32 | 88.71 ± 0.35 | 89.90 ± 1.27 |
Results of HMIC with comparison with our baseline.
| Model | Precision | Recall | F1-Score | |
|---|---|---|---|---|
| Baseline | CNN | 76.76 ± 0.49 | 80.18 ± 0.47 | 78.43 ± 0.48 |
| Multilayer perceptron | 76.19 ± 0.50 | 79.40 ± 0.47 | 77.76 ± 0.49 | |
| Deep CNN | 82.95 ± 0.44 | 87.28 ± 0.39 | 85.06 ± 0.42 | |
| HMIC | Non Whole slide | 84.13 ± 0.37 | 93.56 ± 0.29 | 88.61 ± 0.37 |
| Whole slide |
Results per-classed of HMIC with comparison with our baseline.
| Model | Precision | Recall | F1-Score | |||
|---|---|---|---|---|---|---|
| Baseline | CNN | Normal | 87.83 ± 0.57 | 90.77 ± 0.65 | 89.28 ± 0.61 | |
| Environmental Enteropathy | 90.93 ± 0.61 | 82.48 ± 0.79 | 86.50 ± 0.71 | |||
| Celiac Disease | I | 68.37 ± 1.98 | 68.62 ± 1.96 | 68.50 ± 1.96 | ||
| IIIa | 56.26 ± 1.01 | 56.26 ± 2.21 | 59.29 ± 1.95 | |||
| IIIb | 65.28 ± 0.97 | 98.28 ± 2.01 | 66.64 ± 1.87 | |||
| IIIc | 62.66 ± 1.99 | 66.83 ± 1.99 | 64.68 ± 2.02 | |||
| Multilayer perceptron | Normal | 87.97 ± 0.76 | 81.87 ± 0.76 | 84.81 ± 0.71 | ||
| Environmental Enteropathy | 87.25 ± 0.69 | 90.18 ± 0.62 | 88.69 ± 0.66 | |||
| Celiac Disease | I | 57.92 ± 2.07 | 60.74 ± 2.07 | 59.30 ± 2.09 | ||
| IIIa | 62.58 ± 2.09 | 62.18 ± 2.09 | 60.89 ± 2.11 | |||
| IIIb | 65.00 ± 1.89 | 66.09 ± 1.87 | 65.56 ± 1.88 | |||
| IIIc | 67.97 ± 1.85 | 74.85 ± 1.72 | 71.24 ± 1.78 | |||
| DCNN | Normal | 95.14 ± 0.42 | 94.91 ± 0.43 | 95.14 ± 0.42 | ||
| Environmental Enteropathy | 92.22 ± 0.55 | 90.62 ± 0.60 | 91.52 ± 0.58 | |||
| Celiac Disease | I | 75.41 ± 1.82 | 72.63 ± 1.89 | 73.99 ± 1.85 | ||
| IIIa | 70.81 ± 1.92 | 72.47 ± 1.93 | 71.63 ± 1.79 | |||
| IIIb | 81.08 ± 0.81 | 74.67 ± 1.84 | 77.74 ± 1.65 | |||
| IIIc | 75.07 ± 1.83 | 76.37 ± 1.81 | 75.71 ± 1.81 | |||
| HMIC | Non Whole Slide | Normal | 89.97 ± 0.59 | 89.35 ± 0.61 | 89.66 ± 0.61 | |
| Environmental Enteropathy | 94.02 ± 0.49 | 97.30 ± 0.33 | 95.63 ± 0.33 | |||
| Celiac Disease | I | 83.25 ± 1.58 | 80.91 ± 1.66 | 82.06 ± 1.62 | ||
| IIIa | 80.34 ± 1.62 | 80.46 ± 1.71 | 80.40 ± 1.57 | |||
| IIIb | 85.35 ± 1.49 | 81.77 ± 1.67 | 83.52 ± 1.47 | |||
| IIIc | 85.54 ± 1.49 | 82.71 ± 1.60 | 84.10 ± 1.55 | |||
| Whole Slide | Normal | 90.64 ± 0.57 | 90.06 ± 0.57 | 90.35 ± 0.58 | ||
| Environmental Enteropathy | 94.08 ± 0.49 | 97.33 ± 0.42 | 98.68 ± 0.42 | |||
| Celiac Disease | I | 88.73 ± 1.34 | 85.07 ± 1.51 | 86.86 ± 1.43 | ||
| IIIa | 81.19 ± 1.65 | 81.19 ± 1.65 | 82.44 ± 1.51 | |||
| IIIb | 90.51 ± 1.24 | 90.48 ± 1.27 | 90.49 ± 1.16 | |||
| IIIc | 89.26 ± 1.31 | 90.18 ± 1.26 | 89.72 ± 1.28 | |||
Figure 8.Grad-CAM results for showing feature importance.