| Literature DB >> 35673821 |
Raluca Maria Bungărdean1, Mircea Sebastian Şerbănescu, Costin Teodor Streba, Maria Crişan.
Abstract
Establishing basal cell carcinoma (BCC) subtype is sometimes challenging for pathologists. Deep-learning (DL) algorithms are an emerging approach in image classification due to their performance, accompanied by a new concept - transfer learning, which implies replacing the final layers of a trained network and retraining it for a new task, while keeping the weights from the imported layers. A DL convolution-based software, capable of classifying 10 subtypes of BCC, was designed. Transfer learning from three general-purpose image classification networks (AlexNet, GoogLeNet, and ResNet-18) was used. Three pathologists independently labeled 2249 patches. Ninety percent of data was used for training and 10% for testing on 100 independent training sequences. Each of the resulted networks independently labeled the whole dataset. Mean and standard deviation (SD) accuracy (ACC) [%]∕sensitivity (SN) [%]∕specificity (SP) [%]∕area under the curve (AUC) for all the networks was 82.53±2.63∕72.52±3.63∕97.94±0.3/0.99. The software was validated on another 50-image dataset, and its results are comparable with the result of three pathologists in terms of agreement. All networks had similar classification accuracies, which demonstrated that they reached a maximum classification rate on the dataset. The software shows promising results, and with further development can be successfully used on histological images, assisting pathologists' diagnosis and teaching.Entities:
Mesh:
Year: 2021 PMID: 35673821 PMCID: PMC9289702 DOI: 10.47162/RJME.62.4.14
Source DB: PubMed Journal: Rom J Morphol Embryol ISSN: 1220-0522 Impact factor: 0.833
State of the art results on BCC image classification/segmentation. The default target is classification unless otherwise specified
|
|
|
|
|
|
|
van Zon |
U-Net for segmentation Proprietary for classification |
171 WSI |
Segmentation & classification Two classes: ▪ BCC; ▪ Normal. |
Dice score 0.66 AUC 90% |
|
Campanella |
ResNet34 |
9962 WSI |
Two classes: ▪ BCC; ▪ Normal. |
AUC 98% |
|
Jiang |
GoogleNet inception v3 |
6610 images |
Two classes: ▪ BCC; ▪ Normal. |
The ‘cascade’ framework SN 93% SP 91% |
|
The ‘segmentation’ framework SN 97% SP 94% AUC 98% | ||||
|
Cruz-Roa |
Proprietary |
1417 image patches |
Two classes: ▪ BCC; ▪ Normal. |
ACC 90% Precision 0.876 SN 86% SP 92% Balanced ACC 89% |
|
Santilli |
Proprietary |
190 iKnife scans |
Two classes: ▪ BCC; ▪ Normal. |
ACC 96.62% SN 100% SP 95% |
|
Kimeswenger |
VGG11 |
820 WSI |
Segmentation Two classes: ▪ BCC; ▪ Normal. |
SN 95% SP 95% |
|
Putten |
Proprietary ResNet-based |
|
Two classes: ▪ BCC; ▪ Normal. |
SN 96% SP 89% |
|
Arevalo |
Proprietary |
1417 images |
Segmentation Two classes: ▪ BCC; ▪ Normal. |
AUC 0.981 |
|
Esteva |
Inception v3 |
129 450 dermatoscopy images |
Three classes |
ACC 72% |
|
Nine classes |
ACC 55% | |||
|
Hosny |
AlexNet |
100 015 dermatoscopy images |
Seven classes |
ACC 98.70% SN 95.60% SP 99.27% Precision 95.06% |
|
Jinnai |
VGG-16 |
5846 clinical images |
Seven classes: ▪ Malignant melanoma; ▪ BCC; ▪ Nevus; ▪ Seborrheic keratosis; ▪ Senile lentigo; ▪ Hematoma; ▪ Hemangioma. |
ACC 86.2% |
|
Two classes: ▪ Malignant; ▪ Normal. |
ACC 91.5% SN 83.3% SP 94.5% | |||
|
Thomas |
ResNet50 |
290 WSI |
Four classes: ▪ BCC; ▪ Squamous cell carcinoma; ▪ Intraepidermal carcinoma; ▪ Normal. |
ACC 93.6% |
|
Two classes: ▪ Malignant; ▪ Normal. |
ACC 97.9% |
ACC: Accuracy; AUC: Area under the curve; BCC: Basal cell carcinoma; ISIC: International Skin Imaging Collaboration; SN: Sensitivity; SP: Specificity; WSI: Whole-slide imaging
Figure 1Dataset samples: (a) HR infiltrating and sclerosing; (b) HR micronodular; (c) HR superficial multifocal; (d) LR with adnexal differentiation; (e) LR nodular; (f) LR nodular adenoid variant; (g) LR nodular keratotic variant; (h) LR nodular nodulocystic variant; (i) LR pigmented; (j) LR superficial unifocal. HE staining: (a–j) ×200. HE: Hematoxylin–Eosin; HR: High risk; LR: Low risk
Network ACC [%] performance assessment. Kolmogorov–Smirnov with p-level, Lilliefors p, Shapiro–Wilk W with p-level
|
|
|
|
|
|
ACC [%] |
82.16±2.60 |
83.03±2.67 |
82.42±2.61 |
|
Kolmogorov–Smirnov |
0.12 |
0.10 |
0.09 |
|
|
<0.01 |
<0.01 |
0.03 |
|
Lilliefors |
<0.01 |
0.01 |
0.04 |
|
Shapiro–Wilk |
0.97 |
0.98 |
0.99 |
|
|
0.02 |
0.16 |
0.36 |
ACC: Accuracy
Statistical assessment of mean ACC. Post-hoc Tukey’s test p-value
|
|
|
|
|
AlexNet |
0.05 |
0.77 |
|
GoogLeNet |
|
0.23 |
ACC: Accuracy
Network SN [%] performance assessment
|
|
|
|
|
|
SN [%] |
72.74±3.74 |
72.90±3.29 |
71.93±3.88 |
SN: Sensitivity
Statistical assessment of mean SN. Post-hoc Tukey’s test p-value
|
|
|
|
|
AlexNet |
0.89 |
0.25 |
|
GoogLeNet |
|
0.14 |
SN: Sensitivity
Network SP [%] performance assessment
|
|
|
|
|
|
SP [%] |
97.90±0.30 |
98.01±0.31 |
97.93±0.31 |
SP: Specificity
Statistical assessment of mean SP. Post-hoc Tukey’s test p-value
|
|
|
|
|
AlexNet |
0.03 |
0.76 |
|
GoogLeNet |
|
0.15 |
SP: Specificity
Network AUC performance assessment
|
|
|
|
|
|
AUC |
0.9946±0.0031 |
0.9956±0.0025 |
0.9931±0.0045 |
AUC: Area under the curve
Statistical assessment of mean AUC. Post-hoc Tukey’s test p-value
|
|
|
|
|
AlexNet |
0.08 |
0.01 |
|
GoogLeNet |
|
0.001 |
AUC: Area under the curve
Confusion matrix of the best performing model of AlexNet
|
|
| ||||||||||
|
|
|
|
|
|
|
|
|
|
| ||
|
|
|
381 |
1 |
0 |
0 |
1 |
0 |
0 |
1 |
0 |
0 |
|
|
1 |
309 |
0 |
3 |
3 |
0 |
0 |
0 |
0 |
0 | |
|
|
1 |
0 |
22 |
0 |
0 |
0 |
0 |
0 |
0 |
2 | |
|
|
0 |
1 |
0 |
75 |
1 |
1 |
0 |
0 |
0 |
0 | |
|
|
0 |
3 |
0 |
0 |
502 |
0 |
0 |
1 |
2 |
0 | |
|
|
0 |
3 |
0 |
3 |
5 |
204 |
0 |
2 |
1 |
0 | |
|
|
2 |
1 |
0 |
0 |
2 |
0 |
95 |
0 |
1 |
0 | |
|
|
0 |
1 |
0 |
0 |
10 |
2 |
0 |
110 |
3 |
0 | |
|
|
1 |
7 |
0 |
0 |
18 |
4 |
0 |
0 |
355 |
1 | |
|
|
0 |
0 |
1 |
0 |
0 |
0 |
0 |
0 |
1 |
125 | |
Class codes are 1: HR infiltrating and sclerosing, 2: HR micronodular, 3: HR superficial multifocal, 4: LR adnexal, 5: LR nodular, 6: LR nodular adenoid, 7: LR nodular keratotic, 8: LR nodular nodulocystic, 9: LR pigmented, 10: LR superficial unifocal. HR: High risk; LR: Low risk
Confusion matrix of the best performing model of GoogLeNet
|
|
| ||||||||||
|
|
|
|
|
|
|
|
|
|
| ||
|
|
|
383 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
1 |
0 |
|
|
7 |
305 |
0 |
3 |
0 |
0 |
1 |
0 |
0 |
0 | |
|
|
0 |
1 |
13 |
0 |
1 |
0 |
0 |
0 |
0 |
10 | |
|
|
0 |
4 |
0 |
73 |
0 |
1 |
0 |
0 |
0 |
0 | |
|
|
2 |
3 |
0 |
0 |
496 |
0 |
0 |
1 |
5 |
1 | |
|
|
0 |
0 |
0 |
1 |
5 |
208 |
0 |
0 |
4 |
0 | |
|
|
3 |
1 |
0 |
0 |
1 |
1 |
95 |
0 |
0 |
0 | |
|
|
1 |
0 |
0 |
0 |
8 |
5 |
1 |
110 |
1 |
0 | |
|
|
2 |
5 |
0 |
1 |
9 |
10 |
0 |
2 |
234 |
3 | |
|
|
0 |
1 |
0 |
0 |
4 |
0 |
0 |
0 |
1 |
121 | |
Class codes are 1: HR infiltrating and sclerosing, 2: HR micronodular, 3: HR superficial multifocal, 4: LR adnexal, 5: LR nodular, 6: LR nodular adenoid, 7: LR nodular keratotic, 8: LR nodular nodulocystic, 9: LR pigmented, 10: LR superficial unifocal. HR: High risk; LR: Low risk.
Confusion matrix of the best performing model of ResNet-18
|
|
| ||||||||||
|
|
|
|
|
|
|
|
|
|
| ||
|
|
|
382 |
0 |
0 |
1 |
1 |
0 |
0 |
0 |
0 |
0 |
|
|
4 |
310 |
0 |
0 |
1 |
0 |
1 |
0 |
0 |
0 | |
|
|
0 |
0 |
14 |
0 |
0 |
0 |
1 |
1 |
1 |
8 | |
|
|
3 |
0 |
0 |
74 |
1 |
0 |
0 |
0 |
0 |
0 | |
|
|
0 |
2 |
0 |
0 |
501 |
0 |
0 |
3 |
2 |
0 | |
|
|
0 |
0 |
0 |
1 |
2 |
211 |
0 |
1 |
3 |
0 | |
|
|
2 |
1 |
0 |
0 |
2 |
0 |
95 |
1 |
0 |
0 | |
|
|
0 |
0 |
0 |
0 |
6 |
1 |
0 |
118 |
1 |
0 | |
|
|
2 |
1 |
0 |
0 |
6 |
4 |
1 |
0 |
351 |
1 | |
|
|
1 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
126 | |
Class codes are 1: HR infiltrating and sclerosing, 2: HR micronodular, 3: HR superficial multifocal, 4: LR adnexal, 5: LR nodular, 6: LR nodular adenoid, 7: LR nodular keratotic, 8: LR nodular nodulocystic, 9: LR pigmented, 10: LR superficial unifocal. HR: High risk; LR: Low risk
Confusion matrices of the best performing model of each architecture on HR and LR class prediction
|
|
|
|
| |||
|
|
|
|
|
|
| |
|
|
715 |
10 |
709 |
16 |
710 |
15 |
|
|
20 |
1504 |
22 |
1502 |
12 |
1512 |
|
|
|
|
| |||
HR: High risk; LR: Low risk
Figure 2Standalone application. Running the “Live” mode of a new image
Clinical evaluation assessment: Cohen’s kappa (κ) coefficient. P1–P3 represent the three panel pathologists
|
|
|
|
|
|
|
|
|
P1 |
1 |
0.7570 |
0.6815 |
0.7004 |
0.7098 |
0.6197 |
|
P2 |
|
1 |
0.7481 |
0.7331 |
0.7421 |
0.6817 |
|
P3 |
|
|
1 |
0.7214 |
0.6703 |
0.6068 |
|
AlexNet |
|
|
|
1 |
0.8289 |
0.6837 |
|
GoogLeNet |
|
|
|
|
1 |
0.7475 |
|
ResNet-18 |
|
|
|
|
|
1 |