| Literature DB >> 33937279 |
Chen-Yu Zhu1, Yu-Kun Wang1, Hai-Peng Chen2, Kun-Lun Gao2, Chang Shu1, Jun-Cheng Wang1, Li-Feng Yan2, Yi-Guang Yang3, Feng-Ying Xie3, Jie Liu1.
Abstract
Background: Numerous studies have attempted to apply artificial intelligence (AI) in the dermatological field, mainly on the classification and segmentation of various dermatoses. However, researches under real clinical settings are scarce.Entities:
Keywords: artificial intelligence; convolutional neural networks; deep learning; dermatology; dermoscopy; skin diseases; skin imaging
Year: 2021 PMID: 33937279 PMCID: PMC8085301 DOI: 10.3389/fmed.2021.626369
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Dataset overview.
| Numbers included in study ( | 13,603 | 2,538 | 200 | 200 |
| Mean age (years), mean ± SD (range) | - | 57.38 ± 16.81 | - | 43.55 ± 19.68 |
| Female, | - | 1,532 (60.36) | - | 123 (61.50) |
| Lichen planus, | 804 (5.91) | 126 (4.96) | - | - |
| Rosacea, | 597 (4.39) | 80 (3.15) | 25 (12.5) | 25 (12.5) |
| Viral warts, | 1,110(8.16) | 298 (11.74) | 25 (12.5) | 25 (12.5) |
| Acne vulgaris, | 2,023(14.87) | 277 (10.91) | - | - |
| Keloid and hypertrophic scar, | 438 (3.22) | 96 (3.78) | - | - |
| Eczema/dermatitis, | 2,440 (17.94) | 419 (16.51) | 25 (12.5) | 25 (12.5) |
| Dermatofibroma, | 343 (2.52) | 116 (4.57) | - | - |
| Seborrheic dermatitis, | 767 (5.64) | 124 (4.89) | 25 (12.5) | 25 (12.5) |
| Seborrheic keratosis, | 553 (4.07) | 143 (5.63) | 25 (12.5) | 25 (12.5) |
| Melanocytic nevus, | 1,214 (8.92) | 345 (13.59) | 25 (12.5) | 25 (12.5) |
| Hemangioma, | 200 (14.70) | 61 (2.40) | - | - |
| Psoriasis, | 1,707 (12.55) | 234 (9.22) | 25 (12.5) | 25 (12.5) |
| Port wine stain, | 920 (6.76) | 112 (4.41) | - | - |
| Basal cell carcinoma, | 487 (3.58) | 107 (4.22) | 25 (12.5) | 25 (12.5) |
Figure 1Original efficientnet-b4 architecture.
Figure 2Our modified (19) EFFICIENTNET-b4 architecture. Data flow is from left to right: a dermoscopic image is put into the network and finally transformed into a probability distribution over clinical classes of skin disease using our modified EfficientNet-b4 architecture pretrained on the ImageNet dataset and fine-tuned on our own dataset of 13,603 dermoscopic images in 14 categories.
Figure 3Saliency maps for 14 example images from validation set. Saliency maps for example images from each of the 14 disease classes of the validation set reveal the pixels that most influence a CNN's prediction. Saliency maps show the pixel gradients with respect to the CNN's loss function.
Figure 4Sensitivity and specificity of our model. As a result, our model had an overall sensitivity of 93.38 ± 0.08% and specificity 94.85 ± 0.05%.
The classification accuracy, sensitivity and specificity of the proposed CNN model according to disease category.
| Overall | 0.948 | 0.934 | 0.950 |
| Lichen planus | 0.969 | 0.873 | 0.975 |
| Rosacea | 0.959 | 0.920 | 0.961 |
| Viral warts | 0.932 | 0.944 | 0.930 |
| Acne vulgaris | 0.935 | 0.974 | 0.927 |
| Keloid and hypertrophic scar | 0.969 | 0.934 | 0.970 |
| Eczema/dermatitis | 0.877 | 0.924 | 0.866 |
| Dermatofibroma | 0.987 | 1.000 | 0.987 |
| Seborrheic dermatitis | 0.926 | 0.811 | 0.933 |
| Seborrheic keratosis | 0.953 | 0.858 | 0.957 |
| Melanocytic nevus | 0.960 | 0.961 | 0.960 |
| Hemangioma | 0.984 | 0.975 | 0.984 |
| Psoriasis | 0.886 | 0.920 | 0.882 |
| Port wine stain | 0.963 | 0.989 | 0.961 |
| Basal cell carcinoma | 0.979 | 0.971 | 0.979 |
Figure 5Disease classification performance of the proposed model. We fix a threshold probability t such that the prediction y for any image is y = P ≥ t, and the curve is drawn by sweeping t in the interval 0–1. The AUC is the CNN's measure of performance, with a maximum value of 1. Our model achieves 0.985 AUC in 14-way classification. (A) The full view of the ROC curve of the proposed model. (B) The local enlarged image of the ROC curve between abscissa 0~0.2.
Figure 6Confusion matrix of the classification result. Element (i, j) of each confusion matrix represents the empirical probability of predicting class i given that the ground truth was class j, with i and j referencing classes from Table 1. Light red means low percentage and deep red represents high percentage.
Figure 7The ROC curves and the AUC value of our model, Inception-v3, ResNet-101 and the original EfficientNet-b4. Our model outperforms other reported methods for this skin disease diagnosis problem. (A) The full view of the ROC curve of the proposed model. (B) The local enlarged image of the ROC curve between abscissa 0~0.2.
The detailed overall sensitivity, specificity and accuracy of our and other 3 CNN models.
| Sensitivity | 0.934 | 0.882 | 0.919 | 0.890 |
| Specificity | 0.950 | 0.875 | 0.935 | 0.895 |
| Accuracy | 0.948 | 0.875 | 0.934 | 0.895 |
Comparison between doctors and our CNN model in an 8-class task, the better outcomes of our CNN model are colored orange.
| Sensitivity | 79.70 | 62.04 | 74.96 | 46.53 | 59.43 | 77.23 | 67.66 | 80.54 |
| Specificity | 96.39 | 98.48 | 92.42 | 93.70 | 94.28 | 94.68 | 97.32 | 96.73 |
| Accuracy | 94.31 | 93.93 | 90.24 | 87.81 | 89.92 | 92.50 | 93.61 | 94.71 |
| Sensitivity | 92.00 | 92.00 | 84.00 | 48.00 | 76.00 | 96.00 | 92.00 | 88.00 |
| Specificity | 92.57 | 95.43 | 88.57 | 91.43 | 96.00 | 94.86 | 94.29 | 99.43 |
| Accuracy | 92.50 | 95.00 | 88.00 | 86.00 | 93.50 | 95.00 | 94.00 | 98.00 |
Kappa coefficients (95% confidence interval) of the included four CNN models and dermatologists (as standard) on the eight-class task.
| Rosacea | 0.683 (0.609~0.757) | 0.712 (0.640~0.783) | 0.304 (0.194~0.414) | 0.646 (0.567~0.726) | 0.691 (0.616~0.765) |
| Viral warts | 0.533 (0.445~0.621) | 0.793 (0.731~0.854) | 0.759 (0.693~0.825) | 0.684 (0.609~0.759) | 0.761 (0.695~0.827) |
| Eczema/dermatitis | 0.757 (0.694~0.820) | 0.570 (0.484~0.655) | 0.507 (0.416~0.598) | 0.558 (0.471~0.645) | 0.585 (0.500~0.670) |
| Seborrheic dermatitis | 0.607 (0.526~0.689) | 0.381 (0.280~0.483) | 0.337 (0.233~0.440) | 0.352 (0.247~0.457) | 0.380 (0.277~0.484) |
| Seborrheic keratosis | 0.410 (0.311~0.509) | 0.708 (0.635~0.780) | 0.576 (0.489~0.663) | 0.576 (0.489~0.663) | 0.682 (0.606~0.758) |
| Melanocytic nevus | 0.683 (0.609~0.758) | 0.799 (0.738~0.860) | 0.786 (0.723~0.849) | 0.759 (0.693~0.825) | 0.719 (0.648~0.790) |
| Psoriasis | 0.675 (0.600~0.749) | 0.759 (0.693~0.825) | 0.744 (0.676~0.813) | 0.621 (0.539~0.703) | 0.701 (0.627~0.774) |
| Basal cell carcinoma | 0.738 (0.671~0.805) | 0.905 (0.863~0.948) | 0.875 (0.826~0.924) | 0.725 (0.654~0.796) | 0.853 (0.800~0.905) |
Figure 8t-SNE visualization of the last hidden layer representations in the CNN for four disease classes. Here we show the CNN's internal representation of eight disease classes by applying t-SNE, a method for visualizing high-dimensional data, to the last hidden layer representation (1792-D vector) in the CNN. Colored point clouds represent the different disease categories, showing how the algorithm clusters the diseases.