| Literature DB >> 34215767 |
Jin Bu1, Yu Lin2, Li-Qiong Qing3, Gang Hu4, Pei Jiang5, Hai-Feng Hu6, Er-Xia Shen7,8.
Abstract
With the development of artificial intelligence, technique improvement of the classification of skin disease is addressed. However, few study concerned on the current classification system of International Classification of Diseases, Tenth Revision (ICD)-10 on Diseases of the skin and subcutaneous tissue, which is now globally used for classification of skin disease. This study was aimed to develop a new taxonomy of skin disease based on cytology and pathology, and test its predictive effect on skin disease compared to ICD-10. A new taxonomy (Taxonomy 2) containing 6 levels (Project 2-4) was developed based on skin cytology and pathology, and represents individual diseases arranged in a tree structure with three root nodes representing: (1) Keratinogenic diseases, (2) Melanogenic diseases, and (3) Diseases related to non-keratinocytes and non-melanocytes. The predictive effects of the new taxonomy including accuracy, precision, recall, F1, and Kappa were compared with those of ICD-10 on Diseases of the skin and subcutaneous tissue (Taxonomy 1, Project 1) by Deep Residual Learning method. For each project, 2/3 of the images were included as training group, and the rest 1/3 of the images acted as test group according to the category (class) as the stratification variable. Both train and test groups in the Projects (2 and 3) from Taxonomy 2 had higher F1 and Kappa scores without statistical significance on the prediction of skin disease than the corresponding groups in the Project 1 from Taxonomy 1, however both train and test groups in Project 4 had a statistically significantly higher F1-score than the corresponding groups in Project 1 (P = 0.025 and 0.005, respectively). The results showed that the new taxonomy developed based on cytology and pathology has an overall better performance on predictive effect of skin disease than the ICD-10 on Diseases of the skin and subcutaneous tissue. The level 5 (Project 4) of Taxonomy 2 is better on extension to unknown data of diagnosis system assisted by AI compared to current used classification system from ICD-10, and may have the potential application value in clinic of dermatology.Entities:
Year: 2021 PMID: 34215767 PMCID: PMC8253798 DOI: 10.1038/s41598-021-92848-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Investigations focusing on skin disease classification using artificial intelligence techniques.
| Author | Year | Skin diseases | Imaging type | Method | Accuracy |
|---|---|---|---|---|---|
| Binder et al.[ | 1998 | Pigmented skin lesions | Microscopy images | Computerized image analysis and an artificial neural network | The sensitivity and specificity of the computerized system were 90% and 74%, respectively |
| Verma et al.[ | 2019 | Erythemato-squamous diseases | Dermatology database including macroscopic image; histopathological attribute; family history | An ensemble data mining based on 5 different data mining techniques, including Classification and Regression Trees, Support Vector Machines, Decision Tree, Random Forest and Gradient Boosting Decision Tree | The accurate rate was 98.64% |
| Sharma et al.[ | 2013 | Erythemato-squamous diseases | Dermatology data | An ensemble data mining based on 2 different data mining techniques including Support Vector Machine and Artificial Neural Network | 99.25% and 98.99% at training and testing stages respectively |
| Moradi and Mahdavi-Amiri[ | 2019 | Erythemato-squamous diseases | Two benchmark dermoscopic datasets and one digital image dataset | A kernel sparse representation based method | The method used by Moradi and Mahdavi-Amiri achieved the highest sensitivity as compared to the state-of-the-art methods on the PH2 dataset |
| Yap et al.[ | 2018 | Melanoma | Dermatoscopic image, macroscopic image and patient metadata | Convolutional neural networks | The multimodal classifier outperforms a baseline classifier that only uses a single macroscopic image in both binary melanoma detection (AUC 0.866 vs 0.784) and in multiclass classification (mAP 0.729 vs 0.598) |
| Chang and Chen[ | 2009 | Six major skin diseases | Skin disorder database including clinical and histopathological attributes | Decision tree of data mining combining with neural network classification methods | The neural network model and the sensitivity analysis combining with decision tree model have the highest accuracy (92.62%) and the least accuracy (80.33%) in prediction |
| Esteva et al.[ | 2017 | Melanoma and skin cancers | Macroscopic images and dermoscopy images | Convolutional neural networks | The convolutional neural networks achieve performance on par with all tested experts across both tasks, demonstrating an artificial intelligence capable of classifying skin cancer with a level of competence comparable to dermatologists |
| This study | 2020 | Dermatology and venereology | Macroscopic images | Recurrent neural network | The new taxonomy is overall better on prediction of skin disease than the ICD-10 on Diseases of the skin and subcutaneous tissue |
Figure 1Methodological approach for skin diseases.
Figure 2The first three levels contained by the taxonomy 2.
Comparison of the identified results of projects by different categories.
| Project | Groups | Accuracy (%) | ave_PPV_ | ave_TPR_ | ave_F1 | Kappa |
|---|---|---|---|---|---|---|
| Project_1 | Train | 99.52 | 98.01 ± 2.97 | 90.77 ± 13.40 | 93.63 ± 7.99 | 95.62 |
| Test | 97.73 | 88.51 ± 10.74 | 70.99 ± 26.44 | 74.43 ± 21.89 | 79.08 | |
| Project_2 | Train | 98.56 | 98.12 ± 1.05 | 97.41 ± 2.73 | 97.76 ± 1.90 | 95.52 |
| Test | 95.69 | 94.28 ± 3.22 | 92.17 ± 8.37 | 93.17 ± 5.85 | 86.35 | |
| Project_3 | Train | 99.15 | 98.09 ± 2.91 | 96.22 ± 4.01 | 97.12 ± 3.06 | 95.04 |
| Test | 96.59 | 86.42 ± 7.20 | 83.56 ± 9.69 | 84.81 ± 7.63 | 80.42 | |
| Project_4 | Train | 99.90 | 99.46 ± 1.58 | 97.97 ± 4.39 | 98.67 ± 2.63 | 99.13 |
| Test | 99.45 | 96.22 ± 5.27 | 89.81 ± 15.95 | 91.88 ± 9.59 | 95.10 | |
| Project_1 VS Project_2 P value | Train | 0.031 | 0.755 | 0.921 | 0.374 | |
| Test | 0.008 | 0.481 | 0.093 | 0.138 | ||
| Project_1 VS Project_3 P value | Train | 0.400 | 1.000 | 0.792 | 0.471 | |
| Test | 0.127 | 0.727 | 0.647 | 0.471 | ||
| Project_1 VS Project_4 P value | Train | 0.188 | 0.177 | 0.058 | 0.025* | |
| Test | 0.008 | 0.089 | 0.016* | 0.005** |
PPV positive predictive value, TPR true positive rate.
Effects of AI prediction between train and test groups among various diseases in Project 1 (%).
| Classes | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Train group | Sensitivity and recall | 96.00 | 57.14 | 89.47 | 96.97 | 89.74 | 94.59 | 75.00 | 100.00 | 100.00 | 99.57 | 100.00 |
| Specificity | 100.00 | 100.00 | 100.00 | 99.57 | 99.86 | 99.86 | 100.00 | 100.00 | 99.44 | 96.30 | 100.00 | |
| PPV and precision | 100.00 | 100.00 | 100.00 | 95.52 | 97.22 | 97.22 | 100.00 | 100.00 | 90.48 | 97.66 | 100.00 | |
| NPV | 99.72 | 99.60 | 99.73 | 99.71 | 99.45 | 99.72 | 99.60 | 100.00 | 100.00 | 99.31 | 100.00 | |
| F1 | 97.96 | 72.73 | 94.44 | 96.24 | 93.33 | 95.89 | 85.71 | 100.00 | 95.00 | 98.60 | 100.00 | |
| Test group | Sensitivity and recall | 76.92 | 25.00 | 90.00 | 76.47 | 80.95 | 70.00 | 14.29 | 85.71 | 85.71 | 95.83 | 80.00 |
| Specificity | 97.59 | 100.00 | 99.74 | 99.18 | 99.47 | 99.74 | 100.00 | 100.00 | 98.15 | 84.38 | 99.49 | |
| PPV and precision | 68.97 | 100.00 | 90.00 | 89.66 | 89.47 | 93.33 | 100.00 | 100.00 | 72.00 | 90.20 | 80.00 | |
| NPV | 98.38 | 99.25 | 99.74 | 97.84 | 98.95 | 98.44 | 98.50 | 99.75 | 99.20 | 93.10 | 99.49 | |
| F1 | 72.73 | 40.00 | 90.00 | 82.54 | 85.00 | 80.00 | 25.00 | 92.31 | 78.26 | 92.93 | 80.00 |
1: Other congenital malformations of skin; 2: Gonococcal infection; 3. Other dermatitis; 4. Unspecified viral infection characterized by skin and mucous membrane lesions; 5. Pemphigus; 6. Congenital ichthyosis; 7. Herpes viral [herpes simplex] infections; 8. Scabies; 9. Other predominantly sexually transmitted diseases, not elsewhere classified; 10. Lichen planus; 11. Pediculosis and phthiriasis; PPV positive predictive value, NPV negative predictive value.
Effects of AI prediction between train and test groups among various diseases in Project 2 (%).
| Classes | Infectious diseases | Inflammatory diseases | |
|---|---|---|---|
| Train group | Sensitivity and recall | 95.48 | 99.34 |
| Specificity | 99.34 | 95.48 | |
| PPV/precision | 97.37 | 98.86 | |
| NPV | 98.86 | 97.37 | |
| F1 | 96.42 | 99.10 | |
| Test group | Sensitivity/recall | 86.25 | 98.09 |
| Specificity | 98.09 | 86.25 | |
| PPV/precision | 92.00 | 96.55 | |
| NPV | 96.55 | 92.00 | |
| F1 | 89.03 | 97.31 |
PPV positive predictive value, NPV negative predictive value.
Effects of AI prediction between train and test groups among various diseases in Project 3(%).
| Group | Classes | Virus | Parasite | Bacteria | Dermatitis |
|---|---|---|---|---|---|
| Train group | Sensitivity/recall | 92.42 | 100.00 | 93.10 | 99.34 |
| Specificity | 99.43 | 100.00 | 100.00 | 94.16 | |
| PPV/precision | 93.85 | 100.00 | 100.00 | 98.53 | |
| NPV | 99.28 | 100.00 | 99.44 | 97.32 | |
| F1 | 93.13 | 100.00 | 96.43 | 98.94 | |
| Test group | Sensitivity/recall | 85.29 | 75.00 | 77.42 | 96.51 |
| Specificity | 97.79 | 99.47 | 98.90 | 83.95 | |
| PPV/precision | 78.38 | 85.71 | 85.71 | 95.90 | |
| NPV | 98.61 | 98.95 | 98.10 | 86.08 | |
| F1 | 81.69 | 80.00 | 81.36 | 96.20 |
PPV positive predictive value, NPV negative predictive value.
Effects of AI prediction between train and test groups among various diseases in Project 4 (%).
| Classes | 1 | 2 | 3 | 4 | 5 | 56 | 7 | 8 | 9 | 10 | 11 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Train group | Sensitivity/ recall | 100.00 | 100.00 | 99.78 | 100.00 | 100.00 | 100.00 | 100.00 | 97.44 | 85.71 | 94.74 | 100.00 |
| Specificity | 100.00 | 100.00 | 98.99 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 99.86 | 100.00 | |
| PPV/precision | 100.00 | 100.00 | 99.35 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 94.74 | 100.00 | |
| NPV | 100.00 | 100.00 | 99.66 | 100.00 | 100.00 | 100.00 | 100.00 | 99.86 | 99.87 | 99.86 | 100.00 | |
| F1 | 100.00 | 100.00 | 99.57 | 100.00 | 100.00 | 100.00 | 100.00 | 98.70 | 92.31 | 94.74 | 100.00 | |
| Test group | Sensitivity /recall | 84.62 | 100.00 | 99.58 | 100.00 | 100.00 | 97.06 | 90.00 | 95.24 | 50.00 | 100.00 | 71.43 |
| Specificity | 100.00 | 99.75 | 96.25 | 99.74 | 100.00 | 100.00 | 100.00 | 99.21 | 100.00 | 99.74 | 100.00 | |
| PPV/ precision | 100.00 | 87.50 | 97.55 | 95.45 | 100.00 | 100.00 | 100.00 | 86.96 | 100.00 | 90.91 | 100.00 | |
| NPV | 98.94 | 100.00 | 99.35 | 100.00 | 100.00 | 99.73 | 99.74 | 99.73 | 99.50 | 100.00 | 99.49 | |
| F1 | 91.67 | 93.33 | 98.56 | 97.67 | 100.00 | 98.51 | 94.74 | 90.91 | 66.67 | 95.24 | 83.33 |
1: Pediculosis and phthiriasis; 2: herpes simple genital; 3. lichen planus; 4: condilomas acuminados; 5. Ichthyosis; 6. viral exanthems; 7: pediculosis pubis; 8. pemphigus ; 9: Gonorrhea; 10: eczema; 11: sarna noruega; PPV positive predictive value, NPV negative predictive value.