| Literature DB >> 35061692 |
Young Jae Kim1, Jung-Im Na2, Seung Seog Han3,4, Chong Hyun Won1, Mi Woo Lee1, Jung-Won Shin2, Chang-Hun Huh2, Sung Eun Chang1.
Abstract
BACKGROUND: Although deep neural networks have shown promising results in the diagnosis of skin cancer, a prospective evaluation in a real-world setting could confirm these results. This study aimed to evaluate whether an algorithm (http://b2019.modelderm.com) improves the accuracy of nondermatologists in diagnosing skin neoplasms.Entities:
Mesh:
Year: 2022 PMID: 35061692 PMCID: PMC8782525 DOI: 10.1371/journal.pone.0260895
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Study flowchart.
Dataset and demographic information.
| AI Group | Control Group | |
|---|---|---|
| No. of Cases | 144 | 141 |
| Age (mean ± SD) | 57.0 ± 17.7 | 61.0 ± 15.3 |
| Males (%) | 62 (43.1%) | 52 (36.9%) |
| Onset | 6.9 ± 11.6 | 5.8 ± 9.3 |
| Family history of skin cancer (+) | 4 (2.8%) | 5 (3.5%) |
| Tenderness (+) | 16 (11.1%) | 13 (9.2%) |
| Consistency (range 1–4) | 2.5 ± 0.9 | 2.6 ± 1.0 |
| Suspicion | ||
| by Patients (%) | 79 (57.2%) | 74 (54.0%) |
| by Physicians (%) | 47 (32.6%) | 48 (34.0%) |
| Location | ||
| Head and neck | 56 (38.9%) | 65 (46.1%) |
| Trunk | 42 (29.2%) | 32 (22.7%) |
| Arm | 15 (10.4%) | 17 (12.1%) |
| Leg | 30 (20.8%) | 27 (19.1%) |
| Method of the diagnosis | ||
| Pathologic diagnosis | 139 (96.5%) | 131 (92.9%) |
| Clinical diagnosis | 5 (3.5%) | 10 (7.1%) |
| Malignancy | 23 (16.0%) | 29 (20.6%) |
| Angiosarcoma | 1 | 1 |
| Basal cell carcinoma | 7 | 18 |
| Squamous cell carcinoma | 6 | 5 |
| Squamous cell carcinoma in situ | 7 | 2 |
| Keratoacanthoma | 1 | 0 |
| Melanoma | 0 | 1 |
| Metastasis | 1 | 1 |
| Mycosis fungoides | 0 | 1 |
| Benign (%) | 121 (84.0%) | 112 (79.4%) |
* Onset were available in 93.3% of cases (266 cases).
** The consistency was annotated as follows: 1 = hard, 2 = renitent, 3 = normal, and 4 = soft.
*** The details of the benign conditions are listed in the S1 Table.
Fig 2Top accuracies for diagnosing exact diseases.
The physicians of the AI group (n = 144) referred to the three predictions of the algorithm’s diagnoses and the malignancy score before modifying their first impressions. The physicians of the Control group (n = 141) just reviewed the photographs once again. The P-values of top accuracies between before and after assistance of the trainees are annotated.
Summaries of the sensitivity and specificity.
| Sensitivity | Specificity | ||||||
|---|---|---|---|---|---|---|---|
| Before | after | P value | before | after | P value | ||
| AI Group | Top-1 of Trainees | 78.3% (18/23) | 73.9% (17/23) | 0.7656 | 88.4% (107/121) | 94.2% (114/121) | 0.0572 |
| Top-2 of Trainees | 87.0% (20/23) | 91.3% (21/23) | 0.7728 | 66.9% (81/121) | 76.0% (92/121) | 0.0289 | |
| Top-3 of Trainees | 95.7% (22/23) | 91.3% (21/23) | 0.7728 | 62.0% (75/121) | 73.6% (89/121) | 0.0085 | |
| Top-1 of Attending Dermatologists | 82.6% (19/23) | - | 91.7% (111/121) | - | |||
| Top-2 of Attending Dermatologists | 95.7% (22/23) | - | 82.6% (100/121) | - | |||
| Top-3 of Attending Dermatologists | 95.7% (22/23) | - | 79.3% (96/121) | - | |||
| Patients | 56.5% (13/23) | - | 42.6% (49/115) | - | |||
| Top-1 of the algorithm | 52.2% (12/23) | - | 93.4% (113/121) | - | |||
| Top-2 of the algorithm | 69.6% (16/23) | - | 78.5% (95/121) | - | |||
| Top-3 of the algorithm | 78.3% (18/23) | - | 66.1% (80/121) | - | |||
| Risk “High” of the algorithm | 82.6% (19/23) | - | 70.2% (85/121) | - | |||
| Risk “Medium” of the algorithm | 95.7% (22/23) | - | 60.3% (73/121) | - | |||
| Control | Top-1 of Trainees | 65.5% (19/29) | 65.5% (19/29) | 1.0000 | 81.3% (91/112) | 86.6% (97/112) | 0.0915 |
| Top-2 of Trainees | 93.1% (27/29) | 93.1% (27/29) | N/A | 51.8% (58/112) | 57.1% (64/112) | 0.0411 | |
| Top-3 of Trainees | 93.1% (27/29) | 93.1% (27/29) | N/A | 49.1% (55/112) | 53.6% (60/112) | 0.1096 | |
| Top-1 of Attending Dermatologists | 79.3% (23/29) | - | 90.2% (101/112) | - | |||
| Top-2 of Attending Dermatologists | 86.2% (25/29) | - | 82.1% (92/112) | - | |||
| Top-3 of Attending Dermatologists | 86.2% (25/29) | - | 79.5% (89/112) | - | |||
| Patients | 48.1% (13/27) | - | 44.5% (49/110) | - | |||
N/A: exact p-values with zeros could be computed.
The number of differential diagnoses by the trainees increased from 1.9 ± 0.5 to 2.2 ± 0.6 (P < .001).