| Literature DB >> 31617929 |
Michael Phillips1,2, Helen Marsden3, Wayne Jaffe4, Rubeta N Matin5, Gorav N Wali5, Jack Greenhalgh3, Emily McGrath6, Rob James6, Evmorfia Ladoyanni7, Anthony Bewley8,9, Giuseppe Argenziano10, Ioulios Palamaras11.
Abstract
Importance: A high proportion of suspicious pigmented skin lesions referred for investigation are benign. Techniques to improve the accuracy of melanoma diagnoses throughout the patient pathway are needed to reduce the pressure on secondary care and pathology services. Objective: To determine the accuracy of an artificial intelligence algorithm in identifying melanoma in dermoscopic images of lesions taken with smartphone and digital single-lens reflex (DSLR) cameras. Design, Setting, and Participants: This prospective, multicenter, single-arm, masked diagnostic trial took place in dermatology and plastic surgery clinics in 7 UK hospitals. Dermoscopic images of suspicious and control skin lesions from 514 patients with at least 1 suspicious pigmented skin lesion scheduled for biopsy were captured on 3 different cameras. Data were collected from January 2017 to July 2018. Clinicians and the Deep Ensemble for Recognition of Malignancy, a deterministic artificial intelligence algorithm trained to identify melanoma in dermoscopic images of pigmented skin lesions using deep learning techniques, assessed the likelihood of melanoma. Initial data analysis was conducted in September 2018; further analysis was conducted from February 2019 to August 2019. Interventions: Clinician and algorithmic assessment of melanoma. Main Outcomes and Measures: Area under the receiver operating characteristic curve (AUROC), sensitivity, and specificity of the algorithmic and specialist assessment, determined using histopathology diagnosis as the criterion standard.Entities:
Mesh:
Year: 2019 PMID: 31617929 PMCID: PMC6806667 DOI: 10.1001/jamanetworkopen.2019.13436
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Characteristics of 501 Patients
| Characteristic | No. (%) |
|---|---|
| Age, mean (SD), y | 52.1 (18.6) |
| Sex | |
| Male | 222 (44.3) |
| Female | 279 (55.7) |
| White race | |
| No | 16 (3.20) |
| Yes | 484 (96.8) |
| Missing | 1 (0.1) |
| No. of nevi | |
| ≤10 | 173 (35.4) |
| 11-50 | 232 (47.4) |
| >50 | 84 (17.2) |
| Missing | 8 (1.6) |
| Fitzpatrick skin type | |
| Type I, highly sensitive | 61 (12.4) |
| Type II, very sun sensitive | 172 (34.9) |
| Type III, sun-sensitive skin | 184 (37.3) |
| Type IV, minimally sun sensitive | 62 (12.6) |
| Type V, sun-insensitive skin | 10 (2.0) |
| Type VI, sun insensitive, never burns | 4 (0.8) |
| Missing | 8 (1.6) |
| Hair color | |
| Blond | 110 (22.8) |
| Red | 39 (8.1) |
| Brown | 298 (61.8) |
| Black | 35 (7.3) |
| Missing | 19 (3.8) |
| Freckles | |
| No | 214 (43.4) |
| Yes | 279 (56.6) |
| Missing | 8 (1.6) |
| History of skin cancer | |
| No history | 352 (71.3) |
| Family nonmelanoma | 30 (6.1) |
| Family melanoma | 38 (7.7) |
| Patient nonmelanoma | 34 (6.9) |
| Patient melanoma | 40 (8.1) |
| Missing | 7 (1.4) |
Assessment by Histopathology
| Assessment Method | Probability of Melanoma | Sensitivity, %/Specificity, % | No. (%) | |||
|---|---|---|---|---|---|---|
| Other Lesion Type | Dysplasia | Melanoma | Total | |||
| Clinician | Unlikely | 93/27 | 80 (64.5) | 35 (28.2) | 9 (7.3) | 124 (100) |
| Equivocal | 68/83 | 155 (57.6) | 82 (30.5) | 32 (11.9) | 269 (100) | |
| Likely | 29/97 | 35 (32.4) | 25 (23.2) | 48 (44.4) | 108 (100) | |
| Highly likely | 0/100 | 8 (16.0) | 6 (12.0) | 36 (72.0) | 50 (100) | |
| Total | NA | 278 (50.5) | 148 (26.8) | 125 (22.7) | 551 (100) | |
| Algorithm with iPhone 6s image | Unlikely | 100/0 | 19 (76.0) | 6 (24.0) | 0 | 25 (100) |
| Equivocal | 79/87 | 157 (60.2) | 87 (33.3) | 17 (6.5) | 261 (100) | |
| Likely | 37/98 | 24 (35.8) | 10 (14.9) | 33 (49.3) | 67 (100) | |
| Highly likely | 0/100 | 5 (13.9) | 2 (5.5) | 29 (80.6) | 36 (100) | |
| Total | NA | 205 (52.7) | 105 (30.0) | 79 (20.3) | 389 (100) | |
| Algorithm with Galaxy S6 image | Unlikely | 92/50 | 102 (65.4) | 48 (30.8) | 6 (3.8) | 156 (100) |
| Equivocal | 71/84 | 72 (60.5) | 25 (37.9) | 3 (8.6) | 202 (100) | |
| Likely | 40/98 | 25 (37.9) | 17 (25.7) | 24 (36.4) | 66 (100) | |
| Highly likely | 0/100 | 3 (8.6) | 2 (5.7) | 30 (85.7) | 35 (100) | |
| Total | NA | 202 (53.7) | 98 (26.1) | 76 (20.2) | 376 (100) | |
| Algorithm with DSLR image | Unlikely | 86/58 | 81 (60.5) | 46 (34.3) | 7 (5.2) | 134 (100) |
| Equivocal | 75/89 | 44 (59.5) | 24 (32.4) | 6 (8.1) | 74 (100) | |
| Likely | 41/99 | 12 (30.8) | 10 (25.6) | 17 (43.6) | 39 (100) | |
| Highly likely | 0/100 | 3 (12.5) | 0 | 21 (87.5) | 24 (100) | |
| Total | NA | 140 (51.7) | 80 (29.5) | 51 (18.8) | 271 (100) | |
Abbreviations: DSLR, digital single-lens reflex; NA, not applicable.
Sensitivity/specificity shows the accuracy of the level of likelihood for each threshold for detecting melanoma.
Figure 1. Receiver Operating Characteristic Curves for Clinical and Trained Algorithm Assessment of Biopsied Lesions
A, Area under receiver operator characteristic curve, 0.779. B, Area under receiver operator characteristic curve, 0.902. C, Area under receiver operator characteristic curve, 0.858. D, Area under receiver operator characteristic curve, 0.869. DSLR indicates digital single-lens reflex.
Figure 2. Receiver Operating Characteristic Curves for Clinical and Trained Algorithm Assessment of All Lesions
A, Area under receiver operator characteristic curve, 0.909. B, Area under receiver operator characteristic curve, 0.959. C, Area under receiver operator characteristic curve, 0.938. D, Area under receiver operator characteristic curve, 0.918. DSLR indicates digital single-lens reflex.
Comparison of Algorithm With Clinicians
| Method | No. of Images | % | NNB | |||||
|---|---|---|---|---|---|---|---|---|
| Sensitivity | Specificity | FPR | FNR | PPV | NPV | |||
|
| ||||||||
| Clinician | 1582 | 100 | 69.90 | 30.13 | 0.00 | 20.30 | 100.00 | 4.92 |
| Algorithm with iPhone 6s image | 1110 | 100 | 64.80 | 35.18 | 0.00 | 17.90 | 100.00 | 5.58 |
| 94.90 | 78.10 | 21.87 | 5.06 | 25.00 | 99.50 | 4.00 | ||
| Algorithm with Galaxy S6 image | 1078 | 100 | 51.20 | 48.83 | 0.00 | 13.40 | 100.00 | 7.44 |
| 94.90 | 75.60 | 24.42 | 5.13 | 22.80 | 99.50 | 4.39 | ||
| Algorithm with DSLR image | 809 | 100 | 30.00 | 70.03 | 0.00 | 9.48 | 100.00 | 10.55 |
| 92.80 | 45.50 | 54.52 | 7.25 | 11.10 | 98.80 | 9.02 | ||
|
| ||||||||
| Clinician | 553 | 100 | NA | NA | NA | NA | NA | NA |
| Algorithm with iPhone 6s image | 388 | 100 | 31.30 | 68.71 | 0.00 | 27.10 | 100 | NA |
| 95.00 | 50.60 | 49.35 | 5.06 | 32.90 | 98.00 | NA | ||
| Algorithm with Galaxy S6 image | 376 | 100 | 18.40 | 81.55 | 0.00 | 23.60 | 100 | NA |
| 95.00 | 46.90 | 53.07 | 5.13 | 31.10 | 97.00 | NA | ||
| Algorithm with DSLR image | 271 | 100 | 3.15 | 96.85 | 0.00 | 19.90 | 100 | NA |
| 93.00 | 27.60 | 72.38 | 7.25 | 23.60 | 94.00 | NA | ||
Abbreviations: DSLR, digital single-lens reflex; FNR, false-negative rate; FPR, false-positive rate; NA, not applicable; NNB, number needed to biopsy (to identify 1 case of melanoma); NPV, negative predictive value; PPV, positive predictive value.