| Literature DB >> 32915161 |
Titus J Brinker1, Roman C Maron1, Jochen S Utikal2,3, Achim Hekler1, Axel Hauschild4, Elke Sattler5, Wiebke Sondermann6, Sebastian Haferkamp7, Bastian Schilling8, Markus V Heppt9, Philipp Jansen6, Markus Reinholz5, Cindy Franklin10, Laurenz Schmitt11, Daniela Hartmann5, Eva Krieghoff-Henning1, Max Schmitt1, Michael Weichenthal4, Christof von Kalle12, Stefan Fröhling13.
Abstract
BACKGROUND: Early detection of melanoma can be lifesaving but this remains a challenge. Recent diagnostic studies have revealed the superiority of artificial intelligence (AI) in classifying dermoscopic images of melanoma and nevi, concluding that these algorithms should assist a dermatologist's diagnoses.Entities:
Keywords: artificial intelligence; deep learning; dermatology; diagnosis; machine learning; melanoma; neural network; nevi; skin neoplasm
Mesh:
Year: 2020 PMID: 32915161 PMCID: PMC7519424 DOI: 10.2196/18091
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Overall mean performance of dermatologists without artificial intelligence (AI) support (-AI) compared to that of dermatologists with AI support (+AI) and AI on its own. Performance is split into 3 categories, that is, sensitivity, specificity, and accuracy.
| Performance | AI | Dermatologist -AI | Dermatologist +AI |
| Sensitivity (95% CI) | 84.7% (81.9%-87.6%) | 59.4% (53.3%-65.5%) | 74.6% (69.9%-79.3%) |
| Specificity (95% CI) | 79.1% (74.8%-83.4%) | 70.6% (62.3%-78.9%) | 72.4% (66.2%-78.6%) |
| Accuracy (95% CI) | 81.9% (79.7%-84.2%) | 65.0% (62.3%-67.6%) | 73.6% (70.9%-76.3%) |
Figure 1Combined and individual dermatologists’ performance without and with artificial intelligence (AI) support. Every dot represents a single participant. A line between 2 dots connects the participants' metric without AI support to the corresponding metric with AI support. Highlighted dots represent the participants combined. Boxes indicate 25th and 75th percentile while the horizontal line within shows the median (50th percentile). Whiskers indicate the data range (1.5*IQR) where points beyond are considered as outliers.
Figure 2Combined and individual dermatologists’ diagnostic accuracy without and with artificial intelligence (AI) support. Diagnostic accuracy is measured using sensitivity and specificity. Arrows represent the change in the diagnostic accuracy from without AI support to with AI support. Highlighted arrows represent the participants combined. In addition, the black curve denotes the mean receiver operating characteristic curve of the classifier. ROC: receiver operating characteristic; CNN: convolutional neural network; AUC: area under the curve.
Distribution of the correct and incorrect predictions by classifier and dermatologists without artificial intelligence (AI) support and switching in response to AI support. Percentages displayed below show the amount of times a switch did or did not occur for dermatologists when answering part II of the survey.
| Groupings | Proportion, n (%)a | 95% CI | |
|
| 7 (8) | 5.5%-10.4% | |
|
| Dermatologist switched, n=7 | 0 (1) | 0%-2.2% |
|
| Dermatologist stayed, n=7 | 7 (99) | 97.8%-100% |
|
| 25 (27) | 24.0%-30.1% | |
|
| Dermatologist switched, n=25 | 11 (46) | 33.1%-58.4% |
|
| Dermatologist stayed, n=25 | 14 (54) | 41.6%-66.9% |
|
| 10 (10) | 8.4%-11.8% | |
|
| Dermatologist switched, n=10 | 4 (39) | 23.2%-55.6% |
|
| Dermatologist stayed, n=10 | 6 (61) | 44.4%-76.9% |
|
| 52 (55) | 52.4%-57.1% | |
|
| Dermatologist switched, n=52 | 0 (0) | 0%-0.5% |
|
| Dermatologist stayed, n=52 | 52 (100) | 99.5%-100% |
aAs the mean of all the participants was taken and every participant ended up rating a varying amount of images due to the quality control step, the reported absolute values are approximations.
Confidence distribution of the classifier and dermatologists for part I and part II.
| Groupings | Confidence (95% CI) | |
|
| ||
|
| AIa | 35.8% (27.6%-44.1%) |
|
| Dermatologist part I | 62.3% (54.6%-69.9%) |
|
| Dermatologist part II | 69.8% (62.4%-77.3%) |
|
| ||
|
| AI | 65.4% (60.7%-70.2%) |
|
| Dermatologist part I | 60.8% (53.3%-68.3%) |
|
| Dermatologist part II | 46.0% (33.6%-58.5%) |
|
| ||
|
| AI | 32.9% (28.1%-37.6%) |
|
| Dermatologist part I | 63.2% (56.3%-70.2%) |
|
| Dermatologist part II | 47.9% (36.4%-59.4%) |
|
| ||
|
| AI | 74.5% (72.0%-76.9%) |
|
| Dermatologist part I | 67.9% (61.5%-74.3%) |
|
| Dermatologist part II | 80.3% (76.2%-84.3%) |
aAI: artificial intelligence.