| Literature DB >> 35664971 |
T C Petrie1, C Larson1, M Heath1, R Samatham1, A Davis1, E G Berry1, S A Leachman1.
Abstract
Background: Many classifiers have been developed that can distinguish different types of skin lesions (e.g., benign nevi, melanoma) with varying degrees of success.1-5 However, even successfully trained classifiers may perform poorly on images that include artefacts. While problems created by hair and ink markings have been published, quantitative measurements of blur, colour and lighting variations on classification accuracy has not yet been reported to our knowledge.Entities:
Year: 2021 PMID: 35664971 PMCID: PMC9060017 DOI: 10.1002/ski2.19
Source DB: PubMed Journal: Skin Health Dis ISSN: 2690-442X
FIGURE 1Each row represents the effect of injecting a particular artefact. The composite image at the left shows the effect of adding the artefact at test levels to an example image. ROC curves show sensitivity (y axis) against the false positive rate (x axis). Legends show the parameter values under test. ROC columns compare melanoma versus other and biopsy versus follow tasks for the Wide ResNet (W) and Inception (I) CNNs. CNN, convolutional neural network; ROC, receiver operating characteristic
Results of artefacts on AUC and specificity
| Diagnostic task | Management task | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Wide ResNet | Inception v3 | Wide ResNet | Inception v3 | ||||||
| Parameter | AUC | Spec. | AUC | Spec. | AUC | Spec. | AUC | Spec. | |
| Blur | 1.00 | 0.91 | 0.83 | 0.90 | 0.81 | 0.98 | 0.95 | 0.98 | 0.95 |
| 2.17 |
|
|
|
|
|
|
|
| |
| 3.33 |
|
|
|
|
|
|
|
| |
| 4.50 |
|
|
|
|
|
|
|
| |
| Motion | 1.00 | 0.92 | 0.84 | 0.91 | 0.82 | 0.99 | 0.96 | 0.99 | 0.96 |
| 2.50 | 0.89 |
| 0.88 |
| 0.98 | 0.95 | 0.98 | 0.94 | |
| 4.00 |
|
|
|
| 0.97 | 0.93 |
|
| |
| 5.50 |
|
|
|
|
|
|
|
| |
| More red | 18.5 | 0.91 | 0.84 | 0.91 | 0.82 | 0.98 | 0.96 | 0.98 | 0.95 |
| 37.0 | 0.90 | 0.81 | 0.89 | 0.79 |
| 0.93 |
| 0.92 | |
| 55.5 | 0.88 |
|
|
|
|
|
|
| |
| 74.0 |
|
|
|
|
|
|
|
| |
| More blue | −18.5 | 0.90 | 0.82 | 0.90 | 0.81 | 0.98 | 0.94 | 0.98 | 0.94 |
| −37.0 |
|
|
|
|
|
|
|
| |
| −55.5 |
|
|
|
|
|
|
|
| |
| −74.0 |
|
|
|
|
|
|
|
| |
| Brightness | 0.60 |
|
|
|
|
|
|
|
|
| 0.80 | 0.91 | 0.82 | 0.90 | 0.80 | 0.98 | 0.95 | 0.98 | 0.95 | |
| 1.20 | 0.90 | 0.82 | 0.90 | 0.82 | 0.98 | 0.94 | 0.98 | 0.95 | |
| 1.40 |
|
|
|
|
|
|
|
| |
| Saturation | 0.75 | 0.91 | 0.84 | 0.90 | 0.82 | 0.99 | 0.96 | 0.98 | 0.96 |
| 1.25 | 0.92 | 0.85 | 0.91 | 0.83 | 0.99 | 0.97 | 0.99 | 0.96 | |
| 1.50 | 0.92 | 0.84 | 0.91 | 0.82 | 0.99 | 0.96 | 0.98 | 0.96 | |
| 1.75 | 0.91 | 0.83 | 0.90 | 0.81 | 0.98 | 0.96 | 0.98 | 0.95 | |
| Control | 0.0 | 0.92 | 0.85 | 0.91 | 0.83 | 0.99 | 0.97 | 0.99 | 0.96 |
Note: Columns contain the results from two CNNs for each task. Specificity is calculated at sensitivity = 0.84 for the diagnostic task and 0.89 for the management task. Values in bold represent a difference greater than between an Expert and Average dermatologist.
Abbreviations: CNN, convolutional neural network; AUC, area under the curve.