| Literature DB >> 33327315 |
Jérôme Allyn1,2, Nicolas Allou1,2, Charles Vidal1, Amélie Renou1, Cyril Ferdynus2,3,4.
Abstract
Deep learning algorithms have shown excellent performances in the field of medical image recognition, and practical applications have been made in several medical domains. Little is known about the feasibility and impact of an undetectable adversarial attacks, which can disrupt an algorithm by modifying a single pixel of the image to be interpreted. The aim of the study was to test the feasibility and impact of an adversarial attack on the accuracy of a deep learning-based dermatoscopic image recognition system.First, the pre-trained convolutional neural network DenseNet-201 was trained to classify images from the training set into 7 categories. Second, an adversarial neural network was trained to generate undetectable perturbations on images from the test set, to classifying all perturbed images as melanocytic nevi. The perturbed images were classified using the model generated in the first step. This study used the HAM-10000 dataset, an open source image database containing 10,015 dermatoscopic images, which was split into a training set and a test set. The accuracy of the generated classification model was evaluated using images from the test set. The accuracy of the model with and without perturbed images was compared. The ability of 2 observers to detect image perturbations was evaluated, and the inter observer agreement was calculated.The overall accuracy of the classification model dropped from 84% (confidence interval (CI) 95%: 82-86) for unperturbed images to 67% (CI 95%: 65-69) for perturbed images (Mc Nemar test, P < .0001). The fooling ratio reached 100% for all categories of skin lesions. Sensitivity and specificity of the combined observers calculated on a random sample of 50 images were 58.3% (CI 95%: 45.9-70.8) and 42.5% (CI 95%: 27.2-57.8), respectively. The kappa agreement coefficient between the 2 observers was negative at -0.22 (CI 95%: -0.49--0.04).Adversarial attacks on medical image databases can distort interpretation by image recognition algorithms, are easy to make and undetectable by humans. It seems essential to improve our understanding of deep learning-based image recognition systems and to upgrade their security before putting them to practical and daily use.Entities:
Mesh:
Year: 2020 PMID: 33327315 PMCID: PMC7738012 DOI: 10.1097/MD.0000000000023568
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.817
Figure 1Adversarial neural network architecture (following Poursaeed et al.).[
Figure 2Distribution of the 7 types of skin lesion in the training and test sets.
Precision, recall, and fooling ratios of the model for images from the test set, overall and for each category of skin lesion.
| Unperturbed images (n = 2003) | Perturbed images (n = 2003) | ||||
| Precision | Recall | Precision | Recall | Fooling Ratio | |
| Actinic keratoses | 70% | 65% | 0% | 0% | 100% |
| Basal cell carcinoma | 72% | 73% | 0% | 0% | 100% |
| Benign keratosis-like lesions | 67% | 72% | 0% | 0% | 100% |
| Dermatofibroma | 90% | 39% | 0% | 0% | 100% |
| Melanocytic nevi | 91% | 94% | 67% | 100% | NA |
| Melanoma | 66% | 58% | 0% | 0% | 100% |
| Vascular skin lesions | 91% | 75% | 0% | 0% | 100% |
| Overall | – | – | – | – | 100% |
NA = not applicable.
Figure 3Examples of skin lesion images from the HAM-10000 dataset before (upper line) and after (lower line) perturbation by the adversarial neural network (resolution of 256x256 pixels).