| Literature DB >> 35161713 |
Alina Elena Baia1, Giulio Biondi2, Valentina Franzoni2,3, Alfredo Milani2, Valentina Poggioni2.
Abstract
Deep learning approaches for facial Emotion Recognition (ER) obtain high accuracy on basic models, e.g., Ekman's models, in the specific domain of facial emotional expressions. Thus, facial tracking of users' emotions could be easily used against the right to privacy or for manipulative purposes. As recent studies have shown that deep learning models are susceptible to adversarial examples (images intentionally modified to fool a machine learning classifier) we propose to use them to preserve users' privacy against ER. In this paper, we present a technique for generating Emotion Adversarial Attacks (EAAs). EAAs are performed applying well-known image filters inspired from Instagram, and a multi-objective evolutionary algorithm is used to determine the per-image best filters attacking combination. Experimental results on the well-known AffectNet dataset of facial expressions show that our approach successfully attacks emotion classifiers to protect user privacy. On the other hand, the quality of the images from the human perception point of view is maintained. Several experiments with different sequences of filters are run and show that the Attack Success Rate is very high, above 90% for every test.Entities:
Keywords: adversarial machine learning; emotion recognition; evolutionary algorithm; privacy protection
Mesh:
Year: 2022 PMID: 35161713 PMCID: PMC8840139 DOI: 10.3390/s22030967
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Confusion matrix from the results of the attack with three filters.
Figure 2Confusion matrix from the results of the attack with four filters.
Figure 3Confusion matrix from the results of the attack with five filters.
Examples of adversarial samples: the first column reports the original image and original classification. Columns 2–4 show the adversarial images with their classification. We can notice how the adversarial attack changes the automated emotion recognition without disrupting the image appearance.
| Original | 3 filters | 4 filters | 5 filters |
|---|---|---|---|
|
|
|
|
|
|
| fear | fear | fear |
|
|
|
|
|
|
| contempt | contempt | disgust |
|
|
|
|
|
|
| sadness | sadness | sadness |
|
|
|
|
|
|
| contempt | contempt | contempt |
Figure 4SSIM values distributions for the attacking images produced by sequences of 3, 4 and 5 filters.