| Literature DB >> 33210257 |
Daniel J Carragher1, Peter J B Hancock2.
Abstract
In response to the COVID-19 pandemic, many governments around the world now recommend, or require, that their citizens cover the lower half of their face in public. Consequently, many people now wear surgical face masks in public. We investigated whether surgical face masks affected the performance of human observers, and a state-of-the-art face recognition system, on tasks of perceptual face matching. Participants judged whether two simultaneously presented face photographs showed the same person or two different people. We superimposed images of surgical masks over the faces, creating three different mask conditions: control (no masks), mixed (one face wearing a mask), and masked (both faces wearing masks). We found that surgical face masks have a large detrimental effect on human face matching performance, and that the degree of impairment is the same regardless of whether one or both faces in each pair are masked. Surprisingly, this impairment is similar in size for both familiar and unfamiliar faces. When matching masked faces, human observers are biased to reject unfamiliar faces as "mismatches" and to accept familiar faces as "matches". Finally, the face recognition system showed very high classification accuracy for control and masked stimuli, even though it had not been trained to recognise masked faces. However, accuracy fell markedly when one face was masked and the other was not. Our findings demonstrate that surgical face masks impair the ability of humans, and naïve face recognition systems, to perform perceptual face matching tasks. Identification decisions for masked faces should be treated with caution.Entities:
Keywords: Deep neural network; Face recognition; Familiarity; Identity verification; Signal detection theory
Mesh:
Year: 2020 PMID: 33210257 PMCID: PMC7673975 DOI: 10.1186/s41235-020-00258-x
Source DB: PubMed Journal: Cogn Res Princ Implic ISSN: 2365-7464
Fig. 1Examples of the control, mixed and masked conditions for match and mismatch trials of the GFMT [
reproduced and adapted with permission from the copyright holder]
Descriptive statistics [mean(SD)] for measures of human performance (d′, c) on the GFMT and the SFFMT
| Control | Mixed | Masked | Control | Mixed | Masked | |
|---|---|---|---|---|---|---|
| GFMT | 2.16 (0.94) | 1.15 (0.70) | 1.31 (0.54) | − 0.05 (0.41) | 0.16 (0.39) | 0.23 (0.41) |
| SFFMT | ||||||
| Unfamiliar | 1.16 (0.58) | 0.60 (0.52) | 0.56 (0.47) | 0.01 (0.36) | 0.04 (0.53) | 0.13 (0.57) |
| Familiar | 2.74 (0.87) | 1.80 (0.77) | 1.75 (0.89) | − 0.18 (0.35) | − 0.57 (0.36) | − 0.34 (0.42) |
Separate ANOVA and post hoc analyses for measures of human performance (d′, c) on the GFMT
| ANOVA | ||||||||
|---|---|---|---|---|---|---|---|---|
| 95% CI | 95% CI | |||||||
| Control-Mixed | 6.51 | 0.65, 1.39 | < .001* | 1.21 | − 2.58 | − 0.41, − 0.02 | .033* | − 0.53 |
| Control-Masked | 5.43 | 0.48, 1.23 | < .001* | 1.08 | − 3.36 | − 0.48, − 0.08 | .003* | − 0.68 |
| Mixed-Masked | − 0.99 | − 0.55, 0.23 | .972 | − 0.26 | − 0.76 | − 0.28, 0.14 | .999 | − 0.17 |
*Identifies statistically significant t tests
Fig. 2a Sensitivity (d′) and b response bias (criterion c) on the GFMT and the SFFMT (plotted separately for familiar and unfamiliar faces). Positive criterion c values indicate a conservative response bias (inclined to say ‘mismatch’), while negative values indicate a liberal bias. All error bars show the standard error of the mean (SEM)
One sample t tests comparing the response bias in each mask condition with 0, reported separately for the GFMT and the SFMT
| Familiar | Unfamiliar | |||||||
|---|---|---|---|---|---|---|---|---|
| 95% CI | 95% CI | |||||||
| GFMT | ||||||||
| Control | − 0.90 | − 0.17, 0.06 | .375 | − 0.12 | ||||
| Mixed | 2.76 | 0.04, 0.28 | .009* | 0.42 | ||||
| Masked | 3.63 | 0.10, 0.36 | < .001* | 0.56 | ||||
| SFFMT | ||||||||
| Control | − 3.85 | − 0.28, − 0.09 | < .001* | − 0.53 | 0.11 | − 0.09, 0.11 | .916 | 0.02 |
| Mixed | − 10.58 | − 0.68, − 0.46 | < .001* | − 1.61 | 0.48 | − 0.13, 0.20 | .637 | 0.07 |
| Masked | − 5.22 | − 0.47, − 0.21 | < .001* | − 0.81 | 1.45 | − 0.05, 0.31 | .154 | 0.22 |
*Identifies statistically significant t tests
Separate repeated-measures ANOVAs for measures of human performance (d′, c) on the SFMT
| Familiarity | ||
| Mask condition | ||
| Interaction |
Simple main effects (SME) analyses for the effect of mask condition on face familiarity, for measures of human performance (d′, c) on the SFFMT
| Familiar faces | Unfamiliar faces | |||||||
|---|---|---|---|---|---|---|---|---|
| SME | ||||||||
| 95% CI | 95% CI | |||||||
| Control-Mixed | 5.39 | 0.53, 1.35 | < .001* | 1.13 | 5.19 | 0.31, 0.82 | < .001* | 1.02 |
| Control-Masked | 5.65 | 0.57, 1.40 | < .001* | 1.13 | 5.50 | 0.34, 0.86 | < .001* | 1.12 |
| Mixed-Masked | 0.28 | − 0.39, 0.49 | .999 | 0.06 | 0.32 | − 0.23, 0.31 | .999 | 0.08 |
*Identifies statistically significant t tests
Fig. 3Classification accuracy of the DNN, shown as the percentage correct of the 20 trials in each condition, plotted separately for the match and mismatch trials of the GFMT and the SFFMT
Descriptive statistics [means(SD)] for the similarity ratings given by the DNN for the match and mismatch trials of the GFMT and the SFFMT
| Match | Mismatch | |||||
|---|---|---|---|---|---|---|
| Control | Mixed | Masked | Control | Mixed | Masked | |
| GFMT | 83.35 (5.85) | 52.95 (9.11) | 76.75 (7.03) | 9.85 (8.25) | 9.30 (9.21) | 15.20 (12.25) |
| SFFMT | ||||||
| Unfamiliar | 73.75 (6.90) | 41.35 (8.11) | 59.80 (10.68) | 21.20 (8.72) | 12.65 (8.46) | 19.80 (13.70) |
| Familiar | 75.45 (9.06) | 44.85 (11.98) | 58.60 (11.58) | 17.80 (9.31) | 8.65 (8.62) | 17.95 (9.98) |
Separate item-analysis ANOVAs and post hoc analyses for the similarity ratings given by the DNN for match and mismatch trials of the GFMT and the SFFMT
| Match pairs | Mismatch pairs | |||||||
|---|---|---|---|---|---|---|---|---|
| GFMT | ||||||||
| Mask condition | ||||||||
| 95% CI | 95% CI | |||||||
| Control-Mixed | 12.90 | 24.73, 36.07 | < .001* | 3.97 | ||||
| Control-Masked | 2.80 | 0.93, 12.27 | .021* | 1.02 | ||||
| Mixed-Masked | − 10.10 | − 29.47, − 18.13 | < .001* | − 2.93 | ||||
*Identifies statistically significant t tests
Fig. 4Similarity ratings from the DNN, plotted separately for the match and mismatch trials of the GFMT and the SFFMT. Error bars show SEM