| Literature DB >> 36100851 |
Salman Seyedi1, Zifan Jiang2,3, Allan Levey4, Gari D Clifford2,3.
Abstract
BACKGROUND: The expanding usage of complex machine learning methods such as deep learning has led to an explosion in human activity recognition, particularly applied to health. However, complex models which handle private and sometimes protected data, raise concerns about the potential leak of identifiable data. In this work, we focus on the case of a deep network model trained on images of individual faces.Entities:
Keywords: Data leakage; Deep neural networks; Eye-tracking; Facial features
Mesh:
Year: 2022 PMID: 36100851 PMCID: PMC9469631 DOI: 10.1186/s12938-022-01035-1
Source DB: PubMed Journal: Biomed Eng Online ISSN: 1475-925X Impact factor: 3.903
Fig. 1Eye gaze: illustration of the main part of the target model, which is the focus of the attack: FCs refer to different fully connected layers, while CNNs are convolutional neural network parts. After face and eye detection with regression tree, the left-eye and the right-eye are fed into CNN-E, which is CNN for eyes (shared weights) and a separate CNN, where face crop is the input (CNN-F). The photograph of the face is a modified from a publicly available image [28] under the Unsplash License [29]
Fig. 2ROC curve (SVM on labeling video recordings): the dash-lines correspond to the validation set, while the solid lines are for the test set. The area under the curve for all sets and labels has been shown in the legend. While the blue and green are for the data set with instance labeling, the orange and red indicate values for the data set with person labeling
Fig. 3PR curve (SVM on labeling video recordings): the dash lines correspond to the validation set, while the solid lines are for the test set. Average precision scores are also provided in the legend as AP. While the blue and green are for the data set with instance labeling, the orange and red indicate values for the data set with person labeling
Performance metrics for SVM in validation and test data with both instance and person label
| Validinstance | Validperson | Testinstance | Testperson | |
|---|---|---|---|---|
| Accuracy | 0.85 | 0.81 | 0.79 | 0.77 |
| F1-score | 0.82 | 0.80 | 0.80 | 0.79 |
| AUROC | 0.92 | 0.90 | 0.88 | 0.86 |
| AP (average precision) | 0.92 | 0.91 | 0.87 | 0.91 |
Threshold have been adjusted to achieve the best performance on the validation set in each model (0.68 for instance and 0.8 for person model).
Performance for people with multiple recordings
| Train | Valid | Test | |
|---|---|---|---|
| Totala | 19 | 14 | 21 |
| 11 | 4 | 11 | |
| 16 | 8 | 12 | |
| 1 − | 0.996 | 0.21 | 0.44 |
aThe total number of recordings in each set that belongs to a person who has another recording present in the training set of the target model (eye-tracking model)
bOnly provided for the sake of completeness and is the number of picked recordings as inside (predicted label 1), despite being trained on them with labeling as outside (label 0)
cThe number of these recordings that had been picked as inside (predicted label 1) in model trained on person labels
Validation loss (binary cross-entropy) scores for different inputs for the frame labeling network (instance, person)
| 2 outputa | +2 gradb | +5 gradc | +2 grad + loss + labeld | +3 grad + loss + labele | |
|---|---|---|---|---|---|
| lossInstance | 0.7 | 0.61 | 0.61 | 0.57 | 0.57 |
| lossPerson | 0.7 | 0.61 | 0.62 | 0.59 | 0.58 |
For reference, the baseline (binary cross-entropy of random guesses for a balanced data) would be .
aTakes the two last outputs (the target model output and the layer just before it) of the target model as the input
bTakes the two last outputs and also the two gradients before the last gradient of the target model
cTakes all the “+2 grad” and also the last gradient of three different sections of the target model (boundary, face, eyes)
dTakes the two last layers outputs and also the two before the last gradient and label and loss of target model
eTakes the two last layers outputs and also the three last gradients and label and loss of target model
Data distribution
| Train | Valid | Test | |
|---|---|---|---|
| Number of records | |||
| Total | 242 | 170 | 198 |
| | 140 | 73 | 93 |
| | 159 | 87 | 114 |
| Number of frames | |||
| Total | 83477 | 66755 | 73857 |
| | 46515 | 30072 | 35999 |
| | 50654 | 32950 | 41456 |
aThe labels are set to 1 if the recording was used in the target network’s training.
bThe labels are set to 1 if at least one recording of the person was used in the target network’s training