| Literature DB >> 36161186 |
Zhongzheng Fu1, Boning Zhang1, Xinrun He1, Yixuan Li1, Haoyuan Wang1, Jian Huang1.
Abstract
In emotion recognition based on physiological signals, collecting enough labeled data of a single subject for training is time-consuming and expensive. The physiological signals' individual differences and the inherent noise will significantly affect emotion recognition accuracy. To overcome the difference in subject physiological signals, we propose a joint probability domain adaptation with the bi-projection matrix algorithm (JPDA-BPM). The bi-projection matrix method fully considers the source and target domain's different feature distributions. It can better project the source and target domains into the feature space, thereby increasing the algorithm's performance. We propose a substructure-based joint probability domain adaptation algorithm (SSJPDA) to overcome physiological signals' noise effect. This method can avoid the shortcomings that the domain level matching is too rough and the sample level matching is susceptible to noise. In order to verify the effectiveness of the proposed transfer learning algorithm in emotion recognition based on physiological signals, we verified it on the database for emotion analysis using physiological signals (DEAP dataset). The experimental results show that the average recognition accuracy of the proposed SSJPDA-BPM algorithm in the multimodal fusion physiological data from the DEAP dataset is 63.6 and 64.4% in valence and arousal, respectively. Compared with joint probability domain adaptation (JPDA), the performance of valence and arousal recognition accuracy increased by 17.6 and 13.4%, respectively.Entities:
Keywords: domain adaptation; emotion recognition; individual difference; multimodal fusion; physiological signal; transfer learning
Year: 2022 PMID: 36161186 PMCID: PMC9493208 DOI: 10.3389/fnins.2022.1000716
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
The features used in Experiment 1 and Experiment 2.
| Signal | Feature | Description | Dimension |
| EEG | Differential Entropy (DE) | DE in different bands: Delta (1–4 Hz), Theta (4–8 Hz), Alpha (8–13 Hz), Beta (13–30 Hz), and Gamma (30–48 Hz) | 32 channels × 5 features |
| PPG | Time Domain | Mean value, maximum value, minimum value, standard deviation and root mean square value of heart rate interval. | 1 channel × 8 features |
| Frequency domain | Power spectral density of bands 0.1–1.5 Hz and 1.5–3 Hz. | ||
| GSR | Time Domain | Mean, standard deviation | 1 channel × 7 features |
| Frequency domain | Power spectral density of bands 0.4–0.8 Hz, 0.8–1.2 Hz, 1.2–1.6 Hz, 1.6–2.0 Hz, and 2.0–2.4 Hz. | ||
| RES | Time Domain | Mean, maximum, minimum, standard deviation, and root mean square value of respiratory interval. Respiratory rate (times / second) | 1 channel × 8 features |
| Frequency domain | Power spectral density of bands 0.1–1.5 Hz and 1.5–3.0 Hz. |
Substructural Joint Probability Distribution Adaptation (SSJPDA)
Substructural Joint Probability Distribution Adaptation with Bi-Projection Metrix (SSJPDA-BPM)
The average accuracy (ACC_100%) and F1-measure in different algorithms with single-mode and multi-mode data in valence and arousal classification.
| Method | Modality | Valence | Arousal | ||
| ACC | F1-measure | ACC | F1-measure | ||
| JPDA | EEG | 0.529 | 0.563 | 0.549 | 0.615 |
| PPG | 0.561 | 0.603 | 0.551 | 0.589 | |
| GSR | 0.537 | 0.578 | 0.567 | 0.619 | |
| RES | 0.531 | 0.574 | 0.509 | 0.567 | |
| ALL | 0.541 | 0.576 | 0.568 | 0.626 | |
| JPDA-BPM | EEG | 0.536 | 0.605 | 0.525 | 0.624 |
| PPG | 0.536 | 0.63 | 0.551 | 0.582 | |
| GSR | 0.553 | 0.446 | 0.537 | 0.57 | |
| RES | 0.517 | 0.555 | 0.537 | 0.613 | |
| ALL | 0.533 | 0.615 | 0.573 | 0.613 | |
| SSJPDA | EEG | 0.604 | 0.617 | 0.614 | 0.645 |
| PPG | 0.588 | 0.537 | 0.633 | 0.634 | |
| GSR | 0.605 | 0.596 | 0.613 | 0.618 | |
| RES | 0.614 | 0.643 | 0.619 | 0.614 | |
| ALL | 0.617 | 0.627 | 0.635 | 0.643 | |
| SSJPDA-BPM | EEG | 0.621 | 0.645 | 0.629 | 0.655 |
| PPG | 0.62 | 0.619 |
| 0.652 | |
| GSR | 0.608 | 0.581 | 0.62 | 0.65 | |
| RES | 0.595 | 0.601 | 0.636 | 0.653 | |
| ALL |
|
| 0.644 |
| |
The numbers in bold indicate the highest value of the experimental results.
FIGURE 1The source and target domain’s prediction samples are projected to two-dimensional visualization through t-SNE in multimodal data experiments with different algorithms. (A) Shows valence classification representations, and (B) shows arousal classification representations, where (I) is JPDA algorithm, (II) is JPDA (BPM) algorithm, (III) is SSJPDA algorithm, (IV) is SSJPDA(BPM) algorithm.
The average accuracy and F1-measure of different algorithms in valence and arousal classification.
| Method | Valence | Arousal | ||
| ACC | F1 | ACC | F1 | |
| 1NN | 0.484 | 0.529 | 0.504 | 0.555 |
| TCA | 0.49 | 0.533 | 0.521 | 0.583 |
| JDA | 0.496 | 0.535 | 0.515 | 0.578 |
| BDA | 0.519 | 0.56 | 0.538 | 0.572 |
| JPDA | 0.541 | 0.576 | 0.568 | 0.626 |
| JPDA-BPM | 0.533 | 0.615 | 0.573 | 0.613 |
| SSJPDA | 0.617 | 0.627 | 0.635 | 0.643 |
| SSJPDA-BPM |
|
|
|
|
The numbers in bold indicate the highest value of the experimental results.
FIGURE 2The recognition accuracy of each algorithm in Experiment 2 in 32 subjects was ranked in descending order. (A) Shows the recognition accuracy of valence in different algorithms of 32 subjects, and (B) shows the recognition accuracy of arousal in different algorithms of 32 subjects.