| Literature DB >> 35069270 |
Lei Jiang1, Panote Siriaraya1, Dongeun Choi2, Noriaki Kuwahara1.
Abstract
Objective: Numerous communication support systems based on reminiscence therapy have been developed. However, when using communication support systems, the emotional assessment of older people is generally conducted using verbal feedback or questionnaires. The purpose of this study is to investigate the feasibility of using Electroencephalography (EEG) signals for automatic emotion recognition during RT for older people. Participants: Eleven older people (mean 71.25, SD 4.66) and seven young people (mean 22.4, SD 1.51) participated in the experiment.Entities:
Keywords: Bi-LSTM; EEG signals; emotion recognition; older people; reminiscence therapy
Year: 2022 PMID: 35069270 PMCID: PMC8777059 DOI: 10.3389/fphys.2021.823013
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.566
Emotion recognition based on different stimulus materials summary.
| Signals | Subjects | Stimuli | Algorithm | Emotion model | Highest | References |
| EEG | 32 | DEAP: | Multimodal + Naive Bayes classifier | PAD | F1-score |
|
| EEG | 14 | Audio (6 s) | SVM | PAD | Accu. |
|
| EEG | 43 | Film clip | K-means + multimodal + | Valence | Accu. |
|
| EEG | 15 | SEED: Film clip (4 min) | MFBSE-EWT + ARF classifier | Negative, | 94.4% |
|
| EEG | 20 | Music | SVM | Favored/non | Accu. |
|
| ECG | 23 (26.6) | DREAMER | SM-SSA + RC (IP/CEC) + | PAD | Accu. |
|
PER, peripheral physiological signals; MCA, multimedia content analysis; PAD, valence + arousal + dominance dimension; MFBSE-EWT, MHMS (multivariate Hilbert marginal spectrum) + FBSE (Fourier-Bessel series expansion) + EWT (empirical wavelet transform); ARF, the sparse autoencoder based random forest classifier.
Emotion recognition systems applied to older people summary.
| Signals (source/stimuli) | Subjects | Algorithm | Emotion | Highest | References | |
| Contact | Non-contact | |||||
| Expression | 778 | Viola-Jones-Haar + Gabor + SVM | Neutral | 95.24% 90.32% |
| |
| Speech (USOMS-e database) | 87 | Pretrained CNN | Valence | 57.8% |
| |
| Speech, Expression | 5 | Bi-LSTM | Negative | Speech: 91% |
| |
| EEG | 8 | Trained neural network | Pleasant | 82.61% |
| |
| SpO2, PR | 31 | Statistical feature | Happy; Sad; Angry | SVM:72.86% |
| |
| EMG, ECG, BVP, GSR, RSP, SKT | 30 | LSTM | Amused | EMG: |
| |
SpO2, oxygen saturation; PR, pulse rate; HCI, Human-computer interaction; IoT, Internet of Things; about the highest accuracy in the table unless otherwise noted, all are Accuracy.
Experiment summary.
| Photo conversation stimuli | |
| Number of photos | 36 |
| Photo content | Showa era, landscape, food and festival |
| Photo Conversation | 1 min |
| Experiment information | |
| Number of participants | 11 (O) and 7 (Y) |
| Number of males | 6 (O) and 5 (Y) |
| Number of females | 5 (O) and 2 (Y) |
| Age of participants | O: 66–82 (M = 71.25, SD = 4.66); |
| Rating scales | Arousal, Valence, Stress |
| Rating values | −4–4, −4–4, 1–7 |
| Recorded signals | 8-channel 256 Hz EEG |
The old people (O) from Silver Human Resources Center, Kyoto, Japan; The young people (Y) from Kyoto Institute of Technology, Kyoto, Japan.
FIGURE 1Two participants during the experiment.
FIGURE 2Detail of the experiment procedure.
FIGURE 3Distribution of the conversation emotions on the VA model.
Distribution of the conversation emotions on PS values.
| Classify | Conditions | Total |
| Positive | Pleasure value > 0 and Stress value ≤ 1 | 126 |
| Negative | Pleasure value < 0 or Stress value > 2 | 69 |
FIGURE 4The Emotion classification of EEG raw data based on Bi-LSTM.
Data arrays for subjects.
| Array name | Array shape | Array contents |
| Dataset 1 | 195 × 8 × 15,000 | Trials × Channels × Data |
| Labels | 195 × 2 | Trials × Label (positive and negative) |
FIGURE 5Details of the LSTM and Bi-LSTM models.
Comparison of accuracy results between datasets and parameters of models.
| Dataset | Channels | Models | Optimizer | Accuracy (%) (Epoch) | ||
| Ir = 0.01 | Ir = 0.001 | Ir = 0.0001 | ||||
| 1 | F3, F4, F7, F8, T7, T8, P3, P4 | LSTM | SGD | 98.3%(10) |
| 73.3%(120) |
| RMSprop | – | – | 81.0%(30) | |||
| Adam | – | – | 84.8%(40) | |||
| 1 | F3, F4, F7, F8, T7, T8, P3, P4 | Bi-LSTM | SGD | 98.5%(35) |
| 86.7%(180) |
| RMSprop | – | – | 81.5%(30) | |||
| Adam | – | – | 85.6%(20) | |||
| 2 | F7, F8, T7, T8 | Bi-LSTM | SGD | – | 93.5%(40) | 86.4%(160) |
| RMSprop | – | – |
| |||
| Adam | – | – | 87.1%(30) | |||
| 3 | F3, F4, F7, F8 | Bi-LSTM | SGD | 64.1 (10) | 65.1%(30) | 64.6%(100) |
| RMSprop | – | – |
| |||
| Adam | – | – | 66.1%(15) | |||
“–” represents the loss value of validation set is greater than 1 or accuracy equal to 100% with very few epochs (overfitting). The underlined values, represent the optimal parameters chosen for the models and the corresponding accuracy of emotion recognition.
Comparisons with prior work in emotion recognition.
| Network model | Physiological signals | Subjects (dataset) | Accuracy (%) |
| SVM ( | EEG | 15 (SEED) | 90.40% |
| MFBSE-EWT + ARF ( | EEG | 15 (SEED) | 94.4% |
| CNN-LSTM ( | EEG | 14 subjects | 97.42% |
| SM-SSA + RCs (IP/CEC) + SVM/KNN ( | EEG, ECG | 23 (DREAMER) | 92.38% |
| LSTM ( | EEG, ECG | 5 subjects | 95.00% |
| Bi-LSTM (proposed method) | EEG | 11 subjects | 95.80% |
MFBSE-EWT, MHMS (multivariate Hilbert marginal spectrum) + FBSE (Fourier-Bessel series expansion) + EWT (empirical wavelet transform); ARF, the sparse autoencoder based random forest classifier; SM-SSA, sliding mode singular spectrum analysis; RCs, reconstructed components; IP, information potential; CEC, centered correntropy.