| Literature DB >> 36034935 |
Zhichao Han1, Hongli Chang2, Xiaoyan Zhou1, Jihao Wang1, Lili Wang1, Yongbin Shao1.
Abstract
Objectve: Emotional brain-computer interface can recognize or regulate human emotions for workload detection and auxiliary diagnosis of mental illness. However, the existing EEG emotion recognition is carried out step by step in feature engineering and classification, resulting in high engineering complexity and limiting practical applications in traditional EEG emotion recognition tasks. We propose an end-to-end neural network, i.e., E2ENNet.Entities:
Keywords: depthwise separable convolution; electroencephalogram (EEG); emotional brain-computer interface; long short-term memory; neurocognitive
Year: 2022 PMID: 36034935 PMCID: PMC9413837 DOI: 10.3389/fncom.2022.942979
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 3.387
Figure 1Traditional framework of EEG emotion recognition.
Figure 2The structure diagram of end-to-end neural network on EEG-based emotion recognition.
Detailed parameters of E2ENNet model.
|
|
|
|
|
|
|---|---|---|---|---|
| 1 | Input | (C,T) | ||
| Reshape | (C,T,1) | |||
| Conv2D | (1,64) | (C,T, | Linear | |
| BatchNorm | (C,T, | |||
| 2 | DepthwiseConv2D | (C,1) | (1,T,2 × | Linear |
| Batchnorm | (1,T,2 × | |||
| Activation | (1,T,2 × | Elu | ||
| AveragePool2D | (1,4) | (1,T/4,2 × | ||
| Dropout | (1,T/4,2 × | |||
| 3 | SeparableConv2D | (1,16) | (1,T/4, | Linear |
| Batchnorm | (1,T/4, | |||
| Activation | (1,T/4, | Elu | ||
| AveragePool2D | (1,8) | (1,T/32, | ||
| Dropout | (1,T/32, | |||
| 4 | Reshape | ( | ||
| LSTM | 64 | 64 | ||
| Reshape | (64,1) | |||
| LSTM | 32 | 32 | ||
| Classifier | Dense | N | Softmax |
Block1-4 represents the 2D convolution block, depthwise Convolution block, depthwise separable convolution block and LSTM block, respectively.
Details of three different datasets.
|
|
|
|
|---|---|---|
| DEAP | 32 | Two-category: High/Low Valance,High/Low Arousals |
| DREAMER | 14 | Two-category: High/Low Valance,Arousal,Dominance |
| MPED | 62 | Seven-category: joy,fun,neutrality,sadness, |
Figure 3Two-category classification comparison experiment on DEAP dataset.
Figure 4Two-category classification comparison experiment on DREAMER dataset.
Figure 5Seven-category classification experiment on MPED dataset.
Experiments on DEAP, DREAMER and MPED datasets of using different input data of E2ENNet.
|
|
|
|
|
|---|---|---|---|
| PSD | 77.60 ± 6.05% | 88.84 ± 9.18% | 37.84 ± 7.90% |
| DE | 80.26 ± 5.86% | 96.81 ± 2.18% | 40.50 ± 7.88% |
| Raw data+PSD+DE | 77.54 ± 8.80% | 89.50 ± 9.04% | 38.54 ± 8.68% |
|
|
|
|
|
Bold values represent the highest accuracy.
Ablation experiments of E2ENNet on DEAP, DREAMER and MPED datasets.
|
|
|
|
|
|---|---|---|---|
| E2ENNet(no conv)a | 63.46 ± 8.05% | 82.78 ± 7.34% | 32.18 ± 9.29% |
| E2ENNet(no LSTM)b | 94.89 ± 6.68% | 97.38% ± 1.86% | 40.03 ± 7.34% |
|
|
|
|
|
E2ENNet(no conv)a denotes the E2ENNet without all convolution blocks, only retain LSTM block; E2ENNet(no LSTM)b denotes the E2ENNet without LSTM block, only retain convolution block. E2ENNetc demotes the complete E2ENNet model, containing all convolution and LSTM blocks. Bold values represent the highest accuracy.
Figure 6Experiments on DEAP, DREAMER, and MPED dataset based on different LSTM layers in E2ENNet model.
The time cost of different models on DEAP dataset.
|
|
|
|
|
|---|---|---|---|
| SVM |
|
| 75.14% |
| DBN | / | 35s | 85.42% |
| CapsNet | 181s | 16s | 95.33% |
| E2ENNet(no conv) | 252s | 8s | 63.46% |
| E2ENNet(no LSTM) | 51s |
| 94.89% |
|
| 72s |
|
|
Bold values represent the lowest time cost and the highest accuracy.