| Literature DB >> 35808434 |
Delin Ouyang1, Yufei Yuan1, Guofa Li1, Zizheng Guo2.
Abstract
Various lengths of time window have been used in feature extraction for electroencephalogram (EEG) signal processing in previous studies. However, the effect of time window length on feature extraction for the downstream tasks such as emotion recognition has not been well examined. To this end, we investigate the effect of different time window (TW) lengths on human emotion recognition to find the optimal TW length for extracting electroencephalogram (EEG) emotion signals. Both power spectral density (PSD) features and differential entropy (DE) features are used to evaluate the effectiveness of different TW lengths based on the SJTU emotion EEG dataset (SEED). Different lengths of TW are then processed with an EEG feature-processing approach, namely experiment-level batch normalization (ELBN). The processed features are used to perform emotion recognition tasks in the six classifiers, the results of which are then compared with the results without ELBN. The recognition accuracies indicate that a 2-s TW length has the best performance on emotion recognition and is the most suitable to be used in EEG feature extraction for emotion recognition. The deployment of ELBN in the 2-s TW can further improve the emotion recognition performances by 21.63% and 5.04% when using an SVM based on PSD and DE features, respectively. These results provide a solid reference for the selection of TW length in analyzing EEG signals for applications in intelligent systems.Entities:
Keywords: brain–computer interaction; electroencephalogram (EEG); emotion recognition; experiment-level batch normalization; time window length
Mesh:
Year: 2022 PMID: 35808434 PMCID: PMC9269830 DOI: 10.3390/s22134939
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
The selected film clips used to elicit target emotions in the SEED dataset.
| No. | Emotion | Film Clip Sources | #Clips |
|---|---|---|---|
| 1 | negative | Tangshan Earthquake | 2 |
| 2 | negative | Back to 1942 | 3 |
| 3 | positive | Lost in Thailand | 2 |
| 4 | positive | Flirting Scholar | 1 |
| 5 | positive | Just Another Pandora’s Box | 2 |
| 6 | neutral | World Heritage in China | 5 |
Figure 1The protocol of the experiments in the SEED dataset [29].
Figure 2The EEG topo map of the 58 channels.
The detailed information of TW lengths in each trial. N is the number of calculated features in each trial for frequency band of a selected channel.
| TW Length (s) | Number of TWs | Epochs Contained | Feature Format |
|---|---|---|---|
| 180 | 1 | 180 | 58 × 5 × 1 |
| 90 | 2 | 90 | 58 × 5 × 2 |
| 60 | 3 | 60 | 58 × 5 × 3 |
| 30 | 6 | 30 | 58 × 5 × 6 |
| 20 | 9 | 20 | 58 × 5 × 9 |
| 10 | 18 | 10 | 58 × 5 × 18 |
| 5 | 36 | 5 | 58 × 5 × 36 |
| 4 | 45 | 4 | 58 × 5 × 45 |
| 3 | 60 | 3 | 58 × 5 × 60 |
| 2 | 90 | 2 | 58 × 5 × 90 |
| 1 | 180 | 1 | 58 × 5 × 180 |
Parameter settings of the examined classifiers in sklearn. Those parameters that are not listed in this table are set as the default values, while “\” represents that all the used parameters are the default values.
| Classifier | Parameter Setting |
|---|---|
| KNN | n_neighbors = 5, p = 2, metric = ‘minkowski’ |
| LR | solver = ‘liblinear’, random_state = 10 |
| SVM | random_state = 10 |
| GNB | \ |
| MLP | solver = ‘lbfgs’, alpha = 1e−5, hidden_layer_sizes = (100, 3), random_state = 1, max_iter = 1e5 |
| Bagging | base_estimator = lr, n_estimators = 500, max_samples = 1.0, max_features = 1.0, |
Figure 3The protocol of ELBN.
Details of the 10 sets used for training and testing. Please note that the 15 trials are numbered from 0–14.
| Set # | The Trials Used for Training | The Trials Used for Testing |
|---|---|---|
| 1 | [0, 1, 2, 4, 5, 6, 9, 10, 11, 12, 13, 14] | [3, 7, 8] |
| 2 | [0, 1, 2, 3, 6, 7, 8, 9, 10, 12, 13, 14] | [4, 5, 11] |
| 3 | [1, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14] | [0, 2, 12] |
| 4 | [0, 1, 3, 4, 5, 6, 7, 8, 9, 10, 11, 14] | [2, 12, 13] |
| 5 | [0, 1, 2, 3, 4, 5, 7, 9, 10, 11, 13, 14] | [6, 8, 12] |
| 6 | [0, 1, 2, 5, 6, 7, 9, 10, 11, 12, 13, 14] | [3, 4, 8] |
| 7 | [0, 1, 2, 3, 4, 6, 7, 8, 9, 11, 12, 13] | [5, 10, 14] |
| 8 | [0, 2, 3, 4, 5, 7, 9, 10, 11, 12, 13, 14] | [1, 6, 8] |
| 9 | [0, 3, 4, 5, 6, 7, 8, 10, 11, 12, 13, 14] | [1, 2, 9] |
| 10 | [0, 1, 2, 3, 4, 5, 6, 8, 10, 11, 12, 13] | [7, 9, 14] |
The recognition results (mean accuracy and standard deviation) of PSD features extracted from separate frequency bands when using different classifiers under different TW lengths without ELBN. The highest and second-highest accuracies of each classifier are highlighted in bold red and bold blue, respectively.
| TW Length (s) | KNN | LR | SVM | GNB | MLP | Bagging |
|---|---|---|---|---|---|---|
| 180 | 54.22(3.18) | 66.81(5.81) | 49.48(6.32) | 39.63(5.42) | 62.59(5.52) | 66.30(5.72) |
| 90 | 55.33(2.64) |
| 51.78(4.96) |
|
| 68.07(6.37) |
| 60 |
|
|
|
| 66.67(5.17) |
|
| 30 |
| 67.41(7.53) |
| 39.56(6.75) |
| 68.15(6.37) |
| 20 |
| 67.63(6.50) |
| 39.56(6.94) | 66.52(4.63) | 67.78(5.51) |
| 10 | 56.15(2.77) | 66.37(6.34) | 52.74(4.08) | 39.41(6.99) | 63.63(5.32) |
|
| 5 | 56.07(2.86) | 65.70(6.12) | 52.81(4.01) | 39.48(6.98) | 65.26(4.99) | 67.63(5.40) |
| 4 | 56.00(2.84) | 65.41(6.10) | 52.81(4.01) | 39.56(6.98) | 66.22(4.99) | 67.26(5.28) |
| 3 | 56.00(2.84) | 65.63(5.93) | 52.81(4.01) | 39.56(6.98) | 64.96(5.10) | 67.19(5.52) |
| 2 | 56.00(2.84) | 65.63(5.15) | 52.81(4.01) | 39.56(6.98) | 61.85(7.59) | 66.96(5.76) |
| 1 | 56.00(2.84) | 65.78(4.70) | 52.81(4.01) | 39.56(6.98) | 65.04(4.34) | 67.19(6.04) |
The recognition results (mean accuracy and standard deviation) of DE features extracted from separate frequency bands when using different classifiers under different TW lengths without ELBN. The highest and second-highest accuracies of each classifier are highlighted in bold red and bold blue, respectively.
| TW Length (s) | KNN | LR | SVM | GNB | MLP | Bagging |
|---|---|---|---|---|---|---|
| 180 | 65.78(4.81) | 77.19(7.45) | 70.00(4.09) | 48.30(2.91) | 66.30(8.91) | 76.74(6.35) |
| 90 |
| 76.81(7.62) | 71.48(5.90) | 48.96(2.93) | 72.67(5.35) | 77.41(7.11) |
| 60 |
| 76.22(7.65) | 71.56(6.23) | 48.96(2.82) |
| 77.70(7.11) |
| 30 | 66.22(5.06) | 76.81(7.25) | 71.93(6.21) | 48.96(2.87) | 74.81(6.76) | 78.00(6.40) |
| 20 | 65.85(5.04) |
| 72.07(6.25) | 48.96(2.87) |
| 78.30(6.52) |
| 10 | 66.15(4.91) |
|
|
| 72.07(8.17) | 78.30(5.73) |
| 5 | 66.30(4.85) |
|
|
| 73.48(7.52) | 78.52(5.59) |
| 4 | 66.30(4.85) | 78.30(5.41) |
|
| 74.67(5.79) | 78.74(5.72) |
| 3 | 66.30(4.85) | 78.15(5.04) |
|
| 75.11(8.21) | 78.74(6.09) |
| 2 | 66.30(4.85) | 77.70(5.08) |
|
| 68.59(5.12) |
|
| 1 | 66.30(4.85) | 77.19(5.69) |
|
| 73.11(7.96) |
|
The recognition results (mean accuracy and standard deviation) of PSD features extracted from separate frequency bands when using different classifiers under different TW lengths with ELBN. The highest and second-highest accuracies of each classifier are highlighted in bold red and bold blue, respectively.
| TW Length (s) | KNN | LR | SVM | GNB | MLP | Bagging |
|---|---|---|---|---|---|---|
| 180 | 64.89(4.06) | 73.33(6.87) | 72.59(3.94) | 56.44(5.94) | 72.37(4.76) | 74.37(5.55) |
| 90 | 65.85(3.80) | 77.93(6.56) | 73.41(3.86) | 59.78(6.01) | 73.04(4.58) | 78.37(6.56) |
| 60 | 67.48(3.55) | 77.04(6.93) | 73.70(4.64) |
|
| 77.78(6.20) |
| 30 | 67.56(2.94) | 77.48(5.94) |
| 60.37(6.01) | 71.26(7.97) | 77.78(5.26) |
| 20 | 67.26(3.06) | 77.85(6.09) |
|
|
| 78.67(5.13) |
| 10 | 67.63(3.16) | 79.04(6.03) |
| 60.30(6.02) | 69.93(5.19) | 78.67(5.52) |
| 5 | 67.70(3.11) | 79.04(6.29) |
|
| 73.41(6.56) |
|
| 4 |
| 79.04(6.17) |
|
| 73.26(4.25) | 79.33(5.76) |
| 3 |
|
|
|
| 72.96(6.96) |
|
| 2 |
|
|
|
| 70.59(7.61) | 79.33(5.51) |
| 1 |
|
|
|
|
|
|
The recognition results (mean accuracy and standard deviation) of DE features extracted from separate frequency bands when using different classifiers under different TW lengths with ELBN. The highest and second-highest accuracies of each classifier are highlighted in bold red and bold blue, respectively.
| TW Length (s) | KNN | LR | SVM | GNB | MLP | Bagging |
|---|---|---|---|---|---|---|
| 180 | 70.67(5.63) | 75.78(8.55) | 75.70(5.47) | 67.48(6.22) | 73.33(7.50) | 77.70(7.21) |
| 90 |
| 79.70(8.02) | 77.41(6.01) | 69.26(6.98) | 75.70(6.82) | 80.89(7.79) |
| 60 | 72.37(5.93) | 79.04(8.29) |
| 69.41(6.84) | 75.78(7.16) | 80.22(7.65) |
| 30 | 73.19(5.47) | 81.56(7.10) | 77.26(4.94) | 69.41(6.98) | 73.70(7.28) | 81.48(7.37) |
| 20 | 72.96(5.31) | 81.56(7.74) | 77.48(5.32) |
| 76.15(7.84) | 81.26(7.73) |
| 10 | 73.26(5.41) | 82.37(6.50) | 77.56(5.13) |
| 76.22(6.72) | 82.15(6.64) |
| 5 | 73.41(5.31) | 82.59(6.34) | 77.56(5.13) | 69.41(6.74) |
|
|
| 4 | 73.41(5.22) | 82.44(6.60) |
| 69.41(6.74) | 77.04(7.40) |
|
| 3 | 73.41(5.22) |
|
| 69.41(6.74) | 76.00(6.50) |
|
| 2 |
|
|
| 69.41(6.74) |
| 82.22(6.55) |
| 1 | 73.33(5.16) | 82.52(7.13) |
| 69.41(6.74) | 76.30(6.44) | 82.15(6.81) |
The recognition results (mean accuracy and standard deviation) of DE features with different TW lengths. The red color highlights the results with statistical significance (p ≤ 0.05).
| Feature | Frequency | Processing | Fp1 | Fpz | Fp2 | AF3 | AF4 | C3 | C1 | Cz | C2 | C4 | C6 | P1 | Pz | P2 | P4 | P8 | PO7 | PO3 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PSD | Delta | Without ELBN |
|
|
|
|
|
|
| .712 | .066 |
|
| .301 | .086 | .109 | .085 |
|
| .156 |
| Theta |
|
|
|
| .325 |
| .235 | .798 | .513 | .129 |
|
|
|
|
|
|
|
| ||
| Alpha | .582 | .632 | .671 | .367 | .767 | .334 | .482 | .994 | .673 | .623 | .172 | .484 | .439 | .484 | .449 |
| .149 | .406 | ||
| Beta | .437 | .252 | .497 | .398 | .539 |
|
| .814 | .082 |
|
| .326 | .216 | .368 | .092 |
|
|
| ||
| Gamma | .218 |
| .246 | .064 | .457 |
|
| .454 |
|
|
|
|
|
|
|
|
|
| ||
| Delta | With |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| Theta |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
| Alpha |
|
|
|
| .278 |
|
| .515 |
|
|
|
|
|
|
|
|
|
| ||
| Beta | .076 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
| Gamma |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
| DE | Delta | Without ELBN |
|
| .349 | .567 | .266 | .577 | .442 | .756 | .965 | .827 | .770 |
|
| .079 | .628 | .386 | .141 | .109 |
| Theta |
|
| .894 |
| .218 |
|
| .098 | .071 | .509 | .061 | .431 |
|
| .211 | .059 |
|
| ||
| Alpha |
|
|
|
| .061 | .352 | .728 | .143 |
|
|
|
|
|
|
|
|
| .068 | ||
| Beta |
|
| .056 | .476 | .494 |
|
|
|
|
|
| .114 |
|
| .063 | .466 |
|
| ||
| Gamma | .202 | .243 |
|
|
| .126 |
|
|
|
| .120 |
| .054 |
|
|
| .086 |
| ||
| Delta | With |
|
|
| .053 |
|
|
|
| .212 | .061 | .186 |
|
|
|
|
|
|
| |
| Theta |
|
| .095 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
| Alpha |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
| Beta |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||
| Gamma |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 4Online emotion recognition results when using PSD features. The legends are described as “ground truth emotion _ online predicted emotion” (e.g., positive _ positive means that the ground truth emotion and online predicted emotion are both positive, while positive _ negative means that the ground truth emotion is positive, but the online predicted emotion is negative.). The solid lines represent the median values of probabilities for online predicted results and the boundaries of the shadow areas illustrate the 25th percentile and 75th percentile values: (a) online recognition of positive emotion samples; (b) online recognition of neutral emotion samples; (c) online recognition of negative emotion samples.
Figure 5Online emotion recognition results when using DE features: (a) online recognition of positive emotion samples; (b) online recognition of neutral emotion samples; (c) online recognition of negative emotion samples.