| Literature DB >> 30959760 |
Dong-Wei Chen1, Rui Miao2, Wei-Qi Yang3, Yong Liang4, Hao-Heng Chen5, Lan Huang6, Chun-Jian Deng7, Na Han8.
Abstract
Feature extraction of electroencephalography (EEG) signals plays a significant role in the wearable computing field. Due to the practical applications of EEG emotion calculation, researchers often use edge calculation to reduce data transmission times, however, as EEG involves a large amount of data, determining how to effectively extract features and reduce the amount of calculation is still the focus of abundant research. Researchers have proposed many EEG feature extraction methods. However, these methods have problems such as high time complexity and insufficient precision. The main purpose of this paper is to introduce an innovative method for obtaining reliable distinguishing features from EEG signals. This feature extraction method combines differential entropy with Linear Discriminant Analysis (LDA) that can be applied in feature extraction of emotional EEG signals. We use a three-category sentiment EEG dataset to conduct experiments. The experimental results show that the proposed feature extraction method can significantly improve the performance of the EEG classification: Compared with the result of the original dataset, the average accuracy increases by 68%, which is 7% higher than the result obtained when only using differential entropy in feature extraction. The total execution time shows that the proposed method has a lower time complexity.Entities:
Keywords: differential entropy; electroencephalography; emotion recognition; feature extraction; linear discriminant analysis
Mesh:
Year: 2019 PMID: 30959760 PMCID: PMC6479375 DOI: 10.3390/s19071631
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Flow chart of algorithm architecture.
Predictive performance of five methods using the original dataset.
| Method | RF | SVM | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Band | Delta | Theta | Alpha | Beta | Gamma | Combined | Delta | Theta | Alpha | Beta | Gamma | Combined |
| Accuracy | 0.385 ± 0.019 | 0.378 ± 0.029 | 0.365 ± 0.033 | 0.474 ± 0.030 | 0.491 ± 0.017 | 0.479 ± 0.038 | 0.310 ± 0.025 | 0.318 ± 0.028 | 0.330 ± 0.016 | 0.431 ± 0.021 | 0.315 ± 0.025 | 0.375 ± 0.028 |
| Precision | 0.383 ± 0.014 | 0.382 ± 0.030 | 0.378 ± 0.046 | 0.480 ± 0.039 | 0.487 ± 0.019 | 0.487 ± 0.032 | 0.311 ± 0.027 | 0.321 ± 0.028 | 0.333 ± 0.015 | 0.440 ± 0.026 | 0.324 ± 0.031 | 0.392 ± 0.024 |
| Recall | 0.391 ± 0.020 | 0.385 ± 0.031 | 0.373 ± 0.034 | 0.481 ± 0.030 | 0.496 ± 0.021 | 0.486 ± 0.035 | 0.311 ± 0.026 | 0.319 ± 0.027 | 0.333 ± 0.018 | 0.434 ± 0.022 | 0.320 ± 0.024 | 0.378 ± 0.031 |
| F1 score | 0.379 ± 0.016 | 0.373 ± 0.029 | 0.362 ± 0.035 | 0.465 ± 0.029 | 0.473 ± 0.012 | 0.474 ± 0.041 | 0.308 ± 0.024 | 0.315 ± 0.028 | 0.328 ± 0.015 | 0.429 ± 0.021 | 0.308 ± 0.025 | 0.372 ± 0.026 |
| Kappa coef. | 0.084 ± 0.028 | 0.074 ± 0.045 | 0.058 ± 0.052 | 0.219 ± 0.049 | 0.242 ± 0.029 | 0.227 ± 0.051 | 0.033 ± 0.038 | 0.020 ± 0.040 | 0.006 ± 0.025 | 0.322 ± 0.035 | 0.019 ± 0.036 | 0.069 ± 0.042 |
RF denotes the random forest method, and SVM denotes the support vector machine method. Kappa coef. is Cohen’s kappa coefficient. Each data field shows performance evaluation indices average ± std of 200 random splits of EEG samples.
Predictive performance with five methods using the original dataset and Linear Discriminant Analysis (LDA).
| Method | RF | SVM | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Band | Delta | Theta | Alpha | Beta | Gamma | Combined | Delta | Theta | Alpha | Beta | Gamma | Combined |
| Accuracy | 0.342 ± 0.039 | 0.298 ± 0.043 | 0.322 ± 0.012 | 0.393 ± 0.035 | 0.414 ± 0.015 | 0.537 ± 0.028 | 0.310 ± 0.025 | 0.318 ± 0.028 | 0.330 ± 0.016 | 0.451 ± 0.020 | 0.475 ± 0.025 | 0.475 ± 0.028 |
| Precision | 0.353 ± 0.049 | 0.306 ± 0.050 | 0.329 ± 0.026 | 0.374 ± 0.044 | 0.381 ± 0.056 | 0.510 ± 0.040 | 0.311 ± 0.027 | 0.321 ± 0.028 | 0.333 ± 0.015 | 0.449 ± 0.026 | 0.464 ± 0.031 | 0.492 ± 0.024 |
| Recall | 0.350 ± 0.043 | 0.304 ± 0.044 | 0.335 ± 0.018 | 0.386 ± 0.028 | 0.394 ± 0.021 | 0.530 ± 0.028 | 0.311 ± 0.026 | 0.319 ± 0.027 | 0.333 ± 0.018 | 0.434 ± 0.022 | 0.466 ± 0.024 | 0.478 ± 0.031 |
| F1 score | 0.337 ± 0.036 | 0.290 ± 0.047 | 0.305 ± 0.015 | 0.385 ± 0.037 | 0.372 ± 0.023 | 0.523 ± 0.034 | 0.308 ± 0.024 | 0.315 ± 0.028 | 0.328 ± 0.015 | 0.429 ± 0.021 | 0.458 ± 0.025 | 0.472 ± 0.026 |
| Kappa coef | 0.024 ± 0.063 | 0.043 ± 0.063 | 0.002 ± 0.024 | 0.204 ± 0.041 | 0.227 ± 0.027 | 0.329 ± 0.039 | 0.033 ± 0.038 | 0.020 ± 0.040 | 0.006 ± 0.025 | 0.362 ± 0.035 | 0.319 ± 0.036 | 0.269 ± 0.042 |
RF denotes the random forest method, and SVM denotes the support vector machine method. Kappa coef. is Cohen’s kappa coefficient. Each data field shows performance evaluation indices average ± std of 200 random splits of EEG samples.
Predictive performance with five methods using the differential entropy dataset.
| Method | RF | SVM | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Band | Delta | Theta | Alpha | Beta | Gamma | Combined | Delta | Theta | Alpha | Beta | Gamma | Combined |
| Accuracy | 0.525 ± 0.025 | 0.527 ± 0.053 | 0.513 ± 0.044 | 0.621 ± 0.044 | 0.627 ± 0.041 | 0.704 ± 0.026 | 0.568 ± 0.012 | 0.663 ± 0.034 | 0.601 ± 0.015 | 0.694 ± 0.036 | 0.681 ± 0.017 | 0.770 ± 0.030 |
| Precision | 0.530 ± 0.021 | 0.537 ± 0.055 | 0.529 ± 0.045 | 0.615 ± 0.046 | 0.620 ± 0.047 | 0.700 ± 0.029 | 0.568±0.014 | 0.667 ± 0.034 | 0.602 ± 0.015 | 0.692 ± 0.031 | 0.680 ± 0.025 | 0.768 ± 0.033 |
| Recall | 0.531 ± 0.028 | 0.532 ± 0.055 | 0.522 ± 0.039 | 0.624 ± 0.046 | 0.627 ± 0.047 | 0.704 ± 0.029 | 0.567 ± 0.013 | 0.666 ± 0.034 | 0.602 ± 0.016 | 0.694 ± 0.037 | 0.681 ± 0.024 | 0.770 ± 0.032 |
| F1 score | 0.525 ± 0.025 | 0.527 ± 0.053 | 0.509 ± 0.047 | 0.614 ± 0.046 | 0.617 ± 0.044 | 0.699 ± 0.029 | 0.565 ± 0.011 | 0.663 ± 0.035 | 0.599 ± 0.016 | 0.690 ± 0.038 | 0.677 ± 0.022 | 0.767 ± 0.032 |
| Kappa coef | 0.290 ± 0.036 | 0.294 ± 0.082 | 0.277 ± 0.063 | 0.433 ± 0.065 | 0.440 ± 0.063 | 0.556 ± 0.040 | 0.351 ± 0.018 | 0.496 ± 0.051 | 0.401 ± 0.023 | 0.541 ± 0.052 | 0.521 ± 0.027 | 0.654 ± 0.046 |
RF denotes the random forest method, and SVM denotes the support vector machine method. Kappa coef. is Cohen’s kappa coefficient. Each data field shows performance evaluation indices average ± std of 200 random splits of EEG samples.
Predictive performance with five methods using the differential entropy dataset and LDA.
| Method | RF | SVM | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Band | Delta | Theta | Alpha | Beta | Gamma | Combined | Delta | Theta | Alpha | Beta | Gamma | Combined |
| Accuracy | 0.467 ± 0.017 | 0.484 ± 0.057 | 0.462 ± 0.049 | 0.556 ± 0.033 | 0.561 ± 0.035 | 0.582 ± 0.025 | 0.568 ± 0.012 | 0.663 ± 0.034 | 0.600 ± 0.015 | 0.714 ± 0.036 | 0.741 ± 0.017 | 0.825 ± 0.032 |
| Precision | 0.483 ± 0.020 | 0.517 ± 0.067 | 0.481 ± 0.048 | 0.570 ± 0.024 | 0.560 ± 0.035 | 0.590 ± 0.037 | 0.568 ± 0.014 | 0.667 ± 0.034 | 0.601 ± 0.016 | 0.709 ± 0.031 | 0.690 ± 0.025 | 0.801 ± 0.034 |
| Recall | 0.475 ± 0.015 | 0.496 ± 0.051 | 0.474 ± 0.047 | 0.568 ± 0.025 | 0.566 ± 0.031 | 0.593 ± 0.028 | 0.567 ± 0.013 | 0.666 ± 0.034 | 0.601 ± 0.017 | 0.711 ± 0.037 | 0.716 ± 0.024 | 0.802 ± 0.031 |
| F1 score | 0.453 ± 0.023 | 0.477 ± 0.065 | 0.454 ± 0.049 | 0.552 ± 0.033 | 0.555 ± 0.035 | 0.573 ± 0.031 | 0.565 ± 0.011 | 0.663 ± 0.035 | 0.598 ± 0.016 | 0.705 ± 0.038 | 0.677 ± 0.022 | 0.799 ± 0.033 |
| Kappa coef | 0.208 ± 0.023 | 0.238 ± 0.077 | 0.205 ± 0.071 | 0.343 ± 0.042 | 0.345 ± 0.051 | 0.378 ± 0.039 | 0.351 ± 0.018 | 0.496 ± 0.051 | 0.400 ± 0.023 | 0.566 ± 0.052 | 0.521 ± 0.027 | 0.698 ± 0.049 |
RF denotes the random forest method, and SVM denotes the support vector machine method. Kappa coef. is Cohen’s kappa coefficient. Each data field shows performance evaluation indices average ± std of 200 random splits of EEG samples.
Figure 2Accuracy in original dataset with five methods.
Figure 3Accuracy in original dataset based on LDA with five methods.
Figure 4Accuracy in differential entropy dataset with five methods.
Figure 5Accuracy in differential entropy dataset based on LDA with five methods.
Figure 6Confusion matrix in original dataset with RF.
Figure 7Confusion matrix in original dataset based on LDA with RF.
Figure 8Confusion matrix in differential entropy dataset with LR.
Figure 9Confusion matrix in differential entropy dataset based on LDA with SVM.
Complexity in four experiments with different methods.
| Experiment | Method | Delta | Theta | Alpha | Beta | Gamma | Combined |
|---|---|---|---|---|---|---|---|
| Prediction performance in original dataset | kNN | 5.215 ± 0.091 | 5.337 ± 0.349 | 6.077 ± 0.408 | 5.997 ± 0.347 | 6.205 ± 1.003 | 25.970 ± 0.069 |
| LR | 111.323 ± 4.970 | 93.115 ± 7.211 | 97.788 ± 3.705 | 63.628 ± 4.623 | 51.596 ± 8.941 | 98.428 ± 6.571 | |
| MLP | 41.407 ± 14.262 | 50.474 ± 2.854 | 51.971 ± 1.282 | 50.837 ± 2.304 | 50.153 ± 2.195 | 105.568 ± 14.086 | |
| RF | 4.052 ± 0.115 | 4.497 ± 0.533 | 4.572 ± 0.280 | 4.982 ± 0.185 | 4.501 ± 0.386 | 9.544 ± 0.271 | |
| SVM | 19.429 ± 0.645 | 18.244 ± 1.475 | 19.388 ± 1.219 | 18.135 ± 0.993 | 17.517 ± 0.980 |
| |
| Prediction performance in original dataset based on LDA | kNN | 2.356 ± 0.043 | 2.310 ± 0.040 | 2.341 ± 0.068 | 2.329 ± 0.038 | 2.357 ± 0.030 | 15.060 ± 0.100 |
| LR | 3.480 ± 0.142 | 3.162 ± 0.094 | 3.209 ± 0.078 | 3.320 ± 0.169 | 3.165 ± 0.078 | 16.151 ± 0.203 | |
| MLP | 2.726 ± 0.080 | 2.647 ± 0.052 | 2.680 ± 0.076 | 2.697 ± 0.054 | 2.686 ± 0.041 | 15.408 ± 0.127 | |
| RF | 2.845 ± 0.089 | 2.793 ± 0.062 | 2.836 ± 0.096 | 2.831 ± 0.065 | 2.815 ± 0.036 | 15.534 ± 0.153 | |
| SVM | 2.850 ± 0.075 | 2.539 ± 0.045 | 2.533 ± 0.076 | 2.477 ± 0.048 | 2.483 ± 0.024 |
| |
| Prediction performance in differential entropy dataset | kNN | 1.871 ± 0.007 | 1.876 ± 0.022 | 1.908 ± 0.011 | 1.889 ± 0.005 | 1.882 ± 0.012 | 9.370 ± 0.022 |
| LR | 5.445 ± 0.310 | 7.340 ± 0.405 | 9.168 ± 0.986 | 10.321 ± 0.426 | 9.916 ± 0.558 | 26.932 ± 0.908 | |
| MLP | 6.524 ± 2.782 | 5.348 ± 1.719 | 4.159 ± 0.719 | 5.115 ± 1.640 | 4.865 ± 0.863 | 22.145 ± 3.354 | |
| RF | 2.243 ± 0.036 | 2.254 ± 0.019 | 2.269 ± 0.022 | 2.206 ± 0.018 | 2.192 ± 0.023 | 4.908 ± 0.140 | |
| SVM | 5.502 ± 0.019 | 5.266 ± 0.034 | 5.309 ± 0.059 | 4.125 ± 0.204 | 3.950 ± 0.213 |
| |
| Prediction performance in differential entropy dataset based on LDA | kNN | 0.838 ± 0.013 | 0.826 ± 0.027 | 0.823 ± 0.025 | 0.839 ± 0.011 | 0.841 ± 0.020 | 4.202 ± 0.079 |
| LR | 1.001 ± 0.023 | 1.081 ± 0.032 | 1.134 ± 0.031 | 1.117 ± 0.018 | 1.097 ± 0.026 | 4.516 ± 0.100 | |
| MLP | 1.187 ± 0.023 | 1.162 ± 0.036 | 1.168 ± 0.038 | 1.187 ± 0.023 | 1.184 ± 0.015 | 4.668 ± 0.111 | |
| RF | 1.301 ± 0.019 | 1.302 ± 0.028 | 1.295 ± 0.027 | 1.309 ± 0.013 | 1.301 ± 0.029 | 4.819 ± 0.143 | |
| SVM | 0.966 ± 0.014 | 0.957 ± 0.029 | 0.973 ± 0.032 | 0.933 ± 0.022 | 0.928 ± 0.017 |
|
LDA denotes the linear discriminant analysis method. kNN denotes the k-nearest neighbor method, and LR denotes the logistic method. RF denotes the random forest method, and SVM denotes the support vector machine method. Each data field shows the consumed time average ± std of 200 random splits of EEG samples.