| Literature DB >> 33841089 |
Aimei Dong1, Zhigang Li1, Qiuyu Zheng1.
Abstract
EEG signal classification has been a research hotspot recently. The combination of EEG signal classification with machine learning technology is very popular. Traditional machine leaning methods for EEG signal classification assume that the EEG signals are drawn from the same distribution. However, the assumption is not always satisfied with the practical applications. In practical applications, the training dataset and the testing dataset are from different but related domains. How to make best use of the training dataset knowledge to improve the testing dataset is critical for these circumstances. In this paper, a novel method combining the non-negative matrix factorization technology and the transfer learning (NMF-TL) is proposed for EEG signal classification. Specifically, the shared subspace is extracted from the testing dataset and training dataset using non-negative matrix factorization firstly and then the shared subspace and the original feature space are combined to obtain the final EEG signal classification results. On the one hand, the non-negative matrix factorization can assure to obtain essential information between the testing and the training dataset; on the other hand, the combination of shared subspace and the original feature space can fully use all the signals including the testing and the training dataset. Extensive experiments on Bonn EEG confirmed the effectiveness of the proposed method.Entities:
Keywords: EEG signal; classification; non-negative factorization; shared hidden subspace; transfer learning
Year: 2021 PMID: 33841089 PMCID: PMC8024531 DOI: 10.3389/fnins.2021.647393
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
FIGURE 1Principle of the proposed transferred SVM based on non-negative matrix factorization.
The description of the low-dimensional shared hidden subspace learning.
| 1. |
| 2. |
| 3. |
| 4. |
| 4-1:update ( |
| 4-2:update (( |
| 4-3:update (( |
| 4-4: |
| textbfUntil: |
| ∥( |
| 5. |
FIGURE 2Description of the EEG data.
FIGURE 3Typical EEG signals in groups (A–E).
The description of the 8 groups of datasets.
| 1# | A(75),E(75) | A(25),E(25) | |
| 2# | A(75),B(75),E(75) | A(25),B(25),E(25) | |
| 3# | A(75),E(75) | A(25),C(25) | |
| 4# | A(75),E(75) | A(25),D(25) | |
| 5# | B(75),E(75) | B(25),C(25) | |
| 6# | B(75),E(75) | B(25),D (25) | |
| 7# | A(75),B(75),E(75) | A(25),B(25),C(25) | |
| 8# | A(75),B(75),E(75) | A(25),B(25),D(25) | |
Classification accuracy comparison of 8 classifiers on datasets based on WPD feature extraction.
| SVM | 0.9150 | 0.6733 | 0.6842 | 0.6987 | 0.9550 | 0.9650 | 0.6433 | 0.6667 |
| LDA | 0.9150 | 0.8600 | 0.8350 | 0.8450 | 0.7850 | 0.8050 | 0.8367 | 0.8300 |
| DT | 0.8950 | 0.7933 | 0.8500 | 0.8300 | 0.9500 | 0.9300 | 0.7300 | 0.7267 |
| NB | 0.8700 | 0.7799 | 0.5800 | 0.5600 | 0.7600 | 0.7450 | 0.5799 | 0.6167 |
| KNN | 0.9150 | 0.8533 | 0.7650 | 0.8050 | 0.9600 | 0.9500 | 0.7500 | 0.7467 |
| MTLF | 0.9600 | 0.8800 | 0.6950 | 0.7000 | 0.9000 | 0.8850 | 0.7433 | 0.7564 |
| LMPROJ | 0.8700 | 0.7767 | 0.7950 | 0.8750 | 0.8000 | 0.9200 | 0.6800 | 0.6700 |
| NMF-TL | 0.9700 | 0.9800 | 0.9500 | 0.9500 | 0.9700 | 0.9750 | 0.9699 | 0.9467 |
Classification accuracy comparison of 8 classifiers on datasets based on KPCA feature extraction.
| SVM | 0.9300 | 0.8300 | 0.5700 | 0.5645 | 0.7500 | 0.7700 | 0.5933 | 0.6267 |
| LDA | 0.9050 | 0.5467 | 0.8900 | 0.9530 | 0.9150 | 0.9150 | 0.6467 | 0.6733 |
| DT | 0.9800 | 0.8533 | 0.8950 | 0.9725 | 0.8400 | 0.8650 | 0.7767 | 0.8700 |
| NB | 0.8950 | 0.8149 | 0.6300 | 0.7900 | 0.7900 | 0.7550 | 0.6367 | 0.6700 |
| KNN | 0.9400 | 0.7767 | 0.8450 | 0.8950 | 0.8850 | 0.9050 | 0.7400 | 0.7467 |
| MTLF | 0.9350 | 0.9400 | 0.7750 | 0.8500 | 0.7650 | 0.8150 | 0.8199 | 0.8400 |
| LMPROJ | 0.9550 | 0.9233 | 0.7717 | 0.7700 | 0.8900 | 0.8400 | 0.8633 | 0.8700 |
| NMF-TL | 0.9870 | 0.9500 | 0.9650 | 0.9800 | 0.9800 | 0.9600 | 0.9600 | 0.9767 |
Classification accuracy comparison of 8 classifiers on datasets based on SIFT feature extraction.
| SVM | 0.9800 | 0.6908 | 0.5600 | 0.5800 | 0.7150 | 0.7700 | 0.7187 | 0.7033 |
| LDA | 0.9900 | 0.8900 | 0.5050 | 0.5650 | 0.6300 | 0.6650 | 0.5700 | 0.6100 |
| DT | 0.9764 | 0.9300 | 0.6500 | 0.7200 | 0.5600 | 0.6450 | 0.6467 | 0.7067 |
| NB | 0.9450 | 0.9367 | 0.5800 | 0.5800 | 0.5650 | 0.6400 | 0.6499 | 0.6233 |
| KNN | 0.9864 | 0.9367 | 0.5100 | 0.5650 | 0.5100 | 0.5400 | 0.6100 | 0.6333 |
| MTLF | 0.9850 | 0.9833 | 0.5125 | 0.5800 | 0.8450 | 0.8400 | 0.6634 | 0.7067 |
| LMPROJ | 0.9800 | 0.7933 | 0.6000 | 0.8750 | 0.8700 | 0.8750 | 0.6700 | 0.6750 |
| NMF-TL | 0.9950 | 0.9933 | 0.9700 | 0.9650 | 0.9700 | 0.9650 | 0.9467 | 0.9500 |
FIGURE 4Classification accuracy comparison of 8 classifiers on datasets based on WPD feature extraction.
F1_score of 8 classifiers on datasets based on WPD feature extraction.
| SVM | 0.9364 | 0.8033 | 0.9784 | 0.9685 | 0.9562 | 0.9587 | 0.6877 | 0.6954 |
| LDA | 0.9600 | 0.9100 | 0.9000 | 0.9000 | 0.9700 | 0.9700 | 0.9100 | 0.9100 |
| DT | 0.9000 | 0.7600 | 0.8700 | 0.8700 | 0.9500 | 0.9500 | 0.7600 | 0.7600 |
| NB | 0.8659 | 0.8050 | 0.2092 | 0.3303 | 0.8022 | 0.7922 | 0.8050 | 0.7850 |
| KNN | 0.9600 | 0.8550 | 0.9000 | 0.9700 | 0.9900 | 0.9800 | 0.8450 | 0.8450 |
| MTLF | 0.9559 | 0.8850 | 0.5578 | 0.5994 | 0.8888 | 0.8885 | 0.8700 | 0.8700 |
| LMPROJ | 0.8678 | 0.8535 | 0.8109 | 0.8725 | 0.8260 | 0.9126 | 0.8025 | 0.7684 |
| NMF-TL | 0.9987 | 0.9850 | 0.9800 | 0.9700 | 0.9800 | 0.9800 | 0.9850 | 0.9800 |
F1_score of 8 classifiers on datasets based on PCA feature extraction.
| SVM | 0.9014 | 0.8840 | 0.9572 | 0.9685 | 0.6503 | 0.6462 | 0.5972 | 0.6211 |
| LDA | 0.9100 | 0.5100 | 0.9000 | 0.9100 | 0.8700 | 0.8700 | 0.5300 | 0.5300 |
| DT | 0.9890 | 0.8450 | 0.9767 | 0.9823 | 0.8500 | 0.8500 | 0.8100 | 0.8200 |
| NB | 0.7930 | 0.8150 | 0.4302 | 0.7403 | 0.7759 | 0.7562 | 0.8152 | 0.4850 |
| KNN | 0.9800 | 0.7700 | 0.9300 | 0.9400 | 0.9100 | 0.9000 | 0.7700 | 0.7700 |
| MTLF | 0.9341 | 0.9200 | 0.7046 | 0.8131 | 0.8374 | 0.8274 | 0.9200 | 0.9200 |
| LMPROJ | 0.9566 | 0.9434 | 0.7806 | 0.8066 | 0.8984 | 0.8376 | 0.9037 | 0.9078 |
| NMF-TL | 0.9800 | 0.9450 | 0.9700 | 0.9980 | 0.9800 | 0.9400 | 0.9700 | 0.9700 |
F1_score of 8 classifiers on datasets based on SIFT feature extraction.
| SVM | 0.9796 | 0.8264 | 0.9796 | 0.9765 | 0.8042 | 0.8218 | 0.7239 | 0.7470 |
| LDA | 0.9898 | 0.8500 | 0.9672 | 0.9801 | 0.9600 | 0.9765 | 0.8500 | 0.8500 |
| DT | 0.9987 | 0.9050 | 0.9253 | 0.9645 | 0.9900 | 0.9900 | 0.9050 | 0.9050 |
| NB | 0.9667 | 0.8850 | 0.4248 | 0.4247 | 0.6927 | 0.6972 | 0.8850 | 0.8900 |
| KNN | 0.9845 | 0.9150 | 0.9632 | 0.9754 | 0.9847 | 0.9667 | 0.9100 | 0.9100 |
| MTLF | 0.9900 | 0.9800 | 0.2720 | 0.3995 | 0.6766 | 0.8626 | 0.9391 | 0.9066 |
| LMPROJ | 0.9649 | 0.8661 | 0.3307 | 0.8871 | 0.8803 | 0.8812 | 0.8816 | 0.8016 |
| NMF-TL | 0.9920 | 0.9950 | 0.9987 | 0.9600 | 0.9800 | 0.9900 | 0.9700 | 0.9950 |
Recall of 8 classifiers on datasets based on WPD feature extraction.
| SVM | 0.9600 | 0.9980 | 0.9700 | 0.9600 | 0.9700 | 0.9700 | 0.5100 | 0.5100 |
| LDA | 0.8700 | 0.7600 | 0.7700 | 0.7900 | 0.6000 | 0.6400 | 0.6900 | 0.6700 |
| DT | 0.8900 | 0.8600 | 0.8300 | 0.7900 | 0.9500 | 0.9100 | 0.6700 | 0.6600 |
| NB | 0.8400 | 0.8581 | 0.1200 | 0.2200 | 0.9600 | 0.9600 | 0.6521 | 0.6674 |
| KNN | 0.8700 | 0.8500 | 0.6300 | 0.6400 | 0.9300 | 0.9200 | 0.5600 | 0.5500 |
| MTLF | 0.9700 | 0.9393 | 0.5150 | 0.5300 | 0.9100 | 0.9050 | 0.7815 | 0.7757 |
| LMPROJ | 0.8600 | 0.9750 | 0.8700 | 0.8600 | 0.9200 | 0.8400 | 0.9750 | 0.9750 |
| NMF-TL | 0.9400 | 0.9700 | 0.9200 | 0.9300 | 0.9600 | 0.9700 | 0.9400 | 0.8800 |
Recall of 8 classifiers on datasets based on SITF feature extraction.
| SVM | 0.9800 | 0.8350 | 0.8650 | 0.9650 | 0.9750 | 0.9850 | 0.6300 | 0.6650 |
| LDA | 0.9800 | 0.9700 | 0.0400 | 0.1300 | 0.2600 | 0.3300 | 0.0400 | 0.1300 |
| DT | 0.9900 | 0.9800 | 0.3000 | 0.4400 | 0.1300 | 0.3000 | 0.1300 | 0.3100 |
| NB | 0.9875 | 0.9548 | 0.4100 | 0.4100 | 0.9750 | 0.9800 | 0.6618 | 0.6594 |
| KNN | 0.9900 | 0.9870 | 0.0400 | 0.1300 | 0.0400 | 0.0800 | 0.0400 | 0.0800 |
| MTLF | 0.9850 | 0.9847 | 0.2391 | 0.3000 | 0.9750 | 0.9635 | 0.6588 | 0.7049 |
| LMPROJ | 0.9700 | 0.9550 | 0.2000 | 0.9575 | 0.9200 | 0.9125 | 0.9525 | 0.9675 |
| NMF-TL | 0.9987 | 0.9900 | 0.9400 | 0.9700 | 0.9600 | 0.9400 | 0.9000 | 0.8600 |
Recall of 8 classifiers on datasets based on PCA feature extraction.
| SVM | 0.9550 | 0.9650 | 0.9200 | 0.9400 | 0.5600 | 0.5600 | 0.4600 | 0.4600 |
| LDA | 0.9000 | 0.6200 | 0.8800 | 0.9600 | 0.9600 | 0.9600 | 0.8800 | 0.9600 |
| DT | 0.9600 | 0.8700 | 0.7900 | 0.9800 | 0.8300 | 0.8800 | 0.7100 | 0.9700 |
| NB | 0.7700 | 0.8643 | 0.2800 | 0.7500 | 0.7100 | 0.7100 | 0.6809 | 0.8060 |
| KNN | 0.9800 | 0.7700 | 0.9300 | 0.9400 | 0.9100 | 0.9000 | 0.7700 | 0.7700 |
| MTLF | 0.9200 | 0.9898 | 0.7000 | 0.7000 | 0.9500 | 0.9525 | 0.7898 | 0.8534 |
| LMPROJ | 0.9600 | 0.9450 | 0.9500 | 0.9550 | 0.9700 | 0.9200 | 0.9600 | 0.9600 |
| NMF-TL | 0.9700 | 0.9600 | 0.9900 | 0.9600 | 0.9800 | 0.9800 | 0.9300 | 0.9700 |
Friedman values for 8 different methods on datasets based on WPD feature extraction.
| SVM | 1.420 | 37.81 | 36.01 | 36.64 | 0.4300 | 1.200 | 38.14 | 27.41 |
| LDA | 7.128 | 11.42 | 1.48 | 3.528 | 9.148 | 3.020 | 3.045 | 12.23 |
| DT | 0.7375 | 3.788 | 0.6325 | 1.418 | 0.5150 | 0.8775 | 0.2700 | 1.408 |
| NB | 1.818 | 6.735 | 25.40 | 17.75 | 9.423 | 11.36 | 28.11 | 20.25 |
| KNN | 2.260 | 2.368 | 3.562 | 1.470 | 9.910 | 1.628 | 13.25 | 3.648 |
| MTLF | 0.1725 | 0.4475 | 12.60 | 10.80 | 3.980 | 5.590 | 12.16 | 10.86 |
| LMPROJ | 2.250 | 14.74 | 1.788 | 0.3450 | 5.288 | 3.245 | 28.73 | 30.51 |
| NMF-TL | 0.8150 | 0.3700 | 2.853 | 1.178 | 1.255 | 0.3225 | 1.238 | 1.023 |
FIGURE 7Nemenyi test chart of 8 different methods on datasets based on WPD feature extraction.
FIGURE 9Nemenyi test chart of 8 different methods on datasets based on KPCA feature extraction.
Friedman values for 8 different methods on datasets based on SIFT feature extraction.
| SVM | 0.430 | 35.27 | 36.06 | 36.64 | 27.27 | 11.71 | 15.88 | 9.250 |
| LDA | 16.50 | 16.00 | 14.52 | 13.55 | 18.67 | 21.72 | 18.12 | 17.78 |
| DT | 8.593 | 23.98 | 7.800 | 12.89 | 8.280 | 15.47 | 14.95 | 30.32 |
| NB | 0.1075 | 1.283 | 23.04 | 25.43 | 31.86 | 25.96 | 25.13 | 33.13 |
| KNN | 16.80 | 35.85 | 17.12 | 27.64 | 16.65 | 31.67 | 15.76 | 28.85 |
| LMPROJ | 1.00 | 14.00 | 26.10 | 4.310 | 1.010 | 0.9300 | 21.73 | 38.74 |
| MTLF | 0.4125 | 0.085 | 35.11 | 28.40 | 8.670 | 8.648 | 33.47 | 26.43 |
| NMF-TL | 0.4300 | 0.2850 | 6.195 | 0.7825 | 0.5150 | 3.300 | 3.865 | 2.028 |
Friedman values for 8 different methods on datasets based on KPCA feature extraction.
| SVM | 14.75 | 37.46 | 28.83 | 7.850 | 9.773 | 33.44 | 8.365 | 27.28 |
| LDA | 1.565 | 0.9150 | 2.570 | 4.422 | 2.220 | 0.6975 | 0.7675 | 2.165 |
| DT | 1.223 | 1.010 | 2.085 | 2.727 | 2.143 | 2.800 | 2.433 | 4.095 |
| NB | 2.213 | 21.05 | 5.057 | 21.55 | 2.513 | 1.095 | 13.15 | 2.580 |
| KNN | 1.165 | 0.3075 | 3.162 | 0.8975 | 1.445 | 0.3050 | 0.2450 | 4.475 |
| LMPROJ | 0.5450 | 0.4900 | 11.59 | 8.845 | 2.665 | 5.875 | 4.033 | 35.63 |
| MTLF | 0.1725 | 0.3825 | 12.73 | 7.200 | 7.773 | 5.955 | 11.04 | 5.973 |
| NMF-TL | 0.3625 | 0.7600 | 0.2575 | 0.6225 | 0.8150 | 2.307 | 0.2150 | 2.545 |