| Literature DB >> 32226230 |
Haider Raza1, Dheeraj Rathee2, Shang-Ming Zhou3, Hubert Cecotti4, Girijesh Prasad2.
Abstract
The non-stationary nature of electroencephalography (EEG) signals makes an EEG-based brain-computer interface (BCI) a dynamic system, thus improving its performance is a challenging task. In addition, it is well-known that due to non-stationarity based covariate shifts, the input data distributions of EEG-based BCI systems change during inter- and intra-session transitions, which poses great difficulty for developments of online adaptive data-driven systems. Ensemble learning approaches have been used previously to tackle this challenge. However, passive scheme based implementation leads to poor efficiency while increasing high computational cost. This paper presents a novel integration of covariate shift estimation and unsupervised adaptive ensemble learning (CSE-UAEL) to tackle non-stationarity in motor-imagery (MI) related EEG classification. The proposed method first employs an exponentially weighted moving average model to detect the covariate shifts in the common spatial pattern features extracted from MI related brain responses. Then, a classifier ensemble was created and updated over time to account for changes in streaming input data distribution wherein new classifiers are added to the ensemble in accordance with estimated shifts. Furthermore, using two publicly available BCI-related EEG datasets, the proposed method was extensively compared with the state-of-the-art single-classifier based passive scheme, single-classifier based active scheme and ensemble based passive schemes. The experimental results show that the proposed active scheme based ensemble learning algorithm significantly enhances the BCI performance in MI classifications.Entities:
Keywords: BCI, Brain-computer-interface; Brain-computer interface (BCI); CS, Covariate shift; CSA, Covariate shift adaptation; CSE, Covariate shift estimation; CSE-UAEL, CSE-based unsupervised adaptive ensemble learning; CSP, Common spatial pattern; CSV, Covariate shift validation; CSW, Covariate shift warning; Covariate shift; DWEC, Dynamically weighted ensemble classification; EEG, Electroencephalography; ERD, Synchronization; ERS, Desynchronization; EWMA, exponential weighted moving average; Electroencephalogram (EEG); Ensemble learning; FB, Frequency band; FBCSP, Filter bank common spatial pattern; KNN, K-nearest-neighbors; LDA, Linear discriminant analysis; MI, Motor imagery; NSL, Non-stationary learning; Non-stationary learning; PCA, Principal component analysis; PWKNN, Probabilistic weighted K-nearest neighbour; RSM, Random subspace method; SSL, Semi-supervised learning
Year: 2019 PMID: 32226230 PMCID: PMC7086459 DOI: 10.1016/j.neucom.2018.04.087
Source DB: PubMed Journal: Neurocomputing ISSN: 0925-2312 Impact factor: 5.719
Fig. 1Covariate shift (CS) between the training (Tr) and test (Ts) distributions of subject A07 in dataset-2A. (a) illustrates the CS in the mu (μ) band and (b) shows the CS in the beta (β) band.
Algorithm 1Covariate Shift Estimation (CSE) [13].
Algorithm 2CSE-UAEL.
Algorithm 3PWKNN.
Fig. 2Block diagram of the signal processing and machine learning pipeline implemented in the study. The system consists of two phases. During the training phase, the features were extracted in the temporal and spatial domains from the raw EEG signals, followed by the estimation of covariate shift parameter (i.e. λ and L, smoothing constant and control limit multiplier, respectively) and a classifier is trained on the labeled examples (i.e. X). In the evaluation phase, a similar signal processing method is applied initially and CSP features were monitored by the CSE and adaptation block. In the CSA block, the CSE procedure identifies the CSs and initiates adaptation by adding the k classifier f to the ensemble E, where k counts the number of identified CSs during the evaluation phase. Finally, the k classifier outputs from E are combined to predict the class label.
Results for CSE procedure in dataset-2A AND dataset-2B on BCI-competition-IV.
| CSE for 2A | CSE for 2B | ||||||
|---|---|---|---|---|---|---|---|
| Subject | CSW | CSV | Subject | CSW | CSV | ||
| 0.50 | 12 | 6 | 0.28 | 14 | 10 | ||
| 0.55 | 15 | 8 | 0.17 | 18 | 13 | ||
| 0.60 | 7 | 6 | 0.60 | 19 | 12 | ||
| 0.61 | 10 | 3 | 0.20 | 11 | 6 | ||
| 0.72 | 13 | 8 | 0.10 | 12 | 8 | ||
| 0.54 | 12 | 6 | 0.33 | 22 | 12 | ||
| 0.57 | 11 | 4 | 0.30 | 17 | 11 | ||
| 0.50 | 11 | 5 | 0.21 | 27 | 14 | ||
| 0.70 | 6 | 4 | 0.45 | 18 | 7 | ||
| 0.58 | 10.77 | 5.55 | 0.29 | 17.55 | 10.33 | ||
Fig. 3The plot showed the effect of lambda (λ) on the performance of CSE at CSV stage. The average CSs identified for all the nine subjects were presented for dataset-2A.
Classification accuracy in (%) for dataset-2A in both passive and active schemes. C-1: a combination of PWKNN-PWKNN classifiers; C-2: a combination of inductive-inductive classifiers (i.e. LDA-LDA); and C-3: a combination of inductive-transductive classifiers (i.e. PWKNN-LDA).
| Subjects | Single classifier | |||||
|---|---|---|---|---|---|---|
| Passive scheme | Active scheme | |||||
| C-1 | C-2 | C-3 | C-1 | C-2 | C-3 | |
| A01 | 58.33 | 87.50 | 90.28 | 58.33 | 91.67 | 88.89 |
| A02 | 54.17 | 58.33 | 64.58 | 54.17 | 63.19 | 63.89 |
| A03 | 54.17 | 95.83 | 94.44 | 54.17 | 91.67 | 95.14 |
| A04 | 51.39 | 67.36 | 69.44 | 51.39 | 69.44 | 69.44 |
| A05 | 66.67 | 69.44 | 71.53 | 65.28 | 70.14 | 74.31 |
| A06 | 47.22 | 65.28 | 66.67 | 49.31 | 68.06 | 65.97 |
| A07 | 53.47 | 77.08 | 72.92 | 53.47 | 72.92 | 72.92 |
| A08 | 45.83 | 86.81 | 91.67 | 45.83 | 91.67 | 92.36 |
| A09 | 43.06 | 88.89 | 88.19 | 41.67 | 88.89 | 88.19 |
| Mean | 52.70 | 77.39 | 78.86 | 52.62 | 78.63 | 79.01 |
| Std | 7.10 | 12.93 | 12.01 | 6.86 | 12.01 | 12.09 |
Classification accuracy in (%) for dataset-2B in both passive and active schemes. C-1: a combination of PWKNN-PWKNN classifiers; C-2: a combination of inductive-inductive classifiers (i.e. LDA-LDA); and C-3: a combination of inductive-transductive classifiers (i.e. PWKNN-LDA).
| Subjects | Single classifier | |||||
|---|---|---|---|---|---|---|
| Passive scheme | Active scheme | |||||
| C-1 | C-2 | C-3 | C-1 | C-2 | C-3 | |
| B01 | 50.31 | 70.31 | 74.06 | 51.25 | 66.56 | 75.63 |
| B02 | 51.35 | 50.31 | 50.31 | 52.81 | 51.15 | 51.15 |
| B03 | 48.13 | 46.88 | 51.88 | 48.13 | 50.31 | 51.88 |
| B04 | 50.00 | 90.00 | 92.50 | 49.06 | 89.06 | 92.50 |
| B05 | 54.38 | 80.31 | 78.13 | 55.94 | 74.38 | 72.50 |
| B06 | 50.63 | 67.50 | 78.13 | 50.94 | 68.75 | 78.75 |
| B07 | 55.63 | 68.75 | 68.13 | 54.06 | 70.63 | 68.75 |
| B08 | 53.75 | 59.69 | 73.75 | 53.75 | 62.50 | 73.75 |
| B09 | 51.88 | 66.88 | 71.25 | 51.88 | 69.06 | 71.56 |
| Mean | 51.78 | 66.74 | 70.90 | 51.98 | 66.93 | 70.72 |
| Std | 2.39 | 13.49 | 13.15 | 2.47 | 11.79 | 12.84 |
Classification accuracy in (%) for dataset-2B. C-1: a combination of PWKNN-PWKNN classifiers; C-2: a combination of inductive-inductive classifiers (i.e. LDA-LDA); and C-3: a combination of inductive-transductive classifiers (i.e. PWKNN-LDA).
| Subjects | Baseline methods | Proposed methods (CSE-UAEL) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Passive scheme | Active scheme | ||||||||||
| BAG | AB | TB | RUSB | RSM | C-1 | C-2 | C-3 | C-1 | C-2 | C-3 | |
| B01 | 69.69 | 67.50 | 66.25 | 53.13 | 51.56 | 50.31 | 65.31 | 77.81 | 51.25 | 64.69 | 78.13 |
| B02 | 52.60 | 52.50 | 55.00 | 50.83 | 49.79 | 51.35 | 50.31 | 54.27 | 52.81 | 51.15 | 54.69 |
| B03 | 50.63 | 50.00 | 51.56 | 50.00 | 50.00 | 48.13 | 47.50 | 52.50 | 48.13 | 49.38 | 53.13 |
| B04 | 76.25 | 74.38 | 81.56 | 87.81 | 52.19 | 50.00 | 89.69 | 94.38 | 49.06 | 90.63 | 94.38 |
| B05 | 67.50 | 68.75 | 72.81 | 71.56 | 53.13 | 54.38 | 73.44 | 85.63 | 55.94 | 71.56 | 85.31 |
| B06 | 56.88 | 56.56 | 59.69 | 71.56 | 51.88 | 50.63 | 69.38 | 80.00 | 50.94 | 68.75 | 80.31 |
| B07 | 58.13 | 54.38 | 50.00 | 53.75 | 50.00 | 55.63 | 70.00 | 71.56 | 54.06 | 70.94 | 72.81 |
| B08 | 56.88 | 58.75 | 59.06 | 50.94 | 53.13 | 53.75 | 60.00 | 77.81 | 53.75 | 64.69 | 78.75 |
| B09 | 55.31 | 60.94 | 62.81 | 57.19 | 49.69 | 51.88 | 70.31 | 74.38 | 51.88 | 69.06 | 74.38 |
| Mean | 60.43 | 60.42 | 62.08 | 60.75 | 51.26 | 51.78 | 66.22 | 74.26 | 51.98 | 66.76 | 74.65 |
| Std | 8.66 | 8.22 | 10.21 | 13.21 | 1.42 | 2.39 | 12.68 | 13.57 | 2.47 | 12.11 | 13.36 |
Comparison of CSE-UAEL Algorithm using p-values on dataset-2A. The p-value denotes the Wilcoxon signed-rank test: *p < 0.01, ⋆p < 0.05.
| Single classifier | Ensemble | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Passive | Active | Baseline methods | CSE-UAEL (Passive) | ||||||||
| C-3 | C-3 | BAG | AB | TB | RUSB | RSM | C1 | C2 | C3 | ||
| CSE-UAEL | C-1 | 0.0039* | 0.0039* | 0.0156⋆ | 0.0078* | 0.0078* | 0.0156⋆ | 0.0273⋆ | 1 | 0.0039* | 0.0039* |
| (Active) | C-2 | 0.1016 | 0.1484 | 0.0781 | 0.0447⋆ | 0.0447⋆ | 0.0469⋆ | 0.0078* | 0.0039* | 0.0781 | 0.0408⋆ |
| C-3 | 0.0234⋆ | 0.0234⋆ | 0.0195⋆ | 0.0078* | 0.0078* | 0.0039* | 0.0039* | 0.0039* | 0.1562 | 0.1562 | |
Classification accuracy in (%) for dataset-2A. C-1: a combination of PWKNN-PWKNN classifiers; C-2: a combination of inductive-inductive classifiers (i.e. LDA-LDA); and C-3:performance a combination of inductive-transductive classifiers (i.e. PWKNN-LDA).
| Subjects | Baseline methods | Proposed methods (CSE-UAEL) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Passive scheme | Active scheme | ||||||||||
| BAG | AB | TB | RUSB | RSM | C-1 | C-2 | C-3 | C-1 | C-2 | C-3 | |
| A01 | 86.81 | 71.53 | 81.94 | 84.72 | 84.72 | 58.33 | 88.89 | 91.67 | 58.33 | 87.50 | 91.67 |
| A02 | 47.92 | 50.69 | 50.69 | 52.08 | 59.03 | 54.17 | 59.03 | 63.89 | 54.17 | 60.42 | 63.89 |
| A03 | 90.97 | 71.53 | 90.28 | 90.28 | 90.97 | 54.17 | 96.53 | 94.44 | 54.17 | 95.83 | 94.44 |
| A04 | 66.67 | 65.28 | 68.06 | 67.36 | 67.36 | 51.39 | 68.06 | 70.80 | 51.39 | 66.67 | 72.22 |
| A05 | 65.97 | 70.83 | 70.83 | 65.97 | 54.86 | 65.28 | 73.61 | 77.78 | 65.97 | 72.22 | 77.08 |
| A06 | 63.89 | 63.19 | 63.19 | 64.58 | 44.44 | 49.31 | 66.67 | 73.61 | 45.83 | 64.58 | 75.69 |
| A07 | 74.31 | 75.00 | 74.31 | 72.92 | 70.83 | 53.47 | 80.56 | 72.92 | 53.47 | 74.31 | 73.61 |
| A08 | 72.92 | 90.97 | 88.19 | 90.28 | 85.42 | 45.83 | 89.58 | 93.75 | 45.83 | 88.89 | 94.44 |
| A09 | 91.67 | 84.72 | 88.89 | 87.50 | 87.50 | 41.67 | 88.89 | 88.89 | 41.67 | 89.58 | 90.28 |
| Mean | 73.46 | 71.53 | 75.15 | 75.08 | 71.68 | 52.62 | 79.09 | 80.86 | 52.31 | 77.78 | 81.48 |
| Std | 14.42 | 11.76 | 13.44 | 13.67 | 16.53 | 6.86 | 12.83 | 11.44 | 7.32 | 12.87 | 11.33 |
Comparison of CSE-UAEL Algorithm using p-values on dataset-2B. The p-value denotes the Wilcoxon signed-rank test: *p < 0.01, ⋆p < 0.05.
| Single classifier | Ensemble | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Passive | Active | Baseline methods | CSE-UAEL (Passive) | ||||||||
| C-3 | C-3 | BAG | AB | TB | RUSB | RSM | C1 | C2 | C3 | ||
| CSE-UAEL | C-1 | 0.0078* | 0.0078* | 0.0078* | 0.0078* | 0.0195⋆ | 0.1641 | 0.4961 | 0.75 | 0.0195⋆ | 0.0039* |
| (Active) | C-2 | 0.0447⋆ | 0.0391⋆ | 0.0742 | 0.0486⋆ | 0.1641 | 0.0781 | 0.0078* | 0.0078* | 0.5234 | 0.0039* |
| C-3 | 0.0039* | 0.0039* | 0.0039* | 0.0039* | 0.0078* | 0.0039* | 0.0039* | 0.0039* | 0.0039* | 0.0425⋆ | |
Classification accuracy in (%) Comparison with the state-of-the-art method in dataset-2A.
| CSP | CCSP | FBCSP | OSSFN-FBCSP | RQNN | CSE-UAEL (Active) (C-3) |
|---|---|---|---|---|---|
| 73.46 | 79.78 | 76.31 | 76.31 | 66.59 | 81.48 |
Symbols and notations.
| Symbols and notations | Description |
|---|---|
| Input vector | |
| Output label | |
| Training dataset including input data | |
| Test dataset including input data | |
| Temporary variable to store data in testing phase | |
| Number of training samples in training data | |
| Number of training samples in testing data | |
| Input dimensionality | |
| Probability distribution of input | |
| Probability of | |
| Mu frequency band [8–12] Hz | |
| Beta frequency band [14–30] Hz | |
| Set of labels for Class 1 and Class 2 | |
| Class 1 and Class 2 | |
| Real number | |
| lambda was a smoothing constant in covariate shift estimation | |
| EWMA statistics | |
| Ensemble of classifiers | |
| Classifier | |
| Counter for the number of classifier in ensemble | |
| A radial basis function (RBF) kernel | |
| Number of samples from starting of the testing phase to the current sample | |
| Γ | Threshold |
| ∪ | Union operation |
| Total number of points | |
| Volume | |
| EEG signal | |
| Number of channels in EEG dataset | |
| Number of samples per trial in EEG dataset | |
| CSP projection matrix | |
| spatially filtered signal | |
| Big-O notation |