| Literature DB >> 34975434 |
Chao He1, Jialu Liu1, Yuesheng Zhu2, Wencai Du3.
Abstract
Classification of electroencephalogram (EEG) is a key approach to measure the rhythmic oscillations of neural activity, which is one of the core technologies of brain-computer interface systems (BCIs). However, extraction of the features from non-linear and non-stationary EEG signals is still a challenging task in current algorithms. With the development of artificial intelligence, various advanced algorithms have been proposed for signal classification in recent years. Among them, deep neural networks (DNNs) have become the most attractive type of method due to their end-to-end structure and powerful ability of automatic feature extraction. However, it is difficult to collect large-scale datasets in practical applications of BCIs, which may lead to overfitting or weak generalizability of the classifier. To address these issues, a promising technique has been proposed to improve the performance of the decoding model based on data augmentation (DA). In this article, we investigate recent studies and development of various DA strategies for EEG classification based on DNNs. The review consists of three parts: what kind of paradigms of EEG-based on BCIs are used, what types of DA methods are adopted to improve the DNN models, and what kind of accuracy can be obtained. Our survey summarizes the current practices and performance outcomes that aim to promote or guide the deployment of DA to EEG classification in future research and development.Entities:
Keywords: EEG; brain-computer interface; classification; data augmentation; deep neural networks
Year: 2021 PMID: 34975434 PMCID: PMC8718399 DOI: 10.3389/fnhum.2021.765525
Source DB: PubMed Journal: Front Hum Neurosci ISSN: 1662-5161 Impact factor: 3.169
FIGURE 1The framework of EEG classification using DA strategy. Different color represents different classs.
Collection criteria for inclusion or exclusion.
| Inclusion criteria | Exclusion criteria |
| • Published within the last 5 years | • Research for invasive EEG, electrocorticography (ECoG), magnetoencephalography (MEG), source imaging, fMRI, and so on, or joint studies with EEG |
| • A focus on non-invasive EEG signals | • No specific description of DA |
| • A specific explanation of how to apply DA to EEG signal | |
| • At least one DNN is included for the classifier |
FIGURE 2The search method for identifying relevant studies.
FIGURE 3The structure of the autoencoder. Blue represents input layers and green represents output layers.
FIGURE 4The structure of variational autoencoder. Blue represents input layers and green represents output layers.
FIGURE 5The structure of generative adversarial networks.
FIGURE 6A taxonomy of data augmentation in EEG decoding.
Summary of data augmentation for EEG decoding based on DNNs.
| EEG paradigms | Channel of EEG | Subjects | DA methods | Input form of DA | Classifier | Improvement of accuracy after DA | Datasets | References |
| Driving detection | 30 | 27 | EMD | Time-series data | Graph CNN | 72–95% | Private dataset (27 subjects) |
|
| SD | 23 | 23 | WGAN | Time-series data | CNN | 72.11–95.89% | CHB-MIT ( |
|
| ER | 14 | 18 | GAN | Time-series data | DNN | NA to 98.4% | Private dataset (18 subjects) |
|
| ER | 15/32 | NA | NI | Differential entropy | SVM/ResNet | 40.8–45.4% | SEED ( |
|
| ER | 15/32 | 15 | cWGAN/sVAE | Power spectral density/Differential entropy | SVM/DNN | 44.9–90.8% | SEED/DEAP ( |
|
| ER | 32 | 15 | NI | Time-series data | 3D-CNN | 79.11–88.49% 79.12–87.44% | DEAP ( |
|
| ER | NA | NA | CycleGAN | Time-series data | CNN | Average improvement of 3.7%∼8% | FER2013/SFEW/JAFFE datasets |
|
| ERP | 31 | 37 | Paired trial | Time-series data | DNN | Average improvement of 20∼30% | ERP datasets from |
|
| P300 | 32 | 44 | Borderline-SMOTE | Time-series data | SVM/CNN | Average improvement of 5∼15% | Private dataset (44 subjects) |
|
| MI | 14 | 1 | Conditional DCGAN | Time-frequency representation | CNN | 78–83% | BCIC II-dataset III ( |
|
| MI | 22/44 | 9/NA | NI | Spectral image | CP-MixedNet | Average improvement of 1.1∼4.5% | BCIC IV-dataset 2a ( |
|
| MI | 14 | 1/5 | EMD | Time-series data | CNN/WNN | 77.9–82.9% 88.0–90.1% | BCIC II-dataset III/Five subject’s experiments |
|
| MI | 14/3 | 1/9 | GT | Time-frequency representation | CNN | NA | BCIC II-dataset III/BCIC IV-dataset 2b ( |
|
| MI | 3 | 9 | Feature transform | Time-series data | CNN | Average improvement of 5∼10% | BCIC IV-dataset 2b |
|
| MI | 3/3 | 4/9 | DCGAN | Spectral image | CNN | 74.5–83.2% 80.6–93.2% | BCIC IV-dataset 1 ( |
|
| MI | 22 | 9 | NI/GT | Time-series data | LSTM | NA | BCIC IV-dataset 2a |
|
| MI | 22/3 | 9/9 | SW | Time-series data | RM classifier | NA to 80.4%/82.39% | BCIC IV-dataset 2a/BCIC IV-dataset 2b |
|
| MI | 62 | 14 | Conditional DCGAN | Spectral image | CNN | Average improvement of 3.22∼5.45% | Private dataset (14 subjects) |
|
| MI | 22/3 | 9/9 | Feature transform | Time-series data | HS-CNN | 85.6–87.6% | BCIC IV-dataset 2a/BCIC IV-dataset 2b |
|
| MI | 22/60 | 9/3 | Extended CSSP | Feature matrix | FLDA | Average improvement of 0.3∼31.2% | BCIC IV-dataset 2a/BCIC III -dataset IIIa ( |
|
| RSVP | 256 | 10 | WGAN | Time-series data | EEGNet | NA | BCIT X2 dataset ( |
|
| Sleep stage classification | 2/2 | 20/25 | Oversampling model | Time-series data | BLSTM | Average improvement of 0.1∼2.0% | SA dataset/Sleep-EDF database ( |
|
| SSVEP | 32 | 8 | Randomly average | Time-series data | RNN | Average improvement of 3∼13% | Private dataset (8 subjects) |
|
| MW detection | 4 | 8 | NI | Time-series data | DBN | NA | Private dataset (8 subjects) |
|
| MW detection | 64 | 15 | NI | Spectral image | Multi-frame classifier | NA | Private dataset (8 subjects) | |
| MW detection | 11 | 7 | NI | Spectral image | SAE | 34.2–75% | Private dataset (7 subjects) |
|
| MW detection | 64 | 22 | NI | Spectral image | RNN + CNN | NA to 93% | Private dataset (22 subjects) |
|
| MI | 3 | 5 | NI | Time-series data | CNN | NA | BCIC IV-dataset 2b |
|
| MW detection | 1 | 30 | GAN | Spectral image | Boast | 90–95% | NA |
|
| MI | 1 | 1 | GAN | Spectral image | Distance measurement | NA | NA |
|
| ER | 32 | 32 | GAN | Spectral image | CWGAN | Average improvement of 3∼20% | SEED |
|
| MI | 3 | 1 | GAN | Spectral image | CNN | 77–79% | BCIC IV-dataset 2b |
|
| ER | 14 | 18 | GAN | Time-series data | GANA | 97–98% | CHB-MIT |
|
| MI | 3 | 9 | GAN | Time-series data | CNN + LSTM | NA to 76% | BCIC IV-dataset 2b |
|
| RSVP | 256 | 10 | GAN | Time-series data | CNN | Average improvement of 0.7∼2% | BCIT X2 RSVEP Dataset ( |
|
| SD | 100 | 18 | SW | Spatio-temporal signal | CNN | NA to 97% | Clinic dataset (100 subjects) |
|
| SD | 100 | 10 | SW | Time-series data | CNN | NA | University of Bonn Dataset ( |
|
| SD | 16 | 2 | SW | Spectral image | CNN | NA | Clinic dataset |
|
| SSC | 4 | NA | SW | Time-series data | CNN | 82.9–85.7% | Sleep-EDF database |
|
| SD | 18 | 29 | SW | Time-series data | CNN | NA to 93% | Clinic dataset |
|
| MI | 3 | 9 | SW | Spectral image | CNN | NA to 84% | BCIC IV-dataset 2b |
|
| SD | 18 | 24 | SW | Spatio-temporal signal | LSTM | 70–78% | CHB-MIT |
|
| SD | NA | 2 | SW | Time-series data | CNN | NA | Clinic dataset |
|
| RSVP | 64 | 12 | Oversampling model | Time-series data | CNN | 83.99–86.96% | Private dataset (12 subjects) |
|
| Movement of eye | 2 | NA | Oversampling model | Feature matrix | MLP | NA to 82% | MAHNOB HCI-Tagging database ( |
|
| SSC | 20 | 62 | Oversampling model | Time-series data | CNN | Average improvement of 1.7∼20% | Sleep-EDF |
|
| SSC | 20 | 62 | Oversampling model | Time-frequency representation | Mixer networks | 89.3–90.1% | NA |
|
| SSC | 14 | 121 | Oversampling model | Spectral image | CNN | NA | Private dataset (121 subjects) |
|
| SD | 23 | 23 | Oversampling model | Spatio-temporal signal | CNN + LSTM | NA | CHB-MIT |
|
| SSC | NA | NA | Oversampling model | Spectral image | CNN | NA to 99% | Private dataset |
|
| SSC | 16 | NA | Feature transform | Time-series data | CNN | 72–75% | Private dataset |
|
| ER | 32 | 32 | GT | Wavelet feature | SAE | NA to 68.75% | DEAP |
|
| ER | 32 | 22 | GT | Feature matrix | NN | 40.8–45.4% | DEAP |
|
| SSC | 19 | 155 | Repeat sampling | Entropy feature | CNN | NA to 81.4% | Clinic dataset |
|
| ER | 32 | 32 | GT | Wavelet feature | CNN | Average improvement of 2.2∼5% | DEAP |
|
| MI | 3/22 | 9/9 | NI | Time-series data | Inception NN | Average improvement of 3% | BCIC IV-dataset 2b BCIC IV-dataset 2a |
|
Abbreviations: BCIC, BCI Competition; BLSTM, Bidirectional Long Short-Term Memory network; CSSP, Common Spectral Spatial Patterns; DBN, Deep Belief Networks; EMD, Empirical Model Decomposition; ER, Emotion recognition; FLDA, Fisher Linear Discriminant Analysis; HS-CNN, A CNN with Hybrid Convolution Scale; MW, Mental Workload; MLP, Multi-Layer Perception; LeNet, Deep neural network proposed by