| Literature DB >> 35387250 |
Mahsa Pourhosein Kalashami1, Mir Mohsen Pedram1, Hossein Sadr2.
Abstract
Emotion recognition is a challenging problem in Brain-Computer Interaction (BCI). Electroencephalogram (EEG) gives unique information about brain activities that are created due to emotional stimuli. This is one of the most substantial advantages of brain signals in comparison to facial expression, tone of voice, or speech in emotion recognition tasks. However, the lack of EEG data and high dimensional EEG recordings lead to difficulties in building effective classifiers with high accuracy. In this study, data augmentation and feature extraction techniques are proposed to solve the lack of data problem and high dimensionality of data, respectively. In this study, the proposed method is based on deep generative models and a data augmentation strategy called Conditional Wasserstein GAN (CWGAN), which is applied to the extracted features to regenerate additional EEG features. DEAP dataset is used to evaluate the effectiveness of the proposed method. Finally, a standard support vector machine and a deep neural network with different tunes were implemented to build effective models. Experimental results show that using the additional augmented data enhances the performance of EEG-based emotion recognition models. Furthermore, the mean accuracy of classification after data augmentation is increased 6.5% for valence and 3.0% for arousal, respectively.Entities:
Mesh:
Year: 2022 PMID: 35387250 PMCID: PMC8979741 DOI: 10.1155/2022/7028517
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1A 2-dimensional arousal-valence model.
Dataset description.
| Row index | Data | Value |
|---|---|---|
| 1 | Number of participants | 32 |
| 2 | Stimulus | Music video |
| 3 | Number of videos | 40 |
| 4 | Duration of each video | 60 second |
| 5 | EEG recorded data | 32 channels |
| 6 | Physiological data | 8 channels |
| 7 | EEG data point for each participant from 32 channel | 32 × 8064 |
| 8 | Labeling method | Done by participants after watching the music video |
| 9 | Labeling technique | SAM |
| 10 | Label's scale | 0 to 9 |
| 11 | Labels | Arousal, valence, liking, dominance |
Figure 2The flowchart of the proposed system.
Figure 3Tree representation of DEAP dataset.
Figure 4Rearranged DEAP dataset.
Extracted features.
| Index | Features type | Channel | Frequency (Hz) | Features | No. of features |
|---|---|---|---|---|---|
| 1 | EEG power features | Fp1, AF3, F3, F7, FC5, FC1, C3, T7, CP5, CP1, P3, P7, PO3, O1, Oz, Pz, Fp2, AF4, Fz, F4, F8, FC6, FC2, Cz, C4, T8, CP6, CP2, P4, P8, PO4, O2 | Theta (4–8) slow-alpha (8–10) alpha (8–12) beta (12–30) gamma (30–45) | Average PSD | 160 |
|
| |||||
| 2 | EEG power differences | (Fp2- Fp1), (AF4- AF3), (F4-F3), (F8-F7), (FC6-FC5), (FC2-FC1), (C4- C3), (T8-T7), (CP6- CP5), (CP2-CP1), (P4- P3), (P8-P7), (PO4- PO3), (O2- O1) | Theta alpha beta gamma | Difference of average PSD in theta, alpha, beta, and gamma bands for 14 EEG channel pairs between right and left scalp | 56 |
|
| |||||
| 3 | EEG time- domain features | Fp1, AF3, F3, F7, FC5, FC1, C3, T7, CP5, CP1, P3, P7, PO3, O1, Oz, Pz, Fp2, AF4, Fz, F4, F8, FC6, FC2, Cz, C4, T8, CP6, CP2, P4, P8, PO4, O2 | — | Mean variance zero-crossing rate | 128 |
EEG Frequency band.
| Frequency | Frequency range | Description | Occurrence |
|---|---|---|---|
| Delta | Delta <4/5 | Delta frequency band waves have the highest amplitude and the lowest frequency. | It is a wave shape that appears in a relaxed state like deep and unconscious sleep. It describes a person in a state of anesthesia and unconsciousness. Similar EEG frequencies appear in epileptic seizures, loss of consciousness, and some coma states. |
|
| |||
| Theta | 3/5 < theta <7/5 | Theta frequency band waves are a fast irregular activity. | Theta waves are associated with natural consciousness or thinking and anxiety and concentration. Beta is usually seen with a symmetrical distribution on both sides of the brain but is more pronounced in the frontal lobe. It may not be present or reduced in areas where the cortex is damaged. |
|
| |||
| Alpha | 7/5 < alpha <13 | Alpha frequency band waves are generated by the simultaneous electrical activity of large groups of neurons. | They are usually found with the eyes closed but still awake in signals recorded from the scalp more than the occipital lobe during periods of relaxation. Open eyes also reduce drowsiness and sleepiness. It mostly indicates a state of consciousness |
|
| |||
| Beta | 12 < beta <25 | Beta frequency band waves are a fast irregular activity, where the cortex is damaged. | Beta waves are associated with natural consciousness or thinking and anxiety and concentration. Beta usually occurs on both sides of the brain with a symmetrical distribution but is mainly seen in the frontal lobe. It may not be present or reduced in areas |
|
| |||
| Gamma | 26 < gamma <70 | Gamma waves are thought to be a sign of the active exchange of information between the cerebral cortex and other areas. | Gamma waves are usually generated in the brain when people are conscious and when the eyes move rapidly. Gamma and beta waves may overlap within the range of natural frequencies, and the exact boundary between these two frequency bands is not clear and yet is a judgment for experts. |
Figure 5EEG recorded data in 6 (s) from a single channel.
Figure 6EEG recorded data in the Theta frequency band in 6 (s) from a single channel.
Figure 7EEG recorded data in the Slow-Alpha frequency band in 6 (s) from a single channel.
Figure 8EEG recorded data in the Alpha frequency band in 6 (s) from a single channel.
Figure 9EEG recorded data in the Beta frequency band in 6 (s) from a single channel.
Figure 10EEG recorded data in Gamma frequency band in 6 (s) from a single channel.
Figure 11Average PSD for 32 EEG channels in theta frequency band.
Figure 12Average PSD for 32 EEG channels in the slow-alpha frequency band.
Figure 13Average PSD for 32 EEG channels in the alpha frequency band.
Figure 14Average PSD for 32 EEG channels in the beta frequency band.
Figure 15Average PSD for 32 EEG channels in gamma frequency band.
Figure 16Difference of average PSD in theta frequency band.
Figure 17Difference of average PSD in alpha frequency band.
Figure 18Difference of average PSD in gamma frequency band.
Figure 19Difference of average PSD in beta frequency band.
Figure 20Mean of EEG recorded data for each channel in 60 (s).
Figure 21The variance of EEG recorded data for each channel in 60 (s).
Figure 22Zero crossing Rate of EEG recorded data for each channel in 60 (s).
Figure 23CWGAN network diagram.
Encoded target.
| Row_ Index | Arousal | Valence |
|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 24GAN network diagram.
Figure 25Generator network.
Figure 26Discriminator network.
Figure 27The Loss function of CWGAN networks during training.
Figure 28Distribution plot of real and fake data.
SVM classification results.
| Augmented data real data + fake data | NO. test data | Arousal mean accuracy STD (%) | Valence mean accuracy STD (%) | ||
|---|---|---|---|---|---|
| 1152 + 0 | 128 | 64.3 | % ± 2.3 | 60.1 | % ± 5.2 |
|
|
|
|
|
|
|
| 1152 + 5000 | 128 | 62.4 | % ± 3.8 | 59.6 | % ± 3.1 |
DNN classification results.
| Augmented data real data + fake data | NO. test data | Arousal mean accuracy STD (%) | Valence mean accuracy STD (%) | ||
|---|---|---|---|---|---|
| 1152 + 0 | 128 | 65.4 | % ± 3.4 | 64.3 | % ± 4.3 |
| 1152 + 1152 | 128 | 65.2 | % ± 4.2 | 62.3 | % ± 4.7 |
|
|
|
|
|
|
|
Comparison of proposed work with similar work on SVM classifier.
| Model | Features | NO. Augmented data | Classification type | Mean accuracy | STD | Arousal | Valence | ||
|---|---|---|---|---|---|---|---|---|---|
| M.Acc | STD | M.Acc | STD | ||||||
| Proposed model | 344 extracted features | 0 | Binary | — | — | 64.3% | % ± 2.3 | 60.1% | % ± 5.2 |
| Proposed model | 344 extracted features | 2 × real data | Binary | — | — |
|
|
|
|
| Proposed model | 344 extracted features | Real data + 5000 | Binary | — | — | 62.4% | % ± 3.8 | 59.6% | % ± 3.1 |
| [ | DE | 0 | Categorical | 45.4% | 8.2% | — | — | — | — |
| [ | DE |
| Categorical |
|
| — | — | — | — |
| [ | PSD | 0 | Categorical | 42.7% | 9.6% | — | — | — | — |
| [ | PSD | 5000 | Categorical | 45.0% | 8.9% | — | — | — | — |
Comparison of proposed work with similar work on DNN classifier.
| Model | Features | NO. augmented data | Classification type | Mean accuracy | STD | Arousal | Valence | ||
|---|---|---|---|---|---|---|---|---|---|
| M.Acc | STD | M.Acc | STD | ||||||
| Proposed model | 344 extracted features | 0 | Binary | — | — | 65.4% | % ± 3.4 | 64.3% | % ± 4.3 |
| Proposed model | 344 extracted features | 2 × real data | Binary | — | — | 65.2% | % ± 4.2 | 62.3% | % ± 4.7 |
| Proposed model | 344 extracted features | Real data + 5000 | Binary | — | — |
|
|
|
|
| [ | DE | 0 | Categorical | 44.9% | 4.0% | — | — | — | — |
| [ | DE |
| Categorical |
|
| — | — | — | — |