| Literature DB >> 36129927 |
Hirokatsu Shimizu1, Ramesh Srinivasan1,2.
Abstract
Decoding brain activity related to specific tasks, such as imagining something, is important for brain computer interface (BCI) control. While decoding of brain signals, such as functional magnetic resonance imaging (fMRI) signals and electroencephalography (EEG) signals, during observing visual images and while imagining images has been previously reported, further development of methods for improving training, performance, and interpretation of brain data was the goal of this study. We applied a Sinc-EEGNet to decode brain activity during perception and imagination of visual stimuli, and added an attention module to extract the importance of each electrode or frequency band. We also reconstructed images from brain activity by using a generative adversarial network (GAN). By combining the EEG recorded during a visual task (perception) and an imagination task, we have successfully boosted the accuracy of classifying EEG data in the imagination task and improved the quality of reconstruction by GAN. Our result indicates that the brain activity evoked during the visual task is present in the imagination task and can be used for better classification of the imagined image. By using the attention module, we can derive the spatial weights in each frequency band and contrast spatial or frequency importance between tasks from our model. Imagination tasks are classified by low frequency EEG signals over temporal cortex, while perception tasks are classified by high frequency EEG signals over occipital and frontal cortex. Combining data sets in training results in a balanced model improving classification of the imagination task without significantly changing performance in the visual task. Our approach not only improves performance and interpretability but also potentially reduces the burden on training since we can improve the accuracy of classifying a relatively hard task with high variability (imagination) by combining with the data of the relatively easy task, observing visual images.Entities:
Mesh:
Year: 2022 PMID: 36129927 PMCID: PMC9491577 DOI: 10.1371/journal.pone.0274847
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Architecture of the model.
| Layer | Parameters |
|---|---|
| Input | Shape: (128, 125, 1) |
| Sinc Convolution | Num of Filters: 16, Kernel Size: (1, 65), Freq Min: 1.0, Band Min: 4.0 |
| Batch Normalization | |
| Depth-wise Convolution | Kernel Size: (128, 1), Multiplier: 2 |
| Batch Normalization | |
| Activation | Elu |
| Average Pooling | Size: (1, 4) |
| Dropout | Rate: 0.5 |
| Separable Convolution | Num of Filters: 32, Kernel Size: (1, 8) |
| Batch Normalization | |
| Activation | Elu |
| Average Pooling | Size: (1, 3) |
| Dropout | Rate: 0.5 |
| Flatten | |
| Dense | Size: 512, Activation: Relu |
| Dropout | Rate: 0.5 |
| Dense | Size: 2048, Activation: Relu |
| Dropout | Rate: 0.5 |
| Dense | Size: 40, Activation: Softmax |
Fig 1Diagram of the attention module.
As illustrated, an element-wise summation of each output from the shared network, a sigmoid function activation, an element-wise multiplication between attention and initial features, and an element-wise summation with initial features are performed after the shared network.
Architecture of the generator.
| Block | Filters | Kernel Size | Strides | Padding |
|---|---|---|---|---|
| 1 | 512 | 4 | 1 | valid |
| 2 | 256 | 4 | 2 | same |
| 3 | 128 | 4 | 2 | same |
| 4 | 64 | 4 | 2 | same |
| 5 | 3 | 4 | 2 | same |
Fig 2Validation curve.
(A) Validation Accuracy curve (upper) and Validation Loss curve (lower) on the validation set of the visual experiments. (B) Validation Accuracy curve (upper) and Validation Loss curve (lower) on the validation set of the imagination experiments. The chance accuracy is 1/40 = 0.025 (shown by the gray line).
Averaged accuracy of classifying the imagination test set.
| Imagine | Mix | |
|---|---|---|
| Subject 1 | 0.211 ± 0.021 | 0.334 ± 0.034 |
| Subject 2 | 0.176 ± 0.020 | 0.260 ± 0.045 |
| Subject 3 | 0.094 ± 0.035 | 0.155 ± 0.041 |
| Subject 4 | 0.132 ± 0.058 | 0.249 ± 0.037 |
| All | 0.134 ± 0.042 | 0.252 ± 0.022 |
Test accuracy was improved for both each subject’s result and combined result when we used the “Mix” model.
Fig 3Sorted frequency band of each model.
Sorting was performed based on the attention map averaged over the test set. “Top 1” means the filter with the highest attention value. (A) Frequency band of each spatial filter. (B) Center frequency of each spatial filter.
Fig 4Weighted sum results.
(A) Weighted sum of absolute values of spatial filters. Each result was normalized so that the maximum value is 1. (B) FFT results of weighted summed sinc filters.
Fig 5Difference between filters.
The map of the difference between the weighted sums of the spatial filters in the “Mix” model and the “Visual” model which were derived from the visual test set (Left), and between the “Mix” model and the “Imagine” model which were derived from the imagination test set (Right).
Numbers of picked filters in each frequency block.
| Low | Mid | High | |
|---|---|---|---|
| Visual | 17, 21, 22, 24, 25, 30, 31, 32 | 3, 4, 5, 8, 20, 29 | 1, 2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 18, 19, 23, 26, 27, 28 |
| Imagine | 10, 11, 17, 18, 20, 22, 23, 24, 30, 31 | 1, 3, 4, 7, 9, 12, 13, 15, 16, 21, 25, 28, 29, 32 | 2, 5, 6, 8, 14, 19, 26, 27 |
| Mix (Visual) | 5, 6, 9, 11, 18, 25, 28, 31 | 2, 8, 10, 12, 13, 14, 19, 23, 24, 26, 27, 29 | 1, 3, 4, 7, 8, 15, 16, 17, 20, 21, 22, 29, 30, 32 |
| Mix (Imagine) | 3, 20, 22, 24, 26, 27, 28, 29 | 1, 7, 9, 10, 14, 16, 17, 18, 19, 25, 31, 32 | 2, 4, 5, 6, 8, 11, 12, 13, 15, 16, 18, 21, 23, 30 |
The numbers correspond to “Top #” in Fig 3. If the frequency band of a specific filter spans multiple blocks, the filter is used for each block.
Fig 6Weighted summed filters in each frequency block.
Each map was normalized so that the maximum value is 1. “Low” block mainly contains alpha band or below, “Mid” block mainly contains beta band, and “High” block mainly contains gamma band.
Fig 7Results generated by “Visual” GAN.
(A) Sample of images generated from the visual train dataset. (B) Sample of images generated from the visual test set. (C) The classification accuracy map of images generated from the visual train dataset. (D) The classification accuracy map of images generated from the visual test set.
Fig 8Results generated by “Mix” GAN.
(A) Sample of images generated from the visual train dataset. (B) Sample of images generated from the visual test set. (C) The classification accuracy map of images generated from the visual train dataset. (D) The classification accuracy map of images generated from the visual test set.
Fig 9Results generated from imagination dataset.
(A) Sample of images generated by the “Visual” GAN. (B) Sample of images generated by the “Mix” GAN. (C) The classification accuracy map of images generated by the “Visual” GAN. (D) The classification accuracy map of images generated by the “Mix” GAN.