| Literature DB >> 36051650 |
Zheng Yubo1, Luo Yingying1, Zou Bing1, Zhang Lin1, Li Lei1.
Abstract
Pandemic-related sleep disorders affect human physical and mental health. The artificial intelligence (AI) based sleep staging with multimodal electrophysiological signals help people diagnose and treat sleep disorders. However, the existing AI-based methods could not capture more discriminative modalities and adaptively correlate these multimodal features. This paper introduces a multimodal attention network (MMASleepNet) to efficiently extract, perceive and fuse multimodal features of electrophysiological signals. The MMASleepNet has a multi-branch feature extraction (MBFE) module followed by an attention-based feature fusing (AFF) module. In the MBFE module, branches are designed to extract multimodal signals' temporal and spectral features. Each branch has two-stream convolutional networks with a unique kernel to perceive features of different time scales. The AFF module contains a modal-wise squeeze and excitation (SE) block to adjust the weights of modalities with more discriminative features and a Transformer encoder (TE) to generate attention matrices and extract the inter-dependencies among multimodal features. Our MMASleepNet outperforms state-of-the-art models in terms of different evaluation matrices on the datasets of Sleep-EDF and ISRUC-Sleep. The implementation code is available at: https://github.com/buptantEEG/MMASleepNet/.Entities:
Keywords: attention network; automatic sleep staging; electrophysiological signals; features fusion; multimodal
Year: 2022 PMID: 36051650 PMCID: PMC9424881 DOI: 10.3389/fnins.2022.973761
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
Figure 1The waveforms of EEG, EOG, and EMG in each sleep stage. The data is randomly selected from the Sleep-EDF-78 dataset, and each epoch is 30 s.
Figure 2The architecture of the proposed network. It consists of a multi-branch feature extraction module, an attention based feature fusion module and a classification module. ⊕ is the point-wise addition and ⊗ is the point-wise multiplication. Conv is the convolutional layer, Pool is the pooling layer, FC is the fully connection layer, Norm is the normalization layer.
Summary of the datasets and selected channels.
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| Sleep-EDF-20 | 20 | 42,308 | 19.58 | 6.63 | 42.07 | 13.48 | 18.24 | R&K | 20 |
| Sleep-EDF-78 | 78 | 195,479 | 33.74 | 11.01 | 35.37 | 6.67 | 13.22 | R&K | 10 |
| ISRUC-Sleep-1 | 100 | 87,187 | 22.95 | 12.85 | 31.51 | 19.45 | 13.23 | AASM | 5 |
| ISRUC-Sleep-3 | 10 | 8,589 | 20.44 | 14.04 | 30.12 | 22.90 | 12.50 | AASM | 10 |
Parameters of the MBFE module. Size is the size of convolutional kernel, N is the numbers of filters and d is the number of kernels at the last convolutional layer.
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |||
| EEG | Small | Size | Size | Size | Size | Size | Size | 128 |
| Large | Size | Size | Size | Size | Size | Size | 128 | |
| EOG | Small | Size | Size | Size | Size | Size | Size | 64 |
| Large | Size | Size | Size | Size | Size | Size | 64 | |
| EMG | Small | Size | Size | Size | Size | Size | Size | 64 |
| Large | Size | Size | Size | Size | Size | Size | 64 | |
Comparison among MMASleepNet and baseline models.
|
|
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| W | N1 | N2 | N3 | REM |
| κ |
| |||
| Sleep-EDF-20 | AttnSleepNet | 79.02 | 32.70 | 87.03 | 85.67 | 72.36 | 79.10 | 71.35 | 71.43 | 66.34 |
| SleepPrintNet | 88.77 | 47.99 | 86.72 | 86.21 | 80.26 | 83.08 | 77.99 | 76.67 | 76.34 | |
| SalientSleepNet | 90.79 | 49.86 | 89.03 | 84.77 |
| 86.28 | 80.58 | 81.02 | 77.32 | |
| MMASleepNet |
|
|
|
| 86.41 |
|
|
|
| |
| Sleep-EDF-78 | AttnSleepNet | 92.08 | 36.98 | 84.70 |
| 73.61 | 81.12 | 73.80 | 73.75 | 68.64 |
| SleepPtintNet | 92.65 | 47.39 | 83.59 | 79.97 | 78.75 | 81.64 | 76.47 | 74.70 | 74.27 | |
| SalientSleepNet | 92.28 |
| 84.37 | 71.17 |
| 82.61 | 76.51 | 75.92 | 73.42 | |
| MMASleepNet |
| 49.05 |
| 81.26 | 79.75 |
|
|
|
| |
| ISRUC-SLEEP-1 | AttnSleepNet | 84.19 | 43.80 | 71.52 | 81.93 | 61.12 | 71.65 | 68.53 | 63.70 | 67.43 |
| SleepPtintNet | 79.12 | 40.12 | 58.22 | 68.80 | 73.67 | 65.40 | 63.99 | 56.02 | 62.47 | |
| SalientSleepNet | 85.24 | 51.34 | 76.41 | 83.50 | 79.25 | 76.95 | 75.15 | 70.31 | 74.25 | |
| MMASleepNet |
|
|
|
|
|
|
|
|
| |
| ISRUC-SLEEP-3 | AttnSleepNet | 67.58 | 26.91 | 66.31 | 84.08 | 54.33 | 64.24 | 59.85 | 54.88 | 55.83 |
| SleepPrintNet | 85.15 | 52.53 | 74.95 | 87.28 | 74.84 | 76.88 | 74.95 | 70.29 | 73.69 | |
| SalientSleepNet | 78.37 | 50.64 | 77.33 |
| 75.47 | 76.11 | 73.96 | 69.39 | 73.20 | |
| MMASleepNet |
|
|
| 87.00 |
|
|
|
|
| |
The best values on each dataset are highlighted in bold.
Figure 3The confusion matrices of MMASleepNet, (A) is the confusion matrix valuated on SleepEDF-20 dataset, (B) is the confusion matrix valuated on SleepEDF-78 dataset, (C) is the confusion matrix valuated on ISRUC-Sleep-1 dataset, (D) is the confusion matrix valuated on ISRUC-Sleep-3 dataset.
Figure 4The results of ablation experiments, panel (A) is for the module ablation, panel (B) is for the modalities ablation.
Figure 5The features before and after attention mechanism of MMASleepNet. The data was selected randomly from the Sleep-EDF-20 dataset. Panel (A) is before the attention module, panel (B) is for the modalities ablation.