| Literature DB >> 35323425 |
Kangkyu Kwon1,2, Shinjae Kwon2,3, Woon-Hong Yeo2,3,4,5.
Abstract
Sleep stage classification is an essential process of diagnosing sleep disorders and related diseases. Automatic sleep stage classification using machine learning has been widely studied due to its higher efficiency compared with manual scoring. Typically, a few polysomnography data are selected as input signals, and human experts label the corresponding sleep stages manually. However, the manual process includes human error and inconsistency in the scoring and stage classification. Here, we present a convolutional neural network (CNN)-based classification method that offers highly accurate, automatic sleep stage detection, validated by a public dataset and new data measured by wearable nanomembrane dry electrodes. First, our study makes a training and validation model using a public dataset with two brain signal and two eye signal channels. Then, we validate this model with a new dataset measured by a set of nanomembrane electrodes. The result of the automatic sleep stage classification shows that our CNN model with multi-taper spectrogram pre-processing achieved 88.85% training accuracy on the validation dataset and 81.52% prediction accuracy on our laboratory dataset. These results validate the reliability of our classification method on the standard polysomnography dataset and the transferability of our CNN model for other datasets measured with the wearable electrodes.Entities:
Keywords: automatic sleep stage classification; convolutional neural network; multi-taper spectrogram; nanomembrane electrode
Mesh:
Year: 2022 PMID: 35323425 PMCID: PMC8946692 DOI: 10.3390/bios12030155
Source DB: PubMed Journal: Biosensors (Basel) ISSN: 2079-6374
Figure 1Overview of a public dataset (ISRUC) and the measured lab dataset. (A) Detailed information of both datasets. (B) Data recording system using nanomembrane bioelectrodes (left) and the sensor mounting locations (right) were the upper center of the forehead (EEG1 and EEG2) to measure two-channel EEG, one electrode on the lower-left corner of the left eye (EOG1) to measure two-channel EOG, and another electrode on the upper-right corner of the right eye (EOG2). (C) Measured EOG signals of five different sleep stages: W, N1, N2, N3, and R. (D) Measured EEG signals of five different sleep stages: W, N1, N2, N3, and R. (E) Examples of multi-taper spectrograms of both the ISRUC public and the lab datasets with five sleep stages. From top-to-bottom and left-to-right, the spectrograms show channels F3-A2, F4-A1, EOGL, and EOGR of the public dataset, and channels EEG1, EEG2, EOG1, and EOG2 of the lab dataset.
Figure 2(A) Arrangement of four multi-taper spectrograms for deep learning dataset input. (B) Flow chart capturing data processing overview. (C,D) Proposed machine learning architectures for multi-taper spectrograms (C) and raw signals (D); in this figure, Conv: convolution, F.C.: fully connected layers, and BN: batch normalization.
Number of epochs of each sleep stage for training, validation, and testing.
| Input Type | Number of Epochs (ISRUC Public Dataset) | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Training Set | Validation Set | Test Set | |||||||||||||
| Aw | N1 | N2 | N3 | R | Aw | N1 | N2 | N3 | R | Aw | N1 | N2 | N3 | R | |
| Raw signal | 10,968 | 3299 | 13,366 | 8264 | 6197 | 3591 | 1148 | 4570 | 2692 | 2031 | 3617 | 1065 | 4492 | 2739 | 2119 |
| Spectrogram | 11,013 | 3275 | 13,210 | 8189 | 6207 | 3568 | 1073 | 4729 | 2675 | 1987 | 3575 | 1144 | 4469 | 2791 | 2053 |
Figure 3CNN classification results. (A) Loss curves of the training on the ISRUC dataset (red: loss on the training; blue: loss on the testing). (B) Accuracy curves of the training on the ISRUC dataset (red: accuracy on the training; blue: accuracy on the testing). (C) Confusion matrix with the public dataset (accuracy 88.85%, Cohen’s kappa = 0.854). (D) Confusion matrix with lab dataset (accuracy 81.52%, Cohen’s kappa = 0.734). (E,F) Hypnograms of prediction results from the ISRUC (E) and lab datasets (F).
Public dataset based on classification models trained and tested with raw signals and multi-taper spectrograms.
| Input Type | ISRUC Public Dataset | |
|---|---|---|
| Test Accuracy | Cohen’s Kappa | |
| Raw signal | 87.05% | 0.829 |
| Multi-taper spectrogram | 88.85% | 0.854 |
Lab dataset prediction based on classification models trained and tested with raw signal and multi-taper spectrograms, and the combined number of epochs of each sleep stage.
| Input Type | Lab Dataset | Number of Epochs (Lab Dataset) | |||||
|---|---|---|---|---|---|---|---|
| Prediction Accuracy | Cohen’s Kappa | Prediction Set | |||||
| Aw | N1 | N2 | N3 | R | |||
| Raw signal | 72.94% | 0.608 | 230 | 111 | 1091 | 721 | 554 |
| Multi-taper spectrogram | 81.52% | 0.734 | 230 | 111 | 1091 | 721 | 554 |
Comparison of sleep-stage classification performance with prior works.
| Ref. | Year | Data Type | Input Data | Number of Subjects | Public Dataset | Private Dataset | Number of Channels | Classification |
|---|---|---|---|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | |||||||
| This work | 2022 | ISRUC and Lab dataset | Multi-taper spectrogram and | 100 | 88.85/0.854 | 81.52/0.734 | 2 EEG, 2 EOG | CNN |
| [ | 1993 | Private data | Extracted features | 12 | - | 80.60/- | 2 EEG, 1 EOG, 1 EMG | Multilayer Neural Network |
| [ | 2005 | SIESTA | Extracted features | 590 | 79.6/0.72 | - | 1 EEG, 2 EOG, 1 EMG | LDA, Decision tree |
| [ | 2014 | Sleep-EDF | Extracted features | 1 | 88.9/- | - | 1 EEG | SVM |
| [ | 2016 | Sleep-EDF | Raw data | 20 | 74/0.65 | - | 1 EEG | CNN |
| [ | 2016 | Sleep-EDF | Extracted features | 20 | 78/- | - | 1 EEG | Stacked Sparse Autoencoders |
| [ | 2017 | Montreal archive | Extracted features | 62 | 83.35/- | - | 1 EEG | Mixed Neural Network |
| [ | 2017 | Sleep-EDF & Montreal | Raw data | 32 | 86.2/0.80 | - | 1 EEG | DeepSleepNET (CNN + LSTM) |
| [ | 2018 | Montreal archive | Raw data | 61 | 78/0.80 | - | 6 EEG, 2 EOG, 3 EMG | Multivariate Network |
| [ | 2018 | Private dataset | Extracted features | 76 | - | -/0.8 | 1 EEG, 2 EOG | Random Forest, CNN, LSTM |
| [ | 2018 | SHHS | Raw data | 5728 | 87/0.81 | - | 1 EEG | CNN |
| [ | 2018 | 12 sleep centers | Raw data | 1086 | 87/0.766 | - | 4 EEG, 2 EOG, 1 EMG | CNN |
| [ | 2018 | ISRUC | Extracted features | 100 | 75.29/- | - | 6 EEG | Random Forest |
| [ | 2018 | ISRUC | Raw data | 116 | 92.2/- | - | 6 EEG, 2 EOG, 3 EMG | CNN |
| [ | 2018 | SIESTA/private data | Raw data | 147 | -/0.760 | -/0.703 | 1 EEG, 2 EOG | RNN |
| [ | 2019 | ISRUC | Extracted features | 10 | 79.64/0.74 | - | 6 EEG | HMM |
| [ | 2019 | Sleep-EDF | Raw data | 61 | 91.22/- | - | 1 EEG, 1 EOG | CNN |
| [ | 2019 | Montreal archive | Extracted features | 200 | 83.6/- | - | 1 EEG, 1EOG, 1EMG | CNN |
| [ | 2020 | ISRUC | Extracted features | 10 | 81.65/0.76 | - | 1 EEG | IMBEFs |
| [ | 2020 | Sleep-EDF | Raw data | 100 | 85.52/- | - | 2 EEG | CNN |
| [ | 2020 | ISRUC | Raw data | 294 | 81.8/0.72 | - | 2 EEG, 2 EOG, 1 EMG, 1 ECG | CNN + RNN |