| Literature DB >> 31579607 |
Tao Wang1, Changhua Lu1,2, Guohao Shen1, Feng Hong1.
Abstract
Sleep apnea (SA) is the most common respiratory sleep disorder, leading to some serious neurological and cardiovascular diseases if left untreated. The diagnosis of SA is traditionally made using Polysomnography (PSG). However, this method requires many electrodes and wires, as well as an expert to monitor the test. Several researchers have proposed instead using a single channel signal for SA diagnosis. Among these options, the ECG signal is one of the most physiologically relevant signals of SA occurrence, and one that can be easily recorded using a wearable device. However, existing ECG signal-based methods mainly use features (i.e. frequency domain, time domain, and other nonlinear features) acquired from ECG and its derived signals in order to construct the model. This requires researchers to have rich experience in ECG, which is not common. A convolutional neural network (CNN) is a kind of deep neural network that can automatically learn effective feature representation from training data and has been successfully applied in many fields. Meanwhile, most studies have not considered the impact of adjacent segments on SA detection. Therefore, in this study, we propose a modified LeNet-5 convolutional neural network with adjacent segments for SA detection. Our experimental results show that our proposed method is useful for SA detection, and achieves better or comparable results when compared with traditional machine learning methods. ©2019 Wang et al.Entities:
Keywords: Automaticfeature-extraction; Convolutional neural network; ECG; LeNet-5; Sleep apnea
Year: 2019 PMID: 31579607 PMCID: PMC6756143 DOI: 10.7717/peerj.7731
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1PhysioNet Apnea-ECG dataset preprocessing scheme.
Note: In this study, the labeled segment and its surrounding ±2 segments of the ECG signal (five 1-minute segments in total) was extracted as a whole for processing.
Figure 2Architecture of our modified LeNet-5.
It can be seen as a combination of convolutional neural networks (CNN) for feature extraction and full connection (FC, also known as MLP) as classifier.
Details of our modified LeNet-5 convolutional neural network.
| Input | – | (None, 900, 2) | 0 |
| Conv1 | 32 × 5 × 2, stride 2, pad 0 | (None, 448, 32) | 352 |
| Max pooling2 | 3, stride 3, pad 0 | (None, 149, 32) | 0 |
| Conv3 | 64 × 5 × 2, stride 2, pad 0 | (None, 73, 64) | 10304 |
| Max pooling4 | 3, stride 3, pad 0 | (None, 24, 64) | 0 |
| Dropout5 | 0.8 rate | (None, 24, 64) | 0 |
| FC6 | 32, relu | (None, 32) | 49184 |
| Output | 2, softmax | (None, 2) | 66 |
Notes.
The number of parameters generated by the corresponding operation.
Feature set extracted based on previous studies.
| RMSSD | × | Square root of the average of the squared difference between adjacent RR intervals. | |
| SDNN | × | Standard deviation of the difference between adjacent RR intervals. | |
| NN50 | × | Number of adjacent RR intervals exceeds 50 ms. | |
| pNN50 | × | NN50 divides by the number of RR intervals. | |
| Mean RR | × | Mean of RR intervals. | |
| Mean HR | × | Mean of heart rate (HR), which is derived from RR intervals. | |
| Normalized VLF | × | × | Normalized very low frequency (VLF) component of the corresponding signal. |
| Normalized LF | × | × | Normalized Low frequency (LF) component of the corresponding signal. |
| Normalized HF | × | × | Normalized high frequency (HF) component of the corresponding signal. |
| LF/HF | × | × | The ratio of LF to HF of the corresponding signal. |
| LF/(LF + HF) | × | × | The ratio of LF to LF+HF of the corresponding signal. |
| HF/(LF + HF) | × | × | The ratio of HF to LF+HF of the corresponding signal. |
Notes.
RR intervals of single-lead ECG signal.
Amplitudes of single-lead ECG signal.
The overall performance of our modified LeNet-5 and traditional machine learning methods in per-segment SA detection.
| SVM | 81.4 | 76.9 | 84.3 | 0.887 |
| LR | 80.8 | 75.7 | 84.0 | 0.884 |
| KNN | 77.5 | 68.1 | 83.4 | 0.826 |
| MLP | 81.1 | 71.3 | 87.2 | 0.898 |
| LeNet-5 | 87.6 | 83.1 | 90.3 | 0.950 |
The overall performance of our modified LeNet-5 and traditional machine learning methods in per-recording classification.
| SVM | 88.6 | 100.0 | 66.7 | 0.978 | 0.852 |
| LR | 88.6 | 100.0 | 66.7 | 0.982 | 0.841 |
| KNN | 82.9 | 100.0 | 50.0 | 0.986 | 0.845 |
| MLP | 85.7 | 95.7 | 66.7 | 0.949 | 0.814 |
| LeNet-5 | 97.1 | 100.0 | 91.7 | 0.996 | 0.943 |
Notes.
The correlation value between the actual AHI and the experimentally determined AHI.
Figure 3Comparison of ROC curves of our modified LeNet-5 and MLP in per-segment SA detection.
Figure 4Comparison of the per-segment detection accuracy of five classifiers calculated on 10 different test groups.
The per-segment SA detection and per-recording classification performance in the UCDDB database.
| SVM | 70.6 | 32.7 | 83.3 | 92.3 | 100.0 | 50.0 | 0.251 |
| LR | 69.6 | 34.7 | 81.3 | 84.6 | 90.9 | 50.0 | 0.107 |
| KNN | 66.1 | 38.1 | 75.4 | 84.6 | 100.0 | 0.0 | 0.373 |
| MLP | 67.2 | 38.5 | 76.8 | 92.3 | 100.0 | 50.0 | 0.263 |
| LeNet-5 | 71.8 | 26.6 | 86.9 | 92.3 | 90.9 | 100.0 | 0.624 |
Comparison between the per-segment SA detection performance of our modified LeNet-5 and existing works.
| Feature Engineering | LS-SVM | 84.7 | 84.7 | 84.7 | |
| Feature Engineering | HMM-SVM | 86.2 | 82.6 | 88.4 | |
| Feature Engineering | LS-SVM | 83.4 | 79.5 | 88.4 | |
| Auto encoder | Decision fusion | 83.8 | 88.9 | 88.4 | |
| Our study | CNN | LeNet-5 | 87.6 | 83.1 | 90.3 |
Comparison between the per-recording classification performance of our modified LeNet-5 and existing works.
| PNN | 93.8 | 92.4 | 95.9 | – | |
| LS-SVM | 97.1 | 95.8 | 100 | 0.841 | |
| HMM-SVM | 97.1 | 95.8 | 100 | 0.860 | |
| LR | 89.7 | 92.0 | 85.4 | – | |
| Decision fusion | 100 | 100 | 100 | – | |
| Our study | LeNe-5t | 97.1 | 100.0 | 91.7 | 0.943 |