| Literature DB >> 31578345 |
Mustafa Radha1,2, Pedro Fonseca3,4, Arnaud Moreau5, Marco Ross5, Andreas Cerny5, Peter Anderer5, Xi Long3,4, Ronald M Aarts3,4.
Abstract
Automated sleep stage classification using heart rate variability (HRV) may provide an ergonomic and low-cost alternative to gold standard polysomnography, creating possibilities for unobtrusive home-based sleep monitoring. Current methods however are limited in their ability to take into account long-term sleep architectural patterns. A long short-term memory (LSTM) network is proposed as a solution to model long-term cardiac sleep architecture information and validated on a comprehensive data set (292 participants, 584 nights, 541.214 annotated 30 s sleep segments) comprising a wide range of ages and pathological profiles, annotated according to the Rechtschaffen and Kales (R&K) annotation standard. It is shown that the model outperforms state-of-the-art approaches which were often limited to non-temporal or short-term recurrent classifiers. The model achieves a Cohen's k of 0.61 ± 0.15 and accuracy of 77.00 ± 8.90% across the entire database. Further analysis revealed that the performance for individuals aged 50 years and older may decline. These results demonstrate the merit of deep temporal modelling using a diverse data set and advance the state-of-the-art for HRV-based sleep stage classification. Further research is warranted into individuals over the age of 50 as performance tends to worsen in this sub-population.Entities:
Mesh:
Year: 2019 PMID: 31578345 PMCID: PMC6775145 DOI: 10.1038/s41598-019-49703-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
A list of best-performing methods for wake-REM-N1/N2-N3 classification (30-s basis) using autonomic activity.
| Author, year | Participants | Sensors/signals | Algorithm | Cohen’s | Accuracy |
|---|---|---|---|---|---|
| Hwang[ | 12 healthy, 13 apnea | Bed sensors | Decision rules | 0.48 | 70.9% |
| Tataraidze[ | 685 healthy | RIP | XGB | 0.56 | — |
| Beattie[ | 60 healthy | ACT, PPG | Linear discriminant | 0.52 | 69.0% |
| Fonseca[ | 100 healthy | ECG, RIP | CRF | 0.53 | 70.8% |
| Aggarwal[ | 400 apnea | Nasal flow | Neural CRF | 0.57 | 74.1% |
| Li[ | 5793 | ECG | Deep CNN | 0.47 | 65.9% |
| This study | 195 healthy, 97 patients | ECG | LSTM | 0.61 | 77.0% |
ACT: actigraphy, RIP: respiratory inductance plethysmography, ECG: electrocardiography, RF: radio frequency, XGB: extreme gradient boosting, CRF: conditional random field. CNN: convolutional neural network.
Demographics and sleep statistics of participants in the Siesta data set. Sleep statistics are computed based on the sleep stage annotation of the data set.
| Parameter | Mean (SD) | Range |
|---|---|---|
| Age (year) | 51.5 (17.3) | 20.0–95.0 |
| BMI (kg/m2) | 25.6 (4.5) | 16.5–43.3 |
| TIB (hour) | 8.0 (0.5) | 5.8–9.6 |
| SE (%) | 80.8 (12.8) | 14.6–99.1 |
| N1 (%) | 13.1 (8.4) | 2.4–77.1 |
| N2 (%) | 53.8 (8.8) | 13.6–78.8 |
| N3 (%) | 13.8 (8.4) | 0.0–44.5 |
| REM (%) | 18.2 (5.9) | 0.0–34.8 |
N1, N2, N3, and REM percentages were calculated over the total sleep time for each recording.
BMI: body mass index, TIB: time in bed, SE: sleep efficiency.
Cardiac features used in the study.
| Count | Feature name |
|---|---|
|
|
|
| 4 | Means and medians of HR and RR (both detrended and absolute)[ |
| 12 | SDNN, RR range, pNN50, RMSSD, and SDSD[ |
| 28 | Percentiles (5%, 10%, 25%, 50%, 75%, 90% and 95%) of detrended and absolute HR/RR[ |
| 6 | RR DFA, its short, long exponents and all scales, and WDFA over 330 s and PDFA over non-overlapping segments of 64 heartbeats[ |
|
|
|
| 4 | RR logarithmic VLF, LF, and HF power and LF-to-HF ratio on 270 s windows[ |
| 4 | Boundary-adapted RR logarithmic VLF, LF, and HF power and LF-to-HF ratio on 270 s windows[ |
| 4 | RR mean respiratory frequency and power, max phase and module in HF pole[ |
|
|
|
| 20 | Multiscale sample entropy 1 of RR intervals at length 1 and 2, scales 1–10 over 510 s[ |
| 1 | Sample entropy of symbolic binary changes in RR intervals[ |
| 2 | Short- and long-range phase coordination of R-R intervals in patterns of up to 8 consecutive heartbeats[ |
| 7 | Phase synchronization for 6:2, 7:2, 8:2 and 9:2 phases, dominant ratio, short- and long-term coordination[ |
| 1 | Higuchi’s fractal dimension of the normalized IBI sequence[ |
|
|
|
| 21 | Mean teager energy, % of transition points and maxima and mean and sd of intervals between them, mean and sd of the amplitude of normalized IBIs at transition points and maxima[ |
| 5 | Arousal probabilities (max, mean, median, min, sd)[ |
| 13 | Visibility graph features[ |
HR heart rate; RR R-R interval; SDNN standard deviation of RR; pNN50 pecentage of successive RR differences >50 ms; RMSSD, root mean square of successive RR differences; SDSD, standard deviation of successive RR differences; MAD, mean absolute difference; VLF, very low frequency; LF, low frequency; HF, high frequency; DFA, detrended fluctuation analysis; PDFA, progressive DFA; WDFA, windowed DFA; PSD, power spectral density.
1The estimation accuracy of sample entropy is lower in series shorter than 10 (where m is the pattern length, in samples)[66,67]. In practice this means that this feature will be accurate for all scales with m = 1 and for scales below 6 with m = 2. The choice of window size was discussed in our earlier work[4].
Figure 1Computational graph of the neural network architecture. The three large blocks denote LSTM cells[34]. Dotted black lines denote recurrent connections that pass computed values to the next epoch in the sequence. The sigmoid-like functions are a linear combination of all inputs.
Performance of the model over all folds, as well as the overall performance for alternative architectures.
| Model | Nr of participants | Cohen’s | Accuracy % ± sd |
|---|---|---|---|
|
| |||
| Fold 1 | 73 | 0.60 ± 0.15 | 76.53 ± 8.47 |
| Fold 2 | 73 | 0.60 ± 0.14 | 76.28 ± 8.92 |
| Fold 3 | 73 | 0.64 ± 0.15 | 78.60 ± 10.15 |
| Fold 4 | 73 | 0.61 ± 0.16 | 76.58 ± 10.15 |
|
|
| ||
|
| |||
| Healthy subgroup | 195 | 0.63 ± 0.16 | 76.53 ± 10.14 |
| Sleep apnea subgroup | 51 | 0.60 ± 0.15 | 78.50 ± 7.90 |
| Insomnia subgroup | 26 | 0.65 ± 0.14 | 78.50 ± 7.07 |
| Parkinson’s disease subgroup | 15 | 0.43 ± 0.17 | 65.38 ± 10.04 |
| PLMD | 5 | 0.62 ± 0.15 | 78.33 ± 4.81 |
|
| |||
| LSTM layer, 64 cells | 292 | 0.59 ± 0.14 | 75.92 ± 8.64 |
| LSTM layers, 32 cells | 292 | 0.59 ± 0.15 | 75.54 ± 9.26 |
| LSTM layers, 128 cells | 292 | 0.61 ± 0.15 | 77.00 ± 9.02 |
| LSTM layers, 64 cells | 292 | 0.61 ± 0.14 | 76.64 ± 8.76 |
Figure 2Distribution of Cohen’s κ over three key demographic factors: age, sex and BMI.
Figure 3Distribution of performance for each sleep stage, reported in precision, recall, Cohen’s κ and accuracy, split across young and old age groups.
Figure 4Distribution of performance in Cohen’s κ per patient group, broken down into young and old age segments. Every point represent a single night (multiple nights could belong to the same person).