| Literature DB >> 34234595 |
Bernice M Wulterkens1,2, Pedro Fonseca1,2, Lieke W A Hermans1, Marco Ross3, Andreas Cerny3, Peter Anderer3, Xi Long1,2, Johannes P van Dijk1,4, Nele Vandenbussche4, Sigrid Pillen1,4, Merel M van Gilst1,4, Sebastiaan Overeem1,4.
Abstract
PURPOSE: There is great interest in unobtrusive long-term sleep measurements using wearable devices based on reflective photoplethysmography (PPG). Unfortunately, consumer devices are not validated in patient populations and therefore not suitable for clinical use. Several sleep staging algorithms have been developed and validated based on ECG-signals. However, translation from these techniques to data derived by wearable PPG is not trivial, and requires the differences between sensing modalities to be integrated in the algorithm, or having the model trained directly with data obtained with the target sensor. Either way, validation of PPG-based sleep staging algorithms requires a large dataset containing both gold standard measurements and PPG-sensor in the applicable clinical population. Here, we take these important steps towards unobtrusive, long-term sleep monitoring.Entities:
Keywords: heart rate variability; hypnogram; pediatrics; polysomnography; sleep staging; wearable
Year: 2021 PMID: 34234595 PMCID: PMC8253894 DOI: 10.2147/NSS.S306808
Source DB: PubMed Journal: Nat Sci Sleep ISSN: 1179-1608
Demographic Information for Participants in the Training and Validation Datasets
| Parameter | Training dataset | Validation dataset | ||||
|---|---|---|---|---|---|---|
| Total | Healthy sleepers | Sleep disordered patients | Total | Adults | Children/adolescents | |
| N (participants) | 543 | 121 | 422 | 292 | 244 | 48 |
| N Female [%] | 225 [41.4] | 67 [55.4] | 158 [37.4] | 106 [36.3] | 86 [35.2] | 20 [41.7] |
| Age [min., max.] (yrs) | 49.4 ± 15.4 [3, 86] | 45.7 ± 13.8 [18, 69] | 50.5 ± 15.7 [3, 86] | 42.3 ± 19.7 [3, 82] | 48.4 ± 15.3 [19, 82] | 11.6 ± 4.4 [3, 17] |
| BMI (kg/m2) | 27.0 ± 5.0 | 25.2 ± 3.7 | 27.5 ± 5.23 | – | 27.2 ± 5.0 | – |
Prevalence of Sleep Disorders in the Validation Dataset
| Adults | Children/Adolescents | |||
|---|---|---|---|---|
| Group | Total Prevalence | Single Primary Disorder | Total Prevalence | Single Primary Disorder |
| Sleep disordered breathing | 114 | 86 | 15 | 15 |
| Insomnia | 71 | 40 | 11 | 8 |
| Movement disorder | 35 | 14 | 2 | 2 |
| Behavioral sleep disorder | 22 | 9 | 4 | 3 |
| Non-REM parasomnia | 15 | 12 | 6 | 4 |
| REM parasomnia | 17 | 6 | 0 | 0 |
| Circadian disorder* | - | - | 3 | 3 |
| Other | 34 | 17 | 2 | 1 |
| None | 0 | - | 8 | - |
Notes: The number of participants with the respective diagnoses are shown as a total; as well as the number of participants in whom the respective diagnosis was the single primary sleep disorder. *Circadian disorder was evaluated as a separate group for children/adolescents. For adults, the circadian disorder group was incorporated in the category “other” as it contained less than 10 participants.
Overall Epoch-per-Epoch Agreement for Both Adults and Children/Adolescents
| Task | κ (-) | Accuracy (%) | Sensitivity (%) | Specificity (%) | PPV (%) |
|---|---|---|---|---|---|
| Wake/N1+N2/N3/REM | 0.62 ± 0.12 | 76.4 ± 7.3 | n/a | n/a | n/a |
| Wake/NREM/REM | 0.68 ± 0.11 | 85.2 ± 5.8 | n/a | n/a | n/a |
| Wake (vs Sleep)a | 0.66 ± 0.14 | 91.5 ± 5.4 | 73.1 ± 16.8 | 94.6 ± 5.9 | 74.3 ± 16.6 |
| N1+N2a | 0.54 ± 0.13 | 77.5 ± 6.7 | 78.0 ± 9.0 | 76.7 ± 11.6 | 78.8 ± 11.3 |
| N3a | 0.60 ± 0.22 | 90.8 ± 4.9 | 69.5 ± 24.3 | 94.8 ± 4.9 | 69.1 ± 24.8 |
| REMa | 0.69 ± 0.18 | 93.0 ± 3.9 | 78.2 ± 19.6 | 95.2 ± 3.7 | 71.8 ± 16.1 |
Note: aBinary classification tasks were assessed by a one versus the rest strategy, where one single class (Wake, N1+N2, N3 or REM) was considered as the “positive” class and the remaining classes were aggregated in a single “negative” class.
Abbreviation: PPV, positive predictive value.
Confusion Matrix for Sleep Stage Classification in All Epochs of All Recordings (N = 298,219)
| Pred → | Wake | N1+N2 | N3 | REM | Prev. (-,(%)) | Sens. (%) | κ (-) |
|---|---|---|---|---|---|---|---|
| Wake | 41,962 (14.1%/75.7%) | 11,900 (4.0%/21.5%) | 125 (0.04%/0.2%) | 1468 (0.5%/2.6%) | 55.455 (18.6) | 75.7 | 0.72 |
| N1+N2 | 10,595 (3.6%/6.8%) | 122,603 (41.1%/78.2%) | 12,965 (4.3%/8.3%) | 10,570 (3.5%/6.7%) | 156.733 (52.6) | 78.2 | 0.55 |
| N3 | 310 (0.1%/0.7%) | 13,480 (4.5%/30.0%) | 30,795 (10.3%/68.8%) | 279 (0.09%/0.6%) | 201.907 (15.0) | 68.6 | 0.64 |
| REM | 837 (0.3%/2.0%) | 7560 (2.5%/18.4%) | 256 (0.09%/0.6%) | 32,514 (10.9%/79.0%) | 41.167 (13.8) | 79.0 | 0.72 |
| PPV (%) | 78.1 | 78.8 | 69.8 | 72.5 |
Notes: Each entry in the confusion matrix indicates the number of epochs. Between parentheses, the percentage relative to the total number of epochs of all classes is listed, followed by the percentage relative to the total number of epochs with the corresponding reference sleep stage for that row.
Abbreviations: Prev., prevalence; Sens., sensitivity; PPV, positive predictive value.
Performance for 4-Class Sleep Staging in Diagnostic Subgroups
| Conditiona | N | κ (-) | Accuracy (%) | ||||
|---|---|---|---|---|---|---|---|
| Mean ± SD | Median | P-valueb | Mean ± SD | Median | P-valuec | ||
| Sleep disordered breathing | 75.57 ± 7.74 | 77.53 | 0.15 | ||||
| Insomnia | 82 | 0.63 ± 0.11 | 0.63 | 0.43 | 77.16 ± 6.69 | 77.15 | 0.64 |
| Movement disorder | 37 | 0.61 ± 0.12 | 0.61 | 0.60 | 76.38 ± 8.24 | 78.38 | 0.89 |
| Behavioral | 26 | 0.62 ± 0.10 | 0.62 | 0.81 | 76.97 ± 5.80 | 77.16 | 0.96 |
| Non-REM parasomnia | |||||||
| REM parasomnia | |||||||
Notes: aSubgroup of patients for whom the primary diagnosis includes that disorder. b,cWilcoxon rank-sum test of differences in κ and accuracy, respectively, between the subgroup of patients with that disorder and without. Bold values are statistically significant. **p < 0.01, *p < 0.05.
Figure 1Representative hypnograms of three patients with a single primary diagnosis of sleep disordered breathing (left panel: patient (A–C)) and three patients with a single primary diagnosis of insomnia (right panel: patient (D–F)). Hypnograms are shown based on the PSG reference (top) and the PPG/accelerometer algorithm (bottom). Hypnograms were taken from the 25, 50 and 75 percentiles of overall kappa with the lowest performance on top. REM sleep is marked with a red line. Some clinical aspects are relevant to mention. Patient (B) was diagnosed with very mild, but treatment responsive obstructive sleep apnea, but also had parasomnia complaints. Patient (D and E) had both sleep misperception and were thus diagnosed with paradoxical insomnia. Patient (E) was also diagnosed with a delayed sleep phase, explaining the occurrence of N3-sleep later in the night. Patient (F) was diagnosed with insomnia, receiving quetiapine at the time of PSG with good results.
Sleep Statistics for Both Adults and Children/Adolescents
| Parameter | PSG | PSG – Sleep Statistic Calculated by the Algorithm | |||
|---|---|---|---|---|---|
| Mean ± SD | Range [min., max.] | Mean Error ± SD | 95% LoA | RMSE | |
| SOL (min) | 18.68 ± 22.46 | [0.00, 221.00] | −5.28 ± 24.83 | [−53.94, 43.38] | 25.34 |
| WASO (min) | 72.33 ± 65.55 | [4.50, 391.00] | 9.80 ± 39.45 | [−67.52, 87.13] | 40.59 |
| TWT (min) | 94.96 ± 76.17 | [5.00, 492.00] | 2.93 ± 33.98 | [−63.67, 69.53] | 34.05 |
| TST (min) | 415.00 ± 85.32 | [102.00, 651.50] | −3.99 ± 34.08 | [−70.78, 62.81] | 34.26 |
| SE (%) | 81.25 ± 14.63 | [19.01, 99.01] | −0.70 ± 6.63 | [−13.70, 12.30] | 6.66 |
| Time in N1+N2 (min) | 267.68 ± 60.99 | [47.00, 434.00] | 1.24 ± 49.95 | [−96.66, 99.15] | 49.88 |
| Time in N1+N2 (%) | 65.02 ± 10.63 | [33.33, 100.00] | 0.59 ± 10.65 | [−20.29, 21.47] | 10.65 |
| Time in N3 (min) | 76.82 ± 40.69 | [0.00, 237.50] | 1.19 ± 40.25 | [−77.70, 80.08] | 40.20 |
| Time in N3 (%) | 18.49 ± 9.36 | [0.00, 66.67] | −0.62 ± 9.56 | [−18.12, 19.35] | 9.56 |
| Time in REM (min) | 70.49 ± 30.77 | [0.00, 164.50] | −6.42 ± 22.92 | [−51.36, 38.30] | 23.77 |
| Time in REM (%) | 16.49 ± 5.96 | [0.00, 34.63] | −1.21 ± 5.80 | [−12.58, 10.16] | 5.92 |
Abbreviations: PSG, polysomnography; SD, standard deviation; min., minimum; max., maximum; LoA, limits of agreement; RMSE, root mean square error; SOL, sleep onset latency; WASO, wake after sleep onset; TWT, total wake time; TST, total sleep time; SE, sleep efficiency; min, minutes.