| Literature DB >> 34201861 |
Marco Altini1,2, Hannu Kinnunen1.
Abstract
Consumer-grade sleep trackers represent a promising tool for large scale studies and health management. However, the potential and limitations of these devices remain less well quantified. Addressing this issue, we aim at providing a comprehensive analysis of the impact of accelerometer, autonomic nervous system (ANS)-mediated peripheral signals, and circadian features for sleep stage detection on a large dataset. Four hundred and forty nights from 106 individuals, for a total of 3444 h of combined polysomnography (PSG) and physiological data from a wearable ring, were acquired. Features were extracted to investigate the relative impact of different data streams on 2-stage (sleep and wake) and 4-stage classification accuracy (light NREM sleep, deep NREM sleep, REM sleep, and wake). Machine learning models were evaluated using a 5-fold cross-validation and a standardized framework for sleep stage classification assessment. Accuracy for 2-stage detection (sleep, wake) was 94% for a simple accelerometer-based model and 96% for a full model that included ANS-derived and circadian features. Accuracy for 4-stage detection was 57% for the accelerometer-based model and 79% when including ANS-derived and circadian features. Combining the compact form factor of a finger ring, multidimensional biometric sensory streams, and machine learning, high accuracy wake-sleep detection and sleep staging can be accomplished.Entities:
Keywords: accelerometer; heart rate variability; machine learning; sleep staging; wearables
Mesh:
Year: 2021 PMID: 34201861 PMCID: PMC8271886 DOI: 10.3390/s21134302
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Technical illustration of the second generation Oura ring. The ring has a titanium cover, battery, power handling circuit, double core processor, memory, two LEDs, a photosensor, temperature sensors, 3-D accelerometer, and Bluetooth connectivity to a smartphone app.
Figure 2Accelerometer and temperature data for one participant (Dataset 1: Singapore, 15 years old) and one night. Sleep stages annotated from PSG data are color-coded.
Figure 3Heart rate and HRV (rMSSD) data for one participant (Dataset 1: Singapore, 15 years old) and one night. Sleep stages annotated from PSG data are color-coded.
Figure 4Cosine, decay, and linear functions used to model sensor-independent circadian features.
Bias and limits of agreement for total sleep time (TST), 2-stage classification, and the four models analyzed in this paper.
| Model | Device | Reference | Bias | LOA.Lower | LOA.Upper |
|---|---|---|---|---|---|
| ACC | 429.67 (61.05) | 430.66 (61.12) | 16.38 + −0.04 x ref | bias − 2.46(17.19 + −0.01 x ref) | bias + 2.46(17.19 + −0.01 x ref) |
| ACC+T | 430.1 (61) | 430.66 (61.12) | 14.98 + −0.04 x ref | bias − 2.46(18.51 + −0.02 x ref) | bias + 2.46(18.51 + −0.02 x ref) |
| ACC+T+HRV | 431.2 (60.48) | 430.66 (61.12) | 15.49 + −0.03 x ref | bias − 2.46(18.54 + −0.02 x ref) | bias + 2.46(18.54 + −0.02 x ref) |
| ACC+T+HRV+C | 432 (60.34) | 430.66 (61.12) | 16.27 + −0.03 x ref | bias − 2.46(16.02 + −0.02 x ref) | bias + 2.46(16.02 + −0.02 x ref) |
Figure 5Bland-Altman plots for total sleep time (TST), 2-stage classification, and the four models compared in this paper.
Figure 6Epoch by epoch sensitivity for sleep and wake and the four models compared in this paper. Whiskers are computed as 1.5 times the interquartile range.
Epoch by epoch sensitivity for wake and the four models compared in this paper.
| Model | Sensitivity | Specificity |
|---|---|---|
| ACC | 72.08 (18.44) [70.35, 73.86] | 96.82 (3.04) [96.54, 97.11] |
| ACC+T | 73.71 (17.9) [72.06, 75.42] | 97.05 (2.79) [96.8, 97.31] |
| ACC+T+HRV | 77.18 (16.77) [75.62, 78.76] | 97.61 (2.11) [97.42, 97.81] |
| ACC+T+HRV+C | 80.74 (14.12) [79.44, 82.07] | 98.15 (1.87) [97.98, 98.33] |
Bias and limits of agreement for TST, time in light, deep, and REM sleep for 4-stage classification and the four models analyzed in this paper.
| Model | Measure | Device | Reference | Bias | LOA.Lower | LOA.Upper |
|---|---|---|---|---|---|---|
| ACC | TST (min) | 429.49 (61.05) | 430.66 (61.12) | 16.23 + −0.04 x ref | bias − 2.46(16.96 + −0.01 x ref) | bias + 2.46(16.96 + −0.01 x ref) |
| ACC | Light (min) | 269.78 (71.24) | 247.31 (45.3) | 90.71 + −0.28 x ref | bias − 2.46(24.26 + 0.11 x ref) | bias + 2.46(24.26 + 0.11 x ref) |
| ACC | Deep (min) | 97.45 (59.24) | 93.34 (34.19) | 4.11 (45.55) | −85.16 | 93.38 |
| ACC | REM (min) | 62.26 (35.36) | 90.01 (26.24) | 34.95 + −0.7 x ref | bias − 67.53 | bias + 67.53 |
| ACC+T | TST (min) | 430.1 (61) | 430.66 (61.12) | 14.98 + −0.04 x ref | bias − 2.46(18.51 + −0.02 x ref) | bias + 2.46(18.51 + −0.02 x ref) |
| ACC+T | Light (min) | 276.39 (64.67) | 247.31 (45.3) | 101.27 + −0.29 x ref | bias − 2.46(29.01 + 0.07 x ref) | bias + 2.46(29.01 + 0.07 x ref) |
| ACC+T | Deep (min) | 89.72 (53.45) | 93.34 (34.19) | −3.62 (41.93) | −85.8 | 78.56 |
| ACC+T | REM (min) | 63.98 (32.59) | 90.01 (26.24) | 31.86 + −0.64 x ref | bias − 61.18 | bias + 61.18 |
| ACC+T+HRV | TST (min) | 431.2 (60.48) | 430.66 (61.12) | 15.49 + −0.03 x ref | bias − 2.46(18.54 + −0.02 x ref) | bias + 2.46(18.54 + −0.02 x ref) |
| ACC+T+HRV | Light (min) | 249.12 (48.06) | 247.31 (45.3) | 72.1 + −0.28 x ref | bias − 2.46(20.61 + 0.03 x ref) | bias + 2.46(20.61 + 0.03 x ref) |
| ACC+T+HRV | Deep (min) | 90.69 (40.12) | 93.34 (34.19) | 21.61 + −0.26 x ref | bias − 2.46(28.47 + −0.04 x ref) | bias + 2.46(28.47 + −0.04 x ref) |
| ACC+T+HRV | REM (min) | 91.39 (25.24) | 90.01 (26.24) | 47.76 + −0.52 x ref | bias − 2.46(11.74 + 0.07 x ref) | bias + 2.46(11.74 + 0.07 x ref) |
| ACC+T+HRV+C | TST (min) | 432 (60.34) | 430.66 (61.12) | 16.27 + −0.03 x ref | bias − 2.46(16.02 + −0.02 x ref) | bias + 2.46(16.02 + −0.02 x ref) |
| ACC+T+HRV+C | Light (min) | 249.31 (48.24) | 247.31 (45.3) | 58.1 + −0.23 x ref | bias − 65 | bias + 65 |
| ACC+T+HRV+C | Deep (min) | 90.43 (35.54) | 93.34 (34.19) | 26.85 + −0.32 x ref | bias − 52.63 | bias + 52.63 |
| ACC+T+HRV+C | REM (min) | 92.26 (26.19) | 90.01 (26.24) | 42.83 + −0.45 x ref | bias − 2.46(10.58 + 0.08 x ref) | bias + 2.46(10.58 + 0.08 x ref) |
Figure 7Bias and limits of agreement for TST, 4-stage classification, and the four models analyzed in this paper.
Figure 8Bias and limits of agreement for time in light sleep, 4-stage classification, and the four models analyzed in this paper.
Figure 9Bias and limits of agreement for time in deep sleep, 4-stage classification, and the four models analyzed in this paper.
Figure 10Bias and limits of agreement for time in REM sleep, 4-stage classification, and the four models analyzed in this paper.
Figure 11Epoch by epoch sensitivity for 4-stage classification and the four models compared in this paper. Whiskers are computed as 1.5 times the interquartile range.
Figure 12Epoch by epoch specificity for 4-stage classification and the four models compared in this paper. Whiskers are computed as 1.5 times the interquartile range.
Epoch by epoch sensitivity and specificity for wake, light, deep, and REM sleep and the four models compared in this paper.
| Model | Stage | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| ACC | Wake | 94.53 (3.79) [94.18, 94.88] | 72.07 (17.61) [70.44, 73.74] | 96.85 (2.94) [96.58, 97.14] |
| ACC | Light | 61.27 (5.72) [60.74, 61.8] | 67.74 (14.22) [66.45, 69.03] | 53.11 (16.2) [51.63, 54.62] |
| ACC | Deep | 80.44 (6.01) [79.87, 81] | 48.14 (28.42) [45.51, 50.77] | 87.15 (7.88) [86.42, 87.9] |
| ACC | REM | 78.98 (5.2) [78.5, 79.47] | 28.99 (19.37) [27.19, 30.81] | 90.72 (5.89) [90.17, 91.27] |
| ACC+T | Wake | 94.79 (3.93) [94.43, 95.17] | 73.71 (17.9) [72.07, 75.41] | 97.05 (2.79) [96.79, 97.32] |
| ACC+T | Light | 63.09 (6.8) [62.46, 63.72] | 70.86 (12) [69.76, 71.98] | 53.77 (16.32) [52.27, 55.31] |
| ACC+T | Deep | 82.69 (5.91) [82.14, 83.25] | 50.06 (28.88) [47.33, 52.71] | 89.67 (6.66) [89.07, 90.29] |
| ACC+T | REM | 79.9 (5.35) [79.4, 80.4] | 32.38 (19.33) [30.55, 34.18] | 91.04 (5.44) [90.55, 91.57] |
| ACC+T+HRV | Wake | 95.58 (3.5) [95.25, 95.92] | 77.18 (16.77) [75.65, 78.76] | 97.61 (2.11) [97.42, 97.81] |
| ACC+T+HRV | Light | 77.48 (6.14) [76.91, 78.05] | 79.13 (7.38) [78.45, 79.82] | 75.73 (11.75) [74.62, 76.84] |
| ACC+T+HRV | Deep | 89.11 (4.25) [88.72, 89.51] | 69.57 (23.84) [67.39, 71.8] | 93.73 (4.16) [93.35, 94.12] |
| ACC+T+HRV | REM | 90.16 (4.18) [89.78, 90.57] | 75.89 (18.09) [74.23, 77.56] | 93.75 (3.4) [93.44, 94.06] |
| ACC+T+HRV+C | Wake | 96.38 (3.12) [96.1, 96.68] | 80.74 (14.12) [79.43, 82.06] | 98.15 (1.87) [97.98, 98.33] |
| ACC+T+HRV+C | Light | 80.2 (5.53) [79.68, 80.73] | 81.7 (6.97) [81.06, 82.35] | 78.67 (10.63) [77.7, 79.67] |
| ACC+T+HRV+C | Deep | 90.64 (3.75) [90.29, 90.99] | 74.44 (20.28) [72.56, 76.34] | 94.63 (3.77) [94.28, 94.99] |
| ACC+T+HRV+C | REM | 90.87 (4.12) [90.49, 91.26] | 78.08 (17.39) [76.5, 79.71] | 94.12 (3.41) [93.8, 94.44] |
Figure 13Example hypnogram for an average night (f1 = 0.78) for the model, including all features (ACC+T+HRV+C).