| Literature DB >> 33623459 |
Nicholas I Y N Chee1, Shohreh Ghorbani1, Hosein Aghayan Golkashani1, Ruth L F Leong1, Ju Lynn Ong1, Michael W L Chee1.
Abstract
BACKGROUND: Wearable devices have tremendous potential for large-scale longitudinal measurement of sleep, but their accuracy needs to be validated. We compared the performance of the multisensor Oura ring (Oura Health Oy, Oulu, Finland) to polysomnography (PSG) and a research actigraph in healthy adolescents.Entities:
Keywords: actigraphy; adolescents; polysomnography; validation; wearable
Year: 2021 PMID: 33623459 PMCID: PMC7894804 DOI: 10.2147/NSS.S286070
Source DB: PubMed Journal: Nat Sci Sleep ISSN: 1179-1608
Polysomnography-Determined Sleep Architecture
| 6.5-Hour TIB (N = 22) | 8-Hour TIB (N = 28) | 9-Hour TIB (N = 52) | |
|---|---|---|---|
| TIB | 390.50 (0.98) | 480.10 (1.51) | 540.11 (0.34) |
| TST | 353.81 (18.70) | 443.04 (17.65) | 489.29 (29.19) |
| Stage N1 sleep | 5.73 (5.41) | 8.03 (6.01) | 10.18 (7.28) |
| Stage N2 sleep | 180.93 (25.38) | 227.94 (25.12) | 261.97 (31.23) |
| Stage N1 + N2 sleep | 186.66 (25.85) | 235.96 (25.03) | 272.15 (32.04) |
| Stage N3 sleep | 99.76 (27.24) | 115.96 (21.05) | 111.21 (27.18) |
| REM sleep | 67.38 (16.67) | 91.12 (19.20) | 105.94 (22.58) |
| WASO | 7.39 (7.52) | 11.83 (11.04) | 14.27 (14.74) |
| Sleep efficiency (%) | 90.60 (4.78) | 92.20 (3.84) | 90.59 (5.40) |
Notes: Data presented as mean (standard deviation) in minutes unless otherwise indicated.
Abbreviations: REM, rapid eye movement; TIB, time in bed; TST, total sleep time; WASO, wake after sleep onset.
Biases from PSG for Each Device Across TIB Conditions
| M10 | H5 | Oura | F | |
|---|---|---|---|---|
| TST | −25.83 (13.89)**a,b | −2.18 (13.36)a,c | −32.76 (17.05)**b,c | 69.35 |
| Stage N1+N2 sleep | – | – | −51.14 (28.33)** | |
| Stage N3 sleep | – | – | 31.51 (35.39)** | |
| REM sleep | – | – | −13.13 (20.45)* | |
| WASO | 27.99 (11.76)**a | 14.42 (8.03)**a,c | 30.71 (16.34)**c | 21.35 |
| TST | −33.61 (22.79)**a,b | −7.54 (15.76)a,c | −46.08 (19.70)**b,c | 78.33 |
| Stage N1+N2 sleep | – | – | −69.85 (32.42)** | |
| Stage N3 sleep | – | – | 43.27 (32.28)** | |
| REM sleep | – | – | −19.52 (26.74)** | |
| WASO | 37.76 (20.41)**a | 18.03 (12.05)**a,c | 41.64 (17.06)**c | 43.134 |
| TST | −33.86 (19.25)**a,b | −5.30 (17.78)a,c | −47.26 (24.59)**b,c | 116.06 |
| Stage N1+N2 sleep | – | – | −81.21 (32.18)** | |
| Stage N3 sleep | – | – | 46.76 (36.28)** | |
| REM sleep | – | – | −12.81(28.92)* | |
| WASO | 37.94 (15.97)**a,b | 19.11 (12.00)**a,c | 46.33 (22.03)**b,c | 64.18 |
Notes: Data presented as mean (standard deviation) in minutes. Significant biases using one-sample t-test against zero. Bonferroni corrected p-values: *P < 0.05; **P < 0.001. Analyses of variance of TST, and WASO biases were all significant within each TIB condition (P < 0.001). Negative values represent underestimations. aM10 significantly different from H5 (P < 0.05). bM10 significantly different from Oura (P < 0.05). cH5 significantly different from Oura (P < 0.05).
Abbreviations: M10, The default Actiwatch setting that uses a medium wake threshold with 40 counts per epoch with 10 immobile minutes for sleep onset and termination; H5, Actiwatch setting that has a higher wake threshold of 80 counts per epoch and 5 immobility minutes for sleep onset and termination; REM, rapid eye movement; TIB, time in bed; TST, total sleep time; WASO, wake after sleep onset.
Figure 1Bland−Altman plots of mean bias with upper and lower bands of agreement between polysomnography (PSG), Oura and Actiwatch H5 and M10 settings for each TIB condition (Red: 6.5-hour, green: 8-hour, and blue: 9-hour). A mean bias line above and below zero demonstrates overestimation and underestimation of the device against PSG, respectively. Bland-Altman plots of (A) total sleep time (TST), and (B) wake after sleep onset (WASO) for Actiwatch M10, H5, and Oura, respectively. The solid line indicates the mean value of bias and the dashed line represent 1.96 SD limits of agreement.
Figure 2Bland–Altman plots of (A) light, (B) deep, and (C) REM sleep, respectively between polysomnography (PSG) and Oura. Each TIB condition is color-coded with red: 6.5-hour, green: 8-hour, and blue: 9-hour sleep. The solid line indicates the mean value of bias and the dashed line represent 1.96 SD limits of agreement.
Confusion Matrices of Each Device Setting by TIB Condition
| M10 | H5 | Oura | ||||||
|---|---|---|---|---|---|---|---|---|
| Sleep | Wake | Sleep | Wake | Sleep | Wake | |||
| PSG | Sleep | 0.10(0.04) | 0.05(0.02) | 0.11(0.04) | ||||
| 0.11(0.09) | 0.06(0.08) | 0.11(0.04) | ||||||
| 0.09(0.04) | 0.05(0.02) | 0.12(0.05) | ||||||
| Wake | 0.14(0.11) | 0.30(0.20) | 0.11(0.07) | |||||
| 0.14(0.09) | 0.29(0.14) | 0.11(0.06) | ||||||
| 0.14(0.14) | 0.31(0.20) | 0.11(0.08) | ||||||
Notes: Mean (standard deviation) of proportions, referenced to PSG, of sleep/wake agreements. The classification accuracy of epochs into sleep or wake, specificities for sleep/wake categories; are highlighted in bold.
Abbreviations: M10, the default Actiwatch setting that uses a medium wake threshold with 40 counts per epoch with 10 immobile minutes for sleep onset and termination; H5, Actiwatch setting that has a higher wake threshold of 80 counts per epoch and 5 immobility minutes for sleep onset and termination; REM, rapid eye movement; TIB, time in bed; TST, total sleep time; WASO, wake after sleep onset.
EBE Agreement Metrics, Referenced to PSG, of Each Device Setting Grouped by TIB Condition
| M10 | H5 | Oura | F | |
|---|---|---|---|---|
| 6.5-Hour TIB | ||||
| Sleep-wake accuracy | 0.90 (0.03)a | 0.93(0.02)a,c | 0.89(0.04)c | 15.03 |
| Wake specificity | 0.86(0.11)a | 0.70(0.20)a,c | 0.89(0.07)c | 26.86 |
| Sleep sensitivity | 0.90(0.04)a | 0.95(0.02)a,c | 0.89(0.04)c | 40.76 |
| Sleep stage accuracies | ||||
| Light sleep | – | – | 0.52(0.05) | – |
| Deep sleep | – | – | 0.79(0.12) | – |
| REM sleep | – | – | 0.53(0.18) | – |
| 8-Hour TIB | ||||
| Sleep-wake accuracy | 0.90(0.04)a,b | 0.94(0.02)a,c | 0.89(0.04)b,c | 26.84 |
| Wake specificity | 0.86(0.09)a | 0.71(0.14)a,c | 0.89(0.07)c | 35.70 |
| Sleep sensitivity | 0.91(0.05)a,b | 0.95(0.02)a,c | 0.89(0.04)b,c | 49.52 |
| Sleep stage accuracies | ||||
| Light sleep | – | – | 0.52(0.08) | – |
| Deep sleep | – | – | 0.83(0.10) | – |
| REM sleep | – | – | 0.51(0.17) | – |
| 9-Hour TIB | ||||
| Sleep-wake accuracy | 0.91(0.03)a,b | 0.93(0.03)a,c | 0.89(0.04)b,c | 32.19 |
| Wake specificity | 0.87(0.11)a | 0.70(0.19)a,c | 0.89(0.08)c | 52.25 |
| Sleep sensitivity | 0.91(0.04)a,b | 0.95(0.02)a,c | 0.88(0.05)b,c | 91.83 |
| Sleep stage accuracies | ||||
| Light sleep | – | – | 0.52(0.07) | – |
| Deep sleep | – | – | 0.79(0.11) | – |
| REM sleep | – | – | 0.53(0.17) | – |
Notes: Analyses of variance of sleep sensitivities, wake specificities and sleep-wake accuracies within each TIB condition were all significant (P < 0.001). aM10 significantly different from H5 (P < 0.05). bM10 significantly different from Oura (P < 0.05). cH5 significantly different from Oura (P < 0.05).
Abbreviations: EBE, epoch by epoch; M10, The default Actiwatch setting that uses a medium wake threshold with 40 counts per epoch with 10 immobile minutes for sleep onset and termination; H5, Actiwatch setting that has a higher wake threshold of 80 counts per epoch and 5 immobility minutes for sleep onset and termination; REM, rapid eye movement; TIB, time in bed; TST, total sleep time; WASO, wake after sleep onset.
Confusion Matrices of Oura Sleep Staging by TIB Condition
| Oura | ||||||
|---|---|---|---|---|---|---|
| Wake | Light Sleep | Deep Sleep | REM Sleep | |||
| PSG | Wake | 0.05(0.04) | 0.05(0.04) | 0.01(0.02) | ||
| 0.05(0.04) | 0.04(0.03) | 0.02(0.02) | ||||
| 0.05(0.05) | 0.04(0.03) | 0.02(0.02) | ||||
| Stage N1 + N2 Sleep | 0.13(0.05) | 0.25(0.08) | 0.10(0.04) | |||
| 0.13(0.05) | 0.25(0.07) | 0.10(0.05) | ||||
| 0.13(0.06) | 0.23(0.07) | 0.12(0.05) | ||||
| Stage N3 Sleep | 0.02(0.02) | 0.18(0.12) | 0.01(0.02) | |||
| 0.02(0.02) | 0.13(0.08) | 0.02(0.02) | ||||
| 0.02(0.02) | 0.17(0.10) | 0.02(0.03) | ||||
| REM Sleep | 0.18(0.16) | 0.23(0.12) | 0.06(0.07) | |||
| 0.17(0.09) | 0.28(0.12) | 0.04(0.04) | ||||
| 0.18(0.13) | 0.24(0.12) | 0.05(0.05) | ||||
Notes: Mean (standard deviation) of proportions, referenced to PSG, of each sleep stage classification. Classification accuracies for each sleep stage are highlighted in bold.
Abbreviations: REM, rapid eye movement; TIB, time in bed; TST, total sleep time; WASO, wake after sleep onset.
Biases Proportional to the Sleep Duration Observed in Each Device Across TIB Conditions
| 6.5-Hour TIB (N=22) | 8-Hour TIB (N=28) | 9-Hour TIB (N=52) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| M10 | H5 | Oura | M10 | H5 | Oura | M10 | H5 | Oura | |
| TST | 0.17 (0.23) | −0.04 (0.25) | 0.31 (0.28) | 1.00 (0.27)** | 0.41 (0.27) | 0.82 (0.24)* | 0.39 (0.10)** | 0.22 (0.11) | 0.37 (0.16)* |
| Stage N1 + N2 sleep | −0.21 (0.35) | 0.20 (0.30) | 0.21 (0.16) | ||||||
| Stage N3 sleep | −0.21 (0.49) | 0.65 (0.37) | 0.32 (0.25) | ||||||
| REM sleep | 0.21 (0.43) | 0.77 (0.33)** | 0.62 (0.24)* | ||||||
| WASO | 1.39 (0.23)** | 0.98 (0.24)** | 1.64 (0.19)** | 1.33 (0.20)** | 0.69 (0.26)* | 1.13 (0.19)** | 0.60 (0.15)** | 0.09 (0.16) | 0.95 (0.16)** |
Notes: Data presented as B (standard error) in minutes. Biases were linearly regressed onto the mean of polysomnography and device setting duration to determine if the estimated TST and WASO for each device setting would predict the bias magnitude in each TIB condition. *P < 0.05; **P < 0.001.
Abbreviations: M10, the default Actiwatch setting that uses a medium wake threshold with 40 counts per epoch with 10 immobile minutes for sleep onset and termination; H5, Actiwatch setting that has a higher wake threshold of 80 counts per epoch and 5 immobility minutes for sleep onset and termination; REM, rapid eye movement; TIB, time in bed; TST, total sleep time; WASO, wake after sleep onset.
Figure 3TST, WASO, and sleep stages measured by PSG (blue lines) and Oura ring (red lines) for the Continuous (dotted lines) and Split (solid lines) sleep groups across the manipulation nights. Error bars denote 95% confidence intervals. Blue asterisks denote significant differences between groups with PSG measures, while red asterisks denote significant differences between groups with Oura measures. *P < 0.05; **P < 0.01; ***P < 0.001.