| Literature DB >> 33378539 |
Evan D Chinoy1,2, Joseph A Cuellar1,2, Kirbie E Huwa1,2, Jason T Jameson1,2, Catherine H Watson1,3, Sara C Bessman1,4, Dale A Hirsch1, Adam D Cooper1,3, Sean P A Drummond5, Rachel R Markwald1.
Abstract
STUDYEntities:
Keywords: actigraphy; nearables; noncontact; polysomnography; sensors; sleep technology; validation; wearables
Mesh:
Year: 2021 PMID: 33378539 PMCID: PMC8120339 DOI: 10.1093/sleep/zsaa291
Source DB: PubMed Journal: Sleep ISSN: 0161-8105 Impact factor: 5.849
Sleep summary: total sleep time (TST)
| Device |
| PSG | Device | Bias | Lower limit | Upper limit |
| Effect size |
|
|---|---|---|---|---|---|---|---|---|---|
| Actiwatch | 98 | 418.2 ± 40.7 | 442.1 ± 19.7 | 23.9 | –40.5 | 88.3 | 7.3 | 0.74 | 0.51 |
| Fatigue Science Readiband | 41 | 416.0 ± 47.9 | 429.3 ± 53.0 | 13.3 | –93.2 | 119.8 | 1.6 (0.118) | 0.26 | 0.01 (0.482) |
| Fitbit Alta HR | 49 | 425.1 ± 33.1 | 427.7 ± 19.7 | 2.6 | –42.0 | 47.1 | 0.8 (0.421) | 0.09 | 0.41 |
| Garmin Fenix 5S | 29 | 413.1 ± 53.0 | 456.8 ± 21.0 | 43.7 | –47.0 | 134.4 | 5.2 | 1.07 | 0.61 |
| Garmin Vivosmart 3 | 43 | 414.7 ± 48.4 | 461.5 ± 16.4 | 46.8 | –39.5 | 133.1 | 7.1 | 1.28 | 0.69 |
| Earlysense Live | 51 | 421.6 ± 34.8 | 435.2 ± 30.1 | 13.6 | –45.1 | 72.3 | 3.3 | 0.42 | 0.03 (0.214) |
| ResMed S+ | 51 | 422.3 ± 33.8 | 422.0 ± 40.0 | –0.3 | –70.7 | 70.2 | –0.1 (0.953) | –0.01 | 0.04 (0.159) |
| SleepScore Max | 42 | 413.6 ± 48.9 | 421.1 ± 37.1 | 7.5 | –60.7 | 75.7 | 1.4 (0.162) | 0.17 | 0.14 |
Summary results for minutes of TST for the devices versus polysomnography (PSG). All nights with available TST data for both the device and PSG are included, with the total number of nights (n) indicated in each row. Mean and standard deviation (SD) are shown for PSG and each device. Bias represents the mean difference between PSG and the device, with positive and negative bias values indicating the device showed an overestimation or underestimation compared with PSG, respectively. Lower and upper limits of agreement represent two SDs from the bias. Statistical significance between each device and PSG was assessed with paired t-tests and corresponding p-values. Effect sizes (Hedges’ g) and proportional biases (R2) with corresponding p-values are also shown. p-values at the p < 0.05 level were considered statistically significant and are shown in bold and italic.
Sleep summary: sleep efficiency (SE)
| Device |
| PSG | Device | Bias | Lower limit | Upper limit |
| Effect size |
|
|---|---|---|---|---|---|---|---|---|---|
| Actiwatch | 98 | 87.1 ± 8.5 | 92.1 ± 4.1 | 5.0 | –8.5 | 18.4 | 7.3 | 0.74 | 0.51 |
| Fatigue Science Readiband | 41 | 86.7 ± 10.0 | 89.4 ± 11.0 | 2.8 | –19.4 | 24.9 | 1.6 (0.117) | 0.26 | 0.01 (0.487) |
| Fitbit Alta HR | 49 | 88.6 ± 6.9 | 89.4 ± 4.0 | 0.9 | –8.4 | 10.2 | 1.3 (0.191) | 0.16 | 0.45 |
| Garmin Fenix 5S | 29 | 86.1 ± 11.0 | 96.6 ± 2.9 | 10.6 | –9.0 | 30.1 | 5.8 | 1.29 | 0.82 |
| Garmin Vivosmart 3 | 43 | 86.4 ± 10.1 | 96.5 ± 3.1 | 10.1 | –7.3 | 27.6 | 7.6 | 1.34 | 0.76 |
| Earlysense Live | 51 | 87.8 ± 7.3 | 90.8 ± 6.1 | 2.9 | –9.2 | 15.1 | 3.4 | 0.43 | 0.04 (0.148) |
| ResMed S+ | 51 | 88.0 ± 7.0 | 88.0 ± 8.3 | 0.0 | –14.7 | 14.7 | 0.0 (0.996) | 0.00 | 0.04 (0.158) |
| SleepScore Max | 42 | 86.2 ± 10.2 | 87.8 ± 7.8 | 1.6 | –12.6 | 15.8 | 1.5 (0.150) | 0.18 | 0.13 |
Summary results for the percentage of SE for the devices versus polysomnography (PSG). See Table 1 caption for additional table details.
Figure 1.Bland–Altman plots: total sleep time (TST). Bland–Altman plots depicting the mean bias (blue dashed line) and upper and lower limits of agreement (two standard deviations from bias; black dashed lines) for minutes of TST for the devices compared with polysomnography (PSG). Black circles are individual nights. Solid blue curves represent the best-fit of data, with surrounding gray shaded regions representing 95% confidence bands. The solid black line at zero represents no difference, with positive and negative y-axis values indicating an overestimation or underestimation, respectively, compared with PSG.
Figure 2.Bland–Altman plots: sleep efficiency (SE). Bland–Altman plots depicting the percentage of SE for the devices compared with polysomnography (PSG). See Figure 1 caption for additional details on the interpretation of Bland–Altman plots.
Sleep summary: sleep onset latency (SOL)
| Device |
| PSG | Device | Bias | Lower limit | Upper limit |
| Effect size |
|
|---|---|---|---|---|---|---|---|---|---|
| Actiwatch | 102 | 9.7 ± 8.5 | 2.1 ± 1.2 | –7.6 | –24.1 | 8.8 | –9.4 | –1.25 | 0.93 |
| Fatigue Science Readiband | 42 | 9.8 ± 7.4 | 9.0 ± 10.6 | –0.7 | –18.2 | 16.7 | –0.5 (0.593) | –0.08 | 0.17 |
| Fitbit Alta HR | 57 | 8.9 ± 7.7 | 5.8 ± 4.7 | –3.1 | –19.0 | 12.8 | –2.9 | –0.48 | 0.22 |
| Garmin Fenix 5S | 30 | 9.5 ± 8.1 | 10.3 ± 13.9 | 0.8 | –26.4 | 28.0 | 0.3 (0.750) | 0.07 | 0.26 |
| Garmin Vivosmart 3 | 44 | 9.6 ± 7.3 | 8.5 ± 7.5 | –1.1 | –12.1 | 9.9 | –1.3 (0.192) | –0.15 | 0.00 (0.789) |
| Earlysense Live | 55 | 9.7 ± 9.6 | 10.5 ± 6.9 | 0.8 | –15.2 | 16.8 | 0.8 (0.451) | 0.10 | 0.14 |
| ResMed S+ | 54 | 10.1 ± 9.6 | 14.1 ± 15.9 | 4.0 | –25.1 | 33.1 | 2.0 | 0.30 | 0.26 |
| SleepScore Max | 44 | 9.6 ± 7.2 | 14.0 ± 14.3 | 4.4 | –18.9 | 27.6 | 2.5 | 0.38 | 0.45 |
Summary results for minutes of SOL for the devices versus polysomnography (PSG). See Table 1 caption for additional table details.
Figure 3.Bland–Altman plots: sleep onset latency (SOL). Bland–Altman plots depicting the minutes of SOL for the devices compared with polysomnography (PSG). See Figure 1 caption for additional details on the interpretation of Bland–Altman plots.
Sleep summary: wake after sleep onset (WASO)
| Device |
| PSG | Device | Bias | Lower limit | Upper limit |
| Effect size |
|
|---|---|---|---|---|---|---|---|---|---|
| Actiwatch | 98 | 52.5 ± 39.5 | 35.9 ± 19.4 | –16.6 | –81.0 | 47.9 | –5.1 | –0.53 | 0.47 |
| Fatigue Science Readiband | 41 | 54.1 ± 47.5 | 41.6 ± 52.3 | –12.5 | –119.0 | 94.0 | –1.5 (0.140) | –0.25 | 0.01 (0.506) |
| Fitbit Alta HR | 49 | 46.6 ± 30.8 | 44.5 ± 19.4 | –2.1 | –42.9 | 38.7 | –0.7 (0.472) | –0.08 | 0.35 |
| Garmin Fenix 5S | 29 | 57.2 ± 52.9 | 7.7 ± 11.7 | –49.5 | –144.2 | 45.1 | –5.6 | –1.27 | 0.87 |
| Garmin Vivosmart 3 | 43 | 55.6 ± 48.1 | 8.0 ± 12.8 | –47.6 | –129.0 | 33.8 | –7.7 | –1.34 | 0.85 |
| Earlysense Live | 51 | 49.2 ± 32.2 | 33.9 ± 26.7 | –15.3 | –73.2 | 42.6 | –3.8 | –0.52 | 0.05 (0.124) |
| ResMed S+ | 51 | 48.3 ± 31.1 | 44.9 ± 34.0 | –3.4 | –67.8 | 61.1 | –0.8 (0.457) | –0.10 | 0.01 (0.460) |
| SleepScore Max | 42 | 56.7 ± 48.6 | 44.6 ± 34.1 | –12.1 | –79.5 | 55.3 | –2.3 | –0.29 | 0.22 |
Summary results for total minutes of WASO, from sleep onset latency (SOL), for the devices versus polysomnography (PSG). See Table 1 caption for additional table details.
Figure 4.Bland–Altman plots: wake after sleep onset (WASO). Bland–Altman plots depicting the minutes of WASO from sleep onset latency (SOL) for the devices compared with polysomnography (PSG). See Figure 1 caption for additional details on the interpretation of Bland–Altman plots.
Epoch-by-epoch (EBE) agreement: sleep versus wake
| Device | Sensitivity | Specificity | PPV | NPV | Accuracy | PABAK |
|---|---|---|---|---|---|---|
| Actiwatch | 0.97 | 0.39 | 0.91 | 0.63 | 0.89 | 0.78 |
| Fatigue Science Readiband | 0.94 | 0.45 | 0.92 | 0.55 | 0.88 | 0.75 |
| Fitbit Alta HR | 0.95 | 0.54 | 0.94 | 0.58 | 0.90 | 0.80 |
| Garmin Fenix 5S | 0.99 | 0.18 | 0.88 | 0.74 | 0.88 | 0.74 |
| Garmin Vivosmart 3 | 0.99 | 0.19 | 0.89 | 0.74 | 0.88 | 0.76 |
| EarlySense Live | 0.96 | 0.47 | 0.93 | 0.62 | 0.90 | 0.79 |
| ResMed S+ | 0.93 | 0.51 | 0.93 | 0.51 | 0.88 | 0.75 |
| SleepScore Max | 0.94 | 0.50 | 0.92 | 0.56 | 0.88 | 0.75 |
Proportions for EBE agreement metrics are shown for sleep epochs (versus wake epochs) on all nights for the devices, compared with the corresponding epochs from polysomnography (PSG). Higher values (closer to 1.0) indicate better performance on that metric. PPV = positive predictive value, NPV = negative predictive value, PABAK = prevalence and bias-adjusted kappa.
Epoch-by-epoch (EBE) agreement: light sleep epochs
| Device | Sensitivity | Specificity | PPV | NPV | Accuracy | PABAK |
|---|---|---|---|---|---|---|
| Fitbit Alta HR | 0.76 | 0.67 | 0.70 | 0.74 | 0.72 | 0.42 |
| Garmin Fenix 5S | 0.68 | 0.54 | 0.58 | 0.64 | 0.60 | 0.19 |
| Garmin Vivosmart 3 | 0.70 | 0.55 | 0.60 | 0.66 | 0.63 | 0.24 |
| EarlySense Live | 0.57 | 0.69 | 0.64 | 0.62 | 0.63 | 0.25 |
| ResMed S+ | 0.67 | 0.61 | 0.63 | 0.65 | 0.64 | 0.27 |
| SleepScore Max | 0.68 | 0.60 | 0.62 | 0.66 | 0.64 | 0.26 |
Proportions for EBE agreement metrics are shown for light sleep epochs (versus the combination of all other classifications—wake, deep, and REM) on all nights, compared with the corresponding polysomnography (PSG) epochs. Results are shown for all devices that output sleep stage classifications. See Table 8 caption for additional table details.
Epoch-by-epoch (EBE) agreement: deep sleep epochs
| Device | Sensitivity | Specificity | PPV | NPV | Accuracy | PABAK |
|---|---|---|---|---|---|---|
| Fitbit Alta HR | 0.53 | 0.92 | 0.58 | 0.91 | 0.86 | 0.71 |
| Garmin Fenix 5S | 0.56 | 0.92 | 0.55 | 0.92 | 0.87 | 0.73 |
| Garmin Vivosmart 3 | 0.56 | 0.92 | 0.54 | 0.93 | 0.87 | 0.73 |
| EarlySense Live | 0.68 | 0.84 | 0.46 | 0.93 | 0.81 | 0.62 |
| ResMed S+ | 0.59 | 0.88 | 0.50 | 0.91 | 0.83 | 0.66 |
| SleepScore Max | 0.59 | 0.88 | 0.44 | 0.93 | 0.84 | 0.67 |
Proportions for EBE agreement metrics are shown for deep sleep epochs (versus the combination of all other classifications—wake, light, and REM) on all nights, compared with the corresponding polysomnography (PSG) epochs. Results are shown for all devices that output sleep stage classifications. See Table 8 caption for additional table details.
Epoch-by-epoch (EBE) agreement: rapid eye movement (REM) sleep epochs
| Device | Sensitivity | Specificity | PPV | NPV | Accuracy | PABAK |
|---|---|---|---|---|---|---|
| Fitbit Alta HR | 0.69 | 0.94 | 0.77 | 0.91 | 0.89 | 0.77 |
| Garmin Fenix 5S | 0.54 | 0.84 | 0.51 | 0.86 | 0.77 | 0.53 |
| Garmin Vivosmart 3 | 0.50 | 0.82 | 0.46 | 0.84 | 0.75 | 0.48 |
| EarlySense Live | 0.64 | 0.89 | 0.62 | 0.90 | 0.84 | 0.67 |
| ResMed S+ | 0.50 | 0.95 | 0.71 | 0.87 | 0.85 | 0.69 |
| SleepScore Max | 0.49 | 0.95 | 0.74 | 0.86 | 0.84 | 0.68 |
Proportions for EBE agreement metrics are shown for REM sleep epochs (versus the combination of all other classifications—wake, light, and deep) on all nights, compared with the corresponding polysomnography (PSG) epochs. Results are shown for all devices that output sleep stage classifications. See Table 8 caption for additional table details.
Epoch-by-epoch (EBE) agreement: device sleep stage misclassification errors
| Device | Wake epochs | Light sleep epochs | Deep sleep epochs | REM sleep epochs | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Light | Deep | REM | Wake | Deep | REM | Wake | Light | REM | Wake | Light | Deep | |
| Fitbit Alta HR | 0.35 | 0.01 | 0.09 | 0.06 | 0.12 | 0.06 | 0.03 | 0.43 | 0.01 | 0.05 | 0.24 | 0.02 |
| Garmin Fenix 5S | 0.54 | 0.06 | 0.22 | 0.02 | 0.10 | 0.20 | 0.00 | 0.40 | 0.04 | 0.00 | 0.46 | 0.04 |
| Garmin Vivosmart 3 | 0.53 | 0.08 | 0.20 | 0.02 | 0.09 | 0.18 | 0.00 | 0.41 | 0.03 | 0.00 | 0.42 | 0.04 |
| EarlySense Live | 0.35 | 0.03 | 0.15 | 0.06 | 0.25 | 0.12 | 0.02 | 0.27 | 0.03 | 0.02 | 0.32 | 0.02 |
| ResMed S+ | 0.38 | 0.01 | 0.09 | 0.08 | 0.19 | 0.06 | 0.05 | 0.35 | 0.01 | 0.06 | 0.42 | 0.02 |
| SleepScore Max | 0.40 | 0.01 | 0.09 | 0.07 | 0.20 | 0.05 | 0.05 | 0.35 | 0.01 | 0.05 | 0.44 | 0.02 |
Proportions for EBE misclassification errors of sleep stage epochs versus polysomnography (PSG). PSG-scored classifications are the larger column categories, with the three possible device-scored misclassifications under each category. Results are shown for all devices that output sleep stage classifications.
Sleep summary: light sleep
| Device |
| PSG | Device | Bias | Lower limit | Upper limit |
| Effect size |
|
|---|---|---|---|---|---|---|---|---|---|
| Fitbit Alta HR | 49 | 236.6 ± 28.5 | 256.7 ± 30.1 | 20.0 | –54.1 | 94.2 | 3.8 | 0.68 | 0.00 (0.714) |
| Garmin Fenix 5S | 29 | 238.3 ± 36.2 | 267.3 ± 35.8 | 29.0 | –74.4 | 132.4 | 3.0 | 0.80 | 0.00 (0.957) |
| Garmin Vivosmart 3 | 43 | 238.3 ± 31.9 | 273.0 ± 36.1 | 34.7 | –60.5 | 129.8 | 4.8 | 1.01 | 0.02 (0.431) |
| Earlysense Live | 51 | 237.3 ± 31.0 | 215.0 ± 51.5 | –22.3 | –133.6 | 89.0 | –2.9 | –0.52 | 0.22 |
| ResMed S+ | 51 | 235.9 ± 31.1 | 253.0 ± 34.3 | 17.1 | –58.5 | 92.6 | 3.2 | 0.52 | 0.01 (0.468) |
| SleepScore Max | 42 | 236.5 ± 32.2 | 259.1 ± 38.7 | 22.7 | –51.7 | 97.0 | 4.0 | 0.63 | 0.04 (0.196) |
Summary results for total minutes of light sleep for the devices versus polysomnography (PSG). For PSG, light sleep was calculated as the combination of N1 and N2 sleep stages. Results are shown for all devices that output sleep stage classifications. See Table 1 caption for additional table details.
Sleep summary: deep sleep
| Device |
| PSG | Device | Bias | Lower limit | Upper limit |
| Effect size |
|
|---|---|---|---|---|---|---|---|---|---|
| Fitbit Alta HR | 49 | 81.1 ± 28.1 | 75.0 ± 21.1 | –6.0 | –73.8 | 61.7 | –1.2 (0.219) | –0.24 | 0.08 (0.052) |
| Garmin Fenix 5S | 29 | 63.3 ± 22.9 | 69.4 ± 29.3 | 6.1 | –53.0 | 65.1 | 1.1 (0.278) | 0.23 | 0.07 (0.170) |
| Garmin Vivosmart 3 | 43 | 66.7 ± 23.1 | 70.4 ± 28.9 | 3.7 | –57.5 | 64.9 | 0.8 (0.431) | 0.14 | 0.06 (0.130) |
| Earlysense Live | 51 | 79.6 ± 30.3 | 115.5 ± 42.9 | 35.9 | –66.8 | 138.6 | 5.0 | 0.96 | 0.11 |
| ResMed S+ | 51 | 81.5 ± 27.8 | 96.0 ± 30.5 | 14.5 | –54.8 | 83.8 | 3.0 | 0.49 | 0.01 (0.504) |
| SleepScore Max | 42 | 66.8 ± 23.7 | 87.4 ± 28.9 | 20.7 | –36.3 | 77.7 | 4.7 | 0.77 | 0.05 (0.166) |
Summary results for total minutes of deep sleep for the devices versus polysomnography (PSG). For PSG, deep sleep was calculated as the N3 sleep stage. Results are shown for all devices that output sleep stage classifications. See Table 1 caption for additional table details.
Sleep summary: rapid eye movement (REM) sleep
| Device |
| PSG | Device | Bias | Lower limit | Upper limit |
| Effect size |
|
|---|---|---|---|---|---|---|---|---|---|
| Fitbit Alta HR | 49 | 107.4 ± 20.6 | 96.0 ± 22.9 | –11.4 | –60.2 | 37.3 | –3.3 | –0.52 | 0.01 (0.426) |
| Garmin Fenix 5S | 29 | 111.4 ± 23.8 | 120.0 ± 37.8 | 8.6 | –74.9 | 92.1 | 1.1 (0.277) | 0.27 | 0.19 |
| Garmin Vivosmart 3 | 43 | 109.7 ± 22.6 | 118.1 ± 40.4 | 8.4 | –75.9 | 92.6 | 1.3 (0.200) | 0.25 | 0.28 |
| Earlysense Live | 51 | 104.7 ± 22.2 | 104.7 ± 43.1 | 0.0 | –82.9 | 82.9 | 0.0 (0.997) | 0.00 | 0.36 |
| ResMed S+ | 51 | 104.9 ± 20.7 | 73.0 ± 25.1 | –31.9 | –85.5 | 21.8 | –8.5 | –1.37 | 0.04 (0.159) |
| SleepScore Max | 42 | 110.4 ± 22.4 | 74.6 ± 23.0 | –35.8 | –95.3 | 23.7 | –7.8 | –1.57 | 0.00 (0.856) |
Summary results for total minutes of REM sleep for the devices versus polysomnography (PSG). Results are shown for all devices that output sleep stage classifications. See Table 1 caption for additional table details.