| Literature DB >> 35904853 |
Weizhuang Zhou1, Yu En Chan1, Chuan Sheng Foo1, Jingxian Zhang1, Jing Xian Teo2, Sonia Davila2,3,4, Weiting Huang5, Jonathan Yap5,6, Stuart Cook4, Patrick Tan2,7,8,9, Calvin Woon-Loong Chin5,6, Khung Keong Yeo2,5,6, Weng Khong Lim2,3,7, Pavitra Krishnaswamy1.
Abstract
BACKGROUND: Consumer-grade wearable devices enable detailed recordings of heart rate and step counts in free-living conditions. Recent studies have shown that summary statistics from these wearable recordings have potential uses for longitudinal monitoring of health and disease states. However, the relationship between higher resolution physiological dynamics from wearables and known markers of health and disease remains largely uncharacterized.Entities:
Keywords: cardiometabolic disease; digital phenotypes; free-living; heart rate; machine learning; polygenic risk scores; risk prediction; time series analysis; wearable device
Mesh:
Substances:
Year: 2022 PMID: 35904853 PMCID: PMC9377462 DOI: 10.2196/34669
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 7.076
Summary of demographic, clinical, and consumer wearable data for participants with wearable recordings (N=692) in the SingHEART study cohort.
|
| Female (n=370, 53.5%) | Male (n=322, 46.5%) | |||
|
| Value, mean (SD) | Participants, na (%) | Value, mean (SD) | Participants, na (%) | |
| Age (years) | 45.47 (11.71) | 0 (0) | 44.46 (13.29) | 0 (0) | |
| BMI (kg/m2) | 22.87 (3.94) | 0 (0) | 24.33 (3.39) | 0 (0) | |
| WCb (cm) | 78.91 (10.98) | 0 (0) | 86.96 (9.86) | 0 (0) | |
| SBPc (mm Hg) | 122.51 (17.74) | 0 (0) | 132.20 (14.96) | 0 (0) | |
| DBPd (mm Hg) | 73.38 (12.80) | 0 (0) | 82.18 (10.97) | 1 (0.3) | |
| Wearable-derived resting heart rate (bpm; Fitbit) | 70.66 (6.55) | 0 (0) | 69.37 (6.59) | 0 (0) | |
| ECG_HRe (bpm) | 64.46 (9.17) | 10 (2.7) | 63.67 (9.87) | 12 (3.7) | |
| Total cholesterol (mmol/L) | 5.34 (0.94) | 6 (1.6) | 5.33 (0.97) | 5 (1.6) | |
| LDLf (mmol/L) | 3.32 (0.81) | 7 (1.9) | 3.40 (0.89) | 6 (1.9) | |
| HDLg (mmol/L) | 1.59 (0.32) | 6 (1.6) | 1.36 (0.30) | 5 (1.6) | |
| TGsh (mmol/L) | 0.99 (0.51) | 6 (1.6) | 1.30 (0.76) | 5 (1.6) | |
| Glucose (mmol/L) | 5.17 (0.49) | 8 (2.2) | 5.36 (0.71) | 5 (1.6) | |
| Average daily step counti | 10,349.81 (4180.35) | 30 (8.1) | 10,972.86 (3919.10) | 20 (6.2) | |
| Average daily sedentary minutes | 633.45 (96.48) | 102 (27.6) | 656.49 (95.58) | 88 (27.3) | |
| Average daily sleep minutes | 395.92 (61.18) | 102 (27.6) | 374.49 (65.15) | 88 (27.3) | |
aRefers to number of participants with missing or incomplete values for the respective fields.
bWC: waist circumference.
cSBP: systolic blood pressure.
dDBP: diastolic blood pressure.
eECG_HR: electrocardiogram heart rate.
fLDL: low-density lipoprotein.
gHDL: high-density lipoprotein.
hTG: triglyceride.
iThe average daily step count was derived by taking the sum of steps for each day and then averaging over days. Only days with ≥20 hours of valid data were considered.
Figure 1Wearable data processing pipeline. (A) Construction of low-resolution features based on summary statistics. (B) Construction of high-resolution features based on the Canonical Time-series Characteristics 22 (Catch22) algorithm. (C) UpSet plot of the 692 participants with features from the various categories. Only nonempty set intersections are presented. Intersection size indicates the number of participants found within the intersections of given sets. Of the largest intersection with 328 participants, 321 also had laboratory measurement recordings.
Figure 2Illustration of wearable-derived high-resolution heart rate features. The distributions of 6 high-resolution features from the 321 participants, based on 2 Canonical Time-series Characteristics 22 features obtained from time series recordings in each of the 3 activity levels. The selected participants are at the 2.5th, 25th, 50th, 75th and 97.5th percentiles of each distribution, and the time series for the participant is plotted in the corresponding color. (A-C) CO_trev1_num is the time-reversibility statistic; higher values tend to correspond to “spikier” or irregular time series. (D-F) DN_HistogramMode_5 takes a time series and groups the z-scored values into 5 linearly spaced bins and reports the mode of the bins.
Description of the different model types.
| Model name | Features included | Features, n |
| Baseline [ | Age+gender | 2 |
| RestingHR | Baseline features+wearable-derived resting heart rate | 3 |
| SummaryStats | Baseline features+wearable summary stats | 12 |
| HighRes.ActiveSeg | Baseline features+Catch22a (active) | 24 |
| HighRes.SedenSeg | Baseline features+Catch22 (sedentary) | 24 |
| HighRes.SleepSeg | Baseline features+Catch22 (sleep) | 24 |
aCatch22: Canonical Time-series Characteristics 22.
Laboratory measurements and corresponding thresholds.
| Laboratory measurement | Threshold to be considered at risk | |
| I. Systolic blood pressure (mm Hg) | >140 | |
| II. Diastolic blood pressure (mm Hg) | >90 | |
| III. Triglycerides (mmol/L) | >2.3 | |
| IV. Total cholesterol (mmol/L) | >6.2 | |
| V. HDLa (mmol/L) | <1 | |
| VI. LDLb (mmol/L) | >4.1 | |
| VII. Fasting blood glucose level (mmol/L) | >6 | |
|
| ||
|
| Male | >100 |
|
| Female | >90 |
| IX. BMI (kg/m2) | >27.5 | |
aHDL: high-density lipoprotein.
bLDL: low-density lipoprotein.
Figure 3High-resolution (Canonical Time-series Characteristics 22 [Catch22]) wearable features from 3 different activity states. (A) Frequency polygons of the feature values based on the training set. The colors indicate activity states. (B) Pearson correlation coefficients between pairs of Catch22 features from different physical activity states (sleep, active, and sedentary). Two features from the active period (SC_FluctAnal_2_rsrangefit_50_1_logi_prop_r1 and SC_FluctAnal_2_dfa_50_1_2_logi_prop_r1) are uniformly 0; hence, correlation coefficients involving these 2 features are undefined (white squares).
Model performance on cardiometabolic risk targets. Out-of-bag model performance for each of the 5 model types computed for the 4 targets. A smaller Brier score indicates a better performing model for a given target.
|
| Baselinea, mean (SD) | RestingHRb, mean (SD) | HighRes.ActiveSegc, mean (SD) | HighRes.SedenSegc, mean (SD) | HighRes.SleepSegc, mean (SD) | SummaryStats, mean (SD) |
| anyRISKoutof9 | 0.291 (−5.87×10−4) | 0.258 (7.7×10−4) | 0.253 (8.52×10−4) | 0.239 (−9×10−4) | 0.245 (8.43×10−4) | 0.247 (7.66×10−4) |
| bp_abnormal | 0.227 (4.79×10−4) | 0.223 (5.61×10−4) | 0.217 (7.88×10−4) | 0.222 (8.14×10−4) | 0.225 (8.32×10−4) | 0.225 (7.9×10−4) |
| obesity | 0.246 (6.64×10−4) | 0.227 (7.91×10−4) | 0.221 (8.92×10−4) | 0.214 (9.34×10−4) | 0.226 (8.64×10−4) | 0.227 (8.54×10−4) |
| lipids_abnormal | 0.271 (5.84×10−4) | 0.261 (6.64×10−4) | 0.238 (8.08×10−4) | 0.225 (7.58×10−4) | 0.241 (8.27×10−4) | 0.236 (7.3×10−4) |
aFor each risk target, the Brier scores of the baseline model were significantly different from those of all other models (P<.001).
bFor each risk target, Brier scores of the resting heart rate model (RestingHR) were significantly different from all other models (P<.001).
cFor each risk target, Brier scores of the 3 HighRes models were significantly different from each other (P<.001).
Figure 4Random forest variable importance. The variable importance of each feature for prediction of the 4 cardiometabolic disease risk targets. We averaged each importance value across 200 simulations and used the results to rank the top 10 features to retain for each cardiometabolic disease risk target. This resulted in a total of 26 features across all 4 targets, as shown in the figure. Catch22: Canonical Time-series Characteristics 22.
Degree of association with genomic risk targets. Out-of-bag performance for each of the 5 model types computed for the 3 targets. A smaller Brier score indicates better performing model for a given target.
|
| Baselinea, mean (SD) | RestingHRb, mean (SD) | HighRes.ActiveSeg, mean (SD) | HighRes.SedenSeg, mean (SD) | HighRes.SleepSeg, mean (SD) | SummaryStats, mean (SD) |
| Blood pressure | 0.248 (2.0×10−3) | 0.245 (8.55×10−4) | 0.215 (1.08×10−3) | 0.214 (1.09×10−3) | 0.215 (9.93×10−4) | 0.212 (9.64×10−4) |
| Obesity | 0.245 (2.31×10−3) | 0.246 (9.03×10−4) | 0.205 (1.15×10−3) | 0.192 (1.06×10−3) | 0.199 (1.21×10−3) | 0.203 (1.06×10−3) |
| Lipids | 0.294 (3.02×10−3) | 0.308 (6.36×10−4) | 0.254 (9.07×10−4) | 0.254 (8.82×10−4) | 0.259 (8.92×10−4) | 0.268 (8.86×10−4) |
aFor each risk target, the Brier scores of the baseline model were significantly different from all other models (P<.001).
bFor each risk target, Brier scores of the resting heart rate model (RestingHR) were significantly different from those of the 3 HighRes and SummaryStats models (P<.001).
Illustrative profiles of 5 participants with actualized cardiometabolic events. Participant profiles include demographic information, type of cardiometabolic disease, key physical measurements, clinical and genomic risk markers, and the top 5 important wearable-derived heart rate features (as per Shapley values).
| Participant profiles | Participant | |||||
|
| A | B | C | D | E | |
|
| ||||||
|
| Age (years) | 54 | 57 | 56 | 55 | 61 |
|
| Gender | Male | Male | Male | Female | Male |
|
| Wearable-derived resting heart rate | 72.8 | 58.2 | 73.0 | 69.0 | 55.7 |
|
| ||||||
|
| BMI (kg/m2) | 28.05 | 18.79 | 21.27 | 22.95 | 25.95 |
|
| Blood pressure: SBPa/DBPb (mm Hg) | 166/109 | 108/65 | 164/105 | 112/48 | 133/89 |
|
| Glucose (mmol/L) | 6.8 | 4.8 | 7.4 | 5.3 | 5.3 |
|
| Total cholesterol (mmol/L) | 5.27 | 6.63 | 6.60 | 5.05 | 4.45 |
|
| anyRISKoutof9 | Truec | True | True | Falsed | False |
|
| ||||||
|
| Lipids abnormalities | True | True | False | False | False |
|
| Blood pressure abnormalities | True | True | False | True | False |
|
| Obesity | True | False | False | False | False |
|
| ||||||
|
| Cardiovascular disease | True | True | False | False | True |
|
| Dyslipidemia | True | False | True | True | False |
|
| Hypertension | True | False | False | False | False |
|
| ||||||
|
| CO_f1ecac.sedentary | False | False | False | True | False |
|
| FC_LocalSimple_mean3_stderr.sedentary | True | False | False | False | False |
|
| SB_MotifThree_quantile_hh.sedentary | True | False | False | False | False |
|
| SB_TransitionMatrix_3ac_sumdiagcov.sedentary | False | False | False | True | False |
|
| CO_trev_1_num.sedentary | False | False | False | False | True |
|
| CO_HistogramAMI_even_2_5.sedentary | False | False | True | False | False |
|
| DN_OutlierInclude_p_001_mdrmd.sedentary | True | True | False | False | False |
|
| CO_Embed2_Dist_tau_d_expfit_meandiff.sedentary | False | True | False | True | True |
|
| DN_HistogramMode_10.sedentary | False | False | True | False | False |
|
| DN_HistogramMode_5.sedentary | True | True | True | True | True |
|
| Gender | True | True | True | False | True |
|
| Age (years) | False | True | True | True | True |
aSBP: systolic blood pressure.
bDBP: diastolic blood pressure.
cTrue indicates true or that there is a presence of categorical variables.
dFalse indicates false or absence of categorical variables.