| Literature DB >> 35798763 |
Rahul Ghosal1, Vijay R Varma2, Dmitri Volfson3, Jacek Urbanek4, Jeffrey M Hausdorff5,6,7, Amber Watts8, Vadim Zipunnikov9.
Abstract
Wearable data is a rich source of information that can provide a deeper understanding of links between human behaviors and human health. Existing modelling approaches use wearable data summarized at subject level via scalar summaries in regression, temporal (time-of-day) curves in functional data analysis (FDA), and distributions in distributional data analysis (DDA). We propose to capture temporally local distributional information in wearable data using subject-specific time-by-distribution (TD) data objects. Specifically, we develop scalar on time-by-distribution regression (SOTDR) to model associations between scalar response of interest such as health outcomes or disease status and TD predictors. Additionally, we show that TD data objects can be parsimoniously represented via a collection of time-varying L-moments that capture distributional changes over the time-of-day. The proposed method is applied to the accelerometry study of mild Alzheimer's disease (AD). We found that mild AD is significantly associated with reduced upper quantile levels of physical activity, particularly during morning hours. In-sample cross validation demonstrated that TD predictors attain much stronger associations with clinical cognitive scales of attention, verbal memory, and executive function when compared to predictors summarized via scalar total activity counts, temporal functional curves, and quantile functions. Taken together, the present results suggest that SOTDR analysis provides novel insights into cognitive function and AD.Entities:
Mesh:
Year: 2022 PMID: 35798763 PMCID: PMC9263176 DOI: 10.1038/s41598-022-15528-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Summary statistics for the complete, AD and CNC samples.
| Characteristic | Complete sample | AD | CNC | ||||
|---|---|---|---|---|---|---|---|
| Mean/Freq | SD | Mean/Freq | SD | Mean/Freq | SD | ||
| Age | 73.36 | 7.11 | 73.59 | 7.92 | 73.19 | 6.53 | 0.797 |
| % Female | 52.17 | N/A | 28.20 | N/A | 69.81 | N/A | < 0.001 |
| Years of edu | 16.56 | 3.24 | 15.53 | 2.77 | 17.32 | 3.38 | 0.0064 |
| BMI | 26.78 | 4.52 | 27.28 | 5.04 | 26.42 | 4.11 | 0.3892 |
| VO2 max | 21.99 | 5.34 | 21.61 | 5.24 | 22.24 | 5.43 | 0.592 |
No statistical difference between the AD and CNC groups are observed across age, BMI, or V02 max. However, AD group had a smaller percentage of females (28.2 vs 69.8 for CNC) and lower education (15.5 years vs 17.3 years for CNC).
Figure 1Top left: violin plot of subject-specific averages for CNC and AD participants. top right: smoothed diurnal activity profiles averaged across CNC (blue) and AD (red) participants. Bottom left: average quantile functions of physical activity for AD and CNC participants.
Figure 2The average bivariate time-by-distribution PA surface for CNC (top left) and AD (top right) groups. The difference between CNC and AD (bottom left) and the difference between subjects with high (above percentile) and low (below -percentile) of cognitive attention (ATTN) scores (bottom right).
Figure 3The first four time-varying L-moments of daily physical activity averaged within CNC (blue) and AD (red) groups.
The results of modelling cognitive status (CNC vs AD) and physical activity using Model 1–4 with an adjustment for age, sex, and education.
| Dependent variable: cognitive status (CNC vs AD) | ||||
|---|---|---|---|---|
| Model 1 | Model 2 | Model 3 | Model 4 | |
| Intercept | 7.608** | 6.549* | 10.588** | 12.368*** |
| (3.567) | (3.615) | (4.139) | (4.591) | |
| Age | −0.051 | −0.040 | − 0.072* | − 0.089* |
| (0.038) | (0.039) | (0.043) | (0.047) | |
| Sex | 2.134*** | 2.111*** | 2.527*** | 2.637*** |
| (0.554) | (0.553) | (0.624) | (0.676) | |
| Education | − 0.224** | − 0.213** | − 0.167* | − 0.174* |
| (0.091) | (0.091) | (0.092) | (0.095) | |
| − 0.005*** | NA | NA | NA | |
| (0.002) | ||||
| NA | NA | NA | ||
| NA | NA | NA | ||
| NA | NA | NA | ||
| Observations | 92 | 92 | 92 | 92 |
| cvAUC | 0.781 | 0.773 | 0.792 | 0.811 |
The standard deviation of the estimated coefficients for the scalar predictors are indicated in the parenthesis. Predictors: model 1-scalar average PA, model 2–diurnal PA curves, model 3-quantile functions, model 4-SOTDR with time-by-distribution data objects.
*; **; ***.
Figure 4The estimated regression coefficients for Models 2–4. Estimated temporal effect (top left. t denoting time of the day). Estimated distributional effect (top right, ). Estimated bivariate effect of time-by-distribution PA surface (bottom left).
Figure 5Estimated diurnal effect of of PA on log odds of AD.
The results of modelling attention score and physical activity using Model 1–4 with an adjustment for age, sex, and education.
| Model 1 | Model 2 | Model 3 | Model 4 | |
|---|---|---|---|---|
| Intercept | − 1.423 | − 1.157 | − 2.045** | − 3.696*** |
| (0.929) | (0.960) | (0.927) | (0.988) | |
| Age | 0.002 | − 0.001 | 0.006 | 0.021* |
| (0.011) | (0.011) | (0.010) | (0.011) | |
| Sex | − 0.354** | − 0.349** | − 0.443*** | − 0.476*** |
| (0.150) | (0.150) | (0.150) | (0.134) | |
| Education | 0.083*** | 0.080*** | 0.072*** | 0.069*** |
| (0.023) | (0.023) | (0.023) | (0.021) | |
| 0.0005 | NA | NA | NA | |
| (0.0005) | ||||
| NA | NA | NA | ||
| NA | NA | NA | ||
| NA | NA | NA | ||
| Observations | 92 | 92 | 92 | 92 |
| Adjusted | 0.161 | 0.163 | 0.218 | 0.378 |
| cv | 0.167 | 0.189 | 0.240 | 0.333 |
The standard deviation of the estimated coefficients for the scalar predictors are indicated in the parenthesis. Predictors: model 1-scalar average PA, model 2-diurnal PA curves, model 3-quantile functions, model 4-SOTDR with time-by-distribution data objects. All models are adjusted for age, sex, years of education.
*; **; ***.
Figure 6The estimated effects of the different PA metrics (Model 2-4) on ATTN score. Estimated temporal effect (solid line) (top left). Estimated distributional effect (top right). Estimated bivariate effect of time-by-distribution PA surface (bottom left). The same plot (zoomed-in) with p restricted to the distributional domain (0.5, 1) (bottom right).
Figure 7Scatterplots of the estimated weighted scores corresponding to the predictors average PA, diurnal PA curve, PA quantile function and time-by-distribution PA metric respectively for cognitive status (left) and ATTN (right). Note: corresponds to average daily total count, corresponds to temporal curves of PA, corresponds to distribution representation (via quantile functions) of PA and corresponds to time-by-distribution representation of PA.