| Literature DB >> 29230836 |
Andrew Leroux1, Luo Xiao2, Ciprian Crainiceanu1, William Checkley3.
Abstract
In many studies, it is of interest to predict the future trajectory of subjects based on their historical data, referred to as dynamic prediction. Mixed effects models have traditionally been used for dynamic prediction. However, the commonly used random intercept and slope model is often not sufficiently flexible for modeling subject-specific trajectories. In addition, there may be useful exposures/predictors of interest that are measured concurrently with the outcome, complicating dynamic prediction. To address these problems, we propose a dynamic functional concurrent regression model to handle the case where both the functional response and the functional predictors are irregularly measured. Currently, such a model cannot be fit by existing software. We apply the model to dynamically predict children's length conditional on prior length, weight, and baseline covariates. Inference on model parameters and subject-specific trajectories is conducted using the mixed effects representation of the proposed model. An extensive simulation study shows that the dynamic functional regression model provides more accurate estimation and inference than existing methods. Methods are supported by fast, flexible, open source software that uses heavily tested smoothing techniques.Entities:
Keywords: covariance function; fPCA; face; longitudinal data; mixed effects; penalized splines; sparse functional data
Mesh:
Year: 2017 PMID: 29230836 PMCID: PMC5847461 DOI: 10.1002/sim.7582
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 2Length‐for‐age z‐score and weight‐for‐age z‐score curves for 4 children. Length‐for‐age z‐score is presented as ∘ and weight‐for‐age z‐score is presented as +
Figure 1Distribution of, A, length‐for‐age z‐scores (LAZ), B, weight‐for‐age z‐scores (WAZ), C, weight‐for‐length z‐scores (WLZ) binned into monthly categories, and, D, estimated empirical correlation between LAZ and WAZ at each month. Correlations were calculated using Z‐scores projected onto an evenly spaced grid of ages {0.5,1.5,…,23.5} for all subjects using face::face.sparse() applied separately to LAZ and WAZ
Figure 3Illustration of dynamic prediction using a subject from the CONTENT study. Conditional on the weight‐for‐age z‐score (WAZ) and length‐for‐age z‐score (LAZ) data in blue, the interest is to predict the length‐for‐age z‐score (LAZ) in red
Figure 4Estimated coefficient functions (solid lines) and associated 95% pointwise confidence intervals (dashed lines). WAZ, weight‐for‐age z‐score
Figure 5Left to right: A, estimated correlation function; B, estimated variance function; C, first 5 estimated eigenfunctions (ϕ) and corresponding eigenvalues (λ)
Figure 6Example of dynamic prediction for 4 subjects. Points represent subjects' observed length‐for‐age z‐score (LAZs), solid black/red lines represent the predicted curves, dashed black/red lines indicate 95% pointwise confidence intervals for the trajectories. Only observed data on and to the left of the vertical grey dashed line (blue points) are used for prediction
Various models
| AM: | LAZ |
| AMM: | LAZ |
| FCR: | LAZ |
| FRI: | LAZ |
Abbreviations: AM, additive model with the same mean structure; AMM, additive model with scalar random effects; FCR, functional concurrent regression; FRI, functional random intercept; LAZ, length‐for‐age z‐score.
100×Median (interquartile range) of integrated squared error for predicting coefficient functions using FCR, AMM, and AM across 500 simulations for each combination of (N,σ )
| N =100 | N =200 | ||||||
|---|---|---|---|---|---|---|---|
| FCR | AMM | AM | FCR | AMM | AM | ||
|
| |||||||
|
|
| 0.48 (0.71) | 0.57 (0.76) | 0.63 (0.82) | 0.25 (0.40) | 0.27 (0.40) | 0.34 (0.44) |
|
| 0.74 (1.73) | 1.03 (1.83) | 1.24 (2.01) | 0.38 (0.69) | 0.43 (0.82) | 0.55 (0.96) | |
|
| 0.26 (0.30) | 0.30 (0.34) | 0.42 (0.53) | 0.14 (0.16) | 0.15 (0.18) | 0.25 (0.24) | |
|
|
| 0.54 (0.84) | 0.57 (0.90) | 0.65 (0.95) | 0.28 (0.39) | 0.28 (0.40) | 0.36 (0.43) |
|
| 0.92 (1.68) | 1.12 (1.92) | 1.29 (1.87) | 0.43 (0.84) | 0.51 (0.86) | 0.66 (1.00) | |
|
| 0.30 (0.30) | 0.33 (0.38) | 0.49 (0.47) | 0.16 (0.17) | 0.17 (0.17) | 0.26 (0.30) | |
|
| |||||||
|
|
| 0.50 (0.74) | 0.60 (0.80) | 0.71 (0.91) | 0.22 (0.38) | 0.26 (0.42) | 0.32 (0.44) |
|
| 0.87 (1.46) | 1.12 (1.83) | 1.31 (1.94) | 0.36 (0.73) | 0.49 (0.73) | 0.57 (0.87) | |
|
| 0.21 (0.25) | 0.29 (0.37) | 0.38 (0.50) | 0.11 (0.13) | 0.14 (0.18) | 0.21 (0.27) | |
|
|
| 0.54 (0.73) | 0.55 (0.84) | 0.61 (0.82) | 0.27 (0.40) | 0.27 (0.42) | 0.32 (0.46) |
|
| 0.81 (1.60) | 0.92 (1.68) | 1.11 (1.73) | 0.39 (0.79) | 0.50 (0.88) | 0.59 (0.93) | |
|
| 0.26 (0.26) | 0.30 (0.35) | 0.44 (0.50) | 0.12 (0.13) | 0.13 (0.15) | 0.23 (0.24) | |
Abbreviations: AM, additive model with the same mean structure; AMM, additive model with scalar random effects; FCR, functional concurrent regression.
Average coverage probabilities for 95% confidence bands using FCR, AMM, and AM under various simulation scenarios
| N = 100 | N = 200 | ||||||
|---|---|---|---|---|---|---|---|
| FCR | AMM | AM | FCR | AMM | AM | ||
|
| |||||||
|
|
| 0.93 | 0.94 | 0.56 | 0.93 | 0.93 | 0.60 |
|
| 0.91 | 0.93 | 0.53 | 0.93 | 0.94 | 0.58 | |
|
| 0.84 | 0.71 | 0.63 | 0.84 | 0.72 | 0.64 | |
|
|
| 0.93 | 0.93 | 0.60 | 0.93 | 0.93 | 0.63 |
|
| 0.93 | 0.94 | 0.57 | 0.94 | 0.95 | 0.57 | |
|
| 0.86 | 0.79 | 0.65 | 0.88 | 0.80 | 0.67 | |
|
| |||||||
|
|
| 0.91 | 0.92 | 0.50 | 0.93 | 0.94 | 0.55 |
|
| 0.92 | 0.92 | 0.47 | 0.94 | 0.94 | 0.49 | |
|
| 0.84 | 0.64 | 0.58 | 0.85 | 0.65 | 0.61 | |
|
|
| 0.92 | 0.94 | 0.55 | 0.93 | 0.94 | 0.58 |
|
| 0.93 | 0.95 | 0.53 | 0.93 | 0.94 | 0.51 | |
|
| 0.86 | 0.75 | 0.59 | 0.90 | 0.78 | 0.63 | |
Abbreviations: AM, additive model with the same mean structure; AMM, additive model with scalar random effects; FCR, functional concurrent regression.
Coverage is assessed at 500 equally spaced points between 0 and 24 and then averaged across the 500 points for each simulation scenario.
10×Median (interquartile range) of mean integrated squared error for dynamically predicting subject‐specific curves using the FCR, fPCA, and AMM across 500 simulations for N=100,m ∼Unif(25,35),σ =0.18
| Prediction Time Window | |||||
|---|---|---|---|---|---|
| 8‐12 mo | 14‐18 mo | 20‐24 mo | |||
| Observed Data | 0‐6 mo | 2.06 (0.56) | 3.41 (0.97) | 3.74 (1.20) | FCR |
| 2.25 (0.64) | 3.66 (1.07) | 3.88 (1.17) | FRI | ||
| 2.21 (0.60) | 3.68 (1.07) | 4.71 (1.55) | AMM | ||
| 0‐12 mo | 1.80 (0.49) | 2.42 (0.63) | FCR | ||
| 1.91 (0.56) | 2.54 (0.70) | FRI | |||
| 2.17 (0.63) | 4.77 (1.47) | AMM | |||
| 0‐18 mo | 1.18 (0.36) | FCR | |||
| 1.27 (0.36) | FRI | ||||
| 2.50 (0.76) | AMM | ||||
Abbreviations: AMM, additive model with scalar random effects; FCR, functional concurrent regression; FRI, functional random intercept; fPCA, functional principal component analysis.
All dynamic prediction errors are assessed using 50 subjects that were not included in the model fitting procedure are used to evaluate the dynamic prediction error presented here. Note that although simulations were performed on the unit interval, the time periods have been rescaled to reflect the time domain of the CONTENT data.