| Literature DB >> 31800576 |
Sarah C Gadd1,2, Peter W G Tennant1,3,4, Alison J Heppenstall1,2,4, Jan R Boehnke5, Mark S Gilthorpe1,3,4.
Abstract
Longitudinal data is commonly analysed to inform prevention policies for diseases that may develop throughout life. Commonly methods interpret the longitudinal data as a series of discrete measurements or as continuous patterns. Some of the latter methods condition on the outcome, aiming to capture 'average' patterns within outcome groups, while others capture individual-level pattern features before relating these to the outcome. Conditioning on the outcome may prevent meaningful interpretation. Repeated measurements of a longitudinal exposure (weight) and later outcome (glycated haemoglobin levels) were simulated to match three scenarios: one with no causal relationship between growth rate and glycated haemoglobin; two with a positive causal effect of growth rate on glycated haemoglobin. Two methods that condition on the outcome and one that did not were applied to the data in 1000 simulations. The interpretation of the two-step method matched the simulation in all causal scenarios, but that of the methods conditioning on the outcome did not. Methods that condition on the outcome do not accurately represent a causal relationship between a longitudinal pattern and outcome. Researchers considering longitudinal data should carefully determine if they wish to analyse longitudinal data as a series of discrete time points or by extracting pattern features.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31800576 PMCID: PMC6892534 DOI: 10.1371/journal.pone.0225217
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Directed acyclic graph showing the structure of causal relationships between variables in simulated scenarios A, B and C. Growth represents the growth rate of an individual and is not simulated or measured in the scenario. U is an unknown and unmeasured variable. The age in years at which known variables are measured is shown in subscript. Arrows show the direction of causal relationships and numbers attached to these arrows show the correlations induced by them.
Parameters of latent variables and error terms used to simulate data in section 3.
| Variable | Weight0 (kg) | Weight1 (kg) | Weight2 (kg) | HbA1c (%) |
|---|---|---|---|---|
| Mean | 4 | 8 | 12 | 5.8 |
| Standard deviation | 2 | 2 | 2 | 1 |
The path diagram used to generate observed variables from these is shown in Fig 1.
Summary of simulated variables in scenario A.
| Weight0 (kg) | Weight1 (kg) | Weight2 (kg) | HbA1c40 (%) | |||||
|---|---|---|---|---|---|---|---|---|
| Mean | 95%CI | Mean | 95%CI | Mean | 95%CI | Mean | 95%CI | |
| Mean | 4.003 | 3.879, 4.129 | 8.003 | 7.889, 8.133 | 11.997 | 11.863, 12.117 | 5.800 | 5.737, 5.862 |
| SD | 2.000 | 1.910, 2.093 | 1.998 | 1.910, 2.090 | 2.000 | 1.911, 2.092 | 0.999 | 0.955, 1.043 |
| Correlation with HbA1c | 0.699 | 0.664, 0.729 | 0.029 | -0.033, 0.091 | -0.105 | -0.167, -0.044 | 1 | |
95%CI represents 95% empirical confidence intervals.
Summary of simulated variables in scenario B.
| Weight0 (kg) | Weight1 (kg) | Weight2 (kg) | HbA1c40 (%) | |||||
|---|---|---|---|---|---|---|---|---|
| Mean | 95%CI | Mean | 95%CI | Mean | 95%CI | Mean | 95%CI | |
| Mean | 4.001 | 3.870, 4.124 | 7.999 | 7.876, 8.127 | 11.997 | 11.885, 12.118 | 5.801 | 5.741, 5.859 |
| SD | 2.000 | 1.912, 2.088 | 2.000 | 1.917, 2.084 | 1.999 | 1.915, 2.09 | 1.000 | 0.956, 1.044 |
| Correlation with HbA1c | 0.027 | -0.034, 0.088 | 0.699 | 0.666, 0.731 | 0.229 | 0.169, 0.283 | 1 | |
95%CI represents 95% empirical confidence intervals.
Summary of simulated variables in scenario C.
| Weight0 (kg) | Weight1 (kg) | Weight2 (kg) | HbA1c40 (%) | |||||
|---|---|---|---|---|---|---|---|---|
| Mean | 95%CI | Mean | 95%CI | Mean | 95%CI | Mean | 95%CI | |
| Mean | 3.997 | 3.873, 4.113 | 8.003 | 7.882, 8.122 | 12.001 | 11.875, 12.122 | 5.801 | 5.738, 5.861 |
| SD | 2.001 | 1.919, 2.087 | 2.000 | 1.911, 2.087 | 2.003 | 1.915, 2.101 | 1.001 | 0.959, 1.047 |
| Correlation with HbA1c | -0.106 | -0.166, -0.043 | 0.229 | 0.167, 0.288 | 0.700 | 0.666, 0.731 | 1 | |
95%CI represents 95% empirical confidence intervals.
Fig 2Z-score plots of weight from birth to age 2 years for scenarios A, B and C. Dotted lines show the group diagnosed with diabetes at age 40 and dashed those without a diagnosis. Error bars show empirical 95% confidence intervals.
Average parameter estimates from multilevel models of weight (outcome as covariate).
| Parameter | Scenario A | Scenario B | Scenario C | |||
|---|---|---|---|---|---|---|
| Mean | 95% CI | Mean | 95% CI | Mean | 95% CI | |
| Diabetes | 0.729 | 0.566, 0.900 | 1.070 | 0.899, 1.25 | 0.939 | 0.769, 1.111 |
| Age | 4.327 | 4.223, 4.430 | 3.914 | 3.807, 4.021 | 3.670 | 3.563, 3.767 |
| Diabetes*Age | -1.372 | -1.572, -1.166 | 0.340 | 0.133, 0.573 | 1.375 | 1.166, 1.573 |
| Intercept | 7.823 | 7.737, 7.911 | 7.740 | 7.655, 7.821 | 7.770 | 7.690, 7.862 |
| Intercept variance | 0.481 | 0.348, 0.601 | 0.503 | 0.127, 0.816 | 0.099 | 0.000, 0.255 |
| Age Variance | 0.680 | 0.547, 0.799 | 0.897 | 0.714, 1.08 | 0.567 | 0.000, 0.711 |
| Residual Variance | 1.770 | 1.710, 1.836 | 1.727 | 1.54, 1.849 | 1.840 | 1.778, 1.908 |
| Constant-Age Covariance | 0.973 | 0.894, 0.988 | 0.268 | 0.02, 0.845 | -0.729 | -0.957, 0.579 |
| Autocorrelation parameter | 0.120 | 0.069, 0.169 | 0.042 | -0.118, 0.134 | 0.160 | 0.119, 0.196 |
Fig 3Fitted weight values from multilevel models (outcome as covariate) and average mean weight values for scenarios A, B and C. Dotted lines (fitted values) and circular points (average mean weight values) represent fitted values for the group with a diabetes diagnosis at age 40. Dashed lines (fitted values) and triangular points (average mean weight values) represent those without a diagnosis. The grey ribbon represents an empirical 95% confidence band around the fitted values.
Average parameter estimates from the logistic regression model of diabetes status on weight growth rate.
| Parameter | Scenario A | Scenario B | Scenario C | |||
|---|---|---|---|---|---|---|
| Mean | 95% CI | Mean | 95% CI | Mean | 95% CI | |
| Growth rate | 1.000 | 0.943, 1.057 | 1.194 | 1.122, 1.316 | 1.679 | 1.477, 2.191 |
| Weight0 | 2.000 | 2.060, 2.745 | 1.000 | 1.149, 1.429 | 2.000 | 1.339, 2.265 |
| Constant | 6.030x10-03 | 3.654x10-04, 9.185x10-02 | 9.309x10-05 | 1.659x10-06, 1.291x10-03 | 2.221x10-11 | 2.552x10-16, 7.591x10-09 |
Growth rate was estimated using a multilevel model of weight over age (agnostic to the outcome, diabetes status)