| Literature DB >> 31093555 |
Marieke Welten1, Marlou L A de Kroon2, Carry M Renders3, Ewout W Steyerberg2,4, Hein Raat2, Jos W R Twisk1, Martijn W Heymans1.
Abstract
BACKGROUND: In literature, not much emphasis has been placed on methods for analyzing repeatedly measured independent variables, even less so for the use in prediction modeling specifically. However, repeated measurements could especially be interesting for the construction of prediction models. Therefore, our objective was to evaluate different methods to model a repeatedly measured independent variable and a long-term fixed outcome variable into a prediction model.Entities:
Keywords: Longitudinal studies; Prediction models; Repeated measurements; Risk modeling; Statistical methods
Year: 2018 PMID: 31093555 PMCID: PMC6460730 DOI: 10.1186/s41512-018-0024-7
Source DB: PubMed Journal: Diagn Progn Res ISSN: 2397-7523
Characteristics of the population for analysis
| Broken stick-data | ||
|---|---|---|
| Male, no (%) | 343 (47.0%) | |
| Visit 0 days | Age (years) | 0.0 (0.0; 0.0) |
| BMI-SDS | −0.6 (0.9) | |
| Visit 3 months | Age (years) | 0.3 (0.3; 0.3) |
| BMI-SDS | −0.4 (0.8) | |
| Visit 6 months | Age (years) | 0.5 (0.5; 0.5) |
| BMI-SDS | −0.4 (0.8) | |
| Visit 14 months | Age (years) | 1.2 (1.2; 1.2) |
| BMI-SDS | 0.2 (0.8) | |
| Visit 2 years | Age (years) | 2.0 (2.0; 2.0) |
| BMI-SDS | 0.1 (0.8) | |
| Visit 3 years | Age (years) | 3.0 (3.0; 3.0) |
| BMI-SDS | −0.1 (0.8) | |
| Visit 5.5 years | Age (years) | 5.5 (5.5; 5.5) |
| BMI-SDS | −0.2 (0.7) | |
| Visit 10 years | Age (years) | 9.9 (9.1; 10.4) |
| BMI-SDS | −0.2 (1.0) | |
| Overweight, no (%) | 90 (12.3%) |
Values are expressed as the mean (SD), median (95% range) or number (%) of age at visit, BMI standard deviation score (-SDS) at visit, sex, and overweight
Fig. 1Mean BMI-SDS at ages 0 to 6 years of overweight and non-overweight 10-year-old children
The predictive quality of prediction models developed using different methods to include longitudinal predictor BMI-SDS
| Method | Model includes | Outcome at 10y | |||
|---|---|---|---|---|---|
| Overweight | BMI-SDS | ||||
| Nk R2 | AUC |
| AUC | ||
| 1. All original measurements | BMI-SDS at age 0 days, 3 months, 6 months, 14 months, 2 years, 3 years, 5.5 years | 0.244a | 0.807a | 0.339b | 0.801b |
| 2. Single ‘best’ measurement | BMI-SDS at age 5.5 years | 0.230 | 0.799 | 0.329 | 0.799 |
| 3. Summary measurement | Mean (BMI-SDS at age 0 days, 3 months, 6 months, 14 months, 2 years, 3 years, 5.5 years) | 0.168 | 0.767 | 0.238 | 0.767 |
| 3. Summary measurement | Maximum (BMI-SDS at age 0 days, 3 months, 6 months, 14 months, 2 years, 3 years, 5.5 years) | 0.130 | 0.737 | 0.177 | 0.737 |
| 4. Change between measurements | BMI-SDS at age 0 days and BMI-SDS changes between ages 3m-0d, 6m-3m, 14m-6m, 2y-14m, 3y-2y, 5.5y-3y | 0.244c | 0.807c | 0.339d | 0.801d |
| 5. Conditional measurements | BMI-SDS at age 0 days and conditional BMI-SDS at age 3 months, 6 months, 14 months, 2 years, 3 years, 5.5 years | 0.244e | 0.807e | 0.348 | 0.806 |
| 6. Growth curve parameters | Mean and regression coefficients of the cubic growth curve ( | 0.241 | 0.803 | 0.337 | 0.803 |
Values are the explained variance of each prediction model developed in the broken stick dataset expressed in adjusted Nagelkerke R2 (Nk R2) or adjusted R2 (R2) and the area under the curve (AUC). The models predicting the dichotomous outcome overweight no/yes were analyzed using logistic regression. The prediction models predicting the continuous outcome BMI-SDS at age 10 were analyzed using linear regression
Due to collinearity: a. The model did not contain BMI-SDS at 5.5 years; b. The model did not contain BMI-SDS at 3 years; c. The model did not contain ΔBMI-SDS between 5.5y-3y; d. The model did not contain BMI-SDS at 0 days; e. The model did not contain conditional BMI-SDS at 5.5 years
Characteristics of the methods for developing a prediction model with a longitudinal predictor
| Method | Flexible with missing values | Flexible with timing of measurements | Encompasses all information on the development of the predictor | Capable of dealing with a great number of repeated measurements | Capable of dealing with a small number of repeated measurements | Straightforward predictor computation (no additional steps that need to be performed before prediction model can be made) |
|---|---|---|---|---|---|---|
| 1. All original measurements | + | + | + | |||
| 2. Single “best” measurement | + | + | + | |||
| 3. Summary (mean or maximum etc.) | + | + | + | + | * | |
| 4. Change between measurements | * | + | + | * | ||
| 5. Conditional measurements | + | + | * | |||
| 6. Growth curve parameters | + | + | + | + |
+advantage that is present; *advantage that is partially present; an empty cell indicates an advantage that is not present. See discussion section "Methods to develop prediction models with a longitudinal predictor" for more information