| Literature DB >> 24108269 |
Laura D Howe1, Kate Tilling2, Alicia Matijasevich3, Emily S Petherick4, Ana Cristina Santos5, Lesley Fairley4, John Wright4, Iná S Santos3, Aluísio Jd Barros3, Richard M Martin6, Michael S Kramer7, Natalia Bogdanovich8, Lidia Matush8, Henrique Barros5, Debbie A Lawlor9.
Abstract
Childhood growth is of interest in medical research concerned with determinants and consequences of variation from healthy growth and development. Linear spline multilevel modelling is a useful approach for deriving individual summary measures of growth, which overcomes several data issues (co-linearity of repeat measures, the requirement for all individuals to be measured at the same ages and bias due to missing data). Here, we outline the application of this methodology to model individual trajectories of length/height and weight, drawing on examples from five cohorts from different generations and different geographical regions with varying levels of economic development. We describe the unique features of the data within each cohort that have implications for the application of linear spline multilevel models, for example, differences in the density and inter-individual variation in measurement occasions, and multiple sources of measurement with varying measurement error. After providing example Stata syntax and a suggested workflow for the implementation of linear spline multilevel models, we conclude with a discussion of the advantages and disadvantages of the linear spline approach compared with other growth modelling methods such as fractional polynomials, more complex spline functions and other non-linear models.Entities:
Keywords: ALSPAC; Born in Bradford; Generation XXI; PROBIT; Pelotas; child; growth; height; longitudinal; multilevel models; spline; weight
Mesh:
Year: 2013 PMID: 24108269 PMCID: PMC4074455 DOI: 10.1177/0962280213503925
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Characteristics of birth cohorts.
| ALSPAC | Born in Bradford (White British) | Born in Bradford (Pakistani) | Generation XXI | Pelotas 2004 cohort | PROBIT | |
|---|---|---|---|---|---|---|
| Total sample size included in growth analysis | 14,048 | 613 | 777 | 5282 | 4188 | 17,046 |
| % male | 51.5 | 51.6 | 50.1 | 50.5 | 51.8 | 51.8 |
| % preterm birth (<37 weeks gestation) | 6.1 | 6.1 | 4.6 | 7.2 | 14.0 | –[ |
| Mean maternal height in cm (SD) | 164.0 (6.7) | 164.0 (6.2) | 159.4 (5.8) | 160.5 (6.1) | 158.7 (6.4) | 164.4 (5.6) |
| % mothers who smoked in pregnancy | 25.6 | 34.6 | 3.6 | 20.7 | 27.4 | 2.0 |
ALSPAC: The Avon Longitudinal Study of Parents and Children; PROBIT: the Promotion of Breastfeeding Intervention Trial.
Eligibility criteria for PROBIT mean that all infants were term births (≥37 weeks gestation) with a birth weight ≥2.5 kg.
Features of models for childhood growth in each birth cohort.
| ALSPAC | Born in Bradford | Generation XXI | Pelotas 2004 cohort | PROBIT | |
|---|---|---|---|---|---|
| Age range | 0–10 years | 0–2 years | 0–6 years | 0–4 years | 0–5 years |
| Knot points for weight model | 3 months, 1 and 7 years | 4 and 9 months | 10 days, 3 months, 1 and 3 years | 3 months, 1 year | 3 months, 1 year |
| Knot points for length/height model | 3 months, 1 and 3 years | 4 and 9 months | 3 months, 1 and 3 years | 3 months, 1 and 2 years | 3 months, 1 year |
| Process for selecting knot points | (i) Identified fractional polynomial providing the best smooth curve, (ii) used this curve to identify approximate number and position of knot points, (iii) ran multiple models using alternative positions for knot points at ages in whole months, (iv) selected the combination of knot points giving the best model fit judged by log likelihood, (v) verified that model fit was not substantially compromised when choosing knot points at whole years | (i) Identified fractional polynomial providing the best smooth curve, (ii) used this curve to identify approximate number and position of knot points, (iii) ran multiple models using alternative positions for knot points at ages in whole months, (iv) selected the combination of knot points giving the best model fit judged by log likelihood | (i) Fit a model using the ALSPAC knot points, (ii) tested multiple models using alternative positions for knot points at ages in whole months around the ALSPAC knot points and verified that altering from the ALSPAC knot points did not improve model fit (judged by both log likelihood and differences between observed and predicted values), (iii) tested for improvement in model fit (judged by both log likelihood and differences between observed and predicted values) with the inclusion of an additional early knot point in the weight model | Knot points fit at planned follow-up ages | (i) Identified fractional polynomial providing the best smooth curve, (ii) fit knot points at planned clinic ages, (iii) chose the model which best fitted the fractional polynomial |
| Other covariates included in model[ | Measurement source (measured vs. parent-reported) | Ethnicity and ethnicity interactions with splines, measurement source (research vs. health visitor) |
ALSPAC: The Avon Longitudinal Study of Parents and Children; PROBIT: the Promotion of Breastfeeding Intervention Trial.
All models included gender and interactions between gender and splines as fixed effects.
Figure 1.Distribution of the ages at measurement in (a) ALSPAC and (b) Pelotas 2004 cohorts.
a. ALSPAC: b. Pelotas 2004 cohort
ALSPAC: The Avon Longitudinal Study of Parents and Children.
Note: Illustrating the greater inter-individual variability in ages at measurement in ALSPAC compared with the Pelotas 2004 cohort. Other cohorts are shown in Supplementary Figure 1.
Differences between observed measurements and those predicted by the multilevel model for ALSPAC.
| Mean actual measurement (SD) | Mean predicted measurement (SD) | Mean difference (SD) | |
|---|---|---|---|
|
| |||
| Length/height models | |||
| Birth length | 50.61 (2.36) | 50.72 (1.65) | −0.11 (1.19) |
| >0 to ≤3 months | 57.25 (3.11) | 57.18 (2.66) | 0.07 (1.33) |
| >3 months to ≤1 year | 69.95 (4.78) | 70.06 (4.39) | −0.11 (1.38) |
| >1 year to ≤3 years | 83.64 (5.56) | 83.61 (5.12) | 0.02 (1.48) |
| >3 years to ≤10 years | 117.70 (15.26) | 117.65 (15.00) | 0.05 (1.71) |
| Weight models | |||
| Birth weight | 3.40 (0.55) | 3.36 (0.47) | 0.04 (0.15) |
| >0 to ≤3 months | 4.61 (0.91) | 4.65 (0.87) | −0.04 (0.17) |
| >3 months to ≤1 year | 8.49 (1.48) | 8.45 (1.44) | 0.05 (0.27) |
| >1 year to ≤7 years | 15.28 (4.26) | 15.29 (4.11) | −0.01 (0.79) |
| >7 years to ≤10 years | 29.41 (6.48) | 29.42 (6.14) | −0.01 (1.06) |
ALSPAC: The Avon Longitudinal Study of Parents and Children.
Growth rates predicted by linear spline multilevel models for girls.
| Mean (SD) growth rates | ||||||
|---|---|---|---|---|---|---|
| ALSPAC | Born in Bradford (White British ethnicity) | Born in Bradford (Pakistani ethnicity) | Generation XXI | Pelotas 2004 | PROBIT | |
|
| ||||||
| Girls | 6731 | 316 | 396 | 2465 | 2018 | 7905 |
| Birth length (cm) | 50.00 (1.57) | 48.36 (1.16) | 47.91 (1.04) | 48.30 (1.86) | 47.77 (1.87) | 51.42 (1.66) |
| Early infancy (cm/month) | 3.57 (0.16) | 3.54 (0.39) | 3.92 (0.36) | 3.79 (0.22) | 3.83 (0.31) | 2.96 (0.45) |
| Late infancy (cm/month) | 1.64 (0.15) | 1.63 (0.27) | 1.59 (0.25) | 1.70 (0.17) | 1.59 (0.15) | 1.77 (0.17) |
| Early childhood (cm/month) | 0.83 (0.07) | 0.94 (0.05) | 0.95 (0.05) | 0.90 (0.08) | 1.04 (0.12) | 0.76 (0.04) |
| Later childhood (cm/month) | 0.53 (0.04) | N/A | N/A | 0.53 (0.02) | 0.65 (0.05) | N/A |
|
| ||||||
| Girls | 6731 | 316 | 396 | 2616 | 2018 | 7905 |
| Birth weight (kg) | 3.30 (0.44) | 3.16 (0.51) | 2.98 (0.47) | 3.11 (0.40) | 3.10 (0.33) | 3.33 (0.31) |
| Early neonatal (kg /month) | N/A | N/A | N/A | 0.42 (0.64) | N/A | N/A |
| Early infancy (kg /month) | 0.88 (0.14) | 0.83 (0.17) | 0.82 (0.15) | 0.87 (0.16) | 0.81 (0.11) | 0.87 (0.11) |
| Late infancy (kg /month) | 0.45 (0.09) | 0.44 (0.12) | 0.42 (0.13) | 0.40 (0.09) | 0.37 (0.08) | 0.53 (0.07) |
| Early childhood (kg /month) | 0.19 (0.04) | 0.22 (0.04) | 0.24 (0.05) | 0.22 (0.05) | 0.23 (0.05) | 0.15 (0.02) |
| Later childhood (kg /month) | 0.32 (0.10) | N/A | N/A | 0.19 (0.05) | N/A | N/A |
ALSPAC: The Avon Longitudinal Study of Parents and Children; PROBIT: the Promotion of Breastfeeding Intervention Trial; BiB: Born in Bradford.
Growth rates for early neonatal, early infancy, late infancy, early childhood and late childhood, respectively: ALSPAC length/height: n/a, 0–3 months, 3–12 months, 1–3 years, 3–10 years; ALSPAC weight: n/a, 0–3 months, 3–12 months, 1–7 years, 7–10 years; BiB length and weight: n/a, 0–4 months, 4–9 months, 9–24 months, n/a; Generation XXI length/height: n/a, 0–3 months, 3–12 months, 1–3 years, 3–6 years; Generation XXI weight: 0–10 days, 10 days–3 months, 3–12 months, 1–3 years, 3–6 years; Pelotas length/height: n/a, 0–3 months, 3–12 months, 1–2 years, 2–4 years; Pelotas weight: n/a, 0–3 months, 3–24 months, 2–4 years, n/a; PROBIT length/height and weight: n/a, 0–3 months, 3–12 months, 1–3 years, 3–5 years.
Suggested workflow for the application of linear spline multilevel models.
| 1. Choose approximate knot point locations. Possible strategies include the following: |
| a. Use knowledge of the underlying biology of growth patterns to decide the choice of knot point positioning. |
| b. Identify the best-fitting curve (using fractional polynomials or other method) and use the derivatives of the curve to identify the number and approximate timings of knot points. |
| c. Identify the best-fitting curve (using fractional polynomials or other method) and visually estimate the number and approximate timings of knot points. |
| d. Start with a large number of knot points, gradually reducing the number until a ‘smooth’ curve is achieved. |
| e. Place knot points at the centiles of the distribution of age. |
| f. Use stepwise regression to select knots where there is statistical evidence of a difference between the linear slopes either side of the knot point. |
| g. For datasets with a small number of follow-up occasions and limited inter-individual variability in the timing of follow-ups, knot points can be placed at the target ages of follow-up occasions. |
| h. When modelling child growth data, it may be possible to start with the knot points identified in the five cohorts shown in this paper (i.e. 10 days in data permits, then 3–4 months, 9–12 months, 3 years (height) and 7 years (weight)). |
| 2. Test various models to determine the final knot point locations. We suggest that this choice should be a combination of model fit (which may be assessed by likelihood values, other model fit statistics, residual standard deviation, differences between observed and predicted measurements, etc.) and interpretability. |
| 3. Identify which covariates will be included and how these will be modelled, for example, as fixed effects only or as random effects, with or without interactions between the covariates and the linear splines. |
| 4. Determine whether it is necessary to model complex level 1 variation (e.g. this will be more important when the variance of the outcome changes over time) and how best to model this (e.g. age could be modelled as a continuous term or as each of the linear splines) – comparisons of model fit will be important in this process. |
| 5. Perform model checks. Potential model checks include the following: |
| a. Examine the difference between observed and predicted measurements (range, mean and reference range) overall and in different periods of time to check whether model fit is consistent across the time range of the model. |
| b. Assess the normality of level 1 and level 2 residuals. |
| c. Plot the difference between consecutive level 1 residuals for the same individual against the time difference between the consecutive measures as a check for autocorrelation. If autocorrelation is detected, some statistical software packages can model this. |
| d. Depending on the nature of the dataset, it may be prudent to conduct various sensitivity analyses, for example, verify that models are not dominated by individuals with a greater than average number of measurements, verify that the selected knot points are appropriate for all sub-groups of the population, verify that model fit is similar for all sub-groups of the population, etc. |