| Literature DB >> 30094965 |
Craig Anderson1,2,3, Ryan Hafen4, Oleg Sofrygin5, Louise Ryan1,2.
Abstract
The Bill and Melinda Gates Foundation's Healthy Birth, Growth and Development knowledge integration project aims to improve the overall health and well-being of children across the world. The project aims to integrate information from multiple child growth studies to allow health professionals and policy makers to make informed decisions about interventions in lower and middle income countries. To achieve this goal, we must first understand the conditions that impact on the growth and development of children, and this requires sensible models for characterising different growth patterns. The contribution of this paper is to provide a quantitative comparison of the predictive abilities of various statistical growth modelling techniques based on a novel leave-one-out validation approach. The majority of existing studies have used raw growth data for modelling, but we show that fitting models to standardised data provide more accurate estimation and prediction. Our work is illustrated with an example from a study into child development in a middle income country in South America.Entities:
Keywords: child development; growth curve; trajectory
Mesh:
Year: 2018 PMID: 30094965 PMCID: PMC6767565 DOI: 10.1002/sim.7693
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Summary of relevant studies within the Healthy Birth, Growth and Development knowledge integration project
| Obs Per Child | Child Age, d | |||||||
|---|---|---|---|---|---|---|---|---|
| Dataset | No. of Children | No. of Obs | Min | Max | Median | Min | Max | Median |
|
| 7637 | 18983 | 1 | 4 | 2 | 168 | 927 | 541 |
|
| 197 | 2352 | 1 | 15 | 14 | 95 | 1903 | 804 |
|
| 373 | 12478 | 23 | 37 | 34 | 1 | 1111 | 558 |
|
| 3125 | 35506 | 1 | 37 | 9 | 1 | 1846 | 446 |
|
| 197 | 4405 | 10 | 41 | 21 | 1 | 702 | 116 |
|
| 20510 | 158892 | 1 | 19 | 6 | 1 | 6954 | 718 |
|
| 380 | 8436 | 2 | 26 | 23 | 1 | 1175 | 343 |
|
| 1544 | 28823 | 1 | 77 | 16 | 1 | 6954 | 2746 |
|
| 315 | 2548 | 1 | 13 | 10 | 119 | 493 | 269 |
|
| 22545 | 43158 | 1 | 2 | 2 | 5 | 1908 | 1942 |
|
| 203 | 1427 | 1 | 17 | 7 | 1 | 521 | 136 |
|
| 27363 | 122139 | 1 | 6 | 5 | 1 | 960 | 92 |
|
| 2954 | 41587 | 1 | 69 | 13 | 0 | 900 | 309 |
|
| 2144 | 46499 | 1 | 25 | 25 | 1 | 732 | 336 |
|
| 153 | 1839 | 1 | 16 | 13 | 1 | 679 | 185 |
|
| 412 | 2279 | 1 | 8 | 7 | 193 | 1282 | 628 |
|
| 16898 | 174233 | 1 | 14 | 11 | 1 | 3287 | 275 |
|
| 302 | 1140 | 2 | 4 | 4 | 153 | 457 | 265 |
|
| 278 | 3177 | 1 | 33 | 13 | 1 | 525 | 211 |
|
| 2027 | 15637 | 1 | 10 | 8 | 18 | 1095 | 280 |
|
| 14086 | 64867 | 1 | 10 | 5 | 1 | 1132 | 115 |
Figure 1Fitted growth trajectory of a single child based on fitting each of our 6 models on the raw scale. The blue dots are the data, the black dot is the held out datapoint, and the red line is the model fit. FACE, fast covariance estimation; SITAR, Superimposition by Translation and Rotation [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 2Fitted growth trajectory of a single child based on fitting 5 of our models on the Z scale (SITAR was not fitted on this scale). The blue dots are the data, the black dot is the held out datapoint, and the red line is the model fit. FACE, fast covariance estimation; HAZ, height‐for‐age Z score; SITAR, Superimposition by Translation and Rotation [Colour figure can be viewed at wileyonlinelibrary.com]
MSE results for random value holdout cross‐validation
| Brokenstick (raw) | Brokenstick (Z) | FACE (raw) | FACE (Z) | Penalised Splines (raw) | Penalised Splines (Z) | |
|---|---|---|---|---|---|---|
|
| 0.17 |
| 0.17 | 0.17 | 1.58 | 0.19 |
|
| 0.10 | 0.06 | 0.10 |
| 1.37 | 0.38 |
|
| 0.36 | 0.27 |
| 0.24 | 0.26 | 0.29 |
|
| 0.55 | 0.29 |
| 0.27 | 0.32 | 0.32 |
|
| 0.07 | 0.05 |
| 0.04 | 0.07 | 0.07 |
|
| 3.14 |
| 1.05 | 0.61 | ||
|
| 0.21 | 0.18 | 0.12 |
| 0.16 | 0.16 |
|
| 1.18 |
| 0.25 | 0.14 | ||
|
| 0.07 | 0.07 | 0.06 |
| 2.01 | 0.17 |
|
| 0.36 |
| 0.36 | 0.36 | ||
|
| 0.44 | 0.42 |
| 0.41 | 0.50 | 0.51 |
|
| 0.89 |
| 0.42 | 0.42 | ||
|
|
| 1.20 | 0.62 | 0.81 | ||
|
| 0.22 |
| 0.19 | 0.19 | ||
|
| 0.18 | 0.13 | 0.12 |
| 1.03 | 0.16 |
|
| 0.06 | 0.06 |
| 0.06 | 2.11 | 0.13 |
|
| 1.55 | 0.91 | 0.84 |
| ||
|
| 0.08 | 0.08 | 0.08 |
| 3.07 | 0.12 |
|
| 0.54 | 0.52 | 0.47 |
| 0.56 | 0.54 |
|
| 0.35 | 0.26 | 0.26 |
| 4.37 | 0.58 |
|
| 1.44 | 1.18 | 1.18 |
|
Abbreviations: FACE, fast covariance estimation; MSE, mean squared error. The lowest MSE for each dataset is displayed in bold.
MSE results for last value holdout cross‐validation
| Brokenstick (raw) | Brokenstick (Z) | FACE (raw) | FACE (Z) | Penalised Splines (raw) | Penalised Splines (Z) | |
|---|---|---|---|---|---|---|
|
|
| 0.16 | 0.17 | 0.17 | 0.82 | 0.18 |
|
| 0.08 | 0.10 | 0.07 |
| 1.17 | 0.38 |
|
| 0.10 | 0.15 |
| 0.13 | 0.17 | 0.30 |
|
| 0.31 | 0.25 |
| 0.25 | 0.29 | 0.35 |
|
|
| 0.04 | 0.04 | 0.04 | 0.07 | 0.07 |
|
| 0.94 |
| 0.59 | 0.56 | ||
|
| 0.04 | 0.06 |
| 0.04 | 0.06 | 0.06 |
|
| 0.41 |
| 0.27 | 0.18 | ||
|
|
| 0.06 | 0.06 | 0.06 | 0.91 | 0.12 |
|
| 0.34 |
| 0.35 | 0.36 | ||
|
| 0.36 | 0.36 | 0.36 |
| 0.44 | 0.44 |
|
| 0.66 |
| 0.50 | 0.50 | ||
|
|
| 0.57 | 0.55 | 0.60 | ||
|
|
| 0.14 | 0.14 | 0.14 | ||
|
| 0.16 |
| 0.24 | 0.30 | 0.60 | 0.21 |
|
|
| 0.10 | 0.10 | 0.10 | 1.71 | 0.16 |
|
|
| 0.62 | 0.70 | 0.59 | ||
|
| 0.11 | 0.11 | 0.11 |
| 0.74 | 0.15 |
|
| 0.51 | 0.51 |
| 0.49 | 0.56 | 0.55 |
|
| 0.72 | 0.72 | 0.71 |
| 2.82 | 1.49 |
|
| 1.11 |
| 1.04 | 1.03 |
Abbreviations: FACE, fast covariance estimation; MSE, mean squared error. The lowest MSE for each dataset is displayed in bold.
Figure 3Comparison of results from fitting the brokenstick model on raw data and on Z‐transformed data. Each point represents the MSE values obtained from random holdout on 1 dataset. HAZ, height‐for‐age Z score; MSE, mean squared error [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 4Comparison of results from fitting the brokenstick model on raw data and on Z‐transformed data. Each point represents the MSE values obtained from last value holdout on 1 dataset. HAZ, height‐for‐age Z score; MSE, mean squared error [Colour figure can be viewed at wileyonlinelibrary.com]
Mean squared error for brokenstick model with different numbers of knots
| Knots | Random Holdout | Last Value Holdout |
|---|---|---|
| 3 | 0.083 | 0.088 |
| 4 | 0.059 | 0.050 |
| 5 | 0.054 | 0.042 |
| 6 | 0.051 | 0.042 |
| 7 | 0.049 | 0.044 |
Mean squared error for fast covariance estimation model with different numbers of knots
| Knots | Random Holdout | Last Value Holdout |
|---|---|---|
| 4 | 0.051 | 0.046 |
| 6 | 0.049 | 0.043 |
| 8 | 0.045 | 0.044 |
| 10 | 0.043 | 0.043 |
| 12 | 0.042 | 0.043 |
Mean squared error for penalised spline model with different numbers of knots
| Population Knots | Individual Knots | Random Holdout | Last Value Holdout |
|---|---|---|---|
| 5 | 3 | 0.120 | 0.077 |
| 5 | 5 | 0.111 | 0.066 |
| 5 | 7 | 0.230 | 0.203 |
| 10 | 3 | 0.120 | 0.076 |
| 10 | 5 | 0.112 | 0.065 |
| 10 | 7 | 0.227 | 0.205 |
| 15 | 3 | 0.122 | 0.077 |
| 15 | 5 | 0.112 | 0.066 |
| 15 | 7 | 0.228 | 0.205 |