| Literature DB >> 31009509 |
Robert Hammond1, Rodoniki Athanasiadou1, Silvia Curado1,2, Yindalon Aphinyanaphongs1,3, Courtney Abrams1,3, Mary Jo Messito1,4, Rachel Gross1,4, Michelle Katzow1,4, Melanie Jay1,3,5, Narges Razavian1,3,6, Brian Elbel1,3,7.
Abstract
BACKGROUND: Because of the strong link between childhood obesity and adulthood obesity comorbidities, and the difficulty in decreasing body mass index (BMI) later in life, effective strategies are needed to address this condition in early childhood. The ability to predict obesity before age five could be a useful tool, allowing prevention strategies to focus on high risk children. The few existing prediction models for obesity in childhood have primarily employed data from longitudinal cohort studies, relying on difficult to collect data that are not readily available to all practitioners. Instead, we utilized real-world unaugmented electronic health record (EHR) data from the first two years of life to predict obesity status at age five, an approach not yet taken in pediatric obesity research. METHODS ANDEntities:
Mesh:
Year: 2019 PMID: 31009509 PMCID: PMC6476510 DOI: 10.1371/journal.pone.0215571
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Factors at the prenatal and infancy periods associated with early childhood obesity by age five.
Adapted from González-Muniesa et al. [21].
Number of children included at each selection criteria.
| Selection Criteria (in order) | N Boys | N Girls | N |
|---|---|---|---|
| 1) Full data set | 26,507 | 26,438 | 52,945 |
| 2a) BMI reading between 4.5 and 5.5 years | 5,775 | 5,719 | 11,494 |
| 2b) BMI reading is valid | 5,770 | 5,714 | 11,484 |
| 3) At least one data point prior to 2 years of age | 2,860 | 2,886 | 5,746 |
| 4) Maternal data available | 1,751 | 1,700 | 3,451 |
Number of features by category.
| Feature Category | Number of Features | Number of Features with at Least 1 Occurrence | Number of Features with at Least 5 Occurrences |
|---|---|---|---|
| Diagnosis | 566 | 160 | 107 |
| Lab | 549 | 73 | 57 |
| Medication | 2,968 | 78 | 14 |
| Gender | 2 | 2 | 2 |
| Ethnicity | 2 | 2 | 2 |
| Race | 11 | 9 | 8 |
| Vital | 475 | 255 | 255 |
| Number of visits | 1 | 1 | 1 |
| Zip code | 652 | 207 | 86 |
| Census | 34 | 34 | 34 |
| Maternal diagnosis | 3,962 | 473 | 257 |
| Newborn diagnosis | 566 | 52 | 22 |
| Maternal ethnicity | 4 | 3 | 3 |
| Primary insurance | 419 | 67 | 29 |
| Secondary insurance | 120 | 16 | 3 |
| Maternal race | 7 | 7 | 5 |
| Maternal language | 30 | 7 | 5 |
| Maternal nationality | 126 | 61 | 24 |
| Maternal marriage status | 7 | 5 | 5 |
| Maternal birthplace | 142 | 56 | 23 |
| Maternal delivery age | 1 | 1 | 1 |
| Maternal lab history | 5,700 | 573 | 477 |
| Maternal procedure history | 2,946 | 169 | 89 |
Individual feature associations with obesity between ages 4.5 and 5.5.
| Variable | % of EHR Population | Girls | Boys | ||||
|---|---|---|---|---|---|---|---|
| % of Cohort | Odds Ratio (95% CI) | p-value for OR | % of Cohort | Odds Ratio (95% CI) | p-value for OR | ||
| 52,945 | 1,698 | - | - | 1751 | - | - | |
| Not Hispanic/Latina | 24% | 17% | 0.587 (0.395, 0.874) | 0.009 | 21% | 0.714 (0.529, 0.963) | 0.027 |
| Hispanic/Latino | 49% | 82% | 1.546 (1.053, 2.269) | 0.026 | 79% | 1.399 (1.039, 1.884) | 0.027 |
| Other/Not Reported | 27% | 0% | - | - | 0% | - | - |
| Caucasian/White | 15% | 5% | 1.151 (0.65, 2.038) | 0.630 | 4% | 0.827 (0.482, 1.421) | 0.492 |
| African Amer/Black | 13% | 5% | 1.913 (1.119, 3.27) | 0.018 | 5% | 0.907 (0.526, 1.565) | 0.725 |
| Asian | 10% | 9% | 0.204 (0.089, 0.466) | p<0.001 | 10% | 0.623 (0.419, 0.925) | 0.019 |
| Multiracial | 42% | 77% | 1.085 (0.786, 1.497) | 0.622 | 68% | 1.283 (0.979, 1.681) | 0.071 |
| Other | 14% | 3% | 1.828 (0.965, 3.462) | 0.064 | 1.321 (0.748, 2.332) | 0.337 | |
| Unknown/No Response | 6% | 0% | - | - | 0% | - | - |
| Married | 7% | 36% | 0.702 (0.526, 0.938) | 0.017 | 36% | 0.875 (0.689, 1.111) | 0.272 |
| Divorced | 0% | 0% | 1.885 (0.378, 9.389) | 0.439 | 1% | 2.546 (0.804, 8.067) | 0.112 |
| Partnered | 4% | 32% | 1.135 (0.856, 1.504) | 0.379 | 30% | 1.067 (0.836, 1.363) | 0.601 |
| Single | 6% | 31% | 1.15 (0.867, 1.524) | 0.332 | 33% | 1.057 (0.832, 1.343) | 0.651 |
| Other/Unknown/No Response | 0% | 1% | 5.725 (1.645, 19.92) | 0.006 | 0.587 (0.131, 2.635) | 0.487 | |
| No Data Available | 83% | 0% | - | - | 0% | - | - |
| United States | 3% | 12% | 1.436 (0.992, 2.081) | 0.055 | 13% | 1.041 (0.749, 1.447) | 0.810 |
| China | 2% | 9% | 0.25 (0.115, 0.539) | p<0.001 | 11% | 0.64 (0.431, 0.951) | 0.027 |
| Dominican Republic | 1% | 3% | 1.681 (0.871, 3.243) | 0.122 | 4% | 2.369 (1.441, 3.896) | p<0.001 |
| Ecuador | 1% | 5% | 2.443 (1.475, 4.048) | p<0.001 | 5% | 1.011 (0.591, 1.729) | 0.968 |
| Mexico | 7% | 53% | 0.788 (0.604, 1.028) | 0.079 | 50% | 1.078 (0.86, 1.351) | 0.515 |
| El Salvador | 0% | 3% | 1.425 (0.703, 2.887) | 0.326 | 3% | 1.011 (0.496, 2.06) | 0.977 |
| Guatemala | 1% | 5% | 0.589 (0.292, 1.187) | 0.139 | 5% | 0.695 (0.401, 1.203) | 0.193 |
| Other | 2% | 10% | 1.418 (0.94, 2.139) | 0.096 | 0.905 (0.603, 1.358) | 0.629 | |
| No Data Available | 83% | 0% | - | - | 0% | - | - |
| Diabetes Mellitus in pregnancy | - | 10% | 2.045 (1.396, 2.995) | p<0.001 | 11% | 1.605 (1.15, 2.24) | 0.005 |
| Diabetes Mellitus without complications | - | 5% | 2.093 (1.262, 3.47) | 0.004 | 5% | 1.935 (1.216, 3.08) | 0.005 |
| Hypertension in pregnancy | - | 9% | 1.745 (1.167, 2.61) | 0.007 | 12% | 1.377 (0.987, 1.92) | 0.060 |
| Complications at birth | - | 43% | 1.29 (0.988, 1.685) | 0.061 | 45% | 1.158 (0.923, 1.452) | 0.204 |
| OB-related perin trauma | - | 41% | 0.781 (0.592, 1.029) | 0.078 | 39% | 0.815 (0.645, 1.031) | 0.088 |
| Pelvic obstruction | - | 2% | 1.36 (0.552, 3.35) | 0.503 | 2% | 1.931 (1.02, 3.653) | 0.043 |
| Nutritional diagnosis | - | 0% | 0 (0, 0) | 0.083 | 0% | 0 (0, 0) | 0.000 |
| Epilepsy/convulsions | - | 1% | 2.483 (1.053, 5.853) | 0.766 | 1% | 2.483 (1.053, 5.853) | 0.038 |
| Liver Diseases | - | 10% | 0.743 (0.492, 1.122) | 0.153 | 10% | 0.743 (0.492, 1.122) | 0.158 |
| Skin Diseases | - | 11% | 1.022 (0.735, 1.419) | 0.252 | 14% | 1.022 (0.735, 1.419) | 0.899 |
| Kidney Diseases | - | 1% | 1.144 (0.556, 2.356) | 0.334 | 2% | 1.144 (0.556, 2.356) | 0.714 |
| Circulatory Diseases | - | 1% | 2.386 (0.968, 5.88) | 0.000 | 1% | 2.386 (0.968, 5.88) | 0.059 |
Individual feature associations for girls with obesity between ages 4.5 and 5.5.
| Variable | Total Number | Total Average (SD) | % Obese (N) | Obese Average (SD) | % Not Obese (N) | Not Obese Average (SD) | p-value |
|---|---|---|---|---|---|---|---|
| Weight for Length Z-score (average 19 to 24 months) | 1,347 | 1.042 (1.106) | 22.7% (316) | 1.899 (1.029) | 77.3% (1,076) | 0.79 (0.996) | p<0.001 |
| BMI (average 19 to 24 months) | 1,355 | 17.547 (1.786) | 22.7% (318) | 18.869 (1.818) | 77.3% (1,083) | 17.158 (1.578) | p<0.001 |
| Weight for Length Z-score (latest available reading) | 1,612 | 0.99 (1.166) | 22.1% (368) | 1.806 (1.135) | 77.9% (1,297) | 0.759 (1.066) | p<0.001 |
| BMI (latest available reading) | 1,624 | 17.509 (1.806) | 22.1% (371) | 18.734 (1.953) | 77.9% (1,304) | 17.161 (1.599) | p<0.001 |
Individual feature associations for boys with obesity between ages 4.5 and 5.5.
| Variable | Total Number | Total Average (SD) | % Obese (N) | Obese Average (SD) | % Not Obese (N) | Not Obese Average (SD) | p-value |
|---|---|---|---|---|---|---|---|
| Weight for Length Z-score (average 19 to 24 months) | 1,392 | 1.042 (1.106) | 23.5% (316) | 1.899 (1.029) | 79.9% (1,076) | 0.79 (0.996) | p<0.001 |
| BMI (average 19 to 24 months) | 1,401 | 17.547 (1.786) | 23.5% (318) | 18.869 (1.818) | 79.9% (1,083) | 17.158 (1.578) | p<0.001 |
| Weight for Length Z-score (latest available reading) | 1,665 | 0.99 (1.166) | 22.8% (368) | 1.806 (1.135) | 80.5% (1,297) | 0.759 (1.066) | p<0.001 |
| BMI (latest available reading) | 1,675 | 17.509 (1.806) | 22.8% (371) | 18.734 (1.953) | 80.3% (1,304) | 17.161 (1.599) | p<0.001 |
Fig 2ROC curves for the top performing model compared to individual feature predictions.
Fig 3Precision recall curves for the top performing model compared to individual feature predictions.
Performance tradeoffs for the best performing model for girls.
| Sensitivity | PPV | Specificity | Accuracy | F1 | MCC | N Obese (TP + FP) | N Not Obese (TN + FN) |
|---|---|---|---|---|---|---|---|
| 0.145 | 0.889 | 0.996 | 0.858 | 0.250 | 0.352 | 9 | 330 |
| 0.200 | 0.786 | 0.989 | 0.861 | 0.319 | 0.126 | 14 | 325 |
| 0.291 | 0.571 | 0.958 | 0.850 | 0.386 | 0.030 | 28 | 311 |
| 0.418 | 0.535 | 0.930 | 0.847 | 0.469 | 0.021 | 43 | 296 |
| 0.491 | 0.519 | 0.912 | 0.844 | 0.505 | 0.018 | 52 | 287 |
| 0.600 | 0.371 | 0.803 | 0.770 | 0.458 | 0.007 | 89 | 250 |
| 0.691 | 0.355 | 0.757 | 0.746 | 0.469 | 0.006 | 107 | 232 |
| 0.800 | 0.293 | 0.627 | 0.655 | 0.429 | 0.004 | 150 | 189 |
| 0.891 | 0.261 | 0.511 | 0.572 | 0.403 | 0.003 | 188 | 151 |
Performance tradeoffs for the best performing model for boys.
| Sensitivity | PPV | Specificity | Accuracy | F1 | MCC | N Obese (TP + FP) | N Not Obese (TN + FN) |
|---|---|---|---|---|---|---|---|
| 0.084 | 0.700 | 0.989 | 0.774 | 0.151 | 0.071 | 10 | 340 |
| 0.205 | 0.567 | 0.951 | 0.774 | 0.301 | 0.021 | 30 | 320 |
| 0.301 | 0.543 | 0.921 | 0.774 | 0.388 | 0.015 | 46 | 304 |
| 0.398 | 0.458 | 0.854 | 0.746 | 0.426 | 0.008 | 72 | 278 |
| 0.506 | 0.442 | 0.801 | 0.731 | 0.472 | 0.007 | 95 | 255 |
| 0.602 | 0.435 | 0.757 | 0.720 | 0.505 | 0.006 | 115 | 235 |
| 0.699 | 0.397 | 0.670 | 0.677 | 0.507 | 0.005 | 146 | 204 |
| 0.795 | 0.346 | 0.532 | 0.594 | 0.482 | 0.003 | 191 | 159 |
| 0.904 | 0.306 | 0.363 | 0.491 | 0.457 | 0.003 | 245 | 105 |