| Literature DB >> 31394869 |
Colleen X Muñoz1, Michael Wininger2,3,4.
Abstract
With the collection of water-intake data, the National Health and Nutrition Examination Survey (NHANES) is becoming an increasingly popular resource for large-scale inquiry into human hydration. However, are we leveraging this resource properly? We sought to identify the opportunities and limitations inherent in hydration-related inquiry within a commonly studied database of hydration and nutrition. We also sought to critically review models published from this dataset. We reproduced two models published from the NHANES dataset, assessing the goodness of fit through conventional means (proportion of variance, R2). We also assessed model sensitivity to parameter configuration. Models published from the NHANES dataset typically yielded a very low goodness of fit R2 < 0.15. A reconfiguration of variables did not substantially improve model fit, and the goodness of fit of models published from the NHANES dataset may be low. Database-driven inquiry into human hydration requires the complete reporting of model diagnostics in order to fully contextualize findings. There are several emergent opportunities to potentially increase the proportion of explained variance in the NHANES dataset, including novel biomarkers, capturing situational variables (meteorology, for example), and consensus practices for adjustment of co-variates.Entities:
Keywords: NHANES; big data; chronic disease; database; hydration; modeling; obesity; water intake
Mesh:
Year: 2019 PMID: 31394869 PMCID: PMC6722508 DOI: 10.3390/nu11081828
Source DB: PubMed Journal: Nutrients ISSN: 2072-6643 Impact factor: 5.717
Figure 1Univariate regressions from the National Health and Nutrition Examination Survey (NHANES) 2009–2010 and 2011–2012 datasets, based on Chang et al. (Left) and Rosinger et al. (Right). Both regressions have highly significant p-values but negligible R2.
Figure 2Model goodness (R2) versus hydration threshold (via urine osmolality) in sensitivity analysis of two database studies on hydration in relation to body composition.
Figure 3Model goodness (R2) comparison, incorporating hydration as a continuous variable versus categorical variable R2 < 0.12 in all models.
Goodness of fit of Rosinger et al. with various water intake stratifications used in previous publications. Cells contain R2 values obtained from regression models built from the published model. Top row is the replicated model from Rosinger et al.; all other rows use same dataset and same model as Rosinger et al., but with the altered stratification of a single variable (water intake). Some stratifications showed an improved goodness of fit (R2 greater than Rosinger et al.); some stratifications showed a degraded goodness of fit (R2 less than Rosinger et al.). All models showed a generally weak model fit (R2 < 0.15).
| Norm. | OWt | Obese | All | Strata (Water Intake (mL/day)) | |
|---|---|---|---|---|---|
| Rosinger, 2016 [ | 0.111 | 0.110 | 0.107 | 0.095 | <2700 (F), <3700 (M), <3800 (Lactating F) |
| Armstrong, 2012 [ | 0.132 | 0.109 | 0.114 | 0.101 | {0, 1507, 1745, 2109, 2507, 2945, 3407, ∞} |
| Armstrong, 2010 [ | 0.132 | 0.110 | 0.113 | 0.100 | {0, 1382, 2008, 2048, 2453, 2614, 3261, ∞} |
| Johnson, 2015 [ | 0.127 | 0.108 | 0.107 | 0.099 | {0, 1620, 3210, ∞} |
| Muñoz, 2015 [ | 0.126 | 0.106 | 0.111 | 0.099 | {0, 1500, 2250, 3130, ∞} |
| Sontrop, 2013 [ | 0.114 | 0.109 | 0.112 | 0.097 | {0, 2000, 4300} |
| Pross, 2014 [ | 0.101 | 0.095 | 0.093 | 0.084 | {0, 1200, 2000, ∞} |
| Perrier, 2013 [ | 0.107 | 0.099 | 0.103 | 0.080 | {0, 1200; 2000, 4000} 1 |
| Roussel, 2011 [ | 0.080 | 0.086 | 0.094 | 0.077 | {0, 500, 1000, ∞} |
Norm = Normal Weight; OWt = Overweight. Strata defined or inspired by recent studies in hydration inquiry. 1 Middle hydration group (1200–2000 mL/d) and extremely hydrated (>4000 mL/d) censored.