| Literature DB >> 35954804 |
Ben Allen1, Morgan Lane2, Elizabeth Anderson Steeves3, Hollie Raynor3.
Abstract
Ecological theories suggest that environmental, social, and individual factors interact to cause obesity. Yet, many analytic techniques, such as multilevel modeling, require manual specification of interacting factors, making them inept in their ability to search for interactions. This paper shows evidence that an explainable artificial intelligence approach, commonly employed in genomics research, can address this problem. The method entails using random intersection trees to decode interactions learned by random forest models. Here, this approach is used to extract interactions between features of a multi-level environment from random forest models of waist-to-height ratios using 11,112 participants from the Adolescent Brain Cognitive Development study. This study shows that methods used to discover interactions between genes can also discover interacting features of the environment that impact obesity. This new approach to modeling ecosystems may help shine a spotlight on combinations of environmental features that are important to obesity, as well as other health outcomes.Entities:
Keywords: adolescent obesity; ecological theory; explainable artificial intelligence; household income; machine learning; neighborhood education; neighborhood poverty; parent education
Mesh:
Year: 2022 PMID: 35954804 PMCID: PMC9367834 DOI: 10.3390/ijerph19159447
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Participant demographics and Waist-to-Height Ratio Z-score (N = 11,112).
|
| |
| Male, n (%) | 5811 (52.3%) |
| Female, n (%) | 5301 (47.7%) |
|
| |
| Mean (SD) | 9.92 (0.626) |
| Median [Min, Max] | 9.92 [8.92, 11.1] |
|
| |
| Mean (SD) | 0.208 (1.05) |
| Median [Min, Max] | 0.215 [−3.99, 3.93] |
Figure 1Ten most important features in predicting waist-to-height ratio z-scores. Dotplot showing the decrease in residual sum of squares when the decision trees include each of the most important features. Importance values are averages of the two prediction models. The most important features all characterize some aspect of the neighborhood.
Figure 2Cross-validated interactions predictive of waist-to-height ratio z-scores. Evaluation of the interactions was based on how much variance the interaction explained in the holdout data based on the coefficient of determination (R-square). R-square values presented above are the average of the two predictive models.
Decision Rules and Median Splitting Values for each interacting variable.
| Feature A | Feature B |
|---|---|
| Parent Education < Bachelor’s Degree | <92% of Neighborhood with High School Degree |
| Household Income < $50,000 | ≥18% of Neighborhood Living in Poverty |
| Household Income < $50,000 | Neighborhood Small Particle Pollution <7.9 µg/m3 |
| Median Neighborhood Income < $72,341 | <23 min of weekly sports |
| <92% of Neighborhood with High School Degree | Median home values ≥ $215,825 |
| ≥18% of Neighborhood below poverty line | <16% Single-parent homes |
Figure 3Feature interactions predicted waist-to-height ratio z-scores. 3d surface map showing waist-to-height ratio z-scores by quantiles of interacting features. The predicted waist-to-height ratio z-scores as a function of parent education level and % of neighborhood with a high school diploma (A), total household income and % of neighborhood living in poverty (B), total household income and particle pollution (C), median neighborhood income and weekly involvement in sports (D), % of single parent homes in neighborhood and % of neighborhood living in poverty (E), % of neighborhood with a high school diploma and median neighborhood home values (F). All surface maps are drawn using the first prediction model and holdout data to show the generalizability of each interaction.