| Literature DB >> 35533073 |
Christy L Avery1,2, Annie Green Howard3,2, Anna F Ballou1, Victoria L Buchanan1, Jason M Collins1, Carolina G Downie1, Stephanie M Engel1, Mariaelisa Graff1, Heather M Highland1, Moa P Lee1, Adam G Lilly2,4, Kun Lu5, Julia E Rager5, Brooke S Staley1, Kari E North1, Penny Gordon-Larsen6,2.
Abstract
Advances in technologies to measure a broad set of exposures have led to a range of exposome research efforts. Yet, these efforts have insufficiently integrated methods that incorporate genetic data to strengthen causal inference, despite evidence that many exposome-associated phenotypes are heritable. Objective: We demonstrate how integration of methods and study designs that incorporate genetic data can strengthen causal inference in exposomics research by helping address six challenges: reverse causation and unmeasured confounding, comprehensive examination of phenotypic effects, low efficiency, replication, multilevel data integration, and characterization of tissue-specific effects. Examples are drawn from studies of biomarkers and health behaviors, exposure domains where the causal inference methods we describe are most often applied. Discussion: Technological, computational, and statistical advances in genotyping, imputation, and analysis, combined with broad data sharing and cross-study collaborations, offer multiple opportunities to strengthen causal inference in exposomics research. Full application of these opportunities will require an expanded understanding of genetic variants that predict exposome phenotypes as well as an appreciation that the utility of genetic variants for causal inference will vary by exposure and may depend on large sample sizes. However, several of these challenges can be addressed through international scientific collaborations that prioritize data sharing. Ultimately, we anticipate that efforts to better integrate methods that incorporate genetic data will extend the reach of exposomics research by helping address the challenges of comprehensively measuring the exposome and its health effects across studies, the life course, and in varied contexts and diverse populations. https://doi.org/10.1289/EHP9098.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35533073 PMCID: PMC9084332 DOI: 10.1289/EHP9098
Source DB: PubMed Journal: Environ Health Perspect ISSN: 0091-6765 Impact factor: 11.035
Figure 1.Conceptual diagram of the exposome. By placing genetic data in the middle of four exposome domains (the natural and built environment, the social environment, the chemical environment, and health behaviors), the central role of genetic data is emphasized. Figure adapted from Vermeulen R, Schymanski EL, Barabasi AL, Miller GW. 2020. The exposome and health: Where chemistry meets biology. Science 367:392–396. Reprinted with permission from American Association for the Advancement of Science (AAAS).
Six methods or approaches that leverage genetic data to address challenges facing exposomics research and empower causal inference.
| Challenge | Statistical method or approach afforded by genetic data |
|---|---|
| Reverse causation and unmeasured confounding | Mendelian randomization |
| Comprehensive phenotype measurement and characterization of phenotypic effects | PheWAS of genetically predicted exposures in large biobanks or populations with EHR. |
| Decreased efficiency from data missing by design or from detection limits | Joint models that address missing data from exposure measured on subset of participants and detection limits by leveraging the information available from any associations between genetics and covariates with exposomic data |
| Difficulty replicating findings, particularly if exposure is not measured broadly, not measured with comparable protocols, or unidentified | External replication using genetically predicted exposures. |
| Multilevel data that may be difficult to integrate | Integrative approaches that use genetic data as a framework to link multi-omic data. |
| Limited ability to characterize tissue-specific effects | Imputation of tissue-specific biomarkers of exposure and internal dose (e.g., transcripts, methylation, metabolomics, proteins) using publicly available data. |
Figure 2.Example causal diagram representing the relationship between genetic variants G, exposure X, and outcome Y. The hypothesis tested by Mendelian randomization is shown by the dotted arrow where G serves as an instrumental variable for X (solid arrow).