| Literature DB >> 26809563 |
Abstract
It seems intuitive that disease risk is influenced by the interaction between inherited genetic variants and environmental exposure factors; however, we have few documented interactions between variants and exposures. Advances in technology may enable the simultaneous measurement (i.e., on the same individuals in an epidemiological study) of millions of genome variants with thousands of environmental "exposome" factors, significantly increasing the number of possible factor pairs available for testing for the presence of interactions. The burden of analytic complexity, or sheer number of genetic and exposure factors measured, poses a considerable challenge for discovery of interactions in population-scale data. Advances in analytic approaches, large sample sizes, less conservative methods to mitigate multiple testing, and strong biological priors will be required to prune the search space to find reproducible and robust gene-by-environment interactions in observational data.Entities:
Keywords: Environment-wide association study; Exposome; Gene-by-environment interaction; Genome; Genome-wide association study
Mesh:
Year: 2016 PMID: 26809563 PMCID: PMC4789192 DOI: 10.1007/s40572-016-0080-5
Source DB: PubMed Journal: Curr Environ Health Rep ISSN: 2196-5412
Fig. 1The “space” of all possible G × E interactions. Given e number of environmental exposures of the exposome and g number of genetic variants of the genome, each putative pair of exposome and genomic factors can be potentially tested for interactions in a disease; however, it may be a challenge to detect any interactions given the breadth of the space. Paring down the space of interactions to test (seen in orange or blue) will increase power for detection of interactions (Fig. 2)
Fig. 2Power to search for one million SNPs by e number of environmental exposure interaction pairs as a function of number of factors of the exposome (x-axis), average exposure prevalence in the population (red: 5 %, green: 10 %, blue: 20 % prevalence), sample size (in columns), and effect size (odds ratio) for interaction (odds ratio for disease for both exposure and genetic variant versus neither) in rows. Other assumptions include disease prevalence is 10 % (e.g., type 2 diabetes prevalence), a case-control study (1:1 case:control ratio), the number of variants (g) is 1,000,000, risk variant frequency in the population is 10 %, and the main effects of each G is on average 1.1 (roughly what is observed in GWAS), and main effect of E is 1.5. Black horizontal line denotes 80 % power