| Literature DB >> 22689751 |
Chirag J Patel1, Rong Chen, Atul J Butte.
Abstract
MOTIVATION: Complex diseases, such as Type 2 Diabetes Mellitus (T2D), result from the interplay of both environmental and genetic factors. However, most studies investigate either the genetics or the environment and there are a few that study their possible interaction in context of disease. One key challenge in documenting interactions between genes and environment includes choosing which of each to test jointly. Here, we attempt to address this challenge through a data-driven integration of epidemiological and toxicological studies. Specifically, we derive lists of candidate interacting genetic and environmental factors by integrating findings from genome-wide and environment-wide association studies. Next, we search for evidence of toxicological relationships between these genetic and environmental factors that may have an etiological role in the disease. We illustrate our method by selecting candidate interacting factors for T2D.Entities:
Mesh:
Year: 2012 PMID: 22689751 PMCID: PMC3371861 DOI: 10.1093/bioinformatics/bts229
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Method of integrating epidemiological and toxicological data to create variant by environment interaction candidates. (A) Summary of epidemiological databases, VARIMED (source of genetic associations, red) and NHANES (source of environmental data, green). (B) Factors are chosen by their strength of association to T2D depicted by Manhattan plots from VARIMED or EWAS process with NHANES. A threshold p and q are chosen to choose candidate factors (examples in italics). (C) Variants are mapped to gene promoter, exon or intron; environmental factors are mapped to MeSH ID. (D) Factors are queried for curated evidence regarding molecular-level interaction in CTD. Knowledge is coded in CTD in an example on right. (E) Presence of interacting pair in CTD is a candidate for further study in context of T2D
Fig. 2.Proportion of VARIMED genes covered in CTD versus number of total genes associated for a particular phenotype. We associated a gene with a phenotype bases on presence of a variant in an intron, exon or promoter region for that gene. Variants are associated with a phenotype with a P-value <1 × 10−6. Phenotypes with >20 genes, such as ‘Metabolic Trait’, ‘Height’ and ‘Rheumatoid arthritis’ are labeled with numbers (key on right panel). T2D is depicted as 17, with 100% of 43 genes having some documented molecular interaction in the CTD. Values on the x-axis jittered to show density
Fig. 3.Number of candidate genetic factor by environmental factor interactions derived from varying significance thresholds from genetic association (p, x-axis) and EWAS FDR (or q, in different colors). Solid lines depict total gene-by-environment interactions. Dotted lines depict number of variant-by-environment interactions. For example, there are six interacting pairs in the CTD for genes and exposures found at a p=5×10−7 and q=10%, respectively
Predicted variant by environment interactions for factors prioritized by strength of genetic or environmental association to T2D
| Gene (snp) (reference) | Variant OR | CTD-curated molecular interaction (reference) | ||
|---|---|---|---|---|
| (95% CI) | (95% CI) | |||
| γ-Tocopherol | 1.38 (1.20–1.60) | PPARG [rs1801282] ( | 1.14 (1.08–1.2) | Increases gene expression ( |
| Cis-β-carotene | 0.66 (0.55–0.79) | PPARG [rs1801282] ( | 1.14 (1.08–1.2) | Decreases gene expression ( |
| Vitamin D | 0.68 (0.57–0.81) | PPARG [rs1801282] ( | 1.14 (1.08–1.2) | Affects gene expression ( |
| Trans-β-carotene | 0.70 (0.58–0.82) | PPARG [rs1801282] ( | 1.07 (1.05–1.09) | Decreases gene expression ( |
| PCB187 | 1.89 (1.3–2.8) | PRC1 [rs8042680] (Voight | 1.07 (1.05–1.09) | Increases gene expression ( |
| Total β-carotene | 0.71 (0.60–0.85) | PPARG [rs1801282] ( | 1.14 (1.08–1.2) | Decreases gene expression ( |
Each row depicts a candidate interaction pair, its marginal effect size (Odds ratio) and type, and citation of molecular interaction (right column). For example, in the first row, γ-tocopherol has a documented molecular interaction with PPARG (‘increasing gene expression’). Odds ratios for environmental factor are for 1 SD change in logged exposure variable. Variant odds ratios are for an additive genetic model. Factors chosen based on strength of marginal association (p≤5×10−6 and q≤10%).