| Literature DB >> 25914450 |
Satoshi Nagaie1, Soichi Ogishima1, Jun Nakaya2, Hiroshi Tanaka3.
Abstract
UNLABELLED: Genome-wide association studies (GWAS) and linkage analysis has identified many single nucleotide polymorphisms (SNPs) related to disease. There are many unknown SNPs whose minor allele frequencies (MAFs) as low as 0.005 having intermediate effects with odds ratio between 1.5~3.0. Low frequency variants having intermediate effects on disease pathogenesis are believed to have complex interactions with environmental factors called gene-environment interactions (GxE). Hence, we describe a model using 3D Manhattan plot called GxE landscape plot to visualize the association of p-values for gene-environment interactions (GxE). We used the Gene-Environment iNteraction Simulator 2 (GENS2) program to simulate interactions between two genetic loci and one environmental factor in this exercise. The dataset used for training contains disease status, gender, 20 environmental exposures and 100 genotypes for 170 subjects, and p-values were calculated by Cochran-Mantel-Haenszel chi-squared test on known data. Subsequently, we created a 3D GxE landscape plot of negative logarithm of the association of p-values for all the possible combinations of genetic and environmental factors with their hierarchical clustering. Thus, the GxE landscape plot is a valuable model to predict association of p-values for GxE and similarity among genotypes and environments in the context of disease pathogenesis. ABBREVIATIONS: GxE - Gene-environment interactions, GWAS - Genome-wide association study, MAFs - Minor allele frequencies, SNPs - Single nucleotide polymorphisms, EWAS - Environment-wide association study, FDR - False discovery rate, JPT+CHB - HapMap population of Japanese in Tokyo, Japan - Han Chinese in Beijing.Entities:
Year: 2015 PMID: 25914450 PMCID: PMC4404419 DOI: 10.6026/97320630011161
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Overview of methodology for creating GxE landscape plot. Chart of the 4 steps that have been used to generate GxE landscape plot with simulated data using GENS2 program: (1) preparation of data set by GENS program, (2) calculation of pvalues by Cochran-Mantel-Haenszel chi-squared test on 3 × 2 × 2 contingency table, (3) hierarchical clustering for genetic and environmental factors, (4) visualization of association p-values for all the combinations of genetic and environmental factors.
Figure 2GxE landscape plot. 3D GxE landscape plot of negative logarithm of the association p-values for all the possible combinations of genetic and environmental factors with their hierarchical clustering. Genetic and environmental factors were roughly clustered into two groups in combination of genetic and environmental factors, depicted as two peaks surrounded by yellow ellipse. (A) GxE landscape plot with hierarchical clustering for genetic and environmental factors. (B) Hierarchical clustering of genetic factors (SNPs) and 20 environmental factors. The data are shown in a table format, in which rows represent individual environmental factors and columns represent individual SNPs. The color in each cell reflects the negative logarithm of the association p-values for combinations of genetic and environmental factors. As for genetic factors, except for 2 SNPs, other SNPs were clustered into two groups surrounded by blue and red squares. As for environmental factors, all the environmental factors were also clustered into two groups clearly surrounded by blue and red squares.