| Literature DB >> 23626757 |
Zhixiang Zhu1, Xiaoran Tong, Zhihong Zhu, Meimei Liang, Wenyan Cui, Kunkai Su, Ming D Li, Jun Zhu.
Abstract
Although genome-wide association studies (GWAS) have identified a significant number of single-nucleotide polymorphisms (SNPs) associated with many complex human traits, the susceptibility loci identified so far can explain only a small fraction of the genetic risk. Among other possible explanations, the lack of a comprehensive examination of gene-gene interaction (G×G) is often considered a source of the missing heritability. Previously, we reported a model-free Generalized Multifactor Dimensionality Reduction (GMDR) approach for detecting G×G in both dichotomous and quantitative phenotypes. However, the computational burden and less efficient implementation of the original programs make them impossible to use for GWAS. In this study, we developed a graphics processing unit (GPU)-based GMDR program (named GWAS-GPU), which is able not only to analyze GWAS data but also to run much faster than the earlier version of the GMDR program. As a demonstration of the program, we used the GMDR-GPU software to analyze a publicly available GWAS dataset on type 2 diabetes (T2D) from the Wellcome Trust Case Control Consortium. Through an exhaustive search of pair-wise interactions and a selected search of three- to five-way interactions conditioned on significant pair-wise results, we identified 24 core SNPs in six genes (FTO: rs9939973, rs9940128, rs9922047, rs1121980, rs9939609, rs9930506; TSPAN8: rs1495377; TCF7L2: rs4074720, rs7901695, rs4506565, rs4132670, rs10787472, rs11196205, rs10885409, rs11196208; L3MBTL3: rs10485400, rs4897366; CELF4: rs2852373, rs608489; RUNX1: rs445984, rs1040328, rs990074, rs2223046, rs2834970) that appear to be important for T2D. Of these core SNPs, 11 in FTO, TSPAN8, and TCF7L2 have been reported to be associated with T2D, obesity, or both, providing an independent replication of previously reported SNPs. Importantly, we identified three new susceptibility genes; i.e., L3MBTL3, CELF4, and RUNX1, for T2D, a finding that warrants further investigation with independent samples.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23626757 PMCID: PMC3633958 DOI: 10.1371/journal.pone.0061943
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Working process of the GMDR-GPU program for conducting a two-dimensional interaction search on a sample consisting of four SNPs, two covariates, and a quantitative phenotype.
Comparison of time required for exhaustive pair-wise search in 5,000 subjects with 103 to 106 SNPs per sample among the 1- and 6-GPU, C++ CPU, and Java CPU implementations of GMDR-GPU.
| Implementation | 103 SNPs | 104 SNPs | 105 SNPs | 106 SNPs |
| CPU (Java) | 2.7 hr | 11 d | 3 yr | 300 yr |
| CPU (C++) | 24 min | 1.7 d | 170 d | 48 yr |
| 1 GPU | 2 sec | 4.5 min | 7.3 hr | 31 d |
| 6 GPUs | <1 sec | 45 sec | 75 min | 5 d |
Estimated from the running time of searches of smaller data sets.
Results of two- to five-way SNP-SNP interaction analysis of WTCCC T2D data.
| Characteristic | Two-way | Three-way | Four-way | Five-way |
| SNPs | 351,976 | 281 | 281 | 281 |
| Running time (hr) | 31 | 15.2 | 8.2 | 35.9 |
| Detected interactions | 203 | 16 | 13 | 175 |
| Highest testing accuracy | 0.557 | 0.559 | 0.567 | 0.569 |
| P value | <10−7 | <10−7 | <10−7 | <10−7 |
Figure 2Determination of core SNPs for T2D through two-way to five-way interaction analysis using GMDR-GPU.
Each SNP is represented by a red dot and each interaction by a vertical line. The red dots in the same horizontal line correspond to the same unique SNP. Different colors of vertical lines represent different interaction dimensions (green: two-way; blue: three-way; pink: four-way; yellow: five-way). All identified core SNPs are indicated with their IDs.
Best interaction models and their cross-validation consistencies (CVC), testing accuracies, and permutated P values based on 107 permutated samples for the identified 24 core SNPs.
| Core SNP | Best Interaction Modelrs SNP IDs (Chrom) | CVC | Testing Accuracy | Permuted P value |
| rs10485400 |
| 9/10 | 0.568 | <10−7 |
| rs4897366 |
| 9/10 | 0.569 | <10−7 |
| rs4074720 | rs1469244-rs2452941- | 8/10 | 0.565 | <10−7 |
| rs7901695 | rs4897366- | 9/10 | 0.569 | <10−7 |
| rs4506565 | rs4897366- | 9/10 | 0.569 | <10−7 |
| rs4132670 | rs10485400- | 9/10 | 0.568 | <10−7 |
| rs10787472 | rs2452941- | 7/10 | 0.559 | <10−7 |
| rs11196205 | rs955436-rs6470289- | 7/10 | 0.565 | <10−7 |
| rs10885409 | rs12154976- | 8/10 | 0.557 | <10−7 |
| rs11196208 | rs17608635-rs2736010- | 7/10 | 0.568 | <10−7 |
| rs1495377 | rs4708273-rs4506565- | 7/10 | 0.567 | <10−7 |
| rs9939973 | rs4897366-rs4506565- | 10/10 | 0.568 | <10−7 |
| rs9940128 | rs4897366-rs7901695- | 9/10 | 0.568 | <10−7 |
| rs9922047 | rs10485400-rs4132670- | 7/10 | 0.565 | <10−7 |
| rs1121980 | rs4897366-rs7901695- | 9/10 | 0.569 | <10−7 |
| rs9939609 | rs17608635-rs2736010-rs11196208-rs12879941- | 7/10 | 0.568 | <10−7 |
| rs9930506 | rs10485400-rs4132670- | 8/10 | 0.568 | <10−7 |
| rs2852373 | rs4897366-rs4506565-rs1121980- | 8/10 | 0.569 | <10−7 |
| rs608489 | rs4897366-rs7901695-rs1121980- | 9/10 | 0.569 | <10−7 |
| rs445984 | rs4897366-rs7901695-rs1121980-rs608489- | 9/10 | 0.569 | <10−7 |
| rs1040328 | rs10485400-rs7901695-rs9930506-rs608489- | 8/10 | 0.567 | <10−7 |
| rs990074 | rs10485400-rs7901695-rs9930506-rs608489- | 7/10 | 0.565 | <10−7 |
| rs2223046 | rs10485400-rs4132670-rs9930506-rs608489- | 8/10 | 0.568 | <10−7 |
| rs2834970 | rs10485400-rs4132670-rs9922047-rs608489- | 7/10 | 0.565 | <10−7 |
Note: In our GMDR-GPU analysis, age and sex were used as covariates, and BMI was not adjusted for.
Core SNPs associated with T2D based on G×G analysis at different orders with GMDR-GPU program.
| Core SNP ID (Alleles) | Gene and Location | Physical Location | Coding | Chrom | RefSeq | Reported Disease & Reference |
| rs10485400 (A/G) |
| 130485817 | Intron | 6 | NC_000006.10 | |
| rs4897366 (A/C) | 130499771 | Intron | ||||
| rs4074720 (A/G) | 114738487 | Intron | ||||
| rs7901695 (C/T) |
| 114744078 | Intron | T2D | ||
| rs4506565 (A/T) | (transcription | 114746031 | Intron | T2D | ||
| rs4132670 (C/T) | factor 7-like 2; | 114757761 | Intron | 10 | NC_000010.9 | T2D |
| rs10787472 (A/C) | T-cell specific, | 114771287 | Intron | |||
| rs11196205 (C/G) | HMG-box) | 114797037 | Intron | T2D | ||
| rs10885409 (C/T) | 114798062 | Intron | T2D | |||
| rs11196208 (C/T) | 114801306 | Intron | ||||
| rs1495377 (C/G) |
| 69863368 | Intron | 12 | NC_000012.10 | T2D |
| rs9939973 (A/G) | 52358069 | Intron | Obesity | |||
| rs9940128 (A/G) |
| 52358255 | Intron | T2D | ||
| rs9922047 (C/G) | and obesity | 52363781 | Intron | 16 | NC_000016.8 | |
| rs1121980 (C/T) | associated) | 52366748 | Intron | Obesity | ||
| rs9939609 (A/T) | 52378028 | Intron | T2D | |||
| rs9930506 (A/G) | 52387966 | Intron | Obesity | |||
| rs2852373 (C/T) |
| 33157857 | Intron | 18 | NC_000018.8 | |
| rs608489 (A/G) | 33159109 | Intron | ||||
| rs445984 (C/T) | 35819728 | Intron | ||||
| rs1040328 (C/T) |
| 35832534 | Intron | |||
| rs990074 (C/T) | related | 35833022 | Intron | 21 | NC_000021.7 | |
| rs2223046 (A/G) | transcription | 35833974 | Intron | |||
| rs2834970 (A/G) | factor 1) | 35847430 | Intron |
Gene locations were retrieved from Enzembl Genome Browser.
Physical location of each gene determined according to NCBI build 36.
Figure 3Interaction/association network among the six genes containing at least one core SNPs identified by analyzing WTCCC T2D data with GMDR-GPU.
Beyond the six susceptibility genes identified in this work, three other genes, CTNNB1, RUNX3, and CBFB, were found by a literature search. Although these genes have been associated with a number of human disorders, only two closely related diseases, i.e., T2D and colorectal cancer, are shown.