| Literature DB >> 25077411 |
Min-Seok Kwon, Mira Park, Taesung Park.
Abstract
BACKGROUND: With the development of high-throughput genotyping and sequencing technology, there are growing evidences of association with genetic variants and complex traits. In spite of thousands of genetic variants discovered, such genetic markers have been shown to explain only a very small proportion of the underlying genetic variance of complex traits. Gene-gene interaction (GGI) analysis is expected to unveil a large portion of unexplained heritability of complex traits.Entities:
Mesh:
Year: 2014 PMID: 25077411 PMCID: PMC4101351 DOI: 10.1186/1755-8794-7-S1-S6
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Figure 1Exhaustive approach and stepwise approach in IGENT. t is threshold, is p-value for jth combination in k-order interaction. is ith ordered p-value among p-values of all combinations in k-order interaction. his the number of combinations over the threshold in k-order interaction.
Computation time of IGENT, BOOST, MDR, RF, and SVM.
| SNP size | IGENT_exhaust | BOOST | MDR | RF | SVM |
|---|---|---|---|---|---|
| 50 | <1s | <1s | 1s | 11s | 13s |
| 100 | <1s | <1s | 4s | 46s | 53s |
| 500 | <1s | <1s | 1m 8s | 20m | 23m |
| 1K | 3s | 1s | 4m 25s | 1h 15m | 1h 29m |
| 2K | 8s | 6s | 19m 52s | 5h | 5h 50m |
| 5K | 38s | 30s | 2h 4m | 1d 6h | 1d 12h |
| 10K | 2m 34s | 2m 7s | *8h 16m | *5d 5h | *6d 3h |
| 100K | 4h 23m | 3h 32m | *35d | *520d | *614d |
| 350K | 2d 4h | 1d 19h | *422d | *6366d | *7524d |
| 500K | 4d 10h | 3d 15h | *861d | *12992d | *15353d |
Computation time is measured in simulation 1 dataset which have 2000 individuals. All methods used an exhaustive search strategy for 2nd order interaction analysis.
* This computing time is estimated from the computing time in simulation data with 5000 SNPs.
All analysis are carried out on single core of a 3.16 GHz CPU with 4G memory on LINUX.
Comparison of the type I error in null simulation
| False Positive Rate | ||
|---|---|---|
| IGENT | BOOST | |
| 0.01 | 0.012 | 0.011 |
| 0.05 | 0.057 | 0.054 |
| 0.10 | 0.112 | 0.108 |
| 0.15 | 0.166 | 0.153 |
| 0.20 | 0.219 | 0.215 |
| 0.25 | 0.270 | 0.264 |
| 0.30 | 0.321 | 0.318 |
Figure 2The power comparison between IGENT and BOOST on four disease models with main effects. Results are shown in separate panels for each sample size (800 and 1600). MAF are presented on the x-axis. Model 2-1 is a multiplicative model. Model 2-2 is an epistasis model that has been used to describe handedness and the colour of swine. Model 2-3 is a classical epistasis model. Model 2-4 is the XOR model.
Figure 3Performance comparison with IGENT, BOOST in 70 simulation models.
Efficiency of stepwise analysis
| Model | Powera in Stepwise approach | Power in exhaustive approach | ratio of powerb | Computation in stepwise approachc | ratio of computationd |
|---|---|---|---|---|---|
| 0.69 | 1.00 | 148.4 | |||
| 0.71 | 0.92 | 149.7 | |||
| 0.67 | 0.80 | 154.7 | |||
| 0.87 | 0.94 | 147.6 | |||
| 0.62 | 0.88 | 147.0 | |||
| 0.63 | 0.96 | 145.3 | |||
| 0.19 | 0.25 | 167.3 | |||
| 0.15 | 0.17 | 445.6 |
a Detection probability,
b the ratio of power between stepwise approach and exhaustive approach
c Average number of combinations to be computed in stepwise approach
dComputation ratio is the ratio of computation amount of stepwise approach and computation amount of exhaustive approach. The computation of exhaustive approach is calculated using 2C50 = 1225.
Hub genes (degree of nodes ≥ 10) in two-way interactions of WTCCC-BD
| Hub gene | degree | location | SNP(s) | Referencea |
|---|---|---|---|---|
| B3GALT5 | 115 | 21q22.2b | rs980184 | [ |
| LOC442261 | 98 | 6q23.2d | rs4896044 | |
| PI15 | 32 | 8q21.11b | rs2954873 | [ |
| LOC390730 | 26 | 16q12.2a | rs7188309 rs11640993 rs8056052 | [ |
| PHF20 | 24 | 20q11.23a | rs6060710 | |
| TLE4 | 13 | 9q21.31b | rs914715 rs11138278 | [ |
| DPP10 | 12 | 2q14.1b | rs11123306 rs708647 rs1375144 rs6741692 | [ |
| AKAP10 | 10 | 17p11.2d | rs203466 rs203457 rs119672 rs2108978 | [ |
| CHST2 | 10 | 3q23d | rs4683457 | [ |
a. Reference is literature related with bipolar disorder.
Figure 4Gene-gene interaction network for WTCCC-BD dataset. Red nodes represent genes reported in previous GWAS literature with bipolar disorder dataset. Blue nodes are the genes related with bipolar disorder in previous literature. Green nodes are the genes related with other psychiatric disorders (schizophrenia and depression disorder). Width of edge is the significance level of interaction.
Interaction analysis result using AMD data set
| rank | SNP |
|
|---|---|---|
| 1 | CFH(rs380390) SGCD(rs931798) | 8.454 × 10-12 |
| 2 | CFH(rs1329428) MED27(rs9328536) | 1.943 × 10-10 |
| 3 | CFH(rs380390) | 2.087 × 10-7 |
| 4 | INPP4B(rs3775640) | 3.128 × 10-7 |
| 5 | CFH(rs1329428) | 1.166 × 10-6 |