| Literature DB >> 22618535 |
Attila Gyenesei1, Jonathan Moody, Colin A M Semple, Chris S Haley, Wen-Hua Wei.
Abstract
MOTIVATION: Gene-gene interactions (epistasis) are thought to be important in shaping complex traits, but they have been under-explored in genome-wide association studies (GWAS) due to the computational challenge of enumerating billions of single nucleotide polymorphism (SNP) combinations. Fast screening tools are needed to make epistasis analysis routinely available in GWAS.Entities:
Mesh:
Year: 2012 PMID: 22618535 PMCID: PMC3400955 DOI: 10.1093/bioinformatics/bts304
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Comparison of power of detection of epistasis in binary traits between BiForce and BOOST. Model 1: multiplicative model, Models 2 and 3: missing lethal genotype model (aabb does not lead to disease, AaBb does in Model 3 but not in Model 2), Model 4: exclusive OR model
Fig. 2.Comparison of power of detection of epistasis in quantitative traits between BiForce and PLINK. Model 1: multiplicative model, Models 2 and 3: missing lethal genotype model (aabb does not lead to disease, AaBb does in Model 3 but not Model 2), Model 4: exclusive OR model
Fig. 3.FPR profiles of BiForce in detection of epistasis in binary and quantitative traits
Computing performance (in h) comparison between BiForce and BOOST in analysing different GWAS datasets (1000 samples)
| SNPs | BOOST | BiForce (1 thread) | BiForce (8 threads) | BiForce (cluster) |
|---|---|---|---|---|
| 100 k | 2.90 | 2.29 | 0.60 | 0.01 |
| 200 k | 11.61 | 8.89 | 2.29 | 0.29 |
| 300 k | 26.11 | 20.08 | 5.11 | 0.65 |
| 400 k | 46.36 | 35.97 | 8.83 | 1.16 |
| 500 k | 72.64 | 55.68 | 14.03 | 1.82 |
| 1000 k | 295.97 | 221.98 | 55.96 | 7.40 |
aBOOST, BiForce (one thread) and BiForce (eight threads) each ran on an iMAC workstation with 4 GB RAM and 4 Intel Cores each with two threads running at 2.8 GHz. BiForce (cluster) used a 32-node computer cluster each with 4 CPU cores (two threads per core).
Genome-wide significant epistatic pairs identified from the NFBC199 cohort
| Trait | SNP1 | SNP2 | Distance | LD ( | |
|---|---|---|---|---|---|
| CRP | rs1811472 | rs2592887 | 3.0E-12 | 10 590 | 0.86 |
| CRP | rs1811472 | rs2794520 | 3.5E-11 | 36 467 | 0.62 |
| CRP | rs2592887 | rs2794520 | 2.9E-12 | 25 877 | 0.70 |
| CRP | rs2650000 | rs7953249 | 2.6E-09 | 14 762 | 0.76 |
| CRP | rs1169300 | rs2464196 | 3.4E-10 | 4202 | 0.99 |
| GLU | rs560887 | rs563694 | 1.3E-08 | 10 923 | 0.81 |
| HDL | rs3764261 | rs1532624 | 2.0E-14 | 12 155 | 0.53 |
| LDL | rs157580 | rs405509 (19q13.32; 0.46) | 6.9E-10 | 13 570 | 0.35 |
| TRI | rs1260326 | rs780094 (2p23.3; 0.36) | 5.8E-08 | 10 297 | 0.95 |
aAll SNP pairs listed were detected as marginal SNP interactions, with the threshold of 1.5E-08 for CRP, 2.2E-08 for HDL, 3.9E-08 for GLU and LDL, 7.7E-07 for TRI; SNP1 (SNP2)—name, genomic location and MAF (the latter two in bracket) of the first (second) SNP; P—P-value of the interaction test; distance—the distance in base pairs between two SNPs; LD—linkage disequilibrium (in r2) between a pair of SNPs; the SNP pair in HDL was also detected via the pair-wise genome scan (P < 9.54E-13).
bThe marginal SNP.
Genme-wide significant epistatic pairs identified from the WTCCC datasets
| Trait | SNP1 | SNP2 | Distance | LD ( | |
|---|---|---|---|---|---|
| BD | rs11162341 (1p31.1; 0.13) | rs6658302 (1p31.1; 0.25) | 1.7E-14 | 11 272 | 0.02 |
| BD | rs11096892 (4p15.1; 0.05) | rs6531531 (4p15.1; 0.28) | 9.1E-15 | 2196 | 0.03 |
| BD | rs4246045 | rs4958847 (5q33.1; 0.12) | 5.5E-09 | 62 490 | 0.82 |
| BD | rs11949556 (5q33.1; 0.12) | rs4246045 | 4.7E-09 | 52 704 | 0.82 |
| BD | rs2747436 (6p22.1; 0.29) | rs29254 (6p22.1; 0.06) | 4.2E-13 | 28 341 | 0.03 |
| CD | rs11162341 (1p31.1; 0.13) | rs6658302 (1p31.1; 0.25) | 3.1E-14 | 11 272 | 0.02 |
| CD | rs1735558 (3q21.3; 0.15) | rs6439119 (3q21.3; 0.24) | 2.2E-16 | 136 423 | 0.47 |
| CD | rs2248668 (3q21.3; 0.14) | rs6439119 (3q21.3; 0.24) | 8.9E-19 | 32 438 | 0.49 |
| CD | rs2811472 (3q21.3; 0.15) | rs6439119 (3q21.3; 0.24) | 8.0E-16 | 44 316 | 0.48 |
| CD | rs2811483 (3q21.3; 0.14) | rs6439119 (3q21.3; 0.24) | 4.8E-19 | 8514 | 0.50 |
| CD | rs2811484 (3q21.3; 0.14) | rs6439119 (3q21.3; 0.24) | 6.0E-19 | 8334 | 0.50 |
| CD | rs2811510 (3q21.3; 0.14) | rs6439119 (3q21.3; 0.24) | 6.3E-19 | 8675 | 0.50 |
| CD | rs2955125 (3q21.3; 0.14) | rs6439119 (3q21.3; 0.24) | 6.8E-19 | 2005 | 0.50 |
| CD | rs2955132 (3q21.3; 0.14) | rs6439119 (3q21.3; 0.24) | 1.1E-18 | 17 039 | 0.50 |
| CD | rs1554534 (3q21.3; 0.15) | rs6439119 (3q21.3; 0.24) | 8.8E-15 | 123 518 | 0.47 |
| CD | rs11096892 (4p15.1; 0.05) | rs6531531 (4p15.1; 0.28) | 2.8E-14 | 2196 | 0.03 |
| CD | rs2747436 (6p22.1; 0.29) | rs29254 (6p22.1; 0.06) | 3.4E-14 | 28 341 | 0.03 |
aThreshold for the pair-wise genome scan was 8.3E-13 for BD and 8.2E-13 for CD; threshold for marginal SNP interactions was 4.8E-08 in BD (three marginal SNPs were detected); SNP1 (SNP2)—name, genomic location and MAF (the latter two in bracket) of the first (second) SNP; Pint—P-value of the interaction test; distance—the distance in base pairs between two SNPs; LD—linkage disequilibrium (in r2) between a pair of SNPs.
bThe marginal SNP.