| Literature DB >> 22689639 |
Attila Gyenesei1, Jonathan Moody, Asta Laiho, Colin A M Semple, Chris S Haley, Wen-Hua Wei.
Abstract
Genome-wide association studies (GWAS) have discovered many loci associated with common disease and quantitative traits. However, most GWAS have not studied the gene-gene interactions (epistasis) that could be important in complex trait genetics. A major challenge in analysing epistasis in GWAS is the enormous computational demands of analysing billions of SNP combinations. Several methods have been developed recently to address this, some using computers equipped with particular graphical processing units, most restricted to binary disease traits and all poorly suited to general usage on the most widely used operating systems. We have developed the BiForce Toolbox to address the demand for high-throughput analysis of pairwise epistasis in GWAS of quantitative and disease traits across all commonly used computer systems. BiForce Toolbox is a stand-alone Java program that integrates bitwise computing with multithreaded parallelization and thus allows rapid full pairwise genome scans via a graphical user interface or the command line. Furthermore, BiForce Toolbox incorporates additional tests of interactions involving SNPs with significant marginal effects, potentially increasing the power of detection of epistasis. BiForce Toolbox is easy to use and has been applied in multiple studies of epistasis in large GWAS data sets, identifying interesting interaction signals and pathways.Entities:
Mesh:
Year: 2012 PMID: 22689639 PMCID: PMC3394281 DOI: 10.1093/nar/gks550
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.BiForce Toolbox workflow. GWAS data are generated by microarray or next-generation sequencing platforms (A). Example of input data formats for disease and quantitative traits (B). Example of genotype data conversion and storage in bitwise arrays (C). Snapshot of the BiForce Toolbox GUI after loading and conversion of the BMI data of the NFBC1966 cohort (note the multi-tab options such as working with subsets of SNPs or SNP-pairs and performing single SNP-based genome-wide association) (D). Snapshot of the BiForce Toolbox GUI running the pairwise genome scan of the BMI data with progress reporting at the bottom (note SNP information, genotype counts, contingency table are showed for a retained pair of SNPs; the retained results of the pairwise genome scan can be exported as tab-separated text file or Excel compatible spreadsheet for further analyses by users) (E). A graphic view of positions of interaction signals in BMI of the NFBC1966 cohort generated by a third party tool Circos, where chromosome ideograms are shown around the outer ring and are oriented pter–qter in a clockwise direction with centromeres indicated as red bands (F). Snapshot of the BiForce Toolbox GUI running pathway enrichment analysis after the pairwise genome scan using epistatic genes with interaction −log10 P values greater than 7.3 (i.e. P < 5.0e−08; note the member epistatic genes and associated SNPs are displayed for a pathway selected) (G).
A short list of the results of the pairwise genome scan of BMI in NFBC1966 cohort
| SNP1 | SNP2 | Anova_Pair | Anova_Int | Pair_−log10P | Int_−log10P | GenoClassNo |
|---|---|---|---|---|---|---|
| rs9873966 | rs950604 | 7.704 | 14.850 | 9.587 | 11.341 | 9 |
| rs9873966 | rs4406373 | 7.646 | 14.785 | 9.496 | 11.287 | 9 |
| rs2310173 | rs9691472 | 7.615 | 14.311 | 9.448 | 10.893 | 9 |
| rs7536830 | rs11169063 | 7.644 | 13.928 | 9.492 | 10.576 | 9 |
| rs1453405 | rs2289025 | 7.237 | 13.744 | 8.862 | 10.423 | 9 |
| rs2962896 | rs6890673 | 6.810 | 13.466 | 8.203 | 10.193 | 9 |
| rs983936 | rs1074651 | 7.380 | 13.465 | 9.083 | 10.192 | 9 |
| rs983936 | rs1541694 | 7.324 | 13.447 | 8.997 | 10.177 | 9 |
| rs10490096 | rs17816002 | 7.305 | 16.789 | 8.018 | 10.124 | 8 |
| rs3980965 | rs11763972 | 7.260 | 13.001 | 8.897 | 9.808 | 9 |
aSNP1 (SNP2): the first (second) SNP; Anova_Pair (Anova_Int): F ratio of a whole pair of SNPs with interaction (interaction between a pair of SNPs); Pair_−log10P (Int_−log10P): −log10 P value of a whole pair of SNPs with interaction (interaction between a pair of SNPs); GenoClassNo: the number of joint genotype classes (9 in total) with samples.