| Literature DB >> 18644146 |
Li Ma1, H Birali Runesha, Daniel Dvorkin, John R Garbe, Yang Da.
Abstract
BACKGROUND: Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers provide opportunities to detect epistatic SNPs associated with quantitative traits and to detect the exact mode of an epistasis effect. Computational difficulty is the main bottleneck for epistasis testing in large scale GWAS.Entities:
Mesh:
Year: 2008 PMID: 18644146 PMCID: PMC2503991 DOI: 10.1186/1471-2105-9-315
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Estimated single-processor computing time on the SGI Altix XE 1300 Linux cluster system with 2.66 GHz Intel Clovertown processor, and the total number of tests for two-locus and three-locus analysis.
| Number of SNPs (N) | Two-locus analysis | Three-locus analysis | |
| 500,000 | Computing time (T) | T ≈ 1.2 years | T ≈ 200,000 years |
| Number of tests (M) | M = (1.25) 1011 | M = (2.08) 1016 | |
| 1,000,000 | Computing time (T) | T ≈ 5 years | T ≈ 1.5 million years |
| Number of tests (M) | M = (5.0) 1011 | M = (1.67) 1017 |
Example of distributing N SNPs to m(m+1)/2 processor cores (Pi, i = 1, m(m+1)/2) for the case where N/m is an integer, where m = N/n = number of subsets of SNPs with each subset having n SNPs (m and n are assumed integers).
| Subset 1: | Subset 2: | ... ... | Subset m: | |
| P1 | P2 | ... ... | Pm | Subset 1: |
| Pm+1 | ... ... | P2m-1 | Subset 2: | |
| ... ... | ... ... | ... ... | ||
| Pm(m+1)/2 | Subset m: | |||
Each diagonal core receives one subset of n SNPs and computes [3n + 5n(n - 1)/2] tests, and each off-diagonal core receives two subsets of total 2n SNPs and computes 5n2 tests.
Figure 1Observed and predicted run times of the EPISNPmpi program on Minnesota Supercomputing Institute's 2.6 GHz IBM BladeCenter Linux cluster (Blade) and the SGI Altix XE 1300 Linux cluster system with 2.66 GHz Intel Clovertown processor (Calhoun). The observed run times (circles representing Blade and squares representing Calhoun) matched well with the predicted run times under ideal speedup and scalability (solid line representing Blade and dotted line representing Calhoun). Analyses in this figure used a hypothetical GWAS data set with 50,000 SNPs and 2000 individuals.
Figure 2Examples of chromosome view of single-locus significance and sample size produced by the EPISNPPLOT program that draws chromosome views for all chromosomes by one command. The figure on the left is an example of chromosome view based on the original marker order, and the figure on the right is an example of chromosome view in ascending order of significant dominance effects.
Figure 3Examples of SNP epistasis network of a phenotype produced by the EPINET program that by default draws the 10 largest epistasis networks from the input test results. Line color: black = I-effect, red = A × A, purple = A × D, blue = D × A, green = D × D. Node color: red: p < 10-8, cyan: p < 10-7, green: p < 10-6, yellow: p < 10-5.
Currently supported processors type, MPI libraries, compilers and corresponding binaries
| Voltaire MPI | Intel | Intel | EPISNPmpi_2.0_Voltaire_intel_intel.tar.gz |
| Voltaire MPI | Intel | AMD | EPISNPmpi_2.0_Voltaire_intel_AMD.tar.gz |
| Voltaire MPI | Intel | Intel (EM64T) | EPISNPmpi_2.0_Voltaire_suse_EM64T.tar.gz |
| PathMPI | Pathscale | AMD | EPISNPmpi_2.0_Pathscale_suse_AMD.tar.gz |
| IntelMPI | Intel | AMD | EPISNPmpi_2.0_intelMPI.suse_AMD.tar.gz |
| OpenMPI | Intel | Intel (EM64T) | EPISNPmpi_2.0_OpenMPI_suse_EM64T.tar.gz |
| IBM MPI | Intel | Power4 | EPISNPmpi_2.0_IBM_AIX_pwr.tar.gz |
| MPT | Intel | Itanium | EPISNPmpi_2.0_SGI-Altix_SUSE_itanium.tar.gz |
Currently supported operation systems, processors types, and compilers used to generate binaries
| Widows | Intel | Intel/AMD | epiSNP_2.0_Widows.zip |
| Irix | SGI | MIPS | epiSNP_2.0_SGI_Irix_Mips.tar.gz |
| Linux (SUSE) | Intel | AMD | epiSNP_2.0_intel_suse_AMD.tar.gz |
| Linux (SUSE) | Intel | Intel (EM64T) | epiSNP_2.0_intel_suse_EM64T.tar.gz |
| Linux | Portland | Intel (32bit) | epiSNP_2.0_Linux_Portland_Intel.tar.gz |
| Linux (SUSE) | Pathscale | AMD | epiSNP_2.0_Pathscale_suse_AMD.tar.gz |
| Unix (AIX) | XLF | Power4 | epiSNP_2.0_xlf_AIX_power.tar.gz |