| Literature DB >> 21176216 |
Randall C Johnson1, George W Nelson, Jennifer L Troyer, James A Lautenberger, Bailey D Kessing, Cheryl A Winkler, Stephen J O'Brien.
Abstract
BACKGROUND: As we enter an era when testing millions of SNPs in a single gene association study will become the standard, consideration of multiple comparisons is an essential part of determining statistical significance. Bonferroni adjustments can be made but are conservative due to the preponderance of linkage disequilibrium (LD) between genetic markers, and permutation testing is not always a viable option. Three major classes of corrections have been proposed to correct the dependent nature of genetic data in Bonferroni adjustments: permutation testing and related alternatives, principal components analysis (PCA), and analysis of blocks of LD across the genome. We consider seven implementations of these commonly used methods using data from 1514 European American participants genotyped for 700,078 SNPs in a GWAS for AIDS.Entities:
Mesh:
Year: 2010 PMID: 21176216 PMCID: PMC3023815 DOI: 10.1186/1471-2164-11-724
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of Analysis Results
| Method | Significance Threshold | Corresponding α level |
|---|---|---|
| Bonferroni | 0.71 × 10-7 | 0.046 |
| PRESTO | 0.76 × 10-7 | 0.05 |
| simpleℳ | 0.82 × 10-7 | 0.053 |
| SLIDE | 1.09 × 10-7 | 0.068 |
| Gabriel | 2.72 × 10-7 | 0.151 |
| 4-Gamete | 3.06 × 10-7 | 0.166 |
| Solid spine | 3.71 × 10-7 | 0.195 |
The significance threshold for each method is shown (αGWAS = 0.05), as well as the corresponding genome-wide α level when compared with the PRESTO method. A strict Bonferroni significance threshold is also given.
Difference in Significance Threshold in a Subset of the Data
| Method | Δ Significance Threshold |
|---|---|
| simpleℳ | -8 × 10-11 |
| 4-Gamete | -8 × 10-10 |
| SLIDE | -5 × 10-9 |
| Gabriel | -6 × 10-8 |
| PRESTO | 7 × 10-7 |
| Solid spine | -8 × 10-7 |
The difference in significance threshold is given, comparing an analysis of the full data set to a subset of the data with an equal number of cases and controls (1,514 and 518 individuals, respectively).
Figure 1Change in computation time and significance threshold for varying region sizes. The change in serial computation time (solid black line) and significance threshold (dotted blue line) are plotted as a function of the mean number of SNPs in each region in a GWAS-wide analysis using the simpleℳ method.
Comparison of α levels when restricting the definition of a haplotype
| Method | Parameters | Significance Threshold | Corresponding α level |
|---|---|---|---|
| Gabriel | D'U > 0.98 | 2.72 × 10-7 | 0.151 |
| D'L > 0.70 | |||
| D'U > 0.98 | 2.11 × 10-7 | 0.12 | |
| D'L > 0.85 | |||
| 4-Gamete | Cutoff = 1% | 3.06 × 10-7 | 0.166 |
| Cutoff = 0.5% | 2.50 × 10-7 | 0.139 | |
| Solid spine | D' = 0.80 | 3.71 × 10-7 | 0.195 |
| D' = 0.95 | 2.79 × 10-7 | 0.155 |
Significance thresholds with corresponding α levels are given for each haplotype calling method with the standard parameters and a set of more restricted parameters. D'U and D'L represent the, upper and lower confidence limits of D', respectively.