| Literature DB >> 18976480 |
Priya Duggal1, Elizabeth M Gillanders, Taura N Holmes, Joan E Bailey-Wilson.
Abstract
BACKGROUND: By assaying hundreds of thousands of single nucleotide polymorphisms, genome wide association studies (GWAS) allow for a powerful, unbiased review of the entire genome to localize common genetic variants that influence health and disease. Although it is widely recognized that some correction for multiple testing is necessary, in order to control the family-wide Type 1 Error in genetic association studies, it is not clear which method to utilize. One simple approach is to perform a Bonferroni correction using all n single nucleotide polymorphisms (SNPs) across the genome; however this approach is highly conservative and would "overcorrect" for SNPs that are not truly independent. Many SNPs fall within regions of strong linkage disequilibrium (LD) ("blocks") and should not be considered "independent".Entities:
Mesh:
Year: 2008 PMID: 18976480 PMCID: PMC2621212 DOI: 10.1186/1471-2164-9-516
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Affymetrix 500 K using CEU HapMap Samples
| 1 | 31876 | 4447 | 833 | 5280 |
| 2 | 33610 | 4626 | 787 | 5413 |
| 3 | 27588 | 3903 | 723 | 4626 |
| 4 | 25811 | 3514 | 689 | 4203 |
| 5 | 26548 | 3601 | 646 | 4247 |
| 6 | 26550 | 3487 | 604 | 4091 |
| 7 | 21544 | 3061 | 618 | 3679 |
| 8 | 22550 | 3053 | 563 | 3616 |
| 9 | 19086 | 2664 | 541 | 3205 |
| 10 | 23531 | 3046 | 510 | 3556 |
| 11 | 21477 | 2761 | 528 | 3289 |
| 12 | 20549 | 2821 | 499 | 3320 |
| 13 | 15700 | 2116 | 392 | 2508 |
| 14 | 12839 | 1820 | 371 | 2191 |
| 15 | 11560 | 1857 | 396 | 2253 |
| 16 | 12339 | 1944 | 454 | 2398 |
| 17 | 8473 | 1385 | 344 | 1729 |
| 18 | 11966 | 1748 | 374 | 2122 |
| 19 | 5177 | 954 | 305 | 1259 |
| 20 | 10292 | 1519 | 331 | 1850 |
| 21 | 5873 | 843 | 204 | 1047 |
| 22 | 5053 | 828 | 213 | 1041 |
Illumina 317 K SNPs using CEU HapMap Samples
| 1 | 23055 | 4959 | 1336 | 6295 |
| 2 | 25103 | 5258 | 1348 | 6606 |
| 3 | 21332 | 4505 | 1268 | 5773 |
| 4 | 18923 | 3979 | 1055 | 5034 |
| 5 | 19062 | 3966 | 979 | 4945 |
| 6 | 20524 | 4044 | 950 | 4994 |
| 7 | 16493 | 3472 | 977 | 4449 |
| 8 | 18053 | 3658 | 940 | 4598 |
| 9 | 15691 | 3305 | 936 | 4241 |
| 10 | 15423 | 3263 | 899 | 4162 |
| 11 | 14498 | 3037 | 827 | 3864 |
| 12 | 14844 | 3097 | 918 | 4015 |
| 13 | 11411 | 2373 | 620 | 2993 |
| 14 | 9767 | 2086 | 592 | 2678 |
| 15 | 8817 | 1942 | 631 | 2573 |
| 16 | 8924 | 2078 | 705 | 2783 |
| 17 | 8279 | 1859 | 603 | 2462 |
| 18 | 10390 | 2183 | 678 | 2861 |
| 19 | 5833 | 1408 | 545 | 1953 |
| 20 | 7758 | 1736 | 496 | 2232 |
| 21 | 5430 | 1130 | 318 | 1448 |
| 22 | 5398 | 1156 | 379 | 1535 |
Thresholds for Genome Wide Association Using CEU and YRI Population Samples
| Affymetrix CEU 500 K (n = 66,923) | 1.49 × 10-05 | 7.47 × 10-07 | 1.49 × 10-08 |
| Illumina 317 K (n = 82,494) | 1.21 × 10-05 | 6.06 × 10-07 | 1.21 × 10-08 |
| HapMap YRI (n = 289,175) | 3.45 × 10-06 | 1.73 × 10-07 | 3.45 × 10-09 |
| HapMap CEU (n = 164,296) | 6.09 × 10-06 | 3.04 × 10-07 | 6.09 × 10-09 |
| HapMap CEU (D' > 0.7)* | 8.37 × 10-06 | 4.19 × 10-07 | 8.37 × 10-09 |
| HapMap CEU (D' > 0.9)* | 4.38 × 10-06 | 2.19 × 10-07 | 4.38 × 10-09 |
*extrapolated from Chromosome 1 data. P-values in parentheses in the header line indicate the family-wide error rate that corresponds to the Bonferroni-corrected significance thresholds given in the columns below.
HapMap SNPs using CEU HapMap Samples
| 1 | 184403 | 10740 | 1815 | 12555 |
| 2 | 211913 | 11219 | 1510 | 12729 |
| 3 | 166801 | 9431 | 1426 | 10857 |
| 4 | 155953 | 10204 | 1745 | 10363 |
| 5 | 161666 | 8725 | 1238 | 9963 |
| 6 | 174458 | 8677 | 1743 | 10420 |
| 7 | 137148 | 8050 | 1140 | 9190 |
| 8 | 141925 | 7707 | 1076 | 8783 |
| 9 | 116824 | 7092 | 1105 | 8197 |
| 10 | 132087 | 7428 | 1250 | 8607 |
| 11 | 124354 | 6821 | 1037 | 7858 |
| 12 | 118973 | 6959 | 991 | 7950 |
| 13 | 99669 | 5290 | 793 | 6083 |
| 14 | 80500 | 4690 | 893 | 5583 |
| 15 | 69104 | 4690 | 814 | 5504 |
| 16 | 68205 | 5212 | 817 | 6029 |
| 17 | 56026 | 4127 | 715 | 4842 |
| 18 | 73392 | 4486 | 742 | 5228 |
| 19 | 35412 | 3109 | 570 | 3679 |
| 20 | 60421 | 3896 | 606 | 4502 |
| 21 | 32740 | 2141 | 380 | 2521 |
| 22 | 33369 | 2491 | 421 | 2853 |
HapMap SNPs using YRI HapMap Samples
| 1 | 209439 | 17517 | 4169 | 21686 |
| 2 | 238828 | 19081 | 5688 | 24769 |
| 3 | 184337 | 15409 | 3635 | 19044 |
| 4 | 174670 | 14673 | 2754 | 17427 |
| 5 | 176975 | 14478 | 3063 | 17541 |
| 6 | 187787 | 14073 | 3127 | 17200 |
| 7 | 149764 | 12884 | 2451 | 15335 |
| 8 | 158800 | 13069 | 2465 | 15534 |
| 9 | 128582 | 11602 | 3185 | 14787 |
| 10 | 147710 | 12065 | 3778 | 15843 |
| 11 | 136474 | 11261 | 2793 | 14054 |
| 12 | 130298 | 11142 | 2383 | 13525 |
| 13 | 112162 | 8767 | 1470 | 10237 |
| 14 | 88022 | 7549 | 1240 | 8789 |
| 15 | 77885 | 7979 | 1657 | 9636 |
| 16 | 78364 | 8334 | 1810 | 10144 |
| 17 | 62720 | 6622 | 1754 | 8376 |
| 18 | 87027 | 7466 | 5294 | 12760 |
| 19 | 39729 | 4514 | 1037 | 5551 |
| 20 | 68828 | 6397 | 1344 | 7741 |
| 21 | 37450 | 3717 | 744 | 4461 |
| 22 | 36468 | 3945 | 790 | 4735 |
Altering Block Definitions for Chromosome 1
| Total Number of Blocks | Total Number of Interblock SNPs | Total Number of SNPs and Blocks | Average Number of SNPs per block | Average D' per block | |
| Solid Spine LD | 10740 | 1815 | 12555 | 18.4 | 0.804 |
| Gabriel | 10115 | 38037 | 48152 | 15.7 | 0.805 |
| 4-Gamete Rule | 18967 | 9084 | 28051 | 9.5 | 0.841 |
Examples of sample sizes required to have 80% power to attain significant association (family-wide error of 0.05) when using 'LD-adjusted' and unadjusted Bonferroni-corrected significance thresholds in CEU and YRI under different genetic models
| P-value | Population | Genotype Relative Risk Aa/AA | Sample Size |
| 3.04 × 10-07 | CEU HapMap LD adjusted | 1.4 | 5270 (-890) |
| 2.08 × 10-08 | CEU HapMap | 1.4 | 6160 |
| 3.04 × 10-07 | CEU HapMap LD adjusted | 1.6 | 2550 (-431) |
| 2.08 × 10-08 | CEU HapMap | 1.6 | 2981 |
| 1.73 × 10-07 | YRI HapMap LD adjusted | 1.4 | 5457 (-742) |
| 1.85 × 10-08 | YRI HapMap | 1.4 | 6199 |
| 1.73 × 10-07 | YRI HapMap LD adjusted | 1.6 | 2641 (-358) |
| 1.85 × 10-08 | YRI HapMap | 1.6 | 2999 |
Sample size is calculated with a high risk allele frequency of 10%, disease prevalence of 20%, and power of 0.80, with a difference in allele frequency between the causal marker and the genotyped marker of 10% (D' = 1.0). Sample size indicates the number of cases required (an equal number of controls is also required). The number in parentheses for sample size indicates the difference between the sample size required when using the LD adjusted Bonferroni correction versus using the unadjusted Bonferroni correction (which corrects for 2.4 million CEU HapMap SNPs and 2.7 million YRI HapMap SNPs.