| Literature DB >> 22689754 |
Gregory Darnell1, Dat Duong, Buhm Han, Eleazar Eskin.
Abstract
UNLABELLED: Recent technological developments in measuring genetic variation have ushered in an era of genome-wide association studies which have discovered many genes involved in human disease. Current methods to perform association studies collect genetic information and compare the frequency of variants in individuals with and without the disease. Standard approaches do not take into account any information on whether or not a given variant is likely to have an effect on the disease. We propose a novel method for computing an association statistic which takes into account prior information. Our method improves both power and resolution by 8% and 27%, respectively, over traditional methods for performing association studies when applied to simulations using the HapMap data. Advantages of our method are that it is as simple to apply to association studies as standard methods, the results of the method are interpretable as the method reports p-values, and the method is optimal in its use of prior information in regards to statistical power. AVAILABILITY: The method presented herein is available at http://masa.cs.ucla.edu.Entities:
Mesh:
Year: 2012 PMID: 22689754 PMCID: PMC3371867 DOI: 10.1093/bioinformatics/bts235
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Summary of the power comparison among all 10 ENCODE regions
| Pop. | No. of | No. of | All SNPs | SNPs with power between 0.1 and 0.9 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| tags | SNPs | Trad | Mult (MAF), | Mult (LD), | MVN, | Trad | Mult (MAF), | Mult (LD), | MVN, | |
| CEU | 678 | 10 710 | 0.664 | 0.660 (−0.6) | 0.672 (1.3) | 0.697 (5.0) | 0.698 | 0.706 (1.1) | 0.733 (5.0) | 0.784 (12.2) |
| YRI | 708 | 13 176 | 0.445 | 0.437 (−1.6) | 0.451 (1.5) | 0.537 (20.8) | 0.586 | 0.577 (−1.6) | 0.604 (3.1) | 0.731 (24.7) |
| CHB | 606 | 8934 | 0.716 | 0.710 (−0.9) | 0.726 (1.4) | 0.760 (6.1) | 0.708 | 0.708 (−0.1) | 0.740 (4.5) | 0.813 (14.7) |
| JPT | 608 | 9248 | 0.684 | 0.675 (−1.4) | 0.690 (0.9) | 0.722 (5.6) | 0.712 | 0.696 (−2.3) | 0.736 (3.4) | 0.803 (12.8) |
| Average | 0.627 | 0.621 (−1.1) | 0.635 (1.2) | 0.679 (8.3) | 0.676 | 0.671 (−0.7) | 0.703 (4.0) | 0.782 (15.7) | ||
The numbers in parentheses are the power gain compared with the traditional method
Summary of the resolution comparison among all 10 ENCODE regions
| Pop. | All SNPs | SNPs with power between 0.1 and 0.9 | ||||||
|---|---|---|---|---|---|---|---|---|
| Trad | Mult (MAF), | Mult (LD), | MVN, | Trad | Mult (MAF), | Mult (LD), | MVN, | |
| CEU | 33 334 | 33 377 (−0.1) | 33 078 (0.8) | 27 153 (18.5) | 42 983 | 43 658 (−1.6) | 43 040 (−0.1) | 27 505 (36.0) |
| YRI | 47 017 | 49 128 (−4.5) | 45 561 (3.1) | 30 247 (35.7) | 49 245 | 50 828 (−3.2) | 45 690 (7.2) | 27 299 (44.6) |
| CHB | 26 582 | 25 340 (4.7) | 25 977 (2.3) | 20 045 (24.6) | 36 633 | 35 182 (4.0) | 34 875 (4.8) | 19 981 (45.5) |
| JPT | 30 740 | 30 195 (1.8) | 29 808 (3.0) | 22 917 (25.4) | 42 342 | 40 771 (3.7) | 40 731 (3.8) | 20 183 (52.3) |
| Average | 34 418 | 34 510 (−0.3) | 33 606 (2.4) | 25 090 (27.1) | 42 801 | 42 610 (0.4) | 41 084 (4.0) | 23 742 (44.5) |
The unit of resolution is basepairs. The numbers in parentheses are the improvement percentage in resolution compared with the traditional method
Fig. 1.Average power under varying relative risks