| Literature DB >> 22373475 |
Wenan Chen1, Xi Gao2, Jiexun Wang1, Chuanyu Sun1, Wen Wan1, Degui Zhi3, Nianjun Liu3, Xiangning Chen4, Guimin Gao1.
Abstract
We evaluate four association tests for rare variants-the combined multivariate and collapsing (CMC) method, two weighted-sum methods, and a variable threshold method-by applying them to the simulated data sets of unrelated individuals in the Genetic Analysis Workshop 17 (GAW17) data. The family-wise error rate (FWER) and average power are used as criteria for evaluation. Our results show that when all nonsynonymous SNPs (rare variants and common variants) in a gene are jointly analyzed, the CMC method fails to control the FWER; when only rare variants (single-nucleotide polymorphisms with minor allele frequency less than 0.05) are analyzed, all four methods can control FWER well. All four methods have comparable power, which is low for the analysis of the GAW17 data sets. Three of the methods (not including the CMC method) involve estimation of p-values using permutation procedures that either can be computationally intensive or generate inflated FWERs. We adapt a fast permutation procedure into these three methods. The results show that using the fast permutation procedure can produce FWERs and average powers close to the values obtained from the standard permutation procedure on the GAW17 data sets. The standard permutation procedure is computationally intensive.Entities:
Year: 2011 PMID: 22373475 PMCID: PMC3287927 DOI: 10.1186/1753-6561-5-S9-S86
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Family-wise error rate (FWER) and average power of weighted-sum method I using different permutation procedures
| Permutation procedurea | FWER | Power | Computing time (approximate)b | ||
|---|---|---|---|---|---|
| Disease | |||||
| Normal 103 | 0.1600 | 0.1200 | 0.0123 | 0.0069 | 36 minutes |
| Normal 106 | 0.1200 | 0.1200 | 0.0085 | 0.0074 | 16 days |
| Standard permutation | 0.0650 | 0.1183 | 0.0065 | 0.0069 | 16 days |
| Fast permutation | 0.0550 | 0.1172 | 0.0065 | 0.0065 | 4.5 days |
Data used here are the 200 data sets from GAW17 with all nonsynonymous SNPs.
aNormal 103 is the normal approximation with 1,000 permutations; normal 106 is the normal approximation with 106 permutations; standard permutation is the standard permutation procedure with 106 permutations; and fast permutation is the fast permutation procedure with the maximum number of permutations, 106.
bAll studies are performed in Matlab on a computer with a CPU of 2.66 GHz and 32 GB memory. The computing time is the total running time for all four phenotypes.
Results for the 200 data sets with all nonsynonymous SNPs
| Association method | FWER | Power | ||
|---|---|---|---|---|
| Disease | ||||
| CMC | 0.1150 | 0.1439 | 0.0062 | 0.0044 |
| Weighed-sum method I | 0.0550 | 0.1172 | 0.0065 | 0.0065 |
| Weighted-sum method II | 0.0400 | 0.1511 | 0.0042 | 0.0019 |
| Variable threshold | 0.0250 | 0.1306 | 0.0031 | 0.0064 |
Results for the 200 data sets with nonsynonymous SNPs with MAF < 0.05
| Association method | FWER | Power | ||
|---|---|---|---|---|
| Disease | ||||
| CMC | 0.0100 | 0.1417 | 0.0023 | 0.0013 |
| Weighted-sum method I | 0.0150 | 0.1467 | 0.0015 | 0.0030 |
| Weighted-sum method II | 0.0100 | 0.1194 | 0.0023 | 0.0004 |
| Variable threshold | 0 | 0.1289 | 0.0031 | 0.0014 |