| Literature DB >> 22373457 |
Jing Jin1, Jane E Cerise, Sun Jung Kang, Eun Jung Yoon, Seungtai Yoon, Nancy R Mendell, Stephen J Finch.
Abstract
Statistical tests on rare variant data may well have type I error rates that differ from their nominal levels. Here, we use the Genetic Analysis Workshop 17 data to estimate type I error rates and powers of three models for identifying rare variants associated with a phenotype: (1) by using the number of minor alleles, age, and smoking status as predictor variables; (2) by using the number of minor alleles, age, smoking status, and the identity of the population of the subject as predictor variables; and (3) by using the number of minor alleles, age, smoking status, and ancestry adjustment using 10 principal component scores. We studied both quantitative phenotype and a dichotomized phenotype. The model with principal component adjustment has type I error rates that are closer to the nominal level of significance of 0.05 for single-nucleotide polymorphisms (SNPs) in noncausal genes for the selected phenotype than the model directly adjusting for population. The principal component adjustment model type I error rates are also closer to the nominal level of 0.05 for noncausal SNPs located in causal genes for the phenotype. The power for causal SNPs with the principal component adjustment model is comparable to the power of the other methods. The power using the underlying quantitative phenotype is greater than the power using the dichotomized phenotype.Entities:
Year: 2011 PMID: 22373457 PMCID: PMC3287905 DOI: 10.1186/1753-6561-5-S9-S66
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Distribution of minor allele frequencies of SNPs in the genes studied
| Genes | SNPs | MAF < 0.005 | 0.005 < MAF < 0.01 | 0.01 < MAF < 0.05 | 0.05 < MAF < 0.5 | Total | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Q1 | Q2 | Q1 | Q2 | Q1 | Q2 | Q1 | Q2 | Q1 | Q2 | ||
| Causal | Noncausal | 60 | 100 | 5 | 9 | 12 | 15 | 9 | 15 | 86 | 139 |
| Causal | 32 | 61 | 0 | 5 | 5 | 4 | 2 | 2 | 39 | 72 | |
| Noncausal | Noncausal | 1,422 | 532 | 189 | 57 | 295 | 90 | 288 | 80 | 2,194 | 759 |
The Q1 causal genes are ARNT, ELAVL4, FLT1, FLT4, HIF1A, HIF3A, KDR, VEGFA, and VEGFC. The Q2 causal genes are BCHE, GCKR, INSIG1, LPL, PDGFD, PLAT, RARB, SIRT1, SREBF1, VLDLR, VNN1, VNN3, and VWF. The noncausal genes are all genes on chromosomes 12, 21, and 22 for Q1 and all genes on chromosomes 21 and 22 for Q2.
Type I error rates for Q1 and Q2 using all noncausal SNPs in noncausal genes
| Model | MAF < 0.005 | 0.005 < MAF < 0.01 | 0.01 < MAF < 0.05 | 0.05 < MAF < 0.5 | Total | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| D (%) | Q (%) | D (%) | Q (%) | D (%) | Q (%) | D (%) | Q (%) | D (%) | Q (%) | |
| Q1 | ||||||||||
| SNP | 9.7 | 6.0 | 9.0 | 9.3 | 17.0 | 21.2 | 20.9 | 31.3 | 12.1 | 11.7 |
| Population adjustment | 9.1 | 5.8 | 9.0 | 8.5 | 16.3 | 17.9 | 20.0 | 23.6 | 11.5 | 10.0 |
| PC adjustment | 6.6 | 5.7 | 6.3 | 6.4 | 6.8 | 6.9 | 5.7 | 6.7 | 6.5 | 6.0 |
| Q2 | ||||||||||
| SNP | 5.9 | 5.6 | 6.8 | 7.0 | 5.9 | 5.8 | 6.7 | 7.9 | 6.1 | 6.0 |
| Population adjustment | 6.1 | 5.6 | 7.7 | 7.1 | 5.9 | 5.6 | 5.6 | 5.7 | 6.2 | 5.7 |
| PC adjustment | 5.7 | 5.4 | 6.0 | 6.0 | 5.3 | 5.0 | 4.8 | 4.9 | 5.6 | 5.4 |
D is the dichotomized phenotype; Q is the quantitative phenotype. Noncausal SNPs for Q1 come from chromosomes 12, 21, and 22. Noncausal SNPs for Q2 come from chromosomes 21 and 22. Nominal type I error rate = 0.05, 200 replicates.
Type I error rates and power for Q1 using all SNPs in causal genes
| Model | MAF < 0.005 | 0.005 < MAF < 0.01 | 0.01 < MAF < 0.05 | 0.05 < MAF < 0.5 | Total | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| D (%) | Q (%) | D (%) | Q (%) | D (%) | Q (%) | D (%) | Q (%) | D (%) | Q (%) | |
| Noncausal SNPs | ||||||||||
| SNP | 13.5 | 7.8 | 13.8 | 14.4 | 16.9 | 16.7 | 18.4 | 23.6 | 14.5 | 11.1 |
| Population adjustment | 11.3 | 7.8 | 10.0 | 11.7 | 15.8 | 14.4 | 8.1 | 9.7 | 11.5 | 9.1 |
| PC adjustment | 5.7 | 6.3 | 5.4 | 9.3 | 6.1 | 4.2 | 7.6 | 8.9 | 5.9 | 6.4 |
| Causal SNPs | ||||||||||
| SNP | 25.6 | 21.4 | NA | NA | 93.1 | 98.2 | 91.8 | 99.3 | 37.6 | 35.2 |
| Population adjustment | 22.6 | 21.7 | NA | NA | 91.3 | 97.7 | 85.3 | 98.3 | 34.7 | 35.4 |
| PC adjustment | 14.3 | 19.8 | NA | NA | 64.0 | 75.6 | 64.5 | 85.5 | 23.2 | 30.3 |
D is the dichotomized phenotype; Q is the quantitative phenotype. Nominal type I error rate = 0.05, 200 replicates.
Type I error rates and power for Q2 using all SNPs in causal genes
| Model | MAF < 0.005 | 0.005 < MAF < 0.01 | 0.01 < MAF < 0.05 | 0.05 < MAF < 0.5 | Total | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| D (%) | Q (%) | D (%) | Q (%) | D (%) | Q (%) | D (%) | Q (%) | D (%) | Q (%) | |
| Noncausal SNPs | ||||||||||
| SNP | 5.2 | 5.9 | 7.3 | 6.1 | 5.8 | 6.5 | 5.5 | 6.0 | 5.4 | 6.0 |
| Population adjustment | 6.3 | 5.7 | 11.2 | 7.8 | 6.1 | 5.4 | 4.9 | 5.7 | 6.5 | 5.8 |
| PC adjustment | 5.4 | 5.6 | 6.9 | 4.1 | 5.2 | 5.3 | 4.9 | 5.2 | 5.4 | 5.4 |
| Causal SNPs | ||||||||||
| SNP | 11.9 | 13.3 | 30.4 | 31.5 | 39.8 | 45.6 | 55.8 | 80.8 | 16.0 | 18.2 |
| Population adjustment | 11.2 | 12.9 | 31.1 | 29.1 | 39.4 | 46.5 | 48.0 | 70.8 | 15.2 | 17.5 |
| PC adjustment | 11.2 | 12.7 | 26.5 | 24.5 | 36.5 | 44.4 | 45.3 | 69.8 | 14.6 | 16.9 |
D is the dichotomized phenotype; Q is the quantitative phenotype. Nominal type I error rate = 0.05, 200 replicates.