| Literature DB >> 34828328 |
Zhongxue Chen1, Yong Zang2,3.
Abstract
The additive genetic model as implemented in logistic regression has been widely used in genome-wide association studies (GWASs) for binary outcomes. Unfortunately, for many complex diseases, the underlying genetic models are generally unknown and a mis-specification of the genetic model can result in a substantial loss of power. To address this issue, the MAX3 test (the maximum of three separate test statistics) has been proposed as a robust test that performs plausibly regardless of the underlying genetic model. However, the original implementation of MAX3 utilizes the trend test so it cannot adjust for any covariates such as age and gender. This drawback has significantly limited the application of the MAX3 in GWASs, as covariates account for a considerable amount of variability in these disorders. In this paper, we extended the MAX3 and proposed the CMAX3 (covariate-adjusted MAX3) based on logistic regression. The proposed test yielded a similar robust efficiency as the original MAX3 while easily adjusting for any covariate based on the likelihood framework. The asymptotic formula to calculate the p-value of the proposed test was also developed in this paper. The simulation results showed that the proposed test performed desirably under both the null and alternative hypotheses. For the purpose of illustration, we applied the proposed test to re-analyze a case-control GWAS dataset from the Collaborative Studies on Genetics of Alcoholism (COGA). The R code to implement the proposed test is also introduced in this paper and is available for free download.Entities:
Keywords: GWAS; MAX3 test; genetic model; genotype; phenotype; risk allele; score test
Mesh:
Year: 2021 PMID: 34828328 PMCID: PMC8622598 DOI: 10.3390/genes12111723
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Empirical type I error rates of CMAX3 with different nominal levels.
| Nominal | Sample |
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| level | size | MAF = 0.1 | MAF = 0.3 | MAF = 0.1 | MAF = 0.3 | MAF = 0.1 | MAF = 0.3 |
|
| 2000 |
|
|
|
|
|
|
| 1000 |
|
|
|
|
|
| |
| 500 |
|
|
|
|
|
| |
|
| 2000 |
|
|
|
|
|
|
| 1000 |
|
|
|
|
|
| |
| 500 |
|
|
|
|
|
| |
|
| 2000 |
|
|
|
|
|
|
| 1000 |
|
|
|
|
|
| |
| 500 |
|
|
|
|
|
| |
|
| 2000 |
|
|
|
|
|
|
| 1000 |
|
|
|
|
|
| |
| 500 |
|
|
|
|
|
|
Empirical type I error rates of CMAX3 and OMAX3 with nominal levels 5%.
| Test | Sample |
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| size | MAF = 0.1 | MAF = 0.3 | MAF = 0.1 | MAF = 0.3 | MAF = 0.1 | MAF = 0.3 | |
| CMAX3 | 2000 |
|
|
|
|
|
|
| 1000 |
|
|
|
|
|
| |
| 500 |
|
|
|
|
|
| |
| OMAX3 | 2000 |
|
|
|
|
|
|
| 1000 |
|
|
|
|
|
| |
| 500 |
|
|
|
|
|
|
Empirical powers from simulation study using sample size 2000 and nominal level 5%.
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|
| Test | Model | MAF = 0.1 | MAF = 0.3 | MAF = 0.1 | MAF = 0.3 | MAF = 0.1 | MAF = 0.3 |
| REC | 81.3 | 94.7 | 82.4 | 93.3 | 63.5 | 86.6 | |
| 33.0 | 72.6 | 29.6 | 69.6 | 21.9 | 58.6 | ||
| 14.5 | 23.6 | 11.7 | 21.9 | 8.0 | 17.8 | ||
| LRT | 71.4 | 86.8 | 71.8 | 86.8 | 52.1 | 76.5 | |
| CMAX3 | 75.0 | 90.9 | 75.3 | 89.8 | 55.4 | 80.5 | |
| ADD | 18.1 | 49.7 | 19.5 | 48.5 | 16.4 | 48.1 | |
| 70.0 | 81.9 | 71.8 | 82.1 | 59.6 | 77.7 | ||
| 61.4 | 74.6 | 62.5 | 74.2 | 50.8 | 68.4 | ||
| LRT | 58.3 | 73.5 | 59.7 | 72.0 | 47.8 | 67.3 | |
| CMAX3 | 66.3 | 78.5 | 69.5 | 76.8 | 54.4 | 72.7 | |
| DOM | 8.7 | 13.1 | 7.4 | 11.7 | 8.2 | 11.8 | |
| 71.0 | 72.6 | 69.8 | 71.0 | 57.5 | 63.2 | ||
| 78.1 | 81.2 | 78.0 | 77.9 | 66.8 | 73.0 | ||
| LRT | 67.6 | 70.7 | 66.9 | 69.9 | 55.5 | 62.5 | |
| CMAX3 | 76.9 | 74.2 | 75.6 | 74.2 | 64.6 | 68.2 |
Top ten SNPs with most significant p-values based on the CMAX3 test. For each SNP, the smallest p-value obtained by all methods is in bold.
| Test | |||||
|---|---|---|---|---|---|
| SNPs | MAX3 | LRT | |||
| rs17719726 |
|
|
|
|
|
| rs2082371 |
|
|
|
|
|
| rs2322631 |
|
|
|
|
|
| rs13247743 |
|
|
|
|
|
| rs28760505 |
|
|
|
|
|
| rs7577225 |
|
|
|
|
|
| rs1195812 |
|
|
|
|
|
| rs12561944 |
|
|
|
|
|
| rs7698703 |
|
|
|
|
|
| rs701837 |
|
|
|
|
|
Figure 1R code to analyze the example dataset.