| Literature DB >> 29743933 |
Abstract
Multiple correlated traits are often collected in genetic studies. By jointly analyzing multiple traits, we can increase power by aggregating multiple weak effects and reveal additional insights into the genetic architecture of complex human diseases. In this article, we propose a multivariate linear regression-based method to test the joint association of multiple quantitative traits. It is flexible to accommodate any covariates, has very accurate control of type I errors, and offers very competitive performance. We also discuss fast and accurate significance p value computation especially for genome-wide association studies with small-to-medium sample sizes. We demonstrate through extensive numerical studies that the proposed method has competitive performance. Its usefulness is further illustrated with application to genome-wide association analysis of diabetes-related traits in the Atherosclerosis Risk in Communities (ARIC) study. We found some very interesting associations with diabetes traits which have not been reported before. We implemented the proposed methods in a publicly available R package.Entities:
Mesh:
Year: 2018 PMID: 29743933 PMCID: PMC5878919 DOI: 10.1155/2018/2564531
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Type I error of multitrait tests (m = 2, p0 = 0.1) divided by the nominal significance level α. The MAFs of SNP are 0.1 and 0.2 in the two populations, respectively. Q is the m-DF omnibus Wald test; T and T′ are the 1-DF Wald tests assuming a common or common scaled effect. (Q, T, T′) are the corresponding GEE-based m-DF omnibus test and 1-DF tests assuming a common effect or common scaled effect.
|
| |
| ||||
|---|---|---|---|---|---|---|
| 10−5 | 10−4 | 10−3 | 10−5 | 10−4 | 10−3 | |
| | 0.69 | 0.79 | 0.89 | 0.67 | 0.79 | 0.89 |
| | 0.74 | 0.85 | 0.93 | 0.71 | 0.83 | 0.92 |
| | 0.74 | 0.85 | 0.89 | 0.71 | 0.83 | 0.92 |
|
| 1.04 | 1.00 | 1.00 | 1.03 | 1.01 | 1.00 |
| | 0.98 | 0.99 | 1.01 | 0.97 | 0.99 | 1.00 |
| | 0.96 | 0.98 | 1.00 | 0.96 | 0.98 | 0.99 |
Type I error divided by the nominal significance level α for multitrait tests (m = 8, p0 = 0.1).
| |
|
| ||||
|---|---|---|---|---|---|---|
| 10−5 | 10−4 | 10−3 | 10−5 | 10−4 | 10−3 | |
| | 0.43 | 0.62 | 0.75 | 0.44 | 0.60 | 0.75 |
| | 0.74 | 0.84 | 0.93 | 0.77 | 0.85 | 0.93 |
| | 0.74 | 0.84 | 0.93 | 0.78 | 0.85 | 0.93 |
| | 0.94 | 0.99 | 1.00 | 0.94 | 1.00 | 1.00 |
| | 1.03 | 1.03 | 1.02 | 1.05 | 1.04 | 1.03 |
| | 1.03 | 1.03 | 1.03 | 1.03 | 0.99 | 0.99 |
Power of multitrait tests for m = 2 continuous traits (Y1, Y2) under significance level α = 10−4. The MAFs of SNP are 0.1 and 0.2 in the two populations, respectively. Q is the m-DF omnibus Wald test; T and T′ are the 1-DF Wald tests assuming common or common scaled effect. (Q, T, T′) are the corresponding GEE-based m-DF omnibus test and 1-DF tests assuming a common effect or common scaled effect. σ is the standard error of Y and γ is the SNP coefficient, i = 1,2. The highest powered tests are bold-faced.
| ( |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
|
| |||||||
|
| |||||||
| (0.3,0) | (0.21,0) |
| 0.001 | 0.024 | 0.334 | 0.001 | 0.019 |
| (0.3,0.1) | (0.21,0.1) |
| 0.047 | 0.146 | 0.177 | 0.039 | 0.126 |
| (0.25,0.18) | (0.18,0.18) | 0.180 | 0.221 |
| 0.154 | 0.194 | 0.233 |
| (0.3,0.25) | (0.21,0.25) | 0.523 | 0.617 |
| 0.476 | 0.573 | 0.582 |
| (0.2,0.2) | (0.14,0.2) | 0.179 |
| 0.215 | 0.154 | 0.23 | 0.193 |
| (0.2,0.25) | (0.14,0.25) | 0.410 |
| 0.369 | 0.367 | 0.466 | 0.333 |
| (0.25,0.25) | (0.18,0.25) | 0.449 |
| 0.492 | 0.403 | 0.521 | 0.455 |
| (0,0.25) | (0,0.25) |
| 0.278 | 0.052 | 0.59 | 0.247 | 0.040 |
| (0,0.3) | (0,0.3) |
| 0.525 | 0.121 | 0.865 | 0.477 | 0.093 |
| (0.1,0.25) | (0.07,0.25) | 0.465 |
| 0.372 | 0.418 | 0.448 | 0.330 |
| (0.1,0.3) | (0.07,0.3) |
| 0.726 | 0.590 | 0.700 | 0.688 | 0.534 |
| (0.2,0.3) | (0.14,0.3) | 0.845 |
| 0.842 | 0.810 | 0.870 | 0.810 |
|
| |||||||
|
| |||||||
|
| |||||||
| (0.3,0) | (0.21,0) |
| 0.026 | 0.063 | 0.178 | 0.020 | 0.051 |
| (0.3,0.1) | (0.21,0.1) | 0.316 | 0.249 |
| 0.278 | 0.215 | 0.304 |
| (0.25,0.18) | (0.18,0.18) | 0.419 | 0.510 |
| 0.376 | 0.471 | 0.494 |
| (0.3,0.25) | (0.21,0.25) | 0.830 | 0.891 |
| 0.796 | 0.868 | 0.870 |
| (0.2,0.2) | (0.14,0.2) | 0.375 |
| 0.462 | 0.333 | 0.449 | 0.427 |
| (0.2,0.25) | (0.14,0.25) | 0.631 |
| 0.677 | 0.584 | 0.692 | 0.636 |
| (0.25,0.25) | (0.18,0.25) | 0.734 |
| 0.801 | 0.690 | 0.792 | 0.771 |
| (0,0.25) | (0,0.25) |
| 0.249 | 0.134 | 0.36 | 0.217 | 0.107 |
| (0,0.3) | (0,0.3) |
| 0.485 | 0.29 | 0.657 | 0.437 | 0.235 |
| (0.1,0.25) | (0.07,0.25) |
| 0.385 | 0.165 | 0.406 | 0.356 | 0.140 |
| (0.1,0.3) | (0.07,0.3) |
| 0.639 | 0.301 | 0.728 | 0.605 | 0.257 |
| (0.2,0.3) | (0.14,0.3) | 0.700 |
| 0.545 | 0.655 | 0.713 | 0.500 |
Power of multitrait tests for m = 8 continuous traits under significance level α = 10−4. The MAFs of SNP are 0.1 and 0.2 in the two populations, respectively. Q is the m-DF omnibus Wald test; T and T′ are the 1-DF Wald tests assuming common or common scaled effect. (Q, T, T′) are the corresponding GEE-based m-DF omnibus test and 1-DF tests assuming a common effect or common scaled effect. The highest powered tests are bold-faced.
| ( |
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| ||||||
|
| ||||||
|
|
| 0.001 | 0 | 0.229 | 0 | 0 |
| (.3, .2, .1, .05,0,…, 0) |
| 0 | 0.008 | 0.599 | 0 | 0.005 |
|
| 0.045 | 0.201 |
| 0.030 | 0.169 | 0.195 |
|
| 0.048 |
| 0.193 | 0.032 | 0.204 | 0.170 |
|
| ||||||
|
| ||||||
|
| ||||||
|
|
| 0.001 | 0.004 | 0.043 | 0.001 | 0.002 |
| (.3, .2, .1, .05,0,…, 0) |
| 0.156 | 0.224 | 0.372 | 0.102 | 0.152 |
|
| 0.934 | 0.996 |
| 0.887 | 0.992 | 0.993 |
|
| 0.912 |
| 0.994 | 0.855 | 0.989 | 0.988 |
Figure 1Ratio of the actual significance levels of m-DF chi-square test versus the F-test with (m, n − p − 1 − m) DFs. The x-axis is the type I error rate. (a) Shows the results for testing m = 4 traits with p = 2 covariates based on n individuals. (b) Shows the results for testing m = 8 traits with p = 2 covariates.
Six novel SNPs identified in the ARIC joint association test, which were not significant in the corresponding MAGIC consortium meta-analyses of fasting glucose (FG), fasting insulin (FI), and 2-hour fasting glucose (2hFG) but were significant in the MAGIC meta-analysis of fasting proinsulin (FP). We listed the ARIC joint test's p values (the proposed MLM Wald test and the GEE chi-square test) and the corresponding MAGIC consortium meta-analyses' p values for FG, FI, 2hFG, and FP.
| SNP | Chr | bp | ARIC joint test's | MAGIC meta-analysis' | ||||
|---|---|---|---|---|---|---|---|---|
| Wald | GEE | FG | FI | 2hFG | FP | |||
| rs4502156 | 15 | 62383155 | 5.4 | 7.9 | 8.4 | 6.7 | 8.2 | 3.8 |
| rs7163757 | 15 | 62391608 | 1.4 | 1.8 | 4.2 | 5.7 | 1.9 | 3.9 |
| rs8037894 | 15 | 62394264 | 1.2 | 1.6 | 4.1 | 4.8 | 3.5 | 8.7 |
| rs6494307 | 15 | 62394690 | 1.7 | 2.1 | 3.3 | 4.9 | 2.7 | 4.1 |
| rs7167878 | 15 | 62396189 | 1.7 | 2.1 | 4.6 | 4.5 | 2.4 | 4.1 |
| rs7172432 | 15 | 62396389 | 1.7 | 2.2 | 6.5 | 3.3 | 1.9 | 4.3 |