| Literature DB >> 28732532 |
Wei Zhang1,2, Liu Yang3, Larry L Tang4,5, Aiyi Liu6, James L Mills6, Yuanchang Sun7, Qizhai Li8.
Abstract
BACKGROUND: The association studies on human complex traits are admittedly propitious to identify deleterious genetic markers. Compared to single-trait analyses, multiple-trait analyses can arguably make better use of the information on both traits and markers, and thus improve statistical power of association tests prominently. Principal component analysis (PCA) is a well-known useful tool in multivariate analysis and can be applied to this task. Generally, PCA is first performed on all traits and then a certain number of top principal components (PCs) that explain most of the trait variations are selected to construct the test statistics. However, under some situations, only utilizing these top PCs would lead to a loss of important evidences from discarded PCs and thus makes the capability compromised.Entities:
Keywords: Biomedical study; Pleiotropic genetic associations; Power; Principal component analysis
Mesh:
Substances:
Year: 2017 PMID: 28732532 PMCID: PMC5521155 DOI: 10.1186/s12864-017-3928-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 3The empirical power of five tests for 100 correlated phenotypes sampled from Model 1 with correlation structure S5-S8. 1000 simulation replicates are conducted under the nominal significant level of 0.05
Fig. 1Schematic representations of two association models with indirect (Model 1) and direct (Model 2) genetic effects used to generate multiple correlated phenotypes. In Model 1, the genetic variant affects the correlated phenotypes via latent factors, while the genetic variant directly affects some single phenotypes in Model 2
The empirical type I errors of TATES, MANOVA, MultiPhen, mCPC, and GATE when the correlated phenotypes are sampled from indirect association model
| Scenario | MAF | TATES | MANOVA | MultiPhen | mCPC | GATE | |
|---|---|---|---|---|---|---|---|
|
| S1 | 0.05 | 0.046 | 0.044 | 0.048 | 0.046 | 0.043 |
| 0.15 | 0.048 | 0.045 | 0.043 | 0.044 | 0.041 | ||
| 0.30 | 0.058 | 0.054 | 0.064 | 0.052 | 0.053 | ||
| 0.50 | 0.047 | 0.047 | 0.058 | 0.050 | 0.045 | ||
| S2 | 0.05 | 0.062 | 0.053 | 0.049 | 0.043 | 0.053 | |
| 0.15 | 0.059 | 0.058 | 0.060 | 0.057 | 0.054 | ||
| 0.30 | 0.061 | 0.046 | 0.047 | 0.047 | 0.047 | ||
| 0.50 | 0.054 | 0.053 | 0.054 | 0.055 | 0.059 | ||
| S3 | 0.05 | 0.053 | 0.047 | 0.047 | 0.041 | 0.043 | |
| 0.15 | 0.051 | 0.051 | 0.049 | 0.051 | 0.052 | ||
| 0.30 | 0.064 | 0.053 | 0.057 | 0.061 | 0.065 | ||
| 0.50 | 0.050 | 0.061 | 0.065 | 0.066 | 0.062 | ||
| S4 | 0.05 | 0.062 | 0.042 | 0.047 | 0.046 | 0.042 | |
| 0.15 | 0.051 | 0.045 | 0.045 | 0.049 | 0.045 | ||
| 0.30 | 0.045 | 0.042 | 0.045 | 0.047 | 0.047 | ||
| 0.50 | 0.052 | 0.044 | 0.049 | 0.060 | 0.057 | ||
|
| S5 | 0.05 | 0.055 | 0.059 | 0.111 | 0.057 | 0.059 |
| 0.15 | 0.053 | 0.049 | 0.087 | 0.046 | 0.053 | ||
| 0.30 | 0.056 | 0.037 | 0.096 | 0.030 | 0.032 | ||
| 0.50 | 0.060 | 0.051 | 0.114 | 0.038 | 0.051 | ||
| S6 | 0.05 | 0.058 | 0.049 | 0.103 | 0.046 | 0.051 | |
| 0.15 | 0.056 | 0.056 | 0.093 | 0.063 | 0.058 | ||
| 0.30 | 0.062 | 0.043 | 0.098 | 0.040 | 0.049 | ||
| 0.50 | 0.076 | 0.046 | 0.116 | 0.037 | 0.050 | ||
| S7 | 0.05 | 0.057 | 0.052 | 0.104 | 0.051 | 0.048 | |
| 0.15 | 0.045 | 0.045 | 0.086 | 0.058 | 0.056 | ||
| 0.30 | 0.052 | 0.045 | 0.103 | 0.040 | 0.041 | ||
| 0.50 | 0.049 | 0.062 | 0.114 | 0.054 | 0.053 | ||
| S8 | 0.05 | 0.066 | 0.063 | 0.113 | 0.063 | 0.062 | |
| 0.15 | 0.063 | 0.046 | 0.085 | 0.051 | 0.055 | ||
| 0.30 | 0.039 | 0.052 | 0.116 | 0.041 | 0.038 | ||
| 0.50 | 0.066 | 0.071 | 0.126 | 0.066 | 0.054 |
The number of correlated phenotypes is 20 and 100. Scenario S1-S4 correspond to four correlation structures for m=20 and Scenario S5-S8 are for m=100. For each scenario, four MAFs including 0.05, 0.15, 0.30, and 0.50 are considered. The nominal significance level is 0.05 and 1000 simulations are conducted
Fig. 2The empirical power of five tests for 20 correlated phenotypes sampled from Model 1 with correlation structure S1-S4. 1000 simulation replicates are conducted under the nominal significant level of 0.05
The empirical type I errors of TATES, MANOVA, MultiPhen, mCPC, and GATE when the correlated phenotypes are sampled from direct association model
| Scenario | MAF | TATES | MANOVA | MultiPhen | mCPC | GATE | |
|---|---|---|---|---|---|---|---|
|
| S9 | 0.05 | 0.045 | 0.048 | 0.050 | 0.045 | 0.047 |
| 0.15 | 0.054 | 0.049 | 0.058 | 0.050 | 0.048 | ||
| 0.30 | 0.051 | 0.047 | 0.054 | 0.043 | 0.045 | ||
| 0.50 | 0.050 | 0.054 | 0.056 | 0.053 | 0.052 | ||
| S10 | 0.05 | 0.037 | 0.054 | 0.059 | 0.048 | 0.052 | |
| 0.15 | 0.035 | 0.055 | 0.050 | 0.046 | 0.045 | ||
| 0.30 | 0.038 | 0.049 | 0.055 | 0.052 | 0.043 | ||
| 0.50 | 0.031 | 0.043 | 0.049 | 0.047 | 0.047 | ||
| S11 | 0.05 | 0.041 | 0.050 | 0.050 | 0.052 | 0.050 | |
| 0.15 | 0.049 | 0.047 | 0.054 | 0.048 | 0.053 | ||
| 0.30 | 0.049 | 0.054 | 0.055 | 0.052 | 0.054 | ||
| 0.50 | 0.060 | 0.053 | 0.060 | 0.052 | 0.055 | ||
| S12 | 0.05 | 0.042 | 0.051 | 0.057 | 0.062 | 0.064 | |
| 0.15 | 0.040 | 0.047 | 0.045 | 0.044 | 0.048 | ||
| 0.30 | 0.041 | 0.043 | 0.048 | 0.045 | 0.046 | ||
| 0.50 | 0.045 | 0.048 | 0.052 | 0.048 | 0.051 | ||
|
| S13 | 0.05 | 0.049 | 0.044 | 0.086 | 0.042 | 0.044 |
| 0.15 | 0.060 | 0.047 | 0.089 | 0.046 | 0.040 | ||
| 0.30 | 0.060 | 0.063 | 0.119 | 0.052 | 0.056 | ||
| 0.50 | 0.053 | 0.041 | 0.094 | 0.039 | 0.051 | ||
| S14 | 0.05 | 0.031 | 0.053 | 0.100 | 0.059 | 0.048 | |
| 0.15 | 0.024 | 0.059 | 0.092 | 0.046 | 0.054 | ||
| 0.30 | 0.030 | 0.051 | 0.114 | 0.051 | 0.058 | ||
| 0.50 | 0.030 | 0.050 | 0.122 | 0.046 | 0.042 | ||
| S15 | 0.05 | 0.058 | 0.055 | 0.106 | 0.056 | 0.050 | |
| 0.15 | 0.047 | 0.051 | 0.102 | 0.059 | 0.043 | ||
| 0.30 | 0.042 | 0.066 | 0.118 | 0.061 | 0.051 | ||
| 0.50 | 0.048 | 0.039 | 0.103 | 0.045 | 0.043 | ||
| S16 | 0.05 | 0.036 | 0.055 | 0.099 | 0.059 | 0.053 | |
| 0.15 | 0.041 | 0.050 | 0.087 | 0.055 | 0.049 | ||
| 0.30 | 0.041 | 0.047 | 0.110 | 0.057 | 0.065 | ||
| 0.50 | 0.042 | 0.065 | 0.130 | 0.059 | 0.052 |
The number of correlated phenotypes is 20 and 100. Scenario S9-S12 correspond to four correlation structures for m=20 and Scenario S13-S16 are for m=100. For each scenario, four MAFs including 0.05, 0.15, 0.30, and 0.50 are considered. The nominal significance level is 0.05 and 1000 simulation replicates are conducted
Fig. 4The empirical power of five tests for 20 correlated phenotypes sampled from Model 2 with correlation structure S9-S12. 1000 simulation replicates are conducted under the nominal significant level of 0.05
Fig. 5The empirical power of five tests for 100 correlated phenotypes sampled from Model 2 with correlation structure S13-S16. 1000 replicates are conducted under the nominal significant level of 0.05
P-values of the selected 7 SNPs on mouse chromosome 19 for the association tests with 52 phenotypes using the TATES, MANOVA, MultiPhen, mCPC, and GATE methods
| snpid | TATES | MANOVA | MultiPhen | mCPC | GATE |
|---|---|---|---|---|---|
| rs13483499 | 3.27 ×10−3 | 1.79 ×10−4 | 3.34 ×10−4 | 1.63 ×10−4 | 6.30 ×10−5 |
| rs13459157 | 4.05 ×10−3 | 2.36 ×10−4 | 4.20 ×10−4 | 2.17 ×10−4 | 8.80 ×10−5 |
| rs13483502 | 4.23 ×10−2 | 1.01 ×10−3 | 2.22 ×10−4 | 9.97 ×10−4 | 1.67 ×10−4 |
| rs6259521 | 3.00 ×10−3 | 1.11 ×10−2 | 4.63 ×10−3 | 6.41 ×10−4 | 1.73 ×10−4 |
| rs13483579 | 1.19 ×10−3 | 6.15 ×10−2 | 6.17 ×10−2 | 5.44 ×10−3 | 1.74 ×10−4 |
| rs13483598 | 4.53 ×10−3 | 3.16 ×10−3 | 8.07 ×10−4 | 2.85 ×10−3 | 1.36 ×10−4 |
| rs13483601 | 3.82 ×10−3 | 2.82 ×10−3 | 7.45 ×10−4 | 1.84 ×10−3 | 8.50 ×10−5 |
“snpid” is the ID of the selected SNPs