| Literature DB >> 27189724 |
Zhengbang Li1, Wei Zhang2, Dongdong Pan3, Qizhai Li2.
Abstract
Principal component analysis (PCA) is a useful tool to identify important linear combination of correlated variables in multivariate analysis and has been applied to detect association between genetic variants and human complex diseases of interest. How to choose adequate number of principal components (PCs) to represent the original system in an optimal way is a key issue for PCA. Note that the traditional PCA, only using a few top PCs while discarding the other PCs, might significantly lose power in genetic association studies if all the PCs contain non-ignorable signals. In order to make full use of information from all PCs, Aschard and his colleagues have proposed a multi-step combined PCs method (named mCPC) recently, which performs well especially when several traits are highly correlated. However, the power superiority of mCPC has just been illustrated by simulation, while the theoretical power performance of mCPC has not been studied yet. In this work, we attempt to investigate theoretical properties of mCPC and further propose a novel and efficient strategy to combine PCs. Extensive simulation results confirm that the proposed method is more robust than existing procedures. A real data application to detect the association between gene TRAF1-C5 and rheumatoid arthritis further shows good performance of the proposed procedure.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27189724 PMCID: PMC4870571 DOI: 10.1038/srep26243
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Empirical type I error rates and powers of HT, oPC (k0.8), SSU, SKAT, mCPC (k0.8) and tCPC for Constant correlations.
| 40.5 cmType | HT | oPC (0.8) | SSU | SKAT | mCPC ( | tCPC | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| 40.1 cmI 40.5 cm error | 0.20 | 0 | 0.20 | 0.20 | 0.047 | 0.047 | 0.044 | 0.048 | 0.044 | 0.048 |
| 0.50 | 0 | 0.20 | 0.20 | 0.051 | 0.052 | 0.043 | 0.049 | 0.051 | 0.045 | |
| 0.80 | 0 | 0.20 | 0.20 | 0.053 | 0.049 | 0.051 | 0.052 | 0.050 | 0.051 | |
| 0.95 | 0 | 0.20 | 0.20 | 0.046 | 0.053 | 0.049 | 0.052 | 0.047 | 0.047 | |
| 0.20 | ln 1.6 | 0.05 | 0.30 | 0.448 | 0.054 | 0.140 | 0.446 | 0.610 | 0.404 | |
| 0.20 | ln 1.4 | 0.20 | 0.20 | 0.754 | 0.736 | 0.772 | 0.762 | 0.700 | 0.766 | |
| 0.20 | ln 1.4 | 0.30 | 0.05 | 0.884 | 0.922 | 0.996 | 0.886 | 0.856 | 0.996 | |
| 0.50 | ln 1.6 | 0.05 | 0.30 | 0.450 | 0.106 | 0.288 | 0.430 | 0.524 | 0.496 | |
| 40.8 cm Power | 0.50 | ln 1.4 | 0.20 | 0.20 | 0.724 | 0.756 | 0.810 | 0.816 | 0.702 | 0.856 |
| 0.50 | ln 1.4 | 0.30 | 0.05 | 0.886 | 0.928 | 0.996 | 0.894 | 0.874 | 0.988 | |
| 0.80 | ln 1.6 | 0.05 | 0.30 | 0.450 | 0.144 | 0.352 | 0.440 | 0.456 | 0.528 | |
| 0.80 | ln 1.4 | 0.20 | 0.20 | 0.740 | 0.778 | 0.918 | 0.918 | 0.754 | 0.924 | |
| 0.80 | ln 1.4 | 0.30 | 0.05 | 0.872 | 0.942 | 0.984 | 0.818 | 0.910 | 0.964 | |
| 0.95 | ln 1.6 | 0.05 | 0.30 | 0.484 | 0.246 | 0.300 | 0.368 | 0.500 | 0.520 | |
| 0.95 | ln 1.4 | 0.20 | 0.20 | 0.736 | 0.940 | 0.972 | 0.972 | 0.924 | 0.948 | |
| 0.95 | ln 1.4 | 0.30 | 0.05 | 0.880 | 0.996 | 0.962 | 0.684 | 0.988 | 0.920 |
Empirical type-1 error rates and powers of HT, oPC (k 0.8), SSU, SKAT, mCPC (k 0.8) and tCPC for decreasing correlations.
| 40.5 cmType | HT | oPC (0.8) | SSU | SKAT | mCPC ( | tCPC | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| 40.1 cm I 40.5 cm error | 0.20 | 0 | 0.20 | 0.20 | 0.062 | 0.048 | 0.054 | 0.052 | 0.052 | 0.040 |
| 0.50 | 0 | 0.20 | 0.20 | 0.036 | 0.032 | 0.040 | 0.044 | 0.038 | 0.042 | |
| 0.80 | 0 | 0.20 | 0.20 | 0.052 | 0.038 | 0.042 | 0.044 | 0.056 | 0.046 | |
| 0.95 | 0 | 0.20 | 0.20 | 0.036 | 0.046 | 0.052 | 0.054 | 0.042 | 0.048 | |
| 0.20 | ln 1.6 | 0.05 | 0.30 | 0.470 | 0.060 | 0.110 | 0.470 | 0.650 | 0.386 | |
| 0.20 | ln 1.4 | 0.20 | 0.20 | 0.746 | 0.756 | 0.766 | 0.738 | 0.700 | 0.688 | |
| 0.20 | ln 1.4 | 0.30 | 0.05 | 0.886 | 0.926 | 0.998 | 0.890 | 0.864 | 0.998 | |
| 0.50 | ln 1.6 | 0.05 | 0.30 | 0.490 | 0.074 | 0.126 | 0.466 | 0.568 | 0.390 | |
| 40.8 cm Power | 0.50 | ln 1.4 | 0.20 | 0.20 | 0.722 | 0.784 | 0.742 | 0.706 | 0.686 | 0.662 |
| 0.50 | ln 1.4 | 0.30 | 0.05 | 0.882 | 0.914 | 0.996 | 0.862 | 0.856 | 0.996 | |
| 0.80 | ln 1.6 | 0.05 | 0.30 | 0.482 | 0.120 | 0.148 | 0.350 | 0.478 | 0.400 | |
| 0.80 | ln 1.4 | 0.20 | 0.20 | 0.770 | 0.846 | 0.772 | 0.762 | 0.788 | 0.836 | |
| 0.80 | ln 1.4 | 0.30 | 0.05 | 0.888 | 0.946 | 0.994 | 0.852 | 0.912 | 0.988 | |
| 0.95 | ln 1.6 | 0.05 | 0.30 | 0.480 | 0.192 | 0.266 | 0.358 | 0.464 | 0.510 | |
| 0.95 | ln 1.4 | 0.20 | 0.20 | 0.746 | 0.898 | 0.858 | 0.858 | 0.860 | 0.864 | |
| 0.95 | ln 1.4 | 0.30 | 0.05 | 0.866 | 0.972 | 0.982 | 0.732 | 0.962 | 0.936 |
Parameter settings about means and covariance matrices.
| Covariance matrix | Scenarios | Mean vectors |
|---|---|---|
| Uniform correlation with | (S1) | |
| (S2) | ||
| (S3) | ||
| (S4) | ||
| Uniform correlation with | (S5) | |
| (S6) | ||
| (S7) | ||
| (S8) | ||
| Decreasing correlation with | (S9) | |
| (S10) | ||
| (S11) | ||
| (S12) | ||
| Decreasing correlation with | (S13) | |
| (S14) | ||
| (S15) | ||
| (S16) |
Eigenvalues, cumulative contribution rates and non-centrality parameters for Scenarios (S1) to (S8).
| S1–S4 | S1 | S2 | S3 | S4 | S5–S8 | S5 | S6 | S7 | S8 | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ω | Ω | Ω | Ω | Ω | Ω | Ω | Ω | |||||
| 1 | 16.2 | 81% | 4.94 | 0.20 | 0.05 | 0.01 | 4.8 | 24% | 1.67 | 2.34 | 0.59 | 0.04 |
| 2 | 0.2 | 82% | 0.00 | 4.92 | 1.56 | 0.28 | 0.2 | 28% | 0.00 | 0.00 | 0.00 | 0.00 |
| 3 | 0.2 | 83% | 0.00 | 0.68 | 0.07 | 0.00 | 0.2 | 32% | 0.00 | 3.61 | 0.19 | 0.21 |
| 4 | 0.2 | 84% | 0.00 | 1.26 | 2.46 | 0.00 | 0.2 | 36% | 0.00 | 0.02 | 1.31 | 0.00 |
| 5 | 0.2 | 85% | 0.00 | 1.88 | 3.83 | 0.00 | 0.2 | 40% | 0.00 | 1.74 | 3.09 | 4.29 |
| 6 | 0.2 | 86% | 0.00 | 0.58 | 2.68 | 17.22 | 0.2 | 44% | 0.00 | 0.45 | 0.25 | 0.00 |
| 7 | 0.2 | 87% | 0.00 | 0.01 | 0.01 | 0.00 | 0.2 | 48% | 0.00 | 0.00 | 0.16 | 0.00 |
| 8 | 0.2 | 88% | 0.00 | 0.08 | 0.09 | 0.00 | 0.2 | 52% | 0.00 | 0.65 | 0.00 | 0.00 |
| 9 | 0.2 | 89% | 0.00 | 0.06 | 0.13 | 0.00 | 0.2 | 56% | 0.00 | 0.2 | 0.16 | 0.00 |
| 10 | 0.2 | 90% | 0.00 | 0.09 | 0.06 | 0.00 | 0.2 | 60% | 0.00 | 0.38 | 0.16 | 0.00 |
| 11 | 0.2 | 91% | 0.00 | 0.08 | 0.01 | 0.00 | 0.2 | 64% | 0.00 | 0.28 | 0.05 | 0.00 |
| 12 | 0.2 | 92% | 0.00 | 0.47 | 0.19 | 0.00 | 0.2 | 68% | 0.00 | 2.11 | 1.47 | 0.00 |
| 13 | 0.2 | 93% | 0.00 | 0.07 | 0.08 | 0.00 | 0.2 | 72% | 0.00 | 0.80 | 0.09 | 0.00 |
| 14 | 0.2 | 94% | 0.00 | 0.16 | 0.01 | 0.00 | 0.2 | 76% | 0.00 | 0.62 | 0.49 | 0.00 |
| 15 | 0.2 | 95% | 0.00 | 0.07 | 0.25 | 0.00 | 0.2 | 80% | 0.00 | 1.39 | 0.14 | 0.00 |
| 16 | 0.2 | 96% | 0.00 | 2.41 | 0.07 | 0.00 | 0.2 | 84% | 0.00 | 0.80 | 2.48 | 0.12 |
| 17 | 0.2 | 97% | 0.00 | 0.23 | 0.39 | 0.00 | 0.2 | 88% | 0.00 | 0.12 | 0.23 | 0.00 |
| 18 | 0.2 | 98% | 0.00 | 0.10 | 0.09 | 0.00 | 0.2 | 92% | 0.00 | 0.10 | 0.23 | 0.00 |
| 19 | 0.2 | 99% | 0.00 | 0.12 | 0.01 | 0.00 | 0.2 | 96% | 0.00 | 0.00 | 0.00 | 0.00 |
| 20 | 0.2 | 100% | 0.00 | 2.73 | 0.01 | 1.50 | 0.2 | 100% | 0.00 | 0.79 | 0.06 | 0.13 |
Eigenvalues, cumulative contribution rates and non-centrality parameters for Scenarios (S9) to (S16).
| S9–S12 | S9 | S10 | S11 | S12 | S13–S16 | S13 | S14 | S15 | S16 | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ω | Ω | Ω | Ω | Ω | Ω | Ω | Ω | |||||
| 1 | 7.23 | 36.1% | 5.97 | 1.49 | 0.45 | 0.01 | 1.49 | 7.50% | 11.60 | 6.51 | 1.25 | 0.01 |
| 2 | 4.32 | 57.8% | 0.00 | 2.26 | 1.65 | 0.08 | 1.46 | 14.8% | 0.00 | 6.60 | 3.54 | 0.05 |
| 3 | 2.45 | 70.0% | 0.64 | 0.16 | 2.37 | 0.20 | 1.42 | 21.8% | 1.25 | 0.70 | 4.31 | 0.11 |
| 4 | 1.47 | 77.4% | 0.00 | 0.21 | 1.49 | 0.39 | 1.36 | 28.6% | 0.00 | 0.00 | 2.95 | 0.19 |
| 5 | 0.96 | 82.1% | 0.20 | 0.05 | 0.21 | 0.62 | 1.29 | 35.1% | 0.42 | 0.24 | 1.06 | 0.27 |
| 6 | 0.67 | 85.5% | 0.00 | 0.82 | 0.12 | 0.88 | 1.22 | 14.2% | 0.00 | 0.79 | 0.11 | 0.37 |
| 7 | 0.50 | 88.0% | 0.09 | 0.02 | 0.57 | 1.14 | 1.15 | 47.0% | 0.20 | 0.11 | 0.00 | 0.45 |
| 8 | 0.38 | 89.9% | 0.00 | 0.35 | 0.32 | 1.38 | 1.08 | 52.4% | 0.00 | 0.01 | 0.01 | 0.53 |
| 9 | 0.31 | 91.4% | 0.05 | 0.01 | 0.00 | 1.56 | 1.02 | 57.5% | 0.10 | 0.06 | 0.02 | 0.58 |
| 10 | 0.26 | 92.7% | 0.00 | 0.65 | 0.56 | 1.68 | 0.96 | 62.2% | 0.00 | 0.31 | 0.24 | 0.61 |
| 11 | 0.22 | 93.8% | 0.02 | 0.01 | 1.11 | 1.72 | 0.90 | 66.8% | 0.06 | 0.03 | 0.45 | 0.62 |
| 12 | 0.19 | 94.7% | 0.00 | 0.42 | 0.62 | 1.67 | 0.86 | 71.0% | 0.00 | 0.03 | 0.33 | 0.60 |
| 13 | 0.17 | 95.6% | 0.01 | 0.00 | 0.01 | 1.55 | 0.81 | 75.1% | 0.03 | 0.02 | 0.07 | 0.55 |
| 14 | 0.15 | 96.3% | 0.00 | 0.56 | 0.34 | 1.36 | 0.78 | 79.0% | 0.00 | 0.17 | 0.01 | 0.48 |
| 15 | 0.14 | 97.0% | 0.00 | 0.00 | 0.80 | 1.12 | 0.75 | 82.7% | 0.01 | 0.01 | 0.07 | 0.39 |
| 16 | 0.13 | 97.7% | 0.00 | 0.46 | 0.41 | 0.86 | 0.72 | 86.3% | 0.00 | 0.05 | 0.44 | 0.30 |
| 17 | 0.12 | 98.3% | 0.00 | 0.00 | 0.00 | 0.59 | 0.70 | 89.8% | 0.00 | 0.00 | 0.00 | 0.21 |
| 18 | 0.12 | 98.9% | 0.00 | 0.53 | 0.47 | 0.35 | 0.69 | 93.3% | 0.00 | 0.11 | 0.10 | 0.12 |
| 19 | 0.11 | 99.4% | 0.00 | 0.00 | 0.93 | 0.16 | 0.68 | 96.7% | 0.00 | 0.00 | 0.18 | 0.06 |
| 20 | 0.11 | 100% | 0.00 | 0.49 | 0.46 | 0.04 | 0.67 | 100% | 0.00 | 0.07 | 0.09 | 0.01 |
Figure 1Powers of mCPC (k) and oPC (k) under significant level α = 0.05 for Scenarios (S1) to (S4).
Figure 2Powers of mCPC (k) and oPC (k) under significant level α = 0.05 for Scenarios (S5) to (S8).
Figure 3Powers of mCPC (k) and oPC (k) under significant level α = 0.05 for Scenarios (S9) to (S12).
Figure 4Powers of mCPC (k) and oPC (k) under significant level α = 0.05 for Scenarios (S3) to (S16).