| Literature DB >> 31398229 |
Jianjun Zhang1, Qiuying Sha2, Guanfu Liu3, Xuexia Wang1.
Abstract
There is increasing evidence showing that pleiotropy is a widespread phenomenon in complex diseases for which multiple correlated traits are often measured. Joint analysis of multiple traits could increase statistical power by aggregating multiple weak effects. Existing methods for multiple trait association tests usually study each of the multiple traits separately and then combine the univariate test statistics or combine p-values of the univariate tests for identifying disease associated genetic variants. However, ignoring correlation between phenotypes may cause power loss. Additionally, the genetic variants in one gene (including common and rare variants) are often viewed as a whole that affects the underlying disease since the basic functional unit of inheritance is a gene rather than a genetic variant. Thus, results from gene level association tests can be more readily integrated with downstream functional and pathogenic investigation, whereas many existing methods for multiple trait association tests only focus on testing a single common variant rather than a gene. In this article, we propose a statistical method by Testing an Optimally Weighted Combination of Multiple traits (TOW-CM) to test the association between multiple traits and multiple variants in a genomic region (a gene or pathway). We investigate the performance of the proposed method through extensive simulation studies. Our simulation studies show that the proposed method has correct type I error rates and is either the most powerful test or comparable with the most powerful tests. Additionally, we illustrate the usefulness of TOW-CM based on a COPDGene study.Entities:
Mesh:
Year: 2019 PMID: 31398229 PMCID: PMC6688794 DOI: 10.1371/journal.pone.0220914
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The estimated type I error rates for TOW-CM, minP, MANOVA and MSKAT.
| Sample Size | TOW-CM | minP | MANOVA | MSKAT | |
|---|---|---|---|---|---|
| 1000 | | | 0.054 | 0.055 | 0.055 | 0.045 |
| | | 0.054 | 0.052 | 0.054 | 0.046 | |
| | | 0.052 | 0.049 | 0.053 | 0.048 | |
| 2000 | | | 0.050 | 0.053 | 0.052 | 0.049 |
| | | 0.048 | 0.050 | 0.052 | 0.049 | |
| | | 0.048 | 0.053 | 0.052 | 0.051 | |
| 3000 | | | 0.049 | 0.051 | 0.052 | 0.050 |
| | | 0.053 | 0.055 | 0.050 | 0.049 | |
| | | 0.048 | 0.049 | 0.053 | 0.050 | |
| 1000 | | | 0.012 | 0.010 | 0.010 | 0.009 |
| | | 0.011 | 0.008 | 0.011 | 0.010 | |
| | | 0.012 | 0.010 | 0.010 | 0.007 | |
| 2000 | | | 0.012 | 0.012 | 0.011 | 0.008 |
| | | 0.010 | 0.012 | 0.010 | 0.009 | |
| | | 0.010 | 0.010 | 0.011 | 0.010 | |
| 3000 | | | 0.010 | 0.013 | 0.010 | 0.011 |
| | | 0.012 | 0.011 | 0.010 | 0.010 | |
| | | 0.010 | 0.011 | 0.011 | 0.010 | |
| 1000 | | | 0.0014 | 0.0010 | 0.0012 | 0.0008 |
| | | 0.0004 | 0.0008 | 0.0010 | 0.0007 | |
| | | 0.0010 | 0.0011 | 0.0010 | 0.0009 | |
| 2000 | | | 0.0013 | 0.0012 | 0.0007 | 0.0012 |
| | | 0.0016 | 0.0012 | 0.0007 | 0.0008 | |
| | | 0.0011 | 0.0010 | 0.0010 | 0.0005 | |
| 3000 | | | 0.0005 | 0.0013 | 0.0008 | 0.0009 |
| | | 0.0011 | 0.0012 | 0.0008 | 0.0011 | |
| | | 0.0011 | 0.0011 | 0.0009 | 0.0005 | |
Fig 1Power comparison of four tests as a function of heritability for four continuous traits with the magnitude of correlation at 0.2, 0.5 and 0.8, respectively.
All four traits are associated with the gene for the left panel and only the first three traits are associated with the gene for the right panel. Sample size is 1,000 and 20% of rare variants are causal. All causal variants are risk variants. The powers are evaluated at a significance level of 0.05.
Fig 6Power comparison of four tests as a function of heritability for four continuous traits with the magnitude of correlation at 0.2, 0.5 and 0.8, respectively.
Only the first two traits are associated with the gene for left panel and only the first traits are associated with the gene for right panel. Sample size is 1,000 and 20% of rare variants are causal among which 80% of causal variants are risk variants and 20% of causal variants are protective variants. The powers are evaluated at a significance level of 0.05.
The p-values of significant genes in the genetic association analysis for COPD using these four different methods.
| Chr | Genes | Range of MAF | minP | MANOVA | MSKAT | TOW-CM |
|---|---|---|---|---|---|---|
| 1 | EPHX1 | (0.0214, 0.4620) | 0.0197 | 0.6055 | 0.5890 | 0.6257 |
| 1 | IL6R | (0.1680, 0.4398) | 0.2646 | 0.5214 | 0.5163 | 0.5148 |
| 1 | MFAP2 | (0.1789, 0.4842) | 0.0753 | 0.6986 | 0.9926 | 0.6869 |
| 1 | TGFB2 | (0.0139, 0.4858) | 0.2282 | 0.1831 | ||
| 2 | HDAC4 | (0.0147, 0.4906) | 0.0468 | 0.3393 | 0.2197 | 0.5026 |
| 2 | SERPINE2 | (0.0143, 0.4642) | 0.4671 | 0.9797 | 0.7706 | 0.9010 |
| 2 | SFTPB | (0.0784, 0.4766) | 0.0738 | 0.1017 | 0.1669 | 0.3921 |
| 2 | TNS1 | (0.0128, 0.4936) | 0.00727 | 0.2095 | ||
| 3 | MECOM | (0.0099, 0.4957) | 0.0359 | 0.9878 | 0.7211 | 0.9735 |
| 3 | RARB | (0.0278, 0.4942) | 0.0491 | 0.1988 | 0.7469 | 0.3973 |
| 4 | LOC105377462 | (0.0190, 0.4933) | 0.8310 | |||
| 4 | FAM13A | (0.0279, 0.4968) | 0.2169 | 0.0939 | 0.3925 | |
| 4 | GC | (0.0511, 0.4397) | 0.1743 | 0.1875 | 0.6499 | 0.5257 |
| 4 | GSTCD | (0.0343, 0.3872) | 0.1376 | |||
| 4 | HHIP | (0.0368, 0.4984) | 0.0150 | 0.4131 | ||
| 5 | HTR4 | (0.0396, 0.4889) | 0.0487 | 0.6622 | 0.6512 | 0.8906 |
| 5 | SPATA9 | (0.1059, 0.4077) | 0.1145 | 0.3118 | 0.5198 | 0.1964 |
| 6 | TNF | (0.0259, 0.0809) | 0.0320 | 0.1627 | 0.1542 | 0.3077 |
| 6 | ZKSCAN3 | (0.0137, 0.3036) | 0.4990 | 0.8575 | 0.9083 | 0.8793 |
| 6 | AGER | (0.0442, 0.1830) | ||||
| 6 | ARMC2 | (0.0187, 0.4695) | 0.1618 | 0.2481 | 0.1474 | 0.6233 |
| 6 | NCR3 | (0.0133, 0.0899) | 0.0735 | 0.4641 | 0.4145 | 0.5892 |
| 6 | SOX5 | (0.0193, 0.4972) | 0.0764 | 0.8376 | 0.6845 | 0.6386 |
| 10 | LRMDA | (0.0094, 0.4956) | 0.0394 | 0.4102 | 0.7190 | 0.3260 |
| 10 | CDC123 | (0.0240, 0.4561) | 0.0138 | 0.6846 | 0.4097 | 0.8836 |
| 10 | GSTO2 | (0.0538, 0.4547) | 0.1387 | 0.8731 | ||
| 10 | SFTPD | (0.0186, 0.4367) | 0.3699 | 0.9997 | 0.9767 | 0.9751 |
| 11 | GSTP1 | (0.3351, 0.3452) | 0.7053 | 0.1211 | 0.5043 | |
| 11 | MMP1 | (0.0519, 0.3916) | 0.1665 | 0.8614 | 0.6557 | 0.9449 |
| 11 | MMP12 | (0.0541, 0.1439) | 0.4073 | 0.9512 | 0.7372 | 0.8941 |
| 12 | LRP1 | (0.0271, 0.4071) | 0.0144 | 0.4530 | 0.5326 | 0.1812 |
| 12 | BICD1 | (0.0224, 0.4984) | 0.3045 | 0.3856 | 0.2186 | 0.4076 |
| 12 | CCDC38 | (0.0783, 0.4669) | 0.3525 | 0.1151 | 0.5888 | 0.2316 |
| 14 | SERPINA1 | (0.0212, 0.4171) | 0.0254 | 0.6161 | 0.0816 | 0.3506 |
| 14 | SERPINA3 | (0.1076, 0.4907) | 0.4336 | 0.8567 | 0.6572 | 0.7375 |
| 15 | CHRNA3 | (0.0515, 0.4234) | 0.1387 | |||
| 15 | CHRNA5 | (0.2170, 0.4178) | ||||
| 15 | HYKK | (0.1070, 0.4139) | ||||
| 15 | IREB2 | (0.1577, 0.4287) | 0.1376 | |||
| 15 | THSD4 | (0.0115, 0.4944) | 0.00725 | 0.0798 | 0.8496 | 0.0669 |
| 16 | CFDP1 | (0.0424, 0.4139) | 0.0991 | 0.9474 | 0.7127 | 0.9772 |
| 17 | TIMP2 | (0.0403, 0.4950) | 0.1828 | 0.3702 | 0.6836 | |
| 19 | CYP2A6 | (0.2386, 0.2505) | ||||
| 19 | EGLN2 | (0.0465, 0.3712) | 0.00870 | 0.3913 | 0.3705 | |
| 19 | MIA | (0.0459, 0.0691) | 0.1152 | 0.0647 | 0.4979 | |
| 19 | RAB4B | (0.1374, 0.4273) | 0.7020 | |||
| 19 | TGFB1 | (0.0274, 0.4899) | 0.0418 | 0.3039 | 0.7122 | 0.4531 |
| 20 | MMP9 | (0.0412, 0.4234) | 0.0896 | 0.8143 | 0.79926 | 0.7949 |
| 21 | KCNE2 | (0.1172, 0.2778) | 0.1283 | 0.3687 | 0.3776 | 0.6938 |
| 22 | HMOX1 | (0.0530, 0.4270) | 0.0109 | 0.1936 | 0.1181 | 0.1107 |
Note: significance level 0.05 for MANOVA, MSKAT, TOW-CM, and 0.05/7 for minP.