| Literature DB >> 33175855 |
Yamin Deng1,2, Shiman Wu1, Huifang Fan1.
Abstract
For complex diseases, genome-wide pathway association studies have become increasingly promising. Currently, however, pathway-based association analysis mainly focus on a single phenotype, which may insufficient to describe the complex diseases and physiological processes. This work proposes a combination model to evaluate the association between a pathway and multiple phenotypes and to reduce the run time based on asymptotic results. For a single phenotype, we propose a semi-supervised maximum kernel-based U-statistics (mSKU) method to assess the pathway-based association analysis. For multiple phenotypes, we propose the fisher combination function with dependent phenotypes (FC) to transform the p-values between the pathway and each marginal phenotype individually to achieve pathway-based multiple phenotypes analysis. With real data from the Alzheimer Disease Neuroimaging Initiative (ADNI) study and Human Liver Cohort (HLC) study, the FC-mSKU method allows us to specify which pathways are specific to a single phenotype or contribute to common genetic constructions of multiple phenotypes. If we only focus on single-phenotype tests, we may miss some findings for etiology studies. Through extensive simulation studies, the FC-mSKU method demonstrates its advantages compared with its counterparts.Entities:
Year: 2020 PMID: 33175855 PMCID: PMC7657528 DOI: 10.1371/journal.pone.0240910
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Under different settings, the experiential type 1 error of four methods.
| Data Dimension | Sample Size | Correlation | FC-mSKU | MultiPhen-GATES | O’Brien-VEGAS | RMMLR |
|---|---|---|---|---|---|---|
| 0.051 | 0.050 | 0.053 | 0.057 | |||
| 0.048 | 0.047 | 0.061 | 0.049 | |||
| 0.050 | 0.061 | 0.062 | 0.041 | |||
| 0.052 | 0.046 | 0.071 | 0.056 | |||
| 0.049 | 0.032 | 0.052 | 0.063 | |||
| 0.050 | 0,052 | 0.073 | 0.071 | |||
| 0.046 | 0.047 | 0.077 | 0.063 | |||
| 0.051 | 0.051 | 0.063 | 0.052 | |||
| 0.047 | 0.063 | 0.076 | 0.047 | |||
| 0.045 | 0.052 | 0.061 | 0.061 | |||
| 0.047 | 0.049 | 0.073 | 0.072 | |||
| 0.053 | 0.039 | 0.080 | 0.063 |
Fig 1The testing power of the four methods under four scenarios with ρ = 0.2.
Fig 2The observed power of the four methods with ρ = 0.7 under four scenarios.
Fig 3The Q-Q-plot of p-values shows the results of multivariate analysis.
The x-axis denotes the expected p-value (-log 10), while the y-axis shows the observed p-value (-log 10). The red diagonal line has slope 1 and intercept 0.
Fig 4It shows the Q-Q-plot of p-values by FC to test the association between pathways and multiple phenotypes.
The x-axis denotes expected-log 10 (p-value), while the y-axis shows observed-log 10 (p-value). The red diagonal line has slope 1 and intercept 0.