| Literature DB >> 31296220 |
Sungyoung Lee1, Sunmee Kim2, Yongkang Kim3, Bermseok Oh4, Heungsun Hwang2, Taesung Park5,6.
Abstract
BACKGROUNDS: Recent large-scale genetic studies often involve clustered phenotypes such as repeated measurements. Compared to a series of univariate analyses of single phenotypes, an analysis of clustered phenotypes can be useful for substantially increasing statistical power to detect more genetic associations. Moreover, for the analysis of rare variants, incorporation of biological information can boost weak effects of the rare variants.Entities:
Keywords: Clustered phenotypes; Generalized estimating equations; Pathway analysis; Rare variants
Mesh:
Year: 2019 PMID: 31296220 PMCID: PMC6624181 DOI: 10.1186/s12920-019-0517-4
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Results of type 1 error simulation. Rows represent the proportions of significant genes within the causal pathway (10 and 20%), and columns represent different phenotypic correlation (0.25 and 0.5). For each plot, type 1 errors of PHARAOH-GEE are shown with varying gene-level effects (0.1, 0.2, 0.5 and 1.0) and pathway-level effects (0.15, 0.2 and 0.25), and type 1 errors of GEEaSPU are shown with orange bars
Fig. 2Result of power analysis. Columns and rows represent different phenotypic correlations (0.25 and 0.5) and proportions of significant genes within the causal pathway (10 and 20%). For each plot, estimated statistical powers from 300 simulation datasets are shown with combinations of gene-level effects (0.1, 0.2, 0.5 and 1) and pathway-level effects (0.15, 0.2 and 0.25)
Fig. 3Q-Q plots of the real data analyses. a Q-Q plot of the univariate analyses using KEGG pathway database and b Biocarta pathway database. c Q-Q plot of the analysis of the five phenotypes using PHARAOH-GEE
Top five pathways from PHARAOH-GEE. The q-values after the multiple testing adjustment are presented in each cell, with their corresponding p-values within the brackets. The results of univariate PHARAOH are also provided on the right side of the table
| Pathway | PHARAOH-GEE | GEEaSPU | Univariate PHARAOH | ||||
|---|---|---|---|---|---|---|---|
| HDL | TG | FASTGLU | WAIST | BP | |||
| Glyoxylate and dicarboxylate metabolism | 0.0929 (0.00063) | 0.16 (0.00099) | 0.987 (0.902) | 0.721 (0.021) | 0.772 (0.023) | 0.91 (0.842) | 0.916 (0.202) |
| Glycosphingolipid biosynthesis ganglio series | 0.159 (0.0038) | 0.979 (0.804) | 0.987 (0.79) | 0.805 (0.658) | 0.855 (0.137) | 0.805 (0.359) | 0.695 (0.067) |
| MAPK signaling pathway | 0.159 (0.00404) | 0.468 (0.126) | 0.987 (0.327) | 0.721 (0.234) | 0.855 (0.072) | 0.953 (0.901) | 0.997 (0.45) |
| Valine-leucine and isoleucine biosynthesis | 0.159 (0.0043) | 0.979 (0.797) | 0.987 (0.242) | 0.871 (0.779) | 0.999 (0.801) | 0.91 (0.813) | 0.695 (0.067) |
| Fatty acid metabolism | 0.436 (0.0173) | 0.977 (0.459) | 0.987 (0.834) | 0.721 (0.143) | 0.999 (0.893) | 0.903 (0.647) | 0.997 (0.909) |
Fig. 4An example of the PHARAOH-GEE model