| Literature DB >> 26411870 |
Recep Colak1, TaeHyung Kim2, Hilal Kazan3, Yoomi Oh4, Miguel Cruz5, Adan Valladares-Salgado5, Jesus Peralta5, Jorge Escobedo6, Esteban J Parra7, Philip M Kim8, Anna Goldenberg9.
Abstract
MOTIVATION: Rapid advances in genotyping and genome-wide association studies have enabled the discovery of many new genotype-phenotype associations at the resolution of individual markers. However, these associations explain only a small proportion of theoretically estimated heritability of most diseases. In this work, we propose an integrative mixture model called JBASE: joint Bayesian analysis of subphenotypes and epistasis. JBASE explores two major reasons of missing heritability: interactions between genetic variants, a phenomenon known as epistasis and phenotypic heterogeneity, addressed via subphenotyping.Entities:
Mesh:
Year: 2015 PMID: 26411870 PMCID: PMC4708100 DOI: 10.1093/bioinformatics/btv504
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Effects of hidden subphenotypes on GWAS. (A) Classical CC GWAS wherein heterogeneity in the case population is hidden or is not accounted for; can recover only shared causal markers (SNP3) (B) Subphenotyping approach to the same data recovers both shared (SNP3) and subphenotype causal-specific markers (SNP2, SNP4 and SNP5)
Summary of the six algorithms used for comparison
| Algorithm | Epis | Subphe | Case versus | Univariate | Multivariate |
|---|---|---|---|---|---|
| tasis | notyping | control | phenotype | phenotype | |
| BEAM | |||||
| OSACC | |||||
| χ2 CC | |||||
| χ2 multiway | |||||
| Multinom | |||||
| JBASE |
Fig. 2.Disease model results: performance of all algorithms across four dimensions: subpopulation size (A, B), odds ratios (C, D), MAF (E, F) and disease model combinations (G, H). For each plot, the performance is averaged over dimensions other than the dimension in focus. For example, for (A) all MAF, odds ratios and disease model combinations are averaged over and broken into subpopulation size combinations (see also Supplementary Figs. S6 and S7 for additional results under various call confidence thresholds)
Summary of the discovered subphenotypes
| Mexico-1 | Mexico-2 | |||||
|---|---|---|---|---|---|---|
| Obese | Lean | Obese | Lean | |||
| BMI | 30.4 | 27.3 | 8.98 | 30.7 | 28.23 | 0.032 |
| WHR | 0.98 | 0.94 | 0.0016 | 0.94 | 0.91 | 9.1 |
P values are calculated with Wilcoxon rank-sum test.
Summary of the discovered association markers
| Mexico-1 | Mexico-2 | |||||
|---|---|---|---|---|---|---|
| Module | Type | Module | Proxy( | Type | ||
| Epis. | 1.71 | 1.0 | Epis. | 2 | ||
| 1.0 | Epis. | |||||
| Epis. | 3.45 | 0.84 | Epis. | 1 | ||
| 1 | Epis. | |||||
The Proxy column is the LD (as measured by r2) between the Mexico-1 marker and its paired proxy in Mexico-2 dataset. Joint marginal P-values are calculated with χ2 test.