| Literature DB >> 27194809 |
Eun Yong Kang1, Yurang Park2, Xiao Li3, Ayellet V Segrè3, Buhm Han4, Eleazar Eskin5.
Abstract
Meta-analysis has become a popular tool for genetic association studies to combine different genetic studies. A key challenge in meta-analysis is heterogeneity, or the differences in effect sizes between studies. Heterogeneity complicates the interpretation of meta-analyses. In this paper, we describe ForestPMPlot, a flexible visualization tool for analyzing studies included in a meta-analysis. The main feature of the tool is visualizing the differences in the effect sizes of the studies to understand why the studies exhibit heterogeneity for a particular phenotype and locus pair under different conditions. We show the application of this tool to interpret a meta-analysis of 17 mouse studies, and to interpret a multi-tissue eQTL study.Entities:
Keywords: GWAS; genetic association studies; heterogeneity; meta-analysis
Mesh:
Year: 2016 PMID: 27194809 PMCID: PMC4938634 DOI: 10.1534/g3.116.029439
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Seventeen mouse HDL studies with various environmental/genetic conditions are combined in this meta-analysis (Kang ). In this example, we want to focus on three BXH-wt(M) and four BXH-wt(F) studies. These BXH strains are F2 mice constructed from a cross between C57BL/6J × C3H/HeJ F2 wild-type strains under western diet conditions (van Nas ), but differing by sex. When we consider the effect size estimates only in forest plot format, two confidence intervals of effect estimates overlap each other, making it ambiguous if the observed heterogeneity is a result of stochastic errors. However, in the PM-Plot, since the m-values are calculated utilizing cross-study information, the posterior probabilities are well segregated for these two studies (m-value: 0.93 vs. 0.03), allowing us to hypothesize that the SNP effects on HDL in C57BL/6J × C3H/HeJ F2 strains under the western diet condition can be interacting with sex. Implicated genes are Fabp3, also known as fatty acid binding protein 3, which is a well-known gene playing a regulatory role at the nexus of lipid metabolism and signaling including HDL-cholesterol, LDL-cholesterol, and fasting insulin (Mitchell ; Zhang ). (A) Forest plot and (B) PM-plot for rs32595861 locus (Fabp3 gene) analyzing data from the Kang study. FE, fixed effects model; RE, random effects model.
Figure 2Thirteen multiple-tissue eQTL studies analyzed in GTEx Consortium (2015). In this example, 13 different tissue eQTLs were analyzed together for SEMA3B gene expression levels. The first column shows the P-value for each tissue specific eQTL study. The different colored dots represent the different tissues, the study name column shows the various tissue names included in this multi-tissue eQTL analysis. The forest plot shows that the SNP rs28559826 shows a better association with the SEMA3B gene expression level in three tissues (heart left ventricle, stomach, and thyroid), although the confidence intervals overlap between many tissues. On the other hand, the PM-plot clearly shows that association of the top three tissues (heart left ventricle, stomach, and thyroid) are outstanding compared to other tissue eQTLs. The gene SEMA3B is also known as the semaphorin/collapsin family of molecules. This gene plays a critical role in the guidance of growth cones during neuronal development. It has been shown to act as a tumor suppressor by inducing apoptosis (SEMA3B 2015). (A) Forest plot and (B) PM-plot for rs28559826 locus (SEMA3B gene) analyzing data from the GTex study (GTEx Consortium 2015). RE, random effects model.