| Literature DB >> 24415945 |
Eun Yong Kang1, Buhm Han2, Nicholas Furlotte1, Jong Wha J Joo3, Diana Shih4, Richard C Davis4, Aldons J Lusis5, Eleazar Eskin6.
Abstract
Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis approach is that our combined study has significantly higher power and improved resolution compared to any single study thus explaining the large number of loci discovered in the combined study.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24415945 PMCID: PMC3886926 DOI: 10.1371/journal.pgen.1004022
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Application of Meta-GxE to Apoa2 locus.
The forest plot (A) shows heterogeneity in the effect sizes across different studies. The PM-plot (B) predicts that 7 studies have an effect at this locus, even though only 1 study (HMDP-chow(M)) is genome-wide significant with P-value.
17 HDL studies for meta analysis.
| Study ID | Strains | Conditions | Age | Sex | # Strains | # Samples | # Sig Loci | Ref |
| HMDPxB-chow(M) | (HMDP×BL/6) F1 | Leiden/CETP TG,chow diet | 8 weeks | M | 97 | 516 | 1 | U |
| HMDPxB-chow(F) | (HMDP×BL/6) F1 | Leiden/CETP TG, chow diet | 8 weeks | F | 95 | 468 | 0 | U |
| HMDPxB-ath(M) | (HMDP×BL/6) F1 | Leiden/CETP TG, highfat diet | 24 weeks | M | 97 | 408 | 0 | U |
| HMDPxB-ath(F) | (HMDP×BL/6) F1 | Leiden/CETP TG, highfat diet | 24 weeks | F | 93 | 457 | 3 | U |
| HMDP-chow(M) | HMDP | chow diet | 12 weeks | M | 111 | 749 | 6 |
|
| HMDP-fat(M) | HMDP | highfat diet | 16 weeks | M | 106 | 586 | 0 |
|
| HMDP-fat(F) | HMDP | highfat diet | 16 weeks | F | 92 | 475 | 0 |
|
| BxD-db-12(M) | (DBA×BL/6) F2 | BXD db/db, chow diet | 12 weeks | M | 125 | 125 | 0 |
|
| BxD-db-12(F) | (DBA×BL/6) F2 | BXD db/db, chow diet | 12 weeks | F | 122 | 122 | 0 |
|
| BxD-db-5(M) | (DBA×BL/6) F2 | BXD db/db, chow diet | 5 weeks | M | 109 | 109 | 1 |
|
| BxD-db-5(F) | (DBA×BL/6) F2 | BXD db/db, chow diet | 5 weeks | F | 139 | 139 | 0 |
|
| BxH-apoe(M) | (C3H×BL/6) F2 | BXH Apoe -/- | 24 weeks | M | 161 | 161 | 0 |
|
| BxH-apoe(F) | (C3H×BL/6) F2 | BXH Apoe -/- | 24 weeks | F | 174 | 174 | 0 |
|
| BxH-wt(M) | (C3H×BL/6) F2 | BXH wildtype, highfat diet | 20 weeks | M | 164 | 164 | 0 |
|
| BxH-wt(F) | (C3H×BL/6) F2 | BXH wildtype, highfat diet | 20 weeks | F | 144 | 144 | 0 |
|
| CxB-ldlr(M) | (BALB/cJ×BL/6) F2 | CXB LDLR -/-, highfat diet | 12 weeks | M | 124 | 124 | 0 | U |
| CxB-ldlr(F) | (BALB/cJ×BL/6) F2 | CXB LDLR -/-, highfat diet | 12 weeks | F | 64 | 64 | 0 | U |
Seventeen HDL studies are combined in the meta analysis. U in the Ref column represents a data set that is not yet published. Mice for the HMDPxB panel were created by breeding females of the various HMDP inbred strains to males carrying transgenes for both Apoe Leiden and for human Cholesterol Ester Transfer Protein (CETP) on a C57BL/6 genetic background. The Leiden/CETP transgenes [48], [49] cause high total cholesterol level and high LDL cholesterol level in the circulation, along with reduced HDL cholesterol. BxD db/db denotes a population of F2 mice from a cross between C57BL/6 DBA/2 with homozygous deficiency in leptin receptor (db/db), which results in obese mice. BxH Apoe -/- denotes denotes a population of F2 mice from a cross between C57BL/6 and C3H also carrying a deficiency in apolipoprotein E. CxB LDLR -/- denotes a population of F2 mice from a cross between C57BL/6 and BALB/cBy also carrying a deficiency in LDL receptor, which results in high LDL cholesterol level in the circulation BXH wildtype denotes a population of F2 mice from a cross between C57BL/6 and C3H.
Figure 2The prevalence of heterogeneity in effect size of significant loci.
Each dot represents association between SNPs and HDL phenotype from applying random effects based meta-analysis approach. Dots with larger value represents the existence of more heterogeneity at the locus between studies. The distribution of the heterogeneity statistic for significant SNPs (red dots) in the meta analysis is skewed toward higher heterogeneity while the non-significant SNPs are much less skewed.
Figure 3Power of mouse meta-analysis to identify gene-by-environment interactions in 4,965 animals from 17 studies under varying mean effect sizes and the per study variance of the effect size which corresponds to gene-by-environment effects.
Figure 4Power of (a) random-effect, (b) fixed-effect meta-analysis and (c) heterogeneity meta-analysis methods as a function of the effect size and the strength of the interaction effect (heterogeneity). (d) shows a comparison of the three methods with the color corresponding to the method with the highest power.
Figure 5Manhattan plots showing the results of Meta-GxE applied to (a) 17 HDL studies, (b) 9 HDL studies consisting only of male animals and (c) 8 studies consisting only of female animals.
26 significant loci identified by applying Meta-GxE analysis.
| SNP | Meta GxE | Meta GxE | Meta GxE | # of Studies w/ | HE Meta | # Studies | Genes | Gene |
| Location | (Male) | (Female) | (Male+Female) | Significant | (Male+Female) | E/A/N | in Region | Refs |
| Chr1:64752822 (rs31078051) | 1.12×10−6 | 2.37×10−3 |
| 0 | 3.82×10−3 | 2/14/1 |
|
|
| Chr1:107271282 (rs32203839) | 6.69×10−4 | 2.66×10−4 |
| 0 | 7.55×10−2 | 6/0/11 |
|
|
| Chr1:171199523 (rs32075748) |
|
|
| 1 | 5.80×10−5 | 8/6/3 |
|
|
| Chr2:77837584 (rs6273567) | 1.55×10−4 | 7.17×10−4 |
| 0 | 2.25×10−2 | 3/14/0 |
|
|
| Chr2:134421733 (rs27238693) | 7.69×10−6 | 1.66×10−3 |
| 0 | 1.63×10−2 | 5/12/0 |
|
|
| Chr3:32944259 (rs29869794) | 2.97×10−6 | 2.49×10−2 |
| 0 | 2.36×10−5 | 3/8/6 |
|
|
| Chr3:76066632 (rs31487078) | 5.89×10−3 | 2.29×10−5 |
| 0 | 6.56×10−2 | 4/13/0 |
| - |
| Chr3:107430396 (rs30013147) | 9.59×10−6 | 3.84×10−5 |
| 0 | 7.74×10−2 | 7/10/0 |
|
|
| Chr3:143466942 (rs30206761) | 1.82×10−3 | 3.97×10−5 |
| 0 | 8.60×10−2 | 7/10/0 |
|
|
| Chr4:131925523 (rs32595861) | 1.72×10−4 | 2.84×10−4 |
| 0 | 8.65×10−4 | 6/8/3 |
|
|
| Chr5:119034507 (rs33131194) | 1.23×10−4 | 2.59×10−3 |
| 0 | 3.94×10−1 | 9/8/0 |
|
|
| Chr8:46903188 (rs33272858) |
| 6.52×10−1 | 1.66×10−6 | 1 | 1.62×10−4 | 2/11/4 |
|
|
| Chr8:64150094 (rs31750594) |
| 1.89×10−4 |
| 0 | 8.34×10−1 | 11/6/0 |
|
|
| Chr8:84073148 (rs33435859) |
| 4.53×10−4 |
| 0 | 8.33×10−1 | 12/5/0 |
|
|
| Chr9:101972687 (rs6333310) | 1.22×10−4 | 1.22×10−5 |
| 0 |
| 2/1/14 |
|
|
| Chr10:21399819 (rs29363941) | 9.07×10−4 |
|
| 0 | 1.18×10−2 | 3/12/2 |
|
|
| Chr10:90146088 (rs29370592) |
| 0.756 | 1.02×10−5 | 1 | 8.94×10−4 | 2/14/1 |
|
|
| Chr11:69906552 (rs29477071) |
| 1.35×10−5 |
| 0 |
| 6/9/2 |
|
|
| Chr11:114083173 (rs29416888) | 1.10×10−4 | 7.83×10−5 |
| 0 | 5.28×10−5 | 3/13/1 |
|
|
| Chr14:33632464 (rs31061259) | 1.96×10−4 | 1.65×10−3 |
| 0 | 2.02×10−5 | 3/10/4 |
|
|
| Chr15:21194226 (rs31670969) |
| 1.29×10−2 |
| 1 |
| 3/2/12 |
| - |
| Chr15:59860191 (rs3718217) | 5.64×10−6 | 1.45×10−5 |
| 0 | 9.92×10−5 | 5/10/2 |
|
|
| Chr17:46530712 (rs33259313) | 1.09×10−5 | 4.90×10−3 |
| 0 | 3.53×10−5 | 5/10/2 |
|
|
| Chr18:82240606 (rs13483466) | 2.05×10−4 | 2.23×10−4 |
| 0 | 9.05×10−1 | 5/12/0 |
|
|
| Chr19:3319089 (rs31004232) | 5.58×10−2 |
|
| 0 | 1.08×10−1 | 3/14/0 |
|
|
| ChrX:151384614 (rs31202008) | 2.59×10−4 | 4.72×10−6 |
| 0 | 1.12×10−1 | 5/5/0 |
|
|
Twentysix significant loci identified by applying Meta-GxE analysis of both random effects meta-analysis and heterogeneity testing to 17 mouse HDL studies under different environments containing 4,965 total animals. # studies E denotes the number of studies with an effect on HDL phenotype. # studies N denotes the number of studies with no effect on HDL phenotype. # studies A denotes the number of studies with an ambiguous effect size. Genes in region denotes candidate genes for each locus based on close proximity to the peak SNP and previously suggested role in lipid or apolipoprotein metabolism: Pikfyve (phosphoinositide kinase), Bcl2 (B cell leukemia/lymphoma 2), Apoa2 (apolipoprotein A-II), Agps (alkylglycerone phosphate synthase), Jag1 (jagged 1), Prkci (protein kinase C), Prkci (colony stimulating factor 1 (macrophage)), Hs2st1 (heparan sulfate 2-O-sulfotransferase 1), Fabp3 (fatty acid binding protein 3), Nos1 (nitric oxide synthase 1), Acsl1 (acyl-CoA synthetase long-chain family member 1), Cpe (carboxypeptidase E), Prkaca (protein kinase, cAMP dependent, catalytic, alpha), Acox1 (peroxisomal acyl-coenzyme A oxidase 1), Ppyr1 (pancreatic polypeptide receptor 1), Trib1(tribbles homolog 1), Sqle (squalene epoxidase), Gnmt (glycine N-methyltransferase), Mbp(myelin basic protein), Lrp5 (low density lipoprotein receptor-related protein 5), Htr2c (5-hydroxytryptamine (serotonin) receptor 2C).
Figure 6Peak SNP in chromosome 8 shows interesting gene-by-environment interactions between sex×mutation-driven LDL levels.