Literature DB >> 28978616

Integrative analysis of GWAS, eQTLs and meQTLs data suggests that multiple gene sets are associated with bone mineral density.

W Wang1, S Huang2, W Hou3, Y Liu4, Q Fan5, A He6, Y Wen6, J Hao6, X Guo6, F Zhang7.   

Abstract

OBJECTIVES: Several genome-wide association studies (GWAS) of bone mineral density (BMD) have successfully identified multiple susceptibility genes, yet isolated susceptibility genes are often difficult to interpret biologically. The aim of this study was to unravel the genetic background of BMD at pathway level, by integrating BMD GWAS data with genome-wide expression quantitative trait loci (eQTLs) and methylation quantitative trait loci (meQTLs) data
METHOD: We employed the GWAS datasets of BMD from the Genetic Factors for Osteoporosis Consortium (GEFOS), analysing patients' BMD. The areas studied included 32 735 femoral necks, 28 498 lumbar spines, and 8143 forearms. Genome-wide eQTLs (containing 923 021 eQTLs) and meQTLs (containing 683 152 unique methylation sites with local meQTLs) data sets were collected from recently published studies. Gene scores were first calculated by summary data-based Mendelian randomisation (SMR) software and meQTL-aligned GWAS results. Gene set enrichment analysis (GSEA) was then applied to identify BMD-associated gene sets with a predefined significance level of 0.05.
RESULTS: We identified multiple gene sets associated with BMD in one or more regions, including relevant known biological gene sets such as the Reactome Circadian Clock (GSEA p-value = 1.0 × 10-4 for LS and 2.7 × 10-2 for femoral necks BMD in eQTLs-based GSEA) and insulin-like growth factor receptor binding (GSEA p-value = 5.0 × 10-4 for femoral necks and 2.6 × 10-2 for lumbar spines BMD in meQTLs-based GSEA).
CONCLUSION: Our results provided novel clues for subsequent functional analysis of bone metabolism, and illustrated the benefit of integrating eQTLs and meQTLs data into pathway association analysis for genetic studies of complex human diseases.Cite this article: W. Wang, S. Huang, W. Hou, Y. Liu, Q. Fan, A. He, Y. Wen, J. Hao, X. Guo, F. Zhang. Integrative analysis of GWAS, eQTLs and meQTLs data suggests that multiple gene sets are associated with bone mineral density. Bone Joint Res 2017;6:572-576.
© 2017 Wang et al.

Entities:  

Keywords:  GWAS; eQTLs; meQTLs

Year:  2017        PMID: 28978616      PMCID: PMC5670365          DOI: 10.1302/2046-3758.610.BJR-2017-0113.R1

Source DB:  PubMed          Journal:  Bone Joint Res        ISSN: 2046-3758            Impact factor:   5.853


The genetic predisposition of osteoporosis is poorly understood. Integrating the information from the genome-wide association studies (GWAS) of bone mineral density (BMD) with expression of quantitative trait loci (eQTLs) and methylation quantitative trait loci (meQTLs) data, however, may provide a novel insight into the mechanism of osteoporosis. Integration of GWAS and eQTLs (meQTLs) data identified multiple pathways associated with BMD. Many pathways were associated with BMD in more than one region, including the Reactome Circadian Clock insulin-like growth factor receptor binding. Strength: Our new analysis framework can provide more biological interpretable results by integrating eQTL and meQTL data. Strength: The method can be applied to widely existed, publicly available GWAS summary data. Limitation: Further eQTL and meQTL data are warranted to provide more tissue-specific exploration of GWAS results.

Introduction

Bone mineral density (BMD) is the most commonly used indicator for assessing the risk of a fracture from osteoporosis. It has been estimated that genetic factors contribute to over half of the risk of low BMD, suggesting that BMD is a highly inherited phenotype.[1,2] Genome-wide association studies (GWAS) are capable of simultaneously assessing the correlation between target phenotypes and millions of genetic loci. GWAS of osteoporosis and BMD have successfully identified multiple susceptibility genes.[1,2] However, isolated susceptibility genes are often difficult to interpret biologically and have largely been neglected. GWAS have limited power to detect the causal loci with moderate or weak genetic effects, due to their genome-wide threshold of strict statistical significance.[3] Since individual genes can participate in multiple cellular processes, identifying several disease-associated genes is insufficient when used for revealing the pathogenesis of complex human diseases. A better solution, therefore, is to test the associations between target diseases and multiple functionally related loci, or biological pathways, simultaneously. Inspired by the gene set enrichment analysis (GSEA) of microarray data, GWAS-based pathway association analysis has been proposed and applied in several studies of GWAS.[3-6] By integrating GWAS results with known gene set annotation databases, GWAS-based pathway association analysis has been shown to perform better in clarifying GWAS results.[7] However, traditional pathway association analysis does not usually take into account the genetic effect of eQTLs and meQTLs. A large proportion of genetic variants affect the phenotypes by causal regulatory effect rather than by directly influencing the protein structure.[8] For instance, it has been demonstrated that eQTLs and meQTLs play important roles in the pathogenesis of complex human diseases.[8,9] A genome-wide meQTLs dataset has been used for next-generation sequencing to assay blood DNA methylation at approximately 4.5 million loci, and for testing their associations with about 4.5 million single-nucleotide polymorphisms (SNPs).[9] Recently, Zhu et al[8] proposed using summary data-based Mendelian randomisation (SMR) analysis to study this problem. SMR can integrate GWAS and genome-wide eQTLs datasets to identify causal genes, the expression of which is associated with target diseases.[8] SMR was applied to five real GWAS datasets and identified multiple novel susceptibility genes for complex human diseases, demonstrating the power for integrating eQTLs datasets with GWAS in genetic analysis. Integrating the genome-wide eQTLs and meQTLs dataset in GWAS analysis could therefore provide novel clues for clarifying the genetic mechanism of BMD. However, Zhu et al[8] mainly focused on single gene mapping, while GWAS-based pathway association analysis, evaluating eQTLs and meQTLs, has more potential to discover BMD associated gene sets. In this study, we integrated the Genetic Factors for Osteoporosis Consortium (GEFOS) GWAS of BMD with the genome-wide eQTLs and meQTLs data sets, in order to scan for possible BMD-associated gene sets. We first computed the gene score by SMR software and meQTLs, which were aligned with the GWAS results. The GSEA algorithm was then applied to the gene score to scan for BMD-associated gene sets. This novel method involved identifying significant pathways with the aid of prior data sets so that the results reflected real biological circumstances.

Materials and Methods

Gene score calculation

GWAS of BMD were integrated with eQTLs and meQTLs data before conducting pathway association analysis. A detailed description of the GWAS, eQTLs, and meQTLs data is presented as supplementary information. To integrate eQTLs with GWAS, SMR (an eQTLs-based GWAS analytical approach) was adopted to perform single-gene expression association analysis for BMD.[8] SMR used disease related GWAS and eQTLs data to evaluate the effect of gene expression on phenotypes. The effect size of the most significant SNP in the SMR test was denoted as the gene score in eQTL-based gene score calculation. For meQTLs-based gene score calculation, meQTLs were eliminated without gene annotation from a total of 386 767 meQTLs. The BMD GWAS SNPs were then aligned with the significant meQTLs (meQTLs p < 4.05 × 10-5, corresponding to false discovery rate (FDR) < 0.01), in order to identify those that overlapped. The largest GWAS statistic of the meQTLs was denoted within each gene as its gene scores for subsequent analysis.

Gene set enrichment analysis

The GWAS-based GSEA algorithm was adopted for this study.[3,10] The latest gene set annotation database, the Molecular Signatures Database (MSigDB, Version 5.1, Broad Institute, Cambridge, Massachusetts), containing a total of 13 311 annotated gene sets, was downloaded from a publicly available database (http://software.broadinstitute.org/gsea/msigdb/index.jsp).[11] A total of 5000 permutations were conducted to calculate the p-value of each gene set.

Results

We found multiple eQTLs-based pathways for BMD, some of which showed significant associations in more than one organ. Within each region, several pathways achieved significance (Fig. 1). In more than one region, a total of 181 gene sets were found to be significantly associated with BMD. Analogously, for meQTLs-based results, a total of 114 gene sets were identified that also achieved significance in more than one region (Table I, Supplementary Tables i and ii).
Fig 1

Summary of the expression of eQTL-based and meQTL-based pathway association analysis. Different pathways were associated with BMD in different regions (FN, femoral neck; LS, lumbar spine; FA, forearm).

Table I.

Gene sets that achieved significance in all three regions.

Pathway namep-value[*]
ForearmFemoral neckLumbar spine
Expression quantitative trait loci (eQTL)-based results
GSE17721 PAM3CSK4 vs CPG 12H BMDM up0.0280.038< 0.001
GSE5589 IL6 KO vs IL10 KO LPS and IL10 Stim macrophage 45 minutes up0.0360.0430.002
Module 1950.0420.0110.017
Module 4800.0140.0050.014
NABA ECM glycoproteins0.0490.0410.028
KEGG pathways in cancer0.0490.0380.008
GSE14769 unstim vs 60 minutes LPS BMDM up0.0200.0080.021
YTTCCNNNGGAMR unknown0.0370.0020.032
Darwiche papilloma risk high up0.0400.0010.038
GUO hex targets DN0.0470.0450.024
Module 4270.0470.0010.003
Module 2790.039< 0.0010.003
Methylation quantitative trait loci (meQTL)-based results
Chang cycling genes0.0380.0100.009
GNF2 KISS10.0430.0490.013
GNF2 MMP110.0320.0430.013
chr6q140.0260.0060.030

Kolmogorov-Smirnov running sum statistics was used and p-values were decided based on permutation[3]

Summary of the expression of eQTL-based and meQTL-based pathway association analysis. Different pathways were associated with BMD in different regions (FN, femoral neck; LS, lumbar spine; FA, forearm). Gene sets that achieved significance in all three regions. Kolmogorov-Smirnov running sum statistics was used and p-values were decided based on permutation[3] The results included multiple pathways with a known biological relevance to BMD, such as Reactome Circadian Clock (p < 0.001 for lumbar spine and 0.027 for femoral neck, respectively, in eQTLs-based GSEA) and insulin-like growth factor (IGF) receptor binding (p < 0.001 for femoral neck BMD and 0.026 for lumbar spine BMD, respectively, in meQTLs-based GSEA).

Discussion

Multiple GWAS studies have been conducted using considerable sample sizes,1,2,12-14 yet the overall results have only partly explained the genetic variance for BMD. This implies that the commonly used GWAS analytical strategy lacks the power to interpret the large quantity of information available within the GWAS datasets.[15] To study the genetic background of bone metabolism further and to make full use of the existing large amounts of BMD GWAS data, we have conducted a re-investigation of previous results by integrating genome- and transcriptome- /epitome- level data, and have identified several gene sets associated with BMD. Our eQTLs-based pathway association analysis also identified an interesting gene set, namely the Reactome Circadian Clock, which we found to be significantly associated with BMD in the regions of the femoral neck and lumbar spine. The role of the circadian clock in bone physiology and pathology has been characterised in many studies. Histological studies have identified the existence of circadian rhythm in both bone formation and resorption.[16,17] Biochemical parameters of bone remodelling have been shown to have a clear circadian pattern.[18,19] Moreover, functional studies have revealed that genes involved in mineral deposition also behave in a circadian fashion.[20] An animal experiment reported that a deficiency of the circadian clock protein BMAL1 in mice resulted in a low bone mass phenotype.[21] Epidemiological studies have found that individuals undertaking shift work developed circadian disruption, resulting in lower BMD and an increased risk of osteoporosis.[22] These findings have all demonstrated a relationship between the circadian clock and BMD. Our study suggests that the circadian clock may affect the mineral metabolism of human bones by using the eQTLs effect. Another typical gene set was the IGF receptor binding. This small gene set contains several other genes that interact selectively with the IGF receptors, including IGF-1 and IGF-2, which are essential for maintaining bone mass. These potent anabolic peptides promote formation in bone modelling.[23] The synthesis of IGF-1 by osteoblasts is under the control of growth hormones and a deficiency of the growth hormone (GH)/IGF axis contributes to the development of low bone mass.[24] Our meQTLs-based gene set enrichment analysis reported significant associations between this gene set and the femoral neck and lumbar spine BMD. We also identified significant associations between Reactome GH receptor signalling and BMD in both the femoral neck and the lumbar spine, suggesting that genetically controlled methylation of the GH/IGF axis may play a critical role in bone metabolism. In our study, however, none of the previously mentioned gene sets reached significance in the forearm, which may be because of the relatively small sample size of this area in our study, or may be due to the tissue specificity in that region. Despite these factors, there were still pathways that achieved significance in all three regions, few of which have been reported before. For example, our eQTLs-based analysis found YTTCCNNNGGAMR, consisting of 52 genes located in the promoter regions, which has not previously been associated with BMD in any of the three regions that we have studied. To the best of our knowledge, the common motif within the gene sets does not match any known transcription factor. These findings could help in subsequent functional studies of BMD. The main purpose of this study was to demonstrate that SNPs identified by GWAS exert their effects on phenotypes mainly or partly by influencing the gene expression (GE) and by methylation.[8,9] These regulatory genetic variants, located in non-coding areas or regions, have often been neglected in biological and functional analysis. Our integration analysis took advantage of both pathway strategies and eQTLs and meQTLs results, which should allow for better anticipation of such variants. Our analysed framework does not require individual-level genotype data and can be applied to other open access large-scale GWAS summary data sets. Our study employed classic GSEA to conduct enrichment analysis, as described by Maciejewski.[25] GSEA, which is a preferable tool for gene set association analysis, produces biologically interpretable p-values and generally performs well.[25] Additionally, the GSEA statistic was employed in our study as it is one of the most popular choices for pathway association analysis,5,6,26-28 although other proper enrichment algorithms can be used.[29] There are limitations to our study. First, both the eQTLs and meQTLs data are based on blood samples, and as such may lose power for eQTLs/meQTLs analysis, especially when considering tissue-specific effects. Further high-quality eQTL and meQTL mapping studies will be required for more accurate analysis. Second, a standard SMR analysis incorporates a heterogeneity in dependent instruments (HEIDI) test to distinguish linkage from causal/pleiotropy association. However, such processes were evaluated to decrease the overall number of enrolled genes. As this may decrease the power of subsequent pathway analysis, these processes were not used in our study. The meQTL data were only employed as a filter, which may have less power compared with SMR strategy. SMR-based meQTL analysis, as well as other forms of a priori information, may also suit the framework of Mendelian randomisation and hence warrant further investigation. In conclusion, we have identified a number of candidate gene sets for BMD through integrating the GWAS, eQTLs and meQTLs summary data sets. Our results have provided novel clues for subsequent functional analysis and our study has illustrated the integration of eQTLs and meQTLs data into pathway association analysis for genetic studies of complex human diseases.
  29 in total

1.  PAPA: a flexible tool for identifying pleiotropic pathways using genome-wide association study summaries.

Authors:  Yan Wen; Wenyu Wang; Xiong Guo; Feng Zhang
Journal:  Bioinformatics       Date:  2015-11-14       Impact factor: 6.937

2.  Deficiency of circadian clock protein BMAL1 in mice results in a low bone mass phenotype.

Authors:  William E Samsa; Amit Vasanji; Ronald J Midura; Roman V Kondratov
Journal:  Bone       Date:  2016-01-14       Impact factor: 4.398

3.  Pathway-based approaches for analysis of genomewide association studies.

Authors:  Kai Wang; Mingyao Li; Maja Bucan
Journal:  Am J Hum Genet       Date:  2007-12       Impact factor: 11.025

4.  Integrated pathway and epistasis analysis reveals interactive effect of genetic variants at TERF1 and AFAP1L2 loci on melanoma risk.

Authors:  Myriam Brossard; Shenying Fang; Christopher I Amos; Florence Demenais; Amaury Vaysse; Qingyi Wei; Wei V Chen; Hamida Mohamdi; Eve Maubec; Nolwenn Lavielle; Pilar Galan; Mark Lathrop; Marie-Françoise Avril; Jeffrey E Lee
Journal:  Int J Cancer       Date:  2015-05-26       Impact factor: 7.396

5.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

6.  Genome-wide Survey of Runs of Homozygosity Identifies Recessive Loci for Bone Mineral Density in Caucasian and Chinese Populations.

Authors:  Tie-Lin Yang; Yan Guo; Ji-Gang Zhang; Chao Xu; Qing Tian; Hong-Wen Deng
Journal:  J Bone Miner Res       Date:  2015-06-16       Impact factor: 6.741

7.  Diurnal variation of bone mineral turnover in elderly men and women.

Authors:  S L Greenspan; R Dresner-Pollak; R A Parker; D London; L Ferguson
Journal:  Calcif Tissue Int       Date:  1997-05       Impact factor: 4.333

8.  Association of JAG1 with bone mineral density and osteoporotic fractures: a genome-wide association study and follow-up replication studies.

Authors:  Annie W C Kung; Su-Mei Xiao; Stacey Cherny; Gloria H Y Li; Yi Gao; Gloria Tso; Kam S Lau; Keith D K Luk; Jian-min Liu; Bin Cui; Min-Jia Zhang; Zhen-lin Zhang; Jin-wei He; Hua Yue; Wia-bo Xia; Lian-mei Luo; Shu-li He; Douglas P Kiel; David Karasik; Yi-Hsiang Hsu; L Adrienne Cupples; Serkalem Demissie; Unnur Styrkarsdottir; Bjarni V Halldorsson; Gunnar Sigurdsson; Unnur Thorsteinsdottir; Kari Stefansson; J Brent Richards; Guangju Zhai; Nicole Soranzo; Ana Valdes; Tim D Spector; Pak C Sham
Journal:  Am J Hum Genet       Date:  2010-01-21       Impact factor: 11.025

Review 9.  Skeletal effects of growth hormone and insulin-like growth factor-I therapy.

Authors:  Richard C Lindsey; Subburaman Mohan
Journal:  Mol Cell Endocrinol       Date:  2015-09-25       Impact factor: 4.102

10.  Identification of 153 new loci associated with heel bone mineral density and functional involvement of GPC6 in osteoporosis.

Authors:  John P Kemp; John A Morris; Carolina Medina-Gomez; Vincenzo Forgetta; Nicole M Warrington; Scott E Youlten; Jie Zheng; Celia L Gregson; Elin Grundberg; Katerina Trajanoska; John G Logan; Andrea S Pollard; Penny C Sparkes; Elena J Ghirardello; Rebecca Allen; Victoria D Leitch; Natalie C Butterfield; Davide Komla-Ebri; Anne-Tounsia Adoum; Katharine F Curry; Jacqueline K White; Fiona Kussy; Keelin M Greenlaw; Changjiang Xu; Nicholas C Harvey; Cyrus Cooper; David J Adams; Celia M T Greenwood; Matthew T Maurano; Stephen Kaptoge; Fernando Rivadeneira; Jonathan H Tobias; Peter I Croucher; Cheryl L Ackert-Bicknell; J H Duncan Bassett; Graham R Williams; J Brent Richards; David M Evans
Journal:  Nat Genet       Date:  2017-09-04       Impact factor: 38.330

View more
  5 in total

1.  Neuroarthropathy in diabetes: pathogenesis of Charcot arthropathy.

Authors:  S E Johnson-Lynn; A W McCaskie; A P Coll; A H N Robinson
Journal:  Bone Joint Res       Date:  2018-06-05       Impact factor: 5.853

2.  Integrated genomics analysis highlights important SNPs and genes implicated in moderate-to-severe asthma based on GWAS and eQTL datasets.

Authors:  Zhouzhou Dong; Yunlong Ma; Hua Zhou; Linhui Shi; Gongjie Ye; Lei Yang; Panpan Liu; Li Zhou
Journal:  BMC Pulm Med       Date:  2020-10-16       Impact factor: 3.317

3.  Integration of summary data from GWAS and eQTL studies identified novel risk genes for coronary artery disease.

Authors:  Yigang Zhong; Liuying Chen; Jingjing Li; Yinghao Yao; Qiang Liu; Kaimeng Niu; Yunlong Ma; Yizhou Xu
Journal:  Medicine (Baltimore)       Date:  2021-03-19       Impact factor: 1.817

4.  Evolutionary Genetic Signatures of Selection on Bone-Related Variation within Human and Chimpanzee Populations.

Authors:  Daryn A Stover; Genevieve Housman; Anne C Stone; Michael S Rosenberg; Brian C Verrelli
Journal:  Genes (Basel)       Date:  2022-01-21       Impact factor: 4.141

5.  Integrative genomics analysis of eQTL and GWAS summary data identifies PPP1CB as a novel bone mineral density risk genes.

Authors:  Yu Zhai; Lu Yu; Yang Shao; Jianwei Wang
Journal:  Biosci Rep       Date:  2020-04-30       Impact factor: 3.840

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.