Literature DB >> 22779046

Coanalysis of GWAS with eQTLs reveals disease-tissue associations.

Hyunseok Peter Kang1, Alex A Morgan, Rong Chen, Eric E Schadt, Atul J Butte.   

Abstract

Expression quantitative trait loci (eQTL), or genetic variants associated with changes in gene expression, have the potential to assist in interpreting results of genome-wide association studies (GWAS). eQTLs also have varying degrees of tissue specificity. By correlating the statistical significance of eQTLs mapped in various tissue types to their odds ratios reported in a large GWAS by the Wellcome Trust Case Control Consortium (WTCCC), we discovered that there is a significant association between diseases studied genetically and their relevant tissues. This suggests that eQTL data sets can be used to determine tissues that play a role in the pathogenesis of a disease, thereby highlighting these tissue types for further post-GWAS functional studies.

Entities:  

Year:  2012        PMID: 22779046      PMCID: PMC3392070     

Source DB:  PubMed          Journal:  AMIA Jt Summits Transl Sci Proc


Introduction

Genome wide association studies (GWAS) of common complex or multifactorial diseases have proliferated enormously over the last few years. They have also been successful in identifying a large number of loci at extraordinary levels of significance. However, this success has presented a new challenge: translating these findings into a full understanding of how the loci affect complex disease traits. Most of the reported variants do not affect protein function in an obvious manner and indeed a large number lie in introns or intergenic regions, indicating that they may function through more subtle regulation of gene expression. This is compatible with the hypothesis that such genetic variants, commonly known as expression quantitative trait loci (eQTL), are an important factor in disease susceptibility (1). eQTLs are discovered when samples are simultaneously studied using genotyping tools, yielding information on DNA variants, and expression measurement tools, yielding information on RNA levels. Each SNP can be represented by its alleles or genotypes, and statistically associated across samples with RNA levels. As a hypothetical example of one eQTL (Figure 1), as genotypes vary from CC, CT, TT at SNP rs7188573 in gene MMP25, expression levels of this gene decrease in monocytes. Note that the SNP may be within a certain distance from the gene, in which case the SNP is termed a cis eSNP, or might be distant, termed a trans eSNP. The degree of association is typically estimated statistically, using ANOVA or equivalent tests, controlled for multiple hypothesis testing. Nicolae, et al., recently demonstrated that SNPs associated with human traits are in general enriched for eQTLs (2). Other studies have shown that eQTLs calculated in the tissue of interest in a disease are enriched for disease-associated SNPs (3,4). These insights have been applied to use eQTLs to provide another layer of meaning to SNPs and prioritize GWAS results (5,6).
Figure 1.

Hypothetical example of the different expression values of MMP25 observed in monocytes with various genotypes at eSNP rs7188573.

Since gene expression profiles vary in different tissues, it is only to be expected that some eQTLs are tissue specific, and it has been reported that 33–69% of eQTLs, depending on the analysis method and the tissue type, are not discovered in other tissues (3,7,8), suggesting that the differences in eQTLs across tissues may provide additional functional information. This prompted us to investigate the correlation between the statistical significance of eQTLs calculated in different tissues (blood monocytes, liver, and adipose tissue) and their odds ratios in diseases studied by the Wellcome Trust Case Control Consortium (WTCCC) (9). Our results reveal the potential for this type of correlation analysis to be used to determine the tissue of interest for a disease, ultimately providing information that narrows the research focus or opening up novel avenues of inquiry.

Methods

The overall experimental design in shown in Figure 2. We obtained publicly-available eQTL data generated from 3 different tissue types in healthy Caucasians: peripheral blood monocytes (8), liver, and adipose tissue (10). The first data set specifically involved monocytes isolated from the blood of 1490 healthy individuals recruited in the Gutenberg Heart Study. The eQTLs were mapped using analysis of variance (ANOVA), with a p-value cutoff of 5.78E-12, corresponding to a family-wise error rate of 0.05. This study reported 37,403 associations, comprising 29,912 SNPs and 2,745 expression traits. The liver and adipose datasets involved samples obtained from 1008 morbidly obese individuals at the time of gastric bypass surgery. Association was determined using the Kruskal-Wallis test, and results were reported at a 10% false discovery rate (FDR) based on permutations of the SNP genotypes and gene expression levels, for a total of 24,513 eSNPs associated with 15,241 transcripts or 9931 distinct genes.
Figure 2.

Experimental design.

To compare against these, we used the results of the genome wide association studies reported by the Wellcome Trust Case Control Consortium for 7 different diseases: bipolar disorder (BD), coronary artery disease (CAD), Crohn’s disease (CD), hypertension (HTN), rheumatoid arthritis (RA), type 1 diabetes (T1D), and type 2 diabetes (T2D). We will refer to the p-value of eQTL association for an eSNP as peqtl, and the p-value of disease association for a given SNP as pgwas. For each of the three tissue types for which we had eQTL relationships, we studied the relationship of minimum eSNP p-value (peqtl) to odds ratio against any of the seven diseases. For each disease studied by the WTCCC, we selected eSNPs in each tissue that were in very strong linkage disequilibrium (LD, R2 = 1) with any SNP having uncorrected pgwas < 0.01 for that disease. Because testing multiple SNPs in LD with each other could result in spurious correlation, we selected one SNP with the minimum peqtl from each LD block and performed Kendall rank correlation analysis of -log(peqtl) and the absolute value of log(disease odds ratio) using the cor.test function in R (11). The resulting p-values of 21 statistical tests were provided as input to the qvalue package in R to calculate corresponding q-values (12).

Results

Our goal was to determine whether one can computationally assign a functional tissue to a disease, given only the genetic architecture found through GWAS, using tissue-specific eQTL data sets. As shown in Figure 2, we intersected two sets of variants, one from each disease studied by the WTCCC and the other from eSNPs reported in each tissue type. The total numbers of eSNPs reported to be significant in each tissue, and the number that were associated with each disease at p < 0.01, are shown in Table 1. For the eSNPs listed in this table, we investigated the correlation between peqtl and disease odds ratio. For example, Figure 3 shows the rank of log(odds ratio) plotted against log(peqtl) for monocyte eSNPs and type 1 diabetes. The log of Kendall rank correlation p-value for each disease is shown in Figure 4.
Table 1.

Number of eSNPs for each tissue type that are associated with each disease at p < 0.01.

Tissue typeMonocytesLiverAdipose
Total # of eSNPs30,898738015970
DiseaseNumber of eSNPs with p < 0.01 for disease
Bipolar disorder21370113
Coronary artery disease18357108
Crohn’s disease21166128
Hypertension18249117
Rheumatoid arthritis32884139
Type 1 diabetes481113206
Type 2 diabetes17872119
Figure 3.

Plot of rank of absolute log(odds ratio) vs. −log(peqtl) for monocyte eSNPs and type 1 diabetes.

Figure 4.

Correlation statistics for eSNP p-value and disease odds ratio. The red line represents a cutoff of p = 0.05.

For SNPs reported to be associated with gene expression in monocytes, we found that the correlation was significant at p < 0.05 in three diseases, Crohn’s disease, rheumatoid arthritis, and type 1 diabetes. These are all autoimmune diseases in which macrophages play various roles. For example, there are recent reports that Crohn’s disease may be a primary immunodeficiency of macrophages (13), and these cells are also involved in the pathogenesis of type 1 diabetes (14,15). Monocytes are also of central importance in rheumatoid arthritis (16). Closer inspection revealed several SNPs in the vicinity of -- but not within -- CARD9 that were strongly associated with the expression of this gene. CARD9 has previously been implicated in Crohn’s disease, although the association was not replicated in the WTCCC study (17). In liver tissue, we found that the correlation was significant for type 2 diabetes, and borderline for coronary artery disease. The liver is of course intimately involved in the metabolism of glucose and lipids, and liver pathology, such as nonalcoholic fatty liver disease seen in metabolic syndrome, is linked to coronary artery disease (18) and type 2 diabetes (19). We did not find any significant correlations in adipose tissue. The Kendall rank correlation statistics and q-values are summarized in Table 2. For the significant correlations, the q-values ranged from 0.025 to 0.205.
Table 2.

Summary of Kendall rank correlation statistics and q-values. Bold text indicates significant correlations.

Tissue typeMonocytesLiverAdipose
CorrelationτPqτPqτPq
Disease
Bipolar disorder−0.0380.4060.7470.1400.0880.250−0.0670.2940.594
Coronary artery disease−0.0230.6510.9020.1780.05050.205−0.0100.8820.902
Crohn’s disease0.1060.0220.1480.0510.5460.8920.0200.7370.902
Hypertension0.0280.5730.892−0.0220.8300.902−0.0090.8910.902
Rheumatoid arthritis0.0850.0220.1480.1330.0730.2460.0940.0990.250
Type 1 diabetes0.0990.0010.025−0.0130.8390.9020.0030.9480.914
Type 2 diabetes0.0610.2300.5160.1710.0330.1680.0130.8310.902

Discussion

We systematically investigated the correlation between the statistical significance of eQTLs calculated in different tissues and their odds ratios for different diseases studied by the WTCCC. Although several groups have used these ‘tissue eQTLs’ as an indicator of functional significance in interpreting GWAS results, to our knowledge, this is the first time the correlation of tissues to diseases has been systematically evaluated. If we consider eQTLs to be a marker of the functional significance of SNPs, our analysis reveals that in tissues that play a role in the disease this significance correlates with the disease odds ratio. These results suggest that the reverse may be true as well: tissues in which there is a correlation between functional significance and odds ratio are more likely to play a role in that disease. While the statistical associations are not very strong, the false discovery rates of 0.025 to 0.205 indicate that the majority of them are not spurious. We plan to validate these findings by performing the analysis on additional eQTL and GWAS datasets. Ultimately, eSNP-gene relationships in the tissue of interest may reveal novel candidate disease SNPs, providing additional clues to the pathogenesis of the disease. For example, rs7698608, a SNP with p = 0.0005 for type 2 diabetes, is associated with the expression of CISD2 in the liver. CISD2 is a causative gene for Wolfram syndrome type 2, and a glucose intolerance phenotype has been observed in CISD2−/− C57BL/6 mice (20). In conclusion, analyzing the correlation between tissue eQTL significance and disease odds ratio may provide another layer of tissue-specific information that can be used to decipher GWAS results. In order to facilitate this, more studies of tissue specific eQTLs must be performed and the data shared with the scientific community. Many researchers have been generous with their data, however the trend so far is to share mainly the p-values for the SNPs, and most of the datasets have not been collected in a centralized location. Providing the information equivalent to odds ratio in GWAS (the direction and magnitude of effect on gene expression) and submitting to a centralized database such as the GTEx (Genotype-Tissue Expression) eQTL Browser (http://www.ncbi.nlm.nih.gov/gtex/test/GTEX2/gtex.cgi) will help other investigators fully leverage the power of this data.
  19 in total

1.  Cis-acting variation in the expression of a high proportion of genes in human brain.

Authors:  Nicholas J Bray; Paul R Buckland; Michael J Owen; Michael C O'Donovan
Journal:  Hum Genet       Date:  2003-05-01       Impact factor: 4.132

2.  Gene expression in skin and lymphoblastoid cells: Refined statistical method reveals extensive overlap in cis-eQTL signals.

Authors:  Jun Ding; Johann E Gudjonsson; Liming Liang; Philip E Stuart; Yun Li; Wei Chen; Michael Weichenthal; Eva Ellinghaus; Andre Franke; William Cookson; Rajan P Nair; James T Elder; Gonçalo R Abecasis
Journal:  Am J Hum Genet       Date:  2010-12-10       Impact factor: 11.025

3.  A survey of the genetics of stomach, liver, and adipose gene expression from a morbidly obese cohort.

Authors:  Danielle M Greenawalt; Radu Dobrin; Eugene Chudin; Ida J Hatoum; Christine Suver; John Beaulaurier; Bin Zhang; Victor Castro; Jun Zhu; Solveig K Sieberts; Susanna Wang; Cliona Molony; Steven B Heymsfield; Daniel M Kemp; Marc L Reitman; Pek Yee Lum; Eric E Schadt; Lee M Kaplan
Journal:  Genome Res       Date:  2011-05-20       Impact factor: 9.043

4.  Genetics and beyond--the transcriptome of human monocytes and disease susceptibility.

Authors:  Tanja Zeller; Philipp Wild; Silke Szymczak; Maxime Rotival; Arne Schillert; Raphaele Castagne; Seraya Maouche; Marine Germain; Karl Lackner; Heidi Rossmann; Medea Eleftheriadis; Christoph R Sinning; Renate B Schnabel; Edith Lubos; Detlev Mennerich; Werner Rust; Claire Perret; Carole Proust; Viviane Nicaud; Joseph Loscalzo; Norbert Hübner; David Tregouet; Thomas Münzel; Andreas Ziegler; Laurence Tiret; Stefan Blankenberg; François Cambien
Journal:  PLoS One       Date:  2010-05-18       Impact factor: 3.240

Review 5.  The role of inflammation in insulitis and beta-cell loss in type 1 diabetes.

Authors:  Décio L Eizirik; Maikel L Colli; Fernanda Ortis
Journal:  Nat Rev Endocrinol       Date:  2009-04       Impact factor: 43.330

6.  Cisd2 deficiency drives premature aging and causes mitochondria-mediated defects in mice.

Authors:  Yi-Fan Chen; Cheng-Heng Kao; Ya-Ting Chen; Chih-Hao Wang; Chia-Yu Wu; Ching-Yen Tsai; Fu-Chin Liu; Chu-Wen Yang; Yau-Huei Wei; Ming-Ta Hsu; Shih-Feng Tsai; Ting-Fen Tsai
Journal:  Genes Dev       Date:  2009-05-15       Impact factor: 11.361

7.  Genetic analysis of innate immunity in Crohn's disease and ulcerative colitis identifies two susceptibility loci harboring CARD9 and IL18RAP.

Authors:  Alexandra Zhernakova; Eleanora M Festen; Lude Franke; Gosia Trynka; Cleo C van Diemen; Alienke J Monsuur; Marianna Bevova; Rian M Nijmeijer; Ruben van 't Slot; Roel Heijmans; H Marike Boezen; David A van Heel; Adriaan A van Bodegraven; Pieter C F Stokkers; Cisca Wijmenga; J Bart A Crusius; Rinse K Weersma
Journal:  Am J Hum Genet       Date:  2008-04-24       Impact factor: 11.025

8.  Schizophrenia susceptibility alleles are enriched for alleles that affect gene expression in adult human brain.

Authors:  A L Richards; L Jones; V Moskvina; G Kirov; P V Gejman; D F Levinson; A R Sanders; S Purcell; P M Visscher; N Craddock; M J Owen; P Holmans; M C O'Donovan
Journal:  Mol Psychiatry       Date:  2011-02-22       Impact factor: 15.992

9.  A trans-acting locus regulates an anti-viral expression network and type 1 diabetes risk.

Authors:  Matthias Heinig; Enrico Petretto; Chris Wallace; Leonardo Bottolo; Maxime Rotival; Han Lu; Yoyo Li; Rizwan Sarwar; Sarah R Langley; Anja Bauerfeind; Oliver Hummel; Young-Ae Lee; Svetlana Paskas; Carola Rintisch; Kathrin Saar; Jason Cooper; Rachel Buchan; Elizabeth E Gray; Jason G Cyster; Jeanette Erdmann; Christian Hengstenberg; Seraya Maouche; Willem H Ouwehand; Catherine M Rice; Nilesh J Samani; Heribert Schunkert; Alison H Goodall; Herbert Schulz; Helge G Roider; Martin Vingron; Stefan Blankenberg; Thomas Münzel; Tanja Zeller; Silke Szymczak; Andreas Ziegler; Laurence Tiret; Deborah J Smyth; Michal Pravenec; Timothy J Aitman; Francois Cambien; David Clayton; John A Todd; Norbert Hubner; Stuart A Cook
Journal:  Nature       Date:  2010-09-08       Impact factor: 49.962

10.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors: 
Journal:  Nature       Date:  2007-06-07       Impact factor: 49.962

View more
  8 in total

Review 1.  Genome-wide significant loci: how important are they? Systems genetics to understand heritability of coronary artery disease and other common complex disorders.

Authors:  Johan L M Björkegren; Jason C Kovacic; Joel T Dudley; Eric E Schadt
Journal:  J Am Coll Cardiol       Date:  2015-03-03       Impact factor: 24.094

2.  Identification of candidate susceptibility genes for colorectal cancer through eQTL analysis.

Authors:  Adria Closa; David Cordero; Rebeca Sanz-Pamplona; Xavier Solé; Marta Crous-Bou; Laia Paré-Brunet; Antoni Berenguer; Elisabet Guino; Adriana Lopez-Doriga; Jordi Guardiola; Sebastiano Biondo; Ramon Salazar; Victor Moreno
Journal:  Carcinogenesis       Date:  2014-04-23       Impact factor: 4.944

3.  Consensus Genome-Wide Expression Quantitative Trait Loci and Their Relationship with Human Complex Trait Disease.

Authors:  Chen-Hsin Yu; Lipika R Pal; John Moult
Journal:  OMICS       Date:  2016-07

4.  Novel distal eQTL analysis demonstrates effect of population genetic architecture on detecting and interpreting associations.

Authors:  Matthew Weiser; Sayan Mukherjee; Terrence S Furey
Journal:  Genetics       Date:  2014-09-16       Impact factor: 4.562

5.  Dissecting the Effect of Genetic Variation on the Hepatic Expression of Drug Disposition Genes across the Collaborative Cross Mouse Strains.

Authors:  Aharon Nachshon; Hanifa J Abu-Toamih Atamni; Yael Steuerman; Roa'a Sheikh-Hamed; Alexandra Dorman; Richard Mott; Juliane C Dohm; Hans Lehrach; Marc Sultan; Ron Shamir; Sascha Sauer; Heinz Himmelbauer; Fuad A Iraqi; Irit Gat-Viks
Journal:  Front Genet       Date:  2016-10-05       Impact factor: 4.599

6.  An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci.

Authors:  Jin Hyun Ju; Sushila A Shenoy; Ronald G Crystal; Jason G Mezey
Journal:  PLoS Comput Biol       Date:  2017-05-15       Impact factor: 4.475

7.  Novel disease syndromes unveiled by integrative multiscale network analysis of diseases sharing molecular effectors and comorbidities.

Authors:  Haiquan Li; Jungwei Fan; Francesca Vitali; Joanne Berghout; Dillon Aberasturi; Jianrong Li; Liam Wilson; Wesley Chiu; Minsu Pumarejo; Jiali Han; Colleen Kenost; Pradeep C Koripella; Nima Pouladi; Dean Billheimer; Edward J Bedrick; Yves A Lussier
Journal:  BMC Med Genomics       Date:  2018-12-31       Impact factor: 3.063

8.  Impact of natural genetic variation on gene expression dynamics.

Authors:  Marit Ackermann; Weronika Sikora-Wohlfeld; Andreas Beyer
Journal:  PLoS Genet       Date:  2013-06-06       Impact factor: 5.917

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.