Literature DB >> 30188921

Cross-tissue eQTL enrichment of associations in schizophrenia.

Francesco Bettella1, Andrew A Brown2, Olav B Smeland1,3, Yunpeng Wang1, Aree Witoelar1, Alfonso A Buil Demur4, Wesley K Thompson3,4, Verena Zuber5, Anders M Dale3, Srdjan Djurovic1, Ole A Andreassen1.   

Abstract

The genome-wide association study of the Psychiatric Genomics Consortium identified over one hundred schizophrenia susceptibility loci. The number of non-coding variants discovered suggests that gene regulation could mediate the effect of these variants on disease. Expression quantitative trait loci (eQTLs) contribute to variation in levels of mRNA. Given the co-occurrence of schizophrenia and several traits not involving the central nervous system (CNS), we investigated the enrichment of schizophrenia associations among eQTLs for four non-CNS tissues: adipose tissue, epidermal tissue, lymphoblastoid cells and blood. Significant enrichment was seen in eQTLs of all tissues: adipose (β = 0.18, p = 8.8 × 10-06), epidermal (β = 0.12, p = 3.1 × 10-04), lymphoblastoid (β = 0.19, p = 6.2 × 10-08) and blood (β = 0.19, p = 6.4 × 10-06). For comparison, we looked for enrichment of association with traits of known relevance to one or more of these tissues (body mass index, height, rheumatoid arthritis, systolic blood pressure and type-II diabetes) and found that schizophrenia enrichment was of similar scale to that observed when studying diseases in the context of a more likely causal tissue. To further investigate tissue specificity, we looked for differential enrichment of eQTLs with relevant Roadmap affiliation (enhancers and promoters) and varying distance from the transcription start site. Neither factor significantly contributed to the enrichment, suggesting that this is equally distributed in tissue-specific and cross-tissue regulatory elements. Our analyses suggest that functional correlates of schizophrenia risk are prevalent in non-CNS tissues. This could be because of pleiotropy or the effectiveness of variants affecting expression in different contexts. This suggests the utility of large, single-tissue eQTL experiments to increase eQTL discovery power in the study of schizophrenia, in addition to smaller, multiple-tissue approaches. Our results conform to the notion that schizophrenia is a systemic disorder involving many tissues.

Entities:  

Mesh:

Year:  2018        PMID: 30188921      PMCID: PMC6126834          DOI: 10.1371/journal.pone.0202812

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Since the completion of the human genome project, genome-wide association studies (GWAS) have been useful instruments for improving our understanding of the genetics behind human traits. However, the promise of identifying the genetic basis of disease and determine its underlying mechanisms has only marginally been fulfilled. The schizophrenia GWAS carried out by the Psychiatric Genomics Consortium (PGC) identified many genomic loci plausibly involved in the etiology of the disorder [1] but much of the heritability, estimated from familial studies, is yet to be identified [2, 3]. A considerable portion of the variance in case-control status can be explained in many cohorts using available GWAS data [3, 4], but identifying mechanisms by which these variants act has proved difficult. The approach of GWAS is to independently test a vast genome-wide selection of representative genetic variants for associations with the trait of interest, disregarding any other genetic information. However, it is now known that there are many types of genetic variants with diverse propensities to associate with phenotypic traits [5, 6]. Including functional annotations in the analyses may thus aid in the quest of uncovering the genetic basis of the disease. As seen by the large numbers of non-coding variants identified by GWASes of complex traits, many causal variants are likely to act through the regulation of a gene rather than through changes in the coding sequence [7-10]. This has boosted the interest in studies of gene expression and has encouraged the development of transcriptomics and proteomics databases and tools [11-18]. Expression quantitative trait loci (eQTLs) are genomic loci involved in variation of mRNA levels. Many eQTLs have been associated with different human phenotypes [8] including schizophrenia [19]. The latter was associated in particular to genetic variations affecting gene expression in various brain tissues [17, 20, 21]. People suffering from schizophrenia are often also affected by disorders of other traits involving non-CNS tissues. Examples include high body mass index (BMI) and obesity, hypertension, cardiovascular disease, as well as disturbances of metabolism and immune system. This has led some to regard schizophrenia as a systemic disease [22]. Recent studies provided support to the hypothesis of common genetic mechanisms underpinning schizophrenia and other disorders [23, 24]. In line with such hypothesis, we looked for association enrichment of eQTLs in non-CNS tissues that may be important for schizophrenia: adipose tissue, epidermal tissue, lymphoblastoid cell lines (LCLs) and whole blood [25]. In order to provide adequate terms of comparison, we also included in the study GWAS data related to other traits and diseases known to be polygenic or relevant to specific non-CNS tissues and thought to share at least part of their pathophysiology with schizophrenia: obesity (high BMI) and type-II diabetes involve metabolic processes known to be often affected in individuals with schizophrenia [26]; hypertension (high blood pressure) is one of the main risk factors for cardiovascular disease which is prevalent among individuals with schizophrenia [27] and may share genetic mechanisms with it [23]; rheumatoid arthritis is a disorder of the immune system which several lines of evidence [1, 24, 28–31] relate to schizophrenia; finally, height is one of the most polygenic traits and is also anticipated to inversely correlate to the propensity to suffer from SCZ [32]. The eQTLs are far from interchangeable but play different roles depending on what tissue they affect expression in and what functional regions of the genome they are situated in [25, 33, 34]. The Roadmap [35] epigenomics project was instrumental for detailing the functional annotation of large portions of the genome and expose the interplay of important regulation mechanisms therein. We investigated the differential enrichment between distal and proximal eQTLs and between eQTLs pertaining to different Roadmap functional annotation categories (see Materials and methods), which could provide additional evidence for tissue specificity.

Results

EQTL enrichment in schizophrenia

The Q-Q and fold enrichment plots suggest a ubiquitous enrichment of associations with schizophrenia (Fig 1) among non-CNS eQTLs that is comparable to the one observed among GTEx brain eQTLs [36] (S1 Fig). In fact, only CommonMind [17] brain eQTLs presented significantly higher schizophrenia association enrichment (S1 Fig, S1 Table) than non-CNS eQTLs, perhaps driven by GTEx-designated cerebellum and possibly hypothalamus eQTLs, although the low number of eQTLs available to the region-specific analyses provides only suggestive evidence in this respect (S2 Fig, S1 Table). The non-CNS eQTL enrichment is reflected in significant association chi-squared test statistics differences (S2 Table) between eQTLs and control variants. However, the deflection in the distribution of the association statistics may be due to factors other than the one we are interested in, i.e. eQTL status. The effects of other such factors were assessed by fitting linear models of the association chi-squared statistics including promoter and enhancer affiliation, and total LD as covariates.
Fig 1

Schizophrenia association enrichment in eQTLs.

Q-Q and fold enrichment plots for adipose, epidermal, LCL and whole blood eQTLs. The baseline is determined by respectively matched control SNP sets. The fold enrichment is displayed in logarithmic scale.

Schizophrenia association enrichment in eQTLs.

Q-Q and fold enrichment plots for adipose, epidermal, LCL and whole blood eQTLs. The baseline is determined by respectively matched control SNP sets. The fold enrichment is displayed in logarithmic scale. The influence of LD on the linear models was pre-assessed by comparing the LD scores of all eQTL types to those of the respective control variants (S3 Table, S3 and S4 Figs). The eQTLs are ascribed in general more LD than their matched counterparts. Interestingly, this difference seems mostly due to proximal eQTLs (see Materials and methods) active in multiple tissues, since it disappears once these are excluded (data not shown). While potentially intriguing, as it may hint at some selective sweep process, these tests confirm the importance of controlling for LD when assessing enrichment [5]. If an excess of schizophrenia associations exists among eQTLs, the association chi-squared test statistics are expected to be higher for eQTLs than they are for control variants even while covarying with total LD and affiliation to genetic and regulatory categories. The schizophrenia coefficients for tissue-specific eQTLs and control variants, distal and proximal eQTLs, are reported in Table 1. All eQTL types have significantly higher schizophrenia association chi-squared (adipose , unadjusted p = 8.8E − 06; epidermal , unadjusted p = 3.1E − 04; lymphoblastoid , unadjusted p = 6.2E − 08; whole blood , unadjusted p = 6.4E − 06) than the matched control variants. The higher chi-squared are generally reflected in higher estimated proportions of non-null associations π1 and significantly discern eQTLs from control variants (Mann-Whitney tests). All test statistics are somewhat diminished by exclusion of CommonMind and GTEx brain eQTLs (S4 Table) but do not change in essence.
Table 1

Enrichment statistics and general linear model coefficients for squared schizophrenia association z-scores differences between adipose tissue, epidermal tissue, lymphoblastoid cell lines (LCL) and whole blood eQTLs, and matching control variants.

annotation β¯ β (low 95%)β (high 95%)pπ1pMW
AdiposeeQTL0.180.0990.258.75E-060.218.03E-12
control-0.083-0.12-0.0426.79E-050.07
EpidermaleQTL0.120.0550.190.000310.172.18E-06
control-0.079-0.13-0.0330.000770.076
LCLeQTL0.190.120.276.21E-080.145.20E-11
control-0.097-0.14-0.0554.23E-060.082
Whole bloodeQTL0.190.110.276.38E-060.140.0027
control-0.0021-0.0450.0410.920.098
Allprox0.240.180.301.75E-150.140.80
dist0.130.0830.185.38E-080.190.20
eQTL0.220.180.265.66E-310.177.41E-26

is the mean effect size over the general linear model replicas with functional genetic affiliation covariates; p is the corresponding unadjusted p-value (see methods for more details); π1 is the estimated proportion of non-null associations; pMW is the unadjusted Mann-Whitney test p-value for differences in association chi-squared between eQTL and respective matched control variants; prox stands for proximal eQTLs, dist for distal eQTLs.

is the mean effect size over the general linear model replicas with functional genetic affiliation covariates; p is the corresponding unadjusted p-value (see methods for more details); π1 is the estimated proportion of non-null associations; pMW is the unadjusted Mann-Whitney test p-value for differences in association chi-squared between eQTL and respective matched control variants; prox stands for proximal eQTLs, dist for distal eQTLs. To contextualize the observed enrichment, we intersected the TwinsUK eQTLs with LD windows at haplotype r2 ≥ 0.8 around the 128 LD-independent genomic loci of recognized significance for schizophrenia [1]. The result is reported in Table 2. The overlap corresponds to approximately 12% (15 out of 128) of the loci and is significantly larger than the fraction of independent genomic loci (∼1,000,000) represented by the TwinsUK eQTLs, which we estimate to be between ∼2.8% (OR = 0.24, p = 9.5 · 10−5) and ∼3.2% (OR = 0.27, p = 4.3 · 10−5), depending on whether one considers the designated cis-eQTLs (27,974) or an estimate of the number of independent Roadmap promoter loci for the four tissues of interest (∼32,000).
Table 2

Cross-tissue eQTLs in the loci with genome-wide significant association with schizophrenia.

ChrBase pair(GRCh37)GWASp-value [1]EQTLEnsembl geneHGNCTissue
184249841.17⋅10−9chr1:8464509ENSG00000142599REREA
1500314904.49⋅10−10chr1:149999764ENSG00000250661n.a.B
41706265521.47⋅10−9chr4:170646003ENSG00000109572CLCN3A
51401436644.85⋅10−8chr5:140107679ENSG00000146007ZMAT2L
chr5:140157427ENSG00000170445HARSA
chr5:140109155ENSG00000256453DND1E
101046123356.2⋅10−19chr10:104628873ENSG00000214435AS3MTBE
11575102942.24⋅10−9chr11:57424040ENSG00000156599ZDHHC5B
chr11:57585662ENSG00000213593TMX2L
12299172653.91⋅10−8chr12:29934586ENSG00000133687TMTC1L
574878142.13⋅10−8chr12:57490100ENSG00000166888STAT6E
chr12:123735937ENSG00000111325OGFOD2AE
chr12:123689386ENSG00000111328CDK2AP1L
1236651131.86⋅10−14chr12:123704844ENSG00000130921C12orf65A
chr12:123697007L
chr12:123689386ENSG00000235423n.a.L
15405672374.18⋅10−9chr15:40569884ENSG00000137841PLCB2B
914265608.3⋅10−14chr15:91426560ENSG00000140564FURINA
16299398774.55⋅10−11chr16:29924905ENSG00000149929HIRIP3A
586813931.87⋅10−8chr16:58681393ENSG00000103034NDRG4L
19500911994.69⋅10−8chr19:50100295ENSG00000126460PRRG2E
chr19:50103252ENSG00000126464PRR12A
22423408443.43⋅10−8chr22:42343091ENSG00000100197CYP2D6A

Tissue code: A = adipose, E = epidermal, L = LCL, B = whole blood

Tissue code: A = adipose, E = epidermal, L = LCL, B = whole blood The question of whether proximal and distal eQTLs have different incidence of associations with schizophrenia can be addressed with a test in all respects analogous to the one comparing eQTLs to control variants, where proximal and distal eQTLs act as the two categories. In spite of the larger estimated fraction of distal associations, the eQTL’s position relative to the TSS does not make a significant difference: both proximal and distal eQTLs have higher association chi-squared but in similar measure (Table 1).

Roadmap

The Roadmap functional affiliation effects were estimated upon fitting the linear models described above. These predictably suggest that the variants affiliated to any Roadmap functional elements are considerably more likely to associate to schizophrenia (S5 Table). However, when the linear model fit is restricted to eQTLs and control variants only, the functional affiliation is partially accounted for and the Roadmap functional affiliation association is visibly reduced (S6 Table). Curiously, the reduction is more prominent for enhancers even though these are expected to be farther removed from the coding elements than TwinsUK’s cis-eQTLs. Variations in schizophrenia association propensity between eQTLs acting in different regulatory elements were assessed using the same method. We fitted linear models of schizophrenia association chi-squared test statistics including eQTLxRoadmap “interaction” terms. If the eQTL functional category were of importance the effect size estimates for such terms would be expected to be significantly different from zero. We observed no effect size estimates significantly different from zero, suggesting that if a variant already has an eQTL designation, its placement within diverse genetic functional elements does not to play any important role (Fig 2 and S7 and S8 Tables). The exclusion of CommonMind and GTEx brain eQTls from the analyses did not noticeably alter the results (S9 and S10 Tables).
Fig 2

Schizophrenia association enrichment of eQTLs with different Roadmap functional annotations.

Chi-squared general linear model coefficients for eQTLs of different tissues (adipose, epidermal, lymphoblastoid cell lines (LCL), whole blood) and location (proximal, distal) affiliated to different Roadmap functional elements. “All” stands for all eQTLs (* p < 0.05, ** p < 0.001).

Schizophrenia association enrichment of eQTLs with different Roadmap functional annotations.

Chi-squared general linear model coefficients for eQTLs of different tissues (adipose, epidermal, lymphoblastoid cell lines (LCL), whole blood) and location (proximal, distal) affiliated to different Roadmap functional elements. “All” stands for all eQTLs (* p < 0.05, ** p < 0.001).

Comparison with other phenotypes

The results of the analyses of other GWASes are reported in the supplementary information (S5–S9 Figs, S11–S20 Tables). For comparison with schizophrenia, various measures of eQTL enrichment are reported in Fig 3 (see S10 Fig) for all GWASes. The trait names or acronyms are printed with sizes proportional to the GWAS chi-squared general linear model coefficients. Their coordinates were determined respectively by the estimated proportions of non-null associations (π1) and the Mann-Whitney −log10(p)-values for chi-squared differences among eQTLs and control variants. The height GWAS shows widespread eQTL enrichment in concert with a uniformly high estimated proportion of associations. Other GWASes like BMI, RA and SBP show various degrees of enrichment but not always at a significant level. Notably, the schizophrenia GWAS shows as high π1 as RA, if not higher, among lymphoblastoid eQTLs, and as high π1 as BMI and height among adipose tissue eQTLs. It must be noted that the various GWASes do not offer the same eQTL coverage. However, we found that the results did not appreciably vary upon restricting the analyses to the minimal common set of GWAS variants (data not shown).
Fig 3

Relationship between polygenicity and eQTL association enrichment across different GWASes.

Differences (Mann-Whitney test p-values) in association p-values between eQTLs and control variants of various types as functions of the estimated proportions of non-null associations. The GWAS names or acronyms are color-coded to represent different categories (azure = anthropometric [height]; red = cardiovascular, systolic blood pressure [SBP]; green = immune, rheumatoid arthritis [RA]; gold = metabolic, body mass index [BMI], type-II diabetes [T2D]; black = schizophrenia) and their sizes are proportional to the respective chi-squared linear model coefficients (* p < 0.05, ** p < 0.001).

Relationship between polygenicity and eQTL association enrichment across different GWASes.

Differences (Mann-Whitney test p-values) in association p-values between eQTLs and control variants of various types as functions of the estimated proportions of non-null associations. The GWAS names or acronyms are color-coded to represent different categories (azure = anthropometric [height]; red = cardiovascular, systolic blood pressure [SBP]; green = immune, rheumatoid arthritis [RA]; gold = metabolic, body mass index [BMI], type-II diabetes [T2D]; black = schizophrenia) and their sizes are proportional to the respective chi-squared linear model coefficients (* p < 0.05, ** p < 0.001).

Discussion

The main finding of the present study is that eQTLs for all investigated tissue types (adipose, epidermal, LCL, blood) are enriched of associations with schizophrenia, suggesting that a part of the risk for the illness involves genetic dysregulation in non-CNS tissue types. The non-CNS eQTL enrichment is comparable to that of GTEx [36] brain eQTLs and only slightly lower than that of CommonMind [17] brain eQTLs and is not driven by any of these. The observed enrichment is reflected in the intersection between the TwinsUK eQTLs and the genome-wide significant loci identified by the PGC. Instantly recognizable in Table 2 are the experimentally validated eQTLs for the CLCN3 and FURIN genes [17]. As the interaction analyses show, the eQTL prevalence of schizophrenia associations is statistically indifferent to their specific functional annotation. Given the gleaned [25] roles of promoters and enhancers in transcription, respectively context-free and context-specific, this indicates that schizophrenia risk variants are distributed among context-specific and generic transcription functional elements. Upon comparing different measures of enrichment across different GWASes, we find that the schizophrenia GWAS scores often as high as GWASes of traits for which the respective tissue is known to be of physiological relevance. In adipose tissue eQTLs for instance, the enrichment of schizophrenia associations equals that of BMI and height, while in LCL the estimated proportion of non-null schizophrenia associations is as high as that of non-null RA associations. Such observations may indicate an overlap between the pathophysiological substrate of schizophrenia and that of the other traits, which in turn may be a consequence of genetic pleiotropy, as suggested by earlier findings [23, 24]. A recent study has also posited an “omnigenic” model of genetic risk, and suggested that eQTLs that act in a large number of tissues may have as large a role to play as eQTLs that are tissue specific [37]. The enrichment of associations among non-CNS eQTLs does not go to the detriment of that of eQTLs of brain tissues, the natural substrates for schizophrenia’s pathophysiology. The latter predictably present an even higher incidence of associations with the disorder. The lack of relative enrichment among GTEx brain eQTLs is probably due to their lower mapping power, given the smaller sample they were obtained from. The GTEx consortium reported that brain eQTLs replicated most strongly in other brain tissues, demonstrating that a proportion of the eQTLs mapped are indeed specific to brain tissue, but the low sample size will mean that most tissue specific eQTLs will not be discovered, reducing power to observe enrichment. The enrichment observed for eQTLs of non-CNS tissues suggests that the genetic makeup of schizophrenia may also involve non-neural tissues. The non significant effect of the eQTL regulatory affiliation corroborates the involvement of specific non-CNS tissues in schizophrenia. This supports the long discussed [28] notion of schizophrenia as a systemic disease [22], and fosters the hypothesis of common genetic mechanisms, in line with proposed pleiotropy such as that with cardiovascular and immune diseases [23, 24]. The findings of the present study complement the ones of several recent studies identifying specific or generic eQTLs associations with schizophrenia in brain tissue [17, 20, 21, 38] in that they extend the search space to non-CNS tissue types. They suggest that schizophrenia risk loci affect a range of human tissues. As hinted by the slight overall effects of promoter and enhancer eQTL affiliation, it is possible for some non-CNS eQTLs to be proxy agents for CNS tissue eQTLs. However, the estimated pairwise overlap between eQTLs pertaining to different tissues is relatively low (∼10%). Although such estimates can be slightly biased by the small sample, they encourage to regard TwinsUK’s eQTLs as quite tissue-specific. This considerations added to the negligible effect of double agents, including those active in the brain, on the analyses (see Materials and methods), suggest that the ones detected here are largely non-CNS effects. Insofar as they indicate that genetic risk of schizophrenia associates with gene expression across different somatic tissues, our findings could also have clinical implications. At the very least they warrant further research to assess the relevance of altered gene expression for the high somatic co-morbidity associated with schizophrenia, which is a major health concern. In this respect, comprehensive studies of gene expression across the lifespan, such as the Human Brain Transcriptome [14] or the Brainspan [15, 16] studies carried out for brain tissues, would be of considerable interest for non-CNS tissues as well. A few limitations of the present study should be mentioned. First, some caution may have to be exercised when interpreting the eQTL association enrichment as, due to the intricate patterns of linkage disequilibrium, the overlap of associations with distinct traits could in part be coincidental [39, 40]. Second, the numbers presented here should be regarded in light of the scope of genetic cis-regulation. As highlighted by Buil et al. [25], cis-eQTLs are generally responsible for considerably less than a third and possibly as little as a sixth of the variation in gene expression. Variants affecting trans-regulation have previously been seen to be important in disease [41]. The importance of trans-regulation must therefore be kept in mind when weighing the implications of eQTL enrichment in GWAS. Some trans-regulation will be the downstream consequences of cis-eQTL effects along the genetic pathway, and we would expect to detect enrichment of such signals in our analysis; however, much trans-regulation is expected to be highly tissue-specific, and such enrichment will be detectable only when studying a small minority of tissues. Third, the higher the statistical power the easier it is to discern weak associations. An enrichment of associations is therefore more likely to be detected in the BMI GWAS (sample size N = 339, 224) than in the RA GWAS (sample size N = 103, 638). The widespread enrichment observed in the schizophrenia GWAS (sample size N = 150, 064) may be due in part to its relatively high statistical power. Its cross-tissue enrichment pattern, however, resembles the ones observed for BMI and height, two highly polygenic traits, and is quite different from the one observed for the T2D GWAS (sample size N = 149, 821), a roughly equally powered study. We therefore conclude that the different polygenic architectures must also be a relevant factor. In summary, we find that investigating functional correlates of schizophrenia risk loci in non-CNS tissues may be productive. The observed non-CNS enrichment of schizophrenia association could be due to pleiotropic effects or increased effectiveness of variants that work in many different environmental contexts. To further delineate the functional molecular mechanisms underlying schizophrenia, it could therefore be useful to complement multiple tissue eQTL experiments with larger single tissue eQTL experiments.

Materials and methods

Material

GWAS summary statistics

We used summary p-value statistics from schizophrenia (SCZ, N = 150, 064) [1] as well as body mass index (BMI, N = 339, 224) [42], height (N = 253, 288) [43], rheumatoid arthritis (RA, N = 103, 638) [44], systolic blood pressure (SBP, N = 203, 056) [45] and type-II diabetes (T2D, N = 149, 821) [46] GWASes. Two sets of about 2.5 and 9 million SNPs encompassing high-quality variants from all GWASes [5], were used for better comparisons in analyses involving non-CNS eQTLs only, and brain as well as non-CNS eQTLs, respectively.

TwinsUK eQTLs

The eQTL data were obtained from the TwinsUK sample [47] comprising 856 Caucasian female individuals. We regard these to be suitable due to the size of the TwinsUK sample and because the sample was collected from healthy living individuals and thus free of post-mortem biases. As previously reported [25], TwinsUK’s eQTLs are represented by the single nucleotide variants with strongest association within each locus, with cis-gene expression in adipose tissue, epidermal tissue, lymphoblastoid cell lines (LCLs), and whole blood. The variants used in the current study (9166 adipose tissue, 8731 epidermal tissue, 9551 LCLs, and 5313 blood eQTLs; see [25] for a full description of the eQTL detection method) were designated as eQTLs at a 1% FDR level. Due to their location and the observed association, eQTLs have properties which can differ from other variants in the genome, including minor allele frequency, and regional conservation score and linkage disequilibrium structure, which may cause them to show enrichment in GWAS independently of their role in genetic regulation. To control for these factors when looking for enrichment, we used a set of 40,194 “control” eQTLs produced in [25], which were matched with the eQTLs on two proxies for these factors (minor allele frequency and distance to TSS). In addition to the criteria used in [25] we required that the control eQTLs did not present significant evidence of association with the expression levels of any genes (p-value ≥ 10−4). A census of the eQTLs included in the analyzed data sets is reported in S21 Table. As described by Buil et al. [25], gene regulation is not always tissue-specific and some loci act as eQTLs in more than one type of tissue. The pairwise overlap counts are shown in S11 Table. All analyses described below were repeated after exclusion of double agents and found to be robust to such exclusion.

Brain eQTLs

GTEx version 7 [36] and CommonMind release 1.2 [17] eQTL data were downloaded from the projects’ respective web-services (www.gtexportal.org, www.synapse.org). A wealth of region-specific (amygdala, anterior cingulate cortex, caudate basal ganglia, cerebellar hemisphere, cerebellum, cortex, frontal cortex, hippocampus, hypothalamus, nucleus accumbens basal ganglia, putamen basal ganglia, spinal cord cervical, substantia nigra) brain eQTLs at FDR < 0.01 were readily available in the GTEx data, allowing in detail as well as whole brain analyses. GTEx brain eQTLs were defined as eQTLs for any of the GTEx brain regions. The CommonMind eQTLs were from a larger set of dorsolateral prefrontal cortex tissue samples. The CommonMind brain eQTLs resulted from the intersection of the two eQTL sets designated respectively in the analysis including and the one not including surrogate variables. GTEx and CommonMind eQTLs at the same FDR level were subsequently intersected to obtain a consensus set of brain eQTLs.

Roadmap annotation

Even if all eQTLs considered here are cis-eQTLs, some differences may still exist between eQTLs located at different positions with respect to promoters. We therefore subdivided eQTLs into proximal and distal eQTLs, depending on their distance from the TSS. We set the threshold at 25 kbp to equally populate the two categories. In order to further detail the eventual enrichment patterns, eQTLs were further assigned one of four tissue-specific Roadmap functional annotations: strong enhancer, weak enhancer, active promoter, and weak promoter eQTLs. The functional annotations were extracted from adipose nuclei tracks (E063) for adipose tissue eQTLs, from normal human epidermal keratinocytes tracks (E127) for epidermal tissue eQTLs, from lymphoblastoid cell lines tracks (E116) for lymphoblastoid cell lines eQTLs, and from primary mononuclear cells tracks (E062) for whole blood eQTLs. The distribution of eQTLs across all functional elements is reported in S22 Table.

Statistics

Random linkage disequilibrium-based pruning

The TwinsUK eQTLs are not guaranteed to be independent, because different genes can have different eQTLs in linkage disequilibrium with one another. A possible way to account for the known intricate correlation structure of the genome data is to include linkage disequilibrium (LD) as an integral part of the analyses [48]. Another way is to reduce the bias caused by LD by restricting the analyses to near-independent variants. When pursuing the second avenue, the problem of choosing LD-independent representatives arises. To reduce sampling bias, we generated ten sets of near-independent variants (LD r2 < 0.2 within 1Mbase) picked at random [49] with no replacement to repeatedly perform our analyses on. In order for the genomic correction to be representative of these sets of variants, the genome-wide intergenic correction factor [5] was computed across such sets (median of medians) and applied to all GWAS summary statistics before the ensuing analyses. All results reported are robust to the choice of near-independent variants sets.

Conditional fold enrichment and quantile-quantile plots

It is common practice to use quantile-quantile (Q-Q) plots to compare two distributions; Q-Q plots are used in GWAS to compare the expected −log10(p)-value distribution to the one actually observed and thus assess the non-spurious association content of the latter. Further leftward deflections of the observed distribution in a subset of interest reflect a higher incidence, in other words, enrichment, of low p-values (high −log10(p)-values) in such subset. Fold enrichment plots are also used to visualize enrichment as a function of the association p-value [50]. They were obtained here by comparing the empirical cumulative distributions of −log10(p)-values for SNP association among eQTLs and control variants. Each eQTL subset fold enrichment was calculated as the cumulative distribution functions (CDFs) ratio between the −log10(p) cumulative distribution for and the −log10(p) cumulative distribution for the respective control variants. Left-cumulative association −log10(p)-value bin centers are reported on the x-axis, fold enrichment on the y-axis [51].

Chi-squared general linear models and enrichment estimates

The fold enrichment plots are effective in conveying a visual impression of enrichment. However, they do not provide any quantitative measure of enrichment and are not suited to control for potential mediating effects. General linear models allow to account for any such effects and, assuming no strong interdependence among predictor variables, to assess their significance. To get quantities with better statistical properties, we converted the association p-values to z-scores and squared these (z2 = F−1(p/2)2, where F is the standard normal cumulative distribution function) to recover the original association chi-squared test statistic. We then fitted a chi-squared general linear model [5] including eQTL and Roadmap categories [35], and total LD score [48] as covariates. Interactions between eQTL and Roadmap or genic categories were subsequently included to detail the effect of the eQTLs’ affiliation to specific functional elements. The reported effect sizes and p-values are meta-analysis effect sizes and p-values across the ten analyses relative to the single pruned sets of eQTL and control variants. In order to provide a full characterization of the enrichment, we also compared the association chi-squared among different eQTL types and corresponding matched control variants (Mann-Whitney tests) and estimated the proportion π1 of non-null associations in the two groups using R’s qvalue package [52, 53]. The Mann-Whitney tests and the π1 estimations were performed on the full set of variants.

Enrichment plots for CommonMind, GTEx, GTEx/CommonMind consensus brain and non-CNS eQTLs.

The top panels contain Q-Q and fold enrichment plots for the CommonMind, GTEx brain and TwinsUK non-CNS eQTLs. The bottom panels contain relative fold enrichment plots for CommonMind and GTEx and for GTEx/CommonMind consensus brain eQTLs compared to non-CNS eQTL variants. (PDF) Click here for additional data file.

Enrichment plots for GTEx/CommonMind consensus region-specific brain eQTLs.

The fold enrichment is relative to non-CNS eQTL variants. (PDF) Click here for additional data file.

Histograms of total LD.

Total LD-tagging power of eQTLs and control SNPs, and proximal (prox) and distal (dist) eQTLs. (PDF) Click here for additional data file.

Total LD for different eQTL types and control variants in the study.

Box plots are overlaid on kernel density plots [54]. (PDF) Click here for additional data file.

BMI association enrichment in eQTLs.

Q-Q and fold enrichment plots for adipose, epidermal, LCL and whole blood eQTLs. The baseline is determined by respectively matched control SNP sets. The fold enrichment is displayed in logarithmic scale. (PDF) Click here for additional data file.

Height association enrichment in eQTLs.

Q-Q and fold enrichment plots for adipose, epidermal, LCL and whole blood eQTLs. The baseline is determined by respectively matched control SNP sets. The fold enrichment is displayed in logarithmic scale. (PDF) Click here for additional data file.

Rheumatoid arthritis association enrichment in eQTLs.

Q-Q and fold enrichment plots for adipose, epidermal, LCL and whole blood eQTLs. The baseline is determined by respectively matched control SNP sets. The fold enrichment is displayed in logarithmic scale. (PDF) Click here for additional data file.

Systolic blood pressure association enrichment in eQTLs.

Q-Q and fold enrichment plots for adipose, epidermal, LCL and whole blood eQTLs. The baseline is determined by respectively matched control SNP sets. The fold enrichment is displayed in logarithmic scale. (PDF) Click here for additional data file.

Type-II diabetes association enrichment in eQTLs.

Q-Q and fold enrichment plots for adipose, epidermal, LCL and whole blood eQTLs. The baseline is determined by respectively matched control SNP sets. The fold enrichment is displayed in logarithmic scale. (PDF) Click here for additional data file.

Differences (Mann-Whitney test p-values) in association p-values between proximal and distal eQTLs as functions of the estimated proportions of non-null associations.

The GWAS names or acronyms are color-coded to represent different categories (azure = anthropometric, [height]; red = cardiovascular, systolic blood pressure [SBP]; green = immune, rheumatoid arthritis [RA]; gold = metabolic, body mass index [BMI], type-II diabetes [T2D]; black = schizophrenia) and their sizes are proportional to the respective ANCOVA coefficients (* p < 0.05, ** p < 0.001). (PDF) Click here for additional data file.

eQTL distribution across tissues.

As established in [25], eQTLs have some tendency to act in more than one tissue. (PDF) Click here for additional data file.

Schizophrenia association chi-squared general linear model coefficients for GTEx brain and GTEx/CommonMind consensus brain eQTLs compared to non-CNS eQTLs.

The test statistics refer to a general linear model of all brain and non-CNS variants in the ∼9 million variant template. (PDF) Click here for additional data file.

Schizophrenia association chi-squared differences between various eQTL types and matching control variants.

(PDF) Click here for additional data file.

Total LD differences between various eQTL types and matching control variants.

(PDF) Click here for additional data file.

Enrichment statistics and general linear model coefficients for squared schizophrenia association z-scores differences between adipose tissue, epidermal tissue, lymphoblastoid cell lines (LCL) and whole blood eQTLs, and matching control variants.

All eQTLs designated by CommonMind or GTEx as brain eQTLs were excluded from these analyses. (PDF) Click here for additional data file.

Schizophrenia association chi-squared general linear model coefficients for the four Roadmap functional affiliations.

(PDF) Click here for additional data file.

Schizophrenia association chi-squared general linear model coefficients for the four Roadmap functional affiliations restricted to eQTLs and control variants.

(PDF) Click here for additional data file.

Schizophrenia association chi-squared general linear model coefficients for tissue-specific eQTLs with different functional affiliations.

The test statistics refer to the respective interaction terms. The interaction with TotLD represents the enrichment ascribable to the eQTLs irrespective of their LD-tagging power. Enhancer and Promoter affiliations were assigned by Roadmap in the corresponding tissues. (PDF) Click here for additional data file.

Schizophrenia association chi-squared general linear model coefficients for all, proximal or distal eQTLs with different functional affiliations.

The test statistics refer to the respective interaction terms. The interaction with TotLD represents the enrichment ascribable to the eQTLs irrespective of their LD-tagging power. Enhancer and Promoter affiliations were assigned by Roadmap in the corresponding tissues. (PDF) Click here for additional data file.

Schizophrenia association chi-squared general linear model coefficients for tissue-specific eQTLs with different functional affiliations upon exclusion of CommonMind and GTEx brain eQTLs.

The test statistics refer to the respective interaction terms. The interaction with TotLD represents the enrichment ascribable to the eQTLs irrespective of their LD-tagging power. Enhancer and Promoter affiliations were assigned by Roadmap in the corresponding tissues. (PDF) Click here for additional data file.

Schizophrenia association chi-squared general linear model coefficients for all, proximal or distal eQTLs with different functional affiliations upon exclusion of CommonMind and GTEx brain eQTLs.

The test statistics refer to the respective interaction terms. The interaction with TotLD represents the enrichment ascribable to the eQTLs irrespective of their LD-tagging power. Enhancer and Promoter affiliations were assigned by Roadmap in the corresponding tissues. (PDF) Click here for additional data file.

Enrichment statistics and general linear model coefficients for squared BMI association z-scores differences between adipose tissue, epidermal tissue, lymphoblastoid cell lines (LCL) and whole blood eQTLs, and matching control variants.

(PDF) Click here for additional data file.

BMI association chi-squared general linear model coefficients for all eQTL types with the four Roadmap functional affiliations.

Enhancer and Promoter affiliations were assigned by Roadmap in the corresponding tissues. (PDF) Click here for additional data file.

Enrichment statistics and general linear model coefficients for squared Height association z-scores differences between adipose tissue, epidermal tissue, lymphoblastoid cell lines (LCL) and whole blood eQTLs, and matching control variants.

(PDF) Click here for additional data file.

Height association chi-squared general linear model coefficients for all eQTL types with the four Roadmap functional affiliations.

Enhancer and Promoter affiliations were assigned by Roadmap in the corresponding tissues. (PDF) Click here for additional data file.

Enrichment statistics and general linear model coefficients for squared RA association z-scores differences between adipose tissue, epidermal tissue, lymphoblastoid cell lines (LCL) and whole blood eQTLs, and matching control variants.

(PDF) Click here for additional data file.

Rheumatoid arthritis association chi-squared general linear model coefficients for all eQTL types with the four Roadmap functional affiliations.

Enhancer and Promoter affiliations were assigned by Roadmap in the corresponding tissues. (PDF) Click here for additional data file.

Enrichment statistics and general linear model coefficients for squared SBP association z-scores differences between adipose tissue, epidermal tissue, lymphoblastoid cell lines (LCL) and whole blood eQTLs, and matching control variants.

(PDF) Click here for additional data file.

Systolic blood pressure association chi-squared general linear model coefficients for all eQTL types with the four Roadmap functional affiliations.

Enhancer and Promoter affiliations were assigned by Roadmap in the corresponding tissues. (PDF) Click here for additional data file.

Enrichment statistics and general linear model coefficients for squared T2D association z-scores differences between adipose tissue, epidermal tissue, lymphoblastoid cell lines (LCL) and whole blood eQTLs, and matching control variants.

(PDF) Click here for additional data file.

Type-II diabetes association chi-squared general linear model coefficients for all eQTL types with the four Roadmap functional affiliations.

Enhancer and Promoter affiliations were assigned by Roadmap in the corresponding tissues. (PDF) Click here for additional data file.

eQTLs and matched control variants census in the data set used.

The eQTLs and control variants from [25] were projected onto templates of ∼2.5 million variants (∼9 million variants for analyses involving brain eQTLs) with known pairwise LD. (PDF) Click here for additional data file.

eQTL VS ENCODE demographics.

The numbers refer to the ∼9 million template. (PDF) Click here for additional data file.
  49 in total

1.  LD Score regression distinguishes confounding from polygenicity in genome-wide association studies.

Authors:  Brendan K Bulik-Sullivan; Po-Ru Loh; Hilary K Finucane; Stephan Ripke; Jian Yang; Nick Patterson; Mark J Daly; Alkes L Price; Benjamin M Neale
Journal:  Nat Genet       Date:  2015-02-02       Impact factor: 38.330

2.  Proteomics. Tissue-based map of the human proteome.

Authors:  Mathias Uhlén; Linn Fagerberg; Björn M Hallström; Cecilia Lindskog; Per Oksvold; Adil Mardinoglu; Åsa Sivertsson; Caroline Kampf; Evelina Sjöstedt; Anna Asplund; IngMarie Olsson; Karolina Edlund; Emma Lundberg; Sanjay Navani; Cristina Al-Khalili Szigyarto; Jacob Odeberg; Dijana Djureinovic; Jenny Ottosson Takanen; Sophia Hober; Tove Alm; Per-Henrik Edqvist; Holger Berling; Hanna Tegel; Jan Mulder; Johan Rockberg; Peter Nilsson; Jochen M Schwenk; Marica Hamsten; Kalle von Feilitzen; Mattias Forsberg; Lukas Persson; Fredric Johansson; Martin Zwahlen; Gunnar von Heijne; Jens Nielsen; Fredrik Pontén
Journal:  Science       Date:  2015-01-23       Impact factor: 47.728

Review 3.  The role of regulatory variation in complex traits and disease.

Authors:  Frank W Albert; Leonid Kruglyak
Journal:  Nat Rev Genet       Date:  2015-02-24       Impact factor: 53.242

4.  The NIH Roadmap Epigenomics Mapping Consortium.

Authors:  Bradley E Bernstein; John A Stamatoyannopoulos; Joseph F Costello; Bing Ren; Aleksandar Milosavljevic; Alexander Meissner; Manolis Kellis; Marco A Marra; Arthur L Beaudet; Joseph R Ecker; Peggy J Farnham; Martin Hirst; Eric S Lander; Tarjei S Mikkelsen; James A Thomson
Journal:  Nat Biotechnol       Date:  2010-10       Impact factor: 54.908

5.  Functional SNPs are enriched for schizophrenia association signals.

Authors:  S-A Bacanu; J Chen; J Sun; K Richardson; C-Q Lai; Z Zhao; M C O'Donovan; K S Kendler; X Chen
Journal:  Mol Psychiatry       Date:  2013-04-02       Impact factor: 15.992

6.  Spatio-temporal transcriptome of the human brain.

Authors:  Hyo Jung Kang; Yuka Imamura Kawasawa; Feng Cheng; Ying Zhu; Xuming Xu; Mingfeng Li; André M M Sousa; Mihovil Pletikos; Kyle A Meyer; Goran Sedmak; Tobias Guennel; Yurae Shin; Matthew B Johnson; Zeljka Krsnik; Simone Mayer; Sofia Fertuzinhos; Sheila Umlauf; Steven N Lisgo; Alexander Vortmeyer; Daniel R Weinberger; Shrikant Mane; Thomas M Hyde; Anita Huttner; Mark Reimers; Joel E Kleinman; Nenad Sestan
Journal:  Nature       Date:  2011-10-26       Impact factor: 49.962

7.  MicroRNAs enrichment in GWAS of complex human phenotypes.

Authors:  Luiz F Goulart; Francesco Bettella; Ida E Sønderby; Andrew J Schork; Wesley K Thompson; Morten Mattingsdal; Vidar M Steen; Verena Zuber; Yunpeng Wang; Anders M Dale; Ole A Andreassen; Srdjan Djurovic
Journal:  BMC Genomics       Date:  2015-04-16       Impact factor: 3.969

8.  Genetic variability in the regulation of gene expression in ten regions of the human brain.

Authors:  Adaikalavan Ramasamy; Daniah Trabzuni; Sebastian Guelfi; Vibin Varghese; Colin Smith; Robert Walker; Tisham De; Lachlan Coin; Rohan de Silva; Mark R Cookson; Andrew B Singleton; John Hardy; Mina Ryten; Michael E Weale
Journal:  Nat Neurosci       Date:  2014-08-31       Impact factor: 24.884

9.  Biological insights from 108 schizophrenia-associated genetic loci.

Authors: 
Journal:  Nature       Date:  2014-07-22       Impact factor: 49.962

10.  Identification of expression quantitative trait loci associated with schizophrenia and affective disorders in normal brain tissue.

Authors:  Oneil G Bhalala; Artika P Nath; Michael Inouye; Christopher R Sibley
Journal:  PLoS Genet       Date:  2018-08-24       Impact factor: 5.917

View more
  2 in total

1.  SCFD1 expression quantitative trait loci in amyotrophic lateral sclerosis are differentially expressed.

Authors:  Alfredo Iacoangeli; Isabella Fogh; Sashika Selvackadunco; Simon D Topp; Aleksey Shatunov; Wouter van Rheenen; Ahmad Al-Khleifat; Sarah Opie-Martin; Antonia Ratti; Andrea Calvo; Philip Van Damme; Wim Robberecht; Adriano Chio; Richard J Dobson; Orla Hardiman; Christopher E Shaw; Leonard H van den Berg; Peter M Andersen; Bradley N Smith; Vincenzo Silani; Jan H Veldink; Gerome Breen; Claire Troakes; Ammar Al-Chalabi; Ashley R Jones
Journal:  Brain Commun       Date:  2021-10-07

2.  A Multi-omic Integrative Scheme Characterizes Tissues of Action at Loci Associated with Type 2 Diabetes.

Authors:  Jason M Torres; Moustafa Abdalla; Anthony Payne; Juan Fernandez-Tajes; Matthias Thurner; Vibe Nylander; Anna L Gloyn; Anubha Mahajan; Mark I McCarthy
Journal:  Am J Hum Genet       Date:  2020-11-12       Impact factor: 11.025

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.