Literature DB >> 20369022

Candidate causal regulatory effects by integration of expression QTLs with complex trait genetic associations.

Alexandra C Nica1, Stephen B Montgomery, Antigone S Dimas, Barbara E Stranger, Claude Beazley, Inês Barroso, Emmanouil T Dermitzakis.   

Abstract

The recent success of genome-wide association studies (GWAS) is now followed by the challenge to determine how the reported susceptibility variants mediate complex traits and diseases. Expression quantitative trait loci (eQTLs) have been implicated in disease associations through overlaps between eQTLs and GWAS signals. However, the abundance of eQTLs and the strong correlation structure (LD) in the genome make it likely that some of these overlaps are coincidental and not driven by the same functional variants. In the present study, we propose an empirical methodology, which we call Regulatory Trait Concordance (RTC) that accounts for local LD structure and integrates eQTLs and GWAS results in order to reveal the subset of association signals that are due to cis eQTLs. We simulate genomic regions of various LD patterns with both a single or two causal variants and show that our score outperforms SNP correlation metrics, be they statistical (r(2)) or historical (D'). Following the observation of a significant abundance of regulatory signals among currently published GWAS loci, we apply our method with the goal to prioritize relevant genes for each of the respective complex traits. We detect several potential disease-causing regulatory effects, with a strong enrichment for immunity-related conditions, consistent with the nature of the cell line tested (LCLs). Furthermore, we present an extension of the method in trans, where interrogating the whole genome for downstream effects of the disease variant can be informative regarding its unknown primary biological effect. We conclude that integrating cellular phenotype associations with organismal complex traits will facilitate the biological interpretation of the genetic effects on these traits.

Entities:  

Mesh:

Year:  2010        PMID: 20369022      PMCID: PMC2848550          DOI: 10.1371/journal.pgen.1000895

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   5.917


Introduction

The biological interpretation of genome-wide association study (GWAS) signals [1]–[5] is very challenging since most candidate loci fall either in gene deserts or in regions with many equally plausible causative genes. Following the concurrent progress in understanding the genetic basis of regulatory variation [6]–[9], differential gene expression has been proposed as a promising intermediate layer of information [10] to aid this interpretation [11]. Most commonly, interrogating the GWAS SNPs themselves for significant associations with gene expression [12]–[13] has been employed to explain some of the GWAS results. However, the ubiquity of regulatory variation throughout the human genome [6],[14] makes coincidental overlaps of eQTLs and complex trait loci very likely. This likelihood is a direct consequence of the correlation structure in the genome (linkage disequilibrium - LD), which makes functionally unrelated variants statistically correlated. As sample sizes increase, allowing the discovery of larger numbers of eQTLs of smaller effect size and as the expression experiments will be performed in a larger variety of tissues, we can envisage that almost every gene will have an associated eQTL under a certain condition. Consequently, the probability that any of these will map to a genomic region where a GWAS SNP also resides is very high. Therefore, it is important to emphasize that while it is very tempting to infer potential causal mechanisms based on such overlaps, this would be a naïve inference in the absence of additional supporting evidence for causality. In the long run, this will not only be an issue for gene expression, but also for any other cellular phenotype. Association studies for intermediate phenotypes with possible relevance to complex traits are underway and their results will overlap some of the GWAS signals. The biological meaning of these overlaps will again need to be evaluated in the context of the genome's correlation structure. It is not evident though how to model each genomic region with overlapping association signals in the absence of information about the history of the region. Accounting for the historical parameters of a region under the coalescent, while desirable, is computationally and practically not feasible since the human population history is too complex to properly model and small deviations or slightly incorrect assumptions could create false signals or reduce power. In order to distinguish such accidental colocalizations [15]–[16] from true sharing of causal variants, we propose here an empirical methodology instead. This directly combines eQTL and GWAS data while accounting for the LD of the region harbouring the GWAS SNP. We demonstrate the value of the approach by predicting the regulatory impact of several GWAS variants in cis and trans and we also show that the correlation strength (r2, D') between the GWAS SNP and the eQTL is not a sufficient predictor of regulatory mediated disease effects.

Results

Current GWAS signals are enriched for regulatory variants

To identify likely causal effects (not variants since we do not have full sequencing data) associated with complex traits and diseases we took advantage of published association data catalogued in the NHGRI [17] database and gene expression data generated in LCLs derived from HapMap 3 individuals (see Methods). In this study, we limited the expression analysis to the 109 CEU individuals, as they are the closest in ancestry to the majority of individuals in published GWAS studies. We used the NHGRI database (accessed 02.03.09) to extract 976 GWAS SNPs with minor allele frequency (MAF) >5% that were also genotyped in the HapMap 3 CEU, thus allowing to test the exact GWAS SNPs for associations with differential gene expression in LCLs. In total we examined 17673 genes. In order to discover eQTLs, we used Spearman Rank Correlation (SRC). This method [14] captures the vast majority of associations discovered with standard linear regression (LR) models, with the additional advantage that it's not affected by outliers and hence has more power and allows direct comparison of nominal P-values. We looked for both proximal (cis) and distal (trans) effects as follows: variants within 1Mb on either side of the transcription start site (TSS) of a gene are considered to be acting in cis, while those at least 5 Mb downstream or upstream of the TSS or on a different chromosome are considered to be acting in trans. In order to assess the overall impact of the currently known GWAS SNPs on expression, we contrasted their cis and trans effects to those of a random set of SNPs, representing the null. In a QQ plot (Figure 1), we compare the distributions of the best cis and trans association p-values per SNP for the 976 GWAS SNPs (observed) to 1000 sets of most significant p-values of 976 random SNPs each (expected). The 1000 random sets of 976 SNPs were sampled to have identical MAF distribution to the GWAS SNPs. In cis, we observe a much stronger regulatory signal in the GWAS data compared to random (Figure 1). The significant difference between the two becomes apparent above a −log10(P-value) = 4. In trans, we also detect a more significant regulatory signal for GWAS SNPs compared to random, however not as strong as in cis. This is to be expected given that the much greater statistical space we're exploring in trans limits the power to detect such effects. Nevertheless, despite their confinement to one tissue type - LCLs, these comparisons support the overall explanatory potential of regulatory variation for the biological effects of GWAS variants. As expected given the tissue nature, the phenotypes responsible for this enrichment are immunity related (Figure 2).
Figure 1

Excess of regulatory variants among GWAS signals.

QQ plot depicting the excess of significant regulatory signal in GWAS data (976 NHGRI SNPs). For both the cis and trans analyses, the −log10(P-value) of the best associations per SNP are plotted. In red, the distribution of these values for GWAS SNPs is compared to that of the median of 1,000 sets of 976 random SNPs with same MAF distribution. In black, the estimated upper limit of the 95% confidence interval is plotted.

Figure 2

Cis regulatory enrichment stratified by immunity relatedness.

The −log10(P-value) of the best associations per GWAS SNPs and a set of random SNPs are plotted. As expected given the tissue (LCLs), immunity related phenotypes are mainly responsible for the enrichment.

Excess of regulatory variants among GWAS signals.

QQ plot depicting the excess of significant regulatory signal in GWAS data (976 NHGRI SNPs). For both the cis and trans analyses, the −log10(P-value) of the best associations per SNP are plotted. In red, the distribution of these values for GWAS SNPs is compared to that of the median of 1,000 sets of 976 random SNPs with same MAF distribution. In black, the estimated upper limit of the 95% confidence interval is plotted.

Cis regulatory enrichment stratified by immunity relatedness.

The −log10(P-value) of the best associations per GWAS SNPs and a set of random SNPs are plotted. As expected given the tissue (LCLs), immunity related phenotypes are mainly responsible for the enrichment.

RTC score to distinguish between causal effects and coincidental overlaps

To identify the subset of causal effects from the regulatory enrichment observed, we focused only on the genomic regions harbouring either cis or trans eQTLs. We split the genome into recombination hotspot intervals based on genome-wide estimates of hotspot coordinates from McVean et.al. [18] Limiting the search space for causal effects to these intervals is a reasonable conventional approach, as few or no recombination events are expected between the reported associated SNPs and the functional variants they are tagging. Given the abundance of cis eQTLs in the genome, mere interval overlap in not sufficient to claim that a colocalized cis eQTL and a GWAS SNP are tagging the same functional variant. However, if the GWAS SNP and the eQTL do tag the same causal SNP, we expect that removing the genetic effect of the GWAS SNP will have a marked consequence on the eQTL association. Starting from this hypothesis, we developed an empirical method to uncover regulatory mediated associations with complex traits. For all genes with a significant cis eQTL (0.05 permutation threshold as defined in Stranger et.al. 2007, see Methods) in a given interval, we create corrected phenotypes from the residuals of the standard LR of the GWAS SNP against normalized expression values of the gene for which we have an eQTL. The residuals capture the remaining unexplained expression variance after the removal of the GWAS SNP effect. We redo the SRC analysis with the pseudo phenotype and retain the adjusted association P-value. Depending on the internal LD structure of the hotspot interval, the correlation between the GWAS SNP and the eQTL will vary, hence so will the P-values after and before correction. One way to assess the relevance of the GWAS SNP to the eQTL is to compare its correction impact to that of all other SNPs in the interval. For this purpose, we define a Regulatory Trait Concordance (RTC) Score for each gene-GWAS SNP combination as follows, taking into account the ranking of the correction with respect to all SNPs in the interval (RankGWAS SNP) and the total number of tested SNPs (NSNPs). The rank denotes the number of SNPs which when used to correct the expression data, have a higher impact on the eQTL (smaller adjusted P-value) than the GWAS SNP (i.e. Rank = 0 if the GWAS SNP is the same as the eQTL SNP, Rank = 1 if of all the SNPs in the interval, the GWAS SNP has the largest impact on the eQTL). Given this, the RTC Score will always be in the range (0,1], with values close to 1 indicating that the GWAS effect is the same as the eQTL effect.

RTC properties under different simulation scenarios

We investigated the properties and robustness of the RTC score under the null hypothesis (H0: eQTL and GWAS are tagging two different causal SNPs) and the alternative hypothesis (H1: same causal SNP). For this purpose, we have simulated causal SNPs (cSNP), eQTLs and dSNPs (see Methods) varying the LD levels between them as well as the LD pattern of the hotspot interval where they reside. We have then masked the cSNPs and calculated the RTC score under these different LD scenarios for both hypotheses. The RTC score is uniformly distributed under the null, when the simulated causal eQTL SNP (c-eQTL) and the causal disease SNP (c-dSNP) are different (Figure 3, left panel).
Figure 3

RTC score distribution.

The RTC score is uniformly distributed for simulated eQTLs and dSNPs tagging two different causal variants in the same interval (left panel). The RTC Score is right-skewed for simulated eQTLs and dSNPs tagging the same functional variant (middle panel). The RTC score is sensitive to associations tagging a common functional variant in non-simulated data, when the GWAS trait is gene expression (GenCord LCL samples – right panel).

RTC score distribution.

The RTC score is uniformly distributed for simulated eQTLs and dSNPs tagging two different causal variants in the same interval (left panel). The RTC Score is right-skewed for simulated eQTLs and dSNPs tagging the same functional variant (middle panel). The RTC score is sensitive to associations tagging a common functional variant in non-simulated data, when the GWAS trait is gene expression (GenCord LCL samples – right panel). Under the H1 on the other hand, the RTC score is right skewed, with a clear enrichment for values close to 1 recovering the single causal SNP effect (Figure 3, middle panel). The simulations show that the complexity and variability of the LD structure in the genome impede the simple use of correlation metrics to infer shared causal effects. The statistical correlation (r2) between the eQTL and the dSNP is not on its own sufficient to predict whether they tag the same cSNP (Figure 4). The RTC outperforms r2 as it is able to recover causal effects even for low correlated pairs. The historical correlation metric between eQTLs and dSNPs (D') is also not fully predictive of high RTC scores (Figure 5). We observe from the H0 simulation results that D' is not correlated with RTC, meaning that when the eQTL and dSNP tag different functional variants, the RTC score is not high just because D' is high. In addition, while high RTC scoring cases cluster much tighter around high D' values under the H1 compared to r2 previously, a high D' is not sufficient to predict causal effects. That is because it would be impossible to distinguish causal from coincidental effects given a perfect historical correlation scenario.
Figure 4

Properties of the RTC score when varying r2.

Simulation results depicting the relationship between the RTC score and the r2 (eQTL, dSNP) when they tag different causal SNPs (H0: left panel) versus one causal SNP (H1: right panel). The RTC increases as expected with increased r2 between the eQTL and the dSNP, but when tagging the same functional variant, various lower pairwise r2 combinations can determine a high RTC. This makes r2 on its own insufficient to detect shared causal effects.

Figure 5

RTC score properties when varying D'.

Simulation results depicting the relationship between the RTC score and the D' (eQTL, dSNP) when they tag different causal SNPs (H0: left panel) versus one causal SNP (H1: right panel). D' is not correlated with RTC, therefore it will not determine high scores on its own in the absence of a common functional variant. Under the H1, the majority of high RTC scoring pairs have high D', but in the case of a perfect historical correlation scenario, it's impossible to distinguish causal from coincidental effects with D' only.

Properties of the RTC score when varying r2.

Simulation results depicting the relationship between the RTC score and the r2 (eQTL, dSNP) when they tag different causal SNPs (H0: left panel) versus one causal SNP (H1: right panel). The RTC increases as expected with increased r2 between the eQTL and the dSNP, but when tagging the same functional variant, various lower pairwise r2 combinations can determine a high RTC. This makes r2 on its own insufficient to detect shared causal effects.

RTC score properties when varying D'.

Simulation results depicting the relationship between the RTC score and the D' (eQTL, dSNP) when they tag different causal SNPs (H0: left panel) versus one causal SNP (H1: right panel). D' is not correlated with RTC, therefore it will not determine high scores on its own in the absence of a common functional variant. Under the H1, the majority of high RTC scoring pairs have high D', but in the case of a perfect historical correlation scenario, it's impossible to distinguish causal from coincidental effects with D' only. Finally, we investigated the effect of the overall LD pattern in a region of interest on the RTC. For this purpose, we calculated the median r2 of each hotspot interval and checked its relationship to the RTC score under the null and alternative hypothesis. It is expected that RTC will perform better in intervals with overall low LD, where the correlation between the eQTL and other non-disease SNPs will decay much faster, making the correction for the dSNP stand out. However, we confirm that the LD of the region does not determine high scores by itself. Intervals of low LD where different c-eQTLs and c-dSNPs reside have a uniform distribution of RTC scores (Figure 6, left panel). As expected, we do observe from the H1 simulations that we have most power in intervals with low median r2 (Figure 6, right panel).
Figure 6

RTC score properties when varying the median r2 of the hotspot interval.

Simulation results depicting the relationship between the RTC score and the local LD structure (median r2) under the null (different causal SNPs - left panel) and alternative hypothesis (same causal SNP - right panel). Under H0, the RTC score is evenly distributed, therefore intervals with overall low LD will not determine high RTC scores. Under H1, the RTC performs best in intervals with overall low LD, where the correlation between the eQTL and other non-disease SNPs decays much faster, making the dSNP correction stand out.

RTC score properties when varying the median r2 of the hotspot interval.

Simulation results depicting the relationship between the RTC score and the local LD structure (median r2) under the null (different causal SNPs - left panel) and alternative hypothesis (same causal SNP - right panel). Under H0, the RTC score is evenly distributed, therefore intervals with overall low LD will not determine high RTC scores. Under H1, the RTC performs best in intervals with overall low LD, where the correlation between the eQTL and other non-disease SNPs decays much faster, making the dSNP correction stand out.

RTC scores when both traits are gene expression

As a positive control, we tested the method first on intervals harbouring already identified regulatory associations. We used published cis eQTLs (10−3 permutation threshold) discovered in the same tissue as the HapMap CEU eQTLs (LCLs) but derived from an independent set of samples: 75 individuals of Western European origin from the GenCord project [19]. In this experiment, we considered the GenCord eQTLs as the equivalent of GWAS SNPs and we limited our analysis to intervals with cis eQTLs in both datasets. Furthermore, we conditioned the associated genes for the same interval to be identical in the two expression datasets, expecting thus a common functional variant. As a result of this filtering, we tested SNPs in 157 hotspot intervals, associated with differential expression levels of 154 genes. As expected from the H1 simulations, the RTC Score distribution after correcting for the GenCord eQTLs is right-skewed (Figure 3, right panel), suggesting that the scoring method is sensitive to associations tagging the same functional variant. We detect 33 SNP-probe pairs with an RTC Score of 1 out of the total 185 tested pairs. Given the marked difference in genotyping density between HapMap and GenCord (∼1.2 millionSNPs versus ∼400,000 SNPs respectively) and our hypothesis that the 157 overlapping intervals share the same functional variant, we expect approximately 3 times more perfect scoring cases (99 pairs with RTC Score = 1) than what we observe, had individuals from both datasets been equally densely genotyped. We use the degree of sharing between the eQTLs in the two datasets to derive a reasonable, yet conservative threshold: currently, 105 SNP-probe pairs pass the 0.9 RTC threshold, making it thus a suitable stringent cut-off for calling significant discoveries.

Significant causal cis effects

We then applied the scoring method on the NHGRI GWAS SNPs. The 976 common GWAS SNPs map to 784 hotspot intervals. Of these, we focused the cis analysis on GWAS intervals (N = 130) where at least one significant cis eQTL at a 0.05 permutation P-value threshold also resides (Dataset S1). For the trans analysis, we ordered all 784 GWAS intervals by their most significant trans eQTL and kept the topmost 50 intervals for further examination (Dataset S2). Table 1 summarizes our most confident cis results ordered by RTC Score. We detect SNP-gene combinations passing the 0.9 threshold for 28 intervals out of the 130, twice as many than expected by chance (13 expected top 10% scoring intervals under the uniform distribution). Our method confirms prior results in the literature suggestive of disease effects mediated through expression (ORMDL3 for asthma risk [13], C8orf13 locus for lupus risk [20], SLC22A5 for Crohn's disease [12],[21]). In addition, we detect several other yet unknown candidate genes for a variety of conditions.
Table 1

Candidate cis results.

GWAS SNPComplex TraitGeneRTCChr
rs2064689Crohn's diseaseWDR7811
rs3129934Multiple sclerosisHLA-DRB116
rs2188962Crohn's diseaseSLC22A515
rs1015362Burning and frecklingTRPC4AP120
rs2735839Prostate cancerC19orf48119
rs6830062HeightLCORL14
rs2242330Parkinsons diseaseTMPRSS11A14
rs7498665Body mass index,WeightEIF3CL116
rs2872507Crohn's diseaseZPBP20.9917
rs255052HDL cholesterolAGRP0.9916
rs4549631HeightTRMT110.986
rs9469220Crohn's diseaseILMN_294120.986
rs11083846Chronic lymphocytic leukemiaSLC8A20.9819
rs13277113Systemic lupus erythematosusC8orf130.978
rs9272346Type 1 diabetesHLA-DRB10.966
rs12324805Body mass indexSTARD50.9615
rs3764261HDL cholesterolMT1H0.9616
rs3135388Multiple sclerosisHLA-DRB50.966
rs3814219Endothelial function traitsFAM26B0.9510
rs12708716Type 1 diabetesILMN_320840.9516
rs2269426Plasma eosinophil countHLA-DRB10.956
rs10769908Body mass indexC11orf170.9411
rs4130590Bipolar disorderILMN_173390.949
rs7216389AsthmaORMDL30.9417
rs3796619Recombination rate (males)CRIPAK0.934
rs1748195TriglyceridesDOCK70.931
rs2903692Type 1 diabetesILMN_320840.9316
rs3197999Crohn's diseaseSLC38A30.923
rs9858542Crohn's diseaseSLC38A30.923
rs6441961Celiac diseaseLIMD10.923
rs660895Rheumatoid arthritisPSMB90.916
rs9652490Essential tremorILMN_1113630.9115
rs1397048Hemostatic factorsOR8H20.9111
rs3825932Type 1 diabetesCTSH0.9115
rs2395185Ulcerative colitisILMN_294120.96

Candidate genes (RTC Score > = 0.9) for cis regulatory mediated GWAS effects. The higher the score, the more likely it is that the GWAS SNP and the eQTL for the gene shown are tagging the same functional variant.

Candidate genes (RTC Score > = 0.9) for cis regulatory mediated GWAS effects. The higher the score, the more likely it is that the GWAS SNP and the eQTL for the gene shown are tagging the same functional variant. An interesting example of a novel cis regulatory mediated effect is the one for Crohn's disease with gene SLC38A3, member 3 of the solute carrier family 38. Independent studies detected significant Crohn's associations of two SNPs in the same hotspot interval on chromosome 3 (rs3197999 [12], a non-synonymous SNP in gene MST1 and rs9858542 [1],[22], a synonymous SNP in nearby gene BSN). Suggestive literature evidence in addition to the disease associated non-synonymous SNP made MST1 the most attractive candidate gene out of the many present in that region [23]. However, our data supports an additional regulatory component underlying the susceptibility locus. For both GWAS SNPs, SLC38A3 is the highest scoring candidate in the region (RTC Score: 0.92). Interestingly, this is functionally similar to another Crohn's susceptibility gene SLC22A5 confirmed with our method (RTC Score: 1.0) and also encoding a sodium dependent multi-pass membrane protein (solute carrier family protein). The observed direction of effect is the same for both genes (eQTLs associate with low expression levels) as in previous expression datasets [12] and suggests a possible involvement of this gene family in the disease. This is in agreement with recent studies reporting that disease causative genes are functionally more closely related [24].

Overrepresentation of immunity-related results

The tissue under investigation is LCLs so we expect GWAS signals of immunity related traits (comprising here autoimmune disorders and diseases of the immune system e.g. AIDS progression) to more likely show an overlap with eQTLs. In order to evaluate the relevance of our results, we analyzed the distributions of the best RTC Scores per GWAS SNP stratified by the immunity relatedness of the complex trait they associate with (Figure 7). We observe a significant overrepresentation of high-scoring genes (> = 0.9) for immunity related traits compared to non-immunity related ones (Fisher's Exact Test, P-value = 0.0125) [25]. This suggests that the scoring scheme predicts regulatory effects of the relevant phenotypes. In addition, we observed that for GWAS signals with RTC score >0.9, only 10% of the nearest gene to the GWAS SNP was also the eQTL gene. These however, correspond as expected to instances when the eQTL gene is also the nearest gene to the eQTL itself. If that is not the case, the inference of relevance of a gene simply based on its proximity to the GWAS SNP is not informative.
Figure 7

Overrepresentation of immunity-related high-scoring cis signals.

Distribution of best RTC Scores per GWAS SNP stratified by immunity relatedness. Histogram contains results from the analysis of 130 hotspot intervals with colocalizing disease SNPs and cis eQTLs. We observe a significant overrepresentation of high-scoring (RTC > = 0.9) candidate genes (black bars) for immunity related complex traits compared to non-immunity related ones (grey bars) (Fisher's Exact Test, P-value = 0.0125).

Overrepresentation of immunity-related high-scoring cis signals.

Distribution of best RTC Scores per GWAS SNP stratified by immunity relatedness. Histogram contains results from the analysis of 130 hotspot intervals with colocalizing disease SNPs and cis eQTLs. We observe a significant overrepresentation of high-scoring (RTC > = 0.9) candidate genes (black bars) for immunity related complex traits compared to non-immunity related ones (grey bars) (Fisher's Exact Test, P-value = 0.0125).

Trans effects

Even if the causal SNP is not cis-regulatory, using gene expression to determine its downstream targets, coupled with information about the biological pathways these targets act in could help interpret the primary GWAS effect. We investigated this hypothesis in the topmost 50 GWAS intervals ordered by their trans eQTL significance. For each interval, we apply the RTC Scoring scheme on the subset of genes in the whole genome with a notable effect in trans (SRC nominal P-value <10−5). These signals amount to a total of 552 genes. We obtain SNP-gene combinations passing the 0.9 Score threshold for 24 of the 50 tested intervals (corresponding to a total of 85 genes). Six of these intervals contain GWAS SNPs associated with immunity related traits (Table 2). While not statistically significant - unsurprisingly given that we're only testing a small subset of the total GWAS intervals - these examples support the usefulness of the trans approach. As hypothesized, for the same complex trait associated SNP we can discover several potential candidate genes in trans, throughout the genome. Some of these are biologically plausible results and merit further investigation. However, many trans candidates are hard to interpret at this stage given their incomplete annotation and further functional studies will need to be performed for validation.
Table 2

Candidate trans results.

GWAS SNPComplex TraitGenesRTCSNP ChrGenes Chr
rs2251746Serum IgE levelsSLC25A180.99122
rs983332Response to TNF antagonistsRGS16, IGSF30.9711
rs983332Response to TNF antagonistsC17orf580.97117
rs653178Celiac diseasePAX8, DOK11122
rs17696736Type 1 diabetesPAX8, DOK10.98122
rs2542151Crohn's,Type 1 diabetesMMP1211811
rs2542151Crohn's,Type 1 diabetesSLC39A4, PSD3, AHNAK2, FAM108B1, CYP2S1, CLEC7A0.97188, 8, 14, 9, 19, 12
rs2542151Crohn's,Type 1 diabetesLENEP0.91181
rs3134792PsoriasisADRA2C164
rs3134792PsoriasisDPEP1, ARHGEF30.99616, 3
rs1265181PsoriasisPOU5F1P10.9668
rs1265181PsoriasisDPEP10.95616
rs1265181PsoriasisCYP4F8, ADRA2C0.94619, 4
rs1265181PsoriasisRGS90.92617
rs2395185Ulcerative colitisB4GALT2, ASB50.9761, 4
rs2395185Ulcerative colitisSTK32A0.9465
rs2395185Ulcerative colitisOXT0.93620
rs2395185Ulcerative colitisCSRP30.92611
rs2395185Ulcerative colitisLGALS40.91619
rs3135388Multiple sclerosisLIMS10.9562
rs477515Inflammatory bowel diseaseB4GALT2161
rs477515Inflammatory bowel diseaseASB50.9964
rs477515Inflammatory bowel diseaseSTK32A0.9565
rs477515Inflammatory bowel diseaseOXT0.94620
rs477515Inflammatory bowel diseaseCSRP30.93611
rs477515Inflammatory bowel diseaseDCHS20.9164
rs477515Inflammatory bowel diseaseLGALS40.9619
rs615672Rheumatoid arthritisDCHS20.9964
rs6457617Rheumatoid arthritisSMARCD30.9567
rs6457620Rheumatoid arthritisSMARCD30.9567
rs660895Rheumatoid arthritisRETSAT0.9962
rs660895Rheumatoid arthritisCALCR0.9867
rs9268877Ulcerative colitisLIMS10.9762
rs9268877Ulcerative colitisB4GALT20.9461
rs9268877Ulcerative colitisASB50.9164
rs9272346Type 1 diabetesLIMS10.9762
rs9272346Type 1 diabetesWHDC1L10.94615
rs9272346Type 1 diabetesASB50.9364
rs9272346Type 1 diabetesSEMA6D, OXT, B4GALT20.92615, 20, 1

Candidate trans genes likely involved in the same biological pathways, relevant to the GWAS SNPs. Signals relating to the same hotspot interval separated by a horizontal line. Table contains only the confident results (RTC Score > = 0.9) for the 6 immunity related intervals.

Candidate trans genes likely involved in the same biological pathways, relevant to the GWAS SNPs. Signals relating to the same hotspot interval separated by a horizontal line. Table contains only the confident results (RTC Score > = 0.9) for the 6 immunity related intervals.

RTC on GWAS data outperforms alternative correlation metrics

The power to detect significant associations between genotyped SNP proxies and a phenotype depends on the correlation between those proxies and the functional variant [26]. Just like for the simulated data, we tested whether the correlation between a GWAS SNP and its colocalizing eQTL is sufficient for predicting a shared causal effect. For both the cis and the trans analysis, we observe that the r2 between the eQTL and the disease SNP is not a direct predictor of the RTC Score, and in several cases we predict that even pairs with low r2 are likely tagging the same functional effect (Figure 8, top panel). The reason for this is that many of the high scoring pairs with poor statistical correlation (low r2) are actually historically correlated (D' = 1). Nevertheless, D' is not very informative either (Figure 8, bottom panel), the main problem here being that in regions with generally high D' among many SNPs, one cannot determine which of the pairs actually represents a common functional variant.
Figure 8

The RTC method compared to standard LD measurements in the observed data.

Neither r2 nor D' between the eQTL and the GWAS SNP are direct predictors of a high RTC Score. Highlighted here are the results from the cis and trans analyses. We obtain high scoring results (RTC Scores > = 0.8 in blue) for cases with a high correlation between the disease SNP and the eQTL as expected, but also for pairs with low statistical correlation (r2 – top panel). As shown in the bottom panel, many of these high scoring pairs are historically correlated (D' = 1), but so are many more by chance. Additionally, we can detect high scoring pairs with low D' as well. Hence, no obvious combination of the two LD measures can predict a high RTC Score.

The RTC method compared to standard LD measurements in the observed data.

Neither r2 nor D' between the eQTL and the GWAS SNP are direct predictors of a high RTC Score. Highlighted here are the results from the cis and trans analyses. We obtain high scoring results (RTC Scores > = 0.8 in blue) for cases with a high correlation between the disease SNP and the eQTL as expected, but also for pairs with low statistical correlation (r2 – top panel). As shown in the bottom panel, many of these high scoring pairs are historically correlated (D' = 1), but so are many more by chance. Additionally, we can detect high scoring pairs with low D' as well. Hence, no obvious combination of the two LD measures can predict a high RTC Score. Another metric of potential predictive value is the fraction of eQTL variance explained by the dSNP. Figure 9 indicates the relationship between the RTC score and the fraction of explained variance at the eQTL left unexplained after the dSNP correction (ratio of linear regression adjusted R∧2 after and before correction). As expected given the definition of the RTC, the highest density of good scoring results is registered for dSNPs that explain most of the eQTL variance. However, RTC outperforms the variance metric, scoring high even when that's not the case and thus making the setting of a threshold on the explained variance not sufficiently informative either.
Figure 9

The fraction of eQTL variance explained away by the dSNP versus the RTC score.

We contrast the LR adjusted R2 at the eQTL after and before correction of the dSNP and observe that while most high scoring pairs correspond to cases of lowest variance left unexplained, other interesting cases would be missed solely by using an arbitrary variance threshold.

The fraction of eQTL variance explained away by the dSNP versus the RTC score.

We contrast the LR adjusted R2 at the eQTL after and before correction of the dSNP and observe that while most high scoring pairs correspond to cases of lowest variance left unexplained, other interesting cases would be missed solely by using an arbitrary variance threshold.

Discussion

To aid the functional interpretation of complex trait association signals, we describe here an empirical methodology that directly integrates eQTL and GWAS data while correcting for the local correlation structure in the human genome. As regulatory variants are pervasive throughout the genome, coincidental overlaps of eQTLs and GWAS SNPs are very likely. Hence, current methods that limit themselves to asking whether disease intervals also harbour eQTLs are unreliable for distinguishing trait relevant regulatory effects from other eQTLs. Our methodology addresses and helps resolve this issue. This approach is not limited to gene expression, but could be generalized to any other phenotype. As new methods are developed and larger cohorts become available, various intermediate cellular phenotypes are interrogated via association studies with the hope to find explanatory links between genotypic variation and complex trait predisposition. However, the biological interpretation of these discoveries will also be hardened by the presence of tight LD. It is therefore necessary to evaluate them in a conservative manner, correcting for the local correlation structure in each genomic interval with overlapping association signals. In this paper, we discover causal regulatory effects and their affected candidate genes in cis and to some extent in trans by assessing the impact on the expression phenotype of the removal of the GWAS SNP effect. We compute a score (RTC) for each individual genomic interval that assesses the likelihood that the eQTL and the GWAS SNP are tagging the same functional variant. By ranking the effect of the removal of the GWAS SNP in comparison to the outcome for any other SNP in the region and by accounting for the number of SNPs tested, we produce a score comparable across intervals. We evaluate the performance of the score in various simulated LD scenarios and we present its robustness by its expected uniform distribution when the eQTL and GWAS SNP are tagging different functional variants. In comparison, we investigate how well do current SNP correlation metrics (r2, D') perform on their own. We show that the LD between the GWAS SNP and its colocalizing eQTL is not a good predictor of a shared functional effect. This is very important especially since most of the current replication and follow-up studies only focus on variants highly correlated (r2>0.8) with the initial discoveries. It is important to stress at this point that neither the eQTL nor the GWAS SNP is likely the causal variant. Therefore, what really matters is not the statistical correlation between two proxies but the correlation between each of the proxies and the causal variant, whose frequency is unknown. In any case, no obvious combination of LD measures can substitute the RTC scoring scheme and we thus conclude that many interesting candidate genes would be missed if one were to rely solely on correlation-based approaches. In this paper, we also explore the explanatory potential of regulatory variation given the currently published GWAS data. We observe a significant overrepresentation of eQTLs among GWAS SNPs, especially affecting genes in cis. Long-range trans effects are also present but less prevalent, possibly due to lower power to detect such associations. As expected given the tissue the expression data was measured in (LCLs), we observe a significant abundance of cis regulatory causal effects for immunity related traits. Our result reinforces the necessity to expand the tissue diversity [27]–[28] of genome-wide expression studies in order to facilitate such discoveries for a wider range of human conditions. By applying the RTC method on the NHGRI GWAS SNPs, we are able to confirm previously suspected regulatory mediated disease effects and discover novel candidate genes affected by GWAS SNPs. We provide a list of follow-up candidate genes affected in cis and in addition, we show the utility of genome-wide expression data irrespective of the nature of the primary SNP effect by predicting clusters of genes affected in trans. The individual examination of the candidates prioritized with our approach will undoubtedly assist the biological interpretation of the ever-increasing list of GWAS signals. As associations with more intermediate cellular phenotypes will be reported, the integration of all these signals will be crucial for understanding the biology of complex traits.

Methods

Gene expression measurements

RNA levels were measured in lymphoblastoid cell lines (LCLs) derived from the HapMap 3 individuals using a whole-genome expression array (Illumina Sentrix WG-6, Version 2) as previously described [14]. Each sample had two technical replicates. We analyzed here only expression data from the CEU, a HapMap 3 population of 109 unrelated individuals of Northern European ancestry. The mapping of Illumina probes to unique Ensembl gene IDs resulted in 21,811 probes corresponding to 17,673 autosomal genes available for association analysis. 1,186,075 SNPs (MAF >5%) genotyped in the same individuals were used in the eQTL analysis.

Post-experimental normalization of gene expression data

The log2 transformed raw intensity values were normalized as follows: quantile normalization of sample replicates (two intensity values per Illumina probe) followed by median normalization across all individuals.

Genome-wide association study (GWAS) results

All SNPs from the catalogue of genome-wide association studies maintained by the National Human Genome Research Institute (NHGRI www.genome.gov/26525384) and published by 02.03.2009 were downloaded. Of these, only the 976 unique common variants (MAF >5%) genotyped in the HapMap 3 CEU samples were kept for analysis.

Genotype-gene expression associations and multiple testing correction

Associations between SNP genotypes and normalized expression values were conducted using Spearman Rank Correlation (SRC). For the cis analysis, we considered only SNPs within a 1MB window from the TSS of genes, while in trans we test all SNPs further than 5MB away from the gene's TSS and all SNP-gene pairs on different chromosomes. We assess the statistical significance of the cis associations using permutations as previously described [7],[14]. We call a cis eQTL significant if the nominal association P-value is greater than the 0.01 tail of the minimal P-value distribution resulting from the SNP's associations with 10,000 permuted sets of expression values for each gene.

Recombination hotspot interval mapping

We mapped all common autosomal CEU HapMap 3 SNPs (1,186,075 SNPs) to recombination hotspot intervals as defined by McVean et.al. [18] For the cis analysis we selected the 130 hotspot intervals where at least one significant cis eQTL and a GWAS SNP colocalize while for the trans, we analyzed a subset of 50 of the total 784 unique intervals (where the 976 GWAS SNPs map to). These are the topmost intervals ordered by their most significant trans eQTL (nominal SRC P-value).

QQ plots of the abundance of regulatory signal in GWAS SNPs

For both the cis and trans GWAS analysis, the best P-value associations per SNP were stored. The set of the most significant P-values of the 976 GWAS SNPs was compared to 1000 sets of most significant P-values of 976 random SNPs. The 1000 random sets of 976 SNPs each were conditioned to have the same MAF distribution as the 976 GWAS set. The QQ plot showing the abundance of regulatory signal in GWAS data is the median QQ plot of 1000 (GWAS, random SNPs) comparisons. It shows the distribution of the −log10 quantile values of the GWAS best associations (observed) versus the median of the corresponding 1000 −log10 quantile values from each of the 1000 random SNP sets (expected). In order to assess the significance of the observed versus expected median QQ plot, we superimpose the upper limit of the 95% confidence interval. This is calculated from the sorted 0.95 quantiles of 10000 pairs of 976 random SNPs each.

Scoring scheme for determining causal regulatory effects

We assess the likelihood of a shared functional effect between a GWAS SNP and an eQTL by quantifying the change in the statistical significance of the eQTL after correcting for the genetic effect of the GWAS SNP. We redo the SRC association of the eQTL genotype with the residuals from the standard LR of the “corrected-for” SNP against normalized expression values. We account for the LD structure in each hotspot interval separately by ranking (RankGWAS SNP) the impact on the eQTL (quantified by the adjusted association P-value after correction) of the GWAS SNP correction to that of correcting for all other SNPs in the same interval. By taking into account the total number of SNPs in the interval (NSNPs), we can compare this ranking across different genes and intervals. For this purpose we define the regulatory trait concordance (RTC) Score ranked below ranging from 0 to 1, with values closer to 1 indicating causal regulatory effects.

Simulations of different causal SNPs (H0) and same causal SNP (H1) scenarios

We investigate the properties of the RTC score with respect to different correlation metrics under the null hypothesis (H0: eQTL and dSNP tag different functional variants) and the alternative hypothesis (H1: eQTL and dSNP tag the same functional variant). We use the HapMap3 CEU cis eQTLs (315 genes at 10−3 permutation threshold) to create a list of causal SNPs (cSNP). For the H0, we call these cSNPs causal eQTL SNPs (c-eQTL). For each c-eQTL, we sample a different causal disease SNP (c-dSNP) from the same interval, with the requirement that its MAF comes from a distribution identical to that of the 976 NHGRI GWAS SNPs. Subsequently, we sample up to five eQTL-dSNP pairs per interval where the eQTLs and dSNPs are the topmost correlated (r2) SNPs with the c-eQTL and the c-dSNP respectively. After sampling, we exclude cases where the eQTL and dSNP are identical, as these contradict the H0..c-eQTL-c-dSNP-eQTL-dSNP quartets mapping to 287 unique hotspot intervals were sampled and tested under H0. Under the H1, we sample up to five eQTL-dSNP pairs for each hotspot interval harbouring a cSNP as follows: the eQTLs are chosen as the top most significant SNPs per eQTL gene - excluding the cSNP; the dSNPs are randomly sampled from the same hotspot interval such that the r2 between each of them and the cSNP is in the range [0.5,0.9]. At any stage of the 5-step iteration per cSNP, the dSNP must be different from the cSNP and the eQTLs sampled up to that point. cSNP-eQTL-dSNP trios mapping to 290 unique hotspot intervals throughout the genome were sampled and tested under the H1. We use the LD values (r2) of all pairwise SNP combinations per interval to calculate the median r2, an estimate of the LD extent per region.

GenCord eQTLs as GWAS SNPs

To perform a control experiment where the trait is gene expression, we used cis eQTLs (10−3 P-value permutation threshold) detected in LCLs derived from 75 unrelated individuals of Western European origin from the GenCord project [19]. Hotspot intervals (N = 157) where both a HapMap and a GenCord eQTL associating with the same Ensembl gene reside were analysed with the RTC Scoring scheme. Cis analysis RTC results. Candidate cis regulatory effects as ranked by the RTC. No score filtering. (0.08 MB XLS) Click here for additional data file. Trans analysis RTC results. Candidate trans regulatory effects as ranked by the RTC in 50 genomic intervals. No score filtering. (0.30 MB XLS) Click here for additional data file.
  27 in total

1.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease.

Authors:  Jeffrey C Barrett; Sarah Hansoul; Dan L Nicolae; Judy H Cho; Richard H Duerr; John D Rioux; Steven R Brant; Mark S Silverberg; Kent D Taylor; M Michael Barmada; Alain Bitton; Themistocles Dassopoulos; Lisa Wu Datta; Todd Green; Anne M Griffiths; Emily O Kistner; Michael T Murtha; Miguel D Regueiro; Jerome I Rotter; L Philip Schumm; A Hillary Steinhart; Stephan R Targan; Ramnik J Xavier; Cécile Libioulle; Cynthia Sandor; Mark Lathrop; Jacques Belaiche; Olivier Dewit; Ivo Gut; Simon Heath; Debby Laukens; Myriam Mni; Paul Rutgeerts; André Van Gossum; Diana Zelenika; Denis Franchimont; Jean-Pierre Hugot; Martine de Vos; Severine Vermeire; Edouard Louis; Lon R Cardon; Carl A Anderson; Hazel Drummond; Elaine Nimmo; Tariq Ahmad; Natalie J Prescott; Clive M Onnie; Sheila A Fisher; Jonathan Marchini; Jilur Ghori; Suzannah Bumpstead; Rhian Gwilliam; Mark Tremelling; Panos Deloukas; John Mansfield; Derek Jewell; Jack Satsangi; Christopher G Mathew; Miles Parkes; Michel Georges; Mark J Daly
Journal:  Nat Genet       Date:  2008-06-29       Impact factor: 38.330

2.  From gene expression to disease risk.

Authors:  Emmanouil T Dermitzakis
Journal:  Nat Genet       Date:  2008-05       Impact factor: 38.330

Review 3.  Linkage disequilibrium in humans: models and data.

Authors:  J K Pritchard; M Przeworski
Journal:  Am J Hum Genet       Date:  2001-06-14       Impact factor: 11.025

4.  A null mutation in human APOC3 confers a favorable plasma lipid profile and apparent cardioprotection.

Authors:  Toni I Pollin; Coleen M Damcott; Haiqing Shen; Sandra H Ott; John Shelton; Richard B Horenstein; Wendy Post; John C McLenithan; Lawrence F Bielak; Patricia A Peyser; Braxton D Mitchell; Michael Miller; Jeffrey R O'Connell; Alan R Shuldiner
Journal:  Science       Date:  2008-12-12       Impact factor: 47.728

5.  Gene-centric association mapping of chromosome 3p implicates MST1 in IBD pathogenesis.

Authors:  P Goyette; C Lefebvre; A Ng; S R Brant; J H Cho; R H Duerr; M S Silverberg; K D Taylor; A Latiano; G Aumais; C Deslandres; G Jobin; V Annese; M J Daly; R J Xavier; J D Rioux
Journal:  Mucosal Immunol       Date:  2008-01-16       Impact factor: 7.313

6.  Common polymorphic transcript variation in human disease.

Authors:  Hunter B Fraser; Xiaohui Xie
Journal:  Genome Res       Date:  2009-02-02       Impact factor: 9.043

7.  The fine-scale structure of recombination rate variation in the human genome.

Authors:  Gilean A T McVean; Simon R Myers; Sarah Hunt; Panos Deloukas; David R Bentley; Peter Donnelly
Journal:  Science       Date:  2004-04-23       Impact factor: 47.728

8.  Common regulatory variation impacts gene expression in a cell type-dependent manner.

Authors:  Antigone S Dimas; Samuel Deutsch; Barbara E Stranger; Stephen B Montgomery; Christelle Borel; Homa Attar-Cohen; Catherine Ingle; Claude Beazley; Maria Gutierrez Arcelus; Magdalena Sekowska; Marilyne Gagnebin; James Nisbett; Panos Deloukas; Emmanouil T Dermitzakis; Stylianos E Antonarakis
Journal:  Science       Date:  2009-07-30       Impact factor: 47.728

9.  Common variants near MC4R are associated with fat mass, weight and risk of obesity.

Authors:  Ruth J F Loos; Cecilia M Lindgren; Shengxu Li; Eleanor Wheeler; Jing Hua Zhao; Inga Prokopenko; Michael Inouye; Rachel M Freathy; Antony P Attwood; Jacques S Beckmann; Sonja I Berndt; Kevin B Jacobs; Stephen J Chanock; Richard B Hayes; Sven Bergmann; Amanda J Bennett; Sheila A Bingham; Murielle Bochud; Morris Brown; Stéphane Cauchi; John M Connell; Cyrus Cooper; George Davey Smith; Ian Day; Christian Dina; Subhajyoti De; Emmanouil T Dermitzakis; Alex S F Doney; Katherine S Elliott; Paul Elliott; David M Evans; I Sadaf Farooqi; Philippe Froguel; Jilur Ghori; Christopher J Groves; Rhian Gwilliam; David Hadley; Alistair S Hall; Andrew T Hattersley; Johannes Hebebrand; Iris M Heid; Claudia Lamina; Christian Gieger; Thomas Illig; Thomas Meitinger; H-Erich Wichmann; Blanca Herrera; Anke Hinney; Sarah E Hunt; Marjo-Riitta Jarvelin; Toby Johnson; Jennifer D M Jolley; Fredrik Karpe; Andrew Keniry; Kay-Tee Khaw; Robert N Luben; Massimo Mangino; Jonathan Marchini; Wendy L McArdle; Ralph McGinnis; David Meyre; Patricia B Munroe; Andrew D Morris; Andrew R Ness; Matthew J Neville; Alexandra C Nica; Ken K Ong; Stephen O'Rahilly; Katharine R Owen; Colin N A Palmer; Konstantinos Papadakis; Simon Potter; Anneli Pouta; Lu Qi; Joshua C Randall; Nigel W Rayner; Susan M Ring; Manjinder S Sandhu; André Scherag; Matthew A Sims; Kijoung Song; Nicole Soranzo; Elizabeth K Speliotes; Holly E Syddall; Sarah A Teichmann; Nicholas J Timpson; Jonathan H Tobias; Manuela Uda; Carla I Ganz Vogel; Chris Wallace; Dawn M Waterworth; Michael N Weedon; Cristen J Willer; Xin Yuan; Eleftheria Zeggini; Joel N Hirschhorn; David P Strachan; Willem H Ouwehand; Mark J Caulfield; Nilesh J Samani; Timothy M Frayling; Peter Vollenweider; Gerard Waeber; Vincent Mooser; Panos Deloukas; Mark I McCarthy; Nicholas J Wareham; Inês Barroso; Kevin B Jacobs; Stephen J Chanock; Richard B Hayes; Claudia Lamina; Christian Gieger; Thomas Illig; Thomas Meitinger; H-Erich Wichmann; Peter Kraft; Susan E Hankinson; David J Hunter; Frank B Hu; Helen N Lyon; Benjamin F Voight; Martin Ridderstrale; Leif Groop; Paul Scheet; Serena Sanna; Goncalo R Abecasis; Giuseppe Albai; Ramaiah Nagaraja; David Schlessinger; Anne U Jackson; Jaakko Tuomilehto; Francis S Collins; Michael Boehnke; Karen L Mohlke
Journal:  Nat Genet       Date:  2008-05-04       Impact factor: 38.330

10.  Statistical independence of the colocalized association signals for type 1 diabetes and RPS26 gene expression on chromosome 12q13.

Authors:  Vincent Plagnol; Deborah J Smyth; John A Todd; David G Clayton
Journal:  Biostatistics       Date:  2008-11-27       Impact factor: 5.899

View more
  247 in total

Review 1.  Cellular genomics for complex traits.

Authors:  Emmanouil T Dermitzakis
Journal:  Nat Rev Genet       Date:  2012-02-14       Impact factor: 53.242

2.  Mapping rare and common causal alleles for complex human diseases.

Authors:  Soumya Raychaudhuri
Journal:  Cell       Date:  2011-09-30       Impact factor: 41.582

Review 3.  Allele-specific DNA methylation: beyond imprinting.

Authors:  Benjamin Tycko
Journal:  Hum Mol Genet       Date:  2010-09-20       Impact factor: 6.150

4.  RNA sequencing reveals the role of splicing polymorphisms in regulating human gene expression.

Authors:  Emilie Lalonde; Kevin C H Ha; Zibo Wang; Amandine Bemmo; Claudia L Kleinman; Tony Kwan; Tomi Pastinen; Jacek Majewski
Journal:  Genome Res       Date:  2010-12-20       Impact factor: 9.043

5.  Fast and efficient QTL mapper for thousands of molecular phenotypes.

Authors:  Halit Ongen; Alfonso Buil; Andrew Anand Brown; Emmanouil T Dermitzakis; Olivier Delaneau
Journal:  Bioinformatics       Date:  2015-12-26       Impact factor: 6.937

6.  Variants in an Hdac9 intronic enhancer plasmid impact Twist1 expression in vitro.

Authors:  Tyler E Siekmann; Madelyn M Gerber; Amanda Ewart Toland
Journal:  Mamm Genome       Date:  2015-12-31       Impact factor: 2.957

7.  A Burden of Rare Variants Associated with Extremes of Gene Expression in Human Peripheral Blood.

Authors:  Jing Zhao; Idowu Akinsanmi; Dalia Arafat; T J Cradick; Ciaran M Lee; Samridhi Banskota; Urko M Marigorta; Gang Bao; Greg Gibson
Journal:  Am J Hum Genet       Date:  2016-02-04       Impact factor: 11.025

Review 8.  Using chromatin marks to interpret and localize genetic associations to complex human traits and diseases.

Authors:  Gosia Trynka; Soumya Raychaudhuri
Journal:  Curr Opin Genet Dev       Date:  2013-11-25       Impact factor: 5.578

Review 9.  Determining causality and consequence of expression quantitative trait loci.

Authors:  A Battle; S B Montgomery
Journal:  Hum Genet       Date:  2014-04-26       Impact factor: 4.132

10.  Genome-wide approaches in pharmacogenomics: heritability estimation and pharmacoethnicity as primary challenges.

Authors:  Eric R Gamazon; Minoli Perera
Journal:  Pharmacogenomics       Date:  2012-07       Impact factor: 2.533

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.