Literature DB >> 19128478

Complex nature of SNP genotype effects on gene expression in primary human leucocytes.

Graham A Heap1, Gosia Trynka, Ritsert C Jansen, Marcel Bruinenberg, Morris A Swertz, Lotte C Dinesen, Karen A Hunt, Cisca Wijmenga, David A Vanheel, Lude Franke.   

Abstract

BACKGROUND: Genome wide association studies have been hugely successful in identifying disease risk variants, yet most variants do not lead to coding changes and how variants influence biological function is usually unknown.
METHODS: We correlated gene expression and genetic variation in untouched primary leucocytes (n = 110) from individuals with celiac disease - a common condition with multiple risk variants identified. We compared our observations with an EBV-transformed HapMap B cell line dataset (n = 90), and performed a meta-analysis to increase power to detect non-tissue specific effects.
RESULTS: In celiac peripheral blood, 2,315 SNP variants influenced gene expression at 765 different transcripts (< 250 kb from SNP, at FDR = 0.05, cis expression quantitative trait loci, eQTLs). 135 of the detected SNP-probe effects (reflecting 51 unique probes) were also detected in a HapMap B cell line published dataset, all with effects in the same allelic direction. Overall gene expression differences within the two datasets predominantly explain the limited overlap in observed cis-eQTLs. Celiac associated risk variants from two regions, containing genes IL18RAP and CCR3, showed significant cis genotype-expression correlations in the peripheral blood but not in the B cell line datasets. We identified 14 genes where a SNP affected the expression of different probes within the same gene, but in opposite allelic directions. By incorporating genetic variation in co-expression analyses, functional relationships between genes can be more significantly detected.
CONCLUSION: In conclusion, the complex nature of genotypic effects in human populations makes the use of a relevant tissue, large datasets, and analysis of different exons essential to enable the identification of the function for many genetic risk variants in common diseases.

Entities:  

Year:  2009        PMID: 19128478      PMCID: PMC2628677          DOI: 10.1186/1755-8794-2-1

Source DB:  PubMed          Journal:  BMC Med Genomics        ISSN: 1755-8794            Impact factor:   3.063


Background

Human gene expression levels have a strong heritable component [1-4]. At some genes, the variance in gene expression levels is an order of magnitude greater between unrelated individuals, than between identical twins [5]. Quantitative mRNA levels are key regulators of phenotype and represent a link between genetic variation and phenotypic alterations. A term first introduced by Jansen & Nap [6], genetical genomics aims to identify the genetic variants that affect gene expression. By treating gene expression as a quantitative trait it is possible to correlate gene transcript expression levels with genomic locations such that expression quantitative trait loci (eQTLs) can be identified [7]. In the human genome, cis associations, where a genetic variant affects a transcript that maps to the same locus, have been predominantly reported [3,8]. Trans effects, where the genetic variant is distant to the transcript loci, are much harder to convincingly identify due to inherent multiple testing problems. Analysis of trans effects involves several magnitudes more statistical tests than for cis effects. Although individual studies have reported human trans associations, no effects have been convincingly replicated in multiple studies identified for the same transcript and variant [1,9,10]. Quantitative transcript expression and genotype relationships can be investigated via linkage or association based methodologies. Linkage studies use a genome wide series of markers in recombinant in-bred lines or families, to follow the heritability of a trait, whilst association studies usually compare large number of single nucleotide polymorphisms (SNPs) to transcript levels from unrelated individuals. Despite its extensive use in plant [11,12], mouse [13,14] and rat [15] models, transcript expression and genotype correlation studies have only recently been performed in humans [3,5,8]. Human association studies have centered on RNA obtained from leukocytes, predominantly Epstein-Barr virus (EBV) transformed B cell lines from HapMap individuals [3] or unrelated trios [1]. A large scale genetical genomics linkage study has recently identified more than 1000 cis regulatory loci across the genome, in primary cryopreserved human leukocytes [2], although these are broad genomic regions rather than variants in more precisely defined LD blocks. However, the former studies are limited by their ability to assess transcripts that are significantly expressed at the point of RNA isolation. Cell extraction methods, cryopreservation and EBV transformation all affect individual mRNA expression levels. These variations in RNA analysis make the choice of analysis tissue of paramount importance. Here we present an association based genetical genomics study using primary cell RNA from peripheral blood sampled from patients in remission from an immune-mediated disease. Immediate RNA preservation during blood sampling (using the PAXgene system) represents nearly in vivo human physiological gene expression. Celiac disease is a common (1% prevalence), inflammatory condition of the small intestine induced by intake of gluten in wheat, rye and barley. A strong genetic component has been established for disease with a monozygotic concordance of 75% [16] and 90% of cases possessing the HLA haplotype HLA-DQ2.5 [17] and the remainder mostly have HLA-DQ8 [18]. Despite the role of the HLA, the risk of disease is still greater in HLA matched monozygotic twins compared to HLA matched dizygotic twins [19]. We recently performed a genome wide association and replication study using single nucleotide polymorphisms (SNPs) and identified an additional eight susceptibility loci that predispose to celiac disease [20,21]. A surprising finding from this study was that seven of the eight regions contained promising candidate genes expressed in leucocytes of the immune system. This suggested the feasibility of genetical genomics approaches using peripheral blood to assess the biological function of the celiac disease associated risk variants. One goal of genetical genomics is to uncover previously unknown biological pathways. If genetic variation affects the expression of a gene in trans, this suggests a biological relationship exists between the two loci. To assess this, considerable amounts of effort have been devoted to the development and application of statistical frameworks that are capable of detecting these trans-eQTLs. However, detection of trans-eQTLs in human populations has proven less successful than in mouse, rat and plant recombinant inbred lines [13,22]. It has been suggested that the extensive genetic and environmental diversity between human individuals masks many of the existing trans-effects. We show that numerous cis-eQTLs can be identified through an expression analysis of peripheral blood RNA. We also show many of these are only detectable in peripheral blood RNA, and not in EBV-transformed B cell lines. Through a meta-analysis of these two datasets we identified numerous additional cis-eQTLs. We show that for some cis-eQTLs genetic variation does not lead to overall gene expression changes, but rather leads to shifts in the types of different splice isoforms that are produced. The complex nature of genotypic effects on gene expression limits our ability to fully elucidate the effect on RNA expression level or sequence and combined with the significant multiple testing problems prevented the accurate identification of trans effects.

Methods

Study population

115 UK celiac disease individuals were recruited from Barts and the London NHS Trust and the Oxford Radcliffe Hospitals NHS Trust after informed consent and with ethical approval. Individuals all had a small bowel endoscopic biopsy diagnosis of celiac disease, median age of 51 (23–88), a median age at diagnosis of 42 (1–75), a male to female sex ratio of 1:3, and a median length of treatment on a gluten free diet of 9.4 years (1–47). Celiac individuals responding to a gluten free diet typically show no detectable inflammation (which we confirmed at the mRNA level in peripheral blood). We also enrolled 22 healthy unrelated UK controls, with a male to female sex ratio of 1:1.2, but of unknown age.

PAXgene RNA Extraction

2.5 ml of peripheral blood was collected into a PAXgene tube (Becton Dickinson, UK, 762165). PAXgene vials were chosen to prevent density gradient centrifugation, immortalization or in vitro cell culture artifacts changing mRNA profiles. PAXgene tubes were mixed gently and incubated at room temperature for two hours. After collection, tubes were frozen at -20°C for at least 24 hours followed by storage at -80°C. RNA was isolated using the PAXgene Blood RNA isolation kit (Qiagen, UK, 762174). RNA was quantified using the Nanodrop (Nanodrop Technologies, USA). Total RNA integrity was analyzed using an Agilent Bioanalyzer (Agilent Technologies, USA).

Anti-sense RNA synthesis, amplification, purification and hybridization to Illumina expression chips

Anti-sense RNA was synthesized, amplified and purified using the Ambion Illumina TotalPrep Amplification Kit (Ambion, USA) following the manufacturers' protocol. Complementary RNA was hybridized to Illumina HumanRef-8 v2 Whole Genome BeadChips and scanned on the Illumina BeadArray Reader. Data was handled through the Illumina BeadStudio Gene Expression module v3.2. All gene expression (MIAME compliant) and genotype data has been made freely available by submission to GEO under GSE11501.

Quality control

Five celiac disease samples were excluded from subsequent analysis due to poor median probe intensity correlation with all other samples or incorrect sex assignment, based on an analysis of all the probes that mapped to the non-pseudoautosomal region of chromosome Y. A dataset of 110 celiac disease patients and 22 controls was then used for analysis.

Normalization

Expression probes were mapped to the cDNA sequence from Ensembl v45_36g [23] and the NCBI build 36 genome assembly if necessary. Probes that had less than 96% sequence homology or that mapped to multiple loci were removed. Subsequent analyses were confined to autosomal probes, in order to prevent sex specific effects on gene expression. After removal of probes that map to sex chromosomes, data was quantile-quantile normalized [24].

Celiac disease sample genotypes

All celiac disease patients were genotyped as previously described [20] using Illumina Infinium HumanHap300v1.0 BeadChips.

Peripheral blood eQTL association analysis and false discovery rate control

257,013 autosomal SNPs were tested for association with expression levels in the 110 celiac disease samples that met analysis criteria of minor allele frequency (MAF) > 0.1, exact Hardy-Weinberg equilibrium P-Value > 0.0001 and call-rate > 0.95. Analyses were confined to those probe-SNP pairs for which the distance from probe genomic midpoint to SNP genomic location was less than 250 kb or 500 kb, depending on the analysis performed. To prevent spurious associations due to outliers, a non-parametric Spearman's rank correlation analysis was performed. In order to correct for multiple testing we controlled the false discovery rate (FDR) [25]. The distribution of all the observed p-values was used to calculate the FDR, by comparing this distribution to a null-distribution, obtained from an identical analysis where the expression phenotypes, relative to the genotypes had been permuted. Through 1,000 permutations the Spearman's rank correlation P-value threshold could be determined that corresponded to an FDR of 0.01 or 0.05.

Validation panel: HapMap CEU samples

We compared the identified cis-eQTLs in the celiac peripheral blood dataset to a published human genetical genomics dataset [3]. We reanalyzed expression data from EBV-transformed B cell lines (further described as HapMap B cell line dataset) for 90 CEU HapMap samples [3]. Analyses were performed as described for the celiac disease samples. To enable a comparison between the celiac peripheral blood dataset and the HapMap B cell line dataset, only SNPs were tested that had been successfully called within HapMap and that were present on the Illumina HumanHap300 platform (257,013 SNPs). Although this is only a subset of all the SNPs that have been called for these HapMap samples, this subset of SNPs is known to capture most genetic Caucasian variation well [26].

Analysis of over- and underrepresented biological processes and function

We investigated over- or underrepresentation of certain biological processes or functions through an analysis of all significant cis-eQTL genes using the Panther Classification System [27] (Binomial P-Value, Bonferroni corrected).

Co-expression analysis

Many cis-eQTLs have been detected in human datasets, but only a few trans-eQTLs have been found [1,3,8,10]. These trans-eQTLs imply a biological relationship between the trans-locus and the trans-gene and as such can provide valuable biological insight. If several of these were to be identified, they would allow for the reconstruction of gene networks. A different, regularly used approach for reconstructing gene networks is by systematically assessing co-expression between pairs of genes. We performed such a co-expression analysis for each probe that comprised a cis-eQTL and contrasted this to an analysis where the genetic component of the cis-eQTL on the probe intensity level had been removed. We first investigated whether the identified probe that comprised a cis-eQTLs (probe distance of 250 kb, FDR of 0.05) showed co-expression with all other probes that mapped to different chromosomes for both the celiac peripheral blood dataset and the HapMap B cell line dataset. We correlated the measured intensity levels with all other probes that mapped on different chromosomes, through a Spearman's rank correlation analysis. By applying Bonferroni correction to account for multiple testing, sets of significantly correlated transcript-pairs could be identified. Subsequently we used partial-correlations, enabling us to redo this co-expression analysis while effectively removing the genotypic effect on the probe intensity level for the probes that constitute the significant cis-eQTLs. Again, by applying Bonferroni correction, sets of significantly correlated transcript-pairs could be identified.

Results

Celiac disease is an immune mediated disease, dominated by TH1 cytokine response. Given the importance of tissue specific RNA profiles, we felt that peripheral blood was an appropriate medium to study to investigate celiac disease associated genetic variants. Given the known genetic contribution to disease, we enrolled treated celiac disease individuals rather than healthy controls, leading to an enrichment for celiac disease associated genetic variants and underlying causal variants [20,28]. For 110 celiac disease samples that passed quality control, both expression at 19,867 transcripts and genotype data for 257,013 SNPs, were analyzed.

Gene expression in celiac disease versus healthy control samples

To obtain the most accurate reflection of mRNA levels in peripheral blood leukocytes, whole blood RNA was immediately fixed during venepuncture in PAXgene vials, giving a reflection of in vivo RNA expression from whole blood. One hundred and fifteen treated celiac patients, all of whom were successfully treated and compliant with a gluten free diet for at least six months, were enrolled. Seventy percent of celiac disease patients were female, (66.6% of adult cases diagnosed with celiac disease in the population are female [29]). In addition, 22 random healthy control samples were enrolled to obtain a background RNA expression profile, enabling us to determine whether the patients indeed did not show ongoing inflammation. In a sex matched comparison no known inflammatory disease associated cytokines, including IFNG [30] and IL2 [31] showed significantly increased expression (sex matched Wilcoxon Signed-Ranks Test P value < 0.05) in celiac versus control samples, as was expected since these patients had been treated with a gluten free diet.

Cis-associations of gene expression with SNPs

Gene expression levels for 19,867 transcripts were analyzed for significant genotypic effects at 257,013 autosomal SNPs, mapping within 500 kb of the centre of the transcript probes, resulting in 1,850,599 tests. By using a Spearman's rank correlation coefficient statistic and an FDR of 0.01 or 0.05, 1,360 and 2,178 different SNP-probe effects were detected, respectively. These reflect 394 and 658 unique probes, and 1,273 and 2,035 unique SNPs, respectively (see Table 1).
Table 1

Summary of cis-eQTL findings from celiac peripheral blood and HapMap B cell line data sets

Population110 celiac disease samples(Celiac peripheral blood dataset)90 Caucasian HapMap samples(HapMap B cell datasets)
Expression dataIllumina HumanRef-8 v2 WholeGenome BeadChip (19,867 mapped transcripts)Illumina HumanRef-6 v1 WholeGenome BeadChip (44,791 mapped transcripts)
Genotype dataIllumina Infinium HumanHap300 BeadChip (257,013 SNPs assessed)Subset of all HapMap Genotypes present on Illumina Infinium HumanHap300 BeadChip (257,013 SNPs assessed)
FDR 0.01FDR 0.05FDR 0.01FDR 0.05
SNP-Probe midpoint distance250 kb500 kb250 kb500 kb250 kb500 kb250 kb500 kb
FDR Spearman's Correlation P-Value Thresholdp < 1.67 × 10-5p < 6.57 × 10-6p < 1.36 × 10-4p < 5.47 × 10-5p < 1.08 × 10-5p < 4.20 × 10-6p < 8.79 × 10-5p < 3.47 × 10-5
Number of performed tests930,4561,850,599930,4561,850,5991,913,3423,820,1481,913,3423,820,148
Number of detected differentSNP-probe effects1,529 (235)1,360 (233)2,487 (372)2,178 (355)1,981 (664)1,799 (623)3,226 (1,068)2,839 (988)
Number of unique probes470 (65)394 (56)765 (98)658 (84)613 (193)536 (177)994 (289)821 (255)
Number of unique genes460 (65)385 (56)753 (98)647 (84)563 (189)491 (174)903 (277)746 (244)
Number of unique SNPs1,432 (234)1,273 (230)2,315 (367)2,035 (345)1,743 (628)1,601 (585)2,826 (1009)2,464 (920)

Numbers within brackets denote the number of cis-eQTLs, probes or genes that are potentially due to SNPs that map within the probe and affect the hybridization efficacies of these probes.

Summary of cis-eQTL findings from celiac peripheral blood and HapMap B cell line data sets Numbers within brackets denote the number of cis-eQTLs, probes or genes that are potentially due to SNPs that map within the probe and affect the hybridization efficacies of these probes. An identical analysis was performed on publicly available EBV-transformed B cell line expression and genotype data for 90 CEU HapMap individuals [3]. Using this dataset and a SNP-probe distance of 500 kb and FDR of 0.01 or 0.05, 1,799 and 2,839 different SNP-probe effects were detected, respectively. These reflect 536 and 821 unique probes, and 1,601 and 2,464 unique SNPs, respectively (see Table 1). Details of all detected individual cis-eQTLs are available in Additional file 1.

Distance between SNPs and probes that constitute the cis-eQTLs

Our analysis was initially confined to probes that had a midpoint distance to the tested SNPs less than 500 kb. Analysis of the significant cis-eQTLs SNP-probe distances (Figure 1) suggests few cis-eQTLs have been missed by imposing this threshold, as in both datasets for 95% of the cis-eQTLs, the SNPs map within 250 kb of the probes. As such it is expected that an increase in statistical power can be achieved by reducing the distance to 250 kb as less tests will be performed. Indeed in the celiac peripheral blood dataset more cis-eQTLs were identified (in total 2,487 different SNP-probe effects were observed, compared to 2,178 in the 500 kb analysis, based on an FDR of 0.05), reflecting 765 unique probes and 2,315 unique SNPs. For the HapMap B cell line dataset 3,226 different SNP-probe effects were identified (compared to 2,839 in the 500 kb analysis, based on an FDR of 0.05), reflecting 994 unique probes and 2,826 unique SNPs (see Table 1). Subsequent analyses were confined to cis-eQTLs that had been detected using this 250 kb window analysis.
Figure 1

Cumulative genomic distance distribution between SNP and probe midpoint for significant .

Cumulative genomic distance distribution between SNP and probe midpoint for significant .

Meta-analysis celiac peripheral blood and HapMap B cell lines data sets

4,681 Illumina expression probes had oligonucleotide sequences that are shared between the two different oligonucleotide arrays used (Human Ref8 v2 & Human WG-1 v1). By limiting the analysis to a window size of 250 kb and only to SNPs that had been successfully genotyped in both studies, 576 different SNP-probe effects in the celiac peripheral blood data at a FDR = 0.05 (338 at an FDR = 0.01) were detected. These reflect 573 different SNPs and 197 different probes. In the HapMap B cell line data, 573 different SNP-probe effects were identified at an FDR = 0.05 (290 at an FDR = 0.01), reflecting 573 different SNPs and 189 different probes. A combined meta-analysis of both cohorts (weighted-Z method) identified 1,133 different SNP-probe associations at an FDR = 0.05 (428 at an FDR = 0.01) (see Figure 2). These reflect 1,120 SNPs and 328 unique probes. 440 SNP-probe pairs, reflecting 217 different probes and 439 different SNPs (FDR = 0.05) were not detected when either dataset was analyzed separately.
Figure 2

Summary of meta-analysis of 4,681 identical probes between the celiac peripheral blood and HapMap B cell line data sets (at an FDR = 0.05, SNP-probe midpoint distance < 250 kb).

Summary of meta-analysis of 4,681 identical probes between the celiac peripheral blood and HapMap B cell line data sets (at an FDR = 0.05, SNP-probe midpoint distance < 250 kb). 135 identical SNP-Probe effects (reflecting 135 different SNPs and 51 different probes) had been identified in both the 110 celiac disease samples and in the 90 HapMap B cell line samples (FDR = 0.05). In all cases, the combined meta-P-Value for each of these shared cis-eQTLs was more significant due to the larger sample size, indicating allelic effects in the same direction. A comparison between cis-eQTLs that had been detected in a large scale linkage based genetical genomics study using RNA obtained from peripheral blood mononuclear cells [2], and the cis-eQTLs we detected, indicates that 57.3% (distance 250 kb, 0.05FDR) of probes, displaying a genotypic effect in the celiac peripheral blood dataset, also show a linkage cis-eQTL signal. Of note is that CCR3 and IL18RAP are both identified as cis-eQTLs in both these datasets.

Primer Polymorphisms

While it can be assumed that for most of the detected cis-eQTLs indeed the probe expression is affected by genetic variation, it can also be that SNPs, mapping to regions to which the probe hybridizes, may affect hybridization efficacies and result in cis-eQTLs [32] that are not due to expression differences. For the celiac peripheral blood dataset, 10.0% of all Illumina HumanRef-8 v2 probes map to regions that contain known dbSNP polymorphisms. For the HapMap B cell line dataset, 20.5% of all Illumina HumanRef-6 v1 probes map to known SNPs. For the probes that make up the identified cis-eQTLs (distance 250 kb, FDR = 0.05) this percentage is significantly higher for both the celiac peripheral blood analysis (12,5%, Fisher's Exact test P = 0.02) and even more pronounced for the HapMap B cell line dataset (29.1%, P = 7.76 × 10-11). Numbers within brackets in table 1 denote the number of cis-eQTLs, probes or genes that are potentially due to these primer polymorphisms. If these primer polymorphisms are responsible for different hybridization efficacies, these SNPs should be in LD with the SNPs that make up the cis-eQTLs. We could assess this for those cis-eQTLs where genotype data was available within HapMap Phase II for both the SNP that makes up the eQTL and for the SNPs that map within the probe that constitutes this particular eQTL. We compared the resulting distribution of r2 values to a null distribution, generated by calculating the LD between the SNP that makes up an eQTL and the SNP that mapped immediately adjacent to the primer polymorphism SNP. While the mean LD (mean r2 = 0.27) in the celiac peripheral blood was not significantly higher than the mean LD in the null distribution (mean r2 = 0.25, Wilcoxon Mann-Whitney P-Value = 0.86), this was observed for the HapMap B cell line dataset (mean r2 = 0.46, mean r2 of null distribution = 0.30, Wilcoxon Mann-Whitney P-Value = 6.3 × 10-8).

Differential gene expression between tissue sample types influences cis-eQTL detection

To enable a comparison of gene expression levels between the HapMap B cell line samples and the celiac peripheral blood samples, we limited our analysis to 12,401 transcripts that map to identical exons. The expression levels for these transcripts were quantile normalized. cis-eQTLs that have only been significantly detected in the celiac peripheral blood dataset, reflect probes that on average show higher expression in the celiac peripheral blood samples compared to HapMap B cell line samples (Wilcoxon Signed-Ranks test, P = 7.6 × 10-7) (see Figure 3). Conversely, probes that comprised HapMap B cell line specific cis-eQTLs were higher expressed than within the celiac peripheral blood dataset (P = 2.0 × 10-4). As expected, probes that comprised cis-eQTLs that were common to both data sets did not show differences in expression (P = 0.30). Noteworthy is that IL18RAP and CCR3, both recently identified as being genetically associated with celiac disease [21], exhibited significant cis-regulation in the celiac peripheral blood dataset but not in the HapMap B cell line dataset. Cis-effects for these two genes could be detected as these two genes showed markedly upregulated expression in the celiac peripheral blood dataset (expression rank order 12,072 vs. 6,431 and 11,104 vs. 3,602, respectively). These results indicate that the analysis of different cell types is valuable to gain insight in the potential functional consequences of disease associated genetic variants.
Figure 3

Differential Gene Expression between tissue types results in differential . a) Density plot of log fold change for cis-eQTLs detected in celiac dataset but not in HapMap samples (FDR = 0.01, SNP-probe midpoint distance < 250 kb). b) Density plot of log fold change for cis-eQTLs detected in HapMap dataset but not in the celiac dataset, c) Density plot of log fold change for cis-eQTLs detected in both data sets. P values derived from a Wilcoxon Signed-Ranks Test.

Differential Gene Expression between tissue types results in differential . a) Density plot of log fold change for cis-eQTLs detected in celiac dataset but not in HapMap samples (FDR = 0.01, SNP-probe midpoint distance < 250 kb). b) Density plot of log fold change for cis-eQTLs detected in HapMap dataset but not in the celiac dataset, c) Density plot of log fold change for cis-eQTLs detected in both data sets. P values derived from a Wilcoxon Signed-Ranks Test.

Overrepresented biological pathways

The genes, comprising the significant cis-eQTLs (distance 250 kb, FDR = 0.05), showed an overrepresentation of hydrolase and transferase functions for both datasets, but 'immunity and defense' cis-eQTLs genes were more predominantly detected in the celiac disease peripheral blood dataset than in the HapMap B cell line dataset (see Additional file 2), as might be expected from the differential RNA profiles of the cells under investigation in each dataset.

Significant opposite allelic directions for probes, mapping within the same genes

The observed concordance in allelic direction for the 135 significant SNP-probe effects, detected in both the 110 celiac peripheral blood and the 90 HapMap B cell line samples was perfect when probe sequences were identical. We also assessed whether the allelic directions for cis-eQTLs probes was the same when multiple significant cis-eQTL probes mapped within the same genes. We identified 14 genes (see Table 2) where this was not the case: significant cis-eQTL probes with different oligonucleotide sequences mapped to the same genes but showed opposite allelic directions. For three of the genes, MRPL43, OAS1 and TIPRL, we re-sequenced the probe regions in four, four and three individuals with different genotypes, respectively, and did not discover any polymorphisms, confirming that these cis-eQTLs do not result from artificial hybridization differences.
Table 2

Genes containing multiple probes that are affected by SNPs that also affect other probes in the same gene, but with opposite allelic directions

SourceSNPHUGOSpearmanProbeUnique Illumina Identifier*Probe SequenceNumber of dbSNP polymorphisms known within probe region
Celiacrs1131383POLR2J-0.48GI_62422568ILMN_16573170
HapMaprs1131383POLR2J0.57GI_2170427542107310
Celiacrs1901198IRF5-0.6GI_38683857ILMN_16705760
HapMaprs1901198IRF50.5GI_3868385817703580
Celiacrs6565724LOC4005660.67GI_62177143ILMN_17138030
HapMaprs6565724LOC400566-0.72GI_3754459360400080
HapMaprs6565724LOC400566-0.74GI_426612831031704030
Celiacrs2863095MRPL430.45GI_28872731ILMN_16521470
Celiacrs2863095MRPL43-0.49GI_28872733ILMN_17004770
HapMaprs2863095MRPL430.58GI_2887273139404650
HapMaprs2863095MRPL43-0.61GI_2887273359004870
HapMaprs4768933DIP2B0.42GI_174573881017807350
HapMaprs4768933DIP2B-0.44GI_3993039039402780
Celiacrs10774679OAS1-0.54GI_74229010ILMN_16582470
Celiacrs10774679OAS10.51GI_74229012ILMN_16726060
HapMaprs3177979OAS10.63GI_805162021001540
HapMaprs3177979OAS1-0.72GI_805162221000480
Celiacrs1040404TIPRL-0.47GI_73088904ILMN_17794320
Celiacrs1040404TIPRL0.5GI_73088933ILMN_17814570
HapMaprs222851C17orf81-0.72GI_4466282563801290
HapMaprs222851C17orf810.64GI_4466282927603010
Celiacrs34374PAM0.43GI_21070979ILMN_1788631GGCTACAGTCGAAAAGGGTTTGACCGGCTTAGCACTGAGGGCAGTGACCA0
HapMaprs34374PAM-0.41GI_2107097960056GCCAGTGTCTTTCTTTGGTGCCTTTCCTGTTCAGCATTCTTAGCCTGTGG0
HapMaprs2838859POFUT2-0.48GI_3414748659003411
HapMaprs2838859POFUT20.45Hs.3007361061100590
Celiacrs7084722PTER-0.43GI_47933342ILMN_17953360
HapMaprs7084722PTER0.48GI_2007018555700401
Celiacrs10503170MYOM20.43GI_4505314ILMN_1716733ATTTTCACGGGTGTGGGCACATGGGTGTGGCACCTGGACGTGTGCAGCAT1
HapMaprs10503170MYOM2-0.42GI_45053146620154TTTACACGAGGGTAGACGGCAGATGCCTGACAGAGAGTGGGTTGGCAGAC1
Celiacrs11680305ADI1-0.42GI_8922761ILMN_1795671CCGGTGGTGTGATGATGCCATATACCGCAGGGCTTGCTTCTGTCAAGTGT1
HapMaprs11680305ADI10.41GI_89227614730070GAGCTCCCACCCTAAGGGGCACACACTGAGTTGCTTATGCCACTTCCTTG0
Celiacrs2395185HLA-DRB5-0.41GI_26665892ILMN_1697499GGCTCTTATTCTTCCACAAGAGAGGACTTTCTCAGGCCCTGGTTGCTACC3
HapMaprs2395185HLA-DRB50.47GI_26665892450332ACGGCCTCCCATGCATCTGTACTCCCCCTGTGTGCCACAAATGCACTACG8

Meta-analysis of different significant but opposite allelic effects of SNPs (FDR = 0.05, SNP-probe midpoint distance < 250 kb) in the celiac peripheral blood dataset and HapMap B cell line data set. Shown are 14 genes that contain probes for either of the two platforms but that show opposite allelic directions. If probes have identical identifiers on the two platforms, but different oligonucleotide sequences, the oligonucleotide sequences are indicated.

*As provided by Illumina platform record.

Genes containing multiple probes that are affected by SNPs that also affect other probes in the same gene, but with opposite allelic directions Meta-analysis of different significant but opposite allelic effects of SNPs (FDR = 0.05, SNP-probe midpoint distance < 250 kb) in the celiac peripheral blood dataset and HapMap B cell line data set. Shown are 14 genes that contain probes for either of the two platforms but that show opposite allelic directions. If probes have identical identifiers on the two platforms, but different oligonucleotide sequences, the oligonucleotide sequences are indicated. *As provided by Illumina platform record.

Identification of trans effects and conditioned co-expression analysis

We used the celiac disease dataset to assess whether significant trans-eQTLs could be identified (Spearman's rank correlation test P-Value < 1 × 10-12, FDR = 0.05). Two trans-eQTLs were detected: rs2318331 (mapping within COL22A1) with GI_48093066 (NBPF3) and rs12634559 (mapping downstream of IL1RAP) with GI_16306505 (CASP8AP2). However, these trans-eQTLs were not detected in the HapMap B cell line dataset, even after relaxing the nominal Spearman rank correlation P-value to 0.05. As trans-eQTLs implicate (direct or indirect) biological relationships, it is somewhat disappointing that none were clearly identified. An alternative way to identify biological relationships is by using co-expression analysis (although such an analysis does not have the aim to identify or explain trans-eQTLs). We conducted such a co-expression analysis and compared it to a co-expression analysis that also takes the genetic variation into account. First, co-expression was assessed between each of the probes that constitute cis-eQTLs (SNP-probe midpoint distance < 250 kb, FDR = 0.05) and all other probes (Figure 4a) that do not map to the same chromosome, by calculating all pair-wise Spearman correlation coefficients. This resulted in 13,003,505 and 42,270,807 pair-wise tests for the celiac peripheral blood dataset (using 765 different cis-probes) and the HapMap B cell line dataset (using 994 different cis-probes), respectively. Through Bonferroni correction we accounted for multiple testing, resulting in the identification of 50,821 and 168,292 significantly correlated transcript-pairs respectively (Spearman's correlation test P-Value < 0.05 after multiple testing correction).
Figure 4

. c) An example for UBA52 indicates that a conditioned co-expression analysis can help to identify meaningful biological relationships: Within the conditioned co-expression distribution (indicated in dark blue) there are more strongly co-expressed genes, opposed to an unconditioned analysis (indicated in red). This is supported by an analysis of 156 known interacting genes for UBA52: In the conditioned co-expression analysis (indicated in light blue) co-expression is generally much stronger than in the unconditioned analysis (indicated in yellow).

. c) An example for UBA52 indicates that a conditioned co-expression analysis can help to identify meaningful biological relationships: Within the conditioned co-expression distribution (indicated in dark blue) there are more strongly co-expressed genes, opposed to an unconditioned analysis (indicated in red). This is supported by an analysis of 156 known interacting genes for UBA52: In the conditioned co-expression analysis (indicated in light blue) co-expression is generally much stronger than in the unconditioned analysis (indicated in yellow). Subsequently, we redid this co-expression analysis after having removed the genotypic effect on the probe intensity level for the probes that constitute the significant cis-eQTLs (Figure 4b). We observed a considerably increased number of significantly correlated transcript-pairs: 54,773 (an increase of 7.8%) and 258,874 (an increase of 53.8%), respectively. To validate whether this increase in co-expressed transcripts reflects known biology, we assessed a collection of 80,350 known biological interactions (derived on 17 April 2007 from KEGG [33], BioGrid [34], Reactome [35], BIND [36], HPRD [37] and IntAct [38]). For the celiac peripheral blood dataset 91 of the 50,821 (0.179%) significantly correlated transcript-pairs that had been identified in the unconditioned co-expression analysis are known to interact. In the conditioned co-expression analysis 106 of the 54,773 (0.194%) transcript pairs were known to interact, indicating an 8.0% increase in the proportion of known biological interactions that was identified. Comparable results were obtained for the HapMap B cell line dataset: For the unconditioned co-expression analysis 334 of the 168,292 (0.195%) significant transcript-pairs are known to interact. In the conditioned co-expression analysis 634 of the 258.874 (0.245%) significant transcript-pairs are known to biologically interact, representing an increase of 23.4% in the proportion of known interactions among the most significantly co-expressed transcript-pairs. Figure 4c provides an example for Ubiquitin (UBA52), for which a strong cis-eQTL was observed for probe GI_15451941 with rs2314664 (Spearman's correlation coefficient P-Value = 1.11 × 10-16) in the HapMap B cell line dataset. Within this probe three other polymorphism map (rs6554, rs34040670 and rs3209501), of which rs6554 is in near perfect LD with rs2314664 (r2 = 0.98, HapMap CEU population), suggesting this cis-eQTL does not reflect a real expression difference, but rather a difference in hybridization efficacy. Co-expression with other genes is already present in an unconditioned co-expression analysis, because the distribution of co-expression for UBA52 with all other genes differs significantly from a theoretical null-distribution (Wilcoxon-Mann Whitney P-Value < 1 × 10-12). However, in the conditioned co-expression analysis, this difference is more pronounced (see figure 4c). When confining this analysis to a set of 156 known interactions for UBA52, overall co-expression was significantly stronger for these pairs of genes in the conditioned (mean absolute Spearman's correlation coefficient = 0.39) than in the unconditioned co-expression analysis (mean absolute Spearman's correlation coefficient = 0.20, Wilcoxon-Mann Whitney P-Value < 10-50).

Discussion

We have demonstrated the use of peripheral blood RNA samples for the detection of cis-eQTLs and have shown that there is strong allelic concordance with cis-eQTLs that also had been detected in a HapMap B cell line dataset. These results indicate that a meta-analysis with larger sample size and hence statistical power results in a considerable increase in the detected cis-eQTL even though the arrays that had been used were different. Some cis-eQTLs can be observed across multiple tissue types, in all cases in the same allelic direction, suggesting a major conserved function. However most of the detected cis-eQTLs in these datasets were only detected in one of the two tissues, suggesting that more insight can be gained in the functional consequences of genetic variation by performing genetical genomics studies using different types of cells and tissues. This point is particularly relevant for identifying the function of risk variants for common diseases: for example study immune tissues for immune-mediated diseases, adipose tissue for obesity traits [8], brain tissue for neurological traits [10]. Genetical genomics experiments performed in outbred human populations are additionally complex and in addition to tissue specific RNA profiles, care should be taken to investigate alternative RNA sequence events, primer polymorphisms that generate false eQTLs and strong cis-eQTLs that obscure weaker trans effects. It is attractive to assume most of the observed cis-eQTLs reflect overall gene expression level alterations. However, we did observe 14 genes (Table 2) where different probes showed significant opposite allelic effects. For at least five out of the 14 genes (POFUT2, PTER, MYOM2, ADI1 and HLA-DRB5) polymorphisms within the probe are known to exist in dbSNP. For each of these genes, it may be that one cis-eQTL for instance reflects a real expression difference, whereas the other reflects a hybridization effect due to the presence of these SNPs within the probe. However, recently Kwan et al [39] provided another potential explanation, as they showed for three of the 14 genes (IRF5, MRPL43 and PTER), using Affymetrix GeneChip Human Exon 1.0 ST Arrays, that different genetic variants can result in premature 3' termination events. The SNPs that make up these cis-eQTLs are all in strong linkage disequilibrium with the SNPs Kwan et al described (IRF5: rs7808907 and rs6969930: D' = 1, R2 = 0.74 rs2863095, MRPL43: rs2863095 and rs12241232: D' = 0.89, R2 = 0.75, PTER: rs7909832 and rs1055340: D' = 1, R2 = 1), supporting our observations. Kwan et al examined different exonic effects through an independent validation using quantitative RT-PCR (out of a total of 25 validated genes) and estimated that only 39% of the detected cis-eQTLs influence overall gene expression levels. For the remaining cis-eQTLs genetic variation results in preliminary terminated transcripts (18%), not initiated transcripts (11%), transcripts that are spliced differentially (26%) or a combination of these (6%). We acknowledge that our use of oligonucleotide arrays, predominantly targeting the 3' end of genes, gives a more limited picture of splicing since only 3' termination events can be seen. However, the data presented above and alluded to by Kwan et al [39] suggest that SNP genotypes can have a significant effect on alternatively spliced transcript isoforms. This additional layer of complexity should be examined in future genetical genomics experiments to fully elucidate the genotypic effects on RNA. Although the two trans-effects identified (IL1RAP and CASP8AP2) are interestingly both involved in an apoptosis pathway, these trans-eQTLs were not detected in the HapMap B cell line dataset. These trans-eQTL results are in strong contrast to the number of cis-eQTLs we detected within the two data sets. This results primarily from the limited statistical power to detect trans-eQTLs, due to the number of statistical tests that need to be performed. Additionally, in order to cause a trans-effect, more intermediate biological steps are required that introduces additional biological noise, lessening the correlation between genotype and trans-gene expression. Cis-eQTLs can have a significant impact on the co-expression of genes; this genotypic variation may hinder the identification of significant trans effects. This adds an additional layer of complexity to the analysis of genotypic effects on gene expression in outbred populations. As it has been one of the goals of genetical genomics to identify biological relationships we suggest that the conditioned co-expression analysis we carried out here, might help to uncover these: We have shown that more known biological relationships can be identified when using genetical genomics to perform co-expression analyses that have been conditioned on genotype. An explanation for this observation is that cis-eQTLs sometimes convolute co-expression, as is exemplified for UBA52, where primer polymorphisms probably affect hybridization characteristics.

Conclusion

This study shows that PAXgene isolated peripheral blood RNA is a powerful resource for investigating functional consequences of genetic variation. We have shown that for some of the cis-eQTLs the functional consequences are more complex than previously assumed. Additionally, these findings imply that biological relationships can be extracted in outbred populations, although in a somewhat different manner than what is commonly used to detect biological relationships through trans-eQTLs in inbred model organisms. As this study has only combined genetics with genomics, we envision more extensive integrative approaches, incorporating e.g. epigenetics and proteomics, will help to improve the detection of previously unknown biological pathways.

Abbreviations

SNP: Single Nucleotide Polymorphisms; eQTL: Expression Quantitative Trait Locus

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

LCD and DVH collected patient samples. GH extracted RNA and performed microarray experiments assisted by MB. LF, RCJ, GH, CW and DVH designed the experiment. KH undertook the genotyping. LF and GH performed the data analysis assisted by RCJ and MAS. Sequencing was carried out by GT. GH and LF drafted the manuscript. All authors read and approved the final manuscript.

Pre-publication history

The pre-publication history for this paper can be accessed here:

Additional file 1

All detected Click here for file

Additional File 2

Biological processes and functions for the detected . Listed are significantly over- and underrepresented biological processes and functions for the detected cis-eQTLs (FDR = 0.05, SNP-probe midpoint distance < 250 kb), derived through the Panther Classification System (Binomial test, Bonferroni corrected) Click here for file
  38 in total

1.  Genetical genomics: the added value from segregation.

Authors:  R C Jansen; J P Nap
Journal:  Trends Genet       Date:  2001-07       Impact factor: 11.639

2.  A genome-wide association study of global gene expression.

Authors:  Anna L Dixon; Liming Liang; Miriam F Moffatt; Wei Chen; Simon Heath; Kenny C C Wong; Jenny Taylor; Edward Burnett; Ivo Gut; Martin Farrall; G Mark Lathrop; Gonçalo R Abecasis; William O C Cookson
Journal:  Nat Genet       Date:  2007-09-16       Impact factor: 38.330

3.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.

Authors:  Ewan Birney; John A Stamatoyannopoulos; Anindya Dutta; Roderic Guigó; Thomas R Gingeras; Elliott H Margulies; Zhiping Weng; Michael Snyder; Emmanouil T Dermitzakis; Robert E Thurman; Michael S Kuehn; Christopher M Taylor; Shane Neph; Christoph M Koch; Saurabh Asthana; Ankit Malhotra; Ivan Adzhubei; Jason A Greenbaum; Robert M Andrews; Paul Flicek; Patrick J Boyle; Hua Cao; Nigel P Carter; Gayle K Clelland; Sean Davis; Nathan Day; Pawandeep Dhami; Shane C Dillon; Michael O Dorschner; Heike Fiegler; Paul G Giresi; Jeff Goldy; Michael Hawrylycz; Andrew Haydock; Richard Humbert; Keith D James; Brett E Johnson; Ericka M Johnson; Tristan T Frum; Elizabeth R Rosenzweig; Neerja Karnani; Kirsten Lee; Gregory C Lefebvre; Patrick A Navas; Fidencio Neri; Stephen C J Parker; Peter J Sabo; Richard Sandstrom; Anthony Shafer; David Vetrie; Molly Weaver; Sarah Wilcox; Man Yu; Francis S Collins; Job Dekker; Jason D Lieb; Thomas D Tullius; Gregory E Crawford; Shamil Sunyaev; William S Noble; Ian Dunham; France Denoeud; Alexandre Reymond; Philipp Kapranov; Joel Rozowsky; Deyou Zheng; Robert Castelo; Adam Frankish; Jennifer Harrow; Srinka Ghosh; Albin Sandelin; Ivo L Hofacker; Robert Baertsch; Damian Keefe; Sujit Dike; Jill Cheng; Heather A Hirsch; Edward A Sekinger; Julien Lagarde; Josep F Abril; Atif Shahab; Christoph Flamm; Claudia Fried; Jörg Hackermüller; Jana Hertel; Manja Lindemeyer; Kristin Missal; Andrea Tanzer; Stefan Washietl; Jan Korbel; Olof Emanuelsson; Jakob S Pedersen; Nancy Holroyd; Ruth Taylor; David Swarbreck; Nicholas Matthews; Mark C Dickson; Daryl J Thomas; Matthew T Weirauch; James Gilbert; Jorg Drenkow; Ian Bell; XiaoDong Zhao; K G Srinivasan; Wing-Kin Sung; Hong Sain Ooi; Kuo Ping Chiu; Sylvain Foissac; Tyler Alioto; Michael Brent; Lior Pachter; Michael L Tress; Alfonso Valencia; Siew Woh Choo; Chiou Yu Choo; Catherine Ucla; Caroline Manzano; Carine Wyss; Evelyn Cheung; Taane G Clark; James B Brown; Madhavan Ganesh; Sandeep Patel; Hari Tammana; Jacqueline Chrast; Charlotte N Henrichsen; Chikatoshi Kai; Jun Kawai; Ugrappa Nagalakshmi; Jiaqian Wu; Zheng Lian; Jin Lian; Peter Newburger; Xueqing Zhang; Peter Bickel; John S Mattick; Piero Carninci; Yoshihide Hayashizaki; Sherman Weissman; Tim Hubbard; Richard M Myers; Jane Rogers; Peter F Stadler; Todd M Lowe; Chia-Lin Wei; Yijun Ruan; Kevin Struhl; Mark Gerstein; Stylianos E Antonarakis; Yutao Fu; Eric D Green; Ulaş Karaöz; Adam Siepel; James Taylor; Laura A Liefer; Kris A Wetterstrand; Peter J Good; Elise A Feingold; Mark S Guyer; Gregory M Cooper; George Asimenos; Colin N Dewey; Minmei Hou; Sergey Nikolaev; Juan I Montoya-Burgos; Ari Löytynoja; Simon Whelan; Fabio Pardi; Tim Massingham; Haiyan Huang; Nancy R Zhang; Ian Holmes; James C Mullikin; Abel Ureta-Vidal; Benedict Paten; Michael Seringhaus; Deanna Church; Kate Rosenbloom; W James Kent; Eric A Stone; Serafim Batzoglou; Nick Goldman; Ross C Hardison; David Haussler; Webb Miller; Arend Sidow; Nathan D Trinklein; Zhengdong D Zhang; Leah Barrera; Rhona Stuart; David C King; Adam Ameur; Stefan Enroth; Mark C Bieda; Jonghwan Kim; Akshay A Bhinge; Nan Jiang; Jun Liu; Fei Yao; Vinsensius B Vega; Charlie W H Lee; Patrick Ng; Atif Shahab; Annie Yang; Zarmik Moqtaderi; Zhou Zhu; Xiaoqin Xu; Sharon Squazzo; Matthew J Oberley; David Inman; Michael A Singer; Todd A Richmond; Kyle J Munn; Alvaro Rada-Iglesias; Ola Wallerman; Jan Komorowski; Joanna C Fowler; Phillippe Couttet; Alexander W Bruce; Oliver M Dovey; Peter D Ellis; Cordelia F Langford; David A Nix; Ghia Euskirchen; Stephen Hartman; Alexander E Urban; Peter Kraus; Sara Van Calcar; Nate Heintzman; Tae Hoon Kim; Kun Wang; Chunxu Qu; Gary Hon; Rosa Luna; Christopher K Glass; M Geoff Rosenfeld; Shelley Force Aldred; Sara J Cooper; Anason Halees; Jane M Lin; Hennady P Shulha; Xiaoling Zhang; Mousheng Xu; Jaafar N S Haidar; Yong Yu; Yijun Ruan; Vishwanath R Iyer; Roland D Green; Claes Wadelius; Peggy J Farnham; Bing Ren; Rachel A Harte; Angie S Hinrichs; Heather Trumbower; Hiram Clawson; Jennifer Hillman-Jackson; Ann S Zweig; Kayla Smith; Archana Thakkapallayil; Galt Barber; Robert M Kuhn; Donna Karolchik; Lluis Armengol; Christine P Bird; Paul I W de Bakker; Andrew D Kern; Nuria Lopez-Bigas; Joel D Martin; Barbara E Stranger; Abigail Woodroffe; Eugene Davydov; Antigone Dimas; Eduardo Eyras; Ingileif B Hallgrímsdóttir; Julian Huppert; Michael C Zody; Gonçalo R Abecasis; Xavier Estivill; Gerard G Bouffard; Xiaobin Guan; Nancy F Hansen; Jacquelyn R Idol; Valerie V B Maduro; Baishali Maskeri; Jennifer C McDowell; Morgan Park; Pamela J Thomas; Alice C Young; Robert W Blakesley; Donna M Muzny; Erica Sodergren; David A Wheeler; Kim C Worley; Huaiyang Jiang; George M Weinstock; Richard A Gibbs; Tina Graves; Robert Fulton; Elaine R Mardis; Richard K Wilson; Michele Clamp; James Cuff; Sante Gnerre; David B Jaffe; Jean L Chang; Kerstin Lindblad-Toh; Eric S Lander; Maxim Koriabine; Mikhail Nefedov; Kazutoyo Osoegawa; Yuko Yoshinaga; Baoli Zhu; Pieter J de Jong
Journal:  Nature       Date:  2007-06-14       Impact factor: 49.962

4.  Discovery of expression QTLs using large-scale transcriptional profiling in human lymphocytes.

Authors:  Harald H H Göring; Joanne E Curran; Matthew P Johnson; Thomas D Dyer; Jac Charlesworth; Shelley A Cole; Jeremy B M Jowett; Lawrence J Abraham; David L Rainwater; Anthony G Comuzzie; Michael C Mahaney; Laura Almasy; Jean W MacCluer; Ahmed H Kissebah; Gregory R Collier; Eric K Moses; John Blangero
Journal:  Nat Genet       Date:  2007-09-16       Impact factor: 38.330

5.  A survey of genetic human cortical gene expression.

Authors:  Amanda J Myers; J Raphael Gibbs; Jennifer A Webster; Kristen Rohrer; Alice Zhao; Lauren Marlowe; Mona Kaleem; Doris Leung; Leslie Bryden; Priti Nath; Victoria L Zismann; Keta Joshipura; Matthew J Huentelman; Diane Hu-Lince; Keith D Coon; David W Craig; John V Pearson; Peter Holmans; Christopher B Heward; Eric M Reiman; Dietrich Stephan; John Hardy
Journal:  Nat Genet       Date:  2007-11-04       Impact factor: 38.330

6.  Relative impact of nucleotide and copy number variation on gene expression phenotypes.

Authors:  Barbara E Stranger; Matthew S Forrest; Mark Dunning; Catherine E Ingle; Claude Beazley; Natalie Thorne; Richard Redon; Christine P Bird; Anna de Grassi; Charles Lee; Chris Tyler-Smith; Nigel Carter; Stephen W Scherer; Simon Tavaré; Panagiotis Deloukas; Matthew E Hurles; Emmanouil T Dermitzakis
Journal:  Science       Date:  2007-02-09       Impact factor: 47.728

7.  Gene expression quantitative trait locus analysis of 16 000 barley genes reveals a complex pattern of genome-wide transcriptional regulation.

Authors:  Elena Potokina; Arnis Druka; Zewei Luo; Roger Wise; Robbie Waugh; Mike Kearsey
Journal:  Plant J       Date:  2007-10-17       Impact factor: 6.417

8.  A genome-wide association study for celiac disease identifies risk variants in the region harboring IL2 and IL21.

Authors:  David A van Heel; Lude Franke; Karen A Hunt; Rhian Gwilliam; Alexandra Zhernakova; Mike Inouye; Martin C Wapenaar; Martin C N M Barnardo; Graeme Bethel; Geoffrey K T Holmes; Con Feighery; Derek Jewell; Dermot Kelleher; Parveen Kumar; Simon Travis; Julian R F Walters; David S Sanders; Peter Howdle; Jill Swift; Raymond J Playford; William M McLaren; M Luisa Mearin; Chris J Mulder; Ross McManus; Ralph McGinnis; Lon R Cardon; Panos Deloukas; Cisca Wijmenga
Journal:  Nat Genet       Date:  2007-06-10       Impact factor: 38.330

9.  Reactome: a knowledge base of biologic pathways and processes.

Authors:  Imre Vastrik; Peter D'Eustachio; Esther Schmidt; Geeta Joshi-Tope; Gopal Gopinath; David Croft; Bernard de Bono; Marc Gillespie; Bijay Jassal; Suzanna Lewis; Lisa Matthews; Guanming Wu; Ewan Birney; Lincoln Stein
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

10.  Sequence polymorphisms cause many false cis eQTLs.

Authors:  Rudi Alberts; Peter Terpstra; Yang Li; Rainer Breitling; Jan-Peter Nap; Ritsert C Jansen
Journal:  PLoS One       Date:  2007-07-18       Impact factor: 3.240

View more
  45 in total

1.  eQTL analysis links inflammatory bowel disease associated 1q21 locus to ECM1 gene.

Authors:  Katja Repnik; Uroš Potočnik
Journal:  J Appl Genet       Date:  2016-01-06       Impact factor: 3.240

2.  Gene expression and genetic variation data implicate PCLO in bipolar disorder.

Authors:  Kwang H Choi; Brandon W Higgs; Jens R Wendland; Jonathan Song; Francis J McMahon; Maree J Webster
Journal:  Biol Psychiatry       Date:  2010-12-24       Impact factor: 13.382

3.  Haplotype in the IBD5 region is associated with refractory Crohn's disease in Slovenian patients and modulates expression of the SLC22A5 gene.

Authors:  Katja Repnik; Uroš Potočnik
Journal:  J Gastroenterol       Date:  2011-06-22       Impact factor: 7.527

4.  Integration of genome-wide association studies with biological knowledge identifies six novel genes related to kidney function.

Authors:  Daniel I Chasman; Christian Fuchsberger; Cristian Pattaro; Alexander Teumer; Carsten A Böger; Karlhans Endlich; Matthias Olden; Ming-Huei Chen; Adrienne Tin; Daniel Taliun; Man Li; Xiaoyi Gao; Mathias Gorski; Qiong Yang; Claudia Hundertmark; Meredith C Foster; Conall M O'Seaghdha; Nicole Glazer; Aaron Isaacs; Ching-Ti Liu; Albert V Smith; Jeffrey R O'Connell; Maksim Struchalin; Toshiko Tanaka; Guo Li; Andrew D Johnson; Hinco J Gierman; Mary F Feitosa; Shih-Jen Hwang; Elizabeth J Atkinson; Kurt Lohman; Marilyn C Cornelis; Asa Johansson; Anke Tönjes; Abbas Dehghan; Jean-Charles Lambert; Elizabeth G Holliday; Rossella Sorice; Zoltan Kutalik; Terho Lehtimäki; Tõnu Esko; Harshal Deshmukh; Sheila Ulivi; Audrey Y Chu; Federico Murgia; Stella Trompet; Medea Imboden; Stefan Coassin; Giorgio Pistis; Tamara B Harris; Lenore J Launer; Thor Aspelund; Gudny Eiriksdottir; Braxton D Mitchell; Eric Boerwinkle; Helena Schmidt; Margherita Cavalieri; Madhumathi Rao; Frank Hu; Ayse Demirkan; Ben A Oostra; Mariza de Andrade; Stephen T Turner; Jingzhong Ding; Jeanette S Andrews; Barry I Freedman; Franco Giulianini; Wolfgang Koenig; Thomas Illig; Christa Meisinger; Christian Gieger; Lina Zgaga; Tatijana Zemunik; Mladen Boban; Cosetta Minelli; Heather E Wheeler; Wilmar Igl; Ghazal Zaboli; Sarah H Wild; Alan F Wright; Harry Campbell; David Ellinghaus; Ute Nöthlings; Gunnar Jacobs; Reiner Biffar; Florian Ernst; Georg Homuth; Heyo K Kroemer; Matthias Nauck; Sylvia Stracke; Uwe Völker; Henry Völzke; Peter Kovacs; Michael Stumvoll; Reedik Mägi; Albert Hofman; Andre G Uitterlinden; Fernando Rivadeneira; Yurii S Aulchenko; Ozren Polasek; Nick Hastie; Veronique Vitart; Catherine Helmer; Jie Jin Wang; Bénédicte Stengel; Daniela Ruggiero; Sven Bergmann; Mika Kähönen; Jorma Viikari; Tiit Nikopensius; Michael Province; Shamika Ketkar; Helen Colhoun; Alex Doney; Antonietta Robino; Bernhard K Krämer; Laura Portas; Ian Ford; Brendan M Buckley; Martin Adam; Gian-Andri Thun; Bernhard Paulweber; Margot Haun; Cinzia Sala; Paul Mitchell; Marina Ciullo; Stuart K Kim; Peter Vollenweider; Olli Raitakari; Andres Metspalu; Colin Palmer; Paolo Gasparini; Mario Pirastu; J Wouter Jukema; Nicole M Probst-Hensch; Florian Kronenberg; Daniela Toniolo; Vilmundur Gudnason; Alan R Shuldiner; Josef Coresh; Reinhold Schmidt; Luigi Ferrucci; David S Siscovick; Cornelia M van Duijn; Ingrid B Borecki; Sharon L R Kardia; Yongmei Liu; Gary C Curhan; Igor Rudan; Ulf Gyllensten; James F Wilson; Andre Franke; Peter P Pramstaller; Rainer Rettig; Inga Prokopenko; Jacqueline Witteman; Caroline Hayward; Paul M Ridker; Afshin Parsa; Murielle Bochud; Iris M Heid; W H Linda Kao; Caroline S Fox; Anna Köttgen
Journal:  Hum Mol Genet       Date:  2012-09-08       Impact factor: 6.150

Review 5.  Systems genetics in "-omics" era: current and future development.

Authors:  Hong Li
Journal:  Theory Biosci       Date:  2012-11-09       Impact factor: 1.919

6.  Imputation of exome sequence variants into population- based samples and blood-cell-trait-associated loci in African Americans: NHLBI GO Exome Sequencing Project.

Authors:  Paul L Auer; Jill M Johnsen; Andrew D Johnson; Benjamin A Logsdon; Leslie A Lange; Michael A Nalls; Guosheng Zhang; Nora Franceschini; Keolu Fox; Ethan M Lange; Stephen S Rich; Christopher J O'Donnell; Rebecca D Jackson; Robert B Wallace; Zhao Chen; Timothy A Graubert; James G Wilson; Hua Tang; Guillaume Lettre; Alex P Reiner; Santhi K Ganesh; Yun Li
Journal:  Am J Hum Genet       Date:  2012-10-25       Impact factor: 11.025

7.  Population differences in transcript-regulator expression quantitative trait loci.

Authors:  Pierre R Bushel; Ray McGovern; Liwen Liu; Oliver Hofmann; Ahsan Huda; Jun Lu; Winston Hide; Xihong Lin
Journal:  PLoS One       Date:  2012-03-27       Impact factor: 3.240

8.  Multiple common variants for celiac disease influencing immune gene expression.

Authors:  Patrick C A Dubois; Gosia Trynka; Lude Franke; Karen A Hunt; Jihane Romanos; Alessandra Curtotti; Alexandra Zhernakova; Graham A R Heap; Róza Adány; Arpo Aromaa; Maria Teresa Bardella; Leonard H van den Berg; Nicholas A Bockett; Emilio G de la Concha; Bárbara Dema; Rudolf S N Fehrmann; Miguel Fernández-Arquero; Szilvia Fiatal; Elvira Grandone; Peter M Green; Harry J M Groen; Rhian Gwilliam; Roderick H J Houwen; Sarah E Hunt; Katri Kaukinen; Dermot Kelleher; Ilma Korponay-Szabo; Kalle Kurppa; Padraic MacMathuna; Markku Mäki; Maria Cristina Mazzilli; Owen T McCann; M Luisa Mearin; Charles A Mein; Muddassar M Mirza; Vanisha Mistry; Barbara Mora; Katherine I Morley; Chris J Mulder; Joseph A Murray; Concepción Núñez; Elvira Oosterom; Roel A Ophoff; Isabel Polanco; Leena Peltonen; Mathieu Platteel; Anna Rybak; Veikko Salomaa; Joachim J Schweizer; Maria Pia Sperandeo; Greetje J Tack; Graham Turner; Jan H Veldink; Wieke H M Verbeek; Rinse K Weersma; Victorien M Wolters; Elena Urcelay; Bozena Cukrowska; Luigi Greco; Susan L Neuhausen; Ross McManus; Donatella Barisani; Panos Deloukas; Jeffrey C Barrett; Paivi Saavalainen; Cisca Wijmenga; David A van Heel
Journal:  Nat Genet       Date:  2010-02-28       Impact factor: 38.330

9.  Genome-wide analysis of allelic expression imbalance in human primary cells by high-throughput transcriptome resequencing.

Authors:  Graham A Heap; Jennie H M Yang; Kate Downes; Barry C Healy; Karen A Hunt; Nicholas Bockett; Lude Franke; Patrick C Dubois; Charles A Mein; Richard J Dobson; Thomas J Albert; Matthew J Rodesch; David G Clayton; John A Todd; David A van Heel; Vincent Plagnol
Journal:  Hum Mol Genet       Date:  2010-01-01       Impact factor: 6.150

10.  Geographical genomics of human leukocyte gene expression variation in southern Morocco.

Authors:  Youssef Idaghdour; Wendy Czika; Kevin V Shianna; Sang H Lee; Peter M Visscher; Hilary C Martin; Kelci Miclaus; Sami J Jadallah; David B Goldstein; Russell D Wolfinger; Greg Gibson
Journal:  Nat Genet       Date:  2009-12-06       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.