| Literature DB >> 29650998 |
Tobias Strunz1,2, Felix Grassmann2, Javier Gayán1, Satu Nahkuri1, Debora Souza-Costa1, Cyrille Maugeais1, Sascha Fauser1, Everson Nogoceke1, Bernhard H F Weber3.
Abstract
Genome-wide association studies (GWAS) have identified numerous genetic variants in the human genome associated with diseases and traits. Nevertheless, for most loci the causative variant is still unknown. Expression quantitative trait loci (eQTL) in disease relevant tissues is an excellent approach to correlate genetic association with gene expression. While liver is the primary site of gene transcription for two pathways relevant to age-related macular degeneration (AMD), namely the complement system and cholesterol metabolism, we explored the contribution of AMD associated variants to modulate liver gene expression. We extracted publicly available data and computed the largest eQTL data set for liver tissue to date. Genotypes and expression data from all studies underwent rigorous quality control. Subsequently, Matrix eQTL was used to identify significant local eQTL. In total, liver samples from 588 individuals revealed 202,489 significant eQTL variants affecting 1,959 genes (Q-Value < 0.001). In addition, a further 101 independent eQTL signals were identified in 93 of the 1,959 eQTL genes. Importantly, our results independently reinforce the notion that high density lipoprotein metabolism plays a role in AMD pathogenesis. Taken together, our study generated a first comprehensive map reflecting the genetic regulatory landscape of gene expression in liver.Entities:
Mesh:
Year: 2018 PMID: 29650998 PMCID: PMC5897392 DOI: 10.1038/s41598-018-24219-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Study and sample summary
| Study | Schadt | Schroeder | Innocenti | GTEx Start/Mida | Meta-analysis | Mega-analysis |
|---|---|---|---|---|---|---|
| Sample size before/after QC | 178/178 | 149/149 | 208/178 | 97/83 | 588 | 588 |
| Origin of liver tissue | Post-mortem tissue and resections from donor livers | Normal tissue resected during surgery for liver cancer | Post mortem tissue and resections from donor livers | Post mortem tissue | — | — |
| Transcriptome profiling platform | Agilent Custom 44k | Illumina Human WG-6v2.0 | Agilent 4 × 44 k | RNA-seq (Illumina HiSeq2000) | — | — |
| Probes/genes before QC | 40,638 | 48,701 | 45,015 | 56,318 | — | — |
| Genes after QC | 24,123 | 24,123 | 24,123 | 24,123 | 24,123 | 24,123 |
| Genotyping platform | Affymetrix 500k; Illumina 650 Y | Illumina HumanHap300 | Illumina 610 Quad | Illumina Omni 5 M/2.5 Ma | — | — |
| Variants before QC | 449,699 | 318,237 | 620,901 | 2,526,494/2,378,075a | — | — |
| Variants after QC | 383,719 | 296,718 | 545,886 | 2,389,798/2,119,410a | — | — |
| Variants merged before imputationb | 861,575 | 861,575 | 861,575 | 861,575 | 861,575 | 861,575 |
| Variants after imputation and QC | 6,256,941 | 6,256,941 | 6,256,941 | 6,256,941 | 6,256,941 | 6,256,941 |
| eQTL variants (Q-Value < 1 × 10−3) | 29,546 | 71,423 | 52,565 | 19,802 | 101,148 | 202,489 |
| eQTL variants (Q-Value < 1 × 10−3, unique) | 27,689 | 69,292 | 49,594 | 16,953 | 95,257 | 183,872 |
| eQTL genes (Q-Value < 1 × 10−3, unique) | 363 | 913 | 670 | 387 | 1,313 | 1,959 |
| Overlapping eQTL genes with meta-analysis (Q-Value < 1 × 10−3) | 215 (59.23%) | 491 (53.78%) | 408 (60.9%) | 149 (38.5%) | 1,313 (100%) | 1,260 (64.32%) |
| Overlapping eQTL genes with mega-analysis (Q-Value < 1 × 10−3) | 288 (79.34%) | 688 (75.36%) | 537 (80.15%) | 207 (53.49%) | 1,260 (95.96%) | 1,959 (100%) |
| Independent signals (P-Value < 1 × 10−6) | — | — | — | — | — | 2,060 |
QC = quality control; aOmni 2.5 M for the first data release (GTEx start) and Omni 5 M for the mid-point release (GTEx mid). bAfter quality control the genotype files of the four studies were merged into a single file and variants, which did not overlap in-between datasets, were assigned missing. We only kept variants which were genotyped in at least 100 samples.
Figure 1Manhattan plot of the eQTL mega-analysis in liver. A mega-analysis was conducted including 588 samples from four independent studies measuring eQTL variants in liver tissue. The Manhattan plot shows the −log10 Q-Values of the most significant variant for each of the 24,123 analysed autosomal genes. Additionally, 101 independent secondary signals were identified and are highlighted in red. The blue line depicts the threshold for significance at 1 × 10−3.
Figure 2Characterisation of independent signal eQTL variants based on their genomic localisation. The distance to the nearest transcription start site (TSS) is plotted against the −log10 P-Values of the most significant variant at each eQTL gene, including secondary signals (independent hits). Negative/positive distances denote that the variant is located upstream/downstream of the TSS with regard to the direction of transcription.
Figure 3Functional annotations and predicted consequences of local eQTL-variants. Three sets of variants were evaluated by employing two different databases. Set one (control) includes random variants of the imputed genotype file, which are located next to at least one gene within a distance of a maximum of 1 Mb. Set two (mega-analysis) consists of all significant mega-analysis (Q-Value < 1 × 10−3) eQTL variants while the third group comprises the most significant variant of each independent hit (including the independent secondary signal variants). (A) The chart depicts the percentage of variants per variant set categorised into seven groups by RegulomeDB. The seven-level functional score is based on a synthesis of data derived from various sources: category 1 variants are very likely to affect binding and are linked to gene expression of a target gene (i.e. are known eQTL variants); categories 2 and 3 are likely to affect at least transcription factor binding and several other regulatory effects; categories 4–6 show minimal functional indication while category 7 variants lack evidence for any functional relevance. (B) The chart shows the percentage of variants classified into ten classes of consequences according to the Ensembl Variant Effect Predictor (VEP). For variant set two (mega-analysis) and three (independent hits) we only included the predicted consequence affecting the identified eQTL gene. For the control group, one random gene within a variant–gene distance of a maximum of 1 Mb was chosen. We selected the most severe effect, if the variant had different effects on transcripts of the same gene. ***P-Value for difference between groups <0.001.
eQTL variants overlapping with genome-wide significant AMD variants.
| IH* | dbSNP ID | CHR | Position [hg19] | Gene ID (ENSG) | Gene Symbol | P-Value | Q-Value | Effect Size** | SE | Non-risk allele | Risk allele | Frequency of risk allele | Distance to TSS |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1.2 | rs570618 | chr1 | 196,657,064 | ENSG00000244414 | CFHR1 | 2.15E-12 | 4.34E-10 | 0.711 | 0.099 | G | T | 0.360307 | −131822 |
| 1.1 | rs10922109 | chr1 | 196,704,632 | ENSG00000134365 | CFHR4 | 3.29E-24 | 1.66E-21 | 1.118 | 0.105 | A | C | 0.554124 | −114738 |
| 1.1 | rs10922109 | chr1 | 196,704,632 | ENSG00000244414 | CFHR1 | 7.56E-24 | 2.54E-21 | 0.992 | 0.094 | A | C | 0.554124 | −84254 |
| 1.1 | rs10922109 | chr1 | 196,704,632 | ENSG00000116785 | CFHR3 | 8.38E-17 | 2.11E-14 | 0.923 | 0.107 | A | C | 0.554124 | −39292 |
| 1.1 | rs10922109 | chr1 | 196,704,632 | ENSG00000143278 | F13B | 0.0002 | 0.012 | 0.216 | 0.057 | A | C | 0.554124 | −303688 |
| 1.1 | rs10922109 | chr1 | 196,704,632 | ENSG00000000971 | CFH | 0.0004 | 0.025 | 0.338 | 0.095 | A | C | 0.554124 | 83625 |
| 1.6 | rs61818925 | chr1 | 196,815,450 | ENSG00000116785 | CFHR3 | 1.38E-08 | 1.55E-06 | 0.649 | 0.113 | G | T | 0.417647 | 71526 |
| 1.6 | rs61818925 | chr1 | 196,815,450 | ENSG00000244414 | CFHR1 | 5.97E-05 | 0.006 | 0.416 | 0.103 | G | T | 0.417647 | 26564 |
| 1.6 | rs61818925 | chr1 | 196,815,450 | ENSG00000134389 | CFHR5 | 0.0001 | 0.011 | −0.371 | 0.096 | G | T | 0.417647 | −131216 |
| 11 | rs7803454 | chr7 | 99,991,548 | ENSG00000121716 | PILRB | 5.67E-27 | 5.72E-24 | 0.251 | 0.022 | C | T | 0.188567 | 57812 |
| 11 | rs7803454 | chr7 | 99,991,548 | ENSG00000085514 | PILRA | 6.16E-11 | 1.04E-08 | 0.372 | 0.056 | C | T | 0.188567 | 26396 |
| 23.1 | rs2043085 | chr15 | 58,680,954 | ENSG00000128918 | ALDH1A2 | 0.0002 | 0.016 | 0.207 | 0.056 | T | C | 0.667257 | 435333 |
| 23.2 | rs2070895 | chr15 | 58,723,939 | ENSG00000166035 | LIPC | 5.45E-09 | 6.88E-07 | 0.561 | 0.095 | A | G | 0.80531 | 21172 |
| 23.2 | rs2070895 | chr15 | 58,723,939 | ENSG00000137845 | ADAM10 | 0.0003 | 0.021 | −0.217 | 0.06 | A | G | 0.80531 | −163463 |
| 24.2 | rs17231506 | chr16 | 56,994,528 | ENSG00000087237 | CETP | 8.48E-05 | 0.008 | −0.216 | 0.055 | C | T | 0.327434 | −1233 |
| 27 | rs6565597 | chr17 | 79,526,821 | ENSG00000182612 | TSPAN10 | 1.70E-09 | 2.46E-07 | −0.526 | 0.086 | C | T | 0.383459 | −77375 |
| 27 | rs6565597 | chr17 | 79,526,821 | ENSG00000184009 | ACTG1 | 0.0002 | 0.016 | 0.312 | 0.084 | C | T | 0.383459 | 49825 |
| 27 | rs6565597 | chr17 | 79,526,821 | ENSG00000141552 | ANAPC11 | 0.0006 | 0.036 | −0.171 | 0.05 | C | T | 0.383459 | −321844 |
CHR: chromosome; TSS: transcription start site; SE: standard error of the effect size.
*IH: independent hit according to Fritsche et al.[3].
**Effect size (beta) of a single AMD risk increasing allele.