| Literature DB >> 32895400 |
Lyubov E Salnikova1,2, Maryam B Khadzhieva3,4, Dmitry S Kolobkov3,5,6, Alesya S Gracheva3,4, Artem N Kuzovlev4, Serikbay K Abilev3.
Abstract
Dysregulation in cytokine production has been linked to the pathogenesis of various immune-mediated traits, in which genetic variability contributes to the etiopathogenesis. GWA studies have identified many genetic variants in or near cytokine genes, nonetheless, the translation of these findings into knowledge of functional determinants of complex traits remains a fundamental challenge. In this study we aimed at collection, analysis and interpretation of data on cytokines focused on their tissue-specific expression, eQTLs and GWAS traits. Using GO annotations, we generated a list of 314 cytokines and analyzed them with the GTEx resource. Cytokines were highly tissue-specific, 82.3% of cytokines had Tau expression metrics ≥ 0.8. In total, 3077 associations for 1760 unique SNPs in or near 244 cytokines were mapped in the NHGRI-EBI GWAS Catalog. According to the Experimental Factor Ontology resource, the largest numbers of disease associations were related to 'Inflammatory disease', 'Immune system disease' and 'Asthma'. The GTEx-based analysis revealed that among GWAS SNPs, 1142 SNPs had eQTL effects and influenced expression levels of 999 eGenes, among them 178 cytokines. Several types of enrichment analysis showed that it was cytokines expression variability that fundamentally contributed to the molecular origins of considered immune-mediated conditions.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32895400 PMCID: PMC7477549 DOI: 10.1038/s41598-020-71018-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flowchart of the study design.
Genes regulating proteins with cytokine activity (GO:0005125), chemokine activity (GO:0008009), cytokine receptor activity (GO:0004896) and chemokine receptor activity (GO:0004950).
| Cytokine activity (n = 170) |
| Cytokine and chemokine activity (n = 46) |
| Chemokine activity (n = 4) |
| Cytokine receptor activity (n = 65) |
| Cytokine and cytokine receptor activity (n = 3) |
| Chemokine receptor activity (n = 24) |
| Chemokine and cytokine receptor activity (n = 2) |
*GO:0008083, growth factor activity.
Figure 2Expression profiles and genome-wide eQTL data for cytokines. The tissues are shown by abbreviations (Supplementary Table S2) and grouped by tissue categories (used throughout). (a) Gene expression profiles, built from TPM (Transcripts Per Million). (b) Spearman’s rank correlation matrix of tissue gene expression (heatmap). (c) Distribution of Tau (tissue specificity) scores. (d) Tissue-specific genes (Tau ≥ 0.8, TSI (Tissue Specificity Index) ≥ 0.3) with the highest levels of expression in corresponding tissues. Circles colored in accordance with the legend for tissue categories show TPM value for the tissue(s) with the highest level of expression for a given gene. Mean ± SD for TPM values across all tissues for a gene of interest is colored in red. Grey circles represent TPM values for a given gene in other tissues. Thirteen tissues from different brain regions are signed with a common label ‘BRAIN’. (e) Manhattan plot with the number of associations for eQTLs (Y-axis) targeting cytokines. Top ten genes are signed (upper panel). Manhattan plot with the number of eQTLs (Y-axis) for cytokines. Top ten genes are signed (bottom panel). (f) A density plot showing the distribution of the number of eQTLs per gene in the set of tissue-specific genes and in the set of ‘other’ genes.
Figure 3Direction of eQTL-gene pair effects. eQTL-gene pair effects are presented if the number of unidirectional or bidirectional associations for a given gene pair were ≥ 100. Genes are listed according their chromosome location. Bidirectional (left) or unidirectional (right) effects of eQTL-gene pairs are visualized by color. The effect information includes the number of eQTL-gene pairs (circles), the number of relevant tissues (squares) and the eQTLs distance (triangles).
Figure 4Representation of cytokine gene associations in the NHGRI-EBI GWAS Catalog. Circos plots show the proportions of top-ranked EFO (Experimental Factor Ontology) classifications for cytokines associations found in the NHGRI-EBI GWAS Catalog. These classifications were categorized by: (a) disease type, (b) disease by anatomical system (for non-oncological diseases), (c) type of measurement, (d) disease. The majority of the associations were described by several classification units, illustrated by the individual colored ribbons. Several associations were found in multi-trait studies (Supplementary Table S5). In figure panel (d), ribbons indicate associations that were both uniquely mapped and were studied together within the same framework. (e) Graph with top 20 genes by the number of corresponding associations. The numbers of unique SNPs, mapped traits and PubMed papers are also indicated.
Figure 5Population distribution and functional characterization of GWAS and LD SNPs. (a) GWAS-identified SNPs (left panel) and LD SNPs (right panel) by population are shown in a Venn diagram. In the population where GWAS (index) SNPs were detected, we selected SNPs in LD (linkage disequilibrium) with index SNPs using a threshold of r2 > 0.8. Only index SNPs were analyzed in mixed populations with different ethnicity or in populations, which could not be assigned to one of the four populations: EUR (European), ASN (East Asians), AFR (Africans including African Americans) and AMR (Admixed American). (b) Proportions of index and LD SNPs in the four populations. Asterisks indicate significant differences between EUR and ASN (P = 0.004) in the proportion of index and LD SNPs. (c) Density and box plots for functional IW-scores (K10) for index and LD SNPs. IW-scores (K10) were obtained from ten different scoring systems via IW-Scoring tool: an integrative weighted scoring framework to annotate and prioritize noncoding variations. (d) Distribution of genomic region annotations for index and LD SNPs (INT, intronic variant; NSM, non-synonymous missense variant; SYN, synonymous variant; U3, 3′untranslated region variant; U5, 5′untranslated region variant). (e) Distribution of potential regulatory sequences for index and LD SNPs. (f) Distribution of additional functional annotations for index and LD SNPs. SIFT and PolyPhen prediction scores were used to predict pathogenicity of amino acid substitutions. (d, e) Data were obtained from HaploReg v4; (f) Data were obtained from SNPnexus.
Figure 6Evidence of natural selection for the GWAS SNPs in the GDF5 and IL1RL1/IL18R1 genes. Data were extracted from the 1000 Genomes Selection Browser 1.0 for two natural selection tests: global Fst and integrated haplotype score (iHS). (a) Global Fst values for the SNP rs143384 (Fst = 0.649) and rs224333 (Fst = 0.714) in the GDF5 gene. GWAS signals for the SNPs rs143384 and rs224333 were reported for European and mixed ancestry populations. (b) iHS signals for the SNPs rs2001461 (iHS = 4.54), rs1420103 (iHS = 3.46) and rs6419573 (iHS = 3.34) in the IL18R1/IL1RL1 locus in CEU population. GWAS signals for these SNPs were reported for European ancestry individuals.
Figure 7eQTL analysis of GWAS Catalog SNPs in cytokine genes. (a) Distribution of eSNPs and non-eSNPs by genomic region (upper panel). Number of GTEx associations per eSNP by genomic region (middle panel). IW-scores (K10) by genomic region (bottom panel). (b) Circos plot illustrating by chromosomal region distribution of the numbers of GWAS Catalog associations, unique GWAS Catalog SNPs, unique eSNPs, eSNP associations in the GTEx v.8 database and target genes (from the periphery to the center). (c) Spectrum of GWAS diseases associated with eSNPs in cytokine genes (frequency of association occurrence ≥ 5). (d) Results of ARCHS4 Tissues enrichment analysis for 655 target genes encoding proteins involved in protein interactions. (e) Top ten unique GO terms returned by GO enrichment analysis for the whole set of target genes. (f) Jaccard tissue similarity matrix for eQTL profiles based on matching eQTLs with their target genes and NES (Normalized effect size) direction: for the whole set of target genes (above the diagonal) and for the subset of target cytokines (under the diagonal). *(c) Ankylosing spondylitis, psoriasis, ulcerative colitis, Crohn's disease, sclerosing cholangitis; (e) positive regulation of phosphate metabolic process.