| Literature DB >> 33024230 |
Vasilis F Ntasis1, Nikolaos I Panousis2,3,4,5, Maria G Tektonidou6,7, Emmanouil T Dermitzakis2,3,4,8, Dimitrios T Boumpas7,8,9,10, George K Bertsias11,12, Christoforos Nikolaou13,14,15.
Abstract
Systemic Lupus Erythematosus (SLE) is the prototype of autoimmune diseases, characterized by extensive gene expression perturbations in peripheral blood immune cells. Circumstantial evidence suggests that these perturbations may be due to altered epigenetic profiles and chromatin accessibility but the relationship between transcriptional deregulation and genome organization remains largely unstudied. In this work we propose a genomic approach that leverages patterns of gene coexpression from genome-wide transcriptome profiles in order to identify statistically robust Domains of Co-ordinated gene Expression (DCEs). Application of this method on a large transcriptome profiling dataset of 148 SLE patients and 52 healthy individuals enabled the identification of significant disease-associated alterations in gene co-regulation patterns, which also correlate with SLE activity status. Low disease activity patient genomes are characterized by extensive fragmentation leading to overall fewer DCEs of smaller size. High disease activity genomes display extensive redistribution of co-expression domains with expanded and newly-appearing (emerged) DCEs. The dynamics of domain fragmentation and redistribution are associated with SLE clinical endophenotypes, with genes of the interferon pathway being highly enriched in DCEs that become disrupted and with functions associated to more generalized symptoms, being located in domains that emerge anew in high disease activity genomes. Our results suggest strong links between the SLE phenotype and the underlying genome structure and underline an important role for genome organization in shaping gene expression in SLE.Entities:
Mesh:
Year: 2020 PMID: 33024230 PMCID: PMC7539002 DOI: 10.1038/s41598-020-73654-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Differential patterns of domains of co-ordinated expression (DCEs) in healthy and patient groups. (a) The DCE detection pipeline is represented as a series of ‘transformations’ applied to the expression data. We start by calculating the expression profile of each chromosomal bin using the expression profile of the encompassed genes (i). We then calculate the correlation coefficients between the bins located on the same chromosome (ii). Next, the correlation profile of each chromosome is transformed into a one-dimensional binsignal profile (iii). We analyze that profile, detecting local minima and maxima in order to determine the borders of the domains. Finally, a statistical evaluation of those borders results in the final DCE coordinates (iv). (b) Domainograms depicting the distribution of DCEs for the healthy and the three patient groups studied. The color of DCEs represent the respective average binsignal of the chromosomal bins encompassed. (c) Violin plots illustrating the estimated distribution of DCE sizes in each group. Classic boxplots are included. The scale of the y axis is logarithmic (log(bps)). (d) Average bin signal (co-expression score) for each group. (E) Violin plots representing, for each chromosome, the percentage of chromosomal bins that contain genes, with non-zero expression value, and form DCEs. (c, e). The results of Mann–Whitney–Wilcoxon tests comparing each patient group to the healthy group are demonstrated by the significance level indicators.
Figure 2DCEs are extensively fragmented and redistributed in SLE patients and correlate with functional signatures and epigenetic marks. (a) Heatmap presenting the different types of DCE reorganization. Numbers inside cells indicate the ratio of the number of DCEs, of the respective type, over the total number of DCEs for each patient group. Colour code is corresponding to column z-score of ratios. (b) Heatmap depicting the results of an enrichment test for DCEs in the functionally annotated WGCNA modules. (c) Heatmap depicting the results of an enrichment test for DCEs in different genome subcompartments. (a–c) Scaling and centering has been performed per column. Trees are illustrating the outcome of hierarchical clustering performed on the data. (b, c) Symbols inside cells demonstrate the significance level of the outcome of each test (*:0.05; **:0.01; ***:0.001). Significance has been assessed by a non-parametric, permutation-based test.
Figure 3Functional analysis of the disruption events. (a) Enrichment analysis of ‘Disruptors’ in genes that are commonly regulated (suggested by the mutual regulatory motif matches—TRANSFAC database) by transcription factors indicated on y axis. The overlap between the query gene set and the corresponding Pathway members or TF-target genes are displayed on the x axis. The color of each bar illustrates the corrected p value of the corresponding enrichment test. (b) Average positional enrichments of susceptibility and severity genes[6] against different types of DCEs. Significance levels of one hundred permutations (*:0.05; **:0.01). (c) Protein interaction networks for susceptibility signature genes that are found to be differentially expressed and overlapping split DCE boundaries, as obtained from STRING-DB[35]. Genes are grouped on the basis of a modularity analysis. Modules are shown with coloured polygons around genes (red: interferon signature genes, cyan: DAP12 signaling, lime: neutrophil module, green: B-cell module). (d) Pathway enrichment analysis of genes which correspond to enhancer-TSS links (CD4+ cells—Enhancer Atlas)[37], that are nested in healthy group DCEs but disrupted in SLE. The top 20 most significant KEGG or/and REACTOME pathways are presented.
Figure 4Examples of alterations in the co-expression profile. Heatmaps of expression correlation for selected loci of characteristic cases of disrupted (top), expanded (middle), deleted (bottom left) and emerged DCEs (bottom right). Heatmaps were created with the Sushi package from Bioconductor (https://bioconductor.org/packages/release/bioc/html/Sushi.html). Values in heatmaps correspond to bin signal, while the tracks below them show (from top to bottom) gene positions colour-coded for differential expression as log2(fold-change), DCE coordinates and enhancer-promoter associations that are entirely included in the same DCE (in blue) or not (in red). Names of differentially expressed genes in each locus are shown on the side of each panel.
Figure 5Patterns of gene co-expression in SLE. Graphical representation of the most prominent characteristics of gene co-expression patterns. Healthy genomes have extended domains of co-ordinated gene expression (DCE), but these become shorter and more fragmented in the genomes of patients with low disease activity. A significant number of differentially expressed genes (DEGs) in SLE low-activity genomes are associated with a disease “susceptibility” signature, located in split DCEs, linked to interferon and other signaling pathways and enriched in regions of low chromatin accessibility. In high disease activity genomes DCEs are re-distributed in new regions, where genes linked a SLE “severity” signature and more generalized manifestations of the disease (e.g. nephritis) localize in areas of DCE contraction, depletion and overall increased chromatin accessibility.