| Literature DB >> 30806704 |
Zhi-Wei Guo1, Chen Xie2, Kun Li1, Xiang-Ming Zhai1, Geng-Xi Cai3, Xue-Xi Yang1, Ying-Song Wu1.
Abstract
Super-enhancers (SEs) are enriched with a cluster of mediator binding sites, which are major contributors to cell-type-specific gene expression. Currently, a large quantity of long non-coding RNAs has been found to be transcribed from or to interact with SEs, which constitute super-enhancer associated long non-coding RNAs (SE-lncRNAs). These SE-lncRNAs play essential roles in transcriptional regulation through controlling SEs activity to regulate a broad range of physiological and pathological processes, especially tumorigenesis. However, the pathological functions of SE-lncRNAs in tumorigenesis are still obscure. In this paper, we characterized 5056 SE-lncRNAs and their associated genes by analysing 102 SE data sets. Then, we analysed their expression profiles and prognostic information derived from 19 cancer types to identify cancer-related SE-lncRNAs and to explore their potential functions. In total, 436 significantly differentially expressed SE-lncRNAs and 2035 SE-lncRNAs with high prognostic values were identified. Additionally, 3935 significant correlations between SE-lncRNAs and their regulatory genes were further validated by calculating their correlation coefficients in each cancer type. Finally, the SELER database incorporating the aforementioned data was provided for users to explore their physiological and pathological functions to comprehensively understand the blocks of living systems.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30806704 PMCID: PMC6390648 DOI: 10.1093/database/baz027
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1System overview of cancer-related SE-lncRNAs database construction. The workflow of cancer-related SE-lncRNA database construction mainly consisted of the following three sections: SE-lncRNA identification, cancer-related SE-lncRNA annotation and database construction. We first identified trans-acting and cis-acting SE-lncRNAs according to their regulatory mechanisms (left part of Figure 1). To explore cancer-related SE-lncRNAs, we identified significantly differentially expressed SE-lncRNAs and SE-lncRNAs with high prognostic values (right part of Figure 1). Moreover, we calculated the correlation coefficient along with the regulated genes of each cancer type to identify their truly regulatory relationships. Finally, the SELER database was built.
Data statistics of SE-lncRNAs and their regulated genes
| Mechanism | Super enhancer | lncRNA gene | lncRNA transcript | Gene | Gene transcript | Regulatory relationship |
|---|---|---|---|---|---|---|
| Cis-acting | 24 697 | 4996 | 8821 | 8171 | 57 843 | 16 272 |
| Trans-acting | 4629 | 123 | 179 | 4577 | 32 676 | 11 214 |
| Total | 27 029 | 5056 | 8908 | 9491 | 67 899 | 27 481 |
Gene = regulated genes of SE-lncRNAs. Gene transcript = regulated gene transcripts of SE-lncRNAs. Regulatory relationship = regulatory relations between SE-lncRNA and their regulated genes.
Data statistics of cancer-related SE-lncRNAs
| Type | DF | PV | DF&PV | Significant relationship |
|---|---|---|---|---|
| Cis-acting | 430 | 2032 | 347 | 3622 |
| Trans-acting | 13 | 19 | 7 | 401 |
| Total | 436 | 2035 | 349 | 3935 |
DF = significantly differentially expressed SE-lncRNAs. PV = SE-lncRNAs with a high prognostic value. DF&PV = significantly differentially expressed SE-lncRNAs with a high prognostic value. Significant relationship = significant regulatory relationship with P-value≤0.05 in Cox’s proportional hazard analysis.
Figure 2Number of different cancer type with identical SE-lncRNAs. (A) Significantly differentially expressed SE-lncRNAs. (B) SE-lncRNAs with high prognostic values. Number means the number of cancer types with identical SE-lncRNAs.
Figure 3Sample output diagram for the result of the SE-lncRNA section. (A) Information about super enhancers, SE-lncRNAs and their associated genes. (B) Cancer-related information about SE-lncRNAs, including their expression profiles, prognostic information and significantly associated genes in cancers.