| Literature DB >> 33207694 |
Vanessa M Conn1,2, Marta Gabryelska1,2, Shashikanth Marri1,2, Brett W Stringer1,2,3, Rebecca J Ormsby1,2, Timothy Penn1,2, Santosh Poonnoose2,4, Ganessan Kichenadasse2,5, Simon J Conn1,2.
Abstract
High-throughput RNA sequencing (RNA-seq) and dedicated bioinformatics pipelines have synergized to identify an expansive repertoire of unique circular RNAs (circRNAs), exceeding 100,000 variants. While the vast majority of these circRNAs comprise canonical exonic and intronic sequences, microexons (MEs)-which occur in 30% of functional mRNA transcripts-have been entirely overlooked. CircRNAs which contain these known MEs (ME-circRNAs) could be identified with commonly utilized circRNA prediction pipelines, CIRCexplorer2 and CIRI2, but were not previously recognized as ME-circRNAs. In addition, when employing a bespoke bioinformatics pipeline for identifying RNA chimeras, called Hyb, we could also identify over 2000 ME-circRNAs which contain novel MEs at their backsplice junctions, that are uncalled by either CIRCexplorer2 or CIRI2. Analysis of circRNA-seq datasets from gliomas of varying clinical grades compared with matched control tissue has shown circRNAs have potential as prognostic markers for stratifying tumor from healthy tissue. Furthermore, the abundance of microexon-containing circRNAs (ME-circRNAs) between tumor and normal tissues is correlated with the expression of a splicing associated factor, Serine/arginine repetitive matrix 4 (SRRM4). Overexpressing SRRM4, known for regulating ME inclusion in mRNAs critical for neural differentiation, in human HEK293 cells resulted in the biogenesis of over 2000 novel ME-circRNAs, including ME-circEIF4G3, and changes in the abundance of many canonical circRNAs, including circSETDB2 and circLBRA. This shows SRRM4, in which its expression is correlated with poor prognosis in gliomas, acts as a bona fide circRNA biogenesis factor. Given the known roles of MEs and circRNAs in oncogenesis, the identification of these previously unrecognized ME-circRNAs further increases the complexity and functional purview of this non-coding RNA family.Entities:
Keywords: SRRM4; alternative splicing; circular RNAs; glioblastoma; microexons; splicing factors
Mesh:
Substances:
Year: 2020 PMID: 33207694 PMCID: PMC7697094 DOI: 10.3390/cells9112488
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Figure 1Circular RNA (circRNA) profiling of human gliomas can stratify glioblastoma (GBM) tumors. (a) Canonical linear splicing versus non-canonical, back-splicing producing mRNA and circRNA, respectively. Arc highlights the location of the backsplice junction. (b) Scaled Venn diagram showing total number of circRNAs identified from all 19 brain, circRNA-seq libraries using three different circRNA identification pipelines, CIRCexplorer2, CIRI, and Hyb. (c) Two-dimensional PCA plot utilizing circRNA-seq output from CIRCexplorer2 on 19 samples across healthy, grade II, III, and IV (GBM) gliomas. Ellipses included to highlight stratification of healthy tissue (purple) and grade IV tumor (blue) for each of the three circRNA prediction pipelines. (d) Expanded volcano plot showing circRNAs regulated between healthy tissue and grade II (left panel), grade III (middle panel) and grade IV, GBM (right panel). (e) Heat map showing representative examples of circRNAs specific to healthy brain tissue, grade-specific (grade II and grade IV (GBM)) using CIRCexplorer2. The color scale bar represents the log-transformed normalized reads.
Sample ID, tumor type and grade, RNA sequencing read and alignment statistics. Tumor sample ID #267 (colored in red) was excluded from RNA seq analysis due to low percentage mapped reads.
| Sample ID | Tumor Type | Grade | # Input Reads | # Uniquely Mapped Reads |
|---|---|---|---|---|
| 158 | Astrocytoma | II | 26,343,021 | 89.1% |
| 190 | Oligodendroglioma | II | 27,302,075 | 88.8% |
| 217 | Oligodendroglioma | II | 25,241,413 | 84.9% |
| 207 | Diffuse glioma | II | 23,492,240 | 88.4% |
| 267 | Oligodendroglioma | II | 15,437,385 | 39.1% |
| 150 | Anaplastic astrocytoma | III | 21,617,416 | 86.6% |
| 154 | Anaplastic oligodendroglioma | III | 29,164,440 | 90.8% |
| 172 | Anaplastic oligodendroglioma | III | 42,012,355 | 66.5% |
| 213 | Anaplastic astrocytoma | III | 25,758,766 | 87.6% |
| 214 | Anaplastic astrocytoma | III | 28,412,184 | 85.7% |
| 169 | Glioblastoma | IV | 28,268,617 | 87.6% |
| 170 | Glioblastoma | IV | 27,580,126 | 82.7% |
| 203 | Glioblastoma | IV | 27,106,465 | 88.4% |
| 61 | Glioblastoma | IV | 29,197,512 | 83.0% |
| 160 | Glioblastoma | IV | 23,140,292 | 86.4% |
| 179 | Control | 22,852,102 | 80.3% | |
| 61_MC_NOR | Matched Control | 29,806,403 | 87.3% | |
| 158_MC_NOR | Matched Control | 28,019,827 | 88.2% | |
| 170_MC_NOR | Matched Control | 26,038,920 | 89.1% | |
| 267_MC_NOR | Matched Control | 24,372,321 | 73.3% |
Figure 2Microexons are included in circRNAs. (a) Incorporation of microexons (ME), which are defined here as 3-30 nt sequences into mRNA and circRNA. Note that MEs in circRNAs can fall into the backsplice junction (unfilled exon) or into the body (filled exon) of the circRNA. (b) circRNAs overlapping known microexon sequences for three pipelines. (c) Trimmed reads mapped to EIF4G3 which cover the circRNA backsplice junction across exon 9 and exon 6 (circEIF4G3, 16 counts) and the same junction with a 21nt ME (ME-circEIF4G3, 17 counts) from healthy brain tissue from patient #170. (d) De novo identification of ME-circRNAs using Hyb. Pie chart showing proportion of ME-circRNAs with matched cognate circRNA (637/2558; 24.9%) and orphan ME-circRNAs (1921/2558; 75.1%). (e) Sequence motif enrichment using WebLogo (ver2.8.2) around splice sites of predicted (upper) and known (lower) MEs in ME-circRNAs. The lines under each graph indicates the 5 nt of the intron side and the exon side of the ME splice site with the upstream acceptor splice site on the left and the downstream donor splice site on the right. (f) Upper, Agarose gel showing RT-PCR amplification of ME-circEIF4G3 in RNA from tumor samples #158, #190 and #217 and absence in non-template control (NTC). Lower, Sanger sequencing of PCR product across backsplice junction between the ME (blue shading, exon 6a) and exon 9 (green shading).
Figure 3Serine/arginine repetitive matrix 4 (SRRM4) is a bona fide circRNA and ME-circRNA biogenesis factor. (a) qRT-PCR comparing relative expression of SRRM transcripts between healthy brain tissue (H) and GBM tumor tissue (T). Normalized to GAPDH, n = 4–5 tissue samples. One-way ANOVA, Mann–Whitney test was performed to assess significance. n.s.: no significant difference (p > 0.05) (b) Western blot using FLAG antibody on cell protein lysates from HEK293 cells transiently transfected with pcDNA3.1 (EV), pcDNA3.1::SRRM3-FLAG (SRRM3) or pcDNA3.1::SRRM4-FLAG (SRRM4). Bands of expected size for SRRM3 (70 kDa) and SRRM4 (65 kDa). (c) RT-PCR for ACVR2A linear (ACVR2A), and SRRM4-specific ACVR2A RNA with ME, circACVR2A (comprising exons 3-8) and increase in MEF2C linear RNA with ME. (d) RT-PCR on 3 biological replicates of HEK293 cells with EV, or SRRM4 overexpression for SRRM4, REST (2 bands, with upper band representing the inclusion of ME in REST4), ACVR2A (ME) and GAPDH used as loading control. (e) Scaled Venn diagram showing total number of circRNAs identified across the three replicate EV and SRRM4 overexpression circRNA-seq libraries using three different circRNA identification pipelines, CIRCexplorer2, CIRI and Hyb. (f) RT-PCR for two novel SRRM4-driven circRNAs showing absence in HEK293 EV cells. (g) Volcano plot comparing circRNA expression between EV and SRRM4 overexpression. (h) Scaled Venn diagram showing ME-circRNAs, with the ME present at the backsplice junction, identified using Hyb. (i) RT-PCR showing the dependence on SRRM4 for expression of ME-circEIF4G3 from transgenic circRNA overexpression construct. Top, HeLa cells (with high levels of SRRM4 expression) could produce ME-circEIF4G3 with and without SRRM4 overexpression 24 h post-transfection. Bottom, HEK293 cells (lacking SRRM4 expression) could only produce the circRNA if SRRM4 was co-transfected. * are non-specific RT-PCR products.