| Literature DB >> 34356060 |
Alessia Buratin1,2, Enrico Gaffo2, Anna Dal Molin2, Stefania Bortoluzzi2,3.
Abstract
Circular RNAs (circRNAs) are transcripts generated by back-splicing. CircRNAs might regulate cellular processes by different mechanisms, including interaction with miRNAs and RNA-binding proteins. CircRNAs are pleiotropic molecules whose dysregulation has been linked to human diseases and can drive cancer by impacting gene expression and signaling pathways. The detection of circRNAs aberrantly expressed in disease conditions calls for the investigation of their functions. Here, we propose CircIMPACT, a bioinformatics tool for the integrative analysis of circRNA and gene expression data to facilitate the identification and visualization of the genes whose expression varies according to circRNA expression changes. This tool can highlight regulatory axes potentially governed by circRNAs, which can be prioritized for further experimental study. The usefulness of CircIMPACT is exemplified by a case study analysis of bladder cancer RNA-seq data. The link between circHIPK3 and heparanase (HPSE) expression, due to the circHIPK3-miR558-HPSE regulatory axis previously determined by experimental studies on cell lines, was successfully detected. CircIMPACT is freely available at GitHub.Entities:
Keywords: circular RNA; gene expression; pathways; regulatory axes
Mesh:
Substances:
Year: 2021 PMID: 34356060 PMCID: PMC8308052 DOI: 10.3390/genes12071044
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1CircRNAs regulate cell behavior with different mechanisms. The integrated analysis of circRNA and linear gene expression profiles can predict circRNA functionsby identifying the biological processes and pathways impacted by circRNA expression variation.
Figure 2The CircIMPACT workflow.
The functions implemented in CircIMPACT.
| Function | Description |
|---|---|
|
| Defines the discriminant circRNAs and sample grouping according to circRNA expression patterns |
|
| Performs differential gene expression tests between the sample groups defined by circRNA expression |
|
| Normalization, feature selection, and classification of sample groups defined by circRNA expression using gene expression |
Figure 3CircRNA analysis of bladder cancer RNA-seq data. (A) Fragment of the table produced by the circ.marker function, indicating circRNAs equally or differentially expressed among sample groups. (B) Density plot for circHIPK3 expression in the sample groups defined by circHIPK3 median expression (g1 corresponds to bladder cancer, g2 to control samples). (C) Principal component analysis (PCA) of the sample separation using the top 25 circRNAs mainly contributing to sample separation, which include circHIPK3.
Top 25 circRNAs discriminating better among sample groups. For each circRNA, identified by the circRNA host-gene name and the back-splice junction genomic coordinates (BSJ), the average expression in the two sample groups (g1 and g2) are indicated along with the circRNA fold change in base-2 logarithm (Log2FC) and the adjusted p-value (p.adj) of differential expression.
| CircRNA Host Gene 1 | Back-Splice Coordinates | Log2FC | p.adj | Mean g1 | Mean g2 |
|---|---|---|---|---|---|
| SVIL | 10:29512734-29531288 | 7.3 | 0.0000 | 0.5 | 47.5 |
| AMY2B | 1:103565434-103575540 | 4.5 | 0.0000 | 2.3 | 27.1 |
| HIPK3 | 11:33286412-33287511 | 2.5 | 0.0979 | 37.0 | 213.8 |
| SLC43A1 | 11:57491223-57491862 | 3.2 | 0.0000 | 6.4 | 43.0 |
| TRAF5 | 1:211353238-211354467 | 3.1 | 0.0000 | 5.1 | 44.3 |
| ITGA7 | 12:55700898-55701154 | 5.3 | 0.0007 | 22.5 | 903.0 |
| SNHG12 | 1:28580559-28581229 | 6.2 | 0.0000 | 1.0 | 25.9 |
| RPPH1 | 14:20343123-20343272 | 3.2 | 0.0000 | 41.4 | 359.3 |
| RPPH1 | 14:20343123-20343277 | 5.1 | 0.0008 | 4.3 | 122.8 |
| RPPH1 | 14:20343128-20343277 | 3.3 | 0.0000 | 39.0 | 373.9 |
| MYOCD | 17:12705127-12723008 | 6.8 | 0.0000 | 0.5 | 32.8 |
| CircClust | 17:79644136-79646334 | 5.4 | 0.0003 | 2.2 | 83.4 |
| ZNF208 | 19:21974728-21988909 | 3.0 | 0.0000 | 52.7 | 419.3 |
| PPP1R13L | 19:45398004-45398339 | 4.2 | 0.0215 | 3.8 | 55.9 |
| FER1L4 | 20:35595476-35595754 | 3.2 | 0.0097 | 4.5 | 36.0 |
| AF165147.1 | 21:28377945-28417158 | 6.4 | 0.0000 | 1.8 | 145.7 |
| SMTN | 22:31097268-31104387 | 6.3 | 0.0001 | 1.2 | 71.9 |
| SLC8A1 | 2:40428472-40430304 | 3.3 | 0.0007 | 7.0 | 59.2 |
| GRHL1 | 2:9995878-9999029 | 4.5 | 0.0000 | 2.4 | 30.3 |
| ABTB1 | 3:127674390-127674600 | 3.8 | 0.0000 | 3.7 | 39.3 |
| FNDC3B | 3:172112451-172133546 | 3.5 | 0.0000 | 8.3 | 86.1 |
| KCNN2 | 5:114404437-114404856 | 4.7 | 0.0008 | 2.2 | 56.6 |
| CD2AP | 6:47503279-47554766 | 4.5 | 0.0153 | 3.1 | 62.3 |
| RAB23 | 6:57193841-57210445 | 5.4 | 0.0001 | 1.5 | 53.8 |
| PGM5 | 9:68378198-68392473 | 4.5 | 0.0009 | 1.5 | 30.5 |
1 CircRNA annotation was based on the Ensembl GRCh38 human genome and annotation v93. Genomic regions without annotated genes but expressing one circRNA or more circRNAs (overlapping or not more than 5000 nt apart) defined new loci, called “CircClust”.
Figure 4Putative impact of circHIPK3 in gene expression, gene ontology, and pathways. (A) Volcano plot of the differentially expressed genes in sample groups defined by circHIPK3 expression. (B) CircHIPK3 and the HPSE gene are respectively up- and downregulated in bladder cancer. (C) Top 10 activated and suppressed GO enriched in differentially expressed genes in the group comparisons (D) Pathways most enriched in deregulated genes.
Figure 5The most important genes in the classification analysis of sample groups, defined by circHIPK3 expression. (A) Variable importance plot of the most relevant genes used in the random forest model; (B) top 10 activated and suppressed enriched GO using most important genes in the group classification.