Literature DB >> 30159434

Circular RNAs in the cardiovascular system.

Clarissa P C Gomes1, Antonio Salgado-Somoza1, Esther E Creemers2, Christoph Dieterich3, Mitja Lustrek4, Yvan Devaux1.   

Abstract

Until recently considered as rare, circular RNAs (circRNAs) are emerging as important regulators of gene expression. They are ubiquitously expressed and represent a novel branch of the family of non-coding RNAs. Recent investigations showed that circRNAs are regulated in the cardiovascular system and participate in its physiological and pathological development. In this review article, we will provide an overview of the role of circRNAs in cardiovascular health and disease. After a description of the biogenesis of circRNAs, we will summarize what is known of the expression, regulation and function of circRNAs in the cardiovascular system. We will then address some technical aspects of circRNAs research, discussing how artificial intelligence may aid in circRNAs research. Finally, the potential of circRNAs as biomarkers of cardiovascular disease will be addressed and directions for future research will be proposed.

Entities:  

Keywords:  Artificial intelligence; Biomarker; CRISPR, clustered regularly interspaced short palindromic repeats; CV, cardiovascular; Cardiovascular disease; Cardiovascular system; Circular RNAs; DCM, dilated cardiomyopathy; EMT, epithelial-mesenchymal transition; Non-coding RNAs; RNA-seq, RNA sequencing; RPAD, RNase R treatment followed by polyadenylation and poly(A)+ RNA depletion; RT-qPCR, reverse transcription quantitative polymerase chain reaction; circRNAs, circular RNAs; lncRNAs, long non-coding RNAs; miRNAs, microRNAs; ncRNAs, non-coding RNAs

Year:  2018        PMID: 30159434      PMCID: PMC6084836          DOI: 10.1016/j.ncrna.2018.02.002

Source DB:  PubMed          Journal:  Noncoding RNA Res        ISSN: 2468-0540


Introduction

February 2001 marked the completion of the sequencing of the human genome [1,2]. Eleven years later, researchers from the Encode Project Consortium revealed that, while 80% of our genome is transcribed, only 2% of expressed RNAs encode proteins [3]. This landmark discovery implied that the vast majority of our genes are transcribed into non-protein coding RNAs, more simply named non-coding RNAs (ncRNAs). Multiple categories of ncRNAs have been characterized and classified. The most widely used classification is based on the size of ncRNAs, the shorter ones (<200 nucleotides) being classified as small ncRNAs, while those exceeding 200 nucleotides are grouped as long non-coding RNAs (lncRNAs) (Fig. 1). Inside the lncRNAs family, both linear and circular forms of RNAs can be found. Although circular RNAs (circRNAs) were previously thought to be rare as compared to linear messenger RNAs, recent work showed that circRNAs can be relatively abundant and be the predominant transcripts of hundreds of genes [4]. These circular molecules are ubiquitously expressed in human and more than 30,000 different circRNAs have been identified so far [5]. They are regulated in several disease states and contribute to disease development and progression. The role of circRNAs in the cardiovascular system in only emerging.
Fig. 1

The non-coding RNA family. Non-coding RNAs are classified depending on their size with an arbitrary cut-off of 200 nucleotides. Abbreviations: ncRNAs: noncoding RNAs; lncRNA: long noncoding RNA; eRNA: enhancer RNA; circRNA: circular RNA; rRNA: ribosomal RNA; snRNA: small nuclear RNA; scaRNA: small cajal body-specific RNA; snoRNA: small nucleolar RNA; tiRNA: tRNA-derived stress-induced small RNA; tRNA: transfer RNA; RNAi: RNA interference; siRNA: small interfering RNA; piRNA: piwi-interacting RNA; miRNA: microRNA; rasiRNA: repeat associated small interfering RNA.

The non-coding RNA family. Non-coding RNAs are classified depending on their size with an arbitrary cut-off of 200 nucleotides. Abbreviations: ncRNAs: noncoding RNAs; lncRNA: long noncoding RNA; eRNA: enhancer RNA; circRNA: circular RNA; rRNA: ribosomal RNA; snRNA: small nuclear RNA; scaRNA: small cajal body-specific RNA; snoRNA: small nucleolar RNA; tiRNA: tRNA-derived stress-induced small RNA; tRNA: transfer RNA; RNAi: RNA interference; siRNA: small interfering RNA; piRNA: piwi-interacting RNA; miRNA: microRNA; rasiRNA: repeat associated small interfering RNA. Cardiovascular (CV) disease is still the main cause of death and disabilities worldwide [6,7]. While research has made enormous progress in terms of healthcare and treatment of CV disease, the use of RNAs for its treatment and diagnosis is facing technical challenges that slow down their clinical application. Recent developments revealed the potential of microRNAs (miRNAs) for the management of CV disease patients [8,9]. LncRNAs have also emerged as potential diagnostic and therapeutic targets of CV disease [10,11]. Yet, the knowledge of the role of circRNAs in the diseased heart and their value as CV disease markers and therapeutic targets are still in their infancy. This review article is part of a Special Issue on “Non-coding RNAs in Cardiovascular Health and Disease” and will focus on circRNAs in the CV system. After a description of the mechanisms leading to circRNA biogenesis, we will gather the knowledge of the expression, regulation and function of circRNAs in the CV system. We will then address some methodological aspects of circRNAs research and we will provide some considerations of the potential of artificial intelligence to aid in circRNAs research. The value of circRNAs as biomarkers of CV disease will be delineated and we will finally propose directions for future research.

Biogenesis of circRNAs

CircRNAs are produced by the spliceosome machinery by direct back-splicing of exons or introns within pre-mRNA (Fig. 2) [12] or by exon skipping [13]. As a result of a direct back-splicing event, an exon is not associated with an adjacent downstream exon, as seen in linear splicing, but with an upstream exon or intron. This gives rise to circular RNA molecules harboring exons or introns that are out-of-order from the genomic context [14,15]. Exon skipping involves a lariat that undergoes internal splicing, removing the intron and generating a circRNA. Both proposed mechanisms involve the spliceosome machinery, but the mechanism by which the spliceosome selects only certain exons to circularize is largely unknown. It is however known that many circRNAs in mammals are processed from exons with long flanking introns, containing reverse complementary sequences [16,17]. These sequences can be inverted repeats, such as very abundant Alu elements (∼300 base pairs repeats), or non-repetitive complementary sequences [16]. The common feature is that the intronic sequences base-pair to one another, thereby bringing the splice sites into close proximity of each other, allowing the spliceosome to perform a back-splicing reaction. An alternative mechanism to bridge flanking introns is through interaction with RNA binding proteins that bind cis-regulatory regions in these introns.
Fig. 2

Biogenesis of circRNAs. The spliceosome machinery, which normally catalyses linear splicing of pre-mRNA, can also perform a back-splicing reaction between two exons, resulting in the formation of a circRNA. Back-splicing uses the same canonical splicing machinery and canonical splice sites as needed for linear splicing. Mechanistically, back-splicing requires that the donor and acceptor site of the back-spliced exons are brought in close proximity to each other, which can be accomplished by direct RNA base-pairing of reverse complementary sequences in the introns flanking the back-spliced exons, or by the interaction of RNA binding proteins (RBPs) that dock on these flanking introns.

Biogenesis of circRNAs. The spliceosome machinery, which normally catalyses linear splicing of pre-mRNA, can also perform a back-splicing reaction between two exons, resulting in the formation of a circRNA. Back-splicing uses the same canonical splicing machinery and canonical splice sites as needed for linear splicing. Mechanistically, back-splicing requires that the donor and acceptor site of the back-spliced exons are brought in close proximity to each other, which can be accomplished by direct RNA base-pairing of reverse complementary sequences in the introns flanking the back-spliced exons, or by the interaction of RNA binding proteins (RBPs) that dock on these flanking introns. Among the hundreds of RNA binding proteins characterized so far [18], the first one identified as a regulator of circRNA production was Muscleblind (MBL), a zinc finger protein which is known to regulate alternative splicing in muscle [14] and for which dysregulation can lead to pathologies, particularly myotonic dystrophies (reviewed in Ref. [19]). Interestingly, MBL regulates circRNA production from its own pre-mRNA. Numerous MBL-binding sites are present in the introns flanking the back-spliced junction of circMBL, which turned out to be essential for the biogenesis of circMBL. This raises the possibility of a sophisticated control mechanism for MBL protein levels. When MBL protein levels are high, it decreases the production of its own mRNA by promoting circMBL production. The circRNA can subsequently sponge out the excess MBL protein by binding to it [14] (more details about this function in the next section). The second RNA binding protein shown to regulate circRNA production by pre-mRNA binding was Quaking (QKI) [20], a splicing factor that has been implicated in diseases including ataxia, schizophrenia, and glioblastoma [21]. The insertion of synthetic QKI-binding sites into introns was found to be sufficient to produce circRNAs. Mechanistically it was shown that QKI binds to two flanking introns and subsequently dimerizes to bridge the circularizing exons. During epithelial-mesenchymal transition (EMT) it was shown that about one-third of the circRNAs were regulated by QKI [20]. Whether QKI is also required for circRNA formation in the CV system is currently unknown. Among the post-transcriptional modifications that increase molecular diversity and can lead to alternative splicing and changes in gene expression levels, called RNA editing, the prevailing modification in higher eukaryotes is the single base substitution of adenosine to inosine mediated by the adenosine deaminase acting on RNA (ADAR) enzyme family [22,23]. A recent study revealed that the RNA-editing enzyme ADAR1 functions as an antagonist of circRNA production [24,25]. Circularized exons are normally between long introns, often termed bracketing introns. It was found that introns bracketing circRNAs are enriched for RNA editing events and that knockdown of ADAR1 significantly enhanced circRNA expression in HEK293 cells. Repression of RNA circularization is probably not a general function of ADAR1, as only a few circRNAs were affected by ADAR1 knockdown [25]. RNA Binding Motif protein 20 (RBM20) is yet another splicing factor that has recently been implicated in circRNA production, specifically from the titin gene (TTN) [26].

Function of circRNAs in the CV system

In the following lines, we will summarize the current knowledge of the roles of circRNAs in CV pathophysiology, focusing on their mechanisms of action and interaction with partner molecules. A schematic representation of the different functions of circRNAs is provided in Fig. 3.
Fig. 3

Functions of circRNAs. a) In the presence of internal ribosomal enter sites and a suitable open reading frame, circRNAs could be translated into proteins. b) Generation of circRNAs by back-splicing of the precursor RNA can lead to a regulation of the linear/circular ratio of the hosting gene. c) CircRNAs can sponge microRNAs to act as a cytoplasmic reservoir of microRNAs. The scavenging of microRNAs removes the repression of target RNAs leading to an increase in their translation (if mRNAs) or activity (lncRNAs). d) CircRNAs can act as a scaffold for cytoplasmic proteins, retain certain transcription factors in the cytoplasm or serve as a vehicle for the transport of these molecules. e) CircRNAs could undergo degradation after being targeted by microRNAs. f) CircRNAs could function as a scaffold for transcription factors leading them to specific locations of the genome or directly interact with the DNA to regulate transcription.

Functions of circRNAs. a) In the presence of internal ribosomal enter sites and a suitable open reading frame, circRNAs could be translated into proteins. b) Generation of circRNAs by back-splicing of the precursor RNA can lead to a regulation of the linear/circular ratio of the hosting gene. c) CircRNAs can sponge microRNAs to act as a cytoplasmic reservoir of microRNAs. The scavenging of microRNAs removes the repression of target RNAs leading to an increase in their translation (if mRNAs) or activity (lncRNAs). d) CircRNAs can act as a scaffold for cytoplasmic proteins, retain certain transcription factors in the cytoplasm or serve as a vehicle for the transport of these molecules. e) CircRNAs could undergo degradation after being targeted by microRNAs. f) CircRNAs could function as a scaffold for transcription factors leading them to specific locations of the genome or directly interact with the DNA to regulate transcription.

circRNAs: coding or non-coding?

First, we would like to discuss the “non-coding RNA” feature of circRNAs. While circRNAs are widely considered as lacking protein-coding potential, recent reports questioned this concept. Translation of circRNAs (Fig. 3a) was suggested by ribosome footprinting data in several animal models (from flies to murine ones), which revealed that a number of circRNAs were consistently found to be associated with translating ribosomes [27]. In another study in murine and human myoblasts, cZNF609 (also known as myocardial infarction-associated circRNA – MICRA [28]) was reported to contain an open reading frame allowing for protein production after overexpression [29]. However, the protein-coding potential of circRNAs in naïve conditions may differ from artificial up-regulation and thus their capability to encode proteins is still a matter of debate.

Expression level

As discussed in section 2, splicing events can determine the expression levels of circRNAs (Fig. 3b). Thus, the balance between circular and linear isoforms originated from the same hosting gene is modified by general splicing factors and the regulation of back splicing mechanisms [14].

Sponge for miRNAs

More attention has been brought to the ability of circRNAs to bind miRNAs (Fig. 3c), ∼22 nucleotide ncRNAs that regulate transcription [8,9]. The circRNA antisense of the cerebellar degeneration-related protein 1 (Cdr1as, also known as ciRS-7) harbours more than 60 potential binding sites for miR-7 [30,31]. The sequestration of miR-7 by Cdr1as, a process termed “sponge effect”, relates to insulin production and secretion by beta cells [32]. Furthermore, Cdr1as is elevated after myocardial infarction in mice and its effect on apoptosis seems to be regulated via miR-7 [33]. Also in mice, the heart-related circRNA (Hrcr) scavenges the pro-hypertrophic miR-223 and thus blocks the isoproterenol-induced cardiomyocyte hypertrophy [34]. Although miR-223 is up-regulated in human failing hearts [34], whether it is sequestered by HRCR in human is still unknown. In serum samples from patients with carotid artery dissection, the ratio between the circRNA circR-284 and miR-221 is altered due to an increase of circR-284 and a decrease of miR-221 [35]. In thoracic aortic dissection patients, an increase of matrix metalloproteinase 9 expression is observed and bioinformatics tools predicted that hsa_circ_101283 could contribute to it by binding three miRNAs (mir-320a, miR-583-5p and miR-138-5p) [36]. In human aortic smooth muscle cells, circACTA2 regulates alpha-actin expression by interacting with miR-548f-5p [37]. In mouse cardiac fibroblasts, circRNA_000203 increases collagen production by binding miR-26b-5p [38]. The mitochondrial fission and apoptosis-related circRNA (MFACR) sponges miR-652-3p, upregulating its target, MTP18, consequently regulating mitochondrial fission and cardiomyocyte apoptosis in mice [39]. Finally, cZNF609 appears to regulate vascular endothelial dysfunction by sponging miR-615-5p [40]. The therapeutic potential of these findings remains to be demonstrated.

Scaffold

CircRNAs could also behave as scaffolds, either drive other elements in the cell towards their targets or block them to prevent their action (Fig. 3d). As an example of the latter, circ-Foxo3 sequesters proteins like Id1, E2f1, Fak, and Hif1a in the cytoplasm of cardiomyocytes and cardiac fibroblasts, hence avoiding their translocation to the nucleus and contributing to the ageing of those cells [41]. In a mouse cell line, circ-Foxo3 showed an affinity for cell cycle-related proteins such as Cdk2 and p21, and to a lesser extent Cdk6, p16 and p27, also suggesting a role of this circRNA in cell proliferation [42].

Other putative functions of circRNAs

The miRNA-sponge function of circRNAs described before is typically presumed for cytoplasmic circRNAs. Of note, although cZNF292 is cytoplasmic, a miRNA-sponge effect could not be demonstrated using high-throughput sequencing of RNA isolated by cross-linking and immunoprecipitation [43]. In addition to the sequestration of miRNAs by circRNAs, it can be foreseen that miRNAs could bind circRNAs to induce their degradation (Fig. 3e), similar to the down-regulation of linear RNAs. However, this needs further investigation. Some RNA molecules are able to target genomic DNA in a sequence-specific manner. This characteristic is the basis of the gene editing mechanism named Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR), where a small guide RNA places the CRISPR associated protein (Cas9) at a specific genomic location [44]. Together with their scaffold ability, one could speculate that circRNAs can act as transcription regulators, either alone or in combination with other RNA or protein partners (Fig. 3f). Therefore, circRNAs could target specific DNA sequences and modulate their transcription, as it was previously described for lncRNAs [10,45].

Expression and regulation of circRNAs in the CV system

Although circRNAs have been described at least as early as 1976 as viroids [46] and then in the 1990s in eukaryotes [47,48], interest in their function and application has been recently renewed [30,[49], [50], [51]]. RNA sequencing data of non-polyadenylated transcripts allowed the realization that circRNA expression is widespread [16,24,30,[50], [51], [52]]. Studies revealed that at least 14% of expressed genes in human fibroblasts yield over 25,000 circRNAs [51]. CircRNAs are abundant and conserved in eukaryotes, have been detected in different developmental stages and cell types, and their expression can also be highly cell-type dependent [30,50,51,53]. These observations together with their specific changes in pathological states (reviewed in Ref. [54]) suggest their expression is a regulated process [4]. Investigations of circRNA expression during human development revealed dynamic changes [55,56], particularly a trend in upregulation [56]. Interestingly, the most expressed gene during foetal development was the circular isoform of NCX1 in the heart, a calcium transport gene essential for cardiac development. In this study, several other circRNAs were more expressed than their linear mRNAs [56]. This observation is consistent with other reports that indicate that the circRNA expression is not necessarily correlated with the expression of its linear mRNA from which it derives, varying according to the cell type [25,51,53,56]. Two contributions towards a better understanding of the potential role of circRNAs in the CV system were the characterization of these molecules' expression in both developing and adult hearts [57,58]. One study detected over 9,000 candidate circRNAs for each species using RNA sequencing (RNA-seq) of heart tissue from humans (failing and non-failing), mice (sham or after transverse aortic constriction) and rats (neonatal and adult) [57]. The other study identified also by RNA-seq over 15,000 circRNAs in humans and over 3,000 in mice. The group analysed tissue from failing and non-failing hearts from both species and from human embryonic stem cell-derived cardiomyocytes across different stages of differentiation [58]. The most expressed circRNAs, which were more abundant than their linear counterparts, were produced from critical cardiac genes, such as TTN, ryanodine receptor 2 (RYR2 – encoding a protein found primarily in cardiac muscle and involved in calcium release from the cytoplasm), and dystrophin (DMD – encoding a component of the dystrophin-glycoprotein complex which links the cytoskeleton to the extracellular matrix and which genetic alterations cause Duchenne muscular dystrophy) [58]. The titin gene encodes titin (TTN), a giant protein involved in muscle contraction. RBM20, involved in circRNA biogenesis, is expressed in the heart and skeletal muscle, where it regulates alternative splicing of multiple genes, including TTN [59]. Mutations in the RBM20 gene lead to a severe and early onset dilated cardiomyopathy (DCM), and aberrant splicing of TTN is believed to be the main reason why RBM20 mutation carriers develop DCM [[59], [60], [61]]. However, a recent study showed that RBM20 is also required for the production of a large set of circRNAs that arise from the TTN gene [26]. The authors profiled circRNAs of human left ventricle samples and control from hypertrophic or dilated cardiomyopathy patients, finding 826 back-splice junctions in common between the groups, where 80 were from the titin transcript. The RBM20-binding sites were found to be enriched in the introns flanking the titin back-splice junctions. These RBM20-dependent TTN circRNAs specifically arise from a region within TTN transcript: the so-called I-band, which is known to undergo extensive alternative splicing. Interestingly, in the hearts of Rbm20 knockout mice, and also in human RBM20 mutation carriers, circRNAs from TTN's I-band were completely absent, while at the same time, the corresponding exons were abundantly included in the linear TTN transcript. This implies a mechanism in which exons that are spliced out of the TTN pre-mRNA can serve as a substrate for the production of circRNAs [26]. In conclusion, this study shows that TTN circRNA production is intimately connected to alternative splicing of TTN, and this is in line with other studies demonstrating that the more an exon is circularized, the less it is represented in the linearly processed mRNA [14,62]. A microarray study contributed to the identification of 63 differentially expressed circRNAs from 1,163 circRNA candidates in murine tissues from normal and failing hearts due to myocardial infarction [63]. Twelve of these differentially expressed circRNAs were validated by quantitative PCR, including Cdr1as. The authors observed an upregulation in failing compared to normal hearts, a result consistent with another report showing that Cdr1as was overexpressed in mice hearts with increased infarct size and cardiomyocytes under hypoxia. Additionally, Cdr1as acted as a sponge to miR-7a in myocardial cells, suppressing its activity [33]. Few other circRNAs have been reported correlated with cardiovascular disease. In an analysis of plasma from myocardial infarction patients, the circRNA circRNA_081881 was down-regulated more than ten-fold compared to plasma from healthy volunteers and was predicted to bind miR-548, which targets the heart-protective gene PPARγ [64]. The circRNA produced from the forkhead family of transcription factors Foxo3, circ-Foxo3, was shown to promote cardiac senescence in hearts of aged patients and mice, where it was highly expressed. Silencing of the circRNA inhibited senescence of mouse embryonic fibroblasts inhibited senescence suppressing cardiomyopathy symptoms [41]. As diabetes mellitus is a prime risk factor for cardiovascular disease and cardiac fibrosis is one of the main pathologies of diabetic cardiomyopathies, studies investigating the role of circRNAs in such conditions are emerging. A report with diabetic mice hearts and Angiotensin II-induced cardiac fibrosis investigated the modulation of circRNA_000203 on fibrosis-associated genes expression. This circRNA was found to be upregulated compared to controls and to interact with the anti-fibrotic miR-26b-5p, enhancing the proliferation of cardiac fibroblast [65]. High levels of another circRNA_010567 were found in diabetic mice myocardium and cardiac fibroblasts, negatively regulating miR-141, and thus upregulating its target, TGF-β1. These three molecules play a role in regulating the pathogenesis of myocardial fibrosis in the studied model [66]. In vascular endothelial cells, computational analysis of RNA-seq data resulted in the identification of over 7,000 circRNAs, among which cZNF292 was shown to be highly expressed and up-regulated under hypoxic stress [43]. Importantly, cZNF292 was not a mere byproduct of RNA splicing but was functionally involved in angiogenic sprouting [43].

Methodological aspects of circRNAs research

Most circRNAs escaped identification for decades because they are not easily separated from other RNAs by size (as miRNAs can be), they do not have polyadenylation or free 3′ or 5′ ends, and they have exons arranged in a different order (back-splice), thus being considered as artefacts [12]. However, technological advances in RNA analyses and bioinformatic tools enabled circRNAs to be appreciated as an important type of ncRNAs and potential regulators of gene expression. The events that give rise to circRNAs specify certain properties of these molecules, such as the presence of a non-linear splice junction, the absence of a polyA tail and the resistance to exonuclease treatment, which distinguish them from linear mRNAs. These characteristics can be used to detect novel circRNAs.

Detection of back-splicing events

Back-splicing junctions can be readily detected from ribosomal RNA (rRNA) depleted whole transcriptome sequencing data as opposed to conventional polyA + sequencing data (Fig. 4). The detection of circRNAs is a multi-stage bioinformatics process. Briefly, short sequencing reads, which are indicative of back-splicing events and, hence, do not align co-linearly to their genomic locus of origin, are identified in rRNA-depleted whole transcriptome sequencing data. Depending on tissue and condition, this will lead to a list of potential back-splicing junction candidates, which may represent hundreds to thousands of potential circRNAs. The quality of these predictions is typically assessed by their absence of polyA sequencing data (i.e. false positives) and by their relative enrichment in RNase R treated over mock-treated rRNA-depleted whole transcriptome sequencing data (Fig. 4) [67]. An alternative RNase R-based approach, which is generally applicable for all species where no rRNA removal strategy exists, has been proposed and termed RPAD (RNase R treatment followed by polyadenylation and poly(A)+ RNA depletion) [68]. Briefly, total RNA is digested by RNase R and remaining linear RNAs receive a polyA tail in a second step, which is then used to deplete them from the RNA pool. This method is more efficient to eliminate linear RNAs than treatment with RNase R alone, facilitating the detection of novel circRNAs [68].
Fig. 4

Different sequencing strategies to detect back-splicing junctions and to recover the internal structure of circular RNAs.

Different sequencing strategies to detect back-splicing junctions and to recover the internal structure of circular RNAs. A number of software tools exist for circRNA recognition through the detection of back-splicing junctions (Table 1). A comparison of the benefits and drawbacks of some of these tools has been recently reported [69]. However, publicly available downstream analysis tools for the reconstruction of circRNA species and additional in-depth analyses are rare.
Table 1

Prediction tools for the discovery of circRNAs from RNA sequencing (RNA-seq) datasets.

Tool nameWebsiteDescriptionAlignerReference
Acfshttps://github.com/arthuryxt/acfsIdentifies back-splicing junctions from RNA-seq dataBWA-MEM[84]
circExplorerhttp://yanglab.github.io/CIRCexplorer/Identifies junction reads from back-spliced exons and intron lariatsTopHat Fusion[16]
circRNA_finderhttps://github.com/bioxfu/circRNAFinderPipeline to find circRNAs from RNA-seq dataSTAR[85]
CIRI2https://sourceforge.net/projects/ciri/files/CIRI2/Detects back-splicing junction reads using efficient maximum likelihood estimation based on multiple seed matchingBWA-MEM[86]
DCChttps://github.com/dieterich-lab/Detects back-splicing junctions from RNA-seq data and applies filters to integrate data across replicate setsSTAR[67]
find_circhttps://github.com/marvin-jens/find_circDetects back-spliced sequencing reads from RNA-seq databowtie2[30]
KNIFEhttps://github.com/lindaszabo/KNIFEStatistically-based splicing detection for circular and linear isoforms from RNA-Seq databowtie and bowtie2[56]
MapeSplicehttp://www.netlab.uky.edu/p/bioinfo/MapSplice2Maps RNA-seq data to reference genome to discover splice junctionsbowtie[87]
NCLScanhttps://github.com/TreesLab/NCLscanPipeline to identify intra and intergenic non-colinear transcripts (including circRNA) from paired-end RNA-seq dataBWA[88]
PTESFinderhttps://sourceforge.net/projects/ptesfinder-v1/Pipeline for identifying post-transcriptional exon shuffling events from high-throughput RNA-seq dataBowtie[89]
Sailfish-cirhttps://github.com/zerodel/sailfish-cirEstimates the relative abundance of circRNAs from RNA-seq dataRapMap[90]
Segemehlhttp://www.bioinf.uni-leipzig.de/Software/segemehl/Maps short sequencer reads to reference genomes, detecting mismatches, insertions and deletionsSegemehl[91]
SUPeR-seqhttps://www.illumina.com/science/sequencing-method-explorer/kits-and-arrays/super-seq.htmlMethod to sequence both polyadenylated and non-polyadenylated RNAs from individual cellsbowtie2[52]
UROBORUShttps://github.com/WGLab/UROBORUSPipeline to identify circRNA from total RNA-seq data without RNase R treatmentTopHat Bowtie[92]
Prediction tools for the discovery of circRNAs from RNA sequencing (RNA-seq) datasets. For example, Circtools (https://github.com/dieterich-lab/circtools), a modular, Python/R-based framework for circRNA-related tools that unifies several essential functions in a one-step software solution, aims to address the issue of circRNA detection and quantification. Circtools' base modules cover detection of circRNAs [67], a beta-binomial model for testing relative abundance changes of circular vs. linear RNA species from the same host gene (CircTest), and to fully reconstruct the exon-intron structure of circRNAs from long sequencing reads (FUCHS) [70]. This framework was used to identify circRNAs and host gene dynamics in cardiac development and disease [55].

Alternative splicing analysis

Oftentimes, several alternative circRNAs emerge from the same gene locus. Similar to alternative linear splicing, circRNAs could share the same back-splicing site, yet differ in their internal exon composition, that is, by alternative exon skipping. Moreover, alternative back-splicing may occur within the same host gene by using alternative splice site acceptors or donors. Two software packages are specifically designed for resolving this question, namely FUCHS [70] and CIRI-AS [71].

Assessment of circRNAs by RT-qPCR

Often used to validate RNA-seq data, reverse transcription quantitative polymerase chain reaction (RT-qPCR) has been adapted to specifically assess the expression of circRNAs. Primer design is a necessary first step in validating and quantitating circRNAs. Having considered alternative circRNA splicing, one needs to keep in mind that sequences flanking a particular back-splicing junction may represent one or more circRNAs. A comparison to the expression of the host gene could become a complex endeavour too if multiple isoforms exist. Herein, Circtools supports the experimentalist by its primer design module.

Artificial intelligence for circRNA research

Artificial intelligence methods, particularly machine learning, are increasingly used to analyse biological data. The large amount of experimental data generated by high-throughput methods makes this both possible and necessary, since machine learning requires a lot of training data, but is also often the most effective approach for finding interesting hidden relationships. So far there have been few applications of artificial intelligence to circRNAs since they have not been researched as intensely as other molecules such as linear RNAs or proteins. However, we expect new utilisations of artificial intelligence methods to arise following the continuous improvement of the knowledge related to the role of circRNAs in biology and disease. Both existing and potential applications of artificial intelligence to circRNA are discussed in this chapter.

CircRNA recognition

The first problem that requires sophisticated computational methods is circRNA recognition as discussed in section 5. This is usually tackled with heuristic methods gathered in Table 1. These tools take RNA-seq data as input and use two main approaches to recognize circRNAs: one that is based on detecting back-splices in RNA-seq data, and another that requires genomic information (explained in Ref. [12]). With increasing numbers of experimentally verified circRNAs, it is becoming feasible to develop machine-learning methods to recognize them. A method called PredcircRNA, as well as the web tool WebCircRNA that implements a version of this method distinguishes circRNAs from other lncRNAs [72,73]. The study describing it started with a dataset of 14,084 circRNAs and 19,722 linear lncRNAs. From these, 188 features were extracted: graph describing RNA structure and sequence, conservation score, nucleotide composition, and other features such as single nucleotide polymorphism density at miRNA binding sites. Each group of features was transformed via a Gaussian kernel, which was fused using lp-norm multiple kernel learning to build a model to recognize circRNAs. The evaluation showed an accuracy of around 80% (depending on the dataset). For a general overview of machine learning from RNA-seq data (of which PredcircRNA is an example), refer to [74].

Binding site recognition

A possible application of artificial intelligence to circRNAs is to recognize miRNAs or protein binding sites [75]. Here again, to our knowledge, no applications of artificial intelligence to this problem has been reported but computational methods have been used to recognize RNA-binding proteins and binding sites in other contexts [76]. These methods largely rely on machine learning from protein sequence and structure. It is so far still unclear to what degree they apply to circRNA and whether they can be used to recognize binding to circular as opposed to linear RNAs. Unlike most work on RNA binding, which deals with proteins and their binding sites, one framework detects binding sites on RNA. The novelty of this approach is that it uses not only the primary RNA sequence and secondary structure but also tertiary structure [77]. In addition, instead of relying on traditional machine learning algorithms, the framework uses deep learning, an approach that builds deep artificial neural networks and has recently proven superior to traditional machine learning for many problems. It seems likely that such an approach could be used to recognize binding sites on circRNAs as well.

Biomarker value of circRNAs

Any newly discovered type of RNA molecules has been evaluated for its biomarker potential. Bioinformatics and computational biology approaches based on the concepts of systems biology allowed the discovery of messenger RNAs [78], miRNAs [8,9,79] and lncRNAs [10,11,45] as biomarkers of CV disease. The blood transcriptome has been revealed as a rich reservoir of potential biomarkers of CV diseases [80]. Yet, the biomarker value of circRNAs for CV disease has only recently been suggested. A study in patients with acute myocardial infarction identified the circRNA MICRA which showed an association with the development of heart failure in the few months following the event [28]. This association was validated in two independent patient cohorts. Importantly, MICRA provided a significant incremental value to predict heart failure development over existing risk factors and biomarkers [28,81]. Even if, so far, no other circRNA has been reported to have a diagnostic or a prognostic value for CV disease, it is expected that the resistance of circRNAs to exonuclease degradation confers them with an interesting biomarker potential that needs to be further explored.

Considerations for future work

The knowledge of the role of circRNAs in CV health and disease is still scarce and more research is needed to reach a better knowledge of their potential as therapeutic targets and biomarkers of CV disease. The following considerations may be pondered when designing future research projects. Discovery of circRNAs. Different prediction algorithms available for free download from several databases have been developed for the discovery of circRNAs from RNA-seq datasets (Table 1). Use of RNase R-digested total RNA as input for RNA-seq allows targeting circRNAs for increased specificity and sensitivity of detection. Technical aspects of circRNAs detection. Since circRNAs are produced as an alternative of linear RNA transcripts, it is important to ensure that what is thought of as a circRNA is effectively in a circular form. This verification can be achieved using RNase R digestion and divergent primers or probes spanning the junction site in RT-qPCR experiments. Biogenesis of circRNAs. A thorough characterization of the mechanisms responsible for circRNAs biogenesis will help to understand their regulation in disease states and may lead to the identification of novel therapeutic targets. Role of RNA binding proteins in circRNA biogenesis. Since there are hundreds of RNA binding proteins known, an important challenge for the future will be to identify those proteins specifically involved in circRNA biogenesis. Function of circRNAs. Several databases are available to search for a potential function of circRNAs (Table 2). One of the most interesting, since it links circRNAs to disease states, might be the Circ2Traits database.
Table 2

Databases to search for a potential function of circRNAs.

Database nameWebsiteDescriptionReference
circBasehttp://circbase.org/Merged and unified public datasets of circRNAs[93]
circ2Traitshttp://gyanxet-beta.com/circdb/Disease-circRNA associations according to a network of predicted interactions between disease-associated miRNAs and protein coding, long non-coding and circRNA genes, as well as the disease-and traits-associated variations in circRNA loci[94]
CircNethttp://circnet.mbc.nctu.edu.tw/Tissue-specific circRNA expression profiles and circRNA-miRNA-gene regulatory networks[95]
starBasehttp://starbase.sysu.edu.cn/mirCircRNA.phpPredicted miRNA-circRNA interactions overlapped with CLIP-Seq data[96]

CLIP: cross-linking immunoprecipitation.

Databases to search for a potential function of circRNAs. CLIP: cross-linking immunoprecipitation. Mechanisms of action. Apart from their presumed miRNA sponge effect, the mechanisms of action of circRNAs are only partly understood and deserve further molecular studies. Protein-coding potential. Further investigations are required to clarify the potential of circRNAs to encode proteins, as recently suggested in several reports [27,29,82,83]. Some bioinformatics tools have been developed to evaluate the protein-coding potential of circRNAs (Table 3), which may be helpful for future research.
Table 3

Bioinformatic tools to evaluate the protein-coding potential of circRNAs.

Tool nameWebsiteAimReference
CircProhttp://bis.zju.edu.cn/CircPro/Predicts the protein-coding potential of circRNAs and discovers junction reads from Ribo-Seq data[97]
circRNADBhttp://202.195.183.4:8000/circrnadb/circRNADb.phpA comprehensive database for human circRNAs with protein-coding annotations (IRES, ORF, protein domains)[5]
CPAThttp://lilab.research.bcm.edu/cpat/Imputes the probability of an RNA to encode a peptide[98]
IRESitehttp://iresite.org/Aligns validated IRES to circRNA sequences[99]
ORF Finderhttps://www.ncbi.nlm.nih.gov/orffinder/Searches for open reading frames in DNA sequences
Pfamhttp://pfam.xfam.org/Collects protein families with sequence alignments[100]
PhyloCSFhttps://github.com/mlin/PhyloCSF/wiki/Determines whether a nucleotide sequence represents a protein-coding region[101]

CAPT: coding-potential assessment tool; IRES: internal ribosomal entry site; ORF: open reading frame.

Bioinformatic tools to evaluate the protein-coding potential of circRNAs. CAPT: coding-potential assessment tool; IRES: internal ribosomal entry site; ORF: open reading frame. Artificial intelligence. As research on circRNAs advance, we envision that new solutions based on artificial intelligence methods will help to identify circRNAs, their interacting molecules, their biological function, and their role in disease. Biomarker value of circRNAs. The potential of circRNAs to be used as biomarkers is highly related to their relative stability in body fluids due to resistance to exonucleases. However, as any novel biomarker, circRNAs will have to satisfy stringent criteria to be clinically useful: reliable detection, automatable detection, robust predictive value (specificity and sensitivity), incremental value to existing markers, extensive validation in independent and large patient cohorts, and cost-effectiveness.
  97 in total

1.  Scrambled exons.

Authors:  J M Nigro; K R Cho; E R Fearon; S E Kern; J M Ruppert; J D Oliner; K W Kinzler; B Vogelstein
Journal:  Cell       Date:  1991-02-08       Impact factor: 41.582

2.  MicroRNA-150: a novel marker of left ventricular remodeling after acute myocardial infarction.

Authors:  Yvan Devaux; Melanie Vausort; Gerry P McCann; Jennifer Zangrando; Dominic Kelly; Naveed Razvi; Lu Zhang; Leong L Ng; Daniel R Wagner; Iain B Squire
Journal:  Circ Cardiovasc Genet       Date:  2013-04-01

Review 3.  Functions and regulation of RNA editing by ADAR deaminases.

Authors:  Kazuko Nishikura
Journal:  Annu Rev Biochem       Date:  2010       Impact factor: 23.643

4.  A novel identified circular RNA, circRNA_010567, promotes myocardial fibrosis via suppressing miR-141 by targeting TGF-β1.

Authors:  Bing Zhou; Jian-Wu Yu
Journal:  Biochem Biophys Res Commun       Date:  2017-04-12       Impact factor: 3.575

5.  Detecting and characterizing circular RNAs.

Authors:  William R Jeck; Norman E Sharpless
Journal:  Nat Biotechnol       Date:  2014-05       Impact factor: 54.908

6.  A circular RNA protects the heart from pathological hypertrophy and heart failure by targeting miR-223.

Authors:  Kun Wang; Bo Long; Fang Liu; Jian-Xun Wang; Cui-Yun Liu; Bing Zhao; Lu-Yu Zhou; Teng Sun; Man Wang; Tao Yu; Ying Gong; Jia Liu; Yan-Han Dong; Na Li; Pei-Feng Li
Journal:  Eur Heart J       Date:  2016-01-21       Impact factor: 29.983

7.  NCLscan: accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision.

Authors:  Trees-Juen Chuang; Chan-Shuo Wu; Chia-Ying Chen; Li-Yuan Hung; Tai-Wei Chiang; Min-Yu Yang
Journal:  Nucleic Acids Res       Date:  2015-10-05       Impact factor: 16.971

8.  Single-cell RNA-seq transcriptome analysis of linear and circular RNAs in mouse preimplantation embryos.

Authors:  Xiaoying Fan; Xiannian Zhang; Xinglong Wu; Hongshan Guo; Yuqiong Hu; Fuchou Tang; Yanyi Huang
Journal:  Genome Biol       Date:  2015-07-23       Impact factor: 13.583

9.  IRESite--a tool for the examination of viral and cellular internal ribosome entry sites.

Authors:  Martin Mokrejs; Tomás Masek; Václav Vopálensky; Petr Hlubucek; Philippe Delbos; Martin Pospísek
Journal:  Nucleic Acids Res       Date:  2009-11-16       Impact factor: 16.971

10.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data.

Authors:  Jun-Hao Li; Shun Liu; Hui Zhou; Liang-Hu Qu; Jian-Hua Yang
Journal:  Nucleic Acids Res       Date:  2013-12-01       Impact factor: 16.971

View more
  13 in total

Review 1.  Minor and major circRNAs in virus and host genomes.

Authors:  Zhihao Lou; Rui Zhou; Yinghua Su; Chun Liu; Wenting Ruan; Che Ok Jeon; Xiao Han; Chun Lin; Baolei Jia
Journal:  J Microbiol       Date:  2021-02-23       Impact factor: 3.422

Review 2.  Role of circular RNAs in cardiovascular diseases.

Authors:  Xue Gong; Gengze Wu; Chunyu Zeng
Journal:  Exp Biol Med (Maywood)       Date:  2019-01-17

3.  [CircRNA_005647 inhibits expressions of fibrosis-related genes in mouse cardiac fibroblasts via sponging miR-27b-3p].

Authors:  Shujing Yuan; Jingnan Liang; Ming Zhang; Jiening Zhu; Rong Pan; Hui Li; Ni Zeng; Yihong Wen; Zhiyao Yi; Zhixin Shan
Journal:  Nan Fang Yi Ke Da Xue Xue Bao       Date:  2019-11-30

Review 4.  The long non-coding road to endogenous cardiac regeneration.

Authors:  Abdel Rahman Yousry Afify
Journal:  Heart Fail Rev       Date:  2019-07       Impact factor: 4.214

Review 5.  Extracellular Vesicle-Mediated Vascular Cell Communications in Hypertension: Mechanism Insights and Therapeutic Potential of ncRNAs.

Authors:  Ji-Ru Zhang; Hai-Jian Sun
Journal:  Cardiovasc Drugs Ther       Date:  2020-09-22       Impact factor: 3.727

6.  CircPCMTD1 Acts as the Sponge of miR-224-5p to Promote Glioma Progression.

Authors:  Si-Qi Zheng; Yue Qi; Jun Wu; Fen-Li Zhou; Hao Yu; Lu Li; Bo Yu; Xiao-Fan Chen; Wei Zhang
Journal:  Front Oncol       Date:  2019-05-22       Impact factor: 6.244

Review 7.  Circular RNAs and their roles in head and neck cancers.

Authors:  Yang Guo; Jiechao Yang; Qiang Huang; Chiyao Hsueh; Juan Zheng; Chunping Wu; Hui Chen; Liang Zhou
Journal:  Mol Cancer       Date:  2019-03-21       Impact factor: 27.401

Review 8.  Noncoding RNAs in Hypertension.

Authors:  Amela Jusic; Yvan Devaux
Journal:  Hypertension       Date:  2019-07-29       Impact factor: 10.190

Review 9.  Non-coding RNA involvement in the pathogenesis of diabetic cardiomyopathy.

Authors:  Wei Zhang; Weiting Xu; Yu Feng; Xiang Zhou
Journal:  J Cell Mol Med       Date:  2019-06-26       Impact factor: 5.310

10.  Increasing Upstream Chromatin Long-Range Interactions May Favor Induction of Circular RNAs in LysoPC-Activated Human Aortic Endothelial Cells.

Authors:  Angus Li; Yu Sun; Charles Drummer; Yifan Lu; Daohai Yu; Yan Zhou; Xinyuan Li; Simone J Pearson; Candice Johnson; Catherine Yu; William Y Yang; Kevin Mastascusa; Xiaohua Jiang; Jianxin Sun; Thomas Rogers; Wenhui Hu; Hong Wang; Xiaofeng Yang
Journal:  Front Physiol       Date:  2019-04-18       Impact factor: 4.566

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.