| Literature DB >> 32211235 |
Artem Nedoluzhko1, Fedor Sharko2,3, Md Golam Rbbani1, Anton Teslyuk2, Ioannis Konstantinidis1, Jorge M O Fernandes1.
Abstract
Circular RNAs (circRNAs) are long noncoding RNAs that play a significant role in various biological processes, including embryonic development and stress responses. These regulatory molecules can modulate microRNA activity and are involved in different molecular pathways as indirect regulators of gene expression. Thousands of circRNAs have been described in diverse taxa due to the recent advances in high throughput sequencing technologies, which led to a huge variety of total RNA sequencing being publicly available. A number of circRNA de novo and host gene prediction tools are available to date, but their ability to accurately predict circRNA host genes is limited in the case of low-quality genome assemblies or annotations. Here, we present CircParser, a simple and fast Unix/Linux pipeline that uses the outputs from the most common circular RNAs in silico prediction tools (CIRI, CIRI2, CircExplorer2, find_circ, and circFinder) to annotate circular RNAs, assigning presumptive host genes from local or public databases such as National Center for Biotechnology Information (NCBI). Also, this pipeline can discriminate circular RNAs based on their structural components (exonic, intronic, exon-intronic or intergenic) using a genome annotation file. ©2020 Nedoluzhko et al.Entities:
Keywords: Annotation; Circular RNAs; Host gene; Prediction; Structural components
Year: 2020 PMID: 32211235 PMCID: PMC7081776 DOI: 10.7717/peerj.8757
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
CircParser.pl usage. Required and optional parameters.
| Parameter | Parameter description |
|---|---|
| -h, –help | Show this help message and exit |
| -b | CircRNA input file (required) |
| -g, –genome | Reference genome file (required) |
| -t, –tax | NCBI TaxID (optional) |
| -a | Genome annotation file, gff/gff3 file (optional) |
| –np | Prohibition for coordinate merging (optional) |
| -c, –ciri | Input circRNA from CIRI—CIRI2 |
| –threads | Number of threads (CPUs) for BLAST search (optional) |
| -v, –version | Current CircParser version |
Figure 1An overview of the CircParser pipeline.
(A) The pipeline includes merging of the circRNAs with overlapping genome coordinates and presents the number of different circRNAs originating from one host gene. (B) CircParser includes the prediction of circRNA structural components using a genome annotation gff/gff3 file.
Figure 2Number of circular RNAs that have been predicted by CIRI, CIRI2, CircExplorer2, find_circ, circFinder, and that are common between all prediction algorithms.
Figure 3CircParser capacity: number of host genes that were predicted by CircExplorer2 and CircParser.