Literature DB >> 27493192

PcircRNA_finder: a software for circRNA prediction in plants.

Li Chen¹, Yongyi Yu¹, Xinchen Zhang¹, Chen Liu¹, Chuyu Ye¹, Longjiang Fan^1,2.

Abstract

MOTIVATION: Recent studies reveal an important role of non-coding circular RNA (circRNA) in the control of cellular processes. Because of differences in the organization of plant and mammal genomes, the sensitivity and accuracy of circRNA prediction programs using algorithms developed for animals and humans perform poorly for plants.
RESULTS: A circRNA prediction software for plants (termed PcircRNA_finder) was developed that is more sensitive in detecting circRNAs than other frequently used programs (such as find_circ and CIRCexplorer), Based on analysis of simulated and real rRNA-/RNAase R RNA-Seq data from Arabidopsis thaliana and rice PcircRNA_finder provides a more comprehensive sensitive, precise prediction method for plants circRNAs.
AVAILABILITY AND IMPLEMENTATION: http://ibi.zju.edu.cn/bioinplant/tools/manual.htm CONTACT: fanlj@zju.edu.cnSupplementary information: Supplementary data are available at Bioinformatics online.

Entities: Gene Species

Mesh：

Substances：

Year: 2016 PMID： 27493192 PMCID： PMC5181569 DOI： 10.1093/bioinformatics/btw496

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

1 Introduction

Non-coding circular RNA (circRNA) is a covalently continuous closed loop that usually originates from exonic regions (named exonic circRNA), but can also arise from intronic and intergenic regions. CircRNAs can function as a miRNA sponge (Hansen ; Memczak ) and have the potential to enhance transcription of their host genes (Li ; Zhang et al., 2013). The emergence of rRNA-depleted high-throughput RNA-Seq technology provides a revolutionary approach for the systematic discovery of circRNAs in various species, including human, mouse, Arabidopsis and rice (Lu ; Ye ). A robust method for circRNA identification is an important tool for investigating the role of these molecules. The available circRNA prediction methods (e.g. find_circ and CIRCexplorer) were primarily developed for use with human or animal datasets (Memczak et al., 2013; Pan and Xiong, 2015; Salzman ; Szabo ; Zhang ). There are large differences between mammal and plant genomes and therefore the prediction accuracy and sensitivity of detecting circRNAs in plants using the currently available methods are relatively low (Ye ). In this study, we developed a software (termed PcircRNA_finder) that shows a more comprehensive ability and greater sensitivity and precision in predicting circRNAs in plants.

2 Materials and Methods

PcircRNA_finder is mainly designed for exonic circRNA prediction and consists of three modules as shown in Figure 1. These modules are: (i) Catcher, which is used to collect all backsplice sites by chiastic clipping mapping of PE reads based on available main fusion detection methods, including Tophat-Fusion (Kim and Salzberg, 2011), STAR-Fusion (Dobin, et al., 2013), find_circ (Memczak ), Mapsplice (Wang ) and segemehl (Hoffmann ). Among these candidate backsplice sites, false positive sites will be filtered out in the Filter module. The increased read mapping accuracy in our program excludes some false predictions dues to the high copy number of genes in plants (Supplementary Data). (ii) Annotator, that can be used to annotate the candidate exonic backsplice sites based on available gene annotation. Recent studies have demonstrated that circRNA's backsplicing site is flexible and alternative splicing of circRNAs is prevalent (Starke ; Szabo ). Much of the alternative splicing of circRNAs occurred near by canonical splicing sites (Szabo; Starke ) and therefore, 5-bp flanking the two canonical backsplice sites (acceptor and donor) were allowed for our candidate backsplice sites and (iii) Filter, which is a quality control module for the above candidate circRNAs. It creates a pseudoRef file with the flanking sequences of chiastic backsplice sites and then maps raw reads to it and confirms the backsplice sites. It also requires that the candidate circRNAs contain at least one of two kinds of splicing signals, either a U2 based spliceosome (usually with a consensus sequence of GT-AG and GC-AG) and a U12-based minor spliceosome (usually with a consensus sequence of AT-AC) (Reddy ; Staiger and Brown, 2013).

Fig. 1.

The flowchart of PcircRNA_finder for circRNA prediction. It consists of three modules (stages)

3 Benchmark

To test the performance of PcircRNA_finder, we first compare it with two popular circRNA finding algorithms (find_circ and CIRCexplorer) using a simulation dataset for the analysis. Simulated RNA-Seq data (paired end reads, 100 ;bp and 6000 backsplicing reads for each sample) were generated by randomly choosing 200 chiastic transcripts based upon the Arabidopsis thaliana and rice genome annotations, respectively (Supplementary Data). The sensitivity, precision and sensitivity ;+ ;precision (a comprehensive value) (Chuang ) was used to evaluate the performance of the three methods. The results indicate that PcircRNA_finder has a higher sensitivity (74–88%) than either find_circ or CIRCexplorer (each about 20%) and better precision (63–67%) compared to find_circ and CIRCexplorer, (72 and 100%, respectively) in the two test genomes (Supplementary Data). Finally, PcircRNA_finder obtained a significantly higher comprehensive value in the two test plant species (68–76%), compared to the other two methods (each ;<35%). Transcriptomic data were generated from three RNA-Seq libraries (‘RNAase R’, ‘rRNA-’ and ‘polyA’) of rice seedlings (Supplementary Data). ‘RNAase R’ refers to linear mRNAs isolated from the rice seedlings that were degraded by RNAase R treatment (Circle-Seq, Jeck and Sharpless, 2014). CircRNAs in the various samples were predicted using all three circRNA prediction methods. Using PcircRNA_finder, we found 1,113 circRNAs in the RNAase R sample compared to 915 and 933 predicted by find_circ and CIRCexplorer, respectively. Of the circRNAs detected by PcircRNA_finder, 567 were not found using the other prediction programs. We define high-confidence circRNAs as those predicted circRNAs found in common between the ‘RNAase R’ and ‘rRNA-’ libraries, but not present in the ‘polyA’ library. Based on this definition, PcircRNA_finder predicted more high-confidence circRNAs from the rice RNA-Seq data sample (117) than either of the other two methods (104 and 74) (Supplementary Data).

19 in total

1. Exon circularization requires canonical splice signals.

Authors: Stefan Starke; Isabelle Jost; Oliver Rossbach; Tim Schneider; Silke Schreiner; Lee-Hsueh Hung; Albrecht Bindereif
Journal: Cell Rep Date: 2014-12-24 Impact factor: 9.423

2. Exon-intron circular RNAs regulate transcription in the nucleus.

Authors: Zhaoyong Li; Chuan Huang; Chun Bao; Liang Chen; Mei Lin; Xiaolin Wang; Guolin Zhong; Bin Yu; Wanchen Hu; Limin Dai; Pengfei Zhu; Zhaoxia Chang; Qingfa Wu; Yi Zhao; Ya Jia; Ping Xu; Huijie Liu; Ge Shan
Journal: Nat Struct Mol Biol Date: 2015-02-09 Impact factor: 15.369

3. STAR: ultrafast universal RNA-seq aligner.

Authors: Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal: Bioinformatics Date: 2012-10-25 Impact factor: 6.937

4. Detecting and characterizing circular RNAs.

Authors: William R Jeck; Norman E Sharpless
Journal: Nat Biotechnol Date: 2014-05 Impact factor: 54.908

5. Circular RNAs are a large class of animal RNAs with regulatory potency.

Authors: Sebastian Memczak; Marvin Jens; Antigoni Elefsinioti; Francesca Torti; Janna Krueger; Agnieszka Rybak; Luisa Maier; Sebastian D Mackowiak; Lea H Gregersen; Mathias Munschauer; Alexander Loewer; Ulrike Ziebold; Markus Landthaler; Christine Kocks; Ferdinand le Noble; Nikolaus Rajewsky
Journal: Nature Date: 2013-02-27 Impact factor: 49.962

6. Complementary sequence-mediated exon circularization.

Authors: Xiao-Ou Zhang; Hai-Bin Wang; Yang Zhang; Xuhua Lu; Ling-Ling Chen; Li Yang
Journal: Cell Date: 2014-09-18 Impact factor: 41.582

7. NCLscan: accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision.

Authors: Trees-Juen Chuang; Chan-Shuo Wu; Chia-Ying Chen; Li-Yuan Hung; Tai-Wei Chiang; Min-Yu Yang
Journal: Nucleic Acids Res Date: 2015-10-05 Impact factor: 16.971

8. Circular intronic long noncoding RNAs.

Authors: Yang Zhang; Xiao-Ou Zhang; Tian Chen; Jian-Feng Xiang; Qing-Fei Yin; Yu-Hang Xing; Shanshan Zhu; Li Yang; Ling-Ling Chen
Journal: Mol Cell Date: 2013-09-12 Impact factor: 17.970

9. TopHat-Fusion: an algorithm for discovery of novel fusion transcripts.

Authors: Daehwan Kim; Steven L Salzberg
Journal: Genome Biol Date: 2011-08-11 Impact factor: 13.583

10. Transcriptome-wide investigation of circular RNAs in rice.

Authors: Tingting Lu; Lingling Cui; Yan Zhou; Chuanrang Zhu; Danlin Fan; Hao Gong; Qiang Zhao; Congcong Zhou; Yan Zhao; Danfeng Lu; Jianghong Luo; Yongchun Wang; Qilin Tian; Qi Feng; Tao Huang; Bin Han
Journal: RNA Date: 2015-10-13 Impact factor: 4.942

20 in total

1. Characterization and Cloning of Grape Circular RNAs Identified the Cold Resistance-Related Vv-circATS1.

Authors: Zhen Gao; Jing Li; Meng Luo; Hui Li; Qiuju Chen; Lei Wang; Shiren Song; Liping Zhao; Wenping Xu; Caixi Zhang; Shiping Wang; Chao Ma
Journal: Plant Physiol Date: 2019-04-08 Impact factor: 8.340

Review 2. A narrative review of circular RNAs as potential biomarkers and therapeutic targets for cardiovascular diseases.

Authors: Chi Liu; Nan Li; Guifeng Dai; Omer Cavdar; Hong Fang
Journal: Ann Transl Med Date: 2021-04

3. An Antisense Circular RNA Regulates Expression of RuBisCO Small Subunit Genes in Arabidopsis.

Authors: He Zhang; Shuai Liu; Xinyu Li; Lijuan Yao; Hongyang Wu; František Baluška; Yinglang Wan
Journal: Front Plant Sci Date: 2021-05-24 Impact factor: 5.753

4. Efficient deletion of multiple circle RNA loci by CRISPR-Cas9 reveals Os06circ02797 as a putative sponge for OsMIR408 in rice.

Authors: Jianping Zhou; Mingzhu Yuan; Yuxin Zhao; Quan Quan; Dong Yu; Han Yang; Xu Tang; Xuhui Xin; Guangze Cai; Qian Qian; Yiping Qi; Yong Zhang
Journal: Plant Biotechnol J Date: 2021-01-28 Impact factor: 9.803

5. Identification of Circular RNAs by Multiple Displacement Amplification and Their Involvement in Plant Development.

Authors: Ashirbad Guria; Priyanka Sharma; Sankar Natesan; Gopal Pandi
Journal: Methods Mol Biol Date: 2021

6. Computational Analysis of Transposable Elements and CircRNAs in Plants.

Authors: Liliane Santana Oliveira; Andressa Caroline Patera; Douglas Silva Domingues; Danilo Sipoli Sanches; Fabricio Martins Lopes; Pedro Henrique Bugatti; Priscila Tiemi Maeda Saito; Vinicius Maracaja-Coutinho; Alan Mitchell Durham; Alexandre Rossi Paschoal
Journal: Methods Mol Biol Date: 2021

7. Transcriptome-wide identification and functional prediction of novel and flowering-related circular RNAs from trifoliate orange (Poncirus trifoliata L. Raf.).

Authors: Ren-Fang Zeng; Jing-Jing Zhou; Chun-Gen Hu; Jin-Zhi Zhang
Journal: Planta Date: 2018-02-07 Impact factor: 4.116