Literature DB >> 30462173

circtools-a one-stop software solution for circular RNA research.

Tobias Jakobi1,2, Alexey Uvarovskii1,2, Christoph Dieterich1,2.   

Abstract

MOTIVATION: Circular RNAs (circRNAs) originate through back-splicing events from linear primary transcripts, are resistant to exonucleases, are not polyadenylated and have been shown to be highly specific for cell type and developmental stage. CircRNA detection starts from high-throughput sequencing data and is a multi-stage bioinformatics process yielding sets of potential circRNA candidates that require further analyses. While a number of tools for the prediction process already exist, publicly available analysis tools for further characterization are rare. Our work provides researchers with a harmonized workflow that covers different stages of in silico circRNA analyses, from prediction to first functional insights.
RESULTS: Here, we present circtools, a modular, Python-based framework for computational circRNA analyses. The software includes modules for circRNA detection, internal sequence reconstruction, quality checking, statistical testing, screening for enrichment of RBP binding sites, differential exon RNase R resistance and circRNA-specific primer design. circtools supports researchers with visualization options and data export into commonly used formats.
AVAILABILITY AND IMPLEMENTATION: circtools is available via https://github.com/dieterich-lab/circtools and http://circ.tools under GPLv3.0. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30462173      PMCID: PMC6596886          DOI: 10.1093/bioinformatics/bty948

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Circular RNAs (circRNAs) were initially discovered in the 1990s, but the novel class of RNAs was first described as ‘scrambled exons’ (Nigro ). Two decades later, new studies employing next generation sequencing discovered a large repertoire of circRNAs in different cell types and provided first hints of potential regulatory functions (Hansen ). CircRNAs originate through back-splicing events from linear primary transcripts, are resistant to exonucleases, typically not polyadenylated and are highly specific for cell type and developmental stage. Detection of circRNAs is usually based on the existence of chimeric reads that cover the back splice junction (BSJ) of the circRNA, i.e. the position where the 3’ tail of the linear RNA molecule is fused with the 5’ head to form a circle. To increase the sensitivity for circRNAs in RNA-seq experiments, the CircleSeq protocol was developed based on the circRNAs resistance to exonuclease (RNase R) treatment (Jeck ). While circRNA detection tools are available (Gao and Zhao, 2018), workflows for automated analyses including functionality, such as structural analyses or first functional predictions, are still rare.

2 Materials and methods

circtools currently offers seven modules shown in Figure 1A. Detection of circRNAs from RNA-seq reads mapped by STAR (Dobin ) is based on the DCC software [detect, Cheng ] and generally the first step in the analysis workflow since the TSV-formatted data files generated here are required for subsequent analyses. Briefly, the detection step produces raw counts for circRNAs by exploiting reads that cover the BSJ and additionally generates count tables for the linear host genes. In a second step, these raw counts can be combined with log files of the STAR aligner within the quickcheck module. The diagnostic diagrams generated by the module also show the dramatic effect of the CircleSeq protocol on the number of detected circRNAs in HepG2 and K562 cells (Fig. 1B). Depending on the experimental setup and cell type, the initial detection step usually yields several hundreds to thousands of potential circRNA candidates. This set of candidates should be filtered for subsequent analysis steps. The circtest module of circtools employs a beta-binomial model to model changes in circRNA expression relative to that of the host gene (Cheng ). This statistical framework may also be used to test the set of circRNAs for candidates that show a clear enrichment (P < 0.05) in the RNase R-treated samples compared to the untreated samples (Fig. 1C). One crucial point for functional predictions of circRNAs is knowledge of the actual internal RNA sequence, i.e. which exons or introns are part of the processed circRNA. The reconstruct module, based on the FUCHS software (Metge ), is able to deliver this information by employing longer RNA-seq reads (e.g. 2 × 150 bp paired-end reads). Results of the module include a global analysis of circRNAs as well as per circRNA level information (exon structure shown in Fig. 1D, bottom, light green and purple for HepG2 and K562). Complementary to the reconstruction of circRNAs, circtools also offers the exon module to identify individual exons, which are resistant to RNase R digestion (i.e. CircleSeq experiments, Fig. 1D, middle, orange bars). Given the RNA sequence of circRNA candidates, further analyses are possible. The enrich module of circtools can be used to screen a set of circRNAs for significant enrichment of selected sequence features (provided via BED file). One typical use case is an enrichment screen for eCLIP peaks in circRNAs (Fig. 1E, Supplementary Fig. S4), which could be indicative of RNA-protein binding events, which are relevant for circRNA function. Prioritized in silico circRNA candidates should be verified by experiment. A quantitative validation experiment is typically performed using qRT-PCR. Herein, specific primer pairs are required to ensure that only circular target transcripts get amplified. While most primer design tools are not intended for this specific use case, circtools incorporates a primer design tool able to retrieve well-suited, circRNA-specific primers and additionally generates a graphical representation of the circRNA, the designed primers and the expected product (Fig. 1F, Supplementary Fig. S5). circtools is implemented in Python 3 (tested with 3.4, 3.5 and 3.6) and has been tested on major Linux distributions. The software is straightforward to install via pip3 install circtools and only requires a working R installation. Required Python and R packages are automatically installed. While much of circtools core functionality is implemented in Python, most plotting functions have been implemented in R. New modules can be easily added to the software by extending the circtools Python base class. We intend to add more functionality in the future in order to provide a comprehensive bioinformatics toolbox and we also encourage researches to contribute modules to circtools.
Fig. 1.

Overview of the circtools software showing different output visualizations. (A) General workflow of circtools. (B) Initial quality check after read mapping and circRNA detection. (C) Relative enrichment of circN4BP2L2 in four samples. (D) Visualization of BED tracks produced by circtools; circRNA predictions (green), differentially spliced exons for K562/HepG2 cell lines (orange), reconstructed circRNAs for both cell lines (light green, purple). (E) Enrichment of different RBP binding sites within circN4BP2L2. (F) Visualization of a automatically designed primer pair (green) bracketing the back-splice junction (black line) that separates the two fused exons (orange, red)

Overview of the circtools software showing different output visualizations. (A) General workflow of circtools. (B) Initial quality check after read mapping and circRNA detection. (C) Relative enrichment of circN4BP2L2 in four samples. (D) Visualization of BED tracks produced by circtools; circRNA predictions (green), differentially spliced exons for K562/HepG2 cell lines (orange), reconstructed circRNAs for both cell lines (light green, purple). (E) Enrichment of different RBP binding sites within circN4BP2L2. (F) Visualization of a automatically designed primer pair (green) bracketing the back-splice junction (black line) that separates the two fused exons (orange, red)

3 Discussion

circtools provides a well-tested, harmonized workflow for state-of-the-art circRNA research. The software covers different aspects in this endeavor: It performs initial quality checks, detects and reconstructs circRNAs, tests for host gene-independent expression, screens for enriched sequence features (i.e. RBP sites), supports the design of primers for qRT-PCR verification and visualizes and exports all analyses results. A complete experimental workflow and detailed methods are described in the online documentation and the Supplementary Material. Click here for additional data file.
  7 in total

1.  Scrambled exons.

Authors:  J M Nigro; K R Cho; E R Fearon; S E Kern; J M Ruppert; J D Oliner; K W Kinzler; B Vogelstein
Journal:  Cell       Date:  1991-02-08       Impact factor: 41.582

2.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

3.  Specific identification and quantification of circular RNAs from sequencing data.

Authors:  Jun Cheng; Franziska Metge; Christoph Dieterich
Journal:  Bioinformatics       Date:  2015-11-09       Impact factor: 6.937

Review 4.  Computational Strategies for Exploring Circular RNAs.

Authors:  Yuan Gao; Fangqing Zhao
Journal:  Trends Genet       Date:  2018-01-12       Impact factor: 11.639

5.  Circular RNAs are abundant, conserved, and associated with ALU repeats.

Authors:  William R Jeck; Jessica A Sorrentino; Kai Wang; Michael K Slevin; Christin E Burd; Jinze Liu; William F Marzluff; Norman E Sharpless
Journal:  RNA       Date:  2012-12-18       Impact factor: 4.942

6.  Natural RNA circles function as efficient microRNA sponges.

Authors:  Thomas B Hansen; Trine I Jensen; Bettina H Clausen; Jesper B Bramsen; Bente Finsen; Christian K Damgaard; Jørgen Kjems
Journal:  Nature       Date:  2013-02-27       Impact factor: 49.962

7.  FUCHS-towards full circular RNA characterization using RNAseq.

Authors:  Franziska Metge; Lisa F Czaja-Hasse; Richard Reinhardt; Chistoph Dieterich
Journal:  PeerJ       Date:  2017-02-28       Impact factor: 2.984

  7 in total
  15 in total

1.  Bioinformatic Analysis of Circular RNA Expression.

Authors:  Enrico Gaffo; Alessia Buratin; Anna Dal Molin; Stefania Bortoluzzi
Journal:  Methods Mol Biol       Date:  2021

Review 2.  The expanding regulatory mechanisms and cellular functions of circular RNAs.

Authors:  Ling-Ling Chen
Journal:  Nat Rev Mol Cell Biol       Date:  2020-05-04       Impact factor: 94.444

Review 3.  Insights into circular RNAs: their biogenesis, detection, and emerging role in cardiovascular disease.

Authors:  Zoe Ward; John Pearson; Sebastian Schmeier; Vicky Cameron; Anna Pilbrow
Journal:  RNA Biol       Date:  2021-03-28       Impact factor: 4.652

Review 4.  Non-Coding RNAs in Tuberculosis Epidemiology: Platforms and Approaches for Investigating the Genome's Dark Matter.

Authors:  Ahmad Almatroudi
Journal:  Int J Mol Sci       Date:  2022-04-17       Impact factor: 6.208

Review 5.  Disease-Associated Circular RNAs: From Biology to Computational Identification.

Authors:  Min Tang; Ling Kui; Guanyi Lu; Wenqiang Chen
Journal:  Biomed Res Int       Date:  2020-08-17       Impact factor: 3.411

6.  CircCode: A Powerful Tool for Identifying circRNA Coding Ability.

Authors:  Peisen Sun; Guanglin Li
Journal:  Front Genet       Date:  2019-10-10       Impact factor: 4.599

7.  Ularcirc: visualization and enhanced analysis of circular RNAs via back and canonical forward splicing.

Authors:  David T Humphreys; Nicolas Fossat; Madeleine Demuth; Patrick P L Tam; Joshua W K Ho
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

8.  circRNAprofiler: an R-based computational framework for the downstream analysis of circular RNAs.

Authors:  Simona Aufiero; Yolan J Reckman; Anke J Tijsen; Yigal M Pinto; Esther E Creemers
Journal:  BMC Bioinformatics       Date:  2020-04-29       Impact factor: 3.169

9.  CIRCexplorer3: A CLEAR Pipeline for Direct Comparison of Circular and Linear RNA Expression.

Authors:  Xu-Kai Ma; Meng-Ran Wang; Chu-Xiao Liu; Rui Dong; Gordon G Carmichael; Ling-Ling Chen; Li Yang
Journal:  Genomics Proteomics Bioinformatics       Date:  2020-01-03       Impact factor: 7.691

10.  Docker4Circ: A Framework for the Reproducible Characterization of circRNAs from RNA-Seq Data.

Authors:  Giulio Ferrero; Nicola Licheri; Lucia Coscujuela Tarrero; Carlo De Intinis; Valentina Miano; Raffaele Adolfo Calogero; Francesca Cordero; Michele De Bortoli; Marco Beccuti
Journal:  Int J Mol Sci       Date:  2019-12-31       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.