Li Chen1, Feng Wang2, Emily C Bruggeman2, Chao Li1, Bing Yao2. 1. Department of Health Outcomes Research and Policy, Auburn University, Auburn, AL 36849, USA. 2. Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA.
Abstract
MOTIVATION: Circular RNAs (circRNAs), a class of non-coding RNAs generated from non-canonical back-splicing events, have emerged to play key roles in many biological processes. Though numerous tools have been developed to detect circRNAs from rRNA-depleted RNA-seq data based on back-splicing junction-spanning reads, computational tools to identify critical genomic features regulating circRNA biogenesis are still lacking. In addition, rigorous statistical methods to perform differential expression (DE) analysis of circRNAs remain under-developed. RESULTS: We present circMeta, a unified computational framework for circRNA analyses. circMeta has three primary functional modules: (i) a pipeline for comprehensive genomic feature annotation related to circRNA biogenesis, including length of introns flanking circularized exons, repetitive elements such as Alu elements and SINEs, competition score for forming circulation and RNA editing in back-splicing flanking introns; (ii) a two-stage DE approach of circRNAs based on circular junction reads to quantitatively compare circRNA levels and (iii) a Bayesian hierarchical model for DE analysis of circRNAs based on the ratio of circular reads to linear reads in back-splicing sites to study spatial and temporal regulation of circRNA production. Both proposed DE methods without and with considering host genes outperform existing methods by obtaining better control of false discovery rate and comparable statistical power. Moreover, the identified DE circRNAs by the proposed two-stage DE approach display potential biological functions in Gene Ontology and circRNA-miRNA-mRNA networks that are not able to be detected using existing mRNA DE methods. Furthermore, top DE circRNAs have been further validated by RT-qPCR using divergent primers spanning back-splicing junctions. AVAILABILITY AND IMPLEMENTATION: The software circMeta is freely available at https://github.com/lichen-lab/circMeta. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Circular RNAs (circRNAs), a class of non-coding RNAs generated from non-canonical back-splicing events, have emerged to play key roles in many biological processes. Though numerous tools have been developed to detect circRNAs from rRNA-depleted RNA-seq data based on back-splicing junction-spanning reads, computational tools to identify critical genomic features regulating circRNA biogenesis are still lacking. In addition, rigorous statistical methods to perform differential expression (DE) analysis of circRNAs remain under-developed. RESULTS: We present circMeta, a unified computational framework for circRNA analyses. circMeta has three primary functional modules: (i) a pipeline for comprehensive genomic feature annotation related to circRNA biogenesis, including length of introns flanking circularized exons, repetitive elements such as Alu elements and SINEs, competition score for forming circulation and RNA editing in back-splicing flanking introns; (ii) a two-stage DE approach of circRNAs based on circular junction reads to quantitatively compare circRNA levels and (iii) a Bayesian hierarchical model for DE analysis of circRNAs based on the ratio of circular reads to linear reads in back-splicing sites to study spatial and temporal regulation of circRNA production. Both proposed DE methods without and with considering host genes outperform existing methods by obtaining better control of false discovery rate and comparable statistical power. Moreover, the identified DE circRNAs by the proposed two-stage DE approach display potential biological functions in Gene Ontology and circRNA-miRNA-mRNA networks that are not able to be detected using existing mRNA DE methods. Furthermore, top DE circRNAs have been further validated by RT-qPCR using divergent primers spanning back-splicing junctions. AVAILABILITY AND IMPLEMENTATION: The software circMeta is freely available at https://github.com/lichen-lab/circMeta. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: J M Nigro; K R Cho; E R Fearon; S E Kern; J M Ruppert; J D Oliner; K W Kinzler; B Vogelstein Journal: Cell Date: 1991-02-08 Impact factor: 41.582
Authors: Josh N Vo; Marcin Cieslik; Yajia Zhang; Sudhanshu Shukla; Lanbo Xiao; Yuping Zhang; Yi-Mi Wu; Saravana M Dhanasekaran; Carl G Engelke; Xuhong Cao; Dan R Robinson; Alexey I Nesvizhskii; Arul M Chinnaiyan Journal: Cell Date: 2019-02-07 Impact factor: 41.582
Authors: Daryl J Thomas; Kate R Rosenbloom; Hiram Clawson; Angie S Hinrichs; Heather Trumbower; Brian J Raney; Donna Karolchik; Galt P Barber; Rachel A Harte; Jennifer Hillman-Jackson; Robert M Kuhn; Brooke L Rhead; Kayla E Smith; Archana Thakkapallayil; Ann S Zweig; David Haussler; W James Kent Journal: Nucleic Acids Res Date: 2006-12-13 Impact factor: 16.971