| Literature DB >> 22140119 |
Miri Danan1, Schraga Schwartz, Sarit Edelheit, Rotem Sorek.
Abstract
Circular RNA forms had been described in all domains of life. Such RNAs were shown to have diverse biological functions, including roles in the life cycle of viral and viroid genomes, and in maturation of permuted tRNA genes. Despite their potentially important biological roles, discovery of circular RNAs has so far been mostly serendipitous. We have developed circRNA-seq, a combined experimental/computational approach that enriches for circular RNAs and allows profiling their prevalence in a whole-genome, unbiased manner. Application of this approach to the archaeon Sulfolobus solfataricus P2 revealed multiple circular transcripts, a subset of which was further validated independently. The identified circular RNAs included expected forms, such as excised tRNA introns and rRNA processing intermediates, but were also enriched with non-coding RNAs, including C/D box RNAs and RNase P, as well as circular RNAs of unknown function. Many of the identified circles were conserved in Sulfolobus acidocaldarius, further supporting their functional significance. Our results suggest that circular RNAs, and particularly circular non-coding RNAs, are more prevalent in archaea than previously recognized, and might have yet unidentified biological roles. Our study establishes a specific and sensitive approach for identification of circular RNAs using RNA-seq, and can readily be applied to other organisms.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22140119 PMCID: PMC3326292 DOI: 10.1093/nar/gkr1009
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Identification of circular RNA products in RNA-seq data. (A) An RNA-seq cDNA read that maps to the reference DNA in a non-linear, chiastic manner, is a hallmark of circular RNA. (B) Schematic representation of the Sulfolobus solfataricus tRNATrp, which contains a 65 b intron that is cleaved in the process of tRNA maturation and becomes a stable RNA circle. (C) Reads derived from the region around the circularization junction of the excised tRNATrp intron show chiastic mapping to the genome of S. solfataricus P2, exemplifying the power of RNA-seq in circular RNA discovery. Multiple different reads spanning the circularization junction confirm that the observed circular junction is not an amplification artifact. Numbers denote position of the S. solfataricus P2 genome.
Identified circular RNAs in S. solfataricus P2
| Gene | Circle start | Circle end | Circle size | No. of reads RNA- seq | No. of reads RNase R-seq | Norm. RNase R enrich. | Other cRNAs in gene | Conserved in | RT-PCR validated |
|---|---|---|---|---|---|---|---|---|---|
| (I) rRNAs and tRNA introns | |||||||||
| 5S rRNA/SSOr02 | 7 945 | 78 067 | 123 | 95 | 312 | 7.1 | 4 | Yes | No |
| 16S rRNA/SSOr03 | 871 658 | 873 216 | 1559 | 4124 | 4957 | 1.6 | Yes | ||
| 23S rRNA/SSOr04 | 873 334 | 876 429 | 3096 | 2629 | 1605 | 0.8 | Yes | ||
| tRNA-Trp/SSOt04 | 72 767 | 72 831 | 65 | 1148 | 1694 | 0.7 | 1 | Yes | Yes |
| tRNA-Lys/SSOt07 | 138 386 | 138 407 | 22 | 211 | 825 | 91.5 | |||
| tRNA-Met/SSOt11 | 184 817 | 184 841 | 25 | 25 | 240 | 20.9 | |||
| tRNA-Pro/SSOt42 | 898 313 | 898 333 | 21 | 550 | 1482 | 4.1 | |||
| tRNA-Ser/SSOt33 | 640 978 | 641 001 | 24 | 4 | NA | Yes | |||
| (II) Other non-coding RNAs | |||||||||
| C/D box sR106 | 285 707 | 285 760 | 54 | 2 | 17 | 271.5 | 1 | Yes | |
| C/D box Sso-180 | 362 308 | 362 369 | 62 | 4 | 1 | 46.2 | |||
| C/D box sR133 | 442 392 | 442 417 | 26 | 13 | 9 | 1.0 | |||
| C/D box sR102 | 563 241 | 563 296 | 56 | 2 | 3 | 31.1 | Yes | Yes | |
| C/D box Sso-sR8 | 647 783 | 647 833 | 51 | 10 | 9 | 0.9 | |||
| C/D box Sso-sR4 | 666 143 | 666 186 | 44 | 56 | 277 | 4.1 | 5 | Yes | |
| C/D box Sso-sR10 | 794 186 | 794 240 | 55 | 1 | 1 | 1223.5 | Yes | ||
| C/D box Sso-207 | 816 021 | 816 075 | 55 | 1 | 2 | 68.6 | |||
| C/D box SSOs02 | 829 352 | 829 405 | 54 | 2 | 2 | 52.8 | |||
| C/D box Sso-sR12 | 2 189 397 | 2 189 456 | 60 | 2 | 1 | 11.1 | Yes | ||
| C/D box sR105 | 2 237 915 | 2 237 962 | 48 | 2 | 1 | 7.4 | Yes | ||
| H/ACA box sR109 | 595 510 | 595 579 | 70 | 2 | 2 | 15.7 | 1 | Yes | Yes |
| ncRNA | 442 786 | 442 854 | 69 | 3 | 18 | 160.0 | 1 | Yes | |
| ncRNA (within ISC1476) | 722 538 | 722 578 | 41 | 6 | 8 | 1.4 | 2 | ||
| Sso-117 (antisense to ISC1225) | 1 576 633 | 1 576 671 | 39 | 8 | 1 | 0.0 | 1 | ||
| Sso-109 (antisense to ISC1225) | 1 927 228 | 1 927 258 | 31 | 5 | 1 | 0.1 | |||
| 7S rRNA/SSOr01 | 49 977 | 50 023 | 47 | 3 | 5 | 4.5 | Yes | ||
| Sso-214 | 105 148 | 105 181 | 34 | 13 | 2 | 0.2 | |||
| RNase P | 224 732 | 224 765 | 34 | 40 | 8 | 0.2 | 6 | Yes | Yes |
| Sso-83 | 581 818 | 581 860 | 43 | 23 | 2 | 2.3 | 1 | ||
| ncRNA | 1 275 500 | 1 275 567 | 68 | 14 | 66 | 20.9 | 3 | Yes | |
| (III) Possible processing errors | |||||||||
| tRNA pseudouridine synthase/SSO0393 | 343 138 | 343 264 | 127 | 1 | 1 | 167.1 | 6 | ||
| Intergenic region upstream rRNA operon | 871 573 | 871 657 | 85 | 4 | 2 | 1.0 | |||
| Intergenic region between 16S and 23S rRNA operon | 873 215 | 873 331 | 117 | 7 | 8 | 8.9 | Yes | ||
| (IV) Inside ORFs | |||||||||
| hypothetical protein/SSO0389 | 335 563 | 335 635 | 73 | 7 | 2 | 128.8 | |||
| hypothetical protein/SSO0845 | 725 923 | 726 085 | 163 | 2 | 2 | 33.1 | |||
| succinate dehydrogenase subunit/SSO2359 | 2 154 297 | 2 154 322 | 26 | 4 | 1 | 2.5 | |||
| peptide ABC transporter/SSO2619 | 2 385 872 | 2 385 901 | 30 | 10 | 1 | 80.4 | |||
| rubrerythrin (rr)/SSO2642 | 2 404 146 | 2 404 203 | 58 | 1 | 1 | 749.7 |
aGene containing the identified cRNA. Annotation taken either from NCBI accession NC_002754 (51) and from the following refs (25,34,36).
bIn cases of several cRNAs within the same gene, coordinates for dominant cRNA (one supported by most reads) are shown.
cNumber of reads supporting the circularization junction in the total RNA sample.
dNumber of reads supporting the circularization junction in the RNase R treated sample.
eEnrichment in the RNase R sample (Methods).
fNumber of additional cRNAs overlapping the gene, in case of multiple cRNAs per gene.
*Detected in RNase R sample only.
**The cRNA taken for validation was a variant longer than the dominant one, due to the short size (34 bp) of the dominant cRNA (Supplementary Table S4).
Figure 2.Enrichment for circular transcripts in RNA-seq data from RNase R-treated sample. In both panels, top graph presents RNA-seq data from non-treated sample, and bottom graphs present data from RNase R-treated sample, where linear RNAs are depleted. X axis denotes the position on the S. solfataricus P2 genome, with brown arrows denoting genes (right- and left-pointing arrows correspond to genes encoded on the forward and reverse strands, respectively). Red box marks a cRNA region also supported by permuted reads. Y axis denotes RNA-seq coverage of reads fully mapping to the genome. (A) The C/D box RNA sR106 (34) is enriched in the RNase R-treated sample, and its circular formation is independently supported by 2 and 17 permuted reads in total RNA and RNase R-treated samples, respectively. (B) The C/D box RNA sR102 (34) is enriched in the RNase R-treated sample, and its circular formation is independently supported by two and three permuted reads in total RNA and RNase R-treated samples, respectively.
Figure 3.Functional characteristics of genes encompassing circular transcripts identified in this study.
Figure 4.Experimental verification of cRNAs by RT-PCR. (A) Left, RT–PCR results of amplification with outward-directed primers, designed to amplify cRNA; right, RT–PCR results of amplification with inward-directed primers, expected to amplify both linear and cRNA form. Single and multiple arrowheads represent single or double/triple size products, respectively. N/S, non-specific amplification (as verified by direct sequencing). RT–PCR with each primer set was performed on total RNA sample, RNase R-treated sample, and DNA sample, all extracted from S. solfataricus grown to stationary phase on organotrophic medium. (B) RT–PCR for verification of circular RNAs. Arrows indicate outward facing primers (top) and inward facing primers (bottom). Purple line/circle denotes the RNA template, pink line denotes expected PCR product. (C) Double and triple sized products can stem from multiple rounds of RT around a circular RNA template, followed by PCR amplification. Arrows mark illustrative PCR primers. (D) Northern blot analyses of two ncRNAs with circular forms: (M) Size marker, (1) ncRNA found in genomic location 1 275 500–1 275 567, (2) ncRNA found in genomic location 442 786–442 854. Circular forms are indicated by ‘C’, and linear forms by ‘L’.