| Literature DB >> 17997596 |
Doron Betel1, Robert Sheridan, Debora S Marks, Chris Sander.
Abstract
The recent discovery of a new class of 30-nucleotide long RNAs in mammalian testes, called PIWI-interacting RNA (piRNA), with similarities to microRNAs and repeat-associated small interfering RNAs (rasiRNAs), has raised puzzling questions regarding their biogenesis and function. We report a comparative analysis of currently available piRNA sequence data from the pachytene stage of mouse spermatogenesis that sheds light on their sequence diversity and mechanism of biogenesis. We conclude that (i) there are at least four times as many piRNAs in mouse testes than currently known; (ii) piRNAs, which originate from long precursor transcripts, are generated by quasi-random enzymatic processing that is guided by a weak sequence signature at the piRNA 5'ends resulting in a large number of distinct sequences; and (iii) many of the piRNA clusters contain inverted repeats segments capable of forming double-strand RNA fold-back segments that may initiate piRNA processing analogous to transposon silencing.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17997596 PMCID: PMC2065894 DOI: 10.1371/journal.pcbi.0030222
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Sequence and Cluster Overlaps between Datasets A, B, and C
Although the three studies identified the same piRNA clusters, they are distinct at the level of piRNA sequences.
(A) View of Chromosome 5 piRNA sequences and clusters from datasets A, B, C. Top panel (1) is the karyotype view with cluster positions of the datasets: A (green lines), B (top purple triangles), and C (bottom yellow triangles). Lower panels (2–5) are magnified views of the sequences and cluster locations from the three datasets. Top three tracks in each panel are the sequence locations from datasets C (yellow), B (purple), and A (green), and lower three tracks are the cluster positions in the same color scheme. The Venn diagram of the cluster overlaps (B) shows a good agreement between the datasets while sequence overlaps, using 95% identity measure, are small (C). Note that the number of piRNAs used in this comparison is different from the number of sequences reported in the original studies (see Methods and Table S1).
Comparison of microRNAs and piRNAs
Figure 2Currently Known and Estimated Total Number of piRNAs
We estimate that the total number of piRNAs in mouse testes is ∼2 × 105 (red), roughly four times the number of currently known piRNAs (blue). The estimated number of piRNAs corresponds to ∼23% of all “U” positions (green) or 5%–6% of all nucleotides (yellow) in piRNA clusters.
Figure 3A Model of piRNA Biogenesis
piRNAs originate from long RNA precursors transcribed from a small number of genomic regions (A). Some clusters contain inverted repeats that can potentially form dsRNA fold-back structures. In this genomic view of a cluster chr2: 150870000–150910000 (B), the inverted repeats are represented as linked colored bars. These inverted repeats originate from inverted LINE transposable elements that flank the piRNA cluster (red and blue bars in the LINE track). A long transcript containing the pair of inverted LINE elements can potentially form a precursor with a dsRNA segment (C). piRNAs are processed by a quasi-random mechanism with a weak sequence preference near the 5′ U that is most pronounced in frequent clones (D).