| Literature DB >> 30271890 |
Juan Pablo Tosar1,2, Carlos Rovira3, Alfonso Cayota4,5.
Abstract
PIWI-interacting RNAs (piRNAs) are regarded as the guardians of the genome because they tackle genome stability-threatening transposable elements in the germline. Recently, piRNAs were also reported in other types of cells, including mouse brain, malignant and non-malignant somatic tissues, and human plasma. This suggests that piRNA function might be broader than previously expected. Here, we show that different piRNA databases contain a subset of sequences that correspond to piRNA-sized fragments of ncRNAs (rRNAs, tRNAs, YRNAs, snRNAs, and snoRNAs) and intermediates of miRNA biogenesis. We discuss that the biogenesis of these sequences is probably independent of the PIWI pathway, and can therefore be considered contaminants in piRNA databases. Although a minority of annotated piRNAs falls in this category, they account for the vast majority of piRNA expression in somatic non-gonadal tissues. Since ncRNA fragments are ubiquitous and abundant, their confusion with piRNAs strongly impacts the estimation of piRNA expression outside of mammalian gonads.Entities:
Year: 2018 PMID: 30271890 PMCID: PMC6052916 DOI: 10.1038/s42003-017-0001-7
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Fig. 1piRNAs in human plasma are non-coding RNA fragments. a Sequence logo showing 1 U bias in piRNA sequences in the RNAdb 2.0 database, and absence of such signature in the subset of sequences matching other known ncRNAs. For simplicity, only the first 15 bases are shown. b Read coverage of YRNAs (RNY4 and RNY1) in human plasma (SRR2496797) and human testis (ERA246774). The red and green bars represent the most and second-most abundant piRNAs found in human plasma in Freedman et al.[18]. The predicted secondary structure of RNY4 is shown, with the RNY4-derived piRNA highlighted in red. c Venn diagrams showing the 171,551 piRNAs in RNAdb 2.0, classified by either their 5′ start base or their identity to other ncRNAs. The piRNAs described in human plasma (blue) and human testis (orange) are overimposed, and enrichment scores (ES) were calculated as described in methods
Overlap between annotated piRNAs in piRNA databases (RNAdb 2.0 and piRBase) and non-coding RNAs
| Database | # piRNAs | rRNA | tRNA | miRNA | snRNA | snoRNA | YRNA | m_tRNA | m_rRNA | TOTAL | % |
|---|---|---|---|---|---|---|---|---|---|---|---|
| MM = 0 | |||||||||||
| RNAdb 2.0 | 171,551 | 111 | 132 | 18 | 5 | 97 | 2 | 13 | 14 |
|
|
| piRBase | 32,826 | 42 | 85 | 13 | 0 | 104 | 8 | 12 | 14 |
|
|
| MM = 1 | |||||||||||
| RNAdb 2.0 | 171,551 | 132 | 478 | 34 | 6 | 114 | 3 | 17 | 17 | 801 | 0.47 |
| piRBase | 32,826 | 47 | 156 | 19 | 0 | 117 | 15 | 14 | 17 | 385 | 1.18 |
MM mismatch allowance, m_tRNA mitochondrial tRNAs, m_rRNA mitochondrial rRNAs
piRNAs present in the hippocampus of mouse brain are mostly ncRNA fragments
| piRNA | Reads | Alternative | Sequence | Reads: MIWI-IP |
|---|---|---|---|---|
| DQ541777 | 16,130 | RNY1 | GGCTGGTCCGAAGGTAGTGAGTTATCTCAA | 1 |
| DQ705026 | 6,257 | snoRNA 2 | CTGAAATGAAGAGAATACTCTTGCTGATC | 0 |
| DQ555094 | 3,439 | rRNA 28S | TGGGGGGCCCAAGTCCTTCTGATCGAGGCCCA | 0 |
| DQ719597 | 2,459 | snoRNA 85 | GGTCGATGATGAGAGCTTTGTTCTGAGC | 0 |
| DQ689086 | 1,514 | snoRNA 27 | TGCAATGATGTCATCTTACTACTGAAA | 0 |
| DQ540285* | 1,433 | rRNA 18S | ATCGATGTGGTGCTCCGGAGTTCTCTTCGGGC | 0 |
| DQ540981 | 1,360 | rRNA 28S | CGGGCCGCCGGTGAAATACCACTACTCTCA | 0 |
| DQ720186 | 849 | miR-3102-3p | 0 | |
| DQ555093 | 775 | rRNA 28S | TGGGGGGCCCAAGTCCTTCTGATCGAGGC | 1 |
| DQ540862 | 639 | snoRNA Z12 | CCGGGTGATGCGAATCGTAATCTGAGCCG | 0 |
| DQ540284* | 635 | rRNA 18S | ATCGATGTGGTGCTCCGGAGTTCTCTTCGGG | 0 |
| DQ541506* | 580 | rRNA 18S | GATCGATGTGGTGCTCCGGAGTTCTCTT | 0 |
| DQ539915 | 304 | CDS: MT_CO1 | AACATTTCCTGGGCCTTTCAGGAATACCACGA | 0 |
| DQ540861 | 252 | snoRNA 104 | CCGGGTGATGCGAATCGTAATCTGAGC | 0 |
| DQ715526 | 207 | snoRNA 17 | CACCAAGATGAGTGGTGCAAATCTGATC | 0 |
| DQ543676* | 182 | rRNA 18S | TCGATGTGGTGCTCCGGAGTTCTCTTCGGGC | 0 |
| DQ722288 | 175 | snoRNA D81 | TTACTTGATGATAGTAAAAGATCTGATG | 0 |
| DQ551351 | 168 | CDS: Fth1 | TGCTTCAACAGTGCTTGAACGGAACCCGGT | 1 |
| DQ550765 | 118 | snRNA U12 | TGCGGGATGCCTGGGTGACGCGATCTGCCCG | 0 |
| DQ708131 | 115 | snoRNA 25 | TATCTGTGAGGATAAGTAACTCTGAGG | 0 |
The analysis corresponds to every mouse piRNA presented in Table 1 of Lee et al.[15]. piRNA accession numbers correspond to the NCBI database. Asterisks denote four piRNAs, which were described as belonging to a large piRNA cluster in chromosome 17 in the referenced report. Our alternative annotation is shown in column 3. Underlined bases correspond to mismatches according to our annotation. Column 5 shows the number of reads corresponding to each piRNA in a Miwi RIP-seq study performed in wild-type adult testis (GEO: GSM822760)
human plasma piRNAs are mostly ncRNA fragments
| piRNA | RPM (mean) | Alternative | Start/type | Sequence | |
|---|---|---|---|---|---|
| PIR54042 27 | 40 (100) | 2,295.39 | RNAY4 | 67 | CCCCCCACTGCTAAATTTGACTGGCT |
| PIR2888 30 | 40 (100) | 1,684.62 | RNAY1 | 1 | GGCTGGTCCGAAGGTAGTGAGTTATCTCA |
| PIR58596 31 | 40 (100) | 1,604.31 | MT_rRNA 16S | 1 | GCTAAACCTAGCCCCAAACCCACTCCACCC |
| PIR43376 32 | 40 (100) | 327.78 | MT_tRNA-Val | 1 | CAGAGTGTAGCTTAACACAAAGCACCCAACT |
| PIR57581 31 | 39 (98) | 312.29 | MT_tRNA-Ser | 1 | GAGAAAGCTCACAAGAACTGCTAACTCATG |
| PIR54043 26 | 40 (100) | 59.41 | RNAY4 | 67 | CCCCCCACTGCTAAATTTGACTGG |
| PIR59288 32 | 33 (82) | 57.53 | tRNA-AlaCGC | 1 | |
| PIR40304 32 | 40 (100) | 44.88 | MT_tRNA-His | 32 | TGAATCTGACAACAGAGGCTTACGACCCCTT |
| PIR41574 31 | 40 (100) | 41.07 | piRNA | Cluster Chr5 | TGAGATGCGGGAGCTCCGGCGCACACACTC |
| PIR227919 21 | 40 (100) | 34.69 | MT_tRNA-Met | 1 | AGTAAGGTCAGCTAA |
| PIR75448 31 | 40 (100) | 33.04 | tRNA-IleAAT | 1 | GGCCGGTTAGCTCAGT |
| PIR45809 31 | 34 (85) | 28.77 | MT_tRNA-Ser | 29 | TGCCCCCATGTCTAACAACATGGCTTTCTC |
| PIR57849 32 | 20 (50) | 26.67 | CDS: MT_CO2 | Antisense | GAGGGCGTGATCATGAAAGGTGATAAGCTCT |
| PIR57322 27 | 36 (90) | 20.9 | CDS: PPP1R3E | Sense, exon | GACAACAACGGCGGCCGTGACTATGC |
| PIR59786 31 | 30 (75) | 18.06 | MT_tRNA-Phe | 3′ END | GTTTAGACGGGCTCACATCACCCCATAAAC |
| PIR37665 28 | 23 (58) | 17.91 | piRNA | Cluster Chr11 | TCCTGTATTTGCCGAATTGTGGTGTTT |
| PIR52755 30 | 35 (88) | 17.16 | CDS: SPATA31D1 | Sense, exon | TGTGCAGAATATTGGTCGAGTTATAAGAG |
| PIR55478 30 | 18 (45) | 17.05 | CDS: C6orf89 | Sense, exon | TTCCAGTGCCGAAGACATTGTCAGTCTGT |
| PIR33872 31 | 33 (82) | 16.74 | CDS: VKORC1L1 | Sense, exon | TCAAGGCTAAATCTGCTCATGTCGCCACTG |
| PIR31112 30 | 34 (85) | 15.99 | MT_tRNA-Met | 3′ END | AAATGTTGGTTATACCCTTCCCGTACTA |
| PIR49916 28 | 31 (78) | 15.85 | piRNA | Cluster Chr6 | TGGGAGTGAAATCAGTGTTTAGGACTA |
| PIR59752 31 | 33 (82) | 13.49 | MT_tRNA-Leu | 1 | GTTAAGATGGCAGAGCCCGGTAATCGCATA |
| PIR59421 32 | 27 (68) | 12.25 | rRNA_28S | 4549 | GGTTAGTTTTACCCTACTGATGATGTGTTGT |
| PIR51124 27 | 20 (50) | 11.73 | MT_tRNA-Glu | 41 | TGGTCGTGGTTGTAGTCCGTGCGAGA |
| PIR1340 31 | 33 (82) | 10.51 | MT_tRNA-Met | 5 | AGGTCAGCTAA |
Mean abundance (RPM) and number of patients (N = 40) in which each piRNA was sequenced were extracted from Freedman et al.[18]. The analysis includes every piRNA in the cited study with an abundance ≥ 10 RPM. piRNA accession numbers correspond to the RNAdb 2.0 database. Our alternative annotation is shown in columns 4-5. Underlined bases correspond to mismatches according to our annotation
The top 20 (most abundant) piRNAs found in human tumors, after analysis of The Cancer Genome Atlas, are miRNAs or ncRNA fragments
| piRNA | ∑ RPM cancer | Alternative | Sequence |
|---|---|---|---|
| FR072386 | 38,911,332 | miR-let-7a-1 | TGAGGTAGTAGGTTGTATAGTTTTAGGGTC |
| FR182987 | 3,814,379 | miR-532-5p | CATGCCTTGAGTGTAGGACCGT |
| FR074386 | 1,093,572 | snoRNA 98 | ATGCAGTGTGGAACACAATGAACTGAAC |
| FR140858 | 1,177,195 | miR-106b | TAAAGTGCTGACAGTGCAGATAGTGGTCCTC |
| FR114004 | 823,121 | snoRNA 1B | TTTCTGTGTGGAATTTGAATATCTGAAA |
| FR075316 | 744,575 | snoRNA 82 | ACCTGATGTTACATTGTAGTGTGCTGATG |
| FR132879 | 635,984 | snoRNA 58A | CTGCAGTGATGACTTTCTTAGGACACCTTTG |
| FR064000 | 407,046 | snoRNA 58B | CTGCGATGATGGCATTTCTTAGGACACCTTTG |
| FR163199 | 301,387 | snoRNA 138 | CATGATACTGTAAACGCTTTCTGATG |
| FR043670 | 268,790 | MT_tRNA-Glu | TGGTCGTGGTTGTAGTCCGTGCGAGAA |
| FR091055 | 185,930 | miR-744-5p | TGCGGGGCTAGGGCTAACAGCA |
| FR190827 | 201,282 | snoRNA 6 | AGGGGCTGAATGAAAATGGCCTTTCTGAAC |
| FR090905 | 168,436 | MT_rRNA 16S | GCTAAACCTAGCCCCAAACCCACTCCA |
| FR016773 | 172,895 | snoRNA 42B | ACTTGTGATGTCTTCAAAGGAACCACTGATG |
| FR136216 | 162,693 | snoRNA 62A | GGGAGATGAAGAGGACAGTGACTGAGAGA |
| FR004819 | 123,249 | snoRNA 114-1 | GACGGTGAATACAGGTCTGGAAGTCTGAGGT |
| FR103462 | 96,101 | snoRNA 98 | AAATGCAGTGTGGAACACAATGAACTGAAC |
| FR089006 | 95,184 | tRNA-Gly | |
| FR026913 | 74,640 | snoRNA 89 | GAGGAATGATGACAAGAAAAGGCCGAA |
| FR019019 | 81,678 | snoRNA 107 | GTTCATGATGACACAGGACCTTGTCTGAAC |
piRNA accession numbers correspond to the Functional RNA database (fRNAdb), as reported by Martinez et al.[17]. Our alternative annotation is shown in column 3. Underlined bases correspond to mismatches according to our annotation
Fig. 2Presence of ncRNA fragments in piRNA databases accounts for piRNA expression in cells where the piRNA pathway is off. A gonadal somatic or germinal cell (blue) produces PIWI pathway-dependent piRNAs (blue RNAs; only the primary piRNA pathway is represented), which are allocated in the 5′ binding pocket of a PIWI-clade protein (orange). ncRNA fragments (red, green) are also present in PIWI immunoprecipitates, but they are not necessarily bona fide piRNAs. When sequencing data are mapped against a piRNA database, ncRNA fragments will be thought as piRNAs, even in the absence of a functional piRNA pathway