| Literature DB >> 25425034 |
Peng Zhang1, Xiaohui Si2, Geir Skogerbø1, Jiajia Wang1, Dongya Cui1, Yongxing Li1, Xubin Sun1, Li Liu1, Baofa Sun1, Runsheng Chen3, Shunmin He3, Da-Wei Huang4.
Abstract
piRNAs are a class of small RNAs that is most abundantly expressed in the animal germ line. Presently, substantial research is going on to reveal the functions of piRNAs in the epigenetic and post-transcriptional regulation of transposons and genes. A piRNA database for collection, annotation and structuring of these data will be a valuable contribution to the field, and we have therefore developed the piRBase platform which integrates various piRNA-related high-throughput data. piRBase has the largest collection of piRNAs among existing databases, and contains at present 77 million piRNA sequences from nine organisms. Repeat-derived and gene-derived piRNAs, which possibly participate in the regulation of the corresponding elements, have been given particular attention. Furthermore, epigenetic data and reported piRNA targets were also collected. To our knowledge, this is the first piRNA database that systematically integrates epigenetic and post-transcriptional regulation data to support piRNA functional analysis. We believe that piRBase will contribute to a better understanding of the piRNA functions. Database URL: http://www.regulatoryrna.org/database/piRNA/Entities:
Mesh:
Substances:
Year: 2014 PMID: 25425034 PMCID: PMC4243270 DOI: 10.1093/database/bau110
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Current numbers of unique piRNA sequences in piRBase and other piRNA databases
| Species | piRNABank | piRNAQuest | piRBase |
|---|---|---|---|
| Human | 32194 | 41749 | 32826 |
| Mouse | 72878 | 890078 | 51664769 |
| Rat | 62713 | 66758 | 63182 |
| 44417 | 0 | 21027419 | |
| 0 | 0 | 28219 | |
| Zebrafish | 356550 | 0 | 1330692 |
| Chicken | 0 | 0 | 508437 |
| 0 | 0 | 6142904 | |
| Silkworm | 0 | 0 | 1174963 |
| Platypus | 147 | 0 | 0 |
aThe number of unique piRNAs may be less than shown.
Figure 1.Overview of piRNA sequences in piRBase. (A) The percentage of unique piRNAs from each species in piRBase. (B) The amount of piRNA sequences obtained by different experimental methods. (C) Length distribution of unique piRNA sequences in piRBase. Caenorhabditis elegans piRNAs are not included as all reported sequences are 21 nt long.
Figure 2.Database construction pipeline. The database was constructed in three major steps: manual literature mining, data annotation and analysis and data storage in a MySQL relational database with a Web interface.
List of data supporting the functional analysis in piRBase
| Species | Repeat-/gene-derived piRNAs | Post-transcriptional piRNA target | DNA methylation data | H3K9me3 data |
|---|---|---|---|---|
| Human | Testis | Testis, brain | ||
| Mouse | Testis, germ cell | Germ cell | Testis, germ cell, brain | Germ cell |
| Rat | Testis | |||
| Testis, ovary | Embryo | |||
| Whole worm | ||||
| Zebrafish | Ovary, testis | Testis | ||
| Chicken | Embryo | Testis | ||
| Egg, gastrula | Testis | |||
| Silkworm |
Figure 3.Screenshots of the browse and search pages. (A) The ‘Browse piRNAs’ page. (1) The drop down list box enables the users to browse piRNAs of a specific organism. (2 and 3) Clicking on the links delivers the detailed information page of the corresponding piRNA in piRBase and NCBI, respectively. (B) The ‘Browse Datasets’ page. The datasets can be filtered according to organism (4) and users can also browse piRNAs in a particular dataset (5). To learn more about the datasets, external links are provided (6). (C) In the piRNA ‘Detailed Information’ page, links to piRNA loci in the Genome Browser (7) and the annotations for these positions (8) are given. An online Bowtie tool (9) and a link to UCSC Blat (10) are also available for more alignment results. (D) In the piRNA target search result page, links to the piRNA ‘Detailed Information’ page of piRBase (11) and the target location in the Genome Browser (12) are available. (E and F) The locus information of the piRNAs, H3K9me3 marks, DNA methylation and piRNA target sites are shown in the Genome Browser. RefSeq genes and RepeatMasker annotations are also displayed. Screenshots of the Genome Browser show the piRNA target sites (13) and the H3K9me3 levels at a LINE1 locus (chr3:123735167-123741052 of mm9 genome) in Miwi2 HET and Miwi2 KO spermatogonia (14).