Literature DB >> 30371818

piRBase: a comprehensive database of piRNA sequences.

Jiajia Wang1, Peng Zhang1, Yiping Lu2, Yanyan Li1,3, Yu Zheng1,3, Yunchao Kan4, Runsheng Chen1, Shunmin He1.   

Abstract

PIWI-interacting RNAs are a class of small RNAs that is most abundantly expressed in animal germline. Substantial research is going on to reveal the functions of piRNAs in the epigenetic and post-transcriptional regulation of transposons and genes. To collect and annotate these data, we developed piRBase, a database assisting piRNA functional study. Since its launch in 2014, piRBase has integrated 264 data sets from 21 organisms, and the number of collected piRNAs has reached 173 million. The latest piRBase release (v2.0, 2018) was more focused on the comprehensive annotation of piRNA sequences, as well as the increasing number of piRNAs. In addition, piRBase release v2.0 also contained the potential information of piRNA targets and disease related piRNA. All datasets in piRBase is free to access, and available for browse, search and bulk downloads at http://www.regulatoryrna.org/database/piRNA/.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30371818      PMCID: PMC6323959          DOI: 10.1093/nar/gky1043

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Up to now, various small RNAs can be classified into three types: microRNA (miRNA), endogenous small interfering RNA (siRNA), and PIWI-interacting RNA (piRNA), based on their biogenesis and associated proteins (1). piRNAs are mainly expressed in mammalian germline (2–5), and are generally 24–31 nucleotides in length with 2′-O-methylation at their 3′ ends in most animals (6–8). Compared with miRNAs and siRNAs, piRNAs are much more diverse in sequences. PIWI proteins, which are associated with piRNAs, are also enriched in germline (9–11). piRNAs play various functions in germline and somatic tissues. PIWI/piRNA complex can repress the activity of transposon, which has a high risk of damaging the genome, through post-transcriptional silencing or heterochromatin formation (6,12–20). However, many piRNAs do not match transposon sequences, which implies additional functions of piRNAs (21–23). Recent studies have found that piRNAs can target mRNA by base pairing in mouse (24,25), Drosophila (26,27) and Caenorhabditis elegans (28–32). In silkworm (33) and C. elegans (31), piRNAs are reported to be involved in sex determination through down-regulation of target genes. Besides transposon and mRNA, a large number of lncRNAs are revealed to be mediated by retrotransposon derived piRNAs in mouse late spermatocytes (34). As piRNAs are implicated in transposon and gene regulation, there has been a budding interest in defining the role of piRNAs in human disease (35,36). piRNAs are garnering more and more attention in a variety of cancers. For example, 106 piRNAs are up-regulated and 91 are down-regulated in bladder cancer (37). The abnormal expression of piRNAs are also demonstrated in other types of cancers such as breast cancer (38,39) and gastric cancer (36,40,41). In light of the rapidly increasing studies on piRNA, several piRNA related databases, such as piRNABank (42), piRNAQuest (43), piRNA cluster database (44) and IsopiRBank (45), are generated. However, these databases only contain limited amounts of piRNAs from limited species, and the information of piRNA functions are rarely included. piRBase is the expert database about piRNA included by RNAcentral (46) and is the first database that systematically integrates various piRNA associated data to support piRNA functional analysis. In the latest piRBase release, the number of unique piRNA sequences increased to 173 million, including 21 species. In addition, the piRNA target mRNA records were expanded and potential piRNA target lncRNAs were added in piRBase release v2.0. The information about eight types of cancers (breast, bladder, pancreas, gastric, liver, kidney, myeloma and colorectal cancer) related piRNAs was also added to the new version. We also provided new web tools and improved user interface in piRBase release v2.0.

DATA COLLECTION AND PROCESSING

Based on the first release of piRBase database (47), new datasets and processed piRNA sequences from the literature, supplementary files, and NCBI GEO were collected. We only included sequences that were taken as piRNA or piRNA candidates in corresponding papers. Other piRNA related information was manually extracted from the relevant articles. The contents in piRBase release v2.0 are shown in Figure 1.
Figure 1.

The contents in piRBase release v2.0. piRNA loci, piRNA target sites (mRNAs and lncRNAs) and piRNA related epigenetic data are visualized by UCSC genome browser. Blue boxes: the main updated modules; Red boxes: new modules.

The contents in piRBase release v2.0. piRNA loci, piRNA target sites (mRNAs and lncRNAs) and piRNA related epigenetic data are visualized by UCSC genome browser. Blue boxes: the main updated modules; Red boxes: new modules. Each piRNA was shown with a piRBase name and other related information such as NCBI accession number, organism of origin, sequence, length, the number of papers reporting the piRNA and the number of different methods supporting the piRNA. In the detailed information page, we also provided additional information about the piRNA, such as aliases (NCBI and RNAdb piRNA name), related datasets with tissue and method, genome location, targets and associated cancer. In piRBase, if a piRNA sequence is a subsequence of another piRNA, both of them were considered as different sequences and were assigned distinct piRBase names. As described in the first release of piRBase, all piRNA sequences were mapped to its latest genome using Bowtie (48) with parameter ‘-v 1 -a --best --strata’ in order to obtain the potential origin of every piRNA. piRNAs were referred to as gene- or repeat-related according to the overlapping of piRNA genome loci with RefSeq genes (49) or repeat elements.

DATABASE CONTENT

piRBase aims to provide comprehensive piRNA sequence data, annotation and targets assisting piRNA study. The first release in 2014 contains 77 million piRNA sequences from nine organisms. piRNAs in piRBase are marked as repeat- and gene-related according to their loci in genome, which imply involvement in the regulation of the corresponding elements. In additional, reported piRNA targets and piRNA related epigenetic data were collected. The information of piRNA loci, piRNA target sites, piRNA related epigenetic data, and some basic annotations like RepeatMasker annotations (50) and RefSeq genes are visualized in the UCSC genome browser (50,51) in piRBase release v1.0. Since piRBase release in 2014, we collected more piRNA related data and released in v2.0.

New piRNA records

In present release of piRBase, the number of unique piRNA sequences reaches 173 million, including 264 datasets from 21 species. The number is greatly increased in this version compared with the first release (47) and other piRNA related databases (Table 1). In order to assess credibility of piRNAs in piRBase, we also offered two evaluation score. One score is the number of papers which detected certain piRNA, and the other score is the number of different methods used to detect the sequence. These methods including protein IP, protein CLIP, chromatography, oxidization and small RNA-Seq.
Table 1.

The comparison of piRBase and other piRNA databases

piRBase
SpeciespiRNABankpiRNAQuestIsopiRBankcRelease v1.0Release v2.0
Human35 35641 7494 564 08032 8268 438 265
Mouse55 359890 07838 093 00351 664 76968 054 594
Rat46 44466 75863 1824 081 625
D. melanogaster 44 417a15 058 09321 027 41941 950 613
C. elegans 28 21928 219
Zebrafish6 367 8111 330 6921 330 692
Chicken508 4371 716 795
X. tropicalis 6 142 9046 142 904
Silkworm1 174 9631 253 987
Starlet sea anemone11 811
Cow19 528 252
Crab-eating macaque5 819 946
Rhesus6 514
Marmoset8 311 210
Sea hare499
Tree shrew65 810
Pig1 301 866
D.erecta 1 091 318
D.yakuba 1 069 918
D.virilis 907 289
Rabbit2 816 864
Platypus51b

aThe number of unique piRNAs may be less than shown.

bThe number of unique piRNAs comes from statistic of download file.

cThe number of canonical piRNAs is shown.

The comparison of piRBase and other piRNA databases aThe number of unique piRNAs may be less than shown. bThe number of unique piRNAs comes from statistic of download file. cThe number of canonical piRNAs is shown.

piRNA targets

Many studies have found that piRNAs can regulate the expression of mRNAs and lncRNAs. In the previous version, mRNAs regulated by piRNAs in fly and mouse were included. In this version, new mRNA targets (for silkworm and C. elegans) extracted from the literature were added. Moreover, we predicted piRNA target lncRNAs using the same way as predicting piRNA target mRNA cleavage (25) in mouse testis. In the web page of piRNA target table, we provided the piRBase name linking to detailed information of piRNA, target gene/transcript ID, target site if provided, and other related information.

piRNA and cancer

From piRNA and cancer related literature, we manually collected cancer related piRNAs. In the piRBase release v2.0, eight types of cancers (breast, bladder, pancreas, gastric, liver, kidney, myeloma and colorectal cancer) related piRNAs were included with the information of piRNA name, piRNA expression (up-regulated, down-regulated and fold change) and functional description in cancer tissues or cell lines.

New tools

piRBase release v2.0 added ‘Tools’ menu in navigation bar. We incorporated pirnaPre (52) for users to predict piRNA targets. In addition, some online tools were integrated under the ‘Tools’ menu. ‘Name Convert’ enabled users to convert a list of NCBI or RNAdb piRNA names and accession numbers to piRBase names. The users can also retrieve piRBase names by a list of piRNA sequences using ‘Seq To Name’. Other existing tools such as ‘Reverse Complement’ and ‘Bowtie Online’ tool are also incorporated in the ‘Tools’ menu. Beyond that, we also included the links of others tools for predicting piRNA-targeting sites or identifying piRNAs, such as pirScan (53) and 2L-piRNA (54).

Visualization using UCSC genome browser

In the current piRBase, the latest genome versions such as hg38 and mm10 were added in the UCSC genome browser. Many of the information in the present piRBase release v2.0, such as piRNA loci, piRNA target sites (mRNAs and lncRNAs) and piRNA related epigenetic data, have be visualized using UCSC genome browser. Moreover, we have added about 400 tracks of all related data from current version of piRBase in UCSC genome browser and users can choose to display content they are interested in.

Interface improvement

The web interface of the present piRBase database has been improved for better browse and search experience. The piRNA targets and related cancers are added in piRNA detailed information page. In the piRNA target page, users can browse and search the information by selecting organism, gene symbol and transcript ID. In piRNA search page, two more search options are added in piRNA sequence search. ‘SubStr’ can search all sub-sequences of input sequence. ‘Extend 3′ can search 3′ extended piRNAs. On the basis of download page in the first release, new version data were included. We provided all piRBase data in both release v1.0 and v2.0 for downloading. In addition, users can also download the related data through the ‘Export’ button in corresponding pages.

CONCLUSION

piRBase is the first database that systematically integrates various types of data to support piRNA functional analysis. Compared with the first release in 2014, piRBase release v2.0 is more comprehensive. The total numbers of unique piRNAs and species have been increased. Besides piRNA target mRNAs, the current piRBase also included target lncRNAs. In light of piRNAs as potential biomarkers in cancers, the information of piRNA and cancer was added to the current release. More piRNA related epigenetic data were also added in piRBase UCSC genome browser. In addition, piRBase release v2.0 added new web tools and improved the user interface. As piRNA studies expanded rapidly, we will update piRBase with more related information supporting piRNA functional analysis.
  54 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

Review 2.  The Argonaute family: tentacles that reach into RNAi, developmental control, stem cell maintenance, and tumorigenesis.

Authors:  Michelle A Carmell; Zhenyu Xuan; Michael Q Zhang; Gregory J Hannon
Journal:  Genes Dev       Date:  2002-11-01       Impact factor: 11.361

3.  A novel class of small RNAs bind to MILI protein in mouse testes.

Authors:  Alexei Aravin; Dimos Gaidatzis; Sébastien Pfeffer; Mariana Lagos-Quintana; Pablo Landgraf; Nicola Iovino; Patricia Morris; Michael J Brownstein; Satomi Kuramochi-Miyagawa; Toru Nakano; Minchen Chien; James J Russo; Jingyue Ju; Robert Sheridan; Chris Sander; Mihaela Zavolan; Thomas Tuschl
Journal:  Nature       Date:  2006-06-04       Impact factor: 49.962

4.  A germline-specific class of small RNAs binds mammalian Piwi proteins.

Authors:  Angélique Girard; Ravi Sachidanandam; Gregory J Hannon; Michelle A Carmell
Journal:  Nature       Date:  2006-06-04       Impact factor: 49.962

5.  A distinct small RNA pathway silences selfish genetic elements in the germline.

Authors:  Vasily V Vagin; Alla Sigova; Chengjian Li; Hervé Seitz; Vladimir Gvozdev; Phillip D Zamore
Journal:  Science       Date:  2006-06-29       Impact factor: 47.728

6.  A novel class of small RNAs in mouse spermatogenic cells.

Authors:  Shane T Grivna; Ergin Beyret; Zhong Wang; Haifan Lin
Journal:  Genes Dev       Date:  2006-06-09       Impact factor: 11.361

7.  Characterization of the piRNA complex from rat testes.

Authors:  Nelson C Lau; Anita G Seto; Jinkuk Kim; Satomi Kuramochi-Miyagawa; Toru Nakano; David P Bartel; Robert E Kingston
Journal:  Science       Date:  2006-06-15       Impact factor: 47.728

8.  Specific association of Piwi with rasiRNAs derived from retrotransposon and heterochromatic regions in the Drosophila genome.

Authors:  Kuniaki Saito; Kazumichi M Nishida; Tomoko Mori; Yoshinori Kawamura; Keita Miyoshi; Tomoko Nagami; Haruhiko Siomi; Mikiko C Siomi
Journal:  Genes Dev       Date:  2006-08-01       Impact factor: 11.361

9.  Mili, a mammalian member of piwi family gene, is essential for spermatogenesis.

Authors:  Satomi Kuramochi-Miyagawa; Tohru Kimura; Takashi W Ijiri; Taku Isobe; Noriko Asada; Yukiko Fujita; Masahito Ikawa; Naomi Iwai; Masaru Okabe; Wei Deng; Haifan Lin; Yoichi Matsuda; Toru Nakano
Journal:  Development       Date:  2004-01-21       Impact factor: 6.868

10.  miwi, a murine homolog of piwi, encodes a cytoplasmic protein essential for spermatogenesis.

Authors:  Wei Deng; Haifan Lin
Journal:  Dev Cell       Date:  2002-06       Impact factor: 12.270

View more
  69 in total

1.  RNAdetector: a free user-friendly stand-alone and cloud-based system for RNA-Seq data analysis.

Authors:  Alessandro La Ferlita; Salvatore Alaimo; Sebastiano Di Bella; Emanuele Martorana; Georgios I Laliotis; Francesco Bertoni; Luciano Cascione; Philip N Tsichlis; Alfredo Ferro; Roberta Bosotti; Alfredo Pulvirenti
Journal:  BMC Bioinformatics       Date:  2021-06-03       Impact factor: 3.169

Review 2.  Short RNA regulators: the past, the present, the future, and implications for precision medicine and health disparities.

Authors:  Isidore Rigoutsos; Eric Londin; Yohei Kirino
Journal:  Curr Opin Biotechnol       Date:  2019-07-16       Impact factor: 9.740

Review 3.  Computational Methods and Online Resources for Identification of piRNA-Related Molecules.

Authors:  Yajun Liu; Aimin Li; Guo Xie; Guangming Liu; Xinhong Hei
Journal:  Interdiscip Sci       Date:  2021-04-22       Impact factor: 2.233

4.  Database Resources of the National Genomics Data Center in 2020.

Authors: 
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

5.  Organ-specific small non-coding RNA responses in domestic (Sudani) ducks experimentally infected with highly pathogenic avian influenza virus (H5N1).

Authors:  Mohamed Samir; Ramon O Vidal; Fatma Abdallah; Vincenzo Capece; Frauke Seehusen; Robert Geffers; Ashraf Hussein; Ahmed A H Ali; Stefan Bonn; Frank Pessler
Journal:  RNA Biol       Date:  2019-10-04       Impact factor: 4.652

6.  Diversification of piRNAs expressed in PGCs and somatic cells during embryonic gonadal development.

Authors:  Odei Barreñada; Daniel Fernández-Pérez; Eduardo Larriba; Miguel Brieño-Enriquez; Jesús Del Mazo
Journal:  RNA Biol       Date:  2020-05-06       Impact factor: 4.652

7.  Specific PIWI-Interacting RNAs and Related Small Noncoding RNAs Are Associated With Ovarian Aging in Ames Dwarf (df/df) Mice.

Authors:  Joseph M Dhahbi; Joe W Chen; Supriya Bhupathy; Hani Atamna; Marcelo B Cavalcante; Tatiana D Saccon; Allancer D C Nunes; Jeffrey B Mason; Augusto Schneider; Michal M Masternak
Journal:  J Gerontol A Biol Sci Med Sci       Date:  2021-08-13       Impact factor: 6.053

8.  Promoter hypermethylation of PIWI/piRNA pathway genes associated with diminished pachytene piRNA production in bovine hybrid male sterility.

Authors:  Gong-Wei Zhang; Ling Wang; Huiyou Chen; Jiuqiang Guan; Yuhui Wu; Jianjun Zhao; Zonggang Luo; Wenming Huang; Fuyuan Zuo
Journal:  Epigenetics       Date:  2020-03-06       Impact factor: 4.528

Review 9.  Piwi-interacting RNAs (piRNAs) and cancer: Emerging biological concepts and potential clinical implications.

Authors:  Wenhao Weng; Hanhua Li; Ajay Goel
Journal:  Biochim Biophys Acta Rev Cancer       Date:  2018-12-30       Impact factor: 10.680

10.  Small Non-coding RNAs Are Dysregulated in Huntington's Disease Transgenic Mice Independently of the Therapeutic Effects of an Environmental Intervention.

Authors:  Celine Dubois; Geraldine Kong; Harvey Tran; Shanshan Li; Terence Y Pang; Anthony J Hannan; Thibault Renoir
Journal:  Mol Neurobiol       Date:  2021-03-06       Impact factor: 5.590

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.