| Literature DB >> 30335166 |
Hongwei Wang1, Ludong Yang1, Yan Wang1, Leshi Chen2, Huihui Li1, Zhi Xie1.
Abstract
RPFdb (http://www.rpfdb.org or http://sysbio.sysu.edu.cn/rpfdb) is a public database for hosting, analyzing and visualizing ribosome profiling (ribo-seq) data. Since its initial release in 2015, the amount of new ribo-seq data has been considerably enlarged with the increasing popularity of ribo-seq technique. Here, we describe an updated version, RPFdb v2.0, which brings significant data expansion, feature improvements, and functionality optimization: (i) RPFdb v2.0 currently hosts 2884 ribo-seq datasets from 293 studies, covering 29 different species, in comparison with 777 datasets from 82 studies and 8 species in the previous version; (ii) A refined analysis pipeline with multi-step quality controls has been applied to improve the pre-processing and alignment of ribo-seq data; (iii) New functional modules have been added to provide actively translated open reading frames (ORFs) information for each ribo-seq data; (iv) More features have been made available to increase database usability. With these additions and enhancements, RPFdb v2.0 will represent a more valuable and comprehensive database for the gene regulation community.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30335166 PMCID: PMC6324049 DOI: 10.1093/nar/gky978
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Comparison between RPFdb v2.0 and v1.0
| RPFdb v1.0 | RPFdb v2.0 | |
|---|---|---|
|
| ||
| Data source | SRA | SRA, ENA, DDBJ |
| No. of datasets | 777 | 2884 |
| No. of studies | 82 | 293 |
| No. of species | 8 | 29 |
|
| ||
|
| • Quality control | • Quality control |
| • Keep the first 26 nucleotides of each sequencing read | • Adapter removal | |
| • Low-quality sequence trimming | ||
| • rRNA and tRNA filtration | ||
| • Read-length selection (25–34 nt) | ||
|
| ||
| Browse | • Study browser(overview of dataset-meta description and summary statistics) | • Study browser (overview of dataset-meta description, summary statistics, and quality control assessment) |
| • ORF browser (overview and detailed annotation information on actively translated ORFs) | ||
| Search and visualization | • RPKM values | • RPKM values |
| • Footprint coverage at different genomic regions | • ORF entry | |
| • Footprint coverage at different genomic regions | ||
| Download | • RPKM table | • Raw read count table |
| • RPKM table | ||
| • ORF annotation table | ||
|
| ||
| Applicability | • Desktop computers | • Desktop computers |
| • Mobile devices |
Figure 1.Overview of RPFdb v2.0. (A) Functional modules in RPFdb v2.0. (B) The number of ribo-seq datasets in species. (C) The number of studies in species.