| Literature DB >> 25352543 |
Anton I Petrov, Simon J E Kay, Richard Gibson, Eugene Kulesha, Dan Staines, Elspeth A Bruford, Mathew W Wright, Sarah Burge, Robert D Finn, Paul J Kersey, Guy Cochrane, Alex Bateman, Sam Griffiths-Jones, Jennifer Harrow, Patricia P Chan, Todd M Lowe, Christian W Zwieb, Jacek Wower, Kelly P Williams, Corey M Hudson, Robin Gutell, Michael B Clark, Marcel Dinger, Xiu Cheng Quek, Janusz M Bujnicki, Nam-Hai Chua, Jun Liu, Huan Wang, Geir Skogerbø, Yi Zhao, Runsheng Chen, Weimin Zhu, James R Cole, Benli Chai, Hsien-Da Huang, His-Yuan Huang, J Michael Cherry, Artemis Hatzigeorgiou, Kim D Pruitt.
Abstract
The field of non-coding RNA biology has been hampered by the lack of availability of a comprehensive, up-to-date collection of accessioned RNA sequences. Here we present the first release of RNAcentral, a database that collates and integrates information from an international consortium of established RNA sequence databases. The initial release contains over 8.1 million sequences, including representatives of all major functional classes. A web portal (http://rnacentral.org) provides free access to data, search functionality, cross-references, source code and an integrated genome browser for selected species.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25352543 PMCID: PMC4384043 DOI: 10.1093/nar/gku991
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Expert Databases from which sequence data are already incorporated into RNAcentral
| Database name | Description | URL |
|---|---|---|
| ENA | European Nucleotide Archive; provides the
complete set of ncRNA sequence data reported by the scientific community to the
databases of the International Nucleotide Sequence Database Collaboration (INSDC;
( |
|
| Rfam | Database of ncRNA families and
|
|
| RefSeq | A comprehensive, non-redundant,
well-annotated set of reference sequences including genes and transcripts ( |
|
| VEGA | Database of vertebrate gene annotation that
provides a high-quality set of lncRNAs produced by manual annotation ( |
|
| gtRNAdb | Contains tRNA gene predictions on complete
or nearly complete genomes from a broad range of species ( |
|
| miRBase | miRBase is a database of published microRNA
sequences and annotations ( |
|
| RDP | Provides a quality-controlled, aligned and
annotated set of small ribosomal subunit RNA sequences ( |
|
| tmRNA Website | Provides information about the tmRNA
molecule found in bacteria and some organelles ( |
|
| SRPDB | Provides information about the signal
recognition particle RNA molecule ( |
|
| lncRNAdb | Provides comprehensive information about
experimentally characterized lncRNA molecules ( |
|
Figure 1.RNAcentral architecture. Block arrows indicate data flow, while solid arrows indicate web traffic and quality control.
Figure 2.An RNAcentral entry web page for an lncRNA showing the four sections: (1) Overview and description of the RNA sequence, (2) Annotations and cross-references to Expert Databases, (3) Genome browser for mapped sequences, (4) Sequence data.
Figure 3.The species distribution of sequences in RNAcentral.