Literature DB >> 27543076

RNALocate: a resource for RNA subcellular localizations.

Ting Zhang1, Puwen Tan1, Liqiang Wang1, Nana Jin1, Yana Li1, Lin Zhang1, Huan Yang2, Zhenyu Hu1, Lining Zhang1, Chunyu Hu1, Chunhua Li1, Kun Qian1, Changjian Zhang2, Yan Huang1, Kongning Li3, Hao Lin4, Dong Wang5,6.   

Abstract

Increasing evidence has revealed that RNA subcellular localization is a very important feature for deeply understanding RNA's biological functions after being transported into intra- or extra-cellular regions. RNALocate is a web-accessible database that aims to provide a high-quality RNA subcellular localization resource and facilitate future researches on RNA function or structure. The current version of RNALocate documents more than 37 700 manually curated RNA subcellular localization entries with experimental evidence, involving more than 21 800 RNAs with 42 subcellular localizations in 65 species, mainly including Homo sapiens, Mus musculus and Saccharomyces cerevisiae etc. Besides, RNA homology, sequence and interaction data have also been integrated into RNALocate. Users can access these data through online search, browse, blast and visualization tools. In conclusion, RNALocate will be of help in elucidating the entirety of RNA subcellular localization, and developing new prediction methods. The database is available at http://www.rna-society.org/rnalocate/.
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27543076      PMCID: PMC5210605          DOI: 10.1093/nar/gkw728

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Biological functions of RNAs, including translation of genetic information, cellular signal transduction and transcriptional regulation etc., are determined by their location in cell (1,2). A cell is divided into different compartments that are related to different biological processes (3). For example, the RNA localized in nuclear usually participates in gene expression regulation or mitosis etc (4). Thus, the cellular role of the RNA after synthesis could be inferred from its subcellular localization information. Based on this, subcellular localization for RNAs plays an important role in studying biological function of RNAs. Therefore, it is urgent to construct a database of RNA subcellular localization to integrate, analyze and predict RNA subcellular localization for speeding up RNA structural and functional research. To complement with related research in RNA subcellular localization, we developed a web-accessible database (RNALocate, http://www.rna-society.org/rnalocate/), aimed to collect expanding catalog of diverse species’ RNA subcellular localization in multiple biological processes by manually curating the literature. The first release of RNALocate has contained more than 37 700 manually curated RNA subcellular localization entries with experimental evidence, involving 65 organisms (such as Homo sapiens, Musmusculus and Saccharomyces cerevisiae), 42 subcellular localizations (such as Cytoplasm, Nucleus, Endoplasmic reticulum, Ribosome) and 9 RNA categories (such as mRNA, miRNA, lncRNA). Hence, RNALocate provides a more specific subcellular localization resource in which to efficiently investigate, browse and analyze a particular RNA, and even provides insight into the functions of hypothetical or novel RNAs. The whole data set can be easily queried and downloaded through the webpage, and visualization tools for interactively browsing and analyzing the data set are provided. In addition, RNALocate also allows researchers to submit new RNA subcellular localization.

DATA SOURCES AND IMPLEMENTATION

In order to collect all available RNAs, RNALocate integrates all types of RNA symbols, mainly including microRNA symbols from the miRBase database (5), long non-coding RNA (lncRNA) and mRNA symbols from NCBI Gene and Ensemble genome database (6,7). Other ncRNA category names are also included, such as transfer RNA and small nuclear RNA from NCBI Gene and Ensemble genome database (6,7). The list of subcellular localization names was collected according to the Gene Ontology (GO) (8). We have written a simple script to screen all abstracts and articles in the PubMed database using the following keyword combinations: (each RNA symbol or RNA category name) and/or (each subcellular localization). The relevant hits were further inspected manually. Besides, we also retrieved several thousand subcellular localization entries from lncRNAdb (9), PomBase (10), FlyBase (11), TAIR (12) and DOT (13) databases (Figure 1).
Figure 1.

The overview of the RNALocate database.

The overview of the RNALocate database. The RNALocate database is implemented using HTML and PHP languages with MySQL server. The interface component consists of web pages designed and implemented in HTML/CSS. It has been tested in the Google Chrome, Firefox and Internet Explorer web browsers.

DATABASE CONTENT

RNA subcellular localization information was manually obtained from articles published in the PubMed database before May 2016. In current version, RNALocate documents 37 772 RNA subcellular localization entries with experimental evidence from 65 organisms, involving 42 subcellular localizations (Figure 2) and 9 RNA categories (including csRNA, lncRNA, mRNA, miRNA, piRNA, snRNA, rRNA, snoRNA and tRNA) (Figure 3). Among these, more than 1400 entries were collected from lncRNAdb, PomBase, FlyBase, TAIR and DOT databases. Each subcellular localization entry contains detailed information on RNA symbol, RNA category, alias, organism, sequence, homology, subcellular localization, tissue, validation method, PubMed ID, detailed description and network.
Figure 2.

The hierarchical organization and statistics of RNA subcellular localization.

Figure 3.

The statistics of RNA category and organism. (A) The percentage of 9 RNA categories in RNALocate database (B) The entry number of 65 organism in RNALocate database, only the organisms with ≥100 entries have been listed, respectively.

The hierarchical organization and statistics of RNA subcellular localization. The statistics of RNA category and organism. (A) The percentage of 9 RNA categories in RNALocate database (B) The entry number of 65 organism in RNALocate database, only the organisms with ≥100 entries have been listed, respectively. In ‘Submit’ page, RNALocate invites users to upload novel RNA subcellular localization data, and in ‘Blast’ page, sequence alignment can be done after parameter selection. Except these, the whole data set could be downloaded through two approaches: ‘Basic Download’ and ‘API’ (application programming interface). In ‘Basic Download’ page, the whole data are saved in Microsoft Excel and TXT formats, users can get them by clicking the download button. In ‘API’ page, users can access part of RNALocate data by using script. RNALocate also provides three options in ‘Help’ page to supply instructions for using it, including ‘Statistics’ (detailed statistical tables), ‘Tutorial’ (procedure and illustrations of RNALocate) and ‘Sister Databases’.

DATA QUERYING, SEARCHING AND BROWSING

RNALocate provides an interface for convenient retrieval of all RNA subcellular localizations. Users can query each entry through ‘Keyword Search’ in ‘Search’ page. In ‘Keyword Search’, 5 paths and relevant examples have been provided, including ‘RNA Symbol’, ‘RNA Category’, ‘Subcelllular localization’, ‘Organism’ and ‘Other ID (miRBase ID/Entrez ID)’. RNALocate provides brief details of search results as a table in the ‘Search Result’ page, while more detailed descriptions such as PubMed ID and description of the reference are displayed in ‘Detail’ page reached by selecting ‘more’. When selecting the specific RNA symbol in ‘Search Result’ page, the ‘Detail’ page presents more associated information of the RNA, such as organism, subcellular localization, alias, sequence, homology and validated method. More than 9200 RNAs with orthology/paralogy from miRBase and Homologene database have been provided in RNALocate for investigation on RNA subcellular localizations conservation. To further understand the interaction information between different RNAs in various types of subcellular localizations online, a ‘Network’ option has also been provided to visualize RNA interaction network with subcellular localization and organism (14,15). In ‘Browse’ page, users can access RNALocate in three different paths: ‘By Localization’, ‘By RNA Category’ and ‘By Organism’. A treeview and figure have been displayed in the three pages, respectively. Users could get browse results by clicking the node on the tree or the associated name in the figure. For convenience, the data in RNALocate are organized using a hierarchical structure of subcellular localization, according to the cellular component annotations documented in GO (8).

DISCUSSION AND FUTURE PROSPECTS

Several subcellular localization databases focused on proteins have been previously constructed, such as DBSubLoc, Organelle DB, eSLDB, LOCATE, SUBA, LocDB and PSORTdb databases (3,16–21). They had led to a more comprehensive understanding of the biological functions in proteins. However, recent development has indicated that protein subcellular localization are perhaps only half of the story in cells, since an expanding catalog of diverse RNAs is actively involved in multiple biological processes in different subcellular localization. To complement with this absence, we developed the RNALocate database by organizing and presenting RNA subcellular localization data for 65 organisms across 9 RNA categories. To our knowledge, this is the first database comprehensively focusing on RNA subcellular localization. We hope this resource will bridge the gap in RNAs and subcellular localization research, and stimulate further interesting elucidating the entirety of RNA subcellular localization, and developing new prediction methods. In the future, we will continuously collate RNA subcellular localization reference data and update RNALocate.
  21 in total

1.  DBSubLoc: database of protein subcellular localization.

Authors:  Tao Guo; Sujun Hua; Xinglai Ji; Zhirong Sun
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  The Gene Ontology (GO) database and informatics resource.

Authors:  M A Harris; J Clark; A Ireland; J Lomax; M Ashburner; R Foulger; K Eilbeck; S Lewis; B Marshall; C Mungall; J Richter; G M Rubin; J A Blake; C Bult; M Dolan; H Drabkin; J T Eppig; D P Hill; L Ni; M Ringwald; R Balakrishnan; J M Cherry; K R Christie; M C Costanzo; S S Dwight; S Engel; D G Fisk; J E Hirschman; E L Hong; R S Nash; A Sethuraman; C L Theesfeld; D Botstein; K Dolinski; B Feierbach; T Berardini; S Mundodi; S Y Rhee; R Apweiler; D Barrell; E Camon; E Dimmer; V Lee; R Chisholm; P Gaudet; W Kibbe; R Kishore; E M Schwarz; P Sternberg; M Gwinn; L Hannick; J Wortman; M Berriman; V Wood; N de la Cruz; P Tonellato; P Jaiswal; T Seigfried; R White
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

Review 3.  mRNA localization: gene expression in the spatial dimension.

Authors:  Kelsey C Martin; Anne Ephrussi
Journal:  Cell       Date:  2009-02-20       Impact factor: 41.582

4.  The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools.

Authors:  Philippe Lamesch; Tanya Z Berardini; Donghui Li; David Swarbreck; Christopher Wilks; Rajkumar Sasidharan; Robert Muller; Kate Dreher; Debbie L Alexander; Margarita Garcia-Hernandez; Athikkattuvalasu S Karthikeyan; Cynthia H Lee; William D Nelson; Larry Ploetz; Shanker Singh; April Wensel; Eva Huala
Journal:  Nucleic Acids Res       Date:  2011-12-02       Impact factor: 16.971

5.  Organelle DB: an updated resource of eukaryotic protein localization and function.

Authors:  Nuwee Wiwatwattana; Christopher M Landau; G Jamie Cope; Gabriel A Harp; Anuj Kumar
Journal:  Nucleic Acids Res       Date:  2006-11-27       Impact factor: 16.971

6.  Entrez Gene: gene-centered information at NCBI.

Authors:  Donna Maglott; Jim Ostell; Kim D Pruitt; Tatiana Tatusova
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

7.  lncRNAdb v2.0: expanding the reference database for functional long noncoding RNAs.

Authors:  Xiu Cheng Quek; Daniel W Thomson; Jesper L V Maag; Nenad Bartonicek; Bethany Signal; Michael B Clark; Brian S Gloss; Marcel E Dinger
Journal:  Nucleic Acids Res       Date:  2014-10-20       Impact factor: 16.971

8.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data.

Authors:  Jun-Hao Li; Shun Liu; Hui Zhou; Liang-Hu Qu; Jian-Hua Yang
Journal:  Nucleic Acids Res       Date:  2013-12-01       Impact factor: 16.971

9.  miRBase: annotating high confidence microRNAs using deep sequencing data.

Authors:  Ana Kozomara; Sam Griffiths-Jones
Journal:  Nucleic Acids Res       Date:  2013-11-25       Impact factor: 16.971

10.  PSORTdb: expanding the bacteria and archaea protein subcellular localization database to better reflect diversity in cell envelope structures.

Authors:  Michael A Peabody; Matthew R Laird; Caitlyn Vlasschaert; Raymond Lo; Fiona S L Brinkman
Journal:  Nucleic Acids Res       Date:  2015-11-23       Impact factor: 16.971

View more
  66 in total

1.  lncSLdb: a resource for long non-coding RNA subcellular localization.

Authors:  Xiao Wen; Lin Gao; Xingli Guo; Xing Li; Xiaotai Huang; Ying Wang; Haifu Xu; Ruijie He; Chenglong Jia; Feixiang Liang
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

2.  DM3Loc: multi-label mRNA subcellular localization prediction and analysis based on multi-head self-attention mechanism.

Authors:  Duolin Wang; Zhaoyue Zhang; Yuexu Jiang; Ziting Mao; Dong Wang; Hao Lin; Dong Xu
Journal:  Nucleic Acids Res       Date:  2021-05-07       Impact factor: 16.971

3.  Illuminating lncRNA Function Through Target Prediction.

Authors:  Hua-Sheng Chiu; Sonal Somvanshi; Ting-Wen Chen; Pavel Sumazin
Journal:  Methods Mol Biol       Date:  2021

4.  TACOS: a novel approach for accurate prediction of cell-specific long noncoding RNAs subcellular localization.

Authors:  Young-Jun Jeon; Md Mehedi Hasan; Hyun Woo Park; Ki Wook Lee; Balachandran Manavalan
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

Review 5.  Illuminating RNA biology through imaging.

Authors:  Phuong Le; Noorsher Ahmed; Gene W Yeo
Journal:  Nat Cell Biol       Date:  2022-06-13       Impact factor: 28.213

6.  LncSEA: a platform for long non-coding RNA related sets and enrichment analysis.

Authors:  Jiaxin Chen; Jian Zhang; Yu Gao; Yanyu Li; Chenchen Feng; Chao Song; Ziyu Ning; Xinyuan Zhou; Jianmei Zhao; Minghong Feng; Yuexin Zhang; Ling Wei; Qi Pan; Yong Jiang; Fengcui Qian; Junwei Han; Yongsan Yang; Qiuyu Wang; Chunquan Li
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

7.  miEAA 2.0: integrating multi-species microRNA enrichment analysis and workflow management systems.

Authors:  Fabian Kern; Tobias Fehlmann; Jeffrey Solomon; Louisa Schwed; Nadja Grammes; Christina Backes; Kendall Van Keuren-Jensen; David Wesley Craig; Eckart Meese; Andreas Keller
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

8.  Hybrid sequencing-based personal full-length transcriptomic analysis implicates proteostatic stress in metastatic ovarian cancer.

Authors:  Ying Jing; Yi Zhang; Hui Zhu; Ke Zhang; Mei-Chun Cai; Pengfei Ma; Peiye Shen; Zhenfeng Zhang; Minghui Shao; Jing Wang; Minhua Yu; Xia Yin; Meiying Zhang; Yuan Hu; Danni Chen; Wen Di; Xiaojie Wang; Guanglei Zhuang
Journal:  Oncogene       Date:  2019-01-07       Impact factor: 9.867

9.  Guidelines for the Optimization and Validation of In Situ Hybridization.

Authors:  Julia Jones; William J Howat
Journal:  Methods Mol Biol       Date:  2020

10.  mLoc-mRNA: predicting multiple sub-cellular localization of mRNAs using random forest algorithm coupled with feature selection via elastic net.

Authors:  Prabina Kumar Meher; Anil Rai; Atmakuri Ramakrishna Rao
Journal:  BMC Bioinformatics       Date:  2021-06-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.