Literature DB >> 24803509

RAID: a comprehensive resource for human RNA-associated (RNA-RNA/RNA-protein) interaction.

Xiaomeng Zhang1, Deng Wu1, Liqun Chen1, Xiang Li1, Jinxurong Yang1, Dandan Fan1, Tingting Dong1, Mingyue Liu1, Puwen Tan1, Jintian Xu1, Ying Yi1, Yuting Wang1, Hua Zou1, Yongfei Hu1, Kaili Fan1, Juanjuan Kang1, Yan Huang1, Zhengqiang Miao1, Miaoman Bi1, Nana Jin1, Kongning Li1, Xia Li1, Jianzhen Xu2, Dong Wang1.   

Abstract

Transcriptomic analyses have revealed an unexpected complexity in the eukaryote transcriptome, which includes not only protein-coding transcripts but also an expanding catalog of noncoding RNAs (ncRNAs). Diverse coding and noncoding RNAs (ncRNAs) perform functions through interaction with each other in various cellular processes. In this project, we have developed RAID (http://www.rna-society.org/raid), an RNA-associated (RNA-RNA/RNA-protein) interaction database. RAID intends to provide the scientific community with all-in-one resources for efficient browsing and extraction of the RNA-associated interactions in human. This version of RAID contains more than 6100 RNA-associated interactions obtained by manually reviewing more than 2100 published papers, including 4493 RNA-RNA interactions and 1619 RNA-protein interactions. Each entry contains detailed information on an RNA-associated interaction, including RAID ID, RNA/protein symbol, RNA/protein categories, validated method, expressing tissue, literature references (Pubmed IDs), and detailed functional description. Users can query, browse, analyze, and manipulate RNA-associated (RNA-RNA/RNA-protein) interaction. RAID provides a comprehensive resource of human RNA-associated (RNA-RNA/RNA-protein) interaction network. Furthermore, this resource will help in uncovering the generic organizing principles of cellular function network.
© 2014 Zhang et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

Entities:  

Keywords:  RNA–RNA; RNA–protein; database; interaction

Mesh:

Substances:

Year:  2014        PMID: 24803509      PMCID: PMC4114696          DOI: 10.1261/rna.044776.114

Source DB:  PubMed          Journal:  RNA        ISSN: 1355-8382            Impact factor:   4.942


INTRODUCTION

In the past decades, systematic human protein interaction screens have provided a valuable platform to explore the functional organization of the cells (Bossi and Lehner 2009; Vidal et al. 2011). Consequently, text mining-based annotations of this huge number of protein–protein interactions (PPIs) have been established and lead to a more comprehensive understanding of protein function and cellular processes (STRING) (Franceschini et al. 2013), eggnog (Powell et al. 2012). However, recent development has indicated that PPIs are perhaps only half of the story in cells, since an expanding catalog of noncoding RNAs (ncRNAs) are actively involved in multiple biological processes such as cell death, developmental timing, and fat metabolism (Guttman and Rinn 2012; Xu et al. 2012; Li et al. 2013). The cross-talks within ncRNAs and among RNA–protein are far more intricate and dynamic (Konig et al. 2011; Bernstein et al. 2012; Muller-McNicoll and Neugebauer 2013). For example, experimental evidences indicated that metastasis-associated lung adenocarcinoma transcript 1 (malat1), one of up-regulated long noncoding RNAs (lncRNAs) in many malignant tumors, can stimulate cancer invasion and promote tumorigenicity via binding to several key tumor-suppressor proteins, such as BCL2 and BCLXL1 (Li et al. 2009; Guo et al. 2010). Similarly, recent investigations discovered a novel regulatory RNA circuit, in which RNAs cross-regulate each other by competing for shared ncRNAs (Salmena et al. 2011; Sumazin et al. 2011). For instance, linc-MD1 can sponge miR-133 and miR-135 to modulate the expression of MAML1 and MEF2C, thus act as a competing endogenous RNA (ceRNA) to govern the time of muscle differentiation in mouse and human myoblasts (Cesana et al. 2011). Hence, considerable attention should be focused on the expanding RNA-associated (RNA–RNA/RNA–protein) interaction. Since the comprehensive regulating cross-talk between diverse RNAs and protein still remains ambiguous, we have developed an RNA-associated interaction database (RAID, http://www.rna-society.org/raid) by integrating experimental evidence from tens of thousands of references. The current version of RAID documents over 6100 human RNA-associated (RNA–RNA/RNA–protein) interactions that are extracted from more than 2100 published papers. RAID provides a valuable resource to manipulate, visualize, and analyze human RNA–RNA/RNA–protein interactions. By integrating the RNA–RNA and RNA–protein interactions into a global network, users can follow RNA-associated (RNA–RNA/RNA–protein) interaction trajectory and determine their functional significance in the whole RNA-associated interaction network.

DATA SOURCES AND IMPLEMENTATION

In order to collect all available RNA and Protein symbols, we have downloaded and integrated all types of RNA and protein symbols including approved symbols, approved names, previous symbols, and names and synonyms in the HGNC database (Gray et al. 2013). Because the research for some ncRNAs is still in its infancy, such as promoter-associated small RNAs (PASRs), PIWI-interacting RNAs (piRNA), promoter upstream transcripts (PROMPTs), transcription initiation RNAs (tiRNA), and TSS-associated RNAs (TSSa-RNAs), etc. (Esteller 2011), and there are not-unified nomenclatures, we instead searched the PubMed database by using these ncRNA category names to replace specific ncRNA symbols. In order to reduce the great challenge of manual curation, we have written scripts to screen in advance all abstracts and full-text articles in the PubMed database for the following keywords combinations: (1) RNA–RNA interactions: (RNA symbols or RNA category names) and/or (RNA symbols or RNA category names) and/or (“interaction” or “binding,” etc.); (2) RNA–protein interactions: (RNA symbols or RNA category names) and/or (protein symbols) and/or (interaction or binding, etc.). The scripts mainly consist of two steps: (1) extract PMC and Pubmed IDs from NCBI through Entrez Programming Utilities (eUtils) based on the combination of keywords; (2) download of the matched abstracts or full articles from NCBI. Then, these screened results were further revised manually. The functional information such as RNA/protein interactions, validated methods, and expressing tissues were extracted. At the same time, the interactions predicted in silico were discarded. This manual checking process ensured the high reliability of data. In addition, RAID also integrated the miRNA-associated interactions collected in some focused databases such as miRTarBase, miRDeathDB, MNDR (Mammalian ncRNA-disease repository) (Xu and Li 2012; Wang et al. 2013), and other resources (NPInter and PRD) (Wu et al. 2006; Hsu et al. 2011; Fujimori et al. 2012). All of the above third-party data contain a manual collection of RNA regulation interactions usually produced from precise experiments (Xu et al. 2012; Wang et al. 2013; Hsu et al. 2014). On the other hand, some data sets generated by high-throughput techniques or outdated data such as those collected in dorina, Tarbase, and starbase (Yang et al. 2011; Anders et al. 2012; Vergoulis et al. 2012; Li et al. 2014) haven't been integrated into RAID because of the possible higher false-positive targets. The RAID database is implemented using HTML and PHP language in a window environment connected to the MySQL server, and the interface component consists of the web pages designed and implemented in HTML/CSS. It has been tested in Google Chrome, Safari, Mozilla Firefox, and Internet Explorer web browsers.

CONTENT OF THE DATABASE

According to the PubMed database, we collected the references published before April 2013. Based on keyword combinations, we have automatically screened tens of thousands of abstracts and full-text articles by in-house scripts. In total, more than 2100 literatures were documented and 4493 RNA–RNA interactions and 1619 RNA–protein interaction entries for a total of 6112 curated entries were documented. Among these RNA-associated (RNA–RNA/RNA–protein) interaction entries, there were 2070 nonredundancy RNA symbols and 395 nonredundancy protein symbols. In the current version of RAID, each entry contains detailed information on an RNA-associated (RNA–RNA/RNA–protein) interaction, including RAID ID, RNA/protein symbol, RNA/protein categories, validated method, expressing tissue, a literature reference (Pubmed ID), and detailed functional description (Fig. 1). To facilitate researchers in accessing information from external resources, we linked RNA and protein symbols to the HGNC database (Gray et al. 2013), which can efficiently retrieve plenty of genomic-associated data from external resources. In addition, RAID also welcomes researchers to submit experimentally identified novel RNA-associated (RNA–RNA/RNA–protein) interaction. All of the RNA-associated interactions can be downloaded directly in the Excel format, and RAID provides a publicly available interface (API) for automatic data retrieval in the Download and API page.
FIGURE 1.

The overview of RAID database.

SEARCHING PATHS AND BROWSING

In the search page, RAID provides an interface for convenient retrieval of RNA-associated (RNA–RNA/RNA–protein) interactions. Users can browse and obtain any RNA-associated (RNA–RNA/RNA–protein) interaction through four paths (Fig. 2A). Path 1 (by keyword): browsing the RNA-associated interactions by inputting the keywords (any RNA and protein symbol) with fuzzy search supported. Users can obtain a list of RNA-associated (RNA–RNA/RNA–protein) interactions for any keywords. Path 2 (by RNA/protein category): Users can search all RNA-associated interactions between two defined RNA/protein categories. Similarly, users can retrieve all interactions between two defined RNA/protein symbols in Path 3 (by RNA/protein symbol). Path 4 (by validated method): browsing the RNA-associated interactions by experimental validated methods with multiple selection supported. The main table of results contains RAID ID, RNA/protein symbol 1 and category 1, RNA/protein symbol 2 and category 2, and detail “More” (Fig. 2B). When clicking the “More” link in each record, users can have access to more specific information such as RAID ID, RNA/protein symbol, RNA/protein category, validated method, expression tissue, a literature reference (Pubmed ID), and detailed functional description (Fig. 2C). Similarly, in the browser page, users can also browse any RNA-associated interaction by interaction type (RNA–RNA or RNA–protein), such as lncRNA-associated RNA–RNA interactions (229 entries).
FIGURE 2.

A flowchart for retrieving RNA-associated interaction entry. (A) Four searching paths for retrieving the RNA-associated interaction. (B) The result of a representative database entry. (C) The detailed information for an RNA-associated interaction. In the result and detail pages, RAID linked each RNA/protein symbol and PMID to their corresponding databases.

The overview of RAID database. A flowchart for retrieving RNA-associated interaction entry. (A) Four searching paths for retrieving the RNA-associated interaction. (B) The result of a representative database entry. (C) The detailed information for an RNA-associated interaction. In the result and detail pages, RAID linked each RNA/protein symbol and PMID to their corresponding databases.

THE PREDICTED BINDING SITES AND NETWORK VISUALIZATION

In addition to archive RNA-associated interaction, RAID also intends to integrate a variety of useful tools to analyze these data. Because the identification of RNA–RNA/RNA–protein binding sites can provide valuable insights for underlying the detailed regulating mechanism of the various RNAs, RAID also contains the predicted binding sites for RNA-associated interaction. Specifically, RAID adopts the predicted binding sites and scores by miRanda for a miRNA and its targets (John et al. 2004), while containing the predicted binding sites and score by RIsearch for the RNA–RNA interactions (Fig. 3; Wenzel et al. 2012). For RNA–protein interactions, bindN (Wang and Brown 2006), bindN+ (Wang et al. 2010), Pprint (Kumar et al. 2008), and RNAbindR (Terribilini et al. 2007) are commonly used tools to predict RNA-binding residues in proteins (Puton et al. 2012). Similarly, RAID also merges the predicted RNA-binding residues and scores from these tools. Additionally, RAID also integrates the experimentally verified RNA-binding sites in proteins documented in the RBPBD (Cook et al. 2011) and RsiteDB (Shulman-Peleg et al. 2009) databases. The parameters used by these predictive tools were documented in the Parameter of Help Page.
FIGURE 3.

Representative screenshots of the Binding and Network pages. (A) The Binding page: representing the predicted binding sites and/or constants. (B) The Network page: representing the interaction subnetwork of interacting RNA/protein.

Representative screenshots of the Binding and Network pages. (A) The Binding page: representing the predicted binding sites and/or constants. (B) The Network page: representing the interaction subnetwork of interacting RNA/protein. Besides the detailed analysis of RNA interaction sites, RAID also supports the users to globally observe the RNA-associated (RNA–RNA/RNA–protein) interaction network. Cytoscape Web (cytoscapeweb.cytoscape.org/) is a visualization tool that is suitable for displaying small to medium sized networks in a web-based manner (Lopes et al. 2010). In the visualization option at Network Page (Fig. 3B), RNA-associated interaction subnetworks can be rapidly and independently represented by embedding interactive networks with the Cytoscape Web. The “First Node” or “Second Node” option represents the subnetwork of interacting RNA/protein with the first or second interaction RNA/protein, the “Both the Nodes” option represents the subnetwork of interacting RNA/protein with both interaction nodes. The “First Neighbour” represents the subnetwork of direct interacting with the center node, the “Second Neighbour” represents the subnetwork of direct and second-step interacting with the center node. Interaction of a subnetwork based on the two nodes of this interaction may help the researchers represent all interacting partners immediately. Thus, multiple RNA/protein data resources can be combined in a single visualization for each RNA/protein with its interaction partner. Since the compelling visualization architecture is pan-and-zoom, users can observe specific RNA/protein within the RNA-associated interaction network and the “Selection of the Layout” option can provide the different layout types for this subnetwork.

DISCUSSION AND FUTURE DIRECTIONS

High-throughput proteomics and protein–protein interaction screens have enabled rapid progress in mapping the protein interactome (Bossi et al. 2009; Vidal et al. 2011). However, the RNA-associated interactome is likely to be much larger and more complex due to the huge numbers of transcripts identified by global analyses (Konig et al. 2011; Bernstein et al. 2012; Derrien et al. 2012; Frazer 2012; Muller-McNicoll et al. 2013). Recent investigations indicated that there are complex regulations among diverse ncRNAs and protein-coding genes (Konig et al. 2011; Bernstein et al. 2012; Derrien et al. 2012; Frazer 2012; Muller-McNicoll et al. 2013). Consequently, we systematically collect experimentally verified human RNA-associated (RNA–RNA/RNA–protein) interactions and established the first database centering on the interaction network between diverse RNAs and RNAs/Proteins. RAID will be of particular interest to the life-science community and facilitates the biologists to unravel the role of RNAs/proteins in a variety of biological processes. In the future, we will continuously curate and update the reference data. Complemented with the successful PPI databases, RAID will provide a valuable skeleton for a better understanding of the functional organization of the cell.
  39 in total

1.  Connect the dots: a systems level approach for analyzing the miRNA-mediated cell death network.

Authors:  Yifei Li; Liwei Zhuang; Yuting Wang; Yongfei Hu; Yun Wu; Dong Wang; Jianzhen Xu
Journal:  Autophagy       Date:  2013-01-15       Impact factor: 16.016

Review 2.  How cells get the message: dynamic assembly and function of mRNA-protein complexes.

Authors:  Michaela Müller-McNicoll; Karla M Neugebauer
Journal:  Nat Rev Genet       Date:  2013-03-12       Impact factor: 53.242

3.  BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features.

Authors:  Liangjiang Wang; Caiyan Huang; Mary Qu Yang; Jack Y Yang
Journal:  BMC Syst Biol       Date:  2010-05-28

Review 4.  Protein-RNA interactions: new genomic technologies and perspectives.

Authors:  Julian König; Kathi Zarnack; Nicholas M Luscombe; Jernej Ule
Journal:  Nat Rev Genet       Date:  2012-01-18       Impact factor: 53.242

5.  PRD: A protein-RNA interaction database.

Authors:  Shigeo Fujimori; Katsuya Hino; Ayumu Saito; Satoru Miyano; Etsuko Miyamoto-Sato
Journal:  Bioinformation       Date:  2012-08-03

6.  The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression.

Authors:  Thomas Derrien; Rory Johnson; Giovanni Bussotti; Andrea Tanzer; Sarah Djebali; Hagen Tilgner; Gregory Guernec; David Martin; Angelika Merkel; David G Knowles; Julien Lagarde; Lavanya Veeravalli; Xiaoan Ruan; Yijun Ruan; Timo Lassmann; Piero Carninci; James B Brown; Leonard Lipovich; Jose M Gonzalez; Mark Thomas; Carrie A Davis; Ramin Shiekhattar; Thomas R Gingeras; Tim J Hubbard; Cedric Notredame; Jennifer Harrow; Roderic Guigó
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

7.  starBase: a database for exploring microRNA-mRNA interaction maps from Argonaute CLIP-Seq and Degradome-Seq data.

Authors:  Jian-Hua Yang; Jun-Hao Li; Peng Shao; Hui Zhou; Yue-Qin Chen; Liang-Hu Qu
Journal:  Nucleic Acids Res       Date:  2010-10-30       Impact factor: 16.971

8.  doRiNA: a database of RNA interactions in post-transcriptional regulation.

Authors:  Gerd Anders; Sebastian D Mackowiak; Marvin Jens; Jonas Maaskola; Andreas Kuntzagk; Nikolaus Rajewsky; Markus Landthaler; Christoph Dieterich
Journal:  Nucleic Acids Res       Date:  2011-11-15       Impact factor: 16.971

9.  RsiteDB: a database of protein binding pockets that interact with RNA nucleotide bases.

Authors:  Alexandra Shulman-Peleg; Ruth Nussinov; Haim J Wolfson
Journal:  Nucleic Acids Res       Date:  2008-10-25       Impact factor: 16.971

10.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data.

Authors:  Jun-Hao Li; Shun Liu; Hui Zhou; Liang-Hu Qu; Jian-Hua Yang
Journal:  Nucleic Acids Res       Date:  2013-12-01       Impact factor: 16.971

View more
  32 in total

1.  ncRDeathDB: A comprehensive bioinformatics resource for deciphering network organization of the ncRNA-mediated cell death system.

Authors:  Deng Wu; Yan Huang; Juanjuan Kang; Kongning Li; Xiaoman Bi; Ting Zhang; Nana Jin; Yongfei Hu; Puwen Tan; Lu Zhang; Ying Yi; Wenjun Shen; Jian Huang; Xiaobo Li; Xia Li; Jianzhen Xu; Dong Wang
Journal:  Autophagy       Date:  2015       Impact factor: 16.016

2.  P2RX7-V3 is a novel oncogene that promotes tumorigenesis in uveal melanoma.

Authors:  Hui Pan; Hongyan Ni; LeiLei Zhang; Yue Xing; Jiayan Fan; Peng Li; Tianyuan Li; Renbing Jia; Shengfang Ge; He Zhang; Xianqun Fan
Journal:  Tumour Biol       Date:  2016-07-28

3.  RAID v2.0: an updated resource of RNA-associated interactions across organisms.

Authors:  Ying Yi; Yue Zhao; Chunhua Li; Lin Zhang; Huiying Huang; Yana Li; Lanlan Liu; Ping Hou; Tianyu Cui; Puwen Tan; Yongfei Hu; Ting Zhang; Yan Huang; Xiaobo Li; Jia Yu; Dong Wang
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

Review 4.  Multimodal Long Noncoding RNA Interaction Networks: Control Panels for Cell Fate Specification.

Authors:  Keriayn N Smith; Sarah C Miller; Gabriele Varani; J Mauro Calabrese; Terry Magnuson
Journal:  Genetics       Date:  2019-12       Impact factor: 4.562

5.  Suppression of long noncoding RNA MALAT1 inhibits the development of uveal melanoma via microRNA-608-mediated inhibition of HOXC4.

Authors:  Shuai Wu; Han Chen; Ling Zuo; Hai Jiang; Hongtao Yan
Journal:  Am J Physiol Cell Physiol       Date:  2020-01-08       Impact factor: 4.249

6.  RNAInter v4.0: RNA interactome repository with redefined confidence scoring system and improved accessibility.

Authors:  Juanjuan Kang; Qiang Tang; Jun He; Le Li; Nianling Yang; Shuiyan Yu; Mengyao Wang; Yuchen Zhang; Jiahao Lin; Tianyu Cui; Yongfei Hu; Puwen Tan; Jun Cheng; Hailong Zheng; Dong Wang; Xi Su; Wei Chen; Yan Huang
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

7.  LncReg: a reference resource for lncRNA-associated regulatory networks.

Authors:  Zhong Zhou; Yi Shen; Muhammad Riaz Khan; Ao Li
Journal:  Database (Oxford)       Date:  2015-09-10       Impact factor: 3.451

8.  Introduction to Bioinformatics Resources for Post-transcriptional Regulation of Gene Expression.

Authors:  Eliana Destefanis; Erik Dassi
Journal:  Methods Mol Biol       Date:  2022

Review 9.  De-repressing LncRNA-Targeted Genes to Upregulate Gene Expression: Focus on Small Molecule Therapeutics.

Authors:  Roya Pedram Fatemi; Dmitry Velmeshev; Mohammad Ali Faghihi
Journal:  Mol Ther Nucleic Acids       Date:  2014-11-18       Impact factor: 10.183

10.  Genome wide discovery of long intergenic non-coding RNAs in Diamondback moth (Plutella xylostella) and their expression in insecticide resistant strains.

Authors:  Kayvan Etebari; Michael J Furlong; Sassan Asgari
Journal:  Sci Rep       Date:  2015-09-28       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.