| Literature DB >> 29040625 |
Jing Gong1,2, Di Shao1,2, Kui Xu1,2, Zhipeng Lu3, Zhi John Lu1, Yucheng T Yang4, Qiangfeng Cliff Zhang1,2.
Abstract
We present RISE (http://rise.zhanglab.net), a database of RNA Interactome from Sequencing Experiments. RNA-RNA interactions (RRIs) are essential for RNA regulation and function. RISE provides a comprehensive collection of RRIs that mainly come from recent transcriptome-wide sequencing-based experiments like PARIS, SPLASH, LIGR-seq, and MARIO, as well as targeted studies like RIA-seq, RAP-RNA and CLASH. It also includes interactions aggregated from other primary databases and publications. The RISE database currently contains 328,811 RNA-RNA interactions mainly in human, mouse and yeast. While most existing RNA databases mainly contain interactions of miRNA targeting, notably, more than half of the RRIs in RISE are among mRNA and long non-coding RNAs. We compared different RRI datasets in RISE and found limited overlaps in interactions resolved by different techniques and in different cell lines. It may suggest technology preference and also dynamic natures of RRIs. We also analyzed the basic features of the human and mouse RRI networks and found that they tend to be scale-free, small-world, hierarchical and modular. The analysis may nominate important RNAs or RRIs for further investigation. Finally, RISE provides a Circos plot and several table views for integrative visualization, with extensive molecular and functional annotations to facilitate exploration of biological functions for any RRI of interest.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29040625 PMCID: PMC5753368 DOI: 10.1093/nar/gkx864
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Framework to construct the RISE database. We collected RRIs from transcriptome-wide and targeted sequencing experiments, and other databases and publications. We performed quality control to obtain non-redundant intermolecular RRI entries. We then annotated RRIs with extensive molecular and functional information, including (i) RBP binding sites, (ii) RNA editing and modification sites, (iii) SNPs and pan-cancer mutations, and (iv) gene expression levels from various cell and tissue types. Finally, RISE provides integrative Circos plot visualization and table views for the search results.
Overview of data collected in the RISE database
| Category | Method/Resource | Species | Cell line | Number of interactions | Number of involved genes |
|---|---|---|---|---|---|
| Transcriptome-wide studies | PARIS | Human | HEK293T | 25 824 | 16 192 |
| Human | Hela (high RNase) | 25 552 | 19 335 | ||
| Human | Hela (low RNase) | 20 330 | 17 009 | ||
| Mouse | mESC | 29 514 | 12 625 | ||
| MARIO | Mouse | MEF | 7 167 | 2 936 | |
| Mouse | mESCa | 99 290 | 15 309 | ||
| Mouse | mESCb | 37 441 | 9 715 | ||
| SPLASH | Human | hESC | 3 345 | 971 | |
| Human | HeLa | 5 799 | 1 649 | ||
| Human | LCL | 4 213 | 429 | ||
| Human | hESC (RA treated) | 1 770 | 671 | ||
| LIGR-seq | Human | HEK293T | 641 | 749 | |
| Targeted studies | RIA-seq | Human | Keratinocytes (TINCR) | 3 609 | 1 815 |
| RAP-RNA | Mouse | mESC (Malat1) | 495 | 489 | |
| Mouse | mESC (U1 snRNA) | 12 278 | 8 635 | ||
| CLASH | Human | HEK293 (miRNAs) | 18 508 | 7260 | |
| Yeast | BY4741 (miRNAs) | 253 | 47 | ||
| From other databases/dataset | NPInter v3.0 | Human | − | 3 691 | 2 525 |
| Mouse | − | 52 | 83 | ||
| RAID v2.0 | Human | − | 22 521 | 6 262 | |
| Mouse | − | 3 440 | 2 130 | ||
| RAIN | Human | − | 2 881 | 1 189 | |
| Mouse | − | 36 | 31 | ||
| PMID 26673718 | E. coli | − | 64 | 68 | |
| S. enterica | − | 45 | 49 | ||
| Yeast | − | 52 | 45 | ||
| Total | − | − | − | 328 811 | 56 295 |
aThis experiment uses UV-crosslinking to detect RRIs mediated by one protein.
bThis experiment uses chemical crosslinking to detect RRIs mediated by multiple proteins.
Figure 2.Distribution of RRIs by RNA types and comparison of RRIs from different studies in the RISE database. (A) Circos plot showing RRIs between different types of interacting RNAs in human. (B) Overlap of RRIs in different cell lines and experimental conditions detected by the PARIS method. (C) Overlap of RRIs detected by the PARIS and the SPLASH methods in the HeLa cell line.The comparison in (B, C) is counted on the RNA molecular level regardless of the precise interacting regions. And we used the same transcriptome from the SPLASH study as the mapping reference of PARIS in (C) for cross-technology comparison (which explains the RRI number differences between B and C).
Figure 3.Characteristics of the human RNA interactome in RISE. (A) Degree distribution of the RNAs. (B) Distribution of the shortest path between pairs of RNAs. (C) Degree distribution of the average clustering coefficients of the RNAs. (D) Degree distribution of the average neighborhood connectivity. (E) Degree distribution of the average betweenness centrality of the RNAs. (F) Degree distribution of the average closeness centrality of the RNAs. The blue lines show the regression in log-space.
Figure 4.Example of database usage: RBFOX2 in human as an example. (A) The search box of RISE. (B) The output of the search result showing: (1) gene information, (2) integrative view of RRIs and associated annotations in a Circos plot, (3) detailed information of RRIs, (4) associated molecular annotations.