| Literature DB >> 30445601 |
Benjamin Lang1, Alexandros Armaos1, Gian G Tartaglia1,2,3,4.
Abstract
Protein-RNA interactions are implicated in a number of physiological roles as well as diseases, with molecular mechanisms ranging from defects in RNA splicing, localization and translation to the formation of aggregates. Currently, ∼1400 human proteins have experimental evidence of RNA-binding activity. However, only ∼250 of these proteins currently have experimental data on their target RNAs from various sequencing-based methods such as eCLIP. To bridge this gap, we used an established, computationally expensive protein-RNA interaction prediction method, catRAPID, to populate a large database, RNAct. RNAct allows easy lookup of known and predicted interactions and enables global views of the human, mouse and yeast protein-RNA interactomes, expanding them in a genome-wide manner far beyond experimental data (http://rnact.crg.eu).Entities:
Mesh:
Substances:
Year: 2019 PMID: 30445601 PMCID: PMC6324028 DOI: 10.1093/nar/gky967
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) Interaction propensity scores for the background (sampled from slightly over 2 billion human protein–RNA pairs; light red) and positive set (212 256 high-confidence protein–RNA interactions revealed by eCLIP; cyan). The z-score reported in the results pages is computed on the right-skewed blue distribution, with the solid cyan line indicating the mean and the dashed line indicating a z-score of 1 (one standard deviation above the mean). (B) The area under the ROC curve of 0.78 (0.72 upon length normalization) indicates the predictive performance of the catRAPID method on recent high-confidence experimental eCLIP data from the ENCODE project.
Figure 2.Search results (disambiguation page). This page allows selection of the protein or RNA of interest across the 3 species currently in RNAct.
Examples of realistic search terms successfully resolved by RNAct
| Real-world search term | Retrieved gene symbol(s) | Retrieved description | Retrieved via |
|---|---|---|---|
| ‘annexin 11’ | ANXA11 | Annexin A11 | Partial description match |
| ‘ews’ | EWSR1 | RNA-binding protein EWS | Gene symbol alias |
| ENSG00000089280 | FUS | RNA-binding protein FUS | Ensembl gene identifier |
| FUS_MOUSE | FUS | RNA-binding protein FUS | UniProt identifier |
| P35637 | FUS | RNA-binding protein FUS | UniProt accession |
| ‘pur α’ | PURA | Transcriptional activator protein Pur-α | Partial description match |
| ‘smn’ | SMN1 | Survival motor neuron protein | Partial symbol match |
| SMNDC1 | Survival of motor neuron-related-splicing factor 30 | ||
| ‘tdp43’ | TARDBP | TAR DNA-binding protein 43 | Gene symbol alias, ignoring punctuation (via TDP-43) |
Figure 3.The Protein view. This page shows a list of potential RNA interaction partners prioritized by catRAPID length-normalized prediction score. Alternatively, the page can be sorted by eCLIP experimental results by clicking on the ‘P-value’ or ‘fold change’ columns. Useful information on the protein of interest, such as whether it is a known or predicted RBP and whether experimental interaction data (e.g. from eCLIP experiments) exists for it is shown at the top of this view, and transcript annotation and quality information are shown as badges for each RNA. Links out to Ensembl and UniProt are provided. Other links lead to the protein’s or RNA’s view within RNAct.