| Literature DB >> 29463232 |
Yaron Orenstein1, Uwe Ohler2, Bonnie Berger3,4.
Abstract
BACKGROUND: RNA-binding proteins (RBPs) play vital roles in many processes in the cell. Different RBPs bind RNA with different sequence and structure specificities. While sequence specificities for a large set of 205 RBPs have been reported through the RNAcompete compendium, structure specificities are known for only a small fraction. The main limitation lies in the design of the RNAcompete technology, which tests RBP binding against unstructured RNA probes, making it difficult to infer structural preferences from these data. We recently developed RCK, an algorithm to infer sequence and structural binding models from RNAcompete data. The set of binding models enables, for the first time, a large-scale assessment of RNA structure in the RBPome.Entities:
Keywords: RBP; RBPome; RNA structure; RNA-binding proteins; RNAcompete; eCLIP
Mesh:
Substances:
Year: 2018 PMID: 29463232 PMCID: PMC5819699 DOI: 10.1186/s12864-018-4540-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Structural variability exists in unstructured RNA probes and correlates with protein RNA-binding. a) Distribution of standard deviation of average 5-mer base-pairing probabilities. While in random probes there is more variability, unstructured probes are still quite variable. b) Correlation of probes containing UUUUU HuR-binding intensities and UUUUU’s average base-pairing probabilities. The negative correlation shows that HuR prefers to bind unpaired regions
Fig. 2Analysis of RNA structural binding preferences for a large compendium of RBPs. a) Distribution of paired to unpaired binding preference log ratios for 205 RBPs shows almost all proteins in this dataset prefer to bind unpaired regions. b) Distribution of loop to external binding preference log ratios for 205 RBPs reveals more proteins in this dataset prefer to bind loop regions, and many may bind both. c) RNA structural binding preference improves in vitro binding prediction. Correlation results over 488 paired experiments uncovers that RNA structure plays a significant role in protein-RNA interactions. d) RNA structural binding preference improves in vivo binding prediction. AUC results of 96 paired eCLIP and RNAcompete experiments over 21 joint proteins demonstrate that RNA structural binding preferences learned from in vitro data correlate well with protein-RNA interactions measured in vivo
Fig. 3Structure-based models improve in vivo binding prediction. a) HRHNPK in vivo binding prediction improves AUC from 0.75 to 0.81. Sequence preferences agree with a previous study, but no structural preferences were previously known. b) PUM2 in vivo binding prediction improves AUC from 0.52 to 0.59. Sequence and structural preferences agree with previously-published preferences based on PAR-CLIP data inferred by GraphProt algorithm. Structural contexts letter: S = stem / paired, H = hairpin loop, I = inner loop, M = multi loop, E = external region
Fig. 4Improved binding prediction from amino acid sequence by utilizing RNA structure. a) When we add RNA structural features to the sequence k-mer space of AffinityRegression, we predict binding significantly better than using sequence features alone. b) When we add RNA structural features to the sequence k-mer space of AffinityRegression, we predict the top-bound probes as compared to unbound probes significantly better than using sequence features alone