| Literature DB >> 30517751 |
Neelanjan Mukherjee1,2, Hans-Hermann Wessels2,3, Svetlana Lebedeva2, Marcin Sajek4,5, Mahsa Ghanbari2, Aitor Garzia5, Alina Munteanu2,6, Dilmurat Yusuf2, Thalia Farazi5, Jessica I Hoell5,7, Kemal M Akat5, Altuna Akalin2, Thomas Tuschl5, Uwe Ohler2,3,6.
Abstract
RNA-binding proteins (RBPs) control and coordinate each stage in the life cycle of RNAs. Although in vivo binding sites of RBPs can now be determined genome-wide, most studies typically focused on individual RBPs. Here, we examined a large compendium of 114 high-quality transcriptome-wide in vivo RBP-RNA cross-linking interaction datasets generated by the same protocol in the same cell line and representing 64 distinct RBPs. Comparative analysis of categories of target RNA binding preference, sequence preference, and transcript region specificity was performed, and identified potential posttranscriptional regulatory modules, i.e. specific combinations of RBPs that bind to specific sets of RNAs and targeted regions. These regulatory modules represented functionally related proteins and exhibited distinct differences in RNA metabolism, expression variance, as well as subcellular localization. This integrative investigation of experimental RBP-RNA interaction evidence and RBP regulatory function in a human cell line will be a valuable resource for understanding the complexity of post-transcriptional regulation.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30517751 PMCID: PMC6344852 DOI: 10.1093/nar/gky1185
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.RBP analyzed and binding preferences by RNA category. Heatmap of reference normalized annotation category preference for each RBP clustered into 8 branches by color (left) and subcellular localization (Supplementary Table S5). The heatmap represents the difference in the proportion of sites for a given annotation category in the RBP library versus the reference library. Heatmap of the reference library normalized relative positional binding preference of the 55 RBPs with enriched binding in at least one mRNA-relevant annotation category per branch (right). RBP-specific binding preferences were averaged across selected transcripts (see methods). The relative spatial proportion of 5′UTR, coding regions and 3′UTR were averaged across all selected transcript isoforms. For TES (regions beyond transcription end site), 5′ splice site, and 3′ splice site, we chose fixed windows (250nt for TES and 500nt for splice sites). For each RBP, meta-coverage was scaled between 5′UTR to TES. The 5′ and 3′ intronic splice site coverage was scaled separately from other regions but relative to each other.
Figure 2.RBP binding sequence specificity and elements. (A) Heatmap of reference normalized 6-mer enrichment for top 5 enriched 6-mers for each RBP in the set of RBPs exhibiting more sequence specificity than the reference. (B) Top 3 SSMART motif results using all binding sites found in mRNA-derived annotation categories ranked by the library size normalized enrichment over reference library.
Figure 3.RNA regulatory modules. (A) Factor analysis of target RNA encoding genes binding normalized by the reference library and expression for the 55 RBPs binding to mRNAs and lncRNAs for 13 299 genes (see ‘factor analysis’ section in methods for details). Spring-embedded graph of the factor loading matrix, indicating the association between each of the 55 RBPs and one of the 10 factors. Nodes color-coded by RNA annotation category preference cluster membership from Figure 1. Edge width scales with factor loadings (thicker edge = higher factor loading = stronger association). Only edges with a factor loading > 0.2 (positive values in black) or <−0.2 (negative values in green) depicted. (B) Dot plot of the communality, or the variance in a given RBP cumulatively explained by the all factors color-coded by RNA annotation category preference.
Figure 4.Functional characterization of RNA regulatory modules. (A) The difference in either (A) primary or (B) mature RNA expression TPM (transcripts per million) upon ELAVL1 knockdown by siRNA treatment (y-axis), for each set of factor-associated RNAs. *P < 0.05 Mann–Whitney U-test. (C) Heatmap of the median value of synthesis rate, processing rates, degradation rates, cytoplasmic versus nuclear localization, polyribosomal versus cytoplasmic localization, and translational status from ribosome profiling data for each gene set (top). Heatmap of the odds-ratio of the overlap between factor associated gene sets with annotation (bottom). (D) Plot of the confidence intervals for P-body localization (y-axis from (35)) and stress granule localization (x-axis from (36)) for each set of factor-associated RNAs. Dashed lines represent median enrichment of RNA in P-bodies (y-axis) and stress granules (x-axis).