| Literature DB >> 35551426 |
Damian Wollny1,2,3, Benjamin Vernot4, Jie Wang5, Maria Hondele6,7, Aram Safrastyan8,9, Franziska Aron8,9, Julia Micheel8,9, Zhisong He7, Anthony Hyman5, Karsten Weis6, J Gray Camp10,11, T-Y Dora Tang12,13, Barbara Treutlein14,15.
Abstract
Condensates formed by complex coacervation are hypothesized to have played a crucial part during the origin-of-life. In living cells, condensation organizes biomolecules into a wide range of membraneless compartments. Although RNA is a key component of biological condensates and the central component of the RNA world hypothesis, little is known about what determines RNA accumulation in condensates and to which extend single condensates differ in their RNA composition. To address this, we developed an approach to read the RNA content from single synthetic and protein-based condensates using high-throughput sequencing. We find that certain RNAs efficiently accumulate in condensates. These RNAs are strongly enriched in sequence motifs which show high sequence similarity to short interspersed elements (SINEs). We observe similar results for protein-derived condensates, demonstrating applicability across different in vitro reconstituted membraneless organelles. Thus, our results provide a new inroad to explore the RNA content of phase-separated droplets at single condensate resolution.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35551426 PMCID: PMC9098875 DOI: 10.1038/s41467-022-30158-1
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Sequencing RNA of single phase-separated coacervates.
a Schematic of coacervate generation and single-coacervate sequencing strategy. Coacervates were generated mixing charged polyelectrolytes Carboxymethyldextran with PDDA (CM-Dex:PDDA). Scale bar = 100 µm. Total RNA isolated from iPS was used as RNA input. Single coacervates were sorted into 96-well plates using fluorescence-activated cell sorting (FACS). RNA was extracted from each coacervate and mRNA was converted to cDNA and sequenced upon library preparation. RNAs present in each sequenced coacervate were computationally identified and quantified. b Schematic illustration of cross comparisons of several parameters (RNA length, coacervate size and complexity of RNA pool) from hundreds of individual coacervates in a single assay. c Relationship between the size of single coacervates, the number of different RNA transcripts and the average length of all RNA transcripts in each coacervate. Each dot represents a sequenced coacervate. Coacervate size was measured by the FACS forward scatter (FSC).
Fig. 2Comparison of experiment-to-experiment variability of RNA detection in coacervates.
a Quantification of the efficiency of RNA assembly into CM-Dex:PDDA coacervates across independent experiments. Each dot represents an RNA transcript. Venn diagram: overlapping transcripts across experiments that were found in 90–100% of coacervates. b Experiment-to-experiment variation of the average abundance of each RNA transcript across all coacervates in which it was detected. RNA abundance for each transcript is calculated as transcripts per kilobase million (log2(TPM)) enabling comparison of transcript abundances across coacervates. Red line indicates perfect correlation (x = y). Pearson correlation coefficient = r. c Pie chart demonstrating how frequently each input RNA transcript was detected in coacervates.
Fig. 3Properties of RNA found within coacervates.
a Correlation between input RNA amount and the frequency with which each transcript is detected in CM-Dex:PDDA coacervates. Transcripts that are enriched in coacervates (defined as residuals >30 for generalized additive model) are labeled red. b Analysis of sequence motifs that are detected within enriched transcripts (as defined in a) or randomly selected non-enriched transcripts. Motif enrichment corresponds to E-value derived from MEME suite. Among enriched transcripts, the two most abundant sequence motifs (Motifs 1 and 2) display sequence complementarity. c Frequency of transcript detection in coacervates conditional on if the transcripts contain either Motif 1, Motif 2, both motifs or none. d Analysis of sequence complementarity among different transcripts present in the pool of enriched or randomly selected transcripts. Sequence complementarity was determined using local-pairwise alignment (Smith–Waterman) scores. Dotted line indicates the maximum complementarity score that was detected outside the enriched vs. enriched comparisons (gray bars). e Comparison of sequence similarity of enriched motifs to known genomic elements. Heatmap represents pairwise alignment (Smith–Waterman) of enriched motifs with sequences of short interspersed elements (SINEs). Color intensity represents alignment score.
Fig. 4Comparison of RNA content across different coacervate and condensate types.
a Schematic representation of condensate types. Phase separation of synthetic condensates (CM-Dex:PDDA, CM-Dex:pLys) was induced through addition of carboxlymethyldextran (CM-Dex). b Scatter plots and corresponding Pearson correlations comparing how frequently each transcript is detected in different condensate types. Color represents magnitude of correlation.