| Literature DB >> 27851933 |
Alexander Borodavka1, Surendra W Singaram2, Peter G Stockley1, William M Gelbart3, Avinoam Ben-Shaul4, Roman Tuma5.
Abstract
Long RNA molecules are at the core of gene regulation across all kingdoms of life, while also serving as genomes in RNA viruses. Few studies have addressed the basic physical properties of long single-stranded RNAs. Long RNAs with nonrepeating sequences usually adopt highly ramified secondary structures and are better described as branched polymers. To test whether a branched polymer model can estimate the overall sizes of large RNAs, we employed fluorescence correlation spectroscopy to examine the hydrodynamic radii of a broad spectrum of biologically important RNAs, ranging from viral genomes to long noncoding regulatory RNAs. The relative sizes of long RNAs measured at low ionic strength correspond well to those predicted by two theoretical approaches that treat the effective branching associated with secondary structure formation-one employing the Kramers theorem for calculating radii of gyration, and the other featuring the metric of maximum ladder distance. Upon addition of multivalent cations, most RNAs are found to be compacted as compared with their original, low ionic-strength sizes. These results suggest that sizes of long RNA molecules are determined by the branching pattern of their secondary structures. We also experimentally validate the proposed computational approaches for estimating hydrodynamic radii of single-stranded RNAs, which use generic RNA structure prediction tools and thus can be universally applied to a wide range of long RNAs.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27851933 PMCID: PMC5113152 DOI: 10.1016/j.bpj.2016.10.014
Source DB: PubMed Journal: Biophys J ISSN: 0006-3495 Impact factor: 4.033
Figure 1Schematics of an RNA molecule as a branched polymer. (A) Minimum free energy secondary structure with the maximum ladder path highlighted in magenta and flexible joints or branch points as blue dots. (B) Tree graph representation of the secondary structure in (A), with illustration of the partitioning into two halves (L1(j) and L-L1(j)) at bond j for R computation using the Kramers theorem (see Materials and Methods). (C) An experimentally determined secondary structure of segment 11 (60) with maximum ladder path highlighted, and experimental MLDexp and predicted 〈MLD〉 compared. (D) A representative secondary structure prediction for MS2 genomic RNA and predicted 〈MLD〉. To see this figure in color, go online.
Hydrodynamic Radii Measured by FCS and Average Computed MLDs
| Number | RNA | Class | Length (kb) | % BasePaired | 〈MLD〉 (rungs) | ||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | RV s11 | ds | 0.67 | 58 | 8.2 ± 1.1 | 11.2 ± 3.5 (9.6 ± 2) | 7 ± 1.6 quenching | 114 ± 6 | 2.10 |
| 2 | RV s11 scrambled | ds | 0.67 | 56 | 6.5 ± 1.4 | — | — | 83 ± 6 | 2.1 |
| 3 | BunVS | ss | 0.96 | 65 | 10.0 ± 1.6 | 7.2 ± 2.1 | 8 ± 3.3 | 134 ± 11 | 2.23 |
| 4 | STNV | ss | 1.2 | 62 | 11.7 ± 1.0 | 8.5 ± 1.7 | 9 ± 2 | 154 ± 7 | 2.39 |
| 5 | FHV2 | ss | 1.4 | 62 | 11.9 ± 2.0 | 9.4 ± 2.6 | 8.3 ± 2 | 176 ± 24 | 2.76 |
| 6 | Ef2 | m | 1.8 | 60 | 8.8 ± 1.4 | 9.4 ± 1.6 | 9.7 ± 1.6 | 184 ± 14 | 3.12 |
| 7 | 16S rRNA | r | 1.55 | 64 | 17.5 ± 4.0 | 14 ± 4.8 | quenching | 149 ± 26 | 2.56 |
| 8 | HOTAIR | lnc | 2.4 | 61 | 16.2 ± 2.0 | 12.5 ± 2.4 | 13.4 ± 4.7 | 264 ± 19 | 3.39 |
| 9 | 5′-MS2 | ss | 2.5 | 69 | 10.7 ± 1.2 | 9.8 ± 0.6 | 10.3 ± 1.7 | 167 ± 17 | 2.74 |
| 10 | 3′-MS2 | ss | 2.6 | 69 | 13.8 ± 1.3 | 10.8 ± 0.8 | 10.5 ± 1 | 159 ± 9 | 2.68 |
| 11 | NRON | lnc | 2.6 | 58 | 17.6 ± 2.7 | 15.3 ± 3 | 13.7 ± 2.6 | 212 ± 11 | 3.14 |
| 12 | 23S rRNA | r | 2.9 | 63 | 14.2 ± 2.5 | 11.3 ± 2.2 | quenching | 252 ± 24 | 3.25 |
| 13 | FHV 1 | ss | 3.1 | 62 | 15.6 ± 2.0 | 11.7 ± 4.3 | 9.6 ± 3.4 | 224 ± 13 | 3.12 |
| 14 | RV s1 | ds | 3.3 | 58 | 18.4 ± 3.4 | 15.3 ± 2.2 | 18.1 ± 9 aggregation | 319 ± 24 | 3.66 |
| 15 | MS2 | ss | 3.6 | 69 | 12.3 ± 0.6 | 11.3 ± 1.7 | 9.2 ± 1 | 188 ± 18 | 2.92 |
| 16 | RpoB | m | 3.6 | 64 | 18.3 ± 2.7 | 12 ± 1.2 | 10.6 ± 2 | 289 ± 20 | 3.69 |
| 17 | TCV | ss | 4.5 | 63 | 16.5 ± 1.7 | 14.7 ± 4.5 | 12.4 ± 4.5 | 341 ± 21 | 3.85 |
| 18 | BunV L | ss | 6.9 | 59 | 14.7 ± 2.4 | 11.7 ± 1.8 | 12.5 ± 2 | 375 ± 17 | 4.03 |
| 19 | HCV | ss | 8.9 | 64 | 33.1 ± 5.3 | 20.1 ± 2.6 | 18.8 ± 2.8 | 567 ± 43 | 4.81 |
RV s1 and s11, human Rotavirus segment 1 and 11 precursors (single-stranded); BunVS and BunVL-Bunyamwera virus, small and large segment precursors, respectively (single-stranded); STNV, Satellite Tobacco Necrosis Virus genomic RNA; FHV1 and FHV2, Flock House Virus RNA1 and 2; Ef2 mRNA, X. laevis Ef2 gene transcript; 5′-MS2- 5′ end of MS2 phage genomic RNA (nucleotides 1–2469); 3′-MS2- 3′ end of MS2 phage genomic RNA (nucleotides 992–3569); TCV, Turnip Crinkle Virus genomic RNA; and HCV, Hepatitis C Virus genomic RNA.
ds, single-stranded precursors of dsRNA viral genomes; ss, genomes of ssRNA viruses; m, cellular mRNA; r, ribosomal RNA; and lnc, long noncoding RNA.
Percentage of basepairing averaged over 100 predictions.
Measured in 20 mM MOPS-K+, pH 7.
The values are reported as average ± SD computed from at least 10 measurements. Long RNA molecules were transcribed and 5′ (or 3′; see Materials and Methods) end-labeled with Alexa Fluor 488 (Thermo Fisher Scientific), purified and subsequently checked for integrity by denaturing agarose gel electrophoresis (Fig. S1). In a few cases, quenching or aggregation affected or prevented determination of the diffusion correlation time.
Measured in 10 mM MgCl2 in 20 mM MOPS-K+, pH 7.
Measured in 1 mM spermidine in 20 mM MOPS-K+, pH 7.
Computed by averaging over 100 predictions (± SD).
Computed using Kramers theorem.
Figure 2(A) Measured R as a function of nucleotide length (in kb). Numbering of RNAs is according to their increasing length (Table 1) and coloring is according to the class (black, single-stranded precursors of dsRNA viral genomes; red, genomes of ssRNA viruses; blue, cellular mRNAs; green, ribosomal RNA; and cyan, long noncoding RNAs. Lines and curves represent best fits to different linear polymer models: charged (red, Eq. 1, ν = 1, reduced χ2 = 35.85), simple Gaussian coil (blue, Eq. 1, ν = 0.5, reduced χ2 = 13.37), and a self-avoiding coil (green, Eq. 1, ν = 0.59, reduced χ2 = 14.85). (B) Correlation between R and R predicted from 〈MLD〉 (in bp units); solid line is the best fit with reduced χ2 = 11.28. (C) Correlation between R and R predicted from Kramers theorem (in units of the average segment length, a.u.); solid line is the best fit with reduced χ2 = 12.74. RNA color coding and numbering is the same as in (A). Error bars were omitted for clarity; see Table 1 for standard deviations. To provide directly comparable reduced χ2 values, all fitting was performed using the same nonlinear Levenberg-Marquardt algorithm in OriginPro (OriginLab, Northampton, MA). To see this figure in color, go online.
Figure 3Hydrodynamic size reduction in the presence of Mg2+ (A) or spermidine Sp3+ (B). Coloring and numbering scheme is as in Fig. 2. R values that were compromised by either quenching or possible aggregation (RV s1 and s11 in Table 1) in the presence of multivalent cations were omitted from the plot. Linear regression lines with slopes 0.77 ± 0.03, Pearson’s r = 0.89 for Mg2+ and 0.73 ± 0.04, and Pearson’s r = 0.87 for Sp3+, respectively, are shown. To see this figure in color, go online.