Literature DB >> 21133351

Fluorescence competition and optical melting measurements of RNA three-way multibranch loops provide a revised model for thermodynamic parameters.

Biao Liu1, Joshua M Diamond, David H Mathews, Douglas H Turner.   

Abstract

Three-way multibranch loops (junctions) are common in RNA secondary structures. Computer algorithms such as RNAstructure and MFOLD do not consider the identity of unpaired nucleotides in multibranch loops when predicting secondary structure. There is limited experimental data, however, to parametrize this aspect of these algorithms. In this study, UV optical melting and a fluorescence competition assay are used to measure stabilities of multibranch loops containing up to five unpaired adenosines or uridines or a loop E motif. These results provide a test of our understanding of the factors affecting multibranch loop stability and provide revised parameters for predicting stability. The results should help to improve predictions of RNA secondary structure.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21133351      PMCID: PMC3032278          DOI: 10.1021/bi101470n

Source DB:  PubMed          Journal:  Biochemistry        ISSN: 0006-2960            Impact factor:   3.162


RNA multibranch loops (junctions) are ubiquitous in known RNA secondary structures, including tRNA (1), rRNA (2,3), the HIV-1 genome (4), and ribozymes (5). They can be characterized by the number of helices forming the loop, the number of unpaired nucleotides located in the junction, the base pairs closing the junction, and other factors. Despite the prevalence, the factors that determine the stabilities of RNA multibranch loops are poorly understood. Research on DNA multibranch loops sheds some light onto factors that are likely to be important for RNA multibranch loops. The dependence of structure on the number and type of unpaired nucleotides in DNA multibranch loops has been studied extensively for loops of three, four, and five helices (6−11). DNA multibranch loops with unpaired purines in the junction have different stacking conformations than similar loops containing unpaired pyrimidines (6,7). Furthermore, the effects of unpaired nucleotides on the stability of the junction are dependent on the number of helices forming the junction (8). The presence of two unpaired nucleotides allows formation of a stable DNA three-way junction in which two of the helices coaxially stack, but unpaired nucleotides are not required to stabilize a four-way DNA junction (8−11). Approaches to estimate the free energy change of forming an RNA multibranch loop include (1) optimizing the prediction of known RNA secondary structures (12), (2) fitting experimental results (13−15), and (3) using a combination of these approaches (16). Knowledge-based statistical methods (17−19) could also be used. Coaxial stacking of helixes is observed in RNA multibranch loops (1,20−29) and can often be predicted on the basis of thermodynamics (30). RNA secondary structure prediction algorithms, such as MFOLD (12) and RNAstructure (15), use a simple model to predict multibranch loop stability (12,15). A more thorough understanding of factors contributing to RNA multibranch loop stability should improve the model used to predict stability and therefore the ability to accurately predict RNA secondary structure from sequence. The number of known RNA sequences is rapidly increasing, and it is important to understand structure−function relationships for RNA. Accurate prediction of secondary structure will accelerate discovery of these relationships and may help in assigning functions to RNA motifs and to new RNAs. This study uses a two-strand system modified from previous experiments (14) to measure by optical melting (31) and fluorescence competition assays (FCA) the influence of the number and the type of unpaired nucleotides on the stability of RNA three-way multibranch loops. A common motif in rRNAs, the loop E motif (3), was included in the investigation. Competition binding experiments with gel electrophoresis were used to determine the relative stabilities of DNA multibranch loops (8,32). FCA has been used to study nucleic acid thermodynamics, including stabilities of DNA duplexes (33) and of RNA pseudoknots (34).

Materials and Methods

Materials

Oligonucleotides 5′CCGUCaCCUGC3′, 5′CCGUCa2CCUGC3′, 5′CCGUCuCCUGC3′, and 5′CCGUCu2CCUGC3′ were synthesized on an Applied Biosystem 392 DNA/RNA synthesizer with phosphoramidite A from Glen Research and phosphoramidites G, C, and U from Proligo, Inc. Strands were treated and purified as previously described (14). Purity of products was checked by 20% polyacrylamide gel electrophoresis with 5′ end γ-ATP labeling. All other oligonucleotides were purchased from Integrated DNA Technologies, Inc. (IDT), which purified them by HPLC and tested molecular weights by mass spectroscopy. Fluorescent oligonucleotides had fluorescein attached by a linker of six carbons (6-FAM). The single strand oligonucleotides’ concentrations were calculated from 80 °C absorbance and single-strand extinction coefficients by a nearest-neighbor model (35,36).

UV Melting of Duplexes

Optical melting was performed in standard melting buffer: 1.0 M NaCl, 20 mM sodium cacodylate, and 0.5 mM Na2EDTA, pH 7.0. For each sample melted, equal molar amounts of the two strands in doubly distilled water were mixed, dried in spin vacuum, and then dissolved in standard melting buffer. The sample was annealed at 80 °C for 2 min and cooled to 20 °C at a rate of ∼5 °C/min. Absorbance versus temperature melting curves were measured at 280 nm with a heating rate of 1 °C/min on a Beckman Coulter DU 640 spectrophotometer controlled by a Beckman Coulter high-performance temperature controller cooled with water flow at 25 °C. Duplexes were melted over a 60-fold range in oligonucleotide concentration. Data were analyzed by fitting the transition to a two-state model with sloping baselines using a nonlinear least-squares program (37,38).

Fluorescence Competition Assay (FCA)

The equilibrium for the fluorescence competition assay is illustrated in Figure 1A. A reference structure was formed by a long strand and fluorescein-labeled short strand at 1:1 molar ratio in standard melting buffer. The initial concentration of duplex, G_CG_G/(F)Ca2C (see Table 1 for nomenclature), was 30 μM, and that of CgaaaCGaG/(F)CcaguaG (Table 1) was 1 μM, so that the strands are completely in duplex. The complex was titrated with a solution containing nonfluorescent short strand at high concentration and long strand and fluorescein-labeled short strand at their initial concentrations. For each titration point, the solution was annealed for 3 min at 75 °C in a water bath and then equilibrated at 37 °C for at least 15 min in a 4 × 4 mm quartz cuvette in a HORIBA Jobin Yvon Fluorolog-3 spectrofluorometer. To check for equilibrium, fluorescence intensity was measured for 1 min at intervals of 5 min. If no change within error was observed between two measurements, then the average over the last 1 min was taken as the fluorescence intensity of the titration point. Fluorescence was excited at 494 nm with a 1 nm band-pass, and emission was measured at the peak of the emission, 520 nm, with a 0.5 nm band-pass. The fluorescence intensities of the single strand fluorescent oligonucleotides, 5′FCCGUCa2CCUGC3′ (i.e., “(F)Ca2C”) at 30 μM and 5′FCCGUCcaguaGCUGC3′ (i.e., “(F)CcaguaC”) at 1 μM, were measured to provide the fluorescence intensity at saturation when they were completely displaced by the nonfluorescent oligonucleotide. Successive scans of the samples with only “(F)Ca2C” or “(F)CcaguaC” resulted in minimal change in the fluorescence intensity, indicating photobleaching was negligible.
Figure 1

(A) Illustration of fluorescence competition assay to measure the free energy of multibranch loop formation. As the competitor strand, C, is titrated into the solution of “reference” structure, RS, the competitor strand will substitute the fluorescein-labeled short strand in the “reference” structure to form the new multibranch loop, MBL. The fluorescein-labeled short strand, FS, is freed from the “reference” structure, and the fluorescence intensity of the solution changes. (B) Typical titration curve and fitting for FCA. Here the system is G_CG_G/Ca3C (Table 1). The “reference” structure concentration is 30 μM before titration, and the competitor strand is 400 μM in the titration solution.

Table 1

Nomenclature for Three-Way Multibranch Loop Systems Studied

grouplong strandashort strandaname
15′GCAGG_CGGCUUCGGCCG_GACGG3′5′FCCGUCa2CCUGC3′G_CG_G/(F)Ca2C
  5′CCGUC_CCUGC3′G_CG_G/C_C
  5′CCGUCaCCUGC3′G_CG_G/CaC
  5′CCGUCa2CCUGC3′G_CG_G/Ca2C
  5′CCGUCa3CCUGC3′G_CG_G/Ca3C
  5′CCGUCa4CCUGC3′G_CG_G/Ca4C
  5′CCGUCa5CCUGC3′G_CG_G/Ca5C
  5′CCGUCuCCUGC3′G_CG_G/CuC
  5′CCGUCu2CCUGC3′G_CG_G/Cu2C
  5′CCGUCu3CCUGC3′G_CG_G/Cu3C
  5′CCGUCu4CCUGC3′G_CG_G/Cu4C
  5′CCGUCu5CCUGC3′G_CG_G/Cu5C
25′GCAGCgaaaCGGCUUCGGCCGaGACGG3′5′FCCGUCcaguaGCUGC3′CgaaaCGaG/(F)CcaguaG
  5′CCGUCcaguaGCUGC3′CgaaaCGaG/CcaguaG
  5′CCGUCcagaGCUGC3′CgaaaCGaG/CcagaG

Nucleotides in the junction are in lower case. An underscore indicates a helix−helix interface without unpaired nucleotides.

(A) Illustration of fluorescence competition assay to measure the free energy of multibranch loop formation. As the competitor strand, C, is titrated into the solution of “reference” structure, RS, the competitor strand will substitute the fluorescein-labeled short strand in the “reference” structure to form the new multibranch loop, MBL. The fluorescein-labeled short strand, FS, is freed from the “reference” structure, and the fluorescence intensity of the solution changes. (B) Typical titration curve and fitting for FCA. Here the system is G_CG_G/Ca3C (Table 1). The “reference” structure concentration is 30 μM before titration, and the competitor strand is 400 μM in the titration solution. Nucleotides in the junction are in lower case. An underscore indicates a helix−helix interface without unpaired nucleotides.

Calculating Multibranch Loop Stability from FCA Data

Below is the derivation of the equation for fluorescence intensity as a function of solution composition for FCA titrations. Relative to fluorescein, the fluorescence from nucleotides can be neglected. The fluorescence comes from fluorescein in the free fluorescent single strand (FS) or “reference” structure (RS). So, the fluorescence intensity, F, isHere, fFS and fRS are the fluorescence efficiencies for the free fluorescent single strand and “reference” structure, respectively, [FS] is the concentration of free fluorescein single strand, and [RS] is the concentration of “reference” structure. The total concentration of fluorophore isFrom eqs 1 and 2 As the competition strand was titrated into the solution, some fluorescent short strands in the “reference” structures would be substituted by the nonfluorescent competition strand to form a nonfluorescent multibranch loop (MBL). In the solution, the competition strand is present as free competition strand ([C] for concentration) and as part of newly formed multibranch loop ([MBL] for the concentration), so The long strand RNA could be in different forms: “reference” structure, new multibranch loop, and unimolecular hairpin form ([H] for hairpin concentration). So, the total concentration of long strand RNA ([LS]T for its concentration) can be expressed as The experiments are performed under conditions in which [H] is negligible compared to [RS] + [MBL]. Also, the [LS]T equals [FS]T because the long strand and the fluorescent short strand were mixed in a ratio of 1:1 to form the “reference” structure. Therefore, eq 5 can be approximated asThe equilibrium constant, KMBL, for the multibranch loop formation can be written asRearrange the equation, substitute [C] by eq 4b, and then substitute [MBL] by eq 6The equilibrium constant, KRS, for the “reference” structure can be written asRearrange the equation and substitute [FS] by eq 2bFrom eqs 8 and 10Equation 11 can be rewritten asSolving for [RS] givesBecause [RS] cannot be larger than [FS]T or less than 0, the only valid solution isSubstituting [RS] in eq 3 by eq 14, the fluorescence intensity can be expressed as The value of fFS can be measured from the solution of free fluorescent single strand, fRS can be known from the “reference” structure solution, and [C]T and [FS]T are the concentrations added, so the titration curve was fit to KMBL by the Newton−Gaussian method as implemented in Statistical Analysis System (SAS) from SAS Institute Inc.

Fitting Experimental Data into Energetic Models

Previously published data (13,14) and results reported here for group 1 systems (see below) were fit to energetic models by linear regression analysis as implemented in the R Project for Statistical Computing (http://www.r-project.org). Only systems with ΔH°’s that differed by <30% when determined by 1/TM vs ln(CT/4) analysis and by fitting melting curves were used in the linear regression. The systems excluded from previous regression analysis (13) were also excluded from the linear regression here as were the group 2 sequences listed in Table 1 (see Supporting Information). The data from 1/TM vs ln(CT/4) analysis were used for linear regression as previously (13), except the FCA ΔG°37 was used for system G_CG_G/C_C because the optical melts were not two state (Tables 1 and 2). The p-value of the F test was used as a criterion for judging the correlation of a term to a model. If the p-value is less than 0.10, then the corresponding term is considered highly correlated to the model.
Table 2

Thermodynamics for Duplex Formation Creating a Multibranch Loop from Long Hairpin Strand and Unstructured Short Strand, As Determined by Fluorescence Competition Assay and Optical Melting, in 1 M NaCl, 20 mM Sodium Cacodylate, and 0.5 mM Na2EDTA, pH 7

 FCAa1/TM vs ln(CT/4)
average of melt curve fits
system−ΔG°37 (kcal/mol)−ΔG°37 (kcal/mol)−ΔH° (kcal/mol)−ΔS° (eu)TM (°C)−ΔG°37 (kcal/mol)−ΔH° (kcal/mol)−ΔS° (eu)TM (°C)
G_CG_G/(F)Ca2C 11.34 ± 0.2391.60 ± 4.84258.78 ± 14.9254.211.25 ± 0.5288.48 ± 11.31249.00 ± 34.9354.5
G_CG_G/C_C8.89(8.37 ± 0.13)(49.69 ± 3.71)(133.23 ± 11.75)(48.9)(9.00 ± 0.72)(75.65 ± 17.07)(214.88 ± 53.00)(47.5)
G_CG_G/CaC9.469.15 ± 0.2573.07 ± 6.84206.08 ± 21.4948.59.06 ± 0.4871.53 ± 14.68201.42 ± 45.9148.3
G_CG_G/Ca2C10.8610.50 ± 0.1684.13 ± 4.03237.39 ± 12.5052.410.51 ± 0.2583.73 ± 6.25236.10 ± 19.3952.5
G_CG_G/Ca3C10.8911.09 ± 0.2585.90 ± 5.46241.18 ± 16.8254.411.39 ± 0.6192.54 ± 12.01261.65 ± 36.9754.2
G_CG_G/Ca4C 11.28 ± 0.1387.23 ± 3.00244.90 ± 9.2654.911.67 ± 0.9893.46 ± 23.95263.71 ± 74.1155.1
G_CG_G/Ca5C 11.71 ± 0.2697.11 ± 5.74275.35 ± 17.7154.512.04 ± 0.42104.70 ± 7.76298.76 ± 23.7454.2
G_CG_G/CuC9.088.44 ± 0.1274.26 ± 5.23212.24 ± 16.6345.28.35 ± 0.1776.17 ± 14.45218.66 ± 46.5844.6
G_CG_G/Cu2C10.7210.54 ± 0.3077.63 ± 7.54216.30 ± 23.4953.910.71 ± 0.3083.28 ± 8.22233.97 ± 25.6453.4
G_CG_G/Cu3C11.0711.16 ± 0.2688.97 ± 5.75250.87 ± 17.7254.011.55 ± 0.5497.67 ± 11.90277.67 ± 36.7853.8
G_CG_G/Cu4C 11.51 ± 0.1494.10 ± 3.10266.28 ± 9.5554.311.71 ± 0.6498.43 ± 14.03279.61 ± 53.2554.2
G_CG_G/Cu5C 11.06 ± 0.1889.64 ± 4.34253.35 ± 13.4553.511.31 ± 0.6496.22 ± 15.42273.77 ± 47.7353.2
CgaaaCGaG/(F)CcaguaG 13.08 ± 0.29102.63 ± 5.15288.74 ± 15.7058.112.50 ± 0.4391.24 ± 8.69253.86 ± 26.7058.7
CgaaaCGaG/CcaguaG12.2412.46 ± 0.4097.56 ± 7.48274.40 ± 22.8757.112.23 ± 0.7991.43 ± 13.73255.38 ± 41.7757.6
CgaaaCGaG/CcagaG11.0511.27 ± 0.2984.16 ± 5.50235.04 ± 16.8955.511.24 ± 0.9581.94 ± 18.98227.98 ± 58.1855.9

The errors of FCA are from estimating free energies of reference structures because errors from fitting titration curves are small. The errors are ±0.23 and ±0.29 for systems in groups 1 and 2, respectively, which are errors for ΔG°37 of the reference structures from 1/TM vs ln(CT/4) analysis.

The errors of FCA are from estimating free energies of reference structures because errors from fitting titration curves are small. The errors are ±0.23 and ±0.29 for systems in groups 1 and 2, respectively, which are errors for ΔG°37 of the reference structures from 1/TM vs ln(CT/4) analysis.

Results

Design of the System

Systems studied here are divided into two groups (Table 1 and Figure 2). Group 1 provides insight into the influence of number and type of unpaired nucleotides on the stabilities of RNA three-way multibranch loops (Figure 1A,B). Group 2 provides insight into the thermodynamics of the loop E motif in multibranch loops (Figure 1C,D). The systems studied here (Figure 1) are based on the two-strand system used previously to study free energy increments of RNA multibranch loops (13,14). The long strand can fold into a hairpin, which is similar to the hairpin component in the multibranch loop (Figure 3) and which has a melting temperature greater than 70 °C (14). The short strands are predicted to have no self-structures by RNAstructure (15). Each intermolecular helix formed by association of the two strands contains five base pairs, rather than the four used initially (14). This increased the cooperativity of the melts so the melting is more likely to be a two-state transition. The extra stability imparted by the two additional GC pairs also increased the equilibrium constant. This allows the fluorescence competition assays to be performed at 37 °C because less than 2% of the long strand was ever single strand at the oligonucleotide concentrations used in these experiments.
Figure 2

Sequence design. Group 1 is designed to study the influence of the number and type of unpaired nucleotides on multibranch loop stability. (A) “Reference” structure for fluorescence competition assay to analyze the free energy of systems in group 1. (B) Secondary structures of systems studied. Group 2 is designed to investigate the stability of a loop E motif (5′GAA/3′AUGA) in a multibranch loop. (C) “Reference” structure for fluorescence competition assay to analyze the free energy of systems in group 2. The noncanonical base pairing and some tertiary interactions in the loop E motif are shown (78). (D) A variant of loop E motif in multibranch loop.

Figure 3

Optical melting of three-way multibranch loops G_CG_G/CaC at 44.5 μM (black squares), G_CG_G/Ca2C at 25.0 μM (red diamond), and CgaaaCGaG/CcaguaG at 23.5 μM (green triangle).

Sequence design. Group 1 is designed to study the influence of the number and type of unpaired nucleotides on multibranch loop stability. (A) “Reference” structure for fluorescence competition assay to analyze the free energy of systems in group 1. (B) Secondary structures of systems studied. Group 2 is designed to investigate the stability of a loop E motif (5′GAA/3′AUGA) in a multibranch loop. (C) “Reference” structure for fluorescence competition assay to analyze the free energy of systems in group 2. The noncanonical base pairing and some tertiary interactions in the loop E motif are shown (78). (D) A variant of loop E motif in multibranch loop. Optical melting of three-way multibranch loops G_CG_G/CaC at 44.5 μM (black squares), G_CG_G/Ca2C at 25.0 μM (red diamond), and CgaaaCGaG/CcaguaG at 23.5 μM (green triangle). In each group of systems, the short strands (Table 1) are interchangeable without affecting the helices. Therefore, only a single short strand is required to be fluorescently labeled to form a reference structure with the long strand (Figure 2A,C) for the FCA. By competition, the effect of changing the nucleotide sequence on the short strand in the junction could always be compared to the same reference structure. Therefore, the free energy changes at 37 °C could be determined by FCA and compared with that determined by optical melting. In group 1 systems, unpaired nucleotides were added between the right- and left-hand helices. The predicted preferred stacking arrangement of helixes is the same for all of the systems studied; viz., the left-hand helix stacks on the hairpin stem. Therefore, differences in coaxial stacking of helixes do not have to be considered as more nucleotides are added. The number of unpaired nucleotides was systematically changed from one to five adenosines or uridines to search for trends associated with number and type of unpaired nucleotide. Unpaired G’s and C’s were not used because secondary structure predictions indicated that such constructs would have a large number of stable structures. Group 2 systems (Table 1 and Figure 2C,D) provide thermodynamic properties for the loop E motif (nucleotides in green in the dashed box in Figure 2C) and a variation in a three-way multibranch loop. Besides the loop E motif, three other nucleotides (A9, A22, and C33) were included in the junction so the system has enough flexibility to keep the loop E motif in its native conformation. Actually, 5′gaaa/3′augac (Figure 2C, nucleotides in green) is common in rRNA (3). For a comparison, system CgaaaCGaG/CcagaG in which U36 was deleted from system CgaaaCGaG/CcaguaG was measured. Because U36 forms a trans-Hoogsteen base pair with A7 and a hydrogen bond to the G6 ribose, the loop E motif is broken. In group 2, the right-hand strand and the hairpin strand (Figure 2C) are predicted to coaxially stack onto each other with an intervening noncanonical pair, and the loop E motif will stack onto the left helix.

Thermodynamics Determined by Optical Melting

As predicted with RNAstructure (15), absorbance detected melts of the individual short strands in Table 1 revealed no self-structure. Melts of the long strand (Table 1) also revealed no self-structure other than that of the hairpin, which melted at temperatures above 70 °C. The inter- and intramolecular transitions are sufficiently separate that the intermolecular transition can be analyzed independently from the hairpin transition. Typical melting curves for systems G_CG_G/CaC, G_CG_G/Ca2C, and CgaaaCGaG/CcaguaG are shown in Figure 3. The results of the melts are presented in Table 2. Except for system G_CG_G/C_C, which contains no unpaired nucleotides, the enthalpy changes determined by the average of the melts’ curve fits were within 15% of those determined by the TM−1 vs ln(CT/4) fits. This is consistent with the two-state model. For system G_CG_G/C_C, there is little predicted difference in the free energies of coaxial stacking for all three possible arrangements. This may account for the non-two-state behavior. TM−1 vs ln(CT/4) analysis of the melts of systems G_CG_G/(F)Ca2C and CgaaaCGaG/(F)CcaguaG revealed 0.84 and 0.62 kcal/mol more favorable free energy difference relative to that of system G_CG_G/Ca2C and CgaaaCGaG/CcaguaG, respectively (Table 2). The free energy increments for 5′ fluorescein linking to 5′C/G3′ from these two measurements agree with each other and with our previous result, 0.82 kcal/mol (34), within experimental error. The presence of 5′ fluorescein does not perturb the RNA structure (34). Therefore, it was assumed that this enhancement in stability is due to the free energy bonus of a stacked fluorescein on a 5′ C in a CG pair and/or a decrease in the number of conformations available to the single strand due to volume exclusion. The free energies listed in Table 2 are for the association of a long strand containing a hairpin and a short strand to form a multibranch loop, ΔG°37,bimol. From TM−1 vs ln(CT/4) analysis, in systems n in group 1 (Table 1 and Figures 1A and 2B) as m = 2, 3, or 4, ΔG°37,bimol’s for A or U are the same within experimental error. As m = 1 or 5, ΔG°37,bimol’s for G_CG_G/CamC are more favorable by about 0.7 kcal/mol than that for G_CG_G/CumC. As unpaired nucleotides increase, systems G_CG_G/CamC get more stable or stay the same within experimental error, and systems G_CG_G/CumC have the same trend except for system G_CG_G/Cu5C, which is 0.45 kcal/mol less stable than G_CG_G/Cu4C.

Fluorescence Competition Assays

A typical FCA titration curve is shown in Figure 1B. Systems G_CG_G/(F)Ca2C and CgaaaCGaG/(F)CcaguaG (Figure 2A.C) are the reference structures for systems in groups 1 and 2, respectively. Their ΔG°37’s determined by optical melting were used as the standard for analysis of FCA for systems in each group. Upon titration with competing strand, the fluorescence increased roughly 1−2-fold as the fluorescein-labeled strand was released from the multibranch loop. The results of competition assays are listed in Table 2. When all sources of experimental error are considered, the free energy changes at 37 °C measured by fluorescence competition assays and by optical melting are within experimental error.

Free Energy Increments for Multibranch Loops

Optical melting experiments measure the thermodynamics for dissociation of multibranch loops (Figure 4). The fluorescence competition assay here relies on the optical melting of reference structures. The free energy increment for the multibranch loop, ΔG°37,MBL, can be determined with a nearest-neighbor model as previously described (13,14):Here, ΔG°37,helix’s are for forming the base pairs in the two helices, ΔG°37,bimol init is the penalty for bimolecular initiation, and ΔG°37,prod mm is for possible stacking of mismatches formed on the helix in the product hairpin strand (Figure 4). Previously (13), ΔG°37,prod mm was assumed to include only the increment for the first terminal mismatch of the product hairpin (Figure 4, dashed box). More recent experiments revealed that up to the second and third 3′ dangling ends (Figure 4, dashed and dark box) may affect the stability of RNA helices (39−41). Counting these dangling ends may change ΔG°37,prod mm and therefore the reported ΔG°37,MBL[13,14]. For example, in ref (13), as the three-way multibranch loop systems melt, the newly formed hairpin 5′GGCAG_GCGCUUCGGCGC_GGAGG3′ has a string of purines dangling at the 3′ end. The new parameters (39) predict the second and third dangling purines will make the hairpin 0.8 kcal/mol more favorable than the previous prediction (13). For the three-way multibranch loop system, GaCG_G/CauaC in ref (14), the hairpin has a dangling end of 5′GCA3′, which leads to a 0.6 kcal/mol more favorable folding (39) than assumed previously. These effects were used to recalculate ΔG°37,MBL from previously published experiments (see Supporting Information). For four-way multibranch loops measured previously (13), the newly formed hairpins will stack onto each other so the dangling ends do not stabilize the long strand product. Some minor calculation errors for four-way multibranch loops were corrected, however (see Supporting Information).
Figure 4

Diagram of multibranch loop dissociation in melting. As the multibranch loop dissociates, the newly released nucleotides from the left and right helices stack onto the hairpin stem, so that the first mismatch and second and third 3′ dangling nucleotides stabilize the hairpin.

Diagram of multibranch loop dissociation in melting. As the multibranch loop dissociates, the newly released nucleotides from the left and right helices stack onto the hairpin stem, so that the first mismatch and second and third 3′ dangling nucleotides stabilize the hairpin.

Free Energy Increments for Multibranch Loop Initiation

ΔG°37,MBL was modeled to be the sum of the initiation penalty for multibranch loop formation, ΔG°37,MBL init, and the favorable interactions in the multibranch loop, ΔG°37,MBL stacking, which include the dangling ends, coaxial stacking, and mismatches in the multibranch loop (13,14): For systems in group 1, the calculations are simplified by the fact that all systems are predicted to stack the left helix on the hairpin stem (Figures 1 and 2B). For systems in group 2, the right helix is predicted to stack on the hairpin helix with an intervening AC pair between two GC pairs closing the multibranch loop (Figure 2C,D). The coaxial stacking can be decomposed into two nearest neighbors (12,42,43):Unlike the dangling ends in the hairpin product, which are unconstrained, the unpaired regions in multibranch loops are fixed at both ends and fit into limited space. Two multibranch loops with equal numbers and distributions of unpaired nucleotides could be in different conformations (see Supporting Information). Also, noncanonical interactions are common in and with multibranch loops but are hard to predict. Examination of three-way multibranch loops in 3D structures reveals that usually only the first dangling end stacks on the closing base pair (see Supporting Information). Therefore, only single dangling end stacking is included in eq 17 as stabilizing the multibranch loop. This is consistent with the previous model (13,14). ΔG°37,MBL, ΔG°37,MBL stacking, and ΔG°37,MBL init are listed in Table 3 as derived from TM−1 vs ln(CT/4) plots of optical melting data, from fluorescence competition assays, and from nearest-neighbor parameters. For most cases, ΔG°37,MBL and ΔG°37,MBL init become more favorable or do not change within experimental error as the number of unpaired nucleotides increases. ΔG°37,MBL’s for the same number of unpaired adenosines or uridines are the same, and so are the ΔG°37,MBL init’s. The results for ΔG°37,MBL init are plotted in Figure 5.
Table 3

Free Energies for Three-Way Multibranch Loopsa

    predicted ΔG°37,MBL init (kcal/mol)
systemΔG°37,MBL (kcal/mol)ΔG°37,MBL stackingb (kcal/mol)ΔG°37,MBL init (kcal/mol)eq 24ceq 25d
G_CG_G/C_C6.42 (5.90)−3.429.84 (9.32)9.8710.00
G_CG_G/CaC5.64 (5.33)−5.1210.76 (10.45)10.3610.28
G_CG_G/Ca2C4.29 (3.93)−5.129.41 (9.05)8.148.60
G_CG_G/Ca3C3.70 (3.90)−5.128.82 (9.02)8.638.88
G_CG_G/Ca4C3.51−5.128.638.528.60
G_CG_G/Ca5C3.08−5.128.208.418.32
G_CG_G/CuC6.35 (5.71)−4.6210.97 (10.33)10.3610.28
G_CG_G/Cu2C4.25 (4.07)−4.628.87 (8.69)8.148.60
G_CG_G/Cu3C3.63 (3.72)−4.628.25 (8.34)8.638.88
G_CG_G/Cu4C3.28−4.627.908.528.60
G_CG_G/Cu5C3.73−4.628.358.418.32
CgaaaCGaG/CcaguaG2.79 (3.01)−4.467.25 (7.47)7.866.92
CgaaaCGaG/CcagaG3.98 (4.20)−4.468.44 (8.66)7.977.20

Values of ΔG°37,MBL and ΔG°37,MBL init not in parentheses were calculated from free energy changes determined from TM−1 vs ln(CT/4) plots of optical melting data. Values in parentheses were calculated from free energy changes determined by fluorescence competition assay. Equations 16 and 17, respectively, were used to calculate ΔG°37,MBL and ΔG°37,MBL init.

Single nucleotide dangling end or noncanonical pair was considered, and parameters are from ref (42). For systems with a or u loops (group 1), the left helix and hairpin strand (Figure 2A) are predicted to stack on each other, with parameters from ref (80), to update those from ref (81). For bottom two systems (group 2), the right helix and hairpin strand (Figure 2C) are predicted to stack on each other with an intervening noncanonical pair (12).

Predictions are by eq 24, with aG, bG, cG, dG, and ΔG°37,strain equal to 6.39, −0.14, 0.25, 1.05, and 2.69 kcal/mol, respectively.

Predictions are by eq 25, with aG′, bG′, dG′, and ΔG°37,strain′ equal to 8.06, −0.28, 0.83, and 1.94 kcal/mol, respectively.

Figure 5

ΔG°37,MBL init plots for G_CG_G/CnmC. Black rectangles are for G_CG_G/CamC measured by UV melting, red circles are for G_CG_G/CumC measured by UV melting, green triangles are for G_CG_G/CamC measured by FCA, blue upside-down triangles are for G_CG_G/CumC measured by FCA, purple left-facing triangles are for CgaaaCGaG/CcaguaG and CgaaaCGaG/CcagaG measured by UV melting, burgundy right-facing triangles are for CgaaaCGaG/CcaguaG and CgaaaCGaG/CcagaG measured by FCA, and cyan diamonds are for G_CG_G/CnmC predicted by eq 24.

Values of ΔG°37,MBL and ΔG°37,MBL init not in parentheses were calculated from free energy changes determined from TM−1 vs ln(CT/4) plots of optical melting data. Values in parentheses were calculated from free energy changes determined by fluorescence competition assay. Equations 16 and 17, respectively, were used to calculate ΔG°37,MBL and ΔG°37,MBL init. Single nucleotide dangling end or noncanonical pair was considered, and parameters are from ref (42). For systems with a or u loops (group 1), the left helix and hairpin strand (Figure 2A) are predicted to stack on each other, with parameters from ref (80), to update those from ref (81). For bottom two systems (group 2), the right helix and hairpin strand (Figure 2C) are predicted to stack on each other with an intervening noncanonical pair (12). Predictions are by eq 24, with aG, bG, cG, dG, and ΔG°37,strain equal to 6.39, −0.14, 0.25, 1.05, and 2.69 kcal/mol, respectively. Predictions are by eq 25, with aG′, bG′, dG′, and ΔG°37,strain′ equal to 8.06, −0.28, 0.83, and 1.94 kcal/mol, respectively.

Enthalpy Changes for Three-Way Multibranch Loops and Their Initiation

Similar to ΔG°37,MBL, ΔH°MBL can be estimated from optical melting measurements with a nearest-neighbor model:Here, ΔH°bimol is the measured ΔH° for duplex formation (Table 2), ΔH°helix’s are for the forming the base pairs in the two helices, ΔH°bimol init is the penalty for bimolecular initiation, and ΔH°prod mm is for any possible stacking of mismatches formed on the helix in the product hairpin strand. In the previous model (44), only the first mismatch is predicted to stabilize the product hairpin. Recent results reveal the second 3′ nucleotide overhang also contributes to the thermodynamics of the system (39). The enthalpy of multibranch loops, ΔH°MBL, was modeled as the sum of initiation and stacking (44): The stacking properties of the second and third nucleotide overhangs in a multibranch loop are not clear. Meanwhile, the enthalpy measurement for multibranch loop bimolecular dissociation usually has an error above ±5 kcal/mol (13), while the second overhangs usually affect the enthalpy within 2 kcal/mol (39). So the stacking effect from the second and later overhangs in a multibranch loop is neglected. The available loop enthalpies are listed in Supporting Information. Calorimetric experiments have shown that formation of multibranch loops can be associated with an unfavorable ΔH° (45). Our optical melting results agree with the expectation from calorimetric experiments.

Discussion

Rapid and widespread genomic sequencing has dramatically increased the number of known RNA sequences (46,47). Many of them have important cellular functions, and often functions are directly related to secondary and tertiary structure. RNA folding in vitro often traps structures that are not in the lowest free energy state (48). In vivo, however, RNA chaperones facilitate equilibration with the minimum free energy structure (49). Thus, computer prediction of secondary structure through free energy minimization can be a powerful tool for deducing structure (12,15,50−52) and function for a sequence (53). Improving the accuracy of secondary structure predictions is therefore important for future genomic studies. Multibranch loops remain one of the few RNA motifs not extensively studied. The diversity of their known sequences (54−59) and structures (60−62) suggests they will be involved in many different functions.

Effects of Loop Sequence

Adding an unpaired A or U to a multibranch loop is not always equivalent, but the difference is very subtle when considered on a per nucleotide basis (Table 3 and Figure 5). On the basis of optical melting, systems G_CG_G/CaC and G_CG_G/Ca5C are about 0.7 kcal/mol more stable than systems G_CG_G/CuC and G_CG_G/Cu5C. This is within experimental error of the more favorable stacking by 0.5 kcal/mol of an unpaired A on a CG pair (5′CA/3′G) compared to that of an unpaired U (5′CU/3′G). For m between 2 and 4, the additional unpaired A’s or U’s add essentially identical increments to the stability of a multibranch loop (Table 3). This differs from the observation that poly(A) and poly(U) have different stacking properties (63,64), and the sequence dependence of terminal dangling end stacking where enhancement from multiple A’s is different from that of multiple U’s (39). This may reflect the limited space in the junction which may not allow optimal stacking of the unpaired nucleotides when m = 2, 3, or 4. The more favorable stability of G_CG_G/Ca5C relative to G_CG_G/Cu5C may reflect more flexibility or potential for tertiary interactions for larger loops. In the nearest-neighbor model for predicting stability (65), all of the unpaired nucleotides and half of the nucleotides in closing base pairs are considered to be part of the loop. Thus, on the basis of optical melting data, the largest difference per nucleotide between G_CG_G/CamC and G_CG_G/CumC loops is for G_CG_G/CaC and G_CG_G/CuC systems: (9.15 − 8.44)/(1 + 6/2) = 0.18 kcal/mol per nucleotide. The differences for the others are negligible (less than 0.1 kcal/mol). For enthalpy and entropy (Table 2), the different unpaired nucleotides in the junction did not show obvious differences. ΔG°37,MBL init plots for G_CG_G/CnmC. Black rectangles are for G_CG_G/CamC measured by UV melting, red circles are for G_CG_G/CumC measured by UV melting, green triangles are for G_CG_G/CamC measured by FCA, blue upside-down triangles are for G_CG_G/CumC measured by FCA, purple left-facing triangles are for CgaaaCGaG/CcaguaG and CgaaaCGaG/CcagaG measured by UV melting, burgundy right-facing triangles are for CgaaaCGaG/CcaguaG and CgaaaCGaG/CcagaG measured by FCA, and cyan diamonds are for G_CG_G/CnmC predicted by eq 24. Systems G_GC_G/CamC (m = 0−4) were measured previously (13) and only differ from the G_CG_G/CamC systems by a closing base pair. Excluding the different stacking properties, ΔG°MBL init’s (eq 17) for these two groups agree with each other within experimental error.

Free Energy Models for RNA Multibranch Loop Initiation

The free energies for formation of multibranch loops were fit to three models. One model (12) is represented byHere, aG, bG, and cG are parameters, n is the number of unpaired nucleotides, and h is the number of helices. On the basis of optical melting experiments on a series of three- and four-way multibranch loops, it was suggested that eq 22 is a better approximation (13):Here avg asym is the average asymmetry, which reflects the distribution of unpaired nucleotides in the loop, as defined byIn eq 22, ΔG°strain is a penalty for three-way multibranch loops with zero or one unpaired nucleotide. While eq 22 fits the data better than eq 21, the avg asym term has not been included in dynamic programing algorithms. The data can also be fit to a third model that adds the number of unpaired nucleotides to eq 22: The measurements reported here and 3D structures indicate that it is a good approximation to consider only the stacking of the first dangling end or noncanonical pair when predicting ΔG°MBL, as assumed previously (13). The second and third dangling ends or noncanonical pairs will affect the energetics of the long strand hairpin stem (39−41), however, and thus affect the calculation of ΔG°37,prod mm in eq 16 and therefore ΔG°37,MBL. The adjustment for ΔG°37,MBL by adding the second and third dangling end to hairpin stability was made for available data (13,14), and the revised values are available in Supporting Information. Linear regression was used to derive the parameters for eqs 21, 22, and 24. Previous comparisons of melting and isothermal titration calorimetry results show that 1/TM vs ln(CT/4) plots give reasonably accurate ΔG° values even when melting is not strictly a two-state process (14). Furthermore, for the G_CG_G/C_C system, the ΔH°’s differed by 34% when determined by 1/TM vs ln(CT/4) and by fitting melting curves, but ΔG°37 of −8.89 kcal/mol from FCA was close to the values of −8.37 and −9.00 kcal/mol from melting curves (Table 2). Therefore, systems with ΔH°’s differing by <30% between the two methods of analysis were included in the linear regression. Group 2 systems were not included in the linear regression, because of the more complex sequence in these multibranch loops. Little is known about the sequence dependence of interactions in multibranch loops so they are not included in current energetic models. The parameters from the fit for eq 24 are listed in Table 4, and those for eqs 21 and 22 are listed in Supporting Information. The coefficients of determination, R2, and the p-value of the F test are also listed. Parameters changed little if all the available systems except the previously excluded ones (13) were included in the fit. The predictions from eqs 21, 22, and 24 for measured multibranch loops are listed in Supporting Information.
Table 4

Nearest-Neighbor Free Energy Parameters for Multibranch Loop Initiation at 37 °C

modelparametervalue (kcal/mol)error (kcal/mol)p-value
eq 24 universal modelaG6.390.863.48 × 10−10
 bG−0.140.060.0216
 cG0.250.220.2659
 dG1.050.177.23 × 10−8
 ΔG°37,strain2.690.381.88 × 10−9
  R2 = 0.6007, p-value = 3.523 × 10−12
eq 25 model for three-way MBLaG8.060.39<2 × 10−16
 bG−0.280.062.75 × 10−5
 dG0.830.171.57 × 10−5
 ΔG°37,strain1.940.385.32 × 10−6
  R2 = 0.7083, p = 3.737 × 10−13
eq 26 model for four-way MBLaG′′6.480.459.97 × 10−10
 dG′′1.360.340.0015
  R2 = 0.5248, p = 0.001503
The p-values for parameter cG in eqs 21, 22, and 24 are all >0.1, which indicate the number of helixes from the loop is not a highly correlated term in these models. When the data for ΔH° and ΔS° are fit to the equivalent of eq 24, however, the number of helixes is a reasonably well correlated term (see below). Thus we recommend retaining the cGh term in algorithms for predicting secondary structure so that they can be used for temperatures other than 37 °C. While average asymmetry is not included in dynamic programming algorithms (15), eq 24 could be used for calculating the free energies of predicted structures, so the predicted structures could be reordered. The ability of eq 24 to approximate the experimental results for ΔG°37,MBL init is shown in Table 3 and Figure 5 for the loops reported here and in Supporting Information for all loops. The data can also be fit to separate energetic models for three- and four-way multibranch loops. The ΔG°37,MBL init’s for three-way multibranch loops are fit to For four-way multibranch loops, statistical tests of free energy and enthalpy models with a term for unpaired nucleotides, n, show n is not a highly correlated term. Thus, the ΔG°37,MBL init’s for four-way multibranch loops fit well to a simpler model: The parameters from the fits to eqs 25 and 26 are listed in Table 4. The coefficients of determination, R2, and the p-value of the F test for each term are also listed. The ability of eqs 25 and 26 to approximate the experimental results for ΔG°37,MBL init is shown in Supporting Information.

Comparison with Polymer Model for Loop Initiation

The results in Table 3 and for other three-way multibranch loops (13,14) can be compared with predictions from the two-length-scale freely jointed chain (FJC) polymer model of Aalberts and Nandagopal (66). The predictions from the FJC model and eq 24 differ by as much as 4.2 kcal/mol at 37 °C for a single unpaired nucleotide but converge as the number of unpaired nucleotides increases so that the difference is only 0.9 kcal/mol for 10 unpaired nucleotides (see Supporting Information). The latter is within the propagated experimental error for eq 24. The FJC model predicts more unfavorable free energy for loop initiation, ΔG°37,FJC, as the number of unpaired nucleotides increases. In contrast, the experiments indicate that loop initiations become more favorable as the number of unpaired nucleotides increases in the three-way multibranch loops studied (Table 3, Figure 5, Supporting Information, and ref (13)). This may also reflect unfavorable entropy due to conformational constraints in the limited junction space, as is suggested by identical stacking properties of unpaired A or U as m increases up to 5. For four-way multibranch loops studied previously (13), however, added unpaired nucleotides in the loop destabilize the loop. For the available database, the average asymmetry term in eq 26 provides good fits to the data (Supporting Information). Replacing the average asymmetry term with ΔG°37,FJC or adding ΔG°37,FJC to eq 26 and refitting parameters give poorer fits. Here, ΔG°37,FJC is the loop entropy term calculated from the FJC model (66) for each multibranch loop (see Supporting Information). Experimental data and predictions from eqs 25 and 26 for RNA multibranch loops are consistent with previous observations in DNA that unpaired nucleotides stabilize three-way multibranch loops but destabilize four-way multibranch loops (8). In DNA five-way multibranch loops, unpaired nucleotides stabilize the loop (8). Evidently, in addition to polymer theory, conformational constraints or other considerations for unpaired nucleotides fitting into multibranch loops are needed to fully understand the energetics, especially for multibranch loops with an odd number of helixes. The FJC model predicts a novel favorable entropy associated with coaxial stacking of helixes (66). As a further test of the model, the experimental results for three- and four-way loops were fit to eq 24 modified to also include ΔG°37,FJC: The calculation for ΔG°37,FJC and the parameters for this model are listed in Supporting Information. aG′′′ can be neglected because it has a small value of 0.12 and high p-value of 0.90. All other terms, including cG′′′, have p-values less than 0.10. With available data, eq 27 does not improve the predictions. The average absolute error for predictions with eq 27 is 0.59 kcal/mol, while that for eq 24 is 0.54 kcal/mol. Further experiments, however, are required to completely test the FJC and other models for the energetics of multibranch loops.

Model for Enthalpy Changes for Initiation of Three-Way Multibranch Loops

Previously published data for enthalpy changes for multibranch loop initiation, ΔH°MBL init, have been fit to (44)Similar to eqs 24, 25, and 26 for predicting free energy for multibranch loop initiation, the data can also be fit to a “universal” model, a model for three-way multibranch loops, and a model for four-way multibranch loops, eqs 29, 30, and 31, respectively. The adjustment for the second dangling ends in the product hairpin was made to the available data (13,14) and to our measurements. The ΔH°MBL’s and ΔH°MBL init’s from our measurements are listed in Table 5. The calculations for all data are available in Supporting Information. Linear regressions were used to derive parameters for eqs 29, 30, and 31 from the adjusted data excluding systems omitted from the regression of multibranch loop initiation free energy and using ΔH° from 1/TM vs ln(CT/4) for all sequences, including G_CG_G/C_C. The parameters, the p-values of the F test for each parameter, the coefficient of determination for each model (R2), and the p-value of the F test for the models are listed in Table 6.
Table 5

Enthalpy Changes (kcal/mol) for Three-Way Multibranch Loops

systemunpairedaverage asymmetryΔH°MBLΔH°stackingΔH°MBL initΔH°MBL init predicted by eq 29bΔH°MBL init predicted − ΔH°MBL init
G_CG_G/C_C0036.36−14.8851.2433.03−18.21
G_CG_G/CaC10.6712.98−23.8836.8638.111.25
G_CG_G/Ca2C21.331.92−23.8825.821.29−4.51
G_CG_G/Ca3C32.00.15−23.8824.0326.372.34
G_CG_G/Ca4C42.67a−1.18−23.8822.724.501.80
G_CG_G/Ca5C53.33a−11.06−23.8812.8222.639.81
G_CG_G/CuC10.6711.79−22.3834.1738.113.94
G_CG_G/Cu2C21.338.42−22.3830.821.29−9.51
G_CG_G/Cu3C32.0−2.92−22.3819.4626.376.91
G_CG_G/Cu4C42.67a−8.05−22.3814.3324.5010.17
G_CG_G/Cu5C53.33a−3.59−22.3818.7922.633.84
CgaaaCGaG/GaugacC102.67a−10.01−17.57.4913.281.59
CgaaaCGaG/GagacC92.03.39−17.520.8915.15−9.94

The average asymmetry is set to 2.0 for predicting ΔHMBL init.

Predictions are by eq 29, with aH, bH, cH, dH, and ΔH°37,strain equal to 34.47 ± 13.70, −1.87 ± 0.92, −7.75 ± 3.48, 10.38 ± 2.73, and 21.81 ± 6.11 kcal/mol, respectively.

Table 6

Enthalpy Parameters for Multibranch Loop Initiation Models

modelparametervalue (kcal/mol)error (kcal/mol)p-value
eq 29 universal modelaH34.4713.700.0144
 bH−1.870.920.0466
 cH−7.753.480.0295
 dH10.382.730.0003
 ΔH°strain21.816.110.0007
  R2 = 0.4276, p-value = 2.595 × 10−7
eq 30 model for three-way MBLaH24.507.030.0010
 bH−3.561.100.0022
 dH6.313.140.0050
 ΔG°37,strain11.286.910.1089
  R2 = 0.384, p-value = 2.528 × 10−5
eq 31 model for four-way MBLaH′′−12.254.930.0263
 dH′′17.693.760.0003
  R2 = 0.6124, p-value = 0.0003394
The average asymmetry is set to 2.0 for predicting ΔHMBL init. Predictions are by eq 29, with aH, bH, cH, dH, and ΔH°37,strain equal to 34.47 ± 13.70, −1.87 ± 0.92, −7.75 ± 3.48, 10.38 ± 2.73, and 21.81 ± 6.11 kcal/mol, respectively.

The Loop E Motif May Not Stabilize a Multibranch Loop

The loop E motif widely exists in internal, hairpin, and multibranch loops in 5S, 16S, and 23S rRNA (3,67−71). It is also in catalytic RNAs, such as RNase P (72), group I introns (73), and group II introns (74). It can be involved in protein binding (75,76) and RNA−RNA interactions (77). An NMR structure reveals that there are three noncanonical base pairs in the loop E motif (Figure 2, bottom) (78). Its thermodynamic properties were measured in an internal loop (79) but not in a multibranch loop. The free energy for forming a duplex with loop E motif as internal loop, 5′GCGaguaGGC/3′CGCaagCCG, is −8.0 kcal/mol in 1 M Na+ buffer (79), which is 1.3 kcal/mol more stable than predicted by RNAstructure. The loop E motif (Figure 2, bottom) includes an A37/G6 sheared pair, an A7/U36 trans Hoogsteen pair, and an A7/A34 parallel pair. Moreover, G35 interacts with the phosphate between A6 and A7. U36 plays a critical role in the loop E motif because it forms a hydrogen bond with its own backbone and also interacts with G6 ribose in addition to forming the A7/U36 trans Hoogsteen pair. Deletion of U36 destabilizes the multibranch loop, CgaaaCGaG/CcaguaG, by 1.2 ± 0.5 kcal/mol (Tables 2 and 3). The CgaaaCGaG/CcaguaG multibranch loop with the loop E motif has a ΔG°37,MBL init of 7.25 ± 0.26 kcal/mol by UV melting, which is within experimental error of the 6.92 ± 0.99 kcal/mol predicted by eq 25. For system CgaaaCGaG/CcagaG, in which U36 was deleted from the loop E motif, the measured ΔG°37,MBL init of 8.44 ± 0.08 kcal/mol is 1.24 kcal/mol less favorable than predicted by eq 25 (Table 3). The comparison suggests that some sequences of natural multibranch loops have evolved to provide extra stability and conserved structures have conserved stability. Extra stability from a natural motif was suggested on the basis of the stability measured for another natural sequence (14). Evidently, much more must be learned about the sequence dependence of multibranch loop stability in order to provide reasonable approximations for structure prediction algorithms.

Entropy Changes for Multibranch Loop Initiation

The entropy change for multibranch loop initiation, ΔS°MBL init, can be calculated from ΔH°MBL init and ΔG°37,MBL init, becauseThe values of aS, bS, cS, dS, and ΔS°strain for the model similar to eqs 24, 25, and 26 are listed in Table 7. Essentially identical values are obtained if the entropy data is fit separately to an equation with the same terms as eqs 24, 25, and 26.
Table 7

Entropy Parameters for Multibranch Loop Initiation

modelparametervalue (eu)
universal model corresponding to eqs 24 and 29aS90.54
 bS−5.58
 cS−25.79
 dS30.08
 ΔS°strain61.65
model for 3-way MBLaS53.00
 bS−10.58
 dS17.67
 ΔS°strain30.11
model for 4-way MBLaS′′−60.39
 dS′′52.65

Conclusion

Optical melting and fluorescence competition assays were used to measure the thermodynamics of two series of three-way multibranch loop systems. The free energies measured by these two methods agree within experimental error. The results reveal that adding up to five unpaired adenosines or four unpaired uridines into a three-way multibranch loop can stabilize the loop. Adenosines and uridines stabilize the loop similarly, even though A and U have different stacking properties on the end of a duplex. Revised parameters for predicting the thermodynamics of multibranch loops are reported. This revised model predicts the free energy of a natural loop E motif and a mutant reasonably well. Measurement of common natural motifs could further improve our understanding of factors stabilizing multibranch loop stabilities and provide better prediction of RNA secondary structure.
  76 in total

1.  A quantitative method for evaluating the stabilities of nucleic acids.

Authors:  C A Gelfand; G E Plum; S Mielewczyk; D P Remeta; K J Breslauer
Journal:  Proc Natl Acad Sci U S A       Date:  1999-05-25       Impact factor: 11.205

2.  A novel method for finding tRNA genes.

Authors:  Vickie Tsui; Tom Macke; David A Case
Journal:  RNA       Date:  2003-05       Impact factor: 4.942

3.  Thermodynamics of DNA branching.

Authors:  M Lu; Q Guo; L A Marky; N C Seeman; N R Kallenbach
Journal:  J Mol Biol       Date:  1992-02-05       Impact factor: 5.469

4.  Predicting helical coaxial stacking in RNA multibranch loops.

Authors:  Rahul Tyagi; David H Mathews
Journal:  RNA       Date:  2007-05-16       Impact factor: 4.942

5.  Sequence dependence of stability for coaxial stacking of RNA helixes with Watson-Crick base paired interfaces.

Authors:  A E Walter; D H Turner
Journal:  Biochemistry       Date:  1994-10-25       Impact factor: 3.162

6.  A common motif organizes the structure of multi-helix loops in 16 S and 23 S ribosomal RNAs.

Authors:  N B Leontis; E Westhof
Journal:  J Mol Biol       Date:  1998-10-30       Impact factor: 5.469

Review 7.  Hairpin ribozyme: current status and future prospects.

Authors:  J M Burke
Journal:  Biochem Soc Trans       Date:  1996-08       Impact factor: 5.407

8.  Thermodynamics of coaxially stacked helixes with GA and CC mismatches.

Authors:  J Kim; A E Walter; D H Turner
Journal:  Biochemistry       Date:  1996-10-29       Impact factor: 3.162

9.  Two inequivalent folding isomers of the three-way DNA junction with unpaired bases: sequence-dependence of the folded conformation.

Authors:  J B Welch; F Walter; D M Lilley
Journal:  J Mol Biol       Date:  1995-08-25       Impact factor: 5.469

10.  mRNA secondary structures fold sequentially but exchange rapidly in vivo.

Authors:  Elisabeth M Mahen; Peter Y Watson; Joseph W Cottrell; Martha J Fedor
Journal:  PLoS Biol       Date:  2010-02-09       Impact factor: 8.029

View more
  15 in total

1.  A range of complex probabilistic models for RNA secondary structure prediction that includes the nearest-neighbor model and more.

Authors:  Elena Rivas; Raymond Lang; Sean R Eddy
Journal:  RNA       Date:  2011-12-22       Impact factor: 4.942

2.  Determining parameters for non-linear models of multi-loop free energy change.

Authors:  Max Ward; Hongying Sun; Amitava Datta; Michael Wise; David H Mathews
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

3.  Blind tests of RNA nearest-neighbor energy prediction.

Authors:  Fang-Chieh Chou; Wipapat Kladwang; Kalli Kappel; Rhiju Das
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-08       Impact factor: 11.205

4.  A sensitivity analysis of RNA folding nearest neighbor parameters identifies a subset of free energy parameters with the greatest impact on RNA secondary structure prediction.

Authors:  Jeffrey Zuber; Hongying Sun; Xiaoju Zhang; Iain McFadyen; David H Mathews
Journal:  Nucleic Acids Res       Date:  2017-06-02       Impact factor: 16.971

Review 5.  Understanding the transcriptome through RNA structure.

Authors:  Yue Wan; Michael Kertesz; Robert C Spitale; Eran Segal; Howard Y Chang
Journal:  Nat Rev Genet       Date:  2011-08-18       Impact factor: 53.242

6.  Bridging the gap between in vitro and in vivo RNA folding.

Authors:  Kathleen A Leamy; Sarah M Assmann; David H Mathews; Philip C Bevilacqua
Journal:  Q Rev Biophys       Date:  2016-06-24       Impact factor: 5.318

7.  VfoldMCPX: predicting multistrand RNA complexes.

Authors:  Sicheng Zhang; Yi Cheng; Peixuan Guo; Shi-Jie Chen
Journal:  RNA       Date:  2022-01-20       Impact factor: 4.942

8.  Vfold2D-MC: A Physics-Based Hybrid Model for Predicting RNA Secondary Structure Folding.

Authors:  Yi Cheng; Sicheng Zhang; Xiaojun Xu; Shi-Jie Chen
Journal:  J Phys Chem B       Date:  2021-09-02       Impact factor: 2.991

9.  Thermodynamically stable RNA three-way junction for constructing multifunctional nanoparticles for delivery of therapeutics.

Authors:  Dan Shu; Yi Shu; Farzin Haque; Sherine Abdelmawla; Peixuan Guo
Journal:  Nat Nanotechnol       Date:  2011-09-11       Impact factor: 39.213

10.  The four ingredients of single-sequence RNA secondary structure prediction. A unifying perspective.

Authors:  Elena Rivas
Journal:  RNA Biol       Date:  2013-05-10       Impact factor: 4.652

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.