Thirty-five RNA duplexes containing single nucleotide bulge loops were optically melted and the thermodynamic parameters for each duplex determined. The bulge loops were of the group III variety, where the bulged nucleotide is either a AG/U or CU/G, leading to ambiguity to the exact position and identity of the bulge. All possible group III bulge loops with Watson-Crick nearest-neighbors were examined. The data were used to develop a model to predict the free energy of an RNA duplex containing a group III single nucleotide bulge loop. The destabilization of the duplex by the group III bulge could be modeled so that the bulge nucleotide leads to the formation of the Watson-Crick base pair rather than the wobble base pair. The destabilization of an RNA duplex caused by the insertion of a group III bulge is primarily dependent upon non-nearest-neighbor interactions and was shown to be dependent upon the stability of second least stable stem of the duplex. In-line structure probing of group III bulge loops embedded in a hairpin indicated that the bulged nucleotide is the one positioned further from the hairpin loop irrespective of whether the resulting stem formed a Watson-Crick or wobble base pair. Fourteen RNA hairpins containing group III bulge loops, either 3' or 5' of the hairpin loop, were optically melted and the thermodynamic parameters determined. The model developed to predict the influence of group III bulge loops on the stability of duplex formation was extended to predict the influence of bulge loops on hairpin stability.
Thirty-five RNA duplexes containing single nucleotide bulge loops were optically melted and the thermodynamic parameters for each duplex determined. The bulge loops were of the group III variety, where the bulged nucleotide is either a AG/U or CU/G, leading to ambiguity to the exact position and identity of the bulge. All possible group III bulge loops with Watson-Crick nearest-neighbors were examined. The data were used to develop a model to predict the free energy of an RNA duplex containing a group III single nucleotide bulge loop. The destabilization of the duplex by the group III bulge could be modeled so that the bulge nucleotide leads to the formation of the Watson-Crick base pair rather than the wobble base pair. The destabilization of an RNA duplex caused by the insertion of a group III bulge is primarily dependent upon non-nearest-neighbor interactions and was shown to be dependent upon the stability of second least stable stem of the duplex. In-line structure probing of group III bulge loops embedded in a hairpin indicated that the bulged nucleotide is the one positioned further from the hairpin loop irrespective of whether the resulting stem formed a Watson-Crick or wobble base pair. Fourteen RNA hairpins containing group III bulge loops, either 3' or 5' of the hairpin loop, were optically melted and the thermodynamic parameters determined. The model developed to predict the influence of group III bulge loops on the stability of duplex formation was extended to predict the influence of bulge loops on hairpin stability.
RNAs are intimately involved in a wide variety of biological activities. During the process of gene expression, they serve as informational molecules and part of the decoding machinery. RNAs can function as catalysts, either alone or in conjunction with protein (Altman 1990; Cech 1990; Pace and Brown 1995; Scott 2007). Ribosomal RNA has a direct involvement in protein biosynthesis (Noller et al. 1992; Nissen et al. 2000), and new classes of noncoding RNAs continue to be discovered. The functional diversity of RNA is often predicated on the folding of complex tertiary structures, giving rise to the native, functional fold (Brion and Westhof 1997; Tinoco and Bustamante 1999). Secondary structure, defined as the formation of complementary base pairs, generally forms on a faster timescale (Zarrinkar and Williamson 1994; Woodson 2000) and tends to be more stable than tertiary contacts (Crothers et al. 1974; Banerjee et al. 1993; Onoa et al. 2003). Since tertiary structure arises from preformed secondary structure, accurately determining the secondary structure of an RNA molecule would provide clues to its function as well as facilitate prediction of the RNA tertiary fold. In addition, links between secondary structure and three-dimensional folding are beginning to be explored (Bailor et al. 2010).For these reasons, methods are being developed to model RNA secondary structure (Zuker 1989; Michel and Westhof 1990; Gautheret et al. 1993; Mathews and Turner 2002; Tahi et al. 2002), and there is increased interest in predicting both the secondary and tertiary structure of RNA from sequence. The most commonly used programs for secondary structure prediction, Mfold and RNAstructure (Mathews et al. 1999), use thermodynamic parameters to predict the most stable (and suboptimal) secondary structural folds. The ability of these programs to predict secondary structure depends upon the availability of accurate models to provide the thermodynamic parameters for the various RNA secondary structural motifs. Current structure prediction algorithms using updated thermodynamic parameters correctly predict ∼73% of known base pairs (Mathews et al. 1999, 2004). Therefore, it follows that refinement of the input thermodynamic parameters for multiple secondary structure motifs could improve predictions of secondary structures.One secondary structure motif is the single nucleotide bulge. A single nucleotide bulge occurs when an unpaired nucleotide disrupts regions of contiguous base-pairing. These unpaired nucleotides can participate in a variety of biological events, including protein binding and tertiary structure formation (Harper and Logsdon 1991; Rounseville and Kumar 1992, Rounseville et al. 1996; Klasens et al. 1999). More recently, single nucleotide bulges have been implicated in reverse transcriptase–mediated RNA displacement synthesis as single nucleotide bulges enhance synthesis through stable secondary structures (Lanciault and Champoux 2005). Only a few studies have examined the influence of bulge loops on the thermodynamics of secondary structure formation (Longfellow et al. 1990; Blose et al. 2007; McCann et al. 2011; Lim et al. 2012).Previously, single nucleotide bulges were subdivided into groups according to the identity of the bulge and adjacent base pairs (Blose et al. 2007). Group I single nucleotide bulge loops were defined as bulge loops where the position of the bulge was unambiguous with a bulged nucleotide that is not identical to either of the neighboring nucleotides. Group II single nucleotide bulge loops were defined as bulge loops where the position of the bulge is ambiguous since the bulge nucleotide is identical to at least one of its neighboring nucleotides. A model was developed to predict the influence of group I and group II single nucleotide bulge loops on the thermodynamics of duplex formation (Blose et al. 2007; McCann et al. 2011). Group III bulge loops have additional sequence and structural complexities. For example, with the oligomer 5′-CGGCC/3′GCCGG, the underlined G on the bottom strand can form a base pair with either the underlined C or U. Depending upon which pair forms, either the C or the U would be the bulged nucleotide and the parent duplex would contain either a Watson–Crick or wobble base pair having different thermodynamic stabilities (Table 1).
TABLE 1.
Example of group III loop ambiguity
Example of group III loop ambiguityIn the following study, the complete set of group III single nucleotide bulge loops with Watson–Crick nearest-neighbors has been thermodynamically characterized to improve our ability to predict the stability of RNA duplexes with bulge loops. As observed for group I and II bulges, the free energy increment for the insertion of a group III bulge loop into a duplex was found to be primarily influenced by non-nearest-neighbor interactions (Blose et al. 2007; McCann et al. 2011). The stability of the stem adjacent to the bulge has a direct effect on duplex destabilization caused by the insertion of the bulge. This simple non-nearest-neighbor model was extended to predict the insertion of a group III bulge loop into a hairpin stem.
MATERIALS AND METHODS
RNA synthesis and purification
Most oligomers were synthesized on CPG solid supports (Applied Biosystems 392 DNA/RNA Synthesizer) utilizing phosphoramidites with the 2′ hydroxyl protected as the tert-butyl dimethylsilyl ether from Glen Research (Usman et al. 1987; Wincott et al. 1995). Oligomers underwent ammonia and fluoride deprotection, and the crude sample was purified using preparative tlc (n-propanol:ammonium hydroxide:water, 55:35:10) and Sep-Pak C18 (Waters) chromatography. Some oligomers were ordered from Dharmacon or Integrated DNA Technologies, and deprotection of the oligomers was carried out using the manufacturers’ instructions. The oligomers were then purified as described above. Sample purity was determined through analytical tlc or HPLC (C-8) and was >95%.
Melting curve and data analysis
For non-self-complimentary sequences, individual strand concentrations were calculated from high-temperature single-strand absorbance at 260 and 280 nm using nearest-neighbor extinction coefficients (Borer 1975; Richards 1975). Single strands were then annealed in a 1:1 molar ratio. Optical melting experiments were performed using a Beckman DU 640 spectrophotometer and high-performance temperature controller at 260 or 280 nm. Absorbance changes for oligomers in 1 M NaCl melt buffer (1 M NaCl, 0.01–0.02 M cacodylic acid, 0.5 mM EDTA at pH 7.0) were recorded as a function of temperature from 10–90°C at a rate of 1°C/min as described previously (Serra et al. 1994). The experiment was repeated at 10 varying sample concentrations to give at least a 50-fold concentration range (10 μM–1 mM) for each sample. Absorbance versus temperature profiles were fit to a two-state model with sloping base lines using a nonlinear least squares program (McDowell and Turner 1996). Thermodynamic parameters for duplex formation were obtained by two methods: (1) enthalpy and entropy changes from the fits of the individual melting curves were averaged, and (2) plots of the reciprocal melting temperature, TM−1, versus log (C/4) gave enthalpy and entropy changes (Borer et al. 1974):
Here, C is the total concentration of oligomer. Parameters derived from the two methods generally agreed within 15%, consistent with the two-state model (Freier et al. 1986; Allawi and SantaLucia 1997). The Gibbs free energy change at 37°C was calculated as
Determination of the contribution of bulge loops to duplex thermodynamics
The free energy of duplex formation can be approximated by the nearest-neighbor model (Xia et al. 1998). The free energy contribution of each bulged nucleotide was calculated from the experimental data and the nearest-neighbor model according to Equation 3, where ΔG037(measured) is the experimentally determined value from the melts and
ΔG037(duplex) is calculated from the nearest-neighbor model for the duplex forming the Watson–Crick base pair and not containing the bulge (Table 1). ΔH0 values are calculated in a similar fashion.
Phylogenetic analysis
A database of phylogenetically determined RNA secondary structures (Cannone et al. 2002) of 305 SSU rRNAs, 169 LSU rRNAs, 16 group I introns RNAs, and seven group II intron RNAs, was searched for group III single nucleotide bulge loops. The loops were characterized by nearest-neighbor base pairs and bulge identity.
In-line probing
Samples for in-line probing were labeled at the 5′ (Metkinen) or 3′(Glen Research) end during synthesis with fluorescein or purchased from IDT. The RNAs were purified as described above except that nonfluorescent tlc plates were used and the RNA detected by the fluorescent label. The RNA was incubated in 50 mM Tris (pH 8.3), 20 mM MgCl2, and 100 mM KCl for 2–4 d at room temperature in the dark (Soukup and Breaker 1999; Regulski and Breaker 2008). Control RNAs were incubated in probing buffer but incubated at 0°C. The cleavage products were analyzed by chromatography on nonfluorescent tlc plates (n-propanol:ammonium hydroxide:water, 65:25:10). The migration of the cleavage products were compared to the migration of mixtures of chemically synthesized oligomers of appropriate lengths. The fluorescent products were analyzed with a Kodak image station 400MM Pro.
Statistical analysis
Statistical analysis of the data was done using the statistical software available with GraphPad Prism and GraphPad Instat.
RESULTS
The thermodynamics of only two duplexes with group III bulge loops have previously been measured (Znosko et al. 2002). Therefore, to determine the role of the bulge identity and the nearest-neighbors on the stability of group III bulge loops, 33 additional (eight GA, eight AG, 10 CU, and seven UC bulges) duplexes containing a group III single nucleotide bulge loop were prepared and the thermodynamics of duplex formation measured by optical melting. These oligomers, when combined with the previously measured oligomers, give measurements for the complete set of group III single nucleotide bulge loops with Watson–Crick nearest-neighbors.Thermodynamic parameters for duplex formation by these oligonucleotides are listed in Table 2. The oligonucleotides are listed by bulge sequence and then in order of decreasing free energy for the respective duplex. Residues in bold are the potential bulge nucleotide. The average deviations in thermodynamic parameter values are 11.2%, 12.5%, and 2.0% for ΔH0, ΔS0, and ΔG037, respectively. These average deviations are larger than observed for the group I and group II sequences and may indicate additional structural variability and, therefore, slight non-two-state behavior for some of these oligomers.
TABLE 2.
Thermodynamic parameters for duplex formation
Thermodynamic parameters for duplex formationThe ambiguity in the position of the group III bulge loops means that there are two potential parental duplexes (Table 1) with different thermodynamic stability. To determine the free energy contribution of the bulged nucleotide, one of the two potential parental duplexes must be selected. We first predicted the stability of all of the potential parental duplexes for each of the sequences in Table 2. The predicted stability of the duplexes, which included the Watson–Crick base pair, was always more stable than the predicted value for the duplexes with the wobble base pair. The ambiguity of the group III bulge loops can lead to a variety of analysis after trying many combinations; the best agreement between experiment and model (see Discussion) was achieved using the Watson–Crick duplexes as the parental duplex and calculating the free energy contribution of each bulged nucleotide according to Equation 3. This analysis is presented in Table 3. As previously observed (Znosko et al. 2002; Blose et al. 2007; McCann et al. 2011), all bulges destabilize the duplex. The extent of destabilization ranges between 0.5 and 4.9 kcal/mol. The group III pyrimidine single bulges are slightly less destabilizing than the group III purine bulges by, on average, 0.1 kcal/mol. The destabilization caused by the introduction of a single group I pyrimidine bulge into a duplex is 3.7 ± 1.0 kcal/mol and for a purine residue is 3.6 ± 1.0 kcal/mol. Since these values are not statistically different, they can be combined to provide a simple model to predict the stability of a duplex containing a single group III bulge nucleotide (purine or pyrimidine), which is just the average of all of the measured values: 3.6 kcal/mol.
TABLE 3.
Thermodynamic parameters and natural occurrence for group III bulges
Thermodynamic parameters and natural occurrence for group III bulgesIn-line probing takes advantage of the natural instability of RNA to detect single-stranded regions in RNA. Structural flexibility of single-stranded regions allows the RNA to examine additional structural conformations, some of which produce an in-line arrangement of the 2′ alkoxide and the 5′-O of the phosphodiester, leading to a favorable transition state and cleavage of the RNA strand at the site of flexibility. In-line probing was used to resolve the ambiguity of the group III bulge nucleotide. Because of the small size of our RNA oligomers, analytical tlc was used to resolve the products of in-line cleavage. Figure 1 displays the results of one such experiment. The two hairpins in this example have the group III bulge loop on the 3′ side of the hairpin loop, and they differ by the orientation of the bulge loop, either 5′-GA or 5′-AG. In both cases, the control reactions show no cleavage products. We previously have shown that cleavage occurs to the 3′ side of the bulged nucleotide (McCann et al. 2011); therefore, we would expect formation of either a dimer or trimer depending upon which nucleotide (A or G) was bulged. In both cases, the predominant cleavage produces the dimer product, indicative of the fact that the nucleotide further from the hairpin loop is the bulged nucleotide irrespective of whether a Watson–Crick or wobble base pair forms. Table 4 summarizes the result of in-line cleavage analysis for seven pairs of hairpins with embedded group III bulge loops; in all cases, the major cleavage site occurs after the nucleotide further from the hairpin loop. These results are consistent whether the bulge loop is on the 5′ or 3′ side of the hairpin loop.
FIGURE 1.
tlc analysis of in-line probing of group III bulge loops. (Lanes 1,4) Size markers: monomer, dimer, and trimer, each are labeled on the 3′ end with fluorescein. (Lane 2) Cleavage of group III hairpin sequence given in lower left of figure. (Lane 3) Control lane, hairpin incubated at 0°C in probing buffer. (Lane 5) Cleavage of group III hairpin, sequence given in lower right of figure. (Lane 6) Control lane, hairpin incubated at 0°C in probing buffer. Hairpin figures indicate position of bulge and cleavage site.
TABLE 4.
Summary of in-line cleavage results
tlc analysis of in-line probing of group III bulge loops. (Lanes 1,4) Size markers: monomer, dimer, and trimer, each are labeled on the 3′ end with fluorescein. (Lane 2) Cleavage of group III hairpin sequence given in lower left of figure. (Lane 3) Control lane, hairpin incubated at 0°C in probing buffer. (Lane 5) Cleavage of group III hairpin, sequence given in lower right of figure. (Lane 6) Control lane, hairpin incubated at 0°C in probing buffer. Hairpin figures indicate position of bulge and cleavage site.Summary of in-line cleavage resultsThe thermodynamics for hairpin formation for the hairpins that were analyzed by in-line probing are presented in Table 5, along with the thermodynamics for the corresponding parental hairpins. As with group III bulge loops inserted into duplexes, the insertion of the bulge loop into a hairpin stem destabilizes hairpin formation. The influence of the stability of the hairpin was determined by subtraction of the stability of the hairpin with or without the bulge loop:
With group III bulge loops, there are two potential parental stems, one with the Watson–Crick base pair and the other with the wobble base pair. The bulge loop values are presented in Table 6. The influence of the group III bulges ranges from 1.3–3.0 kcal/mol at 37°C for the Watson–Crick parental hairpins and 0.3–2.7 kcal/mol at 37°C for the wobble parental hairpins. The free energy increment for inserting a group III bulge loop does not appear to depend upon whether the bulge is on the 5′ or 3′ side of the hairpin loop. Nor does it appear to depend upon whether the bulge sequence is 5′-AG or 5′-GA.
TABLE 5.
Thermodynamic parameters for hairpin formation in 1 M NaCl
TABLE 6.
Free energy increment for bulge loops
Thermodynamic parameters for hairpin formation in 1 M NaClFree energy increment for bulge loopsThe enthalpy nearest-neighbor parameters for single nucleotide bulge loops are also presented in Table 3 for duplex and Table 5 for hairpin formation. In nearly all cases, the introduction of a bulge decreases the enthalpy of hairpin formation. The average enthalpy contribution for group III single nucleotide bulge loops is 15.2 kcal/mol for insertion of a bulge into a duplex and 13.6 kcal/mol for insertion of a bulge into a hairpin stem.
DISCUSSION
Non-nearest-neighbor influences on the thermodynamics for group III bulge loop on duplex formation
The main determinant of the thermodynamic influence of group I and II bulge loops on duplex formation was shown to be due to non-nearest-neighbor influences (Blose et al. 2007; McCann et al. 2011). Specifically, the stability of the stem affects the extent to which insertion of the bulge loop destabilized the duplex. To determine if group III bulge loops are also influenced by the stability of the stems adjacent to the bulge, we examined how the stability of the adjacent stem influences the thermodynamics of group III bulge loops insertion into a duplex. The conformational ambiguity of the group III bulge loops allows for two possible sets of duplexes (Table 1) and, therefore, four possible stems. In addition, the influence of the bulge depends upon which of the potential parental strands is selected (Table 1). Many combinations were tested, most of which gave a poor correlation between the influence of the bulge and the adjacent stem. The best correlation was observed when we graphed the free energy of the bulge determined using the Watson–Crick parental strand and the second least stable stem of the four potential stem (Table 1). Note that sometimes the second least stable stem included a wobble base pair. The results of this analysis are presented in Figure 2. The line for the group III data is not significantly different than the group I and group II data. The data from the group I, II, and III bulge loops can therefore be combined to generate a relationship to predict the thermodynamic influence of all bulge loops. That relationship is given in the equation below:
where ΔG037 less stable stem is the less stable stem for group I bulge loops and the second least stable stem for group II and group III bulge loops. While this represents a simple model to predict the influence of bulge loops on the stability of duplex formation, because of the non-nearest-neighbor influences, it cannot be incorporated directly into the current RNA secondary structure prediction programs.
FIGURE 2.
Plot of free energy change for group III bulge loop formation, ΔGo37bulge, versus free energy of second least stable duplex stem, as described in text. Group III bulge indicated by black circle. Lines are the least-squares fit of the data points (group I and II, gray solid line [Blose et al. 2007]; black group III, black dotted line; all, black solid line).
Plot of free energy change for group III bulge loop formation, ΔGo37bulge, versus free energy of second least stable duplex stem, as described in text. Group III bulge indicated by black circle. Lines are the least-squares fit of the data points (group I and II, gray solid line [Blose et al. 2007]; black group III, black dotted line; all, black solid line).
Example calculations
Using the example duplex given in Table 1, the measured free energy for the duplex is 6.4 kcal/mol at 37°C (Table 2). The stability of the parent duplex forming the Watson–Crick base pair has a predicted stability using the nearest- neighbor model of 10.4 kcal/mol at 37°C. The free energy increment for the insertion of the bulge is then taken as the difference between the measured and predicted values or 4.0 kcal/mol. This represents the measured value presented in Table 3. The predicted value for the free energy increment for insertion of the group III bulge loops on duplex formation is given by Equation 5. For this example, the second least stable stem of the four potential stems is 5′-GUCG/3′-CAGU which has a predicted stability of −5.8 kcal/mol at 37°C. Using the −5.8 kcal/mol value in Equation 5 predicts that the free energy increment for insertion of the group III bulge should be 3.8 kcal/mol at 37°C, a difference of 0.2 kcal/mol from the measured value. Similar calculations were done for all of the group III bulge loops in Table 3. The average difference between the measured and predicted values for the group III bulge loops was 0.41 kcal/mol. Twenty-two of the 36 measured group III bulge loops had predicted values within 0.5 kcal/mol of the measured values, and only two had predicted values that were >1 kcal/mol. The two were 5′-CAGUCGAGC/3′-GUCAGUCG and 5′-ACUGCAUCG/3′-UGACGUGC; in both instances, the bulge loop is near the duplex terminus. In both cases, the predicted value is less than the measured free energy increment. It is possible that with these instances, the duplexes are forming structure other than the bulge loop; the entire end of the duplex may be frayed or the ends may form alternate non-Watson–Crick base pairs. There are four other examples where the bulge is located near the terminus of the duplex and the free energy increments are predicted well; therefore, it is not obvious why the two examples above are predicted less well. Previous models to predict the free energy increment have used a constant value or 3.8 kcal/mol for the free energy increment for the insertion of a bulge loop into a duplex (Mathews et al. 1999). Using this model, the average difference between the measured and predicted values is 0.78 kcal/mol. Thus the model presented in Equation 5 improves the prediction of the free energy increment for insertion of a group III bulge loops into a duplex by, on average, 0.3 kcal/mol over the previous model.
Structural determination of the group III bulged nucleotide using in-line probing
The results of the in-line cleavage of the hairpins listed in Table 4 show that the major fluorescent product was the fragment expected if the bulged nucleotide was the one further from the hairpin loop, irrespective of whether the resulting hairpin forms a Watson–Crick or wobble base pair. These results are in agreement with those obtained for group II bulge loops, where it was observed that the bulge nucleotide was also the one further from the hairpin loop (McCann et al. 2011). Thus it appears that structural constraints are more critical to determining the bulged nucleotide than the stability of the corresponding stem. No cleavage was detected due to fraying of the ends of the hairpin stem. This may be due to the fact that while a bulge confers conformational flexibility on a duplex, the preferred conformation is favorable for in-line cleavage (Portmann et al. 1996).
Thermodynamics of group III bulge loops embedded within a hairpin stem
Table 5 lists the thermodynamics for hairpin formation of the 14 hairpins examined by in-line probing. Also included in Table 5 are the thermodynamics of hairpin formation of the parent hairpins, including both the Watson–Crick and wobble base paired stems. Some of the hairpins in Table 5 have a fluorescein tag at either the 5′ or 3′ end. We have previously shown that the fluorescein tag does not affect the thermodynamics of hairpin formation (Lim et al. 2012), so we will not consider it further in our analysis. Again, there is a multitude of ways to analyze the influence of a group III bulge on the thermodynamics of hairpin formation due to the ambiguity of the bulged nucleotide. The influence of the bulge loop on the thermodynamics of hairpin formation was determined using Equation 4 for both the Watson–Crick and wobble parental hairpins.Table 6 displays the results of this analysis. The in-line probing results show that the bulged nucleotide was the one further from the hairpin loop, so we selected this nucleotide as the bulge, irrespective of whether or not this led to the formation of a wobble base pair. For group II bulge loops imbedded in a hairpin stem, the stability of the hairpin stem distal from the hairpin loop gave the best prediction of the influence of the bulge on the thermodynamics of hairpin formation (Lim et al. 2012). Therefore, we chose a similar model, using the stability of the distal stem and Equation 5, to predict the influence of the group III bulge loops on hairpin stability. The predicted values for the insertion of a group III bulge loop into a hairpin stem also are listed in Table 6. The best agreement between the measured and predicted values is obtained using the measured values for the parental hairpin with the Watson–Crick base pair rather than the wobble parental strand and selecting the parental hairpin based upon the in-line probing results (nucleotide further from the hairpin loop was selected as the bulged base).For example, using the two hairpins shown in Figure 1, CGUCGAAAGCAGCG and CGUCGAAAGCGACG (where underlined letters represent the hairpin loop, and bold letters are the potential bulge nucleotides), the parental hairpin in both cases would be modeled as CGUCgaaaGCACG (irrespective of the in-line probing results). The stability of this hairpin is −3.2 kcal/mol at 37°C (Table 5). The measured values for the stability of the two hairpins containing the group III bulge are −0.8 and −1.5 kcal/mol at 37°C, respectively (Table 5). The measured values for the insertion of the bulge loops would be 2.4 and 1.7 kcal/mol at 37°C for the first and second hairpin, respectively (Table 6). The distal stem for both hairpins would be 5′-CG/3′-GC; therefore, the predicted value for the insertion of the bulge loop into the hairpin stem would be 2.1 kcal/mol, using Equation 5 and the stability of 5′-CG/3′-GC nearest-neighbors (−2.36 kcal/mol) (Table 6).Using this model, there is good agreement between the measured and predicted values with an average difference of 0.4 kcal/mol. Therefore, the model developed for the influence of bulge loop insertion into a duplex motif can be extended to predict the influence of the insertion of group III single nucleotide bulge loops into the stem of a hairpin, if the stability of the parental hairpin is chosen with the stem forming the Watson–Crick base pair and the ΔG0 (stem) is calculated for the distal stem.
Enthalpic contributions of group III single nucleotide bulge loops on duplex and hairpin formation
Since the free energy increment for the insertion of a group III bulge loops was dependent upon the stability of the stem adjacent to the bulge, we investigated the influence of stem stability on the enthalpy of bulge insertion. As previously observed for the group I and II bulge loops, the least-squares fit of data did not have a slope significantly different from zero (data not shown), suggesting that the average enthalpy value (15.2 kcal/mol for duplex insertion and 13.8 kcal/mol for hairpin) is a reasonable approximation for the enthalpic energy contribution for insertion of a group III bulge loop. The enthalpic contributions due to a group III bulge loops are not statistically different than the value obtained for group I or group II bulge loops (16.5 kcal/mol), so all of the enthalpic values for group I, II, and III bulge loops can be combined for an average value of 16.0 kcal/mol. The enthalpic values can be used in conjunction with the free energy parameters to determine the stability of RNA structures at temperatures other than 37°C (Lu et al. 2006).
Phylogenetic analysis of group III single nucleotide bulge loop
The database examined in this study contained 249 group III single nucleotide bulge loops. Group III bulge loops are less numerous than the group I (3520) or group II (768) bulge loops. The frequencies of occurrence of the group III single nucleotide bulge loops are listed in Table 3. Over one-third of the group III bulge loops have the sequence (CGAC/GUG). This bulge includes the highly conserved A at position 397 (Escherichia coli numbering) of the small ribosomal RNA. Since the stability of group III single nucleotide bulge loops is independent of the identity of the bulge, there is no correlation between the thermodynamic contribution of the bulge and its frequency of occurrence. Therefore, the section of naturally occurring bulge nucleotides must be related to factors other than stability.The distribution of bulge loops embedded within a hairpin stem was then examined. We selected for bulge loops embedded in the stems of hairpin loops from 3–8 nt in length and 1–6 bp from the hairpin loop. A total of 212 (or 88%) group III bulge loops match these criteria. Since >70% of the ribosomal RNAs are found in hairpin structures, it is not surprising that a high proportion of the group III bulge loops are present in hairpin stems. Of the 212 group III bulge loops, 163 of them are either AG or GA (data not shown). The bulge loops were then characterized as having the bulged base either nearer or further from the hairpin loop. The majority, 120 of the 212, had the nucleotide further from the hairpin loop as the bulged nucleotide. Almost 90% of the phylogenetically determined group III bulge loops formed Watson–Crick base pairs; only 24 of the bulge loops formed wobble base pairs. These results are interesting given that the in-line probing results suggest a marked preference for the group III bulged nucleotide to be the nucleotide further from the hairpin loop irrespective of whether the resulting hairpin formed the Watson–Crick or Wobble base pair. Factors beyond thermodynamics are clearly important in bulge loop selection and structure in native structures.
Naturally occurring group III bulge loops
An interesting bulge loop in the context of a hairpin stem is position 2402 of the large ribosomal subunit. The secondary and tertiary structures of the bulge loop are shown in Figure 3 (Schuwirth et al. 2005; Selmer et al. 2006). In Thermus thermophilus, the bulged nucleotide, a cytosine residue, is 2 bp from the hairpin loop. The analogous position in E. coli is a uracil residue, which is also located 2 bp from the hairpin loop. In both cases, the bulge loops are of the group IV type, having sequence characteristics of both group II and III bulges. That is, there is ambiguity as the bulged nucleotide could be represented by any of the 4 nt on the 5′ side of the hairpin loop. Both bulge loops are positioned in an extrahelical orientation and are pointing into an “open” pocket where there is no discernible interaction with other components of the ribosome. However, they point in opposite directions with the bulged nucleotide in T. thermophilus pointing away from the hairpin loop, while the E. coli bulge is located alongside the hairpin loop. It is possible that, in fact, the bulge loops are interacting with some unresolved structural component of the ribosome that leads to the specificity of the bulged nucleotide identity and position. Alternately, the bulged nucleotide may by directed by the wobble base pair next to the bulged position in T. thermophilus rRNA.
FIGURE 3.
Secondary and tertiary structure of naturally occurring bulge loops. Position 2402 in the large ribosomal subunit RNA, (left) T. thermophilus, (center) E. coli, and (right) position 70 in E. coli RNase P RNA. Boxed nucleotides in the secondary structure diagrams are shown in the three-dimensional presentations. The bulged nucleotide is shown in CPK color and the nearest-neighbor pairs in red (same strand as the bulge) and blue (opposite strand). Extracted from PDB files 2AW4, 2J03, and 1F7F.
Secondary and tertiary structure of naturally occurring bulge loops. Position 2402 in the large ribosomal subunit RNA, (left) T. thermophilus, (center) E. coli, and (right) position 70 in E. coli RNase P RNA. Boxed nucleotides in the secondary structure diagrams are shown in the three-dimensional presentations. The bulged nucleotide is shown in CPK color and the nearest-neighbor pairs in red (same strand as the bulge) and blue (opposite strand). Extracted from PDB files 2AW4, 2J03, and 1F7F.A second example of a naturally occurring group III bulge loop is position 70 in the RNase P RNA of E. coli. In the native RNase P, the bulge is localized in a duplex. Schmitz and Tinoco (2000) has examined the structure of the bulged loop in the context of a hairpin stem as shown in Figure 3. The bulged nucleotide is a uracil residue and represents the base located further from the hairpin loop. In this context, the bulge is positioned away from the hairpin loop. Interestingly, if the cytosine residue of the bulge is mutated to uracil, producing a group II bulge, the bulge undergoes a rearrangement, and now the base closer to the hairpin loop becomes the bulged nucleotide.These examples emphasize the structural flexibility imparted into the RNA when a bulge is introduced. Further investigation will be needed to understand the forces responsible for the selection of the correct bulged nucleotide and its orientation in a complex structural motif.
Authors: Claire V Crowther; Laura E Jones; Jessica N Morelli; Eric M Mastrogiacomo; Claire Porterfield; Jessica L Kent; Martin J Serra Journal: RNA Date: 2016-11-21 Impact factor: 4.942
Authors: Elzbieta Kierzek; Xiaoju Zhang; Richard M Watson; Scott D Kennedy; Marta Szabat; Ryszard Kierzek; David H Mathews Journal: Nat Commun Date: 2022-03-11 Impact factor: 17.694