Thermodynamic preferences to form non-native conformations are crucial for understanding how nucleic acids fold and function. However, they are difficult to measure experimentally because this requires accurately determining the population of minor low-abundance (<10%) conformations in a sea of other conformations. Here, we show that melting experiments enable facile measurements of thermodynamic preferences to adopt nonnative conformations in DNA and RNA. The key to this "delta-melt" approach is to use chemical modifications to render specific minor non-native conformations the major state. The validity and robustness of delta-melt is established for four different non-native conformations under various physiological conditions and sequence contexts through independent measurements of thermodynamic preferences using NMR. Delta-melt is faster relative to NMR, simple, and cost-effective and enables thermodynamic preferences to be measured for exceptionally low-populated conformations. Using delta-melt, we obtained rare insights into conformational cooperativity, obtaining evidence for significant cooperativity (1.0 to 2.5 kcal/mol) when simultaneously forming two adjacent Hoogsteen base pairs. We also measured the thermodynamic preferences to form G-C+ and A-T Hoogsteen and A-T base open states for nearly all 16 trinucleotide sequence contexts and found distinct sequence-specific variations on the order of 2 to 3 kcal/mol. This rich landscape of sequence-specific non-native minor conformations in the DNA double helix may help shape the sequence specificity of DNA biochemistry. Thus, melting experiments can now be used to access thermodynamic information regarding regions of the free energy landscape of biomolecules beyond the native folded and unfolded conformations.
Thermodynamic preferences to form non-native conformations are crucial for understanding how nucleic acids fold and function. However, they are difficult to measure experimentally because this requires accurately determining the population of minor low-abundance (<10%) conformations in a sea of other conformations. Here, we show that melting experiments enable facile measurements of thermodynamic preferences to adopt nonnative conformations in DNA and RNA. The key to this "delta-melt" approach is to use chemical modifications to render specific minor non-native conformations the major state. The validity and robustness of delta-melt is established for four different non-native conformations under various physiological conditions and sequence contexts through independent measurements of thermodynamic preferences using NMR. Delta-melt is faster relative to NMR, simple, and cost-effective and enables thermodynamic preferences to be measured for exceptionally low-populated conformations. Using delta-melt, we obtained rare insights into conformational cooperativity, obtaining evidence for significant cooperativity (1.0 to 2.5 kcal/mol) when simultaneously forming two adjacent Hoogsteen base pairs. We also measured the thermodynamic preferences to form G-C+ and A-T Hoogsteen and A-T base open states for nearly all 16 trinucleotide sequence contexts and found distinct sequence-specific variations on the order of 2 to 3 kcal/mol. This rich landscape of sequence-specific non-native minor conformations in the DNA double helix may help shape the sequence specificity of DNA biochemistry. Thus, melting experiments can now be used to access thermodynamic information regarding regions of the free energy landscape of biomolecules beyond the native folded and unfolded conformations.
Biomolecules do not fold into a single structure but rather form dynamic ensembles of many interconverting conformations (1–3). Low-populated non-native conformational states in the dynamic ensemble can become the major state when a molecule binds a partner molecule and forms a complex (Fig. 1 ). In these cases, energetically favorable intermolecular contacts must cover the energetic cost, or conformational penalty , associated with stabilizing the minor non-native conformation relative to the major native state (Fig. 1) (4–6). Therefore, the thermodynamic preferences to form non-native conformations can be essential determinants of binding affinity and specificity and catalytic efficiency.
Fig. 1.
Thermodynamic preferences to form minor non-native conformations in DNA can influence binding. (A) The p53-bound conformation of a DNA duplex with Hoogsteen bps (in orange) exists as a minor conformation with population pi in the unbound DNA ensemble. The free energy cost to stabilize the minor conformation on binding of p53 depends on pi and is given by , in which R is the universal gas constant (units of kilocalories per mole per degrees Kelvin, and T is the temperature in Kelvin). (B) Binding of p53 to duplex DNA (Protein Data Bank ID: 3KZ8) (9) results in a change in the conformation of bps from Watson–Crick (purple) in the free DNA to Hoogsteen (orange) in the DNA–p53 complex. The free DNA structure corresponds to an idealized B-form DNA double helix generated using 3DNA (93). (C) Schematic representation of the base open state of A-T bps, which is implicated in DNA–protein recognition and damage repair.
Thermodynamic preferences to form minor non-native conformations in DNA can influence binding. (A) The p53-bound conformation of a DNA duplex with Hoogsteen bps (in orange) exists as a minor conformation with population pi in the unbound DNA ensemble. The free energy cost to stabilize the minor conformation on binding of p53 depends on pi and is given by , in which R is the universal gas constant (units of kilocalories per mole per degrees Kelvin, and T is the temperature in Kelvin). (B) Binding of p53 to duplex DNA (Protein Data Bank ID: 3KZ8) (9) results in a change in the conformation of bps from Watson–Crick (purple) in the free DNA to Hoogsteen (orange) in the DNA–p53 complex. The free DNA structure corresponds to an idealized B-form DNA double helix generated using 3DNA (93). (C) Schematic representation of the base open state of A-T bps, which is implicated in DNA–protein recognition and damage repair.For example, in the DNA double helix, Watson–Crick G-C and A-T base pairs (bps) exist in dynamic equilibrium with a minor Hoogsteen conformation, with populations on the order of 0.1 to 1.0% (7, 8), corresponding to an energetic cost of = 2.7 to 4.0 kcal/mol. However, the Hoogsteen bp can become the major conformation when duplex DNA is bound by proteins such as p53 (9) (Fig. 1). Here, energetically favorable intermolecular contacts make up for the conformational penalty to convert the bp from Watson–Crick to the minor Hoogsteen conformation. Indeed, DNA modifications that increase the thermodynamic propensities to form Hoogsteen bps have been shown to improve p53-DNA binding affinity (4, 10). Changes in the structures of RNA and DNA are ubiquitously observed upon binding to proteins and other partners (11, 12). In all cases, the binding energetics will have to make up for the relevant conformational penalties.Conformational preferences, and therefore conformational penalties, can vary depending on sequence and structural contexts, due to epigenetic and post-transcriptional modifications as well as damage, and due to changes in physiological conditions such as pH, temperature, salt composition, and concentration. This can, in turn, help shape the specificities of biochemical processes.For example, bps in DNA can spontaneously open (Fig. 1) (13, 14) and when bound to damage repair proteins can adopt extrahelical conformations (15). The thermodynamic preferences to form the base open and extrahelical conformations have been shown to vary with base modifications and sequence context, in turn determining the binding affinity (15) and catalytic efficiency (16) of glycosylase enzymes, which excise damaged bases from DNA, as well as the processivity of helicases, which unwind DNA (17). DNA bps near mismatches have been shown to have increased propensities to form Hoogsteen bps, and Hoogsteen bps are often observed in non-canonical regions of DNA in crystal structures of DNA–protein complexes (18, 19). In addition, wobble G•T/U mismatches can form Watson–Crick-like conformations through tautomerization and ionization of the bases (20, 21). The preferences to form Watson–Crick-like conformations can vary with pH and chemical modifications, in turn shaping the probability of misincorporating G•T bps during DNA replication (20) and providing a basis for recoding translation (22–24).In RNA, sequence-specific preferences to adopt alternative conformations determine the energetics of RNA–RNA tertiary assembly (25, 26) and RNA–protein binding affinities (27). In addition, pH- and temperature-dependent preferences to adopt alternative secondary structures can help control messenger RNA (mRNA) translation (28, 29) and the efficiency of enzymatic processing of micro RNAs (30). By increasing the thermodynamic preferences to form single-stranded RNA, the epitranscriptomic modification N6-methyladenosine (31) has been shown to improve the binding of single-stranded RNA binding proteins to modified RNAs (32). Aberrant modulation of RNA conformational preferences due to mutations have also been linked to disease (reviewed in ref. 5).Measuring conformational preferences in DNA and RNA and their variation as a function of sequence, structural and chemical contexts, and physiological conditions remains a significant challenge in biophysics and structural biology. This is because measuring the energetic cost to form non-native conformations requires accurately determining their low abundance in a sea of other conformations in the apo-ensemble. A variety of approaches have been used to measure such conformational equilibria, including NMR (33), electron paramagnetic resonance (34) and infrared spectroscopy (35), fluorescence resonance energy transfer (36), cryogenic electron microscopy and X-ray crystallography (37), and chemical probing (38). However, with the exception of NMR, most of these techniques are not applicable to conformations such as Hoogsteen bps and the base open states, which have populations <1%, are short-lived with lifetimes shorter than a few milliseconds, and which involve localized bp rearrangements. Most of these approaches are also technically demanding, often requiring synthesis of molecules with specialized labels, with NMR in particular requiring large sample quantities. Thus, they do not lend themselves to comprehensively exploring the conformational preferences of nucleic acids. As a result, the thermodynamic preferences to form Hoogsteen or base open states have yet to be measured for all 16 trinucleotide sequence contexts. Therefore, there is a need for alternative approaches that can more readily measure conformational preferences in nucleic acids as well as proteins.Prior NMR studies have shown that chemical modifications and mutations can render minor non-native conformations (Fig. 2) the major state (population >90%) in proteins (39) and nucleic acids (40), thereby enabling their in-depth structural as well as functional characterization (41). Here, we show that by combining such chemical modifications (Fig. 2), with melting experiments that measure folding energetics, it is feasible to measure the thermodynamic preferences to form minor non-native conformations in RNA and DNA. We used ultraviolet (UV) spectroscopy to measure melting energetics (42, 43) (Fig. 2) given its simplicity, lack of requirement for specific labeling, and reduced sample requirements relative to other approaches (44–46).
Fig. 2.
Conceptual underpinnings of delta-melt for measuring thermodynamic preferences to adopt non-native conformational states in nucleic acids. (A) Dynamic equilibrium in the unmodified WT nucleic acid molecule involving a major and the ith minor non-native conformation [i(WT)] involving a change in the conformation of a bp (in orange). (B) Chemical modifications (red, Mod) are introduced to bias the conformational ensemble toward the ith non-native conformational state i(WT) of the WT nucleic acid. (C) Schematic diagram showing optical melting experiments on unmodified WT (black) and modified Mod (red) nucleic acids. (D) Free energy diagram of delta-melt (Left), relationship between the free energies in delta-melt (Bottom), and expected correlation between and (Right).
Conceptual underpinnings of delta-melt for measuring thermodynamic preferences to adopt non-native conformational states in nucleic acids. (A) Dynamic equilibrium in the unmodified WT nucleic acid molecule involving a major and the ith minor non-native conformation [i(WT)] involving a change in the conformation of a bp (in orange). (B) Chemical modifications (red, Mod) are introduced to bias the conformational ensemble toward the ith non-native conformational state i(WT) of the WT nucleic acid. (C) Schematic diagram showing optical melting experiments on unmodified WT (black) and modified Mod (red) nucleic acids. (D) Free energy diagram of delta-melt (Left), relationship between the free energies in delta-melt (Bottom), and expected correlation between and (Right).The approach termed “delta-melt” builds on prior studies employing differences in melting energetics to estimate the contribution of motifs (44, 47) or specific functional groups (48, 49) to nucleic acid stability. Such studies provided the basis for nearest-neighbor rules and secondary structure prediction algorithms (50, 51). However, in contrast to these prior studies, delta-melt measures the difference in melting energetics between an unmodified (wild-type, WT) molecule and a molecule chemically modified (Mod) to shift the ensemble toward the desired non-native state (Fig. 2 ). Converting differences in melting energetics into differences between the thermodynamic stability of the major and minor conformational states requires calibration against independent measurements of conformational preferences using other biophysical techniques. In this manner, delta-melt enables measurement of thermodynamic preferences using a pair of melting measurements in a facile manner, with fewer sample requirements relative to alternative biophysical techniques such as NMR. In this study, we extensively verified the robustness of delta-melt for four different non-native conformational states under a variety of physiological conditions and sequence contexts through independent measurements of thermodynamic preferences using NMR.We obtained rare insights into conformational cooperativity using delta-melt, estimating that tandem Hoogsteen bps form cooperatively by 1.0 to 2.5 kcal/mol in DNA. We also measured the thermodynamic preferences to form G-C+ and A-T Hoogsteen and A-T base open states for nearly all 16 trinucleotide sequence contexts and found distinct sequence-specific variations on the order of 2 to 3 kcal/mol, corresponding to ∼10- to 100-fold variation in the population of the minor conformational state. These results suggest that the DNA double helix codes for a rich layer of sequence-specific conformational preferences, which could in turn tune the specificity of DNA biochemical transactions. Thus, melting experiments can be used to access thermodynamic information regarding regions of the free energy landscape of biomolecules beyond the native folded and unfolded conformations, opening the door to more systematic explorations of how the ensemble varies with sequence and structural contexts, chemical modifications, and physiological conditions.
Results
Conceptual Underpinnings of Delta-Melt.
The conceptual underpinnings of delta-melt can be understood by using a free energy diagram (Fig. 2). The desired thermodynamic preference, , in the WT molecule to form the ith non-native conformational state is given by the free energy difference between the ith non-native [i(WT)] and major [Major(WT)] conformations (Fig. 2),In principle, could be obtained from the difference in the melting energetics () between the Major(WT) and i(WT) conformational states in the WT molecule:In practice however, because the minor i(WT) conformation is lowly populated, its melting energetics are difficult to measure experimentally (52, 53). To address this limitation, we introduce a chemical modification that renders i(WT) the dominant conformation in the modified (Mod) molecule, typically by destabilizing the major state relative to the minor state (Fig. 2). The degree to which the modification will affect the melting energetics relative to the WT molecule will depend on the desired thermodynamic preference to form the minor conformation and also on how the modification affects the energetics of the minor conformation and single-strand (ss) (Fig. 2). Therefore, the desired can be obtained from the difference between the melting energetics of WT and Mod molecules, , plus an offset, which takes into account the difference in free energy to modify ss relative to i(WT) (Fig. 2):Thus, can be estimated using a pair of melting experiments, provided that the offset is known. The value of can be determined experimentally provided that it does not vary across different sequence contexts and/or environmental conditions (changes in temperature, pH, and salt) of interest, by calibrating data against values independently measured via biophysical techniques such as NMR for a few experimental conditions and/or sequence contexts (Fig. 2). can be obtained as the intercept from a linear fit of versus (Fig. 2). Such calibrations with resource intensive measurements are routinely performed in other contexts such as microarray-based binding measurements, which are calibrated against isothermal titration calorimetry/electrophoretic mobility shift assay experiments to enable measurement of binding affinities in high throughput (4, 54). The calibration enables us to take into consideration the effect of the modification on the energetics of the minor and ss states. This is in contrast to prior studies employing melting experiments to estimate the energetic contribution of specific motifs to nucleic acid stability (44, 47–49, 55), in which it is typically assumed that modifications do not impact the ss energetics (i.e., ∼ 0). Following calibration to estimate , for a new sample can be estimated in a facile manner using a pair of melting experiments as the sum () of the difference in melting energetics and :One of the advantages of delta-melt is that in theory there is no limit on the population and lifetime of the minor conformation that can be studied as long as it can be stabilized using a modification. It should be applicable to conformational states that fall outside NMR detection, and we will encounter such an example in this study (see below). Likewise, in theory, there is no size limit provided that the site-specifically modified nucleic acid can be synthesized (56). However, deviations from two-state melting behavior are more likely in large or more complex nucleic acid molecules, and this can complicate analysis of the UV melting data, as observed even for some of the short duplexes studied here (see below). In such cases, the melting experiments have to be supplemented with information from other techniques to understand the structural basis behind changes in absorbance (reviewed in ref. 57). In addition, delta-melt does not provide any information regarding the kinetics of interconversion and, consequently, binding mechanisms (induced fit vs. conformational selection) (58) or structure of the non-native state. It is also assumed that the modification robustly renders the desired ith minor conformation the major state (>90%) across different contexts and experimental conditions. If the modification confers a weaker bias, delta-melt may underestimate the true value of In the subsequent sections, we apply delta-melt to a wide range of conformational transitions in nucleic acids in order to evaluate the generality and robustness of the approach and test the validity of the above assumptions underpinning the approach.
Testing Delta-Melt on A-T Hoogsteen bps in Duplex DNA.
We initially performed a set of experiments to test the underlying assumptions and general utility of delta-melt to measure conformational preferences to form four different minor non-native conformations in DNA and RNA. As a first test, we used delta-melt to measure the preferences to form A-T Hoogsteen bps (7) in duplex DNA. The thermodynamic preferences to form A-T Hoogsteen bps in duplex DNA are of interest because they are proposed to determine DNA–protein binding affinities (4, 10) and the propensity of DNA damage induction (59).To form a Hoogsteen bp, the purine base in a Watson–Crick bp has to flip 180° about the glycosidic bond to adopt a syn conformation, and the DNA backbone has to constrict by ∼2 Å to allow hydrogen bonding between the bases (Fig. 3) (60, 61). We used N1-methylated adenine (m1A+), a positively charged naturally occurring form of alkylation damage, to substantially bias (>90%) the conformation of the A-T bp toward Hoogsteen, as shown previously (7, 62). The modification destabilizes the Watson–Crick A-T bp both by disrupting a hydrogen bond and through steric collisions (Fig. 3) (7, 62). Prior NMR studies showed m1A+-T to be an excellent structural mimic of the transient A-T Hoogsteen bps in unmodified DNA (63). As m1A+-T is a good structural mimic of the transient A-T Hoogsteen bp, it may also serve as a good energetic mimic for delta-melt experiments.
Fig. 3.
Testing delta-melt on A-T Hoogsteen bps in DNA using the N1-methyladenosine mimic. (A) In duplex DNA, A-T Watson–Crick bps exist in dynamic equilibrium with minor A-T Hoogsteen bps. Population of the Hoogsteen bp was obtained as described previously (8). N1-methylation of adenine (m1A+, red) biases the equilibrium to favor the Hoogsteen bp. Steric clashes are indicated using dashed lines. (B) DNA duplexes used in UV melting experiments with and without m1A+ substitution (in bold). rA corresponds to ribo adenosine. (C) Correlation plot between for formation of A-T Hoogsteen bps () and obtained from melting measurements on m1A+-T/A-T bp containing DNA (). Points are color-coded according to duplex (in B). Error bars for NMR and delta-melt measurements were obtained using a Monte-Carlo scheme (94) and by propagating the uncertainties from UV melts, respectively, as described in . Shown are the Pearson’s correlation coefficient (r) and RMSE (). The blue shaded region denotes the estimate of the error of linear regression obtained using Monte-Carlo simulations, while open symbols denote data derived from weak RD profiles ().
Testing delta-melt on A-T Hoogsteen bps in DNA using the N1-methyladenosine mimic. (A) In duplex DNA, A-T Watson–Crick bps exist in dynamic equilibrium with minor A-T Hoogsteen bps. Population of the Hoogsteen bp was obtained as described previously (8). N1-methylation of adenine (m1A+, red) biases the equilibrium to favor the Hoogsteen bp. Steric clashes are indicated using dashed lines. (B) DNA duplexes used in UV melting experiments with and without m1A+ substitution (in bold). rA corresponds to ribo adenosine. (C) Correlation plot between for formation of A-T Hoogsteen bps () and obtained from melting measurements on m1A+-T/A-T bp containing DNA (). Points are color-coded according to duplex (in B). Error bars for NMR and delta-melt measurements were obtained using a Monte-Carlo scheme (94) and by propagating the uncertainties from UV melts, respectively, as described in . Shown are the Pearson’s correlation coefficient (r) and RMSE (). The blue shaded region denotes the estimate of the error of linear regression obtained using Monte-Carlo simulations, while open symbols denote data derived from weak RD profiles ().We analyzed sequences shown previously to form stable duplexes in solution and bp positions for which the Watson–Crick to Hoogsteen exchange had previously been characterized using NMR relaxation dispersion (RD) (7, 63, 64) (Fig. 3). This included A-T bps in two different trinucleotide sequences (5′-CAA-3′/5′-AAT-3′) embedded in three different duplexes in two different positional contexts. These data were supplemented with additional NMR RD and corresponding delta-melt experiments under varying buffer and temperature (T) conditions (pH = 4.4 to 6.9, T = 10 to 30 °C, and 25 to 150 mM NaCl) (Fig. 3 and ). The pKa for ionization of the amino group in the m1A base is 7.2 (65); thus, under the pH conditions used here, the amino group is likely to be protonated and m1A is expected to be a good mimic of the Hoogsteen bp, as shown previously (63). Furthermore, prior studies (64) have also shown that under the conditions used for the UV melts the contribution of Dimroth rearrangement of m1A to N6-methyladenosine is negligible. Additional measurements were performed when chemically modifying the Hoogsteen bp with ribo-guanosine or its neighbor using N1-methyl guanosine, both of which are known to perturb the thermodynamic propensity to form Hoogsteen bps (64) (Fig. 3). This yielded a total of 14 data points which could be used to test delta-melt.All delta-melt experiments were performed using the same buffer conditions used in the NMR RD experiments (). The “curve-fitting” approach (66, 67) was used to extract the free energy of melting () from the UV melting curves for all samples (66, 67). The approach assumes two-state melting behavior and is commonly used in the literature (66, 67). A more rigorous approach evaluates the two-state assumption using concentration-dependent melting experiments to extract thermodynamic parameters (42, 43, 68). Although concentration-dependent measurements help gauge the two-state behavior and are also required to determine the enthalpy () and entropy () of melting more accurately, this substantially increases the sample requirements and reduces the throughput of the experiment. In addition, the concentration dependence is a necessary but insufficient condition for two-state melting (44). As we were primarily interested in free energy of melting (), which can be measured accurately using curve-fitting since the errors in the enthalpy and entropy typically cancel out (42), and since we have already previously verified using NMR that some of the duplexes studied here (A2 and A6-DNA) (69) exhibit two-state melting behavior, we favored using the curve-fitting approach in the application of delta-melt. For a more elaborate discussion regarding the UV melting experiments and the assumptions used in the analysis refer to .Indeed, a reasonable correlation (r = 0.70) was observed between = − () and measured using NMR RD (Fig. 3 and ). The correlation could be linearly fit (r = 0.7; Fig. 3) with a root-mean-square error (RMSE) of 0.3 kcal/mol and with ∼0.3 kcal/mol. The quantitative agreement between the delta-melt energetics and those derived using NMR RD suggest that the offset corresponding to the difference in energetic cost to methylate adenine-N1 in the ss versus the A-T Hoogsteen bp () does not vary substantially across the sequence and chemically modified contexts and buffer conditions examined, and that any variations remain small relative to the differences in (∼2 kcal/mol). These results therefore also indicate that delta-melt can be used to measure the thermodynamic preferences to form A-T Hoogsteen bps as a function of experimental conditions.Nevertheless, the correlation was not ideal and m1A+-T is an imperfect mimic of the A-T Hoogsteen bp as the methyl group introduces a positive charge to the adenine base. We therefore also tested neutral modifications such as C-T and T-T mismatches, which were recently shown to mimic the A-T Hoogsteen bp based on NMR chemical shifts analysis (4) (Fig. 4). As in the A-T Hoogsteen bp, the two bases in C-T and T-T mismatches also have to come into closer proximity relative to Watson–Crick bps to hydrogen bond, thus mimicking this key feature of the Hoogsteen conformation (63) (Fig. 4). Mismatches also have the advantage that they can be readily installed when synthesizing DNA. These experiments also assessed the robustness of delta-melt when using different modifications to mimic the minor conformation.
Fig. 4.
Testing delta-melt on A-T Hoogsteen bps in DNA using C-T and T-T mismatches as Hoogsteen mimics. (A) C-T and T-T mismatches mimic the conformation of the A-T Hoogsteen bp. (B) Duplexes used in UV melting experiments with and without C-T and T-T mismatches (in bold). (C and D) Correlation plot between obtained from melting experiments on C-T/A-T (C) and T-T/A-T (D) bp containing DNA () and for formation of A-T Hoogsteen bps as determined using NMR RD (). Points in C and D are color-coded according to the corresponding duplex in B. Error bars for NMR and delta-melt measurements were obtained using a Monte-Carlo scheme (94) and by propagating the uncertainties from UV melts, respectively, as described in . Shown are the Pearson’s correlation coefficient (r) and RMSE (). The blue shaded region denotes the estimate of the error of linear regression obtained using Monte-Carlo simulations, while open symbols denote data derived from weak RD profiles ().
Testing delta-melt on A-T Hoogsteen bps in DNA using C-T and T-T mismatches as Hoogsteen mimics. (A) C-T and T-T mismatches mimic the conformation of the A-T Hoogsteen bp. (B) Duplexes used in UV melting experiments with and without C-T and T-T mismatches (in bold). (C and D) Correlation plot between obtained from melting experiments on C-T/A-T (C) and T-T/A-T (D) bp containing DNA () and for formation of A-T Hoogsteen bps as determined using NMR RD (). Points in C and D are color-coded according to the corresponding duplex in B. Error bars for NMR and delta-melt measurements were obtained using a Monte-Carlo scheme (94) and by propagating the uncertainties from UV melts, respectively, as described in . Shown are the Pearson’s correlation coefficient (r) and RMSE (). The blue shaded region denotes the estimate of the error of linear regression obtained using Monte-Carlo simulations, while open symbols denote data derived from weak RD profiles ().Indeed, an improved correlation (r = 0.98 and 0.93) was observed between = − for both C-T and T-T mismatches (Fig. 4 and ) and to form A-T Hoogsteen bps as measured using NMR RD (). The data could be fit with RMSE = 0.2 kcal/mol for both C-T and T-T (Fig. 4 and ). Nevertheless, the deviation seen for the A16-T9 bp near the m1G10-C15+ Hoogsteen bp in A6-DNAm1G10 (brown point in Fig. 4 ) indicates that C-T/T-T mismatches may not be good energetic mimics of A-T Hoogsteen bp when neighboring another Hoogsteen bp, possibly because they do not capture unique stacking interactions involving the adjacent syn purine base. Indeed, this deviation was not observed when using m1A+ (Fig. 3).Taken together, the above results establish m1A+-T, C-T and T-T bps to be good energetic mimics of A-T Hoogsteen bps and support the underlying assumptions used in delta-melt to measure the thermodynamic preferences to form A-T Hoogsteen bps.In an analogous manner, delta-melt could also be used to measure pH dependent thermodynamic preferences to form G-C+ Hoogsteen base pairs in duplex DNA, by using the naturally occurring N1-methyl guanosine (m1G) modification to bias the conformation of the G-C bp towards Hoogsteen ().
Testing Delta-Melt on Base Open A-T States in Duplex DNA.
One of the advantages of delta-melt is that it can potentially be applied to low-populated and short-lived states falling below detection methods of conventional biophysical methods. We therefore tested its utility on the A-T base open state (Fig. 1), which based on NMR measurements of imino proton exchange rates (14) has an exceptionally low population of ∼0.001% and short lifetime on the order of ∼0.1 μs (Fig. 5). Compared to G-C+ and A-T Hoogsteen bps, the base open state is disfavored by ∼4 kcal/mol, providing an interesting test for the general applicability of delta-melt on an exceptionally low-populated conformational state that cannot be detected even by NMR RD methods.
Fig. 5.
Testing delta-melt on the open state of A-T bps using the N3-methylthymine mimic. (A) Exchange between closed and open A-T Watson–Crick bps in DNA. The T-H3 imino proton in the open state is susceptible to solvent exchange (blue thunder). m3T is hypothesized to bias the ensemble to mimic the base open state by disrupting the A(N1)–H3-(N3)T hydrogen bond and by sterically (curved dashed lines) disfavoring the closed bp. (B) Duplexes used in UV melting experiments with and without m3T substitution (in bold). (C) Correlation plot between for formation of open A-T bps () and obtained from melting experiments on A-m3T/A-T bp containing DNA (). Points are color-coded according to the duplex that they correspond to (in B). Error bars for NMR and delta-melt measurements were obtained using a Monte-Carlo scheme and by propagating the uncertainties from UV melts, respectively, as described in . Shown are the Pearson’s correlation coefficient (r) and RMSE (). The blue shaded region denotes the estimate of the error of linear regression obtained using Monte-Carlo simulations ().
Testing delta-melt on the open state of A-T bps using the N3-methylthymine mimic. (A) Exchange between closed and open A-T Watson–Crick bps in DNA. The T-H3 imino proton in the open state is susceptible to solvent exchange (blue thunder). m3T is hypothesized to bias the ensemble to mimic the base open state by disrupting the A(N1)–H3-(N3)T hydrogen bond and by sterically (curved dashed lines) disfavoring the closed bp. (B) Duplexes used in UV melting experiments with and without m3T substitution (in bold). (C) Correlation plot between for formation of open A-T bps () and obtained from melting experiments on A-m3T/A-T bp containing DNA (). Points are color-coded according to the duplex that they correspond to (in B). Error bars for NMR and delta-melt measurements were obtained using a Monte-Carlo scheme and by propagating the uncertainties from UV melts, respectively, as described in . Shown are the Pearson’s correlation coefficient (r) and RMSE (). The blue shaded region denotes the estimate of the error of linear regression obtained using Monte-Carlo simulations ().Based on imino proton solvent exchange experiments (70), the base open state entails the loss of the A(N1)–H3-(N3)T hydrogen bond. Thus, we hypothesized that substituting T by N3-methyl thymine (m3T), so that the thymine imino proton is replaced by a methyl group, should disrupt the A(N1)–H3-(N3)T hydrogen bond and bias the ensemble toward the base open state (70). Indeed, NMR spectra (, Fig. S8) revealed that the modification disrupted the A(N1)–H3-(N3)T hydrogen bond in the modified bp with m3T adopting an intrahelical conformation.We analyzed sequences shown previously to form stable duplexes in solution and bp positions for which A-T base opening has been previously characterized using NMR (TBP-DNA, Fig. 5) (71). We also performed additional imino proton exchange NMR and corresponding delta-melt experiments for another duplex (A6-DNA) under different buffer and temperature conditions, yielding in total 11 points for comparison, corresponding to six different trinucleotide sequence contexts (5′-TAG-3′/5′-TAT-3′/5′-AAG-3′/5′-AAA-3′/5′-TAA-3′/5′-GAT-3′) in two different duplexes and a range of buffer conditions (pH 8.0 to 8.8, T = 15 to 25 °C, 100 mM NaCl) (Fig. 5 andUV melting experiments were used to measure the energetics of duplex melting with and without m3T substitutions at these positions under the NMR buffer conditions (). Interestingly, a very good correlation (r = 0.89) was observed between the difference in melting energetics = − and for base opening measured using NMR imino proton exchange measurements (Fig. 5 and ) (71). The correlation could be fit with RMSE of 0.3 kcal/mol and ∼2.4 kcal/mol (Fig. 5) (). These results demonstrate the utility of delta-melt to measure the thermodynamic preferences for A-T base opening across different sequence contexts and buffer conditions and also establish A-m3T as a mimic of the base open state which has been recalcitrant to experimental structural characterization.
Testing Delta-Melt on Isomerization of the N6-Methylamino Group in Duplex RNA.
We also examined the utility of delta-melt to address more subtle conformational rearrangements involving rotation of a single methylamino group. N6-methyladenosine (m6A) is a highly abundant RNA modification that plays roles in virtually all aspects of mRNA metabolism (31, 72). When paired with uridine, the methylamino group in m6A isomerizes between anti (major) and syn (minor) conformations, resulting in the loss of a hydrogen bond (Fig. 6) (73). This conformational change has been proposed to play roles slowing a variety of biochemical processes that involve duplex melting and annealing (69).
Fig. 6.
Testing delta-melt on the isomerization of the N6-methylamino group in m6A-U bps in RNA. (A) (Top) Equilibrium between major anti and minor syn conformations of the N6-methylamino group in m6A-U bps in RNA. (Bottom) m62A mimics the syn methylamino group by sterically disfavoring (black dashes) the conformation with the anti methyl. (B) Duplexes used in UV melting experiments with m6A and m62A (indicated in bold) modifications. (C) Correlation plot between obtained from melting experiments on m6A-U/m62A-U bp containing RNA () and measured using NMR RD for formation of m6A(syn)-U bps (). Points are color-coded according to the duplexes in B. Error bars for NMR and delta-melt measurements were obtained using a Monte-Carlo scheme (94) and by propagating the uncertainties from UV melts, respectively, as described in . Shown are the Pearson’s correlation coefficient (r) and RMSE (). The blue shaded region denotes the estimate of the error of linear regression obtained using Monte-Carlo simulations.
Testing delta-melt on the isomerization of the N6-methylamino group in m6A-U bps in RNA. (A) (Top) Equilibrium between major anti and minor syn conformations of the N6-methylamino group in m6A-U bps in RNA. (Bottom) m62A mimics the syn methylamino group by sterically disfavoring (black dashes) the conformation with the anti methyl. (B) Duplexes used in UV melting experiments with m6A and m62A (indicated in bold) modifications. (C) Correlation plot between obtained from melting experiments on m6A-U/m62A-U bp containing RNA () and measured using NMR RD for formation of m6A(syn)-U bps (). Points are color-coded according to the duplexes in B. Error bars for NMR and delta-melt measurements were obtained using a Monte-Carlo scheme (94) and by propagating the uncertainties from UV melts, respectively, as described in . Shown are the Pearson’s correlation coefficient (r) and RMSE (). The blue shaded region denotes the estimate of the error of linear regression obtained using Monte-Carlo simulations.Using NMR chemical exchange saturation transfer (CEST) experiments (74), it was recently shown that the minor non-native m6A(syn)-U bp transiently forms with a population of ∼1% (73). It was also shown that N6,N6-dimethyladenosine (m62A), a posttranscriptional modification in transfer RNA (tRNA) (75), structurally mimics this non-native m6A(syn)-U bp (73). This modification destabilizes the major m6A(anti)-U conformation both by disrupting the A(N6)-H6–(O4)T hydrogen bond and through steric collisions (curved dashed lines, Fig. 6).We tested delta-melt on two sequences shown previously to form stable duplexes in solution, at bp positions for which the non-native m6A(syn)-U bp has been previously characterized using NMR and for which the m62A modification has been shown to recapitulate the NMR chemical shifts of the non-native m6A(syn)-U bp (73) (). Delta-melt was used to measure temperature-dependent (pH 6.8, T = 37 to 65 °C, 25 mM NaCl) thermodynamic preferences to form the non-native m6A(syn)-U bp at two positions in two duplexes in two different trinucleotide sequence and position contexts (5′-CAA-3′/5′-GAC-3′) (Fig. 6); UV melting experiments were used to measure the energetics of duplex melting with m6A and m62A modifications at these positions under the NMR buffer conditions ().An excellent correlation (r = 0.95) was observed between = − and for N6-methylamino isomerization in m6A measured using NMR RD (Fig. 6) (73). The correlation could be fit with an RMSE of 0.1 kcal/mol and ∼0.7 kcal/mol (Fig. 6) (). Thus, the energetic destabilization due to incorporation of m62A is correlated to the energetics of methyl rotation in m6A. These results show that delta-melt can be applied to measure thermodynamic preferences involving isomerization of small chemical groups and broaden its applicability to RNA in addition to DNA.
Using Delta-Melt to Measure the Cooperativity When Forming Tandem Hoogsteen bps in Duplex DNA.
Having benchmarked delta-melt and tested its underlying assumptions, we applied the approach to gain new insights into the biophysical properties of duplex DNA, which might not be feasible to obtain using more conventional approaches. In this first application, we used delta-melt to examine the cooperativity of forming two adjacent Hoogsteen bps. Tandem Hoogsteen bps are frequently observed in crystal structures of DNA in complex with proteins and drugs and in the context of DNA lesions that bias the bp conformation to Hoogsteen (reviewed in ref. 76). Furthermore, crystal structures of certain A-T repeat sequences form helices in which all bps are Hoogsteen (77), again pointing to Hoogsteen bps forming cooperatively. Thus, the thermodynamic preference to form a Hoogsteen bp could be modified by the presence of a nearby Hoogsteen bp, in turn impacting conformational penalties integral to recognition of proteins and other molecules. Such cooperativity could extend to other states such as the base open conformation.Measuring conformational cooperativity is especially challenging as it requires accurately determining the abundance of exceptionally low-populated conformational states in which two or more bp positions are simultaneously in a minor conformation. The population of tandem Hoogsteen bps is expected to be on the order of the product of the probabilities for forming each Hoogsteen bp (∼0.01%). Moreover, such conformations must be distinguished from states in which only one bp is Hoogsteen. Owing to these unique challenges, cooperativity has not been measured to date for any bp rearrangement in nucleic acids.In theory, delta-melt can be used to measure conformational cooperativity for any arbitrary number of conformational transitions by performing experiments employing the appropriate number of modifications in a duplex. We tested this approach and measured the cooperativity of forming tandem Hoogsteen bps using the validated m1A+ and m1G substitutions to induce Hoogsteen bps at specific positions. In what follows, it will be assumed that tandem N1-methylated bps are good energetic mimics of their unmodified tandem Hoogsteen counterparts and that the offset for tandem Hoogsteen bps is small, similar to isolated Hoogsteen bps (Fig. 3 and ) () or is equal to the sum of the offsets for forming the individual Hoogsteen bps ().If tandem Hoogsteen bps do form cooperatively in duplex DNA, the energetic cost to form two Hoogsteen bps simultaneously () should be smaller than the sum of the energetic cost to form each Hoogsteen bp at each position ( + ), i.e., . Conversely, for noncooperativity and for the case of negative cooperativity.Thus, subject to the assumptions outlined above, positive cooperativity predicts that the sum of energetic destabilization of single N1-methylations ( + ) is greater than the energetic destabilization due to tandem N1-methylation () with the difference between the two quantities () providing a measure of the extent of cooperativity (Fig. 7 and ),
Fig. 7.
Measuring cooperativity of Hoogsteen bp formation in DNA. (A) Estimating the cooperativity of Hoogsteen bp formation () using delta-melt (). Red bases denote modified N1-methylated nucleotides (m1A+ and m1G) that bias the bp conformation toward Hoogsteen. (B) The value of over four different bp steps (). Bp steps are color-coded according to the duplexes shown below. Errors in were determined by propagating the uncertainties from the UV melts as described in . Buffer conditions were 15 mM sodium phosphate, 150 mM sodium chloride, and 0.1 mM ethylenediaminetetraacetic acid and pH 5.4 or 6.8 for cooperativity involving G-C and only A-T bps, respectively. All values are reported at 25 °C.
Measuring cooperativity of Hoogsteen bp formation in DNA. (A) Estimating the cooperativity of Hoogsteen bp formation () using delta-melt (). Red bases denote modified N1-methylated nucleotides (m1A+ and m1G) that bias the bp conformation toward Hoogsteen. (B) The value of over four different bp steps (). Bp steps are color-coded according to the duplexes shown below. Errors in were determined by propagating the uncertainties from the UV melts as described in . Buffer conditions were 15 mM sodium phosphate, 150 mM sodium chloride, and 0.1 mM ethylenediaminetetraacetic acid and pH 5.4 or 6.8 for cooperativity involving G-C and only A-T bps, respectively. All values are reported at 25 °C.We used delta-melt to assess the cooperativity of forming tandem T-A/G-C, C-G/G-C, T-A/T-A, and A-T/T-A Hoogsteen bps corresponding to four different bp steps (5′-TG-3′/5′-CG-3′/5′-TT-3′/5′-AT-3′) embedded within two different duplexes under different buffer conditions (pH 5.4 to 6.8, 25 °C, 150 mM NaCl) (). We used the same DNA sequences employed in the experiments used to measure thermodynamic preferences for forming single Hoogsteen bps (Fig. 3 and ). Melting experiments were performed on tandem N1- methylated DNA duplexes in which both bps are expected to be Hoogsteen and their singly methylated counterparts, in which only one bp should be Hoogsteen (Fig. 7 and ).We first performed NMR experiments on the tandem methylated duplexes and their singly methylated counterparts to verify the presence of stably formed Hoogsteen bps. As expected, the singly N1-methylated duplexes showed NMR signatures of Hoogsteen bps (7), including upfield shifted T-H3 and downfield shifted m1A+-H6 protons for A-T Hoogsteen and the downfield shifted C-H4 proton for m1G-C+ Hoogsteen bps, respectively (). The tandem N1-methylated duplexes also exhibited the expected downfield shifted purine-C8 resonances corresponding to the N1-methylated syn purine bases (7, 64) and downfield shifted m1A+-H6 resonances corresponding to the m1A+-T Hoogsteen bps. However, the T-H3 signal was not observable and was likely broadened out of detection due to rapid solvent exchange kinetics.We then measured the cooperativity of forming Hoogsteen bps using delta-melt experiments on the doubly and singly methylated duplexes (). Indeed, across all four bp steps and different buffer conditions examined, tandem Hoogsteen bps formed cooperatively with sizeable = 1.0 to 2.5 kcal/mol (Fig. 7).If tandem Hoogsteen bps do indeed form cooperatively in DNA as implied by delta-melt, and robustly in a manner independent of the methyl modification, the thermodynamic preferences to form a Hoogsteen bp at an unmodified bp should also increase when the adjacent site is a Hoogsteen bp stabilized by an N1-methylated purine modification. We tested this prediction using NMR RD experiments to directly measure the thermodynamic preferences to form G10-C15+ and A16-T9 Hoogsteen bps in A6-DNA when adjacent to pre-formed m1A16+-T9 or m1G10-C15+ Hoogsteen bps, respectively (Fig. S12 ). Indeed, the thermodynamic preference to form Hoogsteen increased by ∼0.2 to 1.0 kcal/mol when the neighboring bp was a modified preformed Hoogsteen. Similar increases were also observed recently when using T-T or C-T mismatches to mimic the Hoogsteen bp (19).To further test these delta-melt predictions of tandem Hoogsteen cooperativity in an unmodified DNA duplex, we took advantage of the anti-tumor antibiotic echinomycin (78) (Fig. S12 ). Echinomycin has previously been shown to bind duplex DNA containing two CG steps separated by a CA step, inducing tandem G-C+/A-T Hoogsteen bps at the CA step at low pH 5.3 (79). If the tandem Hoogsteen bps at the CA step form cooperatively, reducing the preference to form a Hoogsteen bp at one position should also reduce the preference to form the Hoogsteen bp at the second position, resulting in the loss of NMR signatures unique to tandem Hoogsteen bps. Indeed, reducing the thermodynamic preference to form G-C+ Hoogsteen either by increasing the pH to 6.9 or by substituting guanine with 7-deaza guanine at pH 5.3 (80) resulted in the loss of NMR signals characteristic of tandem Hoogsteen bps (Fig. S12 ). Furthermore, the loss of tandem Hoogsteen bp signatures at pH 6.9 could be rescued by using m1G to stabilize the G-C+ Hoogsteen bp (Fig. S10 ). The above results suggest that Hoogsteen bps form cooperatively in echinomycin bound DNA, which may explain why multiple intercalating drugs such as echinomycin bind cooperatively to DNA (81).Taken together, the above results strongly suggest that tandem Hoogsteen bps can form cooperatively in duplex DNA. Future experiments could independently verify tandem Hoogsteen cooperativity, for example by examining the impact of single or double Hoogsteen mimicking substitutions on DNA–protein and DNA–drug binding affinities (4), or through the development of new approaches that can directly detect tandem Hoogsteen bps in unmodified DNA.
Using Delta-Melt to Measure Sequence-Specific Signatures of Hoogsteen and Base Open States in Duplex DNA.
Many biochemical processes involving DNA including protein recognition (12) and damage induction (82) and repair (83), as well as mutations including those related to cancer (84), are all strongly dependent on DNA sequence. When biochemical processes include steps that act on minor non-native conformational states of nucleic acids, such as the Hoogsteen and base open state, the sequence-specific thermodynamic preferences to form such conformations can shape the sequence specificities of the biochemical processes as well. However, the “conformational signatures” describing the sequence-specific preferences to form alternative conformational states have not been measured, primarily because of the difficulties in performing such a large number of NMR measurements.Although the thermodynamic preferences to form minor non-native conformations such as Hoogsteen (8) and base open states (71) have been measured for various bp steps, these prior observations were confounded by additional variation in the global sequence context and position along the helix. We therefore used delta-melt to overcome these barriers and measured conformational signatures for A-T and G-C+ Hoogsteen bps, and the A-T base open state for all 16 trinucleotide sequence contexts, at a given positional context. These measurements were performed on a highly stable scaffold sequence (scaf2, Fig. 8), enabling robust melting measurements across all 16 sequence contexts, including when destabilizing modifications (m1A+, m1G, and m3T) are present.
Fig. 8.
Measuring sequence-dependent thermodynamic preferences in DNA using delta-melt. (A) DNA duplex used to measure sequence-dependent thermodynamic preferences using delta-melt. W-X and Y-Z represent either of the four Watson–Crick bps (A-T/T-A/C-G/G-C). Thermodynamic preferences to form base open or Hoogsteen conformations were measured using delta-melt for the central bp indicated using a box. (B) Relative thermodynamic preferences () to form A-T and G-C+ Hoogsteen bps and the A-T base open states measured using delta-melt at 25 °C (). Sequence contexts are given in the 5′ to 3′ direction. Thermodynamic preferences were referenced to the sequence context with the most favorable preference (5′-GAG-3′, 5′-CGT-3′, and 5′-CAG-3′ for A-T Hoogsteen, G-C+ Hoogsteen, and A-T base open states, respectively). Sequences for which reliable thermodynamic preferences could not be obtained due to non-two-state melting behavior are denoted using * (). Buffer conditions for the measurements are described in . (C) Correlation plots between the thermodynamic preferences for formation of A-T and G-C+ Hoogsteen bps and open A-T bps at 25 °C. (Inset) The Pearson correlation coefficient (r). Note that for correlating preferences for A-T and G-C+ Hoogsteen bps, the purine bases were aligned with each other; for example, the sequence context 5′-AAA-3′ for A-T Hoogsteen was correlated with 5′-AGA-3′ for G-C+ Hoogsteen. Error bars in B and C were obtained by propagating the error from the UV melts and , as described in .
Measuring sequence-dependent thermodynamic preferences in DNA using delta-melt. (A) DNA duplex used to measure sequence-dependent thermodynamic preferences using delta-melt. W-X and Y-Z represent either of the four Watson–Crick bps (A-T/T-A/C-G/G-C). Thermodynamic preferences to form base open or Hoogsteen conformations were measured using delta-melt for the central bp indicated using a box. (B) Relative thermodynamic preferences () to form A-T and G-C+ Hoogsteen bps and the A-T base open states measured using delta-melt at 25 °C (). Sequence contexts are given in the 5′ to 3′ direction. Thermodynamic preferences were referenced to the sequence context with the most favorable preference (5′-GAG-3′, 5′-CGT-3′, and 5′-CAG-3′ for A-T Hoogsteen, G-C+ Hoogsteen, and A-T base open states, respectively). Sequences for which reliable thermodynamic preferences could not be obtained due to non-two-state melting behavior are denoted using * (). Buffer conditions for the measurements are described in . (C) Correlation plots between the thermodynamic preferences for formation of A-T and G-C+ Hoogsteen bps and open A-T bps at 25 °C. (Inset) The Pearson correlation coefficient (r). Note that for correlating preferences for A-T and G-C+ Hoogsteen bps, the purine bases were aligned with each other; for example, the sequence context 5′-AAA-3′ for A-T Hoogsteen was correlated with 5′-AGA-3′ for G-C+ Hoogsteen. Error bars in B and C were obtained by propagating the error from the UV melts and , as described in .In measuring the conformational signatures using delta-melt, our analysis assumed that the offsets are independent of sequence context. Although this is supported by our benchmarks (Figs. 3–6 and ), without additional delta-melt and NMR calibration experiments, we cannot rule out different values are applicable for different sequence contexts. In addition, for some sequence contexts, the melting curves deviated from a simple two-state behavior, and the data could not be used to extract reliable thermodynamic preferences (* symbols in Fig. 8, , and ). Additional studies are needed to understand the origins of the non-two-state behavior of the melting curves and how to extract robust thermodynamic parameters from them.Delta-melt revealed large sequence-specific variations in the thermodynamic preferences to form A-T (∼1.7 to 3.5 kcal/mol) and G-C+ (0.5 to 2.2 kcal/mol) Hoogsteen bps, as well as the A-T base open state (0.3 to 1.4 kcal/mol), which are on the order of 2 to 3 kcal/mol, over the sequence contexts which could be measured reliably using delta-melt (Fig. 8 and ). This corresponds to ∼10- to 100-fold sequence-specific variations in the population of the minor conformation in the dynamic ensemble purely due to changes in the sequence of the immediate neighbor.To verify the conformational preferences obtained using delta-melt, we used NMR RD to independently measure the energetic cost to form G-C+ Hoogsteen bps in the 5′-TGC-3′ and 5′-CGT-3′ sequence contexts (Fig. 9), predicted by delta-melt to have low and high propensities (Fig. 8) to form G-C+ Hoogsteen bps, respectively. Measurements were performed under two different buffer conditions and three different temperatures to test the robustness of the predictions. One of the contexts, 5′-TGC-3′ (data point 6, Fig. 9), with low propensity is predicted by delta-melt to have little to no observable NMR RD even under low pH conditions (pH 5.4, 150 mM NaCl, 15 °C). Such behavior has never been observed previously and therefore represents a strong prediction from the delta-melt approach. On the other side, the 5′-CGT-3′ context (data point 1, Fig. 9) is predicted to have a relatively large population (∼0.7%) of the minor G-C+ Hoogsteen bp at low pH (pH 5.4, 25mM NaCl, 30 °C). Prospective comparison of six thermodynamic preferences measured using delta-melt (Fig. 8) and NMR RD (Fig. 9) reveals an average accuracy of ∼0.5 kcal/mol, which is small relative to the range of 0.5 to 2.0 kcal/mol measured across different sequence contexts. Delta-melt correctly predicted the lack of observable RD for the 5′-TGC-3′ sequence (data point 6, Fig. 9) and enabled measurements of thermodynamic preferences at this position when it was not feasible using NMR RD. Delta-melt also correctly predicted the 5′-CGT-3′ context having the highest propensity to form G-C+ Hoogsteen (data point 1, Fig. 9). Nevertheless, we also observed a relatively large deviation of ∼0.8 kcal/mol between the delta-melt and NMR-derived thermodynamic preferences for G6-C19+ Hoogsteen bp formation in scaf2_TGCGC at pH 5.4 high salt (150mM NaCl) and 30 °C (data point 4, Fig. 9). This sequence context may have a unique offset (), as only one out of the seven measurements used for establishing calibration curves contains the same 5′-TGC-3′ trinucleotide step (). These results support the utility of delta-melt to determine sequence-specific conformational signatures and to identify potential conformational hotspots.
Fig. 9.
Testing the accuracy of delta-melt–derived thermodynamic preferences to form G-C+ Hoogsteen bps using NMR. (A) Duplexes used to test delta-melt derived thermodynamic preferences. 13C/15N isotopically labeled residues are in bold. All remaining residues are unlabeled. (B, Bottom) Comparison of measured thermodynamic preferences to form G-C+ Hoogsteen bps using NMR (orange, ) and values measured using delta-melt (red, ). Numbers 1 to 6 correspond to combinations of sequences and buffer/temperature conditions for the measurements and are given in . Lower-confidence data points () obtained from weak R1ρ RD profiles are denoted using *. Data point 6 with a flat R1ρ RD profile is denoted using “x,” indicating that the minor G-C+ Hoogsteen bp in this case falls outside the NMR RD detection limit. Error bars for free energies were obtained by propagating the uncertainty from UV melts (). (B, Top) Representative examples of R1ρ RD profiles showing the measured effective transverse relaxation rate (R2 + Rex) as a function of offset (ΩOBS/2π) and color-coded spin-lock power (ω1/2π) (). Errors bars were obtained by propagating the uncertainty in R1ρ as described in .
Testing the accuracy of delta-melt–derived thermodynamic preferences to form G-C+ Hoogsteen bps using NMR. (A) Duplexes used to test delta-melt derived thermodynamic preferences. 13C/15N isotopically labeled residues are in bold. All remaining residues are unlabeled. (B, Bottom) Comparison of measured thermodynamic preferences to form G-C+ Hoogsteen bps using NMR (orange, ) and values measured using delta-melt (red, ). Numbers 1 to 6 correspond to combinations of sequences and buffer/temperature conditions for the measurements and are given in . Lower-confidence data points () obtained from weak R1ρ RD profiles are denoted using *. Data point 6 with a flat R1ρ RD profile is denoted using “x,” indicating that the minor G-C+ Hoogsteen bp in this case falls outside the NMR RD detection limit. Error bars for free energies were obtained by propagating the uncertainty from UV melts (). (B, Top) Representative examples of R1ρ RD profiles showing the measured effective transverse relaxation rate (R2 + Rex) as a function of offset (ΩOBS/2π) and color-coded spin-lock power (ω1/2π) (). Errors bars were obtained by propagating the uncertainty in R1ρ as described in .Given the robustness of the delta-melt–derived thermodynamic preferences (Fig. 9), we then proceeded to examine their sequence dependencies. The largest sequence dependence was observed for A-T Hoogsteen (1.7 to 3.5 kcal/mol) followed by G-C+ Hoogsteen ( ∼ 0.5 to 2.2 kcal/mol) and with the open A-T state showing the weakest sequence-dependence ( ∼ 0.3 to 1.4 kcal/mol), possibly because of its relatively increased flexibility. Interestingly, the delta-melt–derived conformational signatures were very different (r ∼ 0 to 0.63) for these three conformational states (Fig. 8). Therefore, those trinucleotide sequence contexts showing a preference for forming one state may not also favor the formation of another state. For example, the 5′-GAT-3′/5′-GGT-3′ trinucleotide context is the third most favorable context for forming the A-T base open state and G-C+ Hoogsteen bps but is the third least favorable context for A-T Hoogsteen bps. If different processes act on different conformational states of the DNA, these conformational signatures could translate into different sequence specificities for the distinct DNA biochemical reactions.The delta-melt data also reveal that the 5′ and 3′ neighboring bps of the purine base of a given bp influence thermodynamic preferences to form the three non-native conformational states to different extents. The 3′ neighboring bp dominates the preferences for A-T and G-C+ Hoogsteen bps, whereas for the A-T base open state both neighbors contribute equally (). In addition, substantial variations (∼1.5 to 2.0 kcal/mol) in the sequence-specific thermodynamic preferences due to the third bp are seen for a given dinucleotide step, indicating that conformational signatures must at minimum be described by a triplet code ().Another advantage of delta-melt is that it provides unique insights into the origins of these different sequence-specific conformational signatures. For example, the sequence-specific variations to form A-T Hoogsteen bps appear to be dominated by the sequence-specific variations in the melting stability of the Watson–Crick A-T bp (,
Fig. S15). Therefore, those sequences that are least stable in the unmodified DNA have the highest propensities to form A-T Hoogsteen bps. This is in line with a prior study showing that A-T Hoogsteen bps have high propensities to form in CA and TA steps, which are the least stable in the Watson–Crick form (8).Interestingly, in contrast, the sequence-specific variations to form G-C+ Hoogsteen bps appears to be dominated by the sequence-specific variations in the melting stability of the m1G-C+ Hoogsteen bp (). These variations could arise due to large sequence-specific variations in the pKa of the cytosine imino proton, possibly mediated by sequence-specific stacking. Finally, in the case of the base open state, the sequence-specific preferences are determined by both the stability of the A-T Watson–Crick and base open (or specifically A-m3T) states (Fig. S15).
Discussion
Melting experiments have been used with great success to understand factors contributing to the stability of the major folded conformations of nucleic acids relative to the unfolded state, providing the basis for the nearest-neighbor rules, which are widely used to predict nucleic acid secondary structure and other properties of interest (44, 47). These prior studies took advantage of the relatively high throughput and low sample requirements of melting experiments, which permit broad explorations of how folding thermodynamics varies across different sequence and structural motifs and experimental conditions. In this study, we showed that this well-established approach (43, 44, 47–49) can be extended to the thermodynamic stabilities of minor non-native conformational states in the ensemble, whose contribution to melting experiments are typically negligible (53).The key to delta-melt is to use chemical modifications to render minor non-native conformational states in the ensemble the major state. This then makes it possible to access information regarding thermodynamic stabilities of minor conformations. It is assumed that the modification biases the ensemble toward the major state so that its population exceeds 90% robustly across the different sequence contexts and experimental conditions examined. We have validated this assumption using NMR for all of the modifications presented here under a variety of sequence and experimental conditions (7, 61) (). However, it may be necessary to test this assumption particularly for cases which give rise to outlier behavior. Thus, in practice, we recommend independent validation of the structure of the modified molecules using biophysical techniques such as NMR spectroscopy or X-ray crystallography for use with delta-melt. Furthermore, a fundamental assumption in delta-melt is that the relative energetic cost for modifying the minor conformation and single strands does not vary with sequence contexts and conditions. Under this assumption, calibration with independent measurements of conformational preferences over different sequence contexts and conditions as done here using NMR provides a basis for exploring the energetics of other conformations in the ensemble using melting experiments. While our study supports this assumption for the conformational states studied here, it needs to be further verified, especially for different sequence contexts through additional measurements of sequence-specific thermodynamic preferences using NMR. This might reveal subgroups of sequence contexts that should be calibrated independently, thus improving the accuracy of the approach.Prior studies established the utility of modifications as structural mimics for a variety of minor non-native conformations in nucleic acids (reviewed in ref. 40). Our results further establish chemical modifications as energetic mimics of minor conformational states. Thus, one can measure thermodynamic preferences to form non-native states in a facile manner using melting experiments, in contrast to NMR RD based approaches, which are comparatively more costly and sample intensive. In addition, our study introduced m3T as a mimic of the A-T base open state. Solving high-resolution structures and dynamic ensembles (63) for m3T modified DNA duplexes could shed light on the structure of this elusive state, in a manner analogous to how m1A+-T provided structural insights into transient Hoogsteen bps (61).Delta-melt made a number of unique applications possible. It allowed us to gain rare insights into the cooperativity of forming non-native bp conformations in nucleic acids, revealing that tandem Hoogsteen bps form cooperatively in duplex DNA. This implies that N1-methylation and possibly other forms of DNA damage can increase the Hoogsteen preferences at neighboring sites, and this could in turn be exploited by damage-repair enzymes such as glycosylases during recognition and repair. Delta-melt could be used in the future to more broadly examine how Hoogsteen preferences are altered with other perturbations such as mismatches, nicks, and other lesions (19). Delta-melt can also potentially be used to measure cooperativity involving different types of bp conformational changes such Hoogsteen and base open states as well as to assess whether cooperativity can arise between bps that are not immediately adjacent to one another (85).Delta-melt enabled us to measure the preferences to form non-native conformations in DNA as a function of the trinucleotide sequence context, revealing ∼10- to 100-fold differences in the probabilities of forming A-T and G-C+ Hoogsteen and A-T base open states due to changes in sequence context alone. We observe rich behavior in the sequence-dependent preferences to form these three conformational states. Not only did the scale of the sequence dependencies vary across the three states, being weakest for the base open conformation, we also observed differences in the extent to which these sequence preferences are driven by the stability of the major versus minor state. These results strongly suggest that hidden within the DNA double helix is a rich landscape of sequence-specific minor conformations. Future studies should verify these predictions from delta-melt and examine their biological implications.In the future, delta-melt could be used to study how changes in DNA sequence beyond the trinucleotide context affect conformational preferences and examine how the conformational preferences are shaped by epigenetic modifications such as 5-methylcytosine. The sequence-dependent thermodynamic propensities obtained from delta-melt can also be cross-referenced with sequence-dependent biological processes, including over 60 mutational signatures linked to cancer (86), to potentially identify minor conformations that drive them (). For example, we observe a strong correlation between the sequence-dependent probabilities for T > A mutations in mutational signature 17A and the thermodynamic propensities for A-T Hoogsteen bp formation obtained using delta-melt, suggesting potential biological roles for Hoogsteen bps that require future investigation ().It should be possible to apply delta-melt to other minor conformational states not examined here. For example, O6-methylguanine, a naturally occurring mutagen (87), could be used to measure thermodynamic preferences to form Watson–Crick like G•T/U mismatches in DNA, RNA, and RNA–DNA hybrids, which have been proposed to evade fidelity checkpoints, contributing to misincorporation errors during DNA replication, translation, and transcription (20). Furthermore, N3-methylcytosine could be used to measure thermodynamic preferences for opening G-C bps, and 8-oxo guanine and adenine, which are commonly occurring form of oxidative damage (88), could be used to measure the thermodynamic preferences to form alternative conformations of A-G mismatches such as A(anti)+-G(syn) and A(syn)-G(anti). Delta-melt could also be used to study the impact of experimental conditions on forming non-native conformations involving secondary structural rearrangements in RNA (89). It may be possible to extend delta-melt to proteins, for which mutants stabilizing minor conformations have been reported (90, 91). The throughput of delta-melt could also be increased in the future using high-throughput melting experiments (92). Through these applications, it may be feasible to extend the nearest-neighbor model to predict the energetics of various alternative nucleic acid conformational states, an important step toward predicting the conformational ensembles of nucleic acids.
Materials and Methods
Sample Preparation.
Buffer preparation.
Sodium phosphate based buffers used for the NMR and UV experiments were prepared as described in .
Annealing and buffer exchange.
Duplex DNA and RNA samples were prepared by mixing the two complementary strands (∼1 mM) in a 1:1 ratio in water, heating to 95 °C for 5–10 min, followed by slow annealing at room temperature. Hairpin DNA and RNA samples were prepared by diluting the samples in water to concentrations < 100 μM, heating to 95 °C for 5–10 min, followed by rapid annealing on ice. Additional procedures for sample preparation and buffer exchange are described in .
Oligonucleotides.
Unlabeled and labeled oligonucleotides were purchased or synthesized in-house as described in .
NMR experiments.
The imino 1H exchange experiments were carried out on a Bruker Avance III 700 MHz spectrometer equipped with a HCN room temperature probe while the remaining NMR data were collected on Bruker Avance III 600 MHz or 700 MHz NMR spectrometers equipped with HCPN and HCN cryogenic probes, respectively. Procedures for resonance assignment and analysis of R1ρ, CEST, and imino proton exchange data are given in .
UV Melting.
Experiments and sample conditions.
Optical melting experiments were carried out on a PerkinElmer Lambda 25 UV/VIS spectrometer with an RTP 6 Peltier Temperature Programmer and a PCB 1500 Water Peltier System, and a Cary-100 UV-Vis spectrophotometer. At least three measurements were carried out for each DNA and RNA duplex using a sample volume of 400 µL in a Teflon-stoppered 1 cm path length quartz cell. The absorbance at 260 nm was monitored while the temperature was varied at a ramp rate of 1 °C/min. Procedures for data analysis are given in .
Comparison of thermodynamic preferences between NMR experiments and delta-melt.
Thermodynamic preferences for conformational changes as measured by NMR were compared to melting energetics as measured by UV melting experiments, as outlined in Fig. 2 and described in detail in .
Authors: Michael F Berger; Anthony A Philippakis; Aaron M Qureshi; Fangxue S He; Preston W Estep; Martha L Bulyk Journal: Nat Biotechnol Date: 2006-09-24 Impact factor: 54.908
Authors: Francesco Colizzi; Cibran Perez-Gonzalez; Remi Fritzen; Yaakov Levy; Malcolm F White; J Carlos Penedo; Giovanni Bussi Journal: Proc Natl Acad Sci U S A Date: 2019-10-18 Impact factor: 11.205