Literature DB >> 25254491

Single-molecule FRET studies of HIV TAR-DNA hairpin unfolding dynamics.

Jixin Chen¹, Nitesh K Poddar, Lawrence J Tauzin, David Cooper, Anatoly B Kolomeisky, Christy F Landes.

Abstract

We directly measure the dynamics of the HIV trans-activation response (TAR)-DNA hairpin with multiple loops using single-molecule Förster resonance energy transfer (smFRET) methods. Multiple FRET states are identified that correspond to intermediate melting states of the hairpin. The stability of each intermediate state is calculated from the smFRET data. The results indicate that hairpin unfolding obeys a "fraying and peeling" mechanism, and evidence for the collapse of the ends of the hairpin during folding is observed. These results suggest a possible biological function for hairpin loops serving as additional fraying centers to increase unfolding rates in otherwise stable systems. The experimental and analytical approaches developed in this article provide useful tools for studying the mechanism of multistate DNA hairpin dynamics and of other general systems with multiple parallel pathways of chemical reactions.

Entities: Chemical Disease Gene Species

Mesh：

Substances：

Year: 2014 PMID： 25254491 PMCID： PMC4207534 DOI： 10.1021/jp507067p

Source DB: PubMed Journal: J Phys Chem B ISSN： 1520-5207 Impact factor: 2.991

Introduction

The melting and annealing of DNA hairpins are essential in many biological processes such as replication, transcription, recombination, gene expression, and DNA transposition for both prokaryotic and eukaryotic systems.[1,2] Furthermore, hairpins with multiple loops are known to play specific roles in viral replication.[3] An important example is the human immunodeficiency virus-1 (HIV-1) trans-activation response region (TAR) hairpin.[1,4] The TAR sequence is remarkably well conserved among HIV isolates, indicating a strong selection pressure to maintain its structure.[5] Thus, the TAR hairpin is of therapeutic interest.[6−9] The TAR–RNA hairpin and its complement, TAR–DNA hairpin, are involved in several crucial steps in the viral life cycle.[10−12] The TAR hairpin has four bulges, which have been found to be critical to the biological function of the TAR sequence because they determine the hairpin unfolding/folding dynamics.[5,13−15] As a general topic, understanding hairpin dynamics is further motivated by the advent of therapeutics with aptamers, which are small RNA and DNA molecules that often form single or multiloop hairpin conformations.[16,17] Schematic of proposed examples of unfolding/folding routes of (a, b) model DNA hairpins and (c) the TAR–DNA hairpin with two dyes Cy3 and Cy5 labeled to the ends. Urea molecules within the solution are shown, and the double helix is not shown for easier demonstration. In order to understand the molecular-scale dynamics of DNA/RNA hairpins, hairpins have been studied using technologies such as temperature-jump,[18,19] optical trap,[20] single-molecule fluorescence resonance energy transfer (smFRET),[21−26] and combinations of spectroscopic techniques.[2,27−31] However, these hairpin structures usually have one single loop connecting a stem region of several base pairs (Figure 1a). It is generally understood that the unfolding/folding rates of such simple DNA hairpins are dependent on the binding energy of the hairpin, the diffusion rate of the two ends of the stem followed by nucleation, and the propagation of base pairing.[30,32−35] This process yields folding times that range from milliseconds to microseconds, depending on the sequence length and base composition.[22,27] smFRET is particularly suited to this study due to its wide applications in studying the single-molecule dynamics of nucleic acids.[36−38]

Figure 1

Schematic of proposed examples of unfolding/folding routes of (a, b) model DNA hairpins and (c) the TAR–DNA hairpin with two dyes Cy3 and Cy5 labeled to the ends. Urea molecules within the solution are shown, and the double helix is not shown for easier demonstration.

For DNA melting (unfolding), a “fraying and peeling mechanism” has been predicted,[39,40] and for annealing (folding) a “collapsing mechanism” has been proposed.[41] Molecular dynamics simulations of the unfolding of short double-stranded DNA have suggested that DNA is opened via untwisting and then peeling.[39] This rapid “fraying” at the end of the helix has been experimentally observed for simple model DNA molecules.[42] This mechanism suggests that the unfolding of the DNA helix starts from one end of the stem and progresses dynamically to the other end of the DNA, similar to unzipping a zipper (Figure 1a). During the folding process of DNA hairpins, end-to-end contact (collapse) has been observed using temperature-jump measurements.[41] This mechanism suggests that the unfolded DNA stalks are extremely flexible and end to end closing is common (Figure 1b).[35] This flexibility is consistent with the molecular dynamics simulations where multiple intermediate states and trap states have been observed.[39] It remains an open question as to whether the general conclusions discussed above can be extended to describe the dynamics of more complex biologically relevant DNA hairpins that include loops and bulges. We hypothesize that a possible biological function for a hairpin loop/bulge is to serve as an additional fraying center to increase unfolding rates in otherwise stable systems. This has been explained thermodynamically using a free energy penalty in hairpin pairing, and the effect of the bulges on folding/unfolding dynamics of the hairpin has been predicted.[43] However, it has been difficult to experimentally measure the stability of intermediate states for complicated structures because of the coexistence of multiple states. In this article, we carried out single-molecule FRET experiments to study the complex dynamics of HIV TAR–DNA hairpin. In order to tune the lifetime of the TAR–DNA folding/unfolding dynamics to our measuring time scale, we introduced two additives, urea and poly(ethylene glycol) (PEG), to the buffer solution. After the smFRET data were obtained, we performed a state analysis algorithm and derived a statistical analysis model to calculate the stabilities of the intermediate states. Structures of the DNA hairpins used in the smFRET studies. Predicted secondary structure of the (a) TAR–DNA with four bulges and a loop and (b) TAR–DNA mutant with the bulges removed. Cy3 and Cy5 were used as the donor and acceptor dye molecules which were coupled to the 5′-dT and 3′-dT of the DNA, respectively. The DNAs were attached to the surface via a biotin linker attached to a -dT in the hairpin loop region.

Experimental Section

Sample Preparation

Purified and labeled single-stranded DNA (ssDNA) TAR and a mutant with the bulges removed (Figure 2) were acquired from TriLink Biotechnologies. The ssDNAs were modified with functional groups: biotin was used for surface immobilization; Cy3-amidite was directly coupled to the 5′ end and Cy5-succinimidyl ester was coupled to a C6 amino linker at the 3′ end of the DNA; dT spacers were designed at the end of the sequences to reduce unwanted photophysical effects. The ssDNAs were immobilized on glass substrates using the biotin–streptavidin interaction. Briefly, plasma cleaned glass coverslips were functionalized with aminosilane (Vectabond,Vector Laboratories). The slides were then grafted in a aqueous solution of 25% (m/m, mass fraction) methoxypoly(ethylene glycol) 5000 propionic acid N-succinimidyl ester (>80%, Sigma-Aldrich), 0.25% (m/m) SUNBRIGHT BI-050TS (Biotin-PEG-COO-MAL, Mw 5000, NOF Corporation, Japan), and 0.8% (m/m) NaHCO3 (Sigma-Aldrich). Custom HybriWell chambers (Grace Bio-Labs) which had a volume of ∼15 μL, secure seal spacers (Grace Bio-Labs), tube connectors (Grace Bio-Labs), and Teflon tubing (Western Analytical Products) were used to construct a flow chamber that was attached to each biotin-PEGylated slide.[44] The biotin-PEGylated slide was incubated with 2 mg mL–1 streptavidin (Invitrogen) in 25 mM HEPES (Sigma-Aldrich) and 40 mM NaCl (Sigma) buffer solution for 10 min followed by DNA (200 pM) adsorption for 20 min. Before the DNAs were attached to the streptavidin-labeled substrates, the DNA samples were denatured at 80 °C in buffer solution for 2.5 min and annealed at 60 °C for 2.5 min, and then 2 mM MgCl2 (Ambion) was added and the solution was reannealed at 0 °C for 5 min to homogenize the samples.

Figure 2

Structures of the DNA hairpins used in the smFRET studies. Predicted secondary structure of the (a) TAR–DNA with four bulges and a loop and (b) TAR–DNA mutant with the bulges removed. Cy3 and Cy5 were used as the donor and acceptor dye molecules which were coupled to the 5′-dT and 3′-dT of the DNA, respectively. The DNAs were attached to the surface via a biotin linker attached to a -dT in the hairpin loop region.

FRET Measurements

Single-molecule images were acquired by a home-built sample scanning confocal microscope based on a Zeiss Axiovert 200 microscope. Raster scanning of the sample coverslip was achieved by a closed-loop xyz piezo stage (P-517.3CL; Physik Instrumente) with 100 × 100 × 20 μm travel range and a minimum resolution of ∼1 nm (SPM 1000; RHK Technology). A 532 nm diode-pumped solid-state laser (Coherent, Compass 315M-100 SL) was used as the excitation source. The light was expanded to overfill the back aperture of a Fluar 100× 1.3 NA oil immersion microscope objective lens (Carl Zeiss, GmbH) which focused the laser light to a spot with a full width at half-maximum (fwhm) beam radius and height of ∼125 nm and ∼1 μm, respectively. The intensity of the laser was controlled with a neutral density filter to be ∼4 μW before the objective, yielding an estimated total power density at the sample of ∼800 W cm–2. The fluorescence signal was collected and refocused by the same objective and was separated from the excitation light using a dichroic mirror (z532rdc; Chroma Technology). Scattered laser light was removed by the use of notch and emission long-pass filters (NHPF-532.0, Kaiser Optical; ET585 and ET685, Chroma Technology). The refocused signal was then further separated by a beam splitter (Chroma 640 DCXR) into donor emission and acceptor emission fluorescence and then finally directed to two avalanche photodiodes detectors (SPCM-AQR-15; PerkinElmer). The smFRET experiments were carried out at room temperature (20 ± 1 °C). Into the flow cell, a buffer solution was flowed at 1 μL min–1 for the duration of the measurements. The HEPES buffer solution containing an oxygen-scavenging system to extend the lifetime of the fluorophores was used in all experiments, and was prepared according to an established protocol:[45] 3% (w/w) β-d-(+)-glucose (Sigma-Aldrich), 0.1 mg mL–1 of glucose oxidase (Sigma), 0.02 mg mL–1 of catalase (Sigma-Aldrich), 40 mM NaCl, 25 mM HEPES buffer, and saturated Trolox solution (6-hydroxy-2,5,7,8-tetramethylchroman-2-carboxylic acid; Sigma-Aldrich). In addition, cosolute 2 mM MgCl2, urea (Sigma-Aldrich) and/or PEG-6000 (Sigma-Aldrich) were added to the solution from stock solutions of 10 M urea and 60% PEG respectively when needed.

SmFRET Analysis

All analysis programs were written in MATLAB (R2009b) except for the hidden-Markov models (HMMs) analysis methods for FRET efficiency trajectories, which were provided by the HaMMy GUI (http://bio.physics.uiuc.edu/HaMMy.html, accessed 09/2013)[46] and vbFRET (http://vbfret.sourceforge.net, accessed 09/2013).[47] The emission intensity trajectories were collected at 1 ms resolution and later binned to 10 ms time steps to improve signal-to-noise ratio. The corrected fluorescence signal trajectories were used directly to calculate the FRET efficiency (EFRET), as the fraction of the fluorescence signal of the acceptor dye over the total signal of acceptor dye and the donor dye:[17,48,49]where Iacceptor and Idonor correspond respectively to Cy5 and Cy3 fluorescence intensity with background and crosstalk correction and blinking removed.[17,48,49] The fitting processes and algorithms can be found in the original literature.[46−52] Briefly, trajectories of all the molecules are combined into a single data file without further modification, and then the file is fed to the two software packages for fitting. During the fitting, the number of states is varied and the other fitting parameters are kept at the software defaults.

Simulation of Wormlike Chain (WLC) Model

The average FRET efficiency, ⟨E⟩, within any long-enough bin time is calculated with WLC:[53,54]where r is a unitless value representing the end-to-end distance R over the maximum possible distance L of the ends-labeled polymer; R0 is the constant Förster radius; and p(r) is the probability factor:where A is a normalization constant:and t is related to another constant called persistence length Lp, a basic mechanical property quantifying the stiffness of a polymer: t = L/Lp. The established WLC model can be applied to our smFRET data of TAR–DNA. The maximum possible length of the ssDNA L = 0.63N nm, where N is the number of unpaired nucleotides (nt) between the two ends with 0.63 nm/nt length.[53] The Förster radius R0 for Cy3–Cy5 dye has been measured to be ∼6 nm when attached to DNA.[53,55] The persistence length of TAR–DNA in urea is estimated from comparing the histogram of smFRET data and simulated FRET values. The smFRET values are simulated with Metropolis Monte Carlo simulations of the time trajectory of the end-to-end distance R.[53,56] In every time step (10 ps), R is allowed to randomly walk between 0 and L with a Gaussian distributed distance step centered at 0.55 nm and a standard deviation 0.2 nm according to the above probability function (representing a 1D diffusion coefficient of ∼1.5 × 10–4 cm2 s–1 = (0.55 nm)2/2/0.01 ns).[53] At each step, the donor will be excited at a probability of 1/5 ns–1, ∼5 times slower than its fluorescence decay rate. If the donor is excited, then it has a decay lifetime of τD ∼ 1 ns into donor fluorescence or (R/R0)6τD into a nonexcited acceptor molecule. If the acceptor is excited, it has a fluorescence decay lifetime 1.3 ns as measured (1.3 ± 0.1 ns, see Supporting Information). The total simulation time for each number of nucleotide is 1 ms. The FRET efficiency is the fraction of the number of steps of acceptor emission over sum of the steps of acceptor emission and donor emission.

Results and Discussion

Photophysics of the Dye Molecules

Blinking and bleaching of the dyes, as well as the dye–DNA interaction, were confirmed to have little influence on our smFRET measurements of the hairpin dynamics. We labeled the two ends of the DNA hairpin with Cy3 and Cy5 and immobilized the hairpin on PEGylated glass slides via biotin–streptavidin interaction, as shown in Figure 2. One potential issue with smFRET experiments is that the photophysical stability of the dyes can change depending on the solution as well as the dye–DNA interaction. These conditions can affect the quantum yields of the dyes and thus affect the FRET efficiency between the donor dye and the acceptor dye.[57] When covalently attached to DNA, cyanine dyes are well-known to bend and attach to DNA basepairs with hydrophobic interactions, varying the dyes’ quantum yields via conformational confinement and charge transfer.[58] The average quantum yields of the dyes are dependent on the DNA sequences they are attached to;[57,59,60] however, the variation of single-dye quantum yield is not observed during our smFRET measurement, probably because the above-mentioned dynamics are too fast to be observed on the time scale of milliseconds to seconds common for single-molecule measurements. As each of our smFRET data points is calculated from the total photon counts of the two dyes during 10 ms, the variations at shorter time scale are time-averaged. Stable photon counts with shot noise were observed for the smFRET time trajectory of the bulge-removed mutant DNA hairpin in HEPES buffer solution (Figure 3a), for which no unfolding dynamics are expected at room temperature and a stable FRET value is expected. This stability of photon counts (representing the quantum yield) confirms that any dye–DNA interactions are (1) minimal and (2) faster than the dynamics measured in our experiments. The single-step bleaching profile confirms that we are measuring single-molecule events. The stability of the dyes are also observed in the presence of different cosolutes (Figure 3b–d), which is consistent with the unchanged lifetimes of the dyes under the different solutions (see Supporting Information for time-resolved fluorescence data). This stability of smFRET at the millisecond time scale is consistent with other smFRET studies of DNA hairpins labeled with the same two dyes.[53,61,62]

Figure 3

Representative photon trajectories show stable photon counts of Cy3 and Cy5 attached to the ends of mutant DNA hairpin in (a) HEPES buffer with 2 mM Mg2+, (b) HEPES buffer with 2 mM Mg2+ and 24% PEG, (c) HEPES buffer with 2 mM Mg2+ and 6 M urea, and (d) HEPES buffer with 6 M urea (full trajectory shown in the Supporting Information). These are raw data for typical molecules binned at 10 ms with bleaching of either dye shown as the transition point of the signals. The FRET histograms of over 50 molecules/each are shown in Figure 4a–d, respectively.

Figure 4

Global ensemble histogram of the mutant and TAR–DNA in (a, e) HEPES buffer with 2 mM Mg2+; (b, f) HEPES buffer with 2 mM Mg2+ and 24% PEG; (c, g) HEPES buffer with 2 mM Mg2+ and 6 M urea; (d, h) HEPES buffer with 6 M urea. The total counts of the histograms are normalized to unity. Insets show the mean FRET efficiency, μ, (error is the standard deviation of three independent measurements) representing the average conformational structure of the DNA; the standard deviation, σ, of the histogram that represents the variation of the FRET distribution and thus the range of conformations; and the number of single molecules measured for each sample, #.

Tuning the Folding/Unfolding Lifetime

Two challenges arise when measuring the dynamics of the TAR–DNA hairpins: the equilibrium lies strongly toward the folded state of the hairpin, and some of the kinetic processes are too fast to be observed by our millisecond time resolution. On the basis of the dynamics established from model hairpins,[22,27] we calculate that the folded-state and unfolded-state lifetimes of the states of TAR–DNA hairpin are at ∼1 ms and ∼10 μs, respectively (see Supporting Information). These values indicate that, at equilibrium, the TAR–DNA hairpin effectively remains folded at room temperature, with brief explorations of the unfolded state that are too fast to be resolved with typical smFRET experiments carried out at the 1–100 ms time scale. Therefore, our ability to characterize even two-state folding kinetics of the hairpin is limited by the fast folding rate (or unstable unfolded state). In the retroviral replication process, the unfolding/folding dynamics are altered by the nucleocapsid (NC) protein,[63,64] which destabilizes the two break points near the open end of the TAR hairpin and allows for the characterization of the protein-induced hairpin unfolding/folding dynamics of the outermost bulge by smFRET, which has a minimum time resolution at ∼1 ms level.[45,62,64−66] In order to understand the mechanism of multiloop hairpin unfolding/folding dynamics, alternative methods were pursued to shift the equilibrium toward the unfolded states and to slow down the dynamics to our experimental time resolution. To this effect, we introduced two additives, poly(ethylene glycol) (PEG) and urea, to the buffer solution. It is well-known that crowding agents such as PEG, sucrose, and glycerol increase the viscosity of aqueous solutions,[67−69] and studies have shown that PEG solutes can destabilize DNA at small weight values of PEG[70,71] but do not significantly affect the stability of DNA if the PEG is larger than 1 kDa.[72,73] Thus, PEG-6000 is used in this study. Urea, a commonly used destabilizer of DNA and proteins, was used to induce helix unfolding and to shift the hairpin folding equilibrium away from the folded state at room temperature.[21,24,74] Although a general consensus on the biological relevance of urea as a denaturant has not been reached, there has been recent evidence in support of urea to perturb conformational changes of nucleic acids and proteins.[74] This conclusion is consistent with the successful application of urea in studying human telomerase RNA pseudoknot folding/unfolding dynamics using smFRET.[24] The smFRET efficiency distribution of the TAR-DNA hairpin is broadened when PEG-6000 is added to the buffer solution (Figure 4f), as the standard deviation increases to 0.12 FRET efficiency compared to 0.02 in HEPES buffer. Under the same conditions, the standard deviation of the bulge-removed mutant only slightly increases to 0.04 (Figure 4b). We confirmed that PEG-6000 has negligible effects on the time-averaged secondary structure of DNA and the dye’s photophysical response using circular dichroism (CD) and fluorescence anisotropy decay measurements for both the standard TAR–DNA and the mutant construct in the presence and absence of PEG (see Supporting Information). Therefore, we consider PEG-6000 a suitable crowding agent to slow down the dynamics of the TAR–DNA and mutant constructs and that the broadening of the smFRET distributions depicted in Figure 4 can be attributed primarily to conformational broadening. Global ensemble histogram of the mutant and TAR–DNA in (a, e) HEPES buffer with 2 mM Mg2+; (b, f) HEPES buffer with 2 mM Mg2+ and 24% PEG; (c, g) HEPES buffer with 2 mM Mg2+ and 6 M urea; (d, h) HEPES buffer with 6 M urea. The total counts of the histograms are normalized to unity. Insets show the mean FRET efficiency, μ, (error is the standard deviation of three independent measurements) representing the average conformational structure of the DNA; the standard deviation, σ, of the histogram that represents the variation of the FRET distribution and thus the range of conformations; and the number of single molecules measured for each sample, #. Further analysis of the distribution of the FRET efficiencies of the two DNA hairpins in PEG solution suggests that the unfolding of the hairpins by thermoagitation, known as “fraying”,[42] stops after each loop, as long as there are sufficient base pairs between loops to provide a barrier to further unfolding. We compared the distribution of the FRET efficiencies (Figure 4) with previous reported distributions of end-labeled TAR–DNA.[45,62,64] The FRET efficiency distribution of TAR–DNA is consistent with the hairpin unfolding to the second bulge from the opening (Figure 2, bulge 2). This is expected because only two base pairs connect bulges 1 and 2 in TAR–DNA, making it the weakest of the bulge connecting sections. This behavior has also been observed previously in the presence of NC proteins.[45,62,64] By analyzing the dwell times for transitions between the two observed smFRET states of TAR–DNA with 24% PEG, as identified by HaMMy, we can confirm that the addition of PEG slows down the unfolding/folding dynamics of the TAR–DNA hairpin. The unfolded-state and folded-state lifetimes of the TAR–DNA hairpin are 142 and 353 ms, respectively (Supporting Information Figure S6), and are slower by 3 orders and 1 order of magnitude, respectively, when in the presence of PEG, shifting them well within the resolvable time frame of smFRET observations. Slowing down the fraying dynamics, however, does not allow us access to all of the possible open hairpin states. Thus further perturbation is required to accomplish this goal. Insets indicate the different concentrations of urea, the mean FRET efficiency (μ), the standard deviation (σ) of the FRET efficiency, and the number of molecules measured (#). By tuning the concentration of a denaturant, urea, we can shift the TAR–DNA hairpin equilibrium to more opened states to observe each of the distinct loop unfolding/folding transitions. The ensemble smFRET histograms for each condition are included in Figure 5, and short pieces of smFRET trajectories for each condition are shown in Figure 6. The trajectories in Figure 6 shift to more open states as the urea concentration is increased, referred to as S1, S2, S3, S4, and S5. Because fluorescence measurements have suggested that the photophysical properties of the dyes are not changed by the presence of urea (see Supporting Information), the primary explanation for the broadening of the FRET distribution of the FRET efficiencies in Figure 5 is urea-induced hairpin unfolding. Single-molecule time trajectories (Figure 6) suggest that the broadening is due to transitions among newly observable FRET states. In the presence of 1 and 2 M urea, the FRET efficiency distributions of the DNA hairpin (Figure 5a,b) are almost the same as the distributions of those with no urea (Figure 4), and thus only one state is observed in the time trajectories (Figure 6f). When the urea concentration increases to around 3 M, the FRET efficiency distribution (Figure 5c) becomes more broad and tails toward a FRET efficiency value of 0.8, in the direction of the state between 0.6 and 0.8 observed in the single-molecule FRET time trajectories (Figure 6g). More states are observed at successively higher urea concentrations (Figures 5d,e and 6h,i), until, in the presence of 6 M urea (Figures 5f and 6j), all states become observable, including those with FRET efficiencies at ∼0.4 and ∼0.2. Qualitatively, the smFRET data change the ensemble steady-state view of urea denaturation of DNA to a more dynamic picture. The ability of urea to control the equilibrium of the TAR–DNA hairpins is more obvious in the average values of the ensemble FRET efficiency (Figure 6k). Just like the results one would get from ensemble measurements, urea reduces the average FRET values. At the single-molecule level, however, the larger the urea concentration, the longer the dwell times of more opened states (Figure 6l).

Figure 5

Insets indicate the different concentrations of urea, the mean FRET efficiency (μ), the standard deviation (σ) of the FRET efficiency, and the number of molecules measured (#).

Figure 6

Proposed structures and smFRET trajectories with their FRET efficiencies showing five different states of TAR–DNA in its folded form S1 (a, f), 0–2 M urea (scheme showing as an example structure of the state); S2 (b, g), 3 M urea; S3 (c, h), 4 M urea; S4 (d, i), 5 M urea; and unfolded hairpin form S5 (e, j), 6 M urea in the absence of Mg2+ (full trajectories shown in the Supporting Information). (k) The mean FRET efficiency as a function of urea concentration represents the denaturing (unfolding) of the DNA. Error bars are standard deviation of three measurements, >20 molecules for each urea concentration at different days for three different samples. Relatively small error bars indicate that the number of molecules is large enough to represent ensemble average. (l) Number of states observed under our specific experimental conditions. According to Figures 4–6, as well as previous studies on smFRET of TAR–DNA,[45,64] we assigned the FRET efficiency 1.0–0.9 to state S1 of TAR–DNA; ∼0.8 to state S2; ∼0.6 to state S3; ∼0.4 to state S4; and 0.3–0.0 to the completely unfolded state S5, all associated with opening through the sequential bulge regions. The assignment is consistent with our hypothesis that a hairpin with four bulges and a loop should yield five resolvable states (Figure 7).

Figure 7

(a) Scheme of the breaking points of TAR–DNA. The four breaking points (Bp) define five states S, and each state has probability of P to be observed in smFRET experiments. Each breaking point has a closed probability F. Bp1a and Bp1b are jointed together as has been discussed earlier in the main text. (b) Scheme of smFRET trajectory with random transitions between states. The histogram summarizes the random trajectory with the bars representing the probabilities of observing the states. (c) Scheme of the microstates and the transitions between the five FRET states. Because the states are defined by the bulges that are connected with the breaking points (Bp) (Figure 7a), quantitatively, the equilibrium constant of each breaking point can be calculated from the probabilities of the states (Figure 7b). According to the ergodic principle, the probability of each state measured at the single-molecule level represents its concentration in ensemble experiments. State S1 represents the eight microstates that have Bp1 closed but can have Bp2–4 either opened or closed (Figure 7c); state S2 contains four microstates; state S3 has two microstates; and states S4 and S5 have only the one microstate. As such, a statistical approach is proposed to obtain the equilibrium constants Kclosed, from the probabilities of the five FRET states:The equilibrium constant is defined by closed probability F: Kclosed, = F/(1 – F), and the free energy can be calculated via ΔG = –RT ln Kclosed,, where R is gas constant and T is temperature. The measured probability of each state can be expressed as a function of F, and the following expressions can be written:Therefore, F can be calculated from the measured state probabilities P, and ΔG can be calculated from F. In order to obtain the probabilities of the states P under 6 M urea when all the states are observed, the hidden Markov model (HMM)[26,46,50−52,75] and the wormlike chain (WLC) model[53] were used to analyze and refine the FRET states of TAR–DNA in the next two sections. The rate constants of the transitions among different FRET states are extractable from the time trajectories, but the process is complicated by measurement noise, state-blur induced by fast transitions within each binned time, variation among molecules, the breakdown of ergodicity for individual molecules, and the complexity of the transitions among the five states. Many methods have been developed to analyze or assist in the analysis of these kinds of complicated time trajectories including the widely used HMM,[46,48,50−52,75−79] which has recently been successfully used to analyze very complicated hairpin smFRET data.[26]

Using HHM To Obtain the Probabilities of the Five States

First we use HMM to fit the trajectories for FRET states and extract the dwell-time distributions of the different states using two HMM packages HaMMy and vbFRET.[46−52] Both packages are well-established programs that use the HMM principle but implement it differently, with HaMMy using maximum likelihood (ML) and vbFRET using maximum evidence (ME) as a measurement of the goodness of fit. The ML and ME scores increase with the number of states and does not reach the maximum even at 10 states (Figure 8a). This is expected because noise and state-blur caused by the fast dynamics among the states make the data better fitted with more parameters, which is consistent with the analysis results of simulated traces with five preset states, designed to have similar noise levels and fast dynamics on the order of the bin time. The five states identified by HaMMy are 0.92, 0.76, 0.57, 0.42, and 0.33 and by vbFRET are 0.94, 0.89, 0.74, 0.54, and 0.37.

Figure 8

(a) Fitting probability as a function of number of states with HaMMy (maximum likelihood, ML) and vbFRET (Maximum evidence, ME). Five-state is picked based on the hairpin unfolding model explained in the text. (b) Average FRET efficiencies vs the number of unpaired bases between the donor dye and the acceptor dye predicted by wormlike chain model assuming 0.8, 1.2, and 1.5 nm persistence length of the ssDNA. The curves represent the simulated data (see Experimental Section). The blue dots are the states identified by HaMMy, and the magenta squares are the states identified by vbFRET where the first two points are combined together.

The fitting results from both packages are compared to the simulation results of wormlike chain (WLC) model to check which result is more consistent with our model of the hairpin dynamics. WLC model is a theory that can be used to explain the average separation distance of the two ends of a polymer chain under fast diffusion, where the most important parameter is the persistence length of the polymer that characterizes the softness of the polymer chain.[53,54] We are particularly interested in this model because smFRET of Cy3- and Cy5-labeled ssDNA has been used to measure the persistence lengths of ssDNA under specific solutions; thus, all parameters except for the persistence length have been established in the literature.[53] Because the persistence length varies with salt conditions, and is affected by pH value and urea, too,[53,80,81] different persistence lengths are used to simulate the FRET curves vs the number of the bases between the two ends of the unfolded hairpin (Figure 8b). From these curves, the five states identified by HaMMy follow the trend well, while the five states identified from vbFRET follow the trend only if the first two states are combined together. As such, the results from HaMMy are adapted for further analysis in this case. (a) Fitting probability as a function of number of states with HaMMy (maximum likelihood, ML) and vbFRET (Maximum evidence, ME). Five-state is picked based on the hairpin unfolding model explained in the text. (b) Average FRET efficiencies vs the number of unpaired bases between the donor dye and the acceptor dye predicted by wormlike chain model assuming 0.8, 1.2, and 1.5 nm persistence length of the ssDNA. The curves represent the simulated data (see Experimental Section). The blue dots are the states identified by HaMMy, and the magenta squares are the states identified by vbFRET where the first two points are combined together. The probabilities of each state are obtained from the fitted time trajectories. The time trajectories and the five-state HaMMy result are shown in Figure 9a. The distributions of these states are further extracted from the fitted data (Figure 9b). When the states are assigned to each molecule, some “inactive” molecules turn out to stay only in a single state during the entire observation period. These molecules might stay fixed due to the strong molecule–substrate interaction[25,82,83] and are thus not counted for the statistical distribution of the states to reduce the effect of immobilization. The dwell times of the first state and the last state of each molecule are also excluded from the histogram to remove the edge effect. After these treatment, the probabilities for the five states 0.92, 0.76, 0.57, 0.42, and 0.33 are 0.09, 0.27, 0.33, 0.20, and 0.11, respectively.

Figure 9

(a) Summary of the smFRET time trajectories of TAR–DNA under 6 M urea. The blue line is the binned data, and the red line is the fitted result from HaMMy at five states. (b) FRET distributions of the five states identified by HaMMy.

Free Energy of Each Break Point under 6 M Urea

Equations 6–10 can be used to calculate the free energy change of each Bp that is independent of other breaking points and the microstates, although the overall free energy of all the Bp should be 0. Our calculations produce the probability of the TAR–DNA hairpin staying folded F1 = 0.09, F2 = 0.30, F3 = 0.52, and F4 = 0.65. The equilibrium constant Kclosed, = F/(1 – F) yields Kclosed,1 = 0.10, Kclosed,2 = 0.42, Kclosed,3 = 1.1, and Kclosed,4 = 1.8 for the four Bp. As such, the free energies, ΔG= −RT ln(Kclosed,), for the base pairs in the presence of 6 M urea are 5.6, 2.1, −0.15, and –1.5 kJ mol–1 for the four breaking points at 20 °C with 6 M urea. The order of the free energies is not consistent with the order of the hybridization energy (Supporting Information Figure S2) or the number of hydrogen bonds 14, 10, 13, and 11 for Bp1 to Bp4, respectively, but rather suggests that in addition to the hydrogen binding strength (enthalpy control), the closer a base pair is to the anchoring point at the end loop, the easier it is for it to remain hydrogen bonded (entropy control).

Conclusion

In summary, our experimental observations are consistent with the hypothesis that the bulges are the fraying centers of hairpin folding/unfolding. In addition, we developed an approach to extract the equilibrium constants of the folding/unfolding of each breaking point to estimate its relative stability. Experimentally, we have successfully demonstrated the method to slow down the dynamics and to open more conformational states for the zipped hairpin by using PEG and urea as cosolutes. The quantitative data analysis is consistent with our model; however, our data analysis is based on the assumptions and models, which will certainly affect the results if they are further optimized. In addition, because the noise and the fast transitions blur our binned data, the existing methods have difficulties to fit the states without specifying the number of states. Thus, we are developing new methods hopefully to analyze the data more objectively or even in a model-free fashion. The results of this study support a “fraying and peeling” mechanism for the unfolding and “collapse” mechanism for the folding of DNA hairpins. Our quantitative analysis of the free energy of each breaking point suggests that the stability of the paired region is a function of both the pairing sequence and its distances to the anchoring/loop positions. The method developed in this paper will be very useful to study the mechanism for the inhibition of the HIV’s TAR–DNA transcription with short DNA oligomers or RNA aptamer and for studying other systems, such as TAR–RNA hairpin, with multiple interconverting states that might be otherwise unresolvable.

75 in total

1. Single-pair fluorescence resonance energy transfer on freely diffusing molecules: observation of Förster distance dependence and subpopulations.

Authors: A A Deniz; M Dahan; J R Grunwell; T Ha; A E Faulhaber; D S Chemla; S Weiss; P G Schultz
Journal: Proc Natl Acad Sci U S A Date: 1999-03-30 Impact factor: 11.205

2. Nanomechanical measurements of the sequence-dependent folding landscapes of single nucleic acid hairpins.

Authors: Michael T Woodside; William M Behnke-Parks; Kevan Larizadeh; Kevin Travers; Daniel Herschlag; Steven M Block
Journal: Proc Natl Acad Sci U S A Date: 2006-04-10 Impact factor: 11.205

3. Monitoring single-stranded DNA secondary structure formation by determining the topological state of DNA catenanes.

Authors: Xingguo Liang; Heiko Kuhn; Maxim D Frank-Kamenetskii
Journal: Biophys J Date: 2006-02-03 Impact factor: 4.033

4. COCIS: Markov processes in single molecule fluorescence.

Authors: David S Talaga
Journal: Curr Opin Colloid Interface Sci Date: 2007-12-01 Impact factor: 6.448

5. Kinetics and dynamics of DNA hybridization.

Authors: Yandong Yin; Xin Sheng Zhao
Journal: Acc Chem Res Date: 2011-06-30 Impact factor: 22.384

6. Orientation dependence in fluorescent energy transfer between Cy3 and Cy5 terminally attached to double-stranded nucleic acids.

Authors: Asif Iqbal; Sinan Arslan; Burak Okumus; Timothy J Wilson; Gerard Giraud; David G Norman; Taekjip Ha; David M J Lilley
Journal: Proc Natl Acad Sci U S A Date: 2008-08-01 Impact factor: 11.205

7. Effects of osmolytes on RNA secondary and tertiary structure stabilities and RNA-Mg2+ interactions.

Authors: Dominic Lambert; David E Draper
Journal: J Mol Biol Date: 2007-05-05 Impact factor: 5.469

8. Quantifying functional group interactions that determine urea effects on nucleic acid helix formation.

Authors: Emily J Guinn; Jeffrey J Schwinefus; Hyo Keun Cha; Joseph L McDevitt; Wolf E Merker; Ryan Ritzer; Gregory W Muth; Samuel W Engelsgjerd; Kathryn E Mangold; Perry J Thompson; Michael J Kerins; M Thomas Record
Journal: J Am Chem Soc Date: 2013-04-03 Impact factor: 15.419