Repetitive 5'GGXGG DNA segments abound in, or near, regulatory regions of the genome and may form unusual structures called G-quadruplexes. Using NMR spectroscopy, we demonstrate that a family of 5'GCGGXGGY sequences adopts a folding topology containing double-chain reversals. The topology is composed of two bistranded quadruplex monomeric units linked by formation of G:C:G:C tetrads. We provide a complete thermodynamic and kinetic analysis of 13 different sequences using absorbance spectroscopy and DSC, and compare their kinetics with a canonical tetrameric parallel-stranded quadruplex formed by TG4T. We demonstrate large differences (up to 10(5)-fold) in the association constants of these quadruplexes depending on primary sequence; the fastest samples exhibiting association rate equal or higher than the canonical TG4T quadruplex. In contrast, all sequences studied here unfold at a lower temperature than this quadruplex. Some sequences have thermodynamic stability comparable to the canonical TG4T tetramolecular quadruplex, but with faster association and dissociation. Sequence effects on the dissociation processes are discussed in light of structural data.
Repetitive 5'GGXGG DNA segments abound in, or near, regulatory regions of the genome and may form unusual structures called G-quadruplexes. Using NMR spectroscopy, we demonstrate that a family of 5'GCGGXGGY sequences adopts a folding topology containing double-chain reversals. The topology is composed of two bistranded quadruplex monomeric units linked by formation of G:C:G:C tetrads. We provide a complete thermodynamic and kinetic analysis of 13 different sequences using absorbance spectroscopy and DSC, and compare their kinetics with a canonical tetrameric parallel-stranded quadruplex formed by TG4T. We demonstrate large differences (up to 10(5)-fold) in the association constants of these quadruplexes depending on primary sequence; the fastest samples exhibiting association rate equal or higher than the canonical TG4T quadruplex. In contrast, all sequences studied here unfold at a lower temperature than this quadruplex. Some sequences have thermodynamic stability comparable to the canonical TG4T tetramolecular quadruplex, but with faster association and dissociation. Sequence effects on the dissociation processes are discussed in light of structural data.
Guanine-rich regions abound in the human genome; they have the propensity to fold into higher order DNA structures such as quadruplexes (1,2). These architectures result from the hydrophobic stacking of several quartets (3–5) and may have applications in areas ranging from supramolecular chemistry to medicinal chemistry [for a recent review see (6)]. Rules have been proposed to describe the properties of simple short segments that do not form loops such as T2G4T2 (7). However, G-quadruplexes are in fact highly polymorphic (8,9). This polymorphic nature is in part due to formation of double-chain reversals loops, which span two or more contiguous tetrads resulting in parallel stranded architectures. Formation, stabilization, dynamic behavior as well as structure–function and structure–stability relationships of these quadruplexes are of current interest (10–15). Recent reports established that d-5′GCGGGGAT (16) and d-5′GCGGGGAT (17) adopt a topology in which two bistranded quadruplex monomeric units are interlocked by formation of four-stranded G:C:G:C tetrads. A schematic representation of the four-stranded topology as well as core hydrogen bond alignments of the stem quadruplex are shown in Figure 1A–C.
Figure 1
Folding topology and pairing schemes. (A) Scheme of the general folding topology of the four-stranded bimolecular quadruplexes. The backbone tracing of the individual strands is shown by thick lines and non-guanine bases are depicted in black. All bases of the stem quadruplex adopt anti alignments. Pairing alignments for the two-stranded G:G:G:G tetrads (B) and the four-stranded G1:C2:G1:C2 tetrad (C) are also shown. The T:T mismatch present in the sequences , and (D), the two-stranded hydrogen bonding alignments for A5(G3:G6:G3:G6):A5 in , (E), T6(G3:G7:G3:G7):T6 hexad (F) and T5:A9:A9:T5 tetrad (G) in are depicted.
In this article, we demonstrate that the same global folding pattern may apply to a rather large sequence family 5′GCGGGG (where X and Y may correspond to one or two bases). We first provide a structural analysis of the GCGGGG motif, then analyze the contribution of various base substitutions on the kinetics and thermodynamics affecting the global architecture of these quadruplexes in view of structural data.
MATERIALS AND METHODS
Nomenclature, synthesis and purification of oligonucleotide sequences
Oligodeoxynucleotides have the general sequence 5′-GCGGXGGY-3′ where X = T, U, TT, UC, TC or A and Y = AT, AU, T or A. (U is a deoxyuracyl; all sugars are deoxyriboses). All oligonucleotides with X = T or U belong to the serie ‘1’ and are labeled 1, 1B, 1UA and 1UB, depending on the nature of the 3′ base(s) (Table 1). All oligonucleotides with X = TT or UU belong to the serie ‘3’ and are labelled , 3A and 3AU. Oligonucleotides with a ‘TC’ core are labeled , , , and . A single oligonucleotide (2) with a central ‘A’ is tested here. Sequences were synthesized by Eurogentec (Seraing, Belgium) or as described previously (17). Concentrations of all oligodeoxynucleotides were estimated using published sequence-dependent extinction coefficients (18); values are provided in Supplementary Table S3.
Table 1
Oligonucleotides
Name (5′→3′)
Sequence
kona (M−3 s−1)
kon(rel)b
Eonc (kcal.M−1)
T1/2d (°C) (Na+)
Gele
ΔHcal°(kcal M−1)f
TG4T
TGGGGT
1.8 × 108
1
−29 ± 2
57.4
+
−72
1
GCGGTGGAT
1.3 × 109
7
−31 ± 3
43.0
+
−96 ± 5
1B
GCGGTGGT
5.8 ×107
0.3
nd
36.0
nd
−101 ± 5
1UA
GCGGUGGAT
1.1 ×109
6
−20 ± 2
48.9
+
−94 ± 5
1UB
GCGGTGGAU
1.1 ×109
6
−26 ± 2
41.5
+
nd
2
GCGGAGGAT
2.6 ×105
0.0014
+36 ± 7
37
+/−
−61 ± 12
3
GCGGTTGGAT
1.7 ×108
0.9
−29 ± 5
33.6
+
−103 ± 5
3A
GCGGTTGGA
1.3 ×108
0.7
+6 ± 6
31.6
+
nd
3U
GCGGUUGGAT
3.6 ×108
2
−26 ± 6
37.2
+
−96 ± 5
4
GCGGTCGGAT
2.8 ×105
0.0016
+42 ± 4
30.5
+/−
nd
4U
GCGGUCGGAT
1.1 ×106
0.006
+32 ± 7
31.8
+/−
nd
4AU
GCGGUCGGA
1.5 ×105
0.0008
nd
31.8
+/−
nd
4A
GCGGTCGGA
<105
<10−4
nd
≈24
−
nd
4B
GCGGTCGGT
<105
<10−4
nd
≈22
−
nd
nd, not done.
aAssociation rate constant at 8°C, pH 7, with 0.11 M Na+, in M−3 s−1. kon is given with a ± 30% accuracy or better.
bRelative association rate constant at 8°C, as compared with TG4T.
cActivation energy of association (±SD). As the dissociation temperature found for all oligonucleotides was lower than for TG4T, it was difficult to measure kon values above 20°C in a number of cases; hence Eon values derive from association rates on a relatively narrow T-range.
d(Non-equilibrium) melting temperature of the preformed quadruplex, in (°C), in 0.11 M Na+, determined with a temperature gradient of 0.45°C/min. T1/2 is provided with a 0.5°C accuracy or better.
eEvidence for quadruplex formation on a non-denaturing gel. +, single retarded band when incubated in 0.11 M Na+. +/−, smear (dissociation of the complex in the gel; migration at low temperature mandatory to observe the complex) or trace amounts. −, no retarded band, even after a 72 h incubation at 4 or 17°C at 500 µM strand concentration).
fModel-independent enthalpies determined by DSC.
Absorbance measurements
Absorbance was recorded as described previously (19). Most melting curves recorded by heating a preformed quadruplex do not correspond to equilibrium melting curves (hysteresis phenomenon) and the ‘Tm’ deduced from these experiments depends on the heating rate. In order to distinguish it from the true thermodynamic Tm, we will call this value T1/2.
Differential scanning calorimetry
Microcalorimetry experiments were performed with a Nano DSC-II microcalorimeter (CSC) as described previously (20).
NMR spectroscopy
All experiments were performed on Varian Unity Inova NMR 500 and 600 MHz spectrometers. The datasets were processed using Varian software (VNMR) and the real matrices were transformed to the FELIX (Accelrys, Inc., San Diego, CA) format on a Silicon Graphics (Mountain View, CA) Octane workstation. Typically, assignments were based on homonuclear NOESY (21), DQF-COSY (22), TOCSY (23), and heteronuclear (1H-13C)HSQC (24), (1H-15N)HSQC (25) and (1H,13C) JR-HMBC (26) experiments. The final concentration of DNA in the sample was about 6 mM (∼1.5 mM per strand).
RESULTS
Topological architecture
Determination of the 3D solution structure of all sequences presented in Table 1 is beyond the scope of this study. Nevertheless, before comparing the kinetics and thermodynamics of these oligomers, we deemed necessary to establish that these sequences have the same global architecture. Quadruplex formation was thus analyzed by PAGE and revealed by an anomalously slow migrating band as compared with the migration pattern of the same ‘single-stranded’ oligomer (Supplementary Figure S1). Complete or near complete conversion into a lower mobility band was obtained with most sequences. However, in the case of and its variants very high concentrations (mM range) were required to obtain quadruplex formation. Furthermore, the isothermal thermal difference spectra of all these structures (Supplementary Figure S2) were in agreement with the formation of quadruplexes (27,28). However, the actual shape of these spectra was not strictly identical to that of a canonical G-quadruplex (19), indicating also the formation of other base folding patterns (see below).The folding topology is based on studies on two of these sequences ( and (16,17). The exchangeable proton spectral regions for and point to stable structures at 0°C. For these sequences the number of resonances in the imino region coincides with the number of guanines in the respective sequence, consistent with symmetry related multistranded architectures. The topology was primarily established through observation of analogous connectivities in the region connecting aromatic protons to H1′ in NOESY spectra (Supplementary Figures S3and S4). Also, typical dipolar connectivities connecting the imino and amino exchangeable protons to the aromatic were consistent with formation of pseudo-planar elements. For example, in the dipolar connectivities between the imino of G6, G6H1 and G3H8 (peak d) and between G3H1 and G6H8 (peak e) protons are compatible with G3:G6:G3:G6 tetrad formation. The hydrogen bond alignment of the pseudo-planar architecture is further demonstrated through dipolar connectivities between G6NH2 and G3H8 (peak l) and G3NH2 to G6H8 (peak j). In addition, we observe an NOE between the G3NH2 and A5H8 protons (peak k), consistent with recognition of the minor groove edge of G3 by the Watson–Crick edge of A5 resulting in the A5(G3:G6:G3:G6):A5 hexad alignment (Figure 1E). Accordingly, the amino protons of A5 resonate at 10.37 and 9.30 p.p.m. (Supplementary Figure S5A). These broadened amino protons are downfield shifted relative to their counterparts in Watson–Crick A:T pairs, which generally resonate between 7.5 and 8.0 p.p.m.Analysis of through bond and through space correlations in D2O buffer solution resulted in assignments for base and sugar protons of the quadruplex. In a plot of a NOESY collected with mixing time of 250 ms we can trace the sequential connectivities between the base and its own and 5′-flanking sugar H1′ protons along individual strands (Figure 2B). For all oligonucleotides, weak intraresidue H8/H6-H1′ cross-peaks were observed indicating anti glycosidic torsion angles (χ).
Figure 2
NOESY for 2, d(GCGGAGGAT). (A) Expanded JR-NOESY spectrum (200 ms) of 2, d(GCGGAGGAT). JR-NOESY spectrum (200 ms) in 100 mM NaCl, 2 mM sodium phosphate buffer at pH 6.7 in 1H2O at 0°C, exhibiting dipolar connectivities between imino and aromatic protons. Peaks a to m are assigned as follows: a, G1H1–C2H6; b, G1H1–C2NH2; c, G6H1–G1H8; d, G6H1–G3H8; e, G3H1–G6H8; f, G4H1–A8H2; g, G4H1–G7H8; h, G7H1–A8H2; i, G7H1–G4H8; j, G3NH2–G6H8; k, G3NH2–A5H8; l, G6NH2–G3H8; m, A5H8–G6H8; n, C2NH2–G1H8. (B) NOESY for 2, d(GCGGAGGAT). An expanded NOESY (250 ms mixing time) spectrum recorded in 2H2O at 20°C correlating the aromatic base proton and sugar H1′ for , d[(GCGGAGGAT)2]2. The lines trace the NOE connectivities between the base protons and their own and 5′-flanking sugar H1′ protons from G1 to T9 in the sequence. Both peaks (b), G6H8–G1H1′, and (d), G1H8–G6H1′, are crucial assignments leading to the identification of the architecture of the quadruplex formed by the sequence d(GCGGGGAT) in Na+ salt solution. The corresponding cross peaks indicating the folding of tetrameric interlocked quadruplex appear in both and . Crosspeak (a) shows the strong NOE between the aromatic H8 and the very lowfield shifted H3′ of adenine A5 indicative of C3′-endo sugar puckering, and cross-peak (c) is sequential aromatic-aromatic.
In Figure 3 we depict the architecture formed by as well as the tridimensional structures of , and . From the top view the overall topology of appears regular cylindrical. However, and deviate from regular toroid due to the accommodation of the residues in the double chain reversal (bottom view). Whilst in the one-residue looping base recognizes the sheared edge of a stem guanine, in the thymine aligns with a quadruplex groove. The more salient backbone of is due to accommodation of a two-residue loop.
Figure 3
Views of the architectures formed by 2, d(GCGGAGGAT); 1, d(GCGGTGGAT); and 3, d(GCGGTTGGAT). On top views from the top through the quadruplex stem axis depicting deformation of the DNA toroid. On the bottom corresponding views across the stem axis. The model for d(GCGGGGAT) was derived from substitution of T5 for A5 in the structure formed by : d(GCGGGGAT) in 0.1 M NaCl; PDB ID, 1NYD (16), followed by energy minimizations under Insight II (Accelrys Inc.). The Hoogsteen edge of A5 was brought to hydrogen bonding distance, keeping an anti glycosidic bond angle using the Biopolymer module of Insight II (Accelrys).
Association of the isolated strands at low temperature: effects of concentration
Isothermal renaturation experiments were used to study the formation of the quadruplexes; representative examples are provided in Figure 4A and B for sample . We first determined that the low temperature renaturation was minimal for the DNA sample at concentrations below 20 µM. In contrast, formation of the quadruplex at 3°C was nearly complete in 2 h at 80 µM (Figure 4B), demonstrating that concentration played a dramatic role in the kinetics of association. It was possible to fit the experimental profiles for the determination of the order n of the reaction. Data could be fitted with 3.3 ≤ n ≤ 3.9; we defined n = 4 for all further studies. These fits (dotted lines) are in nearly-perfect agreement with the experimental points. Moreover, the kon values determined from the curves at three different concentrations or at two different wavelengths (240 and 295 nm) were in excellent agreement [as seen in a dual wavelength parametric test (29); Supplementary Figure S2]. The association rate constant at 8°C was 1.3 × 109 M−3.s−1 for (Table 1), i.e. seven times faster than TG4T that we used as a reference (19).
Figure 4
Absorbance data on association. (A) Experimental data at two wavelengths (240 and 295 nm) and fitted functions (solid lines). oligonucleotide at 60 µM strand concentration in a 0.11 M Na+, pH 7.2 buffer at 3°C. The fitted kon values were 1.62 × 109 and 1.81 × 109 M−3.s−1 at 240 and 295 nm, respectively. (B) Effect of strand concentration (10–80 µM) for at 3°C in 0.11 M Na+ on the relative absorbance at 295 nm. Quadruplex formation leads to an increase in absorbance at this wavelength. (C) Effect of temperature on association for at 3°C (blue), 8°C (orange) and 12°C (red) in 0.11 M Na+ on the absorbance at 240 nm. Quadruplex formation leads to a decrease in absorbance at this wavelength. (D) Arrhenius plots for the association of (circles), (triangles), (squares), (closed squares) and TG4T (squares; dotted line) oligonucleotides. (E) Effect of pH on association. kon is plotted versus pH for , d(GCGGGGAT, closed circles) and dTG4T (red symbols). Solid and dotted lines, exponential fits (note the Y-axis log scale) for the data points. (F) Effect of ionic strength on association. kon is plotted versus [Na+] for , d(GCGGGGAT, closed circles) and dTG4T (red symbols). These experiments were performed in a 10 mM lithium cacodylate pH 7.2 buffer with various concentrations of NaCl.
Sequence and temperature effects on association
Next, we performed the same renaturation experiment at various temperatures. As shown in Figure 4C (for ) and Supplementary Figure S6A (for ), an increase in temperature has a deleterious impact on the kinetics of association: at a given strand concentration, folding was slower at 12°C than at 3°C. The association rates are sumed up in Figure 4D and Supplementary Figures S6B–D (Arrhenius plots). At low temperature (<10°C), sample folded 105 times faster than . At 21°C, this difference was less pronounced, because of a different temperature dependency of kon. The oligonucleotides could be subdivided into two distinct families:and (and their variants , , and ) have a high association rate, especially at low temperature. Their association constant is either similar or higher (for , and ) than the kon of TG4T (Table 1). As most sequences have comparable negative activation energies of association (Eon), they have relatively similar temperature dependencies for kon., , and have a low association rate, especially at low temperature. At 8°C, their association constant is roughly 1000-fold slower than the kon of TG4T (Table 1). The most striking difference with the first category is their opposite temperature dependencies: they have comparable positive activation energies of association (Eon). This puzzling behavior could be the result of the formation of competitive structure(s) (Figure 6C and Supplementary Data).From the data shown in Table 1, one may rank the various sequences in decreasing order of kon:> = TG4T > > . A central T→U substitution generally leads to a faster association; it is clear that folds significantly faster than , demonstrating that the central TT→UU substitution accelerates the association process. A similar phenomenum was found for as compared with , demonstrating that the central TC→UC substitution also accelerates the association process.
Dissociation of the preformed quadruplexes: sequence effects
Starting from preformed quadruplexes (several days at 0°C and high strand concentration 100–1000 µM), one can simply follow the denaturation of this structure by recording the absorbance at 240 or 295 nm (19,20) (Figure 5A). The curve of the hypochromic effect produced upon heating appears to be that of a typical cooperative melting transition. However, this melting transition does not represent an equilibrium denaturation process, for upon cooling, renaturation of the DNA quadruplex is very slow (Figure 5B). Furthermore, this apparent melting temperature does not depend on oligonucleotide concentration but strongly depends on the rate of heating (Figure 5C and Supplementary Figure S7; a 10–20°C difference in T1/2 may be found between the fastest and the slowest gradient), again indicating that this profile does not correspond to an equilibrium curve but solely reflects the dissociation of the quadruplex. In this temperature range, what we see in practice is a simple one-way reaction from a folded quadruplex to a dissociated state: differences in T1/2 reflect differences in thermal lability (19). We were therefore able to extract dissociation rate constant (koff) values in the 20–50°C temperature range. These values were plotted for and against 1/T and compared with the values found for TG4T (Figure 5D). A quasi linear relation between ln(koff) and 1/T was found, corresponding to activation energies of dissociation (Eoff) of 62 ± 3 and 65 ± 4 kcal/mol for and , respectively, as compared with 43 ± 1 kcal/mol for TG4T. At all temperatures above 20°C, and have higher dissociation rate constants than the canonical quadruplex: the quadruplex lifetimes at 37°C of and were 16 and 400 times shorter than the lifetime of the TG4T tetramer, respectively. All other oligonucleotides are significantly less stable than the TG4T quadruplex as shown by a lower T1/2 (Table 1). Within each family, one may find interesting results concerning thymine to uracyl substitution. When a central thymine is replaced by an uracyl (T→U for , TT→UU for and TC→UC for ) a significant increase in T1/2 is found (+5.9, 3.6 and 1.3°C, respectively). We found the following stability order TG4T > > > > . This ranking is therefore different from the one found on association.
Figure 5
Absorbance data on thermal stability. (A) Melting profile of oligonucleotide in 0.11 M Na+. Absorbance at 295 nm (crosses) or 240 nm (circles) is recorded every 4 min with a thermal gradient of ∼0.09°C/min in a pH 7.0, 10 mM sodium cacodylate buffer supplemented with 0.1 M NaCl. (B) Hysteresis in the denaturation/renaturation process of in 0.11 M Na+ at 20 µM strand concentration (temperature gradient 0.278°C/min). Arrows indicate directions of temperature changes. At this strand concentration, once melted, the tetramolecular quadruplex does not refold upon cooling, but a very slow renaturation is observed at 3°C after the cooling experiment. (C) T1/2 of the (circles), (triangles) (squares) and (diamonds) quadruplexes as a function of the average temperature gradient. Note that the ‘true’ equilibrium Tm, which would be obtained with an infinitely slow gradient cannot be approached by this method, only an upper limit may be determined. (D) Arrhenius plots for the dissociation of (black symbols), (red symbols) and TG4T (squares; dotted line) quadruplexes. Different symbols (squares, crosses and so on) correspond to independent melting profiles recorded at 240 or 295 nm using different temperature gradients. (E) Effect of pH on T1/2 for , d(GCGGGGAT, closed circles) and dTG4T (red symbols). Values determined for a heating temperature gradient of 0.48°C/min. (F) Effect of ionic strength on T1/2 for , d(GCGGGGAT, circles) and dTG4T (red symbols). Values determined for a heating temperature gradient of 0.48°C/min. These experiments were performed in a 10 mM lithium cacodylate, pH 7.2, buffer with various concentrations of NaCl.
Ionic strength and pH effects
All profiles shown before were obtained in a pH 7.2 buffer containing Na+ ions at 110 mM. We investigated the effects of changing buffer conditions on the quadruplex GCGGGGAT. kon and T1/2 were only weakly dependent on pH in the 6.0–7.8 range (Figure 4E for association, Figure 5E for thermal stability). Changing the pH from 7.8–6.0 led to a 1.2- and 2-fold increase in association for TG4T and , respectively. A similar pH change had no effect on the thermal stability of (T1/2 = 42.5 at pH 6.0 and 7.8). Various concentrations of sodium were also tested. Increasing ionic strength played a significant role in the thermal dissociation of the quadruplex (Figure 5F; the higher the salt concentration, the higher the T1/2). Varying the sodium concentration had an even more dramatic effect on the association process (Figure 4F). Raising the NaCl concentration from 60 to 210 mM leads to a 6°C increase in T1/2 and 300-fold increase in the association constant. For both parameters (kon and T1/2), one may note that the effect is more pronounced on GCGGGGAT than on TG4T: in the latter case, the same increase in NaCl concentration leads to a 35-fold faster association process.
Thermodynamics of the 1–4 quadruplexes
Qualitatively, from the data on association ( ≈ > > > ) and dissociation ( > > > > ), one may easily deduce that forms the thermodynamically most stable complex, immediately followed by . It is interesting to compare the thermodynamics of with the canonical TG4T tetramolecular parallel quadruplex. At 37°C, the quadruplex formed by unfolds 16 times faster than the TG4T tetramer. On the other hand, folds approximately seven times faster than TG4T. As a result, one can deduce that both quadruplexes have relatively similar ΔG° at 37°C, with a 2-fold difference in the association equilibrium constants Ka = kon/koff. However, one should note that this calculation is valid only at 37°C (these structures have different enthalpies), and that experimental points for association and dissociation of and TG4T are not determined in the same temperature range (for , 2–21°C for association, 25–50°C for dissociation). Fortunately, a reasonable linear fit may be obtained for most association and dissociation rate constants (Figures 4D and 5D): their slopes allow the determination of the association activation energies (Eon, Table 1 and Eoff) with some confidence.It is more difficult to rank the other sequences, as contrasting effects were found on association and dissociation. and are thermodynamically less stable than and TG4T at all temperatures, but the difference in Ka is more pronounced at low temperatures. For example, at 8°C, there is a 104–105 difference in Ka between and (which results both from fastest dissociation and much slower association of the former). At 20°C, there is a ≈300-fold difference only, as a result of opposite effects of temperature on kon for the two sequences.In order to better understand the thermodynamics of the quadruplexes, we performed DSC experiments on several different sequences. This technique allows us to evaluate the energetics of G-quadruplex and compare the impact of base substitution on the enthalpy of formation. Examples of heating/cooling experiments are presented in Figure 6. Similar to the UV-melting experiments, a large hysteresis is obtained. This phenomenon is not unexpected: at a typical scan rate of 1°C/min, reformation of the quadruplex upon cooling occurs at a lower temperature. The profile for (GCGGGGAT) is even more complex (Figure 6C); one may see two distinct transitions when heating a preformed sample (several days at 4°C). The area under the first melting curve allows a model-independent determination of ΔH° for these sequences. Assuming 97 and 91% quadruplex formation before the onset of the first DSC run for 1 and 3, respectively (based on the kon values found in Table 1), the enthalpy values found for and quadruplexes were −96 and −103 kcal/mol quadruplex, respectively. It is interesting to compare these numbers with the model-dependent ΔH° values deduced from the thermokinetic analysis (ΔH° = Eon − Eoff). The values found for and quadruplexes were −93 ± 6 and −94 ± 9 kcal/mol, respectively, in excellent agreement with the model-independent DSC values. The formation of and quadruplexes are therefore strongly enthalpy-driven, with an even more negative ΔH° than TG4T (72 kcal/mol under identical conditions). This result is hardly surprising as and quadruplexes not only involve the formation of 4 G-quartets (asTG4T) but also 2 (G:C:G:C) quartets. DSC measurements also allow us to rank the samples according to their enthalpy:
Figure 6
DSC analysis. DSC raw data of heating (solid line) and cooling (dotted line) (1°C/min) for (A) , d(GCGGGGAT) 357 µM, (B) , d(GCGGGGAT) 382 µM, (C) , d(GCGGGGAT) 317 µM. In (B and C), two heating profiles are presented (first heating, solid line; second heating, broken line). They illustrate partial, but not complete, reformation of the quadruplex after a first heating/cooling cycle [for (B)] and formation of a competitive structure (C).
DISCUSSION
We have presented evidence suggesting that the global folding pattern established for GCGGGGAT (1) and GCGGGGAT (3) may indeed be extended to a large family of sequences GCGGGG. We analyzed the contribution of various base substitutions within the double chain reversal () on the kinetics and thermodynamics of these quadruplexes.
Architecture
We have demonstrated that the sequences GCGGGGAT and GCGGGGT/A fold into a tetramolecular quadruplex that results in an interlocked bimolecular topology as established for (16) and (17). The quadruplexes formed result from anti-parallel strands at their four-stranded core, and parallel strands in two symmetry-related bistranded regions. The parallel strandedness observed for the bistranded regions of the stem quadruplex results from double chain reversals. Indeed, all guanines in the stem quadruplex structure for , , and show anti glycosidic bond angles. In contrast to T5 in , for each single strand of , the Hoogsteen edge of A5 recognizes the sheared edge of G3 thus forming a single-stranded triad, G3(A5–G6). The involvement of G3 and G6 in formation of a tetrad leads to an overall bistranded hexad alignment: A5(G3:G6:G3:G6)A5. Double chain reversals containing this particular structural motif are now well established in quadruplexes (30–33). We have also shown that this topology can accommodate more than one base in the double chain reversal. Indeed, substitution of a single base (X = T or A) for two bases (TT or TC) in the GCGGGGA segment does also result in four-stranded quadruplex structures.
Sequence and temperature effects on association, dissociation and energetics
We report a complete analysis of the kinetic properties of a relatively large family of sequences (13 different oligonucleotides) coupled with a structural study for some of its members. The kinetics were compared with the parallel-stranded tetrameric quadruplex formed by TG4T which was used as a reference. Calorimetry measurements allowed us to determine a model independent enthalpy of quadruplex formation (which was strongly negative) and evaluate the impact of each modification on the energetics of the structure. Overall for sequences , and their variants, modifications did not strongly affect the enthalpy of formation. In contrast, a T→A substitution siginificantly destabilizes the structure (ΔH° drops to −60 kcal/mol; Table 1). It is important to note that the stabilities derived from the analysis of thermal denaturation curves alone do not take into account the very large differences in association rate constants (up to 105-fold) found between the different oligonucleotides. Although many oligomers adopt relatively similar conformations, the kinetics of these complexes may vary greatly.The differences in the (non-equilibrium) melting temperatures (T1/2) are a direct measure of the contribution of the stabilized structural elements within the structural environment of the double chain reversal. Thus, in general the series is more thermally stable than , and the latter more stable than the series. The greater stability of over in a single residue loop may be explained from a combination of factors, such as backbone torsions of the double chain reversal and hydrophobic/hydrophilic interactions. The deviations from canonical backbone dihedral angle conformations were more numerous and severe in than in (17). Furthermore, optimum hydrophobic interactions of the methyl groups of T5 oriented towards the quadruplex stem (its solvent exposed Watson–Crick edge) contrast to hydrophilic Hoogsteen edge of A5 oriented towards the quadruplex stem coupled with a solvent exposed minor groove edge. Recognition of the Hoogsteen edge of a tetrad-forming G3 by A5 would contribute little to the stability of the local architecture, since hydrophobic interactions are usually of greater significance. Finally, the sequence has a lower apparent melting temperature (T1/2) than . In the latter there is contribution to stability by base stacking interactions of the A:A mismatch (hydrophobic interactions), which is absent in .In extending to a two residue loop the thymines in do not loop out. All thymines in this structure are well structured and make significant hydrogen bond interactions with the stem. This is clearly supported by the observation of the exchangeable H3 proton for all thymines (pH 6.8, 0°C), as well as the observation of numerous NOE contacts of differing contribution to distance. Indeed, T5 and T6 are part of newly discovered planar architectures: a T5:A9:A9:T5 tetrad and a T6:(G3:G7:G3:G7):T6 hexad alignments. The observation of the T:A:A:T tetrad is documented by various NOE contacts of differing contribution to distance. For example, the adenine A9 shows cross-peaks A9H62–G8H1, T10H3–A9H62, T10H3–A9H61, T10H3–A9H2 clearly indicating stacking interactions with G8 and T10. In the first T5 does not form intrastrand base-stacking interactions. Thus it does not contribute significantly to the stability of the stem. Nevertheless, the appearance of T5H3 lower field shifted below 11 p.p.m. clearly indicates that it falls in a deshielding region of an isotropic aromatic field. This is consistent with the T5H3 proton positioning itself in a fairly narrow region. Compared with the backbone dihedral angles of and , in there are no remarkable torsions. However, the methyl group of T6 is solvent exposed. This is likely to be the major contributor for its destabilization in comparison to and .Corroborating structural conclusions with the kinetics of association is difficult, as differences in the association rate constants not only result from intrinsic propensity for folding of A, T, TT or TC base(s) within the GpG steps, but also from their propensity to favor competing architectures. It is relatively difficult to analyze these competing structures in detail, as they are relatively unstable (Tm below 20°C). This coupled equilibrium reduces to a variable extent the available free strand concentration for quadruplex formation, and therefore affects the apparent association rate constant of the quadruplex. Working around room temperature or above somewhat abrogates this contribution. A comparison of the kon at 18–20°C indicates that the fastest-folding oligomers possess a central T base (as in ), whereas a A-loop slows the association and the two-base loops (TC, TT, UC) fall in between. The destabilizing effect of the purine (adenine) as compared with one pyrimidine (thymine) could actually result from at least two effects: (i) stabilization of a single-stranded or double-stranded structure that inhibits quadruplex formation, as for sequence (Figure 6C) and (ii) a structural distorsion dominated by steric effects. This is further exacerbated in the folding of two-base loops. We did not observe a biphasic transition in the case of the sequence with a central adenine (, data not shown) but DSC data suggests that this adenine per se has a strong detrimental effect on the enthalpy of the quadruplex (Table 1).
Biological relevance and implications for other sequences
Repetitive GGGG segments show propensity to fold into quadruplexes bearing loops linking anti-parallel strands or double chain reversals resulting in parallel stranded quadruplexes (11,14–17,31,33–37). These sequence motifs are found in several important regions of the genome, especially in promoters (38), enhancers (39), locus control regions (40) and AGG islands present in fragile-X syndrome CGG repeats (41–43). Higher order structures with FMR1 sequence variants could be involved in the blockage of DNA processing in fragile X (44). Formation of a quadruplex may therefore play a role in the observed expansion in fragile X loci. Note, however, that all the results presented here were obtained in a sodium-containing buffer, in contrast with intracellular conditions, where potassium ions predominates. We are currently performing NMR experiments in a potassium-containing buffer. Preliminary results suggest that, at high DNA strand concentrations (∼1 mM) , and adopt a different topology we are currently investigating.Our systematic approach gives insights into the propensity and stability of formation of double chain reversals for A, T, TT, and TC between two stacked tetrads. This study can be extended to three stacked tetrads if a suitable model system is developed and include a greater number and permutations of bases. The gained knowledge thus contributes to a better understanding of the propensity for transiently folding, dynamics and stability to be observed in double-chain reversals appearing in intramolecular quadruplexes folded in segments appearing in intronic regions, promoter regions, in, or near repressor and enhancer genetic sequences.
CONCLUSIONS
Moderately-rich guanine sequences (5 guanines for 8–10 base long oligomers) may form relatively stable tetramolecular quadruplexes, even if they involve ‘blocks’ of only two contiguous guanines. These quadruplexes are structurally diverse and involve a number of unusual pairing patterns such as hexads (Figure 1E–G), further extending the sequence repertoire of DNA quadruplexes. Some of these ‘non-canonical’ quadruplexes may actually form faster than the TG4T reference sequence. We have demonstrated within the same topology that DNA sequence dramatically influences propensity for adopting double chain reversal loops. We have also probed modes of stabilization of bases within the double chain reversal loop, thus providing a structural basis for differential dissociation.
Authors: A J Verkerk; M Pieretti; J S Sutcliffe; Y H Fu; D P Kuhl; A Pizzuti; O Reiner; S Richards; M F Victoria; F P Zhang Journal: Cell Date: 1991-05-31 Impact factor: 41.582
Authors: Sofia Kolesnikova; Martin Hubálek; Lucie Bednárová; Josef Cvacka; Edward A Curtis Journal: Nucleic Acids Res Date: 2017-09-06 Impact factor: 16.971
Authors: Shang-Te Danny Hsu; Peter Varnai; Anthony Bugaut; Anthony P Reszka; Stephen Neidle; Shankar Balasubramanian Journal: J Am Chem Soc Date: 2009-09-23 Impact factor: 15.419