We compared here 80 different sequences containing four tracts of three guanines with loops of variable length (between 1 and 15 bases for unmodified sequences, up to 30 for fluorescently labeled oligonucleotides). All sequences were capable of forming stable quadruplexes, with T(m) above physiological temperature in most cases. Unsurprisingly, the melting temperature was systematically lower in sodium than in potassium but the difference between both ionic conditions varied between 1 and >39°C (average difference: 18.3°C). Depending on the sequence context, and especially for G4 sequences involving two very short loops, the third one may be very long without compromising the stability of the quadruplex. A strong inverse correlation between total loop length and T(m) was found in K(+): each added base leads to a 2°C drop in T(m) or ∼0.3 kcal/mol loss in ΔG°. The trend was less clear in Na(+), with a longer than expected optimal loop length (up to 5 nt). This study will therefore extend the sequence repertoire of quadruplex-prone sequences, arguing for a modification of the widely used consensus (maximal loop size of 7 bases).
We compared here 80 different sequences containing four tracts of three guanines with loops of variable length (between 1 and 15 bases for unmodified sequences, up to 30 for fluorescently labeled oligonucleotides). All sequences were capable of forming stable quadruplexes, with T(m) above physiological temperature in most cases. Unsurprisingly, the melting temperature was systematically lower in sodium than in potassium but the difference between both ionic conditions varied between 1 and >39°C (average difference: 18.3°C). Depending on the sequence context, and especially for G4 sequences involving two very short loops, the third one may be very long without compromising the stability of the quadruplex. A strong inverse correlation between total loop length and T(m) was found in K(+): each added base leads to a 2°C drop in T(m) or ∼0.3 kcal/mol loss in ΔG°. The trend was less clear in Na(+), with a longer than expected optimal loop length (up to 5 nt). This study will therefore extend the sequence repertoire of quadruplex-prone sequences, arguing for a modification of the widely used consensus (maximal loop size of 7 bases).
G-quadruplexes are a family of nucleic acid secondary structures stabilized by G-quartets that form in the presence of cations (1–5). These structures may be involved in key biological processes and recent articles highlight this notion (6–13). Four (or more) tracks of two or more guanines are required to form an intramolecular structure. Those are connected by three—sometimes four, see ref. (14)—loops of variable size and sequence. Previous works studying intramolecular quadruplexes formed with a central loop of 3–7 bases (15) or more (16) support the important role played by the nature and length of loops in quadruplex stability (15,17–25). To better understand this contribution, we chose to compare model sequences containing four tracts of three guanines. In this article, we investigated sequence variants, involving thymidine-only or thymidine-rich loops. This leads to a general sequence of the type GGGLGGGLGGGLGGG where L1, L2 and L3 may either be T2A or a loop composed solely of 1–30 thymines. Over 80 different sequences were tested; they are presented in Tables 1 and 2 and include 12 fluorescently labeled oligonucleotides for FRET analysis. This study complements our previous report in which we compared sequences of identical loop length (3 nt) but with variable base content (23).
Table 1.
Sequence of the oligonucleotide used for UV melting
(Length) Namea
Sequence (5′→3′)
Tm Na+ (°C)
Tm K+ (°C)
L1
L2
L3
T-var-T
(16) 121
GGG
T
GGG
TT
GGG
T
GGG
(51)
84.0
(17) 131
GGG
T
GGG
TTT
GGG
T
GGG
45.5
81.8
(18) 141
GGG
T
GGG
TTTT
GGG
T
GGG
43.0
76.5
(19) 151
GGG
T
GGG
TTTTT
GGG
T
GGG
41.5
73.5
(20) 161
GGG
T
GGG
TTTTTT
GGG
T
GGG
40.3
70.5
(21) 171
GGG
T
GGG
TTTTTTT
GGG
T
GGG
37.0
68.5
(23) 191
GGG
T
GGG
TTTTTTTTT
GGG
T
GGG
32.0
67.0
(26) 1B1
GGG
T
GGG
TTTTTTTTTTTT
GGG
T
GGG
26.1
63.0
(29) 1E1
GGG
T
GGG
T15
GGG
T
GGG
24.8
62.5
TTT-var-TTT
(19) 313
GGG
TTT
GGG
T
GGG
TTT
GGG
(49.5)
(65.5)
(20) 323
GGG
TTT
GGG
TT
GGG
TTT
GGG
(46.1)
(62)
(21) 333
GGG
TTT
GGG
TTT
GGG
TTT
GGG
51.0
65.3
(22) 343
GGG
TTT
GGG
TTTT
GGG
TTT
GGG
56.5
66.1
(23) 353
GGG
TTT
GGG
TTTTT
GGG
TTT
GGG
54.5
65.0
(23) 363
GGG
TTT
GGG
TTTTTT
GGG
TTT
GGG
54.5
61.8
(25) 373
GGG
TTT
GGG
TTTTTTT
GGG
TTT
GGG
nd
nd
(27) 393
GGG
TTT
GGG
TTTTTTTTT
GGG
TTT
GGG
45.5
53.9
(30) 3B3
GGG
TTT
GGG
TTTTTTTTTTTT
GGG
TTT
GGG
39.3
49.0
(33) 3E3
GGG
TTT
GGG
T15
GGG
TTT
GGG
36.5
45.0
TTA-var-TTA
(19) H313
GGG
TTA
GGG
T
GGG
TTA
GGG
(45.0)
(62.0)
(20) H323
GGG
TTA
GGG
TT
GGG
TTA
GGG
(54.3)
(61.3)
(21) H333
GGG
TTA
GGG
TTT
GGG
TTA
GGG
56.6
65.3
(22) H343
GGG
TTA
GGG
TTTT
GGG
TTA
GGG
62.5
67.8
(23) H353
GGG
TTA
GGG
TTTTT
GGG
TTA
GGG
61.8
65.5
(24) H363
GGG
TTA
GGG
TTTTTT
GGG
TTA
GGG
60.8
62.3
(25) H373
GGG
TTA
GGG
TTTTTTTT
GGG
TTA
GGG
57.5
59.0
(27) H393
GGG
TTA
GGG
TTTTTTTTTTT
GGG
TTA
GGG
52.8
54.4
(30) H3B3
GGG
TTA
GGG
TTTTTTTTTTTT
GGG
TTA
GGG
46.8
48.0
var-TTT-TTT
(19) 133
GGG
T
GGG
TTT
GGG
TTT
GGG
45.8
67.2
(20) 233
GGG
TT
GGG
TTT
GGG
TTT
GGG
50.7
65.5
(21) 333
GGG
TTT
GGG
TTT
GGG
TTT
GGG
51.0
65.3
(22) 433
GGG
TTTT
GGG
TTT
GGG
TTT
GGG
44.4
62.0
(23) 533
GGG
TTTTT
GGG
TTT
GGG
TTT
GGG
40.7
58.5
(24) 633
GGG
TTTTTT
GGG
TTT
GGG
TTT
GGG
38.2
56.9
(25) 733
GGG
TTTTTTT
GGG
TTT
GGG
TTT
GGG
36.7
56.3
(27) 933
GGG
TTTTTTTTT
GGG
TTT
GGG
TTT
GGG
33.9
53.3
(30) B33
GGG
TTTTTTTTTTTT
GGG
TTT
GGG
TTT
GGG
31.0
51.3
(33) E33
GGG
T15
GGG
TTT
GGG
TTT
GGG
28.0
49.5
TTT-TTT-var
(19) 331
GGG
TTT
GGG
TTT
GGG
T
GGG
46.2
(68.3)
(20) 332
GGG
TTT
GGG
TTT
GGG
TT
GGG
51.8
(67.3)
(21) 333
GGG
TTT
GGG
TTT
GGG
TTT
GGG
51.0
65.3
(22) 334
GGG
TTT
GGG
TTT
GGG
TTTT
GGG
44.5
62.3
(23) 335
GGG
TTT
GGG
TTT
GGG
TTTTT
GGG
39.5
59.9
(24) 336
GGG
TTT
GGG
TTT
GGG
TTTTTT
GGG
37.0
57.8
(25) 337
GGG
TTT
GGG
TTT
GGG
TTTTTTT
GGG
35.5
55.8
(27) 339
GGG
TTT
GGG
TTT
GGG
TTTTTTTTT
GGG
30.5
53.3
(30) 33B
GGG
TTT
GGG
TTT
GGG
TTTTTTTTTTTT
GGG
27.8
51.0
(33) 33E
GGG
TTT
GGG
TTT
GGG
T15
GGG
25.5
49.5
var-TTT-var
(17) 131
GGG
T
GGG
TTT
GGG
T
GGG
45.5
81.8
(19) 232
GGG
TT
GGG
TTT
GGG
TT
GGG
53.8
(68.8)
(21) 333
GGG
TTT
GGG
TTT
GGG
TTT
GGG
51.0
65.3
(23) 434
GGG
TTTT
GGG
TTT
GGG
TTTT
GGG
40.8
59.4
(25) 535
GGG
TTTTT
GGG
TTT
GGG
TTTTT
GGG
34.3
53.7
(27) 636
GGG
TTTTTT
GGG
TTT
GGG
TTTTTT
GGG
31.5
51.0
(29) 737
GGG
TTTTTTT
GGG
TTT
GGG
TTTTTTT
GGG
26.0
46.9
(33) 939
GGG
T9
GGG
TTT
GGG
T9
GGG
18.5
41.5
(39) B3B
GGG
T12
GGG
TTT
GGG
T12
GGG
13.0
33.0b
(45) E3E
GGG
T15
GGG
TTT
GGG
T15
GGG
11.0
31.0b
var-var-TTT
(17) 113
GGG
T
GGG
T
GGG
TTT
GGG
46.2b
82.0
(19) 223
GGG
TT
GGG
TT
GGG
TTT
GGG
46.0
67.5
(21) 333
GGG
TTT
GGG
TTT
GGG
TTT
GGG
51.0
65.3
(23) 443
GGG
TTTT
GGG
TTTT
GGG
TTT
GGG
51.5
64.5
(25) 553
GGG
TTTTT
GGG
TTTTT
GGG
TTT
GGG
47.3
60.5
(27) 663
GGG
TTTTTT
GGG
TTTTTT
GGG
TTT
GGG
42.7
55.0
(29) 773
GGG
TTTTTTT
GGG
TTTTTTT
GGG
TTT
GGG
37.5
50.0
(33) 993
GGG
T9
GGG
T9
GGG
TTT
GGG
29.0
42.0
TTT-var-var
(17) 311
GGG
TTT
GGG
T
GGG
T
GGG
(42.8)
(82.0)
(19) 322
GGG
TTT
GGG
TT
GGG
TT
GGG
46.0
(67.8)
(21) 333
GGG
TTT
GGG
TTT
GGG
TTT
GGG
51.0
65.3
(23) 344
GGG
TTT
GGG
TTTT
GGG
TTTT
GGG
50.5
60.8
(25) 355
GGG
TTT
GGG
TTTTT
GGG
TTTTT
GGG
43.5
56.3
(27) 366
GGG
TTT
GGG
TTTTTT
GGG
TTTTTT
GGG
38.0
49.9
(29) 377
GGG
TTT
GGG
TTTTTTT
GGG
TTTTTTT
GGG
32.8
45.0
(33) 399
GGG
TTT
GGG
T9
GGG
T9
GGG
22.0
38.3
var-var-var
(18) 222
GGG
TT
GGG
TT
GGG
TT
GGG
45.5
(73.8)
(21) 333
GGG
TTT
GGG
TTT
GGG
TTT
GGG
51.0
65.3
(24) 444
GGG
TTTT
GGG
TTTT
GGG
TTTT
GGG
45.8
57.5
(27) 555
GGG
TTTTT
GGG
TTTTT
GGG
TTTTT
GGG
35.1
50.5
(30) 666
GGG
TTTTTT
GGG
TTTTTT
GGG
TTTTTT
GGG
27.6
42.3
(33) 777
GGG
T7
GGG
T7
GGG
T7
GGG
18.0
35.9b
(39) 999
GGG
T9
GGG
T9
GGG
T9
GGG
10.0
30.0b
aLength (in nucleotides) within parentheses is followed by the name of the sequence. See ‘Materials and Methods’ section for the definition of the convention used here. Tm values between parentheses correspond to situation in which significant formation of intermolecular complexes is either suspected or demonstrated. Tm within brackets: strong evidence for contributing intermolecular species. bslight hysteresis.
Table 2.
Sequence of the oligonucleotide used for FRET melting
Name
Sequence (5′→ 3′)
Tm Na+ (°C)
Tm K+ (°C)
L1
L2
L3
f131t
TGGG
T
GGG
TTT
GGG
T
GGGT
56.6
75.1
f161t
TGGG
T
GGG
TTTTTT
GGG
T
GGGT
50.5
68.0
f191t
TGGG
T
GGG
TTTTTTTTT
GGG
T
GGGT
50.1
64.8
f1E1t
TGGG
T
GGG
T15
GGG
T
GGGT
47.0
61.6
f1K1t
TGGG
T
GGG
T21
GGG
T
GGGT
45.1
59.9
f1S1t
TGGG
T
GGG
T30
GGG
T
GGGT
–a
–a
f333t
TGGG
TTT
GGG
TTT
GGG
TTT
GGGT
49.2
55.4
f363t
TGGG
TTT
GGG
TTTTTT
GGG
TTT
GGGT
46.9
49.7
f393t
TGGG
TTT
GGG
TTTTTTTTT
GGG
TTT
GGGT
48.4
48.1
f3E3t
TGGG
TTT
GGG
T15
GGG
TTT
GGGT
46.8
46.3
f3K3t
TGGG
TTT
GGG
T21
GGG
TTT
GGGT
47.3
46.0
f3S3t
TGGG
TTT
GGG
T30
GGG
TTT
GGGT
44.0
–a
Lower case letters (f and t) refer to 5′ FAM and 3′ TAMRA, respectively. Note that the presence of the fluorescent labels and different experimental conditions prevent a direct comparison with Tm values found by UV-absorbance experiments in Table 1.
aweird / low quality melting profile.
Sequence of the oligonucleotide used for UV meltingaLength (in nucleotides) within parentheses is followed by the name of the sequence. See ‘Materials and Methods’ section for the definition of the convention used here. Tm values between parentheses correspond to situation in which significant formation of intermolecular complexes is either suspected or demonstrated. Tm within brackets: strong evidence for contributing intermolecular species. bslight hysteresis.Sequence of the oligonucleotide used for FRET meltingLower case letters (f and t) refer to 5′ FAM and 3′ TAMRA, respectively. Note that the presence of the fluorescent labels and different experimental conditions prevent a direct comparison with Tm values found by UV-absorbance experiments in Table 1.aweird / low quality melting profile.
MATERIALS AND METHODS
Nomenclature, synthesis and purification of oligonucleotide sequences
Oligonucleotides were synthesized by Eurogentec (Seraing, Belgium). Concentrations of all oligodeoxynucleotides were estimated using extinction coefficients provided by the manufacturer and calculated with a nearest neighbor model (26).Sequences are given in the 5′- to 3′-direction and loops are numbered according to this convention:In the 5′-GGGLGGGLGGGLGGG-3′ sequence, L1 refers to the 5′-side loop, L3 to the 3′-side loop, while L2 corresponds to the central loop.The three-digit oligonucleotide name indicates loop lengths in L1, L2 and L3, respectively. For example corresponds to the GGGGGGGGGGGG sequence. Bases in the loops are in bold/underlined characters.When loop length was >9 nt, letters were used: = 12, = 15, = 21, = 30. Hence, corresponds to the GGGGGGGGGGGG sequence. For the family of sequences in which the L1 and L3 loops are not solely composed of thymines but of the TTA motif (Table 1), the prefix letter was used to distinguish them from the other sequences.For FRET melting experiments, lower case ‘f’ and ‘t’ letters refer to FAM in 5′ (fluorescent donor for FRET) and tetramethylrhodamine in 3′ (TAMRA, acceptor for FRET), respectively.The different sequence families are defined in Table 1. For example T-var-T correspond to sequences in which the first and last loops are composed of a single thymine while the central loop may vary (‘var’).
Absorbance and circular dichroism measurements
Melting experiments were conducted as described earlier (23,27). Denaturation was followed by recording the absorbance at 240 or 295 nm (28,29). Our reference conditions for this study were 10 mM Lithium cacodylate pH 7.2 supplemented with 100 mM NaCl or KCl, abbreviated hereafter K+ or Na+ conditions. The sequences were tested at 4 µM strand concentration. The intramolecular formation of the G-quadruplexes was evaluated by varying concentration in the 1–25 µM range or by using FRET melting experiments (see below) (30). Circular Dichroism (CD) spectra were recorded on a JASCO-810 spectropolarimeter using a 1-cm path length quartz cuvette as described earlier (27). TDS spectra were obtained by subtracting the absorbance spectra recorded below the observed transition from the absorbance at high temperature (31).
FRET melting
FRET melting experiments were conducted with double-labeled dyes (32) (Table 2) using a MX3000 realtime PCR machine (gain set at 1). Oligonucleotides were generally tested at 0.2 µM strand concentration in 10 mM lithium cacodylate buffer supplemented with 100 mM NaCl or 90 mM LiCl + 10 mM KCl, as for classical FRET melting experiments with ligands (33).
Gel electrophoresis
For some experiments, formation of G4-DNA was confirmed by non-denaturing PAGE (23,27). Prior to the incubation, the DNA samples were denatured at 90°C for 5 min and slowly cooled to room temperature (2 h). Oligonucleotides were then incubated at 30 µM strand concentration in 10 mM Tris–HCl pH 7.5 buffer with 100 mM NaCl or KCl. Ten percent sucrose was added just before loading. The gel was prepared at 12% acrylamide supplemented with 20 mM of the corresponding salt, run at 23°C and revealed by UV-Shadow. One should note that the migration of the dTn oligonucleotides does not necessarily correspond to single-strands (34): these oligonucleotides were simply chosen here to provide an internal migration standard.
RESULTS
Evidence for quadruplex formation
We use a set of well established methods in order to demonstrate that the sequences we are studying are actually G-quadruplex structures.Thermal Differential Spectra (TDS) exhibit the typical pattern of a G-quadruplex structure with two positive maxima at 240 and 275 nm and a negative minimum around 295 nm (28,31,35) (examples shown in Figures 1A–B). These spectra are qualitatively similar and may unambiguously be attributed to G4 structures. CD results are also in agreement with the formation of quadruplexes but different types are observed (36,37) (examples shown in Figures 1C–D). Within a family of sequences, the shortest representatives such as (in KCl and NaCl) and (in KCl only) tend to adopt a ‘Type I’ spectra, with a positive peak around 260 nm. Sequences of intermediate length exhibit a ‘Type II’ or ‘Hybrid’ spectra, clearly indicative of a quadruplex fold, but with a different topology. Finally, the longest sequences and have a relatively ambiguous CD signature which probably results from a significant contribution of two very long loops to the ellipticity. For these sequences, attribution of a quadruplex fold relies solely on the TDS spectra.
Figure 1.
TDS, CD and UV melting experiments. Oligonucleotides from the var-TTT-var series are shown for illustration. Results obtained with the other series are shown as supplementary information (Supplementary Figures S1–S8). The sequences were tested at 4 µM strand concentration. Left panels in 100 mM KCl; Right panels in 100 mM NaCl. Both buffers contained 10 mM lithium cacodylate at pH 7.2. (A–B) Examples of Thermal difference spectra. Thermal difference spectra result from the difference between the absorbance recorded at 88 ± 2°C and at 04 ± 2°C. They were recorded over the 220–335 nm wavelength range. (C–D) Examples of Circular dichroism spectra. CD spectra were recorded at 10−25°C (in K+) or 4–25°C in Na+ (to maximize quadruplex formation in the case of relatively unstable quadruplexes) on a JASCO-810 spectropolarimeter using 1 cm path length quartz cuvettes. Oligonucleotides were prepared as a 4 µM and annealed by heating to 90°C for 2 min, followed by slow cooling. (E–F) Examples of UV melting profiles. Absorbance at 295 nm is plotted as a function of temperature for a selection of sequences.
TDS, CD and UV melting experiments. Oligonucleotides from the var-TTT-var series are shown for illustration. Results obtained with the other series are shown as supplementary information (Supplementary Figures S1–S8). The sequences were tested at 4 µM strand concentration. Left panels in 100 mM KCl; Right panels in 100 mM NaCl. Both buffers contained 10 mM lithium cacodylate at pH 7.2. (A–B) Examples of Thermal difference spectra. Thermal difference spectra result from the difference between the absorbance recorded at 88 ± 2°C and at 04 ± 2°C. They were recorded over the 220–335 nm wavelength range. (C–D) Examples of Circular dichroism spectra. CD spectra were recorded at 10−25°C (in K+) or 4–25°C in Na+ (to maximize quadruplex formation in the case of relatively unstable quadruplexes) on a JASCO-810 spectropolarimeter using 1 cm path length quartz cuvettes. Oligonucleotides were prepared as a 4 µM and annealed by heating to 90°C for 2 min, followed by slow cooling. (E–F) Examples of UV melting profiles. Absorbance at 295 nm is plotted as a function of temperature for a selection of sequences.During absorbance UV-melting experiments, these sequences exhibit an inverted transition at 295 nm (Figure 1E–F). As for most quadruplex-forming sequences studied so far, stability was dependent on the nature of the monocation. As expected, stability of the quadruplex was lower in NaCl than in KCl (Table 1). This difference between sodium and potassium conditions is highly variable, with ΔTm between +1.2 and +39.2°C (average +18.3°C, see below). Finally, non-denaturing gel electrophoresis indicated that most oligomers migrated in agreement with the formation of a predominant intramolecular structure (see below). These results indicate that all sequences studied here form quadruplexes.
Intramolecular versus intermolecular structures
For most of the sequences presented in this study, a concentration-independent melting temperature determined by UV-absorbance (strand concentration range 1–30 µM) confirmed that the folding process was intramolecular (Figure 2). We found evidence for intermolecular association in few cases. Among the examples shown in this figure, one should notice that oligonucleotide exhibited a weak but significant dependency of the Tm on strand concentration (+5°C over a 10× or more concentration range). A few oligonucleotides were also analyzed by FRET melting (Table 2 and data not shown), with a different strand concentration range (from 50 to 1000 nM, signal saturation preventing us from analyzing data at higher strand concentrations). We observed a concentration independent Tm, arguing against the formation of a significant amount of intermolecular structures.
Figure 2.
Effect of concentration on Tm. Tm (in °C) is shown versus strand concentration (log scale) for a few examples. For most sequences, the effect was negligible or modest, in agreement with intramolecular folding. Notable exceptions are (in sodium) or (in potassium) (solid lines; filled circles and triangles, respectively).
Effect of concentration on Tm. Tm (in °C) is shown versus strand concentration (log scale) for a few examples. For most sequences, the effect was negligible or modest, in agreement with intramolecular folding. Notable exceptions are (in sodium) or (in potassium) (solid lines; filled circles and triangles, respectively).Non-denaturing gel electrophoresis allowed us to evidence interesting different behaviors in sodium and potassium. We chose to reveal the gel using UV-shadow, a method that does not require any labeling of any kind and that relies solely on the absorbance of the nucleic acid in the far UV region (254 nm). This absorbance is very weakly dependent on the DNA conformation [10% hypo/hyperchromicity at most (31)]. Hence, band intensity accurately reflects species abundancy, in contrast with all other staining methods we tested (silver or fluorescence staining). The drawback of UV-shadow is its poor sensitivity, forcing us to use relatively high oligonucleotide concentration (30 µM; i.e. 7.5-fold higher than most UV melting experiments). In the examples provided in Figure 3 one may observe that a single band is observed in most sequences, with a migration in agreement with the formation of intramolecular complexes, as judged by molecular size markers such as double-stranded sequences (dx9 and dx12) or oligothymidylate repeats (right lanes). Notable exceptions are the shortest sequences for which a slower migration is observed indicative of a completely different topology or a higher molecularity. For some samples, one may observe several bands likely corresponding to different intramolecular species: in these cases, even though, the Tm still appears as concentration-independent.
Figure 3.
Behavior of the G-quadruplex forming oligonucleotides on a non-denaturing gel. Sequences were compared by non-denaturing electrophoresis and revealed by UV-shadow with 30 µM of oligonucleotide. Samples were incubated in 10 mM Tris–HCl pH 7.5 buffer with 100 mM NaCl (A–B) or 100 mM KCl (C–D). The gel was prepared at 12% acrylamide supplemented with 20 mM of the corresponding salt and run at 23°C. Migration markers are oligothymidylate markers (dT15, dT21 or dT30) or double-stranded markers (dx9: 5′-d-GCGATACGG + 5′-d-CCGATACGC dx12: 5′-d-GCGTGACTTCGG + 5′-d-CCGAAGTCACGC). Oligonucleotide length (in nucleotide) is indicated above each band for comparison purposes.
Behavior of the G-quadruplex forming oligonucleotides on a non-denaturing gel. Sequences were compared by non-denaturing electrophoresis and revealed by UV-shadow with 30 µM of oligonucleotide. Samples were incubated in 10 mM Tris–HCl pH 7.5 buffer with 100 mM NaCl (A–B) or 100 mM KCl (C–D). The gel was prepared at 12% acrylamide supplemented with 20 mM of the corresponding salt and run at 23°C. Migration markers are oligothymidylate markers (dT15, dT21 or dT30) or double-stranded markers (dx9: 5′-d-GCGATACGG + 5′-d-CCGATACGC dx12: 5′-d-GCGTGACTTCGG + 5′-d-CCGAAGTCACGC). Oligonucleotide length (in nucleotide) is indicated above each band for comparison purposes.Alternatively, a single band with a slower migration could be obtained with in sodium (Figure 3A), or and in both ionic conditions (Figure 3B–D). This behavior could be indicative of an unusually slow intramolecular structure or, more likely, an intermolecular complex corresponding to dimers and tetramers of the oligonucleotide. Overall, as shown by the numbers in parentheses in Table 1, intermolecular complexes were often significantly present for sequences with two loops of 1–2 nt. These observations should be kept in mind when considering loop size effects on Tm! Finally some oligonucleotides gave several bands, such as (lane 15) in K+ and Na+, indicating that several conformations or molecularities could coexist.
Analysis of stability
Comparison of melting temperatures of the different oligonucleotides (Table 1) allowed us to reach the following conclusions:As observed earlier (16), and with the precautions mentioned above for very short sequences, Tm generally decreases when the length of a loop length is increased. This effect is striking both in potassium and in sodium (Figure 4) and valid for L1 L2 and L3.
Figure 4.
Sequence effects on Tm. Tm (deduced from UV-melting experiments) is shown as a function of loop size (in nucleotides) both in potassium (left panels) and sodium (right panels).
The conclusions reached for L1 generally apply to L3. Subtle differences may however be observed in Figure 4B and E (compare squares and diamonds). On the other hand, the profile found for the family in which L2 is variable (circles) is different from the two others, both in potassium and in sodium.Length dependent effects are somewhat different in sodium and potassium. This is reflected by the modest correlation (R = 0.69) found between Tm values in sodium and potassium (Figure 5A). Our study reveals that this Tm difference is highly variable (ΔTm between 1 and 39°C!). One can observe on Figure 5B that this difference is often higher for the series with the shortest loops, but this effect might be partially the result of the presence of ‘contaminating’ intermolecular complexes for those short sequences. For each series, the K+–Na+ difference tends to be relatively constant for loops of 7–15 nt, from 1–2°C (for TTA-var-TTA) to >30°C (for T-var-T).
Figure 5.
Sodium–Potassium difference. (A) Correlation between Tm values (deduced from UV-melting experiments) found in K+ and Na+. (B) ΔTm (K+ - Na+) is shown as a function of loop size (in nucleotides) for each family of sequences.
While quadruplexes easily accommodate a single long loop, incorporating two or three loops of 9 nt or more becomes quickly detrimental, especially in sodium (Figure 4C and F).We tested a single family in which the loops are not entirely composed of thymines (L1 = L3 = TTA). The comparison with (L1 = L3 = TTT) may be made in Figure 4A and D (diamonds and square, respectively). The profiles are relatively similar but, in sodium, the adenines play a positive role on stability for all sequences with a central loop composed of at least 2 nt (up to 6–7°C).Sequence effects on Tm. Tm (deduced from UV-melting experiments) is shown as a function of loop size (in nucleotides) both in potassium (left panels) and sodium (right panels).Sodium–Potassium difference. (A) Correlation between Tm values (deduced from UV-melting experiments) found in K+ and Na+. (B) ΔTm (K+ - Na+) is shown as a function of loop size (in nucleotides) for each family of sequences.
DISCUSSION
In order to improve our understanding of quadruplex stability, we have chosen to deconvoluate sequence from loop length effects. Our previous work was dedicated to a systematic evaluation of all possible bases in 1- or 3-nt-long loops (23,38). In the present study, we analyzed the effects of loop length on intramolecular quadruplex stability, keeping loop composition constant (thymines only). Different studies have addressed this issue, but only for shorter loops (generally 7 bases or less) and in a peculiar sequence context (15). The work presented here offers a general view of this effect thanks to the number of sequence tested, the different motifs considered and the choice of two different cations. The comparison of an important number of sequences in two ionic conditions allow us several conclusions:Most quadruplex structures studied here are stable at physiological temperature, especially when potassium is present.The difference found between Tms in potassium and sodium is highly variable and thus deserves further scrutiny. A recurrent observation is that this ΔTm is ‘maximal’ for sequences with two single-base loops (T-var-T) (38) (Figure 5B). This extreme difference is the result of at least two factors: (a) an unusually high Tm in potassium: the two short loops favor an all-parallel fold which is cation-dependent and (b) a unusually low Tm in sodium (compare Figures 4A and D, red circles). It is likely that the quadruplex topologies are different for these two conditions. In contrast, the difference is ‘minimal’ for the TTA-var-TTA family, as a result of an unusual high stability in sodium (Figure 4D, squares). As a consequence, it is difficult to predict the stability in sodium from the stability in potassium or vice versa as reflected by the poor correlation found between the results in these two ionic conditions (Figure 5A).The most stabilizing loop length also depends on the nature of the cation: while very short loops are almost always favored in potassium (but may lead to intermolecular complexes), the situation is much less clear in sodium: for some families, the highest Tm is obtained for a loop of 5 nt (Figures 4D–E). In general, increasing the length of one loop leads to a decrease in stability (Figure 4). A destabilizing effect is always observed for loops of 6 nt or more. For the five different families here in which a single loop is affected, increasing its size from 6- to 9–15 nt leads to a Tm decrease summarized in Table 3. This effect is significant but relatively mild (−2°C or less per extra thymine) and tends to level off for long sequences.
Table 3.
Effect of increasing loop length on Tm
In Sodiuma
In Potassiuma
From 6 to 9 nt
5.2 ± 1.9
6.5 ± 2.2
From 6 to12 nt
8.8 ± 3.1
11.0 ± 3.5
From 6 to 15 nt
11.6 ± 5.1
14.0 ± 3.2
adestabilization, in °C as compared to the Tm of the quadruplex with a TTTTTT loop. Results averaged for five different families of sequences.
This modest and regular effect prevents us from defining an upper limit for loop length compatible with quadruplex formation under physiological conditions. The commonly used upper limit chosen for loop length in many bioinformatics studies (7 nt) is arbitrary and obviously too stringent, as Tm >50°C may be found with one loop of 12 nt or more, at least in potassium. We therefore tried to refine this upper limit. For technical reasons, to study extremely long sequences, we used a FRET melting assay as intermolecular complexes tend to form with very long oligomers: the FRET assay may be performed at a 10- to 100-fold lower strand concentration therefore limiting the contribution of intermolecular complexes. As shown in Figure 6, a relatively stable quadruplex may still be formed with a central loop of 21 or even 30 nt. Furthermore, the Tm tends to become length independent, demonstrating that one cannot even propose an upper limit for loop size in vitro. This observation has relevance for naturally occurring sequences with loops of variable size and contents such as the ones found in promoters or other genomic regions (38,39) and is in agreement with the lariat type quadruplex observed by Sugiyama and colleagues (40). Note, however that FRET dyes may affect folding and indeed stabilize some topologies, making a direct comparison with unmodified sequences difficult.
Figure 6.
FRET melting experiments. Tm (deduced from FRET melting experiments) is shown as a function of loop size (in nucleotides) both in potassium (full lines) and sodium (dotted lines) for the T-var-T and TTT-var-TTT families.
One may also try to correlate stability with total loop length (adding the number of nucleotides in loops L1, L2 and L3). Results are summarized in Figure 7. Interestingly, a clear negative correlation may be found in potassium while the trend is less defined in sodium, as shown by the corresponding linear correlation coefficients (0.93 and 0.69, respectively). In potassium, each extra nucleotide leads to a 2.04°C decrease in Tm which may roughly be translated in a 0.3 kcal/mol loss in ΔG° (38). We did not observe a plateau around a total loop length of ≈7 nt as found by Bugaut and Balasubramanian (17): Tm tends to reach a plateau but only for much longer loops (Figures 6 and 7A). The correlation found in Figure 7A is strikingly better than the one observed for naturally occurring sequences (24). In the later case, loop composition plays a predominant role and masks the length effects.
Figure 7.
Tm as a function of total loop size. Tm (deduced from UV melting experiments) is plotted versus total (L1 + L2 + L3) loop size in K+ (A) and Na+ (B).
The effects found for shorter loops (5 nt or less) are more variable, especially in sodium. Depending on the sequence context, the most stabilizing loop length may vary between 1 and 5 nt.As expected short loops tend to allow for greater variation in loop type and consequently topology. Indeed, some of the extra species may correspond to different intramolecular topologies. Short loops also tend to favor ‘contaminating’ intermolecular complexes, which complicate a comparison with longer sequences. Such problem is often underappreciated but may be present even for well-known sequences (41). In these experiments, such higher order complexes may be revealed by an anomalous migration on non-denaturing gels. One way to circumvent—but not obliterate—this issue is to use a FRET melting assay and work at much lower concentrations.The results collected here were obtained for intramolecular quadruplexes potentially involving three quartets. It will be interesting to determine if these results apply to G4 structures with two or four quartets. A systematic study of the GGLGGLGGLGG motif is currently underway (Alberti et al., manuscript in preparation). Data on such quadruplexes is scarce (42,43), despite strong biological relevance (thrombin aptamer, some telomeric motifs, etc.)FRET melting experiments. Tm (deduced from FRET melting experiments) is shown as a function of loop size (in nucleotides) both in potassium (full lines) and sodium (dotted lines) for the T-var-T and TTT-var-TTT families.Tm as a function of total loop size. Tm (deduced from UV melting experiments) is plotted versus total (L1 + L2 + L3) loop size in K+ (A) and Na+ (B).Effect of increasing loop length on Tmadestabilization, in °C as compared to the Tm of the quadruplex with a TTTTTT loop. Results averaged for five different families of sequences.Our long term goal is to establish rules to predict the stability of intramolecular G-quadruplexes in vitro based on nucleotide sequences. An obvious application of this work will be to incorporate the numerical values found here in the algorithm developed by Huppert and colleagues (44). Other factors such as multiple quadruplexes (45,46) and molecular crowding (25,47,48) should also be taken into account. Altogether, our concerted efforts should improve the reliability of these predictions, especially for sequence spaces poorly covered before (very long loops). Nevertheless, one can argue that this data collection suffers from an inherent flaw: as quadruplexes adopt a variety of topologies (39,49–52) length and sequence effect are expected to be different for lateral, diagonal and chain reversal loops. Considering that several hundreds of melting profiles have been analyzed in this study (several replicates of 80 sequences done in sodium and potassium, some of them tested at different concentrations) one can easily understand that covering a wider sequence space is hardly feasible. Until recently, no rules were available to reliably predict what kind of quadruplex can be formed by a given sequence. However, recent works offer new possibilities: by inserting 8-bromo G (53,54) and ribo-G at key positions, one can control the topology of folding and hence loop type (55). Deconvolution of loop effects should be simpler in that case and lower the number of samples to be tested: efforts are now being made in that direction.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Institut National de la Santé et de la Recherche Médicale (INSERM), the Fondation pour la Recherche Médicale (FRM); University of Bordeaux 2; Agence Nationale de la Recherche (G4-Toolbox ANR-09-BLAN-0355); Région Aquitaine grants. Funding for open access charge: INSERM.Conflict of interest statement. None declared.
Authors: Giovanni Marsico; Vicki S Chambers; Aleksandr B Sahakyan; Patrick McCauley; Jonathan M Boutell; Marco Di Antonio; Shankar Balasubramanian Journal: Nucleic Acids Res Date: 2019-05-07 Impact factor: 16.971
Authors: Robert Hänsel-Hertsch; Dario Beraldi; Stefanie V Lensing; Giovanni Marsico; Katherine Zyner; Aled Parry; Marco Di Antonio; Jeremy Pike; Hiroshi Kimura; Masashi Narita; David Tannahill; Shankar Balasubramanian Journal: Nat Genet Date: 2016-09-12 Impact factor: 38.330