A polyguanine/polycytosine (polyG/polyC) tract in the proximal promoter of the vascular endothelial growth factor (VEGF) gene is essential for transcriptional activation. The guanine-rich (G-rich) and cytosine-rich (C-rich) strands on this tract are shown to form specific secondary structures, characterized as G-quadruplexes and i-motifs, respectively. Mutational analysis of the G-rich strand combined with dimethyl sulfate (DMS) footprinting, a polymerase stop assay, and circular dichroism (CD) spectroscopy revealed that the G-quadruplex containing a 1:4:1 double-chain reversal loop is the most thermodynamically stable conformation that this strand readily adopts. These studies provide strong evidence that the size of loop regions plays a critical role in determining the most favored folding pattern of a G-quadruplex. The secondary structure formed on the complementary C-rich strand was also determined by mutational analysis combined with Br(2) footprinting and CD spectroscopy. Our results reveal that at a pH of 5.9 this strand is able to form an intramolecular i-motif structure that involves six C-C(+) base pairs and a 2:3:2 loop configuration. Taken together, our results demonstrate that the G-quadruplex and i-motif structures are able to form on the G- and C-rich strands, respectively, of the polyG/polyC tract in the VEGF proximal promoter under conditions that favor the transition from B-DNA to non-B-DNA conformations.
A polyguanine/polycytosine (polyG/polyC) tract in the proximal promoter of the vascular endothelial growth factor (VEGF) gene is essential for transcriptional activation. The guanine-rich (G-rich) and cytosine-rich (C-rich) strands on this tract are shown to form specific secondary structures, characterized as G-quadruplexes and i-motifs, respectively. Mutational analysis of the G-rich strand combined with dimethyl sulfate (DMS) footprinting, a polymerase stop assay, and circular dichroism (CD) spectroscopy revealed that the G-quadruplex containing a 1:4:1 double-chain reversal loop is the most thermodynamically stable conformation that this strand readily adopts. These studies provide strong evidence that the size of loop regions plays a critical role in determining the most favored folding pattern of a G-quadruplex. The secondary structure formed on the complementary C-rich strand was also determined by mutational analysis combined with Br(2) footprinting and CD spectroscopy. Our results reveal that at a pH of 5.9 this strand is able to form an intramolecular i-motif structure that involves six C-C(+) base pairs and a 2:3:2 loop configuration. Taken together, our results demonstrate that the G-quadruplex and i-motif structures are able to form on the G- and C-rich strands, respectively, of the polyG/polyC tract in the VEGF proximal promoter under conditions that favor the transition from B-DNA to non-B-DNA conformations.
VEGF, a key regulator of angiogenesis, plays an important role in tumor survival, growth and metastasis (1,2) and is overexpressed in many types of humancancers, including glioma, renal cell carcinoma, and ovarian and pancreatic cancer (3–6). VEGF expression is regulated by many external stimuli, such as hypoxia (7–9), growth factors (10,11), hormones (12) and cytokines (13,14). Also, the loss or inactivation of tumor suppressor genes (15,16) and the activation of oncogenes (17) are reported to cause VEGF upregulation. Studies on the molecular mechanism of VEGF gene expression have defined major cis-acting elements and transcription factors involved in the regulation of this expression (18–20). These studies showed that a polyG/polyC tract in the VEGF proximal promoter region (–85 to –50 bp relative to the initiation of transcription; Figure 1) is essential for basal and inducible VEGF expression. This polyG/polyC tract contains three potential Sp1 binding sites (Figure 1). It was reported that the deletion of this polyG/polyC tract dramatically decreases the basal promoter activity by about 90% and abolishes VEGF expression induced by platelet-derived growth factor (PDGF) in NIH3T3 cells (18). In another report, the deletion of this polyG/polyC tract totally abrogated VEGF promoter activity in PANC-1cancer cells (19). Given the importance of this tract for VEGF promoter activity, we examined the DNA sequence of this region and found that the G-rich strand contains four consecutive runs of guanines separated by one or two cytosine bases, and the C-rich strand contains four consecutive runs of cytosines separated by one or two guanine bases. These sequences are consistent with the general motifs that form G-quadruplex and i-motif structures.
Figure 1.
Schematic diagram of the proximal promoter region of the human VEGF gene, with the location and base sequence of the polyG/polyC tract in this region shown. Three Sp1 binding sites are enclosed in boxes, and the four runs of guanines are underlined. The DNase I and S1 nuclease hypersensitive sites are indicated with filled circles and arrowheads, respectively.
Schematic diagram of the proximal promoter region of the humanVEGF gene, with the location and base sequence of the polyG/polyC tract in this region shown. Three Sp1 binding sites are enclosed in boxes, and the four runs of guanines are underlined. The DNase I and S1 nuclease hypersensitive sites are indicated with filled circles and arrowheads, respectively.DNA sequences containing consecutive runs of guanines are known to form intermolecular or intramolecular G-quadruplex structures. These are four-stranded structures consisting of stacked G-tetrads having a planar association of four guanines held together by Hoogsteen hydrogen bonding (21–25). DNA sequences containing consecutive runs of cytosines, at acidic pH levels, are capable of forming intermolecular or intramolecular four-stranded i-motif structures. The two parallel-stranded C–C+ (one cytosine must be hemiprotonated at the N3 position) base-paired duplexes intercalate into each other in an antiparallel orientation. G-quadruplex and i-motif structures have been extensively studied and have been shown to form in the telomeric ends of eukaryotic chromosomes (26–30) and in the G- and C-rich sequences located in the proximal promoter regions of c-myc (21,31,32), KRAS (30,33), Rb (34) and RET (35) genes. In the present study, we demonstrate the formation of these structures in the VEGF promoter.Within supercoiled plasmid DNA, the G-rich strand in the polyG/polyC tract of the VEGF proximal promoter is very dynamic and able to adopt non-B-DNA conformations, which are resistant to cleavage by both DNase I and S1 nuclease in the presence of K+ or the G-quadruplex-interactive agent telomestatin (36,37). The G-rich strand of the VEGF proximal promoter contains five runs of guanines (Figure 1). Our previous work has shown that the four 5′-end runs of guanines are involved in the formation of an intramolecular G-quadruplex, whereas inclusion of the fifth run of guanines results in the formation of intermolecular G-quadruplexes (37). Therefore, we focused this study on the presumably more biologically relevant G-quadruplex formed by the first four runs of guanines at the 5′-end. Results from a polymerase stop assay and CD studies of the G-rich strand in this region show the formation of an intramolecular parallel G-quadruplex structure (36). The most thermodynamically stable G-quadruplexes formed by the G-rich strand in the VEGF proximal promoter were studied using mutational analysis, CD melting, DMS footprinting and a polymerase stop assay. This present investigation has added significance since we have recently shown that the G-quadruplex-interactive agents that are able to stabilize these G-quadruplexes repress VEGF expression in cancer cells, which offers a new approach to antiangiogenesis therapy.Here, we demonstrate the formation of both G-quadruplex and i-motif structures in the VEGF proximal promoter and characterize their folding patterns and most likely structures using biochemical and biophysical techniques. Finally, other sequences in promoter regions that form these G-quadruplex and i-motif structures have been compared to provide additional insight into the conservation of these elements.
MATERIALS AND METHODS
Materials
The oligonucleotides used in this work were purchased from Biosearch Technologies (Novato, CA, USA), and their sequences are listed in Table 1.
Table 1.
Oligonucleotides used in this study
Bold letters indicate mutated bases.
Oligonucleotides used in this studyBold letters indicate mutated bases.
CD spectroscopy
CD spectra were recorded on a Jasco-810 spectrophotometer (Jasco, Easton, MD, USA) at room temperature (∼25°C), using a quartz cell of 1 mm optical path length, an instrument scanning speed of 100 nm/min, with a response time of 1 s and over a wavelength range of 200–350 nm. Each spectrum was recorded three times, smoothed and baseline-corrected for signal contributions from used buffers.For the G-quadruplex study, the oligonucleotides were diluted to a strand concentration of 10 μM in 50 mM Tris–HCl, pH 7.4, with an appropriate concentration of KCl. In the Tm experiment with 20 mM KCl, the molar ellipticities at 262 nm at different temperatures (4–95°C) were recorded and plotted against temperatures.For the i-motif study, the oligonucleotides were diluted to a strand concentration of 10 μM in 50 mM Tris–acetate at appropriate pHs. To determine the transition mid-point of the VEGF i-motif structure, the molar ellipticities at 288 nm against pHs were plotted and the mid-point was determined.
DMS footprinting
The oligonucleotides were 5′-end-labeled with 32P and purified by micro-spin column 6. The 32P-labeled oligonucleotides were treated with 2% DMS for 5 min in either the absence of KCl or the presence of 100 mM KCl. The DMS-treated oligonucleotides were loaded on a 16% nondenaturing polyacrylamide gel to separate the single-stranded DNA and intramolecular G-quadruplex from other intermolecular G-quadruplexes by their different electrophoretic mobilities. The DNA were recovered and subjected to piperidine cleavage (21).
Polymerase stop assay
The polymerase stop assay templates were designed by placing the VEGF proximal promoter G-rich sequence, or the various mutant G-rich sequences, in a polymerase stop assay cassette, as described previously (38). 5′-end-labeled primer p28 d(TAATACGACTCACTATAGCAATTGCGTG) and template DNA were annealed in an annealing buffer (50 μM Tris–HCl, pH 7.5, 10 μM NaCl) by heating at 95°C for 5 min and slowly cooling to room temperature. The primer-annealed template oligonucleotides were purified by electrophoresis using a 12% nondenaturing polyacrylamide gel. The purified primer template oligonucleotides were used in a primer extension assay with Taq DNA polymerase, as described previously (38).
Native polyacrylamide gel electrophoresis (PAGE)
The pHs of PAGE gel solution in 1 × Tris–Acetate–EDTA and running buffer (1 × Tris–acetate–EDTA) were both adjusted to pH 5.0. Ten thousand cpm of 32P-labeled IM1 or T24 in 50 mM Tris–acetate, pH 5.0, was heated at 95°C for 5 min, cooled to room temperature and incubated at room temperature for 2 h before loading on a 20% nondenaturing PAGE gel.
Br2 footprinting experiment
The Br2 footprinting experiment was carried out in accordance with published procedure (39) to probe the secondary structure formed by IM1. In brief, IM1 was 5′-end-labeled with 32P using T4 polynucleotide kinase and [γ-32P] ATP, and the labeled IM1 was gel-purified using 12% polyacrylamide gel electrophoresis under denaturing conditions (7 M urea). For the Br2 cleavage reaction, the purified 5′-end-labeled IM1 was treated for 20 min with molecular Br2 that was generated in situ by mixing an equal molar concentration (50 mM) of KBr with KHSO5 in the same tube. The reactions were then terminated by adding 50 μl of stop mix containing 0.6 M Na-acetate (pH 5.2) and 10 mg/ml calf thymus DNA, and unreacted Br2 was removed by ethanol precipitation. Following ethanol precipitation, the DNA pellet was dried and resuspended with 100 μl of freshly diluted 1 M piperidine, and the samples were heated at 90°C for 30 min to induce bromination-specific strand cleavage. Following piperidine treatments, the DNA samples were completely dried and resuspended with alkaline sequencing gel loading dye and applied to a 20% sequencing gel. The purine- and pyrimidine-specific reactions were carried out using formic acid or hydrazine to generate sequencing markers, following published procedure (39).
RESULTS
Formation of G-quadruplex structures in the G-rich strand of the VEGF proximal promoter
We have reported that the G-rich strand of the VEGF proximal promoter is able to form an unusual DNA secondary structure in the presence of KCl or G-quadruplex-interactive-agents, such as TMPyP4 and telomestatin (36). In the presence of 100 mM KCl, the CD spectrum of the wild-type G-rich sequence in the VEGF proximal promoter (WT, Table 1A) exhibits a characteristic positive peak at 264 nm and a negative peak at 240 nm, suggesting that it forms a parallel G-quadruplex structure (Figure 2A) (35,36). In the sequences studied, we added 5′ and 3′ thymidine tails and showed that these did not affect the folding patterns relative to the native sequences.
Figure 2.
CD and DMS footprinting of the wild-type VEGF G-rich strand (WT) and other mutant sequences. (A) CD spectrum of WT in the presence of 100 mM KCl. (B) DMS footprinting of WT, C9T, C9A, C10T and C10A in the absence of KCl (−) or in the presence of 100 mM KCl (+). (C) DMS footprinting of G11/12T. The AG and TC sequencing reactions for WT are shown on the left of (B).
CD and DMS footprinting of the wild-type VEGF G-rich strand (WT) and other mutant sequences. (A) CD spectrum of WT in the presence of 100 mM KCl. (B) DMS footprinting of WT, C9T, C9A, C10T and C10A in the absence of KCl (−) or in the presence of 100 mM KCl (+). (C) DMS footprinting of G11/12T. The AG and TC sequencing reactions for WT are shown on the left of (B).To resolve the G-quadruplex structure formed by this strand, we performed DMS footprinting to identify guanines involved in the G-tetrads, by their resistance to methylation at position N7 due to Hoogsteen hydrogen bonding (21). In this process, the G-quadruplex-forming oligonucleotides were subjected to DMS treatment in the absence or presence of KCl. The DMS-treated samples were resolved on a nondenaturing polyacrylamide gel, and the intramolecularly folded DNA samples were recovered from the gel and subjected to piperidine cleavage. In the absence of KCl, the DMS methylation pattern of the guanines along the WT sequence is consistent with an unstructured DNA form (Figure 2B, lane 3), whereas in the presence of 100 mM KCl, some guanines were well-protected against DMS methylation, while others showed enhanced methylation and subsequent cleavage (Figure 2B, lane 4). The DMS footprinting pattern of WT in the presence of 100 mM KCl demonstrated a signature for G-quadruplex formation where four tracts of three guanines (G2–G4, G6–G8, G13–G15 and G17–G19) form the four strands of a G-quadruplex with a 1:4:1 loop arrangement (two 1-base loops and one 4-base loop) consisting of C5, C9–G12 and C16, respectively (defined as G-quadruplex I; Figure 3B). Unexpectedly, G14, which is located in the run of five guanines (G11–G15), showed only modest methylation by DMS, which is consistent with our previous observation (37).
Figure 3.
(A) DMS footprinting of oligonucleotides G11T, G12T, G14T and G14A in the absence of KCl (−) or in the presence of 100 mM KCl (+). The sequence of G11T is shown to the right of the gel. For the other sequences, only the mutated region of G11–G15 is shown. The mutation sites in each sequence are indicated with an asterisk. (B) The two different folding patterns (1:4:1 and 1:2:3) for the VEGF G-quadruplexes inferred from this data.
(A) DMS footprinting of oligonucleotidesG11T, G12T, G14T and G14A in the absence of KCl (−) or in the presence of 100 mM KCl (+). The sequence of G11T is shown to the right of the gel. For the other sequences, only the mutated region of G11–G15 is shown. The mutation sites in each sequence are indicated with an asterisk. (B) The two different folding patterns (1:4:1 and 1:2:3) for the VEGF G-quadruplexes inferred from this data.In an attempt to understand the reactivity mechanism of G14 to DMS and to extend our previous study (37), we first determined whether the two cytosines in the central loop (C9 and C10) are involved in cleavage at G14 by forming possible C–G base pairs. Sequences containing C-to-T or C-to-A mutations (C9T, C9A, C10T and C10A in Table 1A) in the central loop were examined first. In the presence of 100 mM KCl, the CD spectra of these sequences are similar to that of oligonucleotide WT (data not shown), suggesting that they all form parallel G-quadruplexes. DMS footprinting results showed that methylation at G14 still persists (Figure 2B, lanes 6, 8, 10 and 12), with some decrease in oligomers C9T and C10A compared to the WT sequence. DMS footprinting of an oligomer with G-to-T mutations at positions 11 and 12 (G11/12T in Table 1A), which forms a parallel G-quadruplex based on CD (data not shown), revealed that methylation at G14 disappeared (Figure 2C). This experiment demonstrates that the reactivity to DMS at G14 in the WT sequence is dependent on the presence of the two guanines in the loop. Thus, this reactivity of G14 to DMS comes from either the alternative use of the five consecutive guanines in forming G-quadruplexes (see below) or other possible interplay among G14, G11 and G12. However, a clear explanation for the reactivity of G14 to DMS requires detailed information on the VEGF G-quadruplex by NMR or X-ray crystallography.
DMS footprinting, polymerase stop assay and CD studies show that there are two equilibrating G-quadruplex loop isomers, of which the 1:4:1 form is the most stable
In the G-rich strand of the VEGF proximal promoter, the third run of guanines contains five guanines (G11–G15), but only three consecutive guanines are required to form a G-quadruplex. In principle, different loop isomers using these five guanines are able to form by using three different contiguous guanines, i.e. G11–G13, G12–G14 or G13–G15. To evaluate this hypothesis, we designed several mutant sequences (G11T, G12T, G14T and G14A in Table 1A). The CD spectra of these four sequences in the presence of 100 mM KCl all demonstrated parallel G-quadruplex formation (data not shown). In the presence of KCl, DMS footprinting of G11T and G12T showed that four runs of three guanines (G2–G4, G6–G8, G13–G15 and G17–G19) are well protected against DMS methylation and subsequent cleavage, while the other guanines (G1, G12 and G20 in G11T, and G11 and G20 in G12T) showed enhanced DMS methylation and subsequent cleavage (Figure 3A, lanes 2 and 4). Overall, the G-quadruplex structures formed by these mutants at the 5′-end of the run of five guanines (G11T and G12T) use the same three G-tetrads, with a 1:4:1 arrangement of loops, as those formed by the WT oligomer (defined as G-quadruplex I; Figure 3B). For sequences G14T and G14A, we saw a different DMS footprinting pattern: the three contiguous guanines at the 5′-end of the third run of guanines (G11–G13) were well protected against DMS methylation (Figure 3A, lanes 6 and 8), suggesting that guaninesG11–G13 are involved in G-quadruplex formation. DMS footprinting of G14T and G14A in the presence of 100 mM KCl demonstrated a pattern of G-quadruplex formation where G2–G4, G6–G8, G11–G13 and G17–G19 form the four strands that connect the three tetrads, with a 1:2:3 arrangement of loops composed of C5; C9 and C10; and G14, G15 and C16, respectively (defined as G-quadruplex II; Figure 3B).The stabilities of G-quadruplexes I and II were compared using a polymerase stop assay (Figure 4A). Very significant stop products were seen for the sequences that form G-quadruplexes using the three guanines at the 3′-end (G13–G15) of the five guanine run with increasing concentrations of KCl (Figure 4A, Pol-WT, Pol-G11T, Pol-G12T, Pol-G11/12T), whereas only minor stop products were observed for the sequences that form G-quadruplexes using the three guanines at the 5′-end (G11–G13) (Figure 4A, Pol-G14T and Pol-G14A). We conclude that the G-quadruplex structure (G-quadruplex I) formed using the three 3′-end guanines (G13–G15), which forms a 1:4:1 loop isomer species, is more stable than the one (G-quadruplex II) formed using the three 5′-end guanines (G11–G13). This is further confirmed by CD melting experiments, which showed that the G-quadruplex structures with the 1:4:1 loop arrangement (G11T, G12T, G11/12T) have higher Tm (64–67°C versus 57–61°C) than the G-quadruplexes with the 1:2:3 loop arrangement (G14T and G14A) (Figure 4B).
Figure 4.
(A) Polymerase stop assay of Pol-WT, Pol-G11T, Pol-G12T, Pol-G11/12T, Pol-G14T and Pol-G14A with increasing concentrations of KCl. The primer site, stop product site and full-length product site are indicated with arrows. Sequencing reactions of guanine for each sequence are indicated at the top of each lane (G). (B) The polymerase stop assay cassette sequence is shown with a box indicating the different G-quadruplex sequences studied. The primer site and stop site product are shown with arrows. The G-quadruplex sequences are shown below the polymerase stop assay cassette sequence, with their Tm on the right.
(A) Polymerase stop assay of Pol-WT, Pol-G11T, Pol-G12T, Pol-G11/12T, Pol-G14T and Pol-G14A with increasing concentrations of KCl. The primer site, stop product site and full-length product site are indicated with arrows. Sequencing reactions of guanine for each sequence are indicated at the top of each lane (G). (B) The polymerase stop assay cassette sequence is shown with a box indicating the different G-quadruplex sequences studied. The primer site and stop site product are shown with arrows. The G-quadruplex sequences are shown below the polymerase stop assay cassette sequence, with their Tm on the right.
The C-rich strand of the VEGF proximal promoter forms an intramolecular i-motif structure
To determine whether an i-motif structure is formed by the C-rich strand of the VEGF proximal promoter, CD spectra of IM1 (Table 1B), a sequence identical to the C-rich strand of the VEGF proximal promoter, were collected at different pHs. As shown in Figure 5A, at acidic pH (≤5.9), there is a characteristic positive peak at 288 nm and a negative peak at 265 nm, with crossover at 276 nm, indicating the formation of an i-motif structure. At pH >7, the positive peak sharply decreases, showing that the i-motif stability decreases with increasing pH. This indicates that an unstructured DNA forms at neutral pH due to the deprotonation and the disruption of C–C+ base-pairing. The transition mid-point pH was determined to be pH 5.8 by plotting the molar ellipticity at 288 nm versus pHs (Figure 5B).
Figure 5.
(A) CD spectra of IM1 at room temperature in 50 mM Tris–acetate buffer at different pHs (8.0–4.4). (B) pH dependence of the molar ellipticity at 288 nm from (A). (C) IM1 and T24 on denaturing PAGE gel at pH 8.3 (left) and on a nondenaturing PAGE gel at pH 5.0 (right). (D) CD spectra of IM1–IM5 at pH 5.5.
(A) CD spectra of IM1 at room temperature in 50 mM Tris–acetate buffer at different pHs (8.0–4.4). (B) pH dependence of the molar ellipticity at 288 nm from (A). (C) IM1 and T24 on denaturing PAGE gel at pH 8.3 (left) and on a nondenaturing PAGE gel at pH 5.0 (right). (D) CD spectra of IM1–IM5 at pH 5.5.Native PAGE was used to differentiate between the formation of intramolecular and intermolecular i-motif structures (31,40,41). A 24-mer dT oligonucleotide, T24 (Table 1B), was used as a control in this experiment. T24 and IM1 were first subjected to electrophoresis on a denaturing PAGE gel at pH 8.3, where it was found that IM1 ran as a single band with a similar mobility to T24, which is similar in size to IM1 (Figure 5C). Subsequently, we ran the same oligomers on a native PAGE gel at pH 5.0, conditions at which IM1 forms an i-motif structure as determined by CD. The results in Figure 5C show that IM1 travels as one band faster than T24, suggesting that under acidic conditions, IM1 adopts an intramolecular structure.If an intramolecular i-motif structure is formed, all four runs of cytosines should be involved in i-motif formation. Therefore, mutations within each run of cytosines should disrupt this formation. To test this, a single C-to-T mutation, double C-to-T mutations or triple C-to-T mutations were introduced in the middle of runs of three cytosines, four cytosines or five cytosines, respectively, to disrupt the cytosine repeats and determine their involvement in i-motif formation. CD spectra of these different mutant sequences (IM2–IM5, Table 1B) and IM1 at pH 5.5 were determined. When compared to the CD spectra of IM1, the i-motif signature peaks (288 nm) of IM2–IM5 were all dramatically decreased (Figure 5D), suggesting that each run of cytosines contributes to base-pairing during i-motif formation, which further supports an intramolecular i-motif formation.
Determination of the cytosines involved in the formation of the intramolecular i-motif by CD analysis
CD analysis of wild-type and mutant sequences was previously used to characterize the i-motif structure formed by the C-rich strand on the 5′-end of the Rb gene (34). In the C-rich sequence of the VEGF promoter, only the third run contains three cytosines. Therefore, it was assumed that there would be only three cytosines in each run involved in the i-motif structure. To identify which three cytosines are involved in the hemiprotonated C–C+ pairing of the i-motif, selective C-to-T mutations were introduced at specific positions on the wild-type C-rich sequence (IM6–IM12, Table 1B). The CD spectra of these sequences were collected and compared with that of IM1. IM6 and IM7 were designed so that the C at either the 5′-end (IM6) or the 3′-end (IM7) of the first run of cytosines was mutated to T. The results show that the mutation at the 3′-end (IM7) produces a greater reduction in molar ellipticity at 288 nm in comparison to the mutation at the 5′-end (IM6) (Figure 6A), suggesting that the C at the 3′-end is more important for i-motif structure formation. Using the same procedure, it was shown that in the second run of cytosines, the central three cytosines are more important for i-motif formation (Figure 6B), and in the fourth run of cytosines, the C at the 3′-end is more critical for i-motif formation (Figure 6C). On the basis of these results, the cytosines of IM1 involved in i-motif formation are proposed to be C2–C4, C7–C9, C13–C15 and C18–C20 (Figure 6D).
Figure 6.
CD spectra of IM1, IM6 and IM7 (A), IM1 and IM8–IM10 (B) and IM1, IM11 and IM12 (C). The cytosines predicted to be involved in i-motif formation are underlined (D).
CD spectra of IM1, IM6 and IM7 (A), IM1 and IM8–IM10 (B) and IM1, IM11 and IM12 (C). The cytosines predicted to be involved in i-motif formation are underlined (D).
Determination of the predominant i-motif structure formed by the C-rich strand by Br2 footprinting
As we previously reported, Br2 footprinting can be used to unambiguously determine the identity of the cytosine residues involved in base pairings and intercalations to form i-motif structures (35). As shown in Figure 7 (left panel), lane 3, C2–C4, C7–C9, C13–C15 and C18–C20 are well protected, suggesting that these cytosines are involved in base pairing and intercalation, whereas cytosine residues C6, C10 and C17 showed enhanced reactivity toward Br2, suggesting that these three residues are located in loop regions. The cytosines involved in base pairing and intercalation determined by Br2 footprinting are the same as those determined by CD analysis. On the basis of the results of CD analysis and Br2 footprinting, a model folding pattern for the VEGF i-motif is proposed, which involves six hemiprotonated C–C+ base pairs formed from four consecutive antiparallel cytosine stretches with a 2:3:2 loop arrangement (Figure 7, right panel). Since there are two very narrow grooves and two very wide grooves in an i-motif (42), we speculate that the two 2-base loops run across narrow grooves and the 3-base loop runs across a wide groove. The predominant i-motif formed by the C-rich strand of the VEGF promoter adopts an antiparallel i-motif folding pattern similar to that found in the human telomeric C-strand, as determined by NMR (42).
Figure 7.
Autoradiogram and densitometric scanning of the autoradiogram from a Br2 footprinting experiment to determine cytosine residues involved in base pairing and intercalation in the intramolecular i-motif folded form. Lane 1 represents the pyrimidine-specific reaction used to generate sequencing marker. Lanes 2 and 3 represent reaction without and with Br2 (0.5 mM) in double-distilled H2O, respectively. On the right is shown the predicted folding pattern of the predominant i-motif structure formed by IM1 based on the results of the Br2 footprinting experiment shown on the left.
Autoradiogram and densitometric scanning of the autoradiogram from a Br2 footprinting experiment to determine cytosine residues involved in base pairing and intercalation in the intramolecular i-motif folded form. Lane 1 represents the pyrimidine-specific reaction used to generate sequencing marker. Lanes 2 and 3 represent reaction without and with Br2 (0.5 mM) in double-distilled H2O, respectively. On the right is shown the predicted folding pattern of the predominant i-motif structure formed by IM1 based on the results of the Br2 footprinting experiment shown on the left.
DISCUSSION
VEGF is one of the most important angiogenic factors and is overexpressed in many types of tumors (43). VEGF and its receptors have been attractive targets for anti-angiogenesis therapy in the last decade (44). In 2004, Bevacizumab, a humanized anti-VEGF antibody, was approved by the FDA as a first-line therapy for colorectal cancer treatment, and there now are small-molecule VEGF receptor inhibitors in different stages of clinical trials (45). Our previous study showed that the polyG/polyC region in the VEGF proximal promoter is very dynamic and able to unwind into single-stranded DNA and then fold into secondary DNA structures, such as G-quadruplexes (36). In this article, we have partially characterized the unique DNA secondary structures formed in the VEGF proximal promoter. The G-rich strand of the VEGF proximal promoter was shown to form a parallel G-quadruplex with three G-tetrads and three double-chain reversal loops containing 1, 4 and 1 bases. From the complementary C-rich strand, an i-motif is formed with six C–C+ base pairs and three loops containing 2, 3 and 2 bases.The four consecutive runs of guanines in the G-rich strand of the polyG/polyC tract in the VEGF proximal promoter have four, three, five and four guanines, respectively. If three consecutive guanines are randomly selected from each run of guanines to form a G-quadruplex, twelve different loop isomers could possibly be formed by the VEGF wild-type G-rich sequence. However, the results of our study revealed that a 1:4:1 loop isomer is the predominant configuration formed by this sequence. Results from the polymerase stop assay show that the 1:4:1 loop isomer is much more efficient than the 1:2:3 loop isomer in arresting DNA synthesis (Figure 4A), suggesting that the 1:4:1 loop isomer may be more biologically relevant. The CD melting experiments confirmed that the 1:4:1 loop isomer has greater stability (4–10°C) than the 1:2:3 loop isomer (Figure 4B). To enhance the production of the 1:2:3 loop isomer, a mutation is required at G14.The major folding difference between these two loop isomers is that the 1:4:1 loop isomer possesses a G3N1G3 motif on the 3′- and 5′-ends, in contrast to a single G3N1G3 motif in the 1:2:3 loop isomer. Since it is known that a double-chain reversal with two edges, each made up of three guanines and connected through a 1-base loop, is very stable and is found in a variety of promoter G-quadruplexes, it is easy to rationalize why the 1:4:1 loop isomer would predominate over the 1:2:3 loop isomer (23,24,35,36,46–48). While the pathway to the 1:4:1 loop isomer is presumably through the intermediacy of two distinct 3′ and 5′ 1-base loop faces, we speculate that the 1:2:3 loop isomer occurs when one 2-base loop forms directly adjacent to a 1-base loop, thus requiring a 3-base loop to complete the G-quadruplex.The DMShypersensitivity ofG14 is a unique feature of the VEGF G-quadruplex-forming element. Interpretation of the DMS footprinting pattern shows that G14 should reside in the central tetrad of the G-quadruplex. As such, it was completely unexpected that this guanine would be hypersensitive to DMS (Figure 2B). This hypersensitivity is in sharp contrast to the well-protected guanines at the adjacent positions (13 and 15) and implies that G14 must exist at some point in a non-Hoogsteen base-paired state, making this residue more accessible to attack by DMS. The DMS footprinting results of a series of oligomers containing mutant VEGF G-rich sequences suggest that G14hypersensitivity to DMS is dependent upon the presence of G11 and G12 (Figure 2C), but not cytosines C9 and C10 (Figure 2B). However, why G14 is hyperreactive to DMS is not clear at this point, and further biophysical studies will be required to define this anomaly.A small but growing number of promoter elements that have been shown to form G-quadruplexes, as well as a more limited number of i-motif-forming sequences, have been characterized by chemical footprinting and CD, and in a few cases NMR studies have been carried out (21,23,35,49–51). For those human promoter elements that have been shown to be critical for transcriptional activation, it is informative to compare the G-quadruplex-forming sequences and their folding patterns. The sequences for the 3-tetrad group are shown in Figure 8A. (Only the predominant G-quadruplex-forming sequence of each promoter is shown.) The similarities are striking. All the G-quadruplexes, with the exception of Bcl-2, have both a 3′- and a 5′-end G3N1G3 motif and all form parallel G-quadruplex structures (22,24,35). The exception, Bcl-2, has a 3-base loop at the 5′-side and forms a mixed parallel/antiparallel folding pattern (23,52). However, what is strikingly different in these structures is the central loop, which differs both in size (2–7 bases) and base composition.
Figure 8.
Comparison of truncated G-quadruplex-forming sequences (A and B) and i-motif-forming sequences (C) within selected gene promoters.
Comparison of truncated G-quadruplex-forming sequences (A and B) and i-motif-forming sequences (C) within selected gene promoters.Outside the class of G-quadruplexes containing three tetrads are a number of structures that have either two or four tetrads (Figure 8B). In the case of PDGF-A, there are four tetrads requiring two bases to form a double-chain reversal loop on the 3′- and 5′-sides (49), so this is a simple incremental variation on the three-tetrad class that also folds to form parallel G-quadruplexes containing three double-chain reversal loops. Myb and Rb are composed of only two tetrads, and while Myb still forms a parallel-stranded G-quadruplex, it requires a heptad on one face to maintain stability (53). In contrast, the Rb forms a basket G-quadruplex with a mixed parallel/antiparallel structure (34).In addition to that described here for VEGF, folding patterns for the i-motifs have been assigned for only RET and Rb, so the data set is more limited (Figure 8C). VEGF and RET are the same with regard to loop size (2:3:2) but differ in the number of i-motif C–C+ hemiprotonated base pairs, i.e. six for VEGF and five for RET. Preliminary modeling studies suggest that the 2:3:2 loop configuration is the minimum required for bridging the antiparallel runs of cytosines (unpublished data). Rb conforms to this rule, having loop sizes of 2:4:2, but in this case only four C–C+ base pairs, which may also be the minimum required for i-motif stability. So while the number of C–C+ base pairs varies from four to six, the 3′- and 5′-end lateral-loop bases are two for each of these i-motifs; but in all cases the constituent bases in the central lateral loop are different.The biological role of the i-motif on the C-rich strand in the VEGF promoter has yet to be determined. Previous studies from the Levens lab have shown that heterogeneous nuclear ribonucleoprotein K (hnRNP K) is able to bind to the C-rich strand, called the CT element, in the NHE III1 region of the c-Myc oncogene promoter to activate c-Myc transcription (54,55). Therefore, the formation of an i-motif structure on the C-rich strand of the VEGF promoter could prevent hnRNP K binding to the single-strand to activate transcription. Thus, i-motif formation on the C-rich strand could be a silencing element for DNA transcriptional regulation.Finally, we have also observed that G-quadruplex-interactive agents are able to stabilize the VEGF G-quadruplex in vitro and downregulate VEGF mRNA levels in cancer cells (37), suggesting that targeting the G-quadruplex formed in the VEGF promoter is a feasible strategy to modulate VEGF expression at the transcriptional level. Therefore, targeting DNA secondary structures in the VEGF promoter could be a novel approach to anti-angiogenesis drug discovery in cancer therapy.
Authors: Igor G Panyutin; Mykola I Onyshchenko; Ethan A Englund; Daniel H Appella; Ronald D Neumann Journal: Curr Pharm Des Date: 2012 Impact factor: 3.116
Authors: Mykola I Onyshchenko; Timur I Gaynutdinov; Ethan A Englund; Daniel H Appella; Ronald D Neumann; Igor G Panyutin Journal: Nucleic Acids Res Date: 2009-12 Impact factor: 16.971