| Literature DB >> 34450640 |
Subramaniyam Ravichandran1, Maria Razzaq1, Nazia Parveen1, Ambarnil Ghosh1, Kyeong Kyu Kim1.
Abstract
G-quadruplex (G4), a four-stranded DNA or RNA structure containing stacks of guanine tetrads, plays regulatory roles in many cellular functions. So far, conventional G4s containing loops of 1-7 nucleotides have been widely studied. Increasing experimental evidence suggests that unconventional G4s, such as G4s containing long loops (long-loop G4s), play a regulatory role in the genome by forming a stable structure. Other secondary structures such as hairpins in the loop might thus contribute to the stability of long-loop G4s. Therefore, investigation of the effect of the hairpin-loops on the structure and function of G4s is required. In this study, we performed a systematic biochemical investigation of model G4s containing long loops with various sizes and structures. We found that the long-loop G4s are less stable than conventional G4s, but their stability increased when the loop forms a hairpin (hairpin-G4). We also verified the biological significance of hairpin-G4s by showing that hairpin-G4s present in the genome also form stable G4s and regulate gene expression as confirmed by in cellulo reporter assays. This study contributes to expanding the scope and diversity of G4s, thus facilitating future studies on the role of G4s in the human genome.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34450640 PMCID: PMC8501965 DOI: 10.1093/nar/gkab739
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Schematic representation of a conventional G4 structure (A) and a hairpin-G4 (B). The hairpin-G4 is characterized by the coexistence of a hairpin and G4 in the same structure, unlike the conventional G4.
List of oligonucleotides used in this study
| Sample name | Sequence | Length |
|---|---|---|
| SH-G4 | TGGGTGGGTGGGTTGTCGGCGACATGGGT | 29 |
| SI-1-G4 | TGGGTGGGTGGGTTGTCGGCGACTTGGGT | 29 |
| SI-2-G4 | TGGGTGGGTGGGTTCTCGGCGACTTGGGT | 29 |
| MH-G4 | TGGGTGGGTGGGTTGTCAGTATGGCATACTGACATGGGT | 39 |
| MI-1-G4 | TGGGTGGGTGGGTTGTCAGAATGGCATACTGACATGGGT | 39 |
| MI-2-G4 | TGGGTGGGTGGGTTATCAGTATGCCACACTGACATGGGT | 39 |
| LH-G4 | TGGGTGGGTGGGTTGTCAGTATAGTCTGGCAGACTATACTGACATGGGT | 49 |
| LI-1-G4 | TGGGTGGGTGGGTTGTGAGTATAGACTGGCAGACTATACTGACATGGGT | 49 |
| LI-2-G4 | TGGGTGGGTGGGTGGTCTGTGTAGACTGGCGGAATATACTGACATGGGT | 49 |
| SU-G4 | TGGGTGGGTGGGTCTTCTTACATATGGGT | 29 |
| MU-G4 | TGGGTGGGTGGGTCTTCTTATATATTCTTCTTACAGGGT | 39 |
| LU-G4 | TGGGTGGGTGGGTCTTCTTACTTATTCTTCTTACTTATTCTTCTTGGGT | 49 |
| SU-G4 LOOP ONLY | TCTTCTTACATAT | 13 |
| MU-G4 LOOP ONLY | TCTTCTTATATATTCTTCTTACA | 23 |
| LU-G4 LOOP ONLY | TCTTCTTACTTATTCTTCTTACTTATTCTTCTT | 33 |
| SH-G4 LOOP ONLY | TTGTCGGCGACAT | 13 |
| MH-G4 LOOP ONLY | TTGTCAGTATGGCATACTGACAT | 23 |
| LH-G4 LOOP ONLY | TTGTCAGTATAGTCTGGCAGACTATACTGACAT | 33 |
| Pu22myc | TGAGGGTGGGTAGGGTGGGTAA | 22 |
| CHST1-G4 | CGGGTGGGTGGGGGCGGGCTCCGGAGCCTGGCTGCGGAGTGGGT | 44 |
| CHST1-MutG4 | CGGGTGGGTGGGGGCAAAAAAAAAAAAAAAAAAAAAAAGTGGGT | 44 |
| MCM4-G4 | TGGGTGGGTACCGGCCCGAGCTGGGCCGCGGGTGGGT | 37 |
| MCM4-MutG4 | TGGGTGGGTAAAAAAAAAAAAAAAAAAGCGGGTGGGT | 37 |
| MCM4-G4NGT | TGGGTGGGTACCGCGCCGAGCTGGCGCGCGGGTGGGT | 37 |
| MSI1-G4 | AGGGCGTTCCCGCGGCCGGGCCCCCGCGCCGGGGTGGGTGGGG | 43 |
| MSI1-MutG4 | AGGGCGAAAAAAAAAAAAAAAAAAAAAAAAGGGGTGGGTGGGG | 43 |
| NRBP1-G4 | TGGGTGGGCGGGGCCCGGCCCTCGGGCGTTCGCTGGGGTGGGC | 43 |
| NRBP1-MutG4 | TGGGTGGGCGGGGAAAAAAAAAAAAAAAAAAAAAGGGGTGGGC | 43 |
| TMCC3-G4 | GGGGGTGGGTGGGGGGTCCAGGCGGCTGCGGGGCGCGGGA | 40 |
| TMCC3-MutG4 | GGGGGTGGGTGGGGGGTCCAAAAAAAAAAAAAAAACGGGA | 40 |
| L5′TH-G4 | TGGGTGGGTGGGTTTGTCAGTATAGTCTGGCAGACTATACTGACATGGGT | 54 |
| L3′TH-G4 | TGGGTGGGTGGGTTGTCAGTATAGTCTGGCAGACTATACTGACATTGGGT | 54 |
| L5′TTH-G4 | TGGGTGGGTGGGTTTTGTCAGTATAGTCTGGCAGACTATACTGACATGGGT | 54 |
| L3′TTH-G4 | TGGGTGGGTGGGTTGTCAGTATAGTCTGGCAGACTATACTGACATTTGGGT | 54 |
| LTTTH-G4 | TGGGTGGGTGGGTTTTTGTCAGTATAGTCTGGCAGACTATACTGACATTTTGGGT | 55 |
| LTH-G4 | TGGGTGGGTGGGTTTGTCAGTATAGTCTGGCAGACTATACTGACATTGGGT | 54 |
| LTTH-G4 | TGGGTGGGTGGGTTTTGTCAGTATAGTCTGGCAGACTATACTGACATTTGGGT | 54 |
| L5′TTTH-G4 | TGGGTGGGTGGGTTTTTGTCAGTATAGTCTGGCAGACTATACTGACATGGGT | 54 |
| L3′TTTH-G4 | TGGGTGGGTGGGTTGTCAGTATAGTCTGGCAGACTATACTGACATTTTGGGT | 54 |
| LU1-G4 | TGGGTCTTCTTACTTATTCTTCTTACTTATTCTTCTTGGGTGGGTGGGT | 54 |
| LU2-G4 | TGGGTGGGTCTTCTTACTTATTCTTCTTACTTATTCTTCTTGGGTGGGT | 54 |
| LU-G4 | TGGGTGGGTGGGTCTTCTTACTTATTCTTCTTACTTATTCTTCTTGGGT | 54 |
| LH1-G4 | TGGGTTGTCAGTATAGTCTGGCAGACTATACTGACATGGGTGGGTGGGT | 54 |
| LH2-G4 | TGGGTGGGTTGTCAGTATAGTCTGGCAGACTATACTGACATGGGTGGGT | 54 |
| LH-G4 | TGGGTGGGTGGGTTGTCAGTATAGTCTGGCAGACTATACTGACATGGGT | 54 |
Stem–loop details and loop free energies in model hairpin-G4s and unstructured loop mutants
| Sample name | Loop sequence | Loop length | Predicted* Δ | Predicted* | Stem length | Hairpin loop length |
|---|---|---|---|---|---|---|
| SH-G4 | TTGTCGGCGACAT | 13 | –0.21 | 39.6 | 2 | 5 |
| SI-1-G4 | TTGTCGGCGACTT | 13 | 0.09 | 35.9 | 2 | 5 |
| SI-2-G4 | TTCTCGGCGACTT | 13 | 1.38 | –3.4 | 1 | 5 |
| MH-G4 | TTGTCAGTATGGCATACTGACAT | 23 | –5.14 | 63.9 | 7 | 5 |
| MI-1-G4 | TTGTCAGAATGGCATACTGACAT | 23 | –2.7 | 54.8 | 5 | 9 |
| MI-2-G4 | TTATCAGTATGCCACACTGACAT | 23 | –1.21 | 47.7 | 5 | 7 |
| LH-G4 | TTGTCAGTATAGTCTGGCAGACTATACTGACAT | 33 | –9.94 | 69.7 | 12 | 5 |
| LI-1-G4 | TTGTGAGTATAGACTGGCAGACTATACTGACAT | 33 | –3.23 | 51.2 | 10 | 5 |
| LI-2-G4 | TGGTCTGTGTAGACTGGCGGAATATACTGACAT | 33 | –1.38 | 49.4 | 4 | 4 |
| SU-G4 | TCTTCTTACATAT | 13 | No folding | Not Determined | 0 | 0 |
| MU-G4 | TCTTCTTATATATTCTTCTTACA | 23 | 2.94 | –47.9 | 1 | 9 |
| LU-G4 | TCTTCTTACTTATTCTTCTTACTTATTCTTCTT | 33 | 2.63 | –21.5 | 2 | 11 |
*Predictions were carried out using UNAFold DNA-folding form server with parameters 100 mM Na+ and 1 mM Mg2+ at 37°C.
Figure 2.Hairpin-G4s can form stable G4s with higher stability than the mutants with unstructured loops. (A) CD spectra of short (S), medium (M) and long (L) loop-G4s with perfect hairpin loop (H-G4s), intermediate hairpin loop (I1 and I2-G4s) and unstructured loop (U-G4s) showed characteristic G4 peaks at 240 and 260nm. (B) CD thermal melting analysis showed that G4s with hairpin loops (H/I-G4s) are more stable than the G4 with unstructured loop (U-G4s) in varying Mg2+ concentrations. LI-2-G4 was extremely stable at 10mM MgCl2, and hence the melting temperature could not be determined. All measurements were taken in 10 mM HEPES buffer (pH 7.5) containing 100 mM KCl and 1 mM MgCl2. The ramp rate for CD melting experiments was set to 0.2°C/min. Error bars represent ± SD.
Comparison between unstructured loop and hairpin-forming loop ΔGh and hairpin-G4 and unstructured-loop mutant CD Tm
| Sample name | Predicted loop Δ | CD |
|---|---|---|
| SU-G4 | No folding | 69.48 ± 1.17 |
| SH-G4 | –0.21 | 71.39 ± 1.37 |
| SI-1-G4 | 0.09 | 69.42 ± 0.14 |
| SI-2-G4 | 1.38 | 68.67 ± 0.09 |
| MU-G4 | 2.94 | 65.34 ± 0.03 |
| MH-G4 | –5.14 | 69.71 ± 0.04 |
| MI-1-G4 | –2.7 | 66.11 ± 0.28 |
| MI-2-G4 | –1.21 | 63.27 ± 0.08 |
| LU-G4 | 2.63 | 62.97 ± 0.29 |
| LH-G4 | –9.94 | 68.80 ± 0.28 |
| LI-1-G4 | –3.23 | 63.55 ± 0.50 |
| LI-2-G4 | –1.38 | 66.26 ± 0.01 |
CD experiments were performed in buffer containing 100 mM KCl and 1 mM MgCl2 in 10 mM HEPES (pH 7.5). ΔGh was predicted using UNAFold DNA-folding form server with parameters 100 mM Na+ and 1 mM Mg2+ at 37°C. The ramp rate of 0.2°C/min was used for CD melting.
Figure 3.Position of hairpin-forming long loop and hairpin-flanking region do not affect the stability of hairpin-G4s. CD spectra (A) and Tm (B) of model hairpin-G4s containing hairpins at different loop positions. CD spectra (C) and Tm (D) of model hairpin-G4s with different hairpin-flanking regions. G4s with unstructured loops (LU-G4s) have been tested for comparison. All measurements were taken in 10 mM HEPES buffer (pH 7.5) containing 100 mM KCl and 1 mM MgCl2. The ramp rate for CD melting experiments was set to 0.2°C/min. Error bars represent ± SD (standard deviation) of two trials.
Figure 4.SYBR Green is an effective probe to identify the presence of hairpin in putative hairpin-G4s. (A) Schematic representation of the SYBR Green fluorescence assay. SYBR Green binds to the dsDNA stem region of the hairpin in G4s, which results in fluorescence enhancement. However, in G4s without hairpins, the fluorescence enhancement is not observed due to the absence of SYBR Green binding to the DNA. (B) SYBR Green fluorescence spectra for model hairpin-G4s (H/I-G4s) and unstructured-loop mutant (U-G4) and conventional G4 (pu22myc) shows highest fluorescence enhancement for hairpin-G4s, confirming hairpin formation.
Figure 5.Both model hairpin-G4s and genomic hairpin-G4 candidates form predominantly intramolecular G4s. Model hairpin-G4s (H/I-G4s) and unstructured-loop mutant (U-G4) (A) and hairpin G4s from the genome (B) and their unstructured-loop mutants were electrophoresed on a 15% Native PAGE gel stained with SYBR Gold. Samples were prepared in 10 mM HEPES (pH 7.5) and 100 mM KCl with 1 mM MgCl2 and electrophoresed in 1× TBE.
Figure 6.Hairpin-G4s show characteristics of hairpin and G4 formation and are more stable than their unstructured loop mutants. In 1H NMR spectra analysis, hairpin-G4s (SH-G4, MH-G4, and LH-G4) show peaks for hairpin formation between 12.5–15 ppm, whereas G4s with unstructured loops (SU-G4, MU-G4 and LU-G4) do not show peaks for hairpin formation. Both H-G4s and U-G4s display peaks for G4 formation between 10–12.5 ppm. Temperature-dependent NMR spectra of H-G4s and U-G4s shows that H-G4s are more stable than U-G4s because the peaks for G4 start to disappear at higher temperatures. All spectra were recorded in 10 mM HEPES pH 7.5, 100 mM KCl, 1 mM MgCl2 at 25°C.
Figure 7.The effect of hairpin formation in the candidate G4s present in the human gene promoters on the promoter activity. (A) Schema showing reporter construction for analyzing cis-regulatory effect of hairpin-G4s on luciferase reporter activity. (B) Cis-regulatory effect of the model hairpin-G4 and their mutants with the unstructured loop on the luciferase reporter activity (C) Cis-regulatory effect of the candidate-G4 containing hairpin and their mutants with the unstructured loop on the luciferase reporter activity. G4 sequences are inserted in the promoter region of the luciferase reporter. (D) The reporter was constructed in the same way in Figure 7A except inserting the G4 sequences into the 5′UTR region. (E) The model G4s (LU-G4 and LH-G4) have been tested as controls in both sense and antisense strand. (F) Reporter activity was also analyzed for candidate-G4 containing hairpin and their mutants in the 5′UTR. All data was normalized to the respective control G4 containing unstructured loop. In all experiments, luciferase expression was quantified 24 hours after transfection. Error bars represent standard deviations of three individual trials (N = 3). Significance was calculated using one-sample t-test (*P ≤ 0.05; **P ≤ 0.01; ***P ≤ 0.001).
Figure 8.Hairpin-G4s can act as transcriptional barriers. T7 polymerase stop assay using model G4s (A), and candidate G4s (wild type and mutant containing unstructured loop) (B). Large amounts of arrested (A) and slipped transcript (S), and less amount of full length transcript (FL) are observed in G4s with stable hairpin compared to the counterparts containing the unstructured loop. ‘M’ represents the size marker.