| Literature DB >> 17169996 |
Julian L Huppert1, Shankar Balasubramanian.
Abstract
Certain G-rich DNA sequences readily form four-stranded structures called G-quadruplexes. These sequence motifs are located in telomeres as a repeated unit, and elsewhere in the genome, where their function is currently unknown. It has been proposed that G-quadruplexes may be directly involved in gene regulation at the level of transcription. In support of this hypothesis, we show that the promoter regions (1 kb upstream of the transcription start site TSS) of genes are significantly enriched in quadruplex motifs relative to the rest of the genome, with >40% of human gene promoters containing one or more quadruplex motif. Furthermore, these promoter quadruplexes strongly associate with nuclease hypersensitive sites identified throughout the genome via biochemical measurement. Regions of the human genome that are both nuclease hypersensitive and within promoters show a remarkable (230-fold) enrichment of quadruplex elements, compared to the rest of the genome. These quadruplex motifs identified in promoter regions also show an interesting structural bias towards more stable forms. These observations support the proposal that promoter G-quadruplexes are directly involved in the regulation of gene expression.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17169996 PMCID: PMC1802602 DOI: 10.1093/nar/gkl1057
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1(a) Structure of a G-tetrad, showing hydrogen bonds and monovalent cation. (b) Schematic of an intramolecular G-quadruplex. G-tetrads are shown as blue squares, and monovalent cations as grey spheres. The structure shown is folded in an antiparallel conformation, with the strands of the G-tetrads alternately running up and down. (c) Model for transcription modulation via formation of a quadruplex in a promoter region.
PQS densities in promoters and near nuclease hypersensitive sites
| Location | PQS density (PQS/kbase) | Enrichment compared to genome average |
|---|---|---|
| Genomic DNA | 0.13 | 1× |
| 1 kb upstream of TSS | 0.77 | 6.1× |
| Near NHS cluster (500 bp) | 1.07 | 8.6× |
| Near cluster and in promoter | 28.6 | 230× |
Figure 2(a) Density of PQS with distance upstream from the TSS. The genome as a whole has a density of 0.13 PQS/kb, shown by the dashed line. (b) The percentage of promoter regions containing at least one PQS increases as the size of the promoter increases. This increase is extremely fast over the first 1000 bases. The dashed line shows what percentage would be predicted if the density of PQS were equal to that across the genome as a whole.
Figure 3(a) Percent of PQS identified in given ranges 5′ of a TSS (‘promoter regions’), which have at least one single-base loop. The genome wide average figure is 64% (dashed line). (b) Proportion of PQS upstream of a TSS and within 500 bp of an NHS cluster which have at least one single-base loop decreases rapidly with increasing promoter size. (c) Proportion of PQS with at least one single-base loop in bulk genomic DNA, within 1000 bp upstream of a TSS (promoter), within 500 bp of an NHS cluster, and in promoter/NHS cluster overlap regions. The whole-genome figure is shown as a dashed line.
Experimental results for PQS with frequent NHEs
| Name | No NHE | Sequence | |
|---|---|---|---|
| 20h | 20 | tgtGGGGTCGGGGGAGGGGGGAGGGata | 71 |
| 14h1 | 14 | tgtGGGGTAGGGGGAGGGGGGAGGGata | 71 |
| 14h2 | 14 | gccGGGGCGGCTCGGGACGGGGCCCGGGGAGCGTGGGTGGGacc | 75 |
| 14h3 | 14 | cccGGGACGGGGGCCGGCGGGCCACGGGccc | n/o⊥ |
| 12h | 12 | gctGGGCGAGGGGTGGGAGCAGACGGGctg | 57 |
aNumber of NHE cleavage positions reported within 500 bp of the PQS midpoint ⊥ n/o signifies that no quadruplex melting transition was observed. A melting transition at 70°C was observed at 260 nm for this sequence.
Experimental results for Promoter PQS with frequent NHEs
| Name | No NHE | Sequence | |
|---|---|---|---|
| GUK1 | 3 | agcGGGAGAAGACGGGCTGGGAGGGcgc | 52 |
| MCM2 | 3 | gaaGGGACACGGAGGGGCGGGCCAGAGGGtcc | 64 |
| NKFB2 | 3 | tcaGGGTGGGGGCCCCGAGGGCTGGGGccg | 69 |
| PSMA2 | 4 | gctGGGTTGGGGCGGGGGGAGCGGGacg | 69 |
| UTX | 3 | gccGGGCGGGGAGGGGGGGtca | >80°C |
Gene names: GUK1, Guanylate kinase 1; MCM2, DNA replication licensing factor; NKFB2, DNA-binding factor KBF2, Lyt10; PSMA2, Proteasome subunit alpha type 2; UTX, Ubiquitously transcribed X chromosome tetratricopeptide repeat protein.
aNumber of NHE cleavage positions reported within 50 bp of the PQS.
bThis sequence was further studied in 20 mM KCl and showed a complete melt with a Tm of 66°C.
Figure 4CD spectra of the sequences individually characterized in this study. Peaks at 260 nm indicate a parallel folded quadruplex, whereas peaks at 295 nm indicate an anti-parallel fold.