| Literature DB >> 21729868 |
Pankaj Kumar1, Vinod Kumar Yadav, Aradhita Baral, Parveen Kumar, Dhurjhoti Saha, Shantanu Chowdhury.
Abstract
Function of non-B DNA structures are poorly understood though several bioinformatics studies predict role of the G-quadruplex DNA structure in transcription. Earlier, using transcriptome profiling we found evidence of widespread G-quadruplex-mediated gene regulation. Herein, we asked whether potential G-quadruplex (PG4) motifs associate with transcription factors (TF). This was analyzed using 220 position weight matrices [designated as transcription factor binding sites (TFBS)], representing 187 unique TF, in >75,000 genes in human, chimpanzee, mouse and rat. Results show binding sites of nine TFs, including that of AP-2, SP1, MAZ and VDR, occurred significantly within 100 bases of the PG4 motif (P < 1.24E-10). PG4-TFBS combinations were conserved in 'orthologously' related promoters across all four organisms and were associated with >850 genes in each genome. Remarkably, seven of the nine TFs were zinc-finger binding proteins indicating a novel characteristic of PG4 motifs. To test these findings, transcriptome profiles from human cell lines treated with G-quadruplex-specific molecules were used; 66 genes were significantly differentially expressed across both cell-types, which also harbored conserved PG4 motifs along with one/more of the nine TFBS. In addition, genes regulated by PG4-TFBS combinations were found to be co-regulated in human tissues, further emphasizing the regulatory significance of the associations.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21729868 PMCID: PMC3185432 DOI: 10.1093/nar/gkr536
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Flowchart summarizing the approach adopted in this study. Schema of the strategy followed to test genome wide association of TF with PG4 motif(s).
Figure 2.PG4 motif positional conservation across orthologously related promoters. Scheme showing identification of conserved PG4 motif within ±200 bases in orthologously related promoters (±2 kb of TSS) of human, chimpanzee, mouse and rat and search for associated TFBS. H represents a human gene and C, M and R represent their orthologous in chimpanzee, mouse and rat, respectively.
Distribution of PG4 motifs near TSS
| ORFs studied | Total no. of PG4 motif in promoters | Promoters with at least one PG4 motif | Conserved PG4 motifs in 871 orthologously | |
|---|---|---|---|---|
| Human | 20 664 | 50 939 | 14 836 | 1563 |
| Chimpanzee | 20 601 | 41 811 | 14 184 | 1666 |
| Mouse | 19 656 | 33 738 | 13 738 | 1459 |
| Rat | 15 163 | 20 148 | 9470 | 1350 |
a±2 kb centered at TSS.
bHuman, chimpanzee, mouse and rat.
Figure 3.Strategy followed for genome wide comparative analysis to identify enriched presence of TFBS within promoters harboring conserved PG4 motifs in human, chimpanzee, mouse and rat.
Figure 4.Zinc finger TFBS are closely associated with PG4 motifs. Distribution of TFBS with respect to PG4 motif (PG4–TFBS inter-distance) in promoters of human, chimpanzee, mouse and rat that harbor conserved PG4 motifs. Pseudocolor represents percentage frequency of PG4–TFBS inter-distance values in bins of 100 bases relative to nearest PG4 motif. Asterisk represents additional zinc finger TF found in the study.
Significance of PG4–TFBS co-occurrence within ±100 bp in promoters with conserved PG4 motifs in human, chimpanzee, mouse and rat
| Human | Chimpanzee | Mouse | Rat | ||
|---|---|---|---|---|---|
| TF name | |||||
| 1 | SP1 | <E-300 | 2.43E-143 | 1.20E-76 | 1.36E-41 |
| 2 | WT1 | <E-300 | 5.31E-41 | 7.70E-11 | 1.24E-10 |
| 3 | KROX | <E-300 | 2.20E-31 | 1.35E-48 | 1.56E-20 |
| 4 | MAZ | <E-300 | 1.59E-105 | 3.56E-57 | 2.35E-18 |
| 5 | VDR | 1.24E-262 | 1.40E-269 | 1.46E-19 | 1.44E-11 |
| 6 | Kid3 | <E-300 | <E-300 | <E-300 | <E-300 |
| 7 | ZF5 | <E-300 | <E-300 | 1.48E-190 | 8.27E-139 |
| 8 | ETF | <E-300 | 6.33E-268 | 1.03E-123 | 4.06E-81 |
| 9 | AP-2 | <E-300 | <E-300 | 4.04E-69 | 2.84E-74 |
aIndicates value
Functional annotation of TFBS significantly co-occurring with conserved PG4 motifs (within 100 bases)
| TF Name | Classification of TF | Involvement in Biological processes/ Pathways | Key regulated genes by TF |
|---|---|---|---|
| SP1 | Zinc-coordinating DNA binding domains, C2H2 zinc-finger domain, Ubiquitous factors | Cell cycle; MAPK signaling; TGF-β signaling | |
| VDR | Zinc-coordinating DNA binding domains, Cys4 zinc finger of nuclear receptor type, Thyroid hormone receptor-like factors | Cell-cycle progression, proliferation and growth, Osteoblastic differentiation | |
| KROX | Zinc-coordinating DNA binding domains, C2H2 zinc-finger domain, cell-cycle regulators | Cell cycle, apoptosis | |
| WT1 | Zinc-coordinating DNA binding domains, C2H2 zinc-finger domain, cell-cycle regulators, GLI-like | Cell cycle, MAPK signaling, apoptosis | |
| MAZ | Zinc-coordinating DNA binding domains, C2H2 zinc-finger domain | Cell cycle, apoptosis, lymphocyte development, neural differentiation | |
| Kid3 | Zinc-coordinating DNA binding domains, C2H2 zinc-finger domain, Krueppel-like | Kidney and brain development | |
| ZF5 | Zinc-coordinating DNA binding domains, C2H2 zinc-finger domain, Krueppel-like | Cell cycle, cell proliferation, induction of programmed cell death | |
| AP-2 | Basic Domains, bHSH | Cell cycle TGF-β signaling, MAPK signaling | |
| ETF | Helix-turn-helix, TEA domain | Cell cycle |
aRelevant references showing involvement of particular TF in biological processes/pathways are given in Supplementary Table S2.
Figure 5.Genes harboring conserved PG4–TFBS associations are differentially expressed in presence of G-quadruplex binding ligand. Left panel: expression profile of genes with conserved PG4 motif that have significant differential expression in both cell lines on treatment with ligand. Pseudocolor representing their relative expression values in HeLaS3 and A549 cells. Right panel: association of PG4 motif with TFBS in promoters of differentially expressed genes shown in left panel; black box represents PG4 motif and associated pseudocolor shows number of TFBS within 100-base windows relative to TSS.
TFBS enriched on promoters harboring conserved PG4 motif that are differentially expressed in A549 and HeLaS3 cells after treatment with G-quadruplex binding ligand
| TFBS associated with conserved PG4 motif (within 100 bases) | Differentially expressed genes with PG4–TFBS in promoters | |
|---|---|---|
| AP2 | 34 | 2.05E-40 |
| ETF | 28 | 4.61E-53 |
| Kid3 | 58 | 1.64E-09 |
| KROX | 19 | 3.29E-13 |
| MAZ | 24 | 5.55E-13 |
| SP1 | 27 | 1.88E-25 |
| VDR | 18 | 1.02E-05 |
| WT1 | 20 | 5.31E-11 |
| ZF5 | 41 | 2.68E-44 |