| Literature DB >> 31597270 |
Yogini P Bhavsar-Jog1, Eric Van Dornshuld2, Tracy A Brooks3, Gregory S Tschumper4, Randy M Wadkins5.
Abstract
G-quadruplexes (G4s) and i-motifs (iMs) are tetraplex DNA structures. Sequences capable of forming G4/iMs are abundant near the transcription start sites (TSS) of several genes. G4/iMs affect gene expression in vitro. Depending on the gene, the presence of G4/iMs can enhance or suppress expression, making it challenging to discern the underlying mechanism by which they operate. Factors affecting G4/iM structures can provide additional insight into their mechanism of regulation. One such factor is epigenetic modification. The 5-hydroxymethylated cytosines (5hmCs) are epigenetic modifications that occur abundantly in human embryonic stem cells (hESC). The 5hmCs, like G4/iMs, are known to participate in gene regulation and are also enriched near the TSS. We investigated genomic co-localization to assess the possibility that these two elements may play an interdependent role in regulating genes in hESC. Our results indicate that amongst 15,760 G4/iM-forming locations, only 15% have 5hmCs associated with them. A detailed analysis of G4/iM-forming locations enriched in 5hmC indicates that most of these locations are in genes that are associated with cell differentiation, proliferation, apoptosis and embryogenesis. The library generated from our analysis is an important resource for investigators exploring the interdependence of these DNA features in regulating expression of selected genes in hESC.Entities:
Keywords: DNA nanomaterials; DNA secondary structures; cytosine-rich DNA
Mesh:
Substances:
Year: 2019 PMID: 31597270 PMCID: PMC6804057 DOI: 10.3390/molecules24193619
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1The overall distribution of the number of sequences that have putative iM-forming potential. (a) The number of sequences with putative iM potential and their position relative to the TSS of genes. (b) The number of putative iM-forming sequences that have 5hmCs co-localized within 100 bp of an iM relative to the TSS of genes.
Figure 2Positional dependence of iM and 5hmC localization. (a) The overall density of iM-forming sequences relative to the TSS. (b) The density of iM-forming sequences with 5hmCs co-localized within 100 bp of a putative iM.
Figure 3Relationship between iM potential (Equation (1) below) and 5hmC density. (a) The contour plot for the sequences upstream of the TSS shows that 5hmC enrichment is associated with sequences with low iM-forming potential. (b) The contour plot for the downstream sequences shows that the 5hmC enrichment is associated with sequences with high iM-forming potentials.
Correlations between number of 5hmC, iM-forming potential, and GC-content in 1 kb region upstream and downstream of TSS. The correlations were also computed for the upper and lower 95% confidence interval range. Since the correlation coefficients are very small, further analysis was done in order to assess whether or not these coefficients were significantly different from zero at a significance level of 0.05. The (*) indicates that the correlation significantly differs from being zero.
| 1 kb upstream of TSS | Correlation | Lower 95% | Upper 95% | Signif. Prob | |
|---|---|---|---|---|---|
| iM Potential | GC content | 0.29 | 0.25 | 0.34 | <0.0001 * |
| 5hmC content | GC content | 0.12 | 0.07 | 0.17 | <0.0001 * |
| 5hmC content | iM Potential | −0.07 | −0.12 | −0.02 | 0.01 * |
|
| |||||
| iM Potential | GC content | 0.30 | 0.25 | 0.34 | <0.0001 * |
| 5hmC content | GC content | 0.06 | 0.01 | 0.10 | <0.0293 * |
| 5hmC content | iM Potential | −0.02 | −0.07 | 0.03 | 0.4782 |
Figure 4PANTHER pie charts showing the molecular function distributions of G4/iM-forming genes (irrespective of whether or not 5hmC-modifications) (a) upstream of TSS, and (b) downstream of TSS; 5hmC-containing genes (irrespective of presence of G4s/iM-forming sequences) (c) upstream and (d) downstream of the TSS; iM-forming genes that have 3 or more 5hmCs associated with them (e) upstream, and (f) downstream of the TSS.
Functional classification of iM-forming genes with three or more 5hmCs within a G4/iM-forming sequence upstream of the TSS. The number of 5hmCs present is given in parenthesis.
| Differentiation Proliferation Apoptosis | Embryogenesis | Transcription Translation | Metabolism/Biosynthesis Cytoskeletal Organization Transport/Ion Binding Enzyme Activity |
|---|---|---|---|
Functional classification of iM-forming genes with three or more 5hmCs within a G4/iM-forming sequence downstream of the TSS. The number of 5hmCs present is given in parenthesis.
| Differentiation Proliferation Apoptosis | Embryogenesis | Transcription Translation | Metabolism/Biosynthesis Cytoskeletal Organization Transport/Ion Binding Enzyme Activity |
|---|---|---|---|
DAVID clustering of biological processes of iM-forming genes with three or more 5hmCs within a G4/iM-forming sequence upstream relative to TSS. The statistical significance of these processes being enriched versus random gene selection is indicated by the p value shown.
| Biological Processes Cluster 1 | Biological Processes Cluster 2 | |||
|---|---|---|---|---|
| Regulation of Protein Kinase Activity | Regulation of Kinase Activity | Regulation of Transferase Activity | Negative Regulation of Macromolecular Metabolic Processes | Negative Regulation of Transcription |
| ZGPAT | ZGPAT | ZGPAT | PRR15L | PRR15L |
| BIRC7 | BIRC7 | BIRC7 | CST3 | FOXH1 |
| CHRM1 | CHRM1 | CHRM1 | FOXH1 | SORBS3 |
| DUSP2 | DUSP2 | DUSP2 | SORBS3 | ZGPAT |
| ZGPAT | ||||
DAVID clustering of biological processes of iM-forming genes with three or more 5hmCs within a G4/iM-forming sequence downstream relative to TSS. The statistical significance of these processes being enriched versus random gene selection is indicated by the p value shown.
| Biological Processes Cluster 1 | Biological Processes Cluster 2 | ||
|---|---|---|---|
| Metal Ion Binding | Calcium Ion Binding | Identical Protein Binding | Protein Homodimerization Activity |
| CTCFL | S100A16 | S100A16 | S100A16 |
| S100A16 | PLCB2 | SP100 | SP100 |
| SP100 | SCUBE2 | EMILIN1 | ENG |
| NME4 | SLC25A23 | ENG | |
| PLCB2 | VIL1 | ||
| SCUBE2 | |||
| SLC25A3 | |||
| VAV1 | |||
| VIL1 | |||
| DPEP1 | |||