| Literature DB >> 22016335 |
Tomasz Zemojtel1, Szymon M Kielbasa, Peter F Arndt, Sarah Behrens, Guillaume Bourque, Martin Vingron.
Abstract
The formation of new transcription factor-binding sites (TFBSs) has a major impact on the evolution of gene regulatory networks. Clearly, single nucleotide mutations arising within genomic DNA can lead to the creation of TFBSs. Are molecular processes inducing single nucleotide mutations contributing equally to the creation of TFBSs? In the human genome, a spontaneous deamination of methylated cytosine in the context of CpG dinucleotides results in the creation of thymine (C → T), and this mutation has the highest rate among all base substitutions. CpG deamination has been ascribed a role in silencing of transposons and induction of variation in regional methylation. We have previously shown that CpG deamination created thousands of p53-binding sites within genomic sequences of Alu transposons. Interestingly, we have defined a ∼30 bp region in Alu sequence, which, depending on a pattern of CpG deamination, can be converted to functional p53-, PAX-6-, and Myc-binding sites. Here, we have studied single nucleotide mutational events leading to creation of TFBSs in promoters of human genes and in genomic regions bound by such key transcription factors as Oct4, NANOG, and c-Myc. We document that CpG deamination events can create TFBSs with much higher efficiency than other types of mutational events. Our findings add a new role to CpG methylation: We propose that deamination of methylated CpGs constitutes one of the evolutionary forces acting on mutational trajectories of TFBSs formation contributing to variability in gene regulation.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22016335 PMCID: PMC3228489 DOI: 10.1093/gbe/evr107
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FCpG deamination creates TFBSs with high efficiency in human promoters. Each point represents a CE value calculated for each of 217 TFs. The horizontal axis represents CE values recorded in HCPs, and the vertical axis represents CE values obtained in LCPs. Average CE values for CpG deamination–driven creation events (CEdeam) and nondeamination–driven creation events (CEnondeam) are highlighted in bold red and blue digits, respectively.
Numbers of Created In Vivo Binding Sites and Corresponding Creation Efficiencies
| Factor | d+ | d− | Nond+ | Nond− | CEdeam | CEnondeam | CEdeam/CEnondeam | % of d+ | |
| c-Myc_canonical | 5 | 187 | 13 | 4050 | 0.026042 | 0.003200 | 8.0 | 9.4E−4 | 27.8 |
| c-Myc_noncanonical | 26 | 976 | 21 | 11706 | 0.025948 | 0.001791 | 14.5 | 3.4E−17 | 55.3 |
| Ctcf | 90 | 3356 | 832 | 199548 | 0.026117 | 0.004152 | 6.3 | 5.2E−40 | 9.8 |
| Irf4 | 24 | 769 | 327 | 136133 | 0.030265 | 0.002396 | 12.6 | 1.9E−18 | 6.8 |
| Nanog | 70 | 972 | 246 | 91794 | 0.067178 | 0.002672 | 25.1 | 2.2E−111 | 22.2 |
| Nfya | 21 | 1207 | 50 | 27906 | 0.017101 | 0.001788 | 9.6 | 7.1E−13 | 29.6 |
| Nfyb | 22 | 1417 | 70 | 34235 | 0.015288 | 0.002041 | 7.5 | 9.4E−12 | 23.9 |
| Nrsf | 5 | 195 | 10 | 4207 | 0.025000 | 0.002371 | 10.5 | 3.5E−4 | 33.3 |
| OCT4 | 33 | 382 | 99 | 24123 | 0.079518 | 0.004039 | 19.7 | 2.4E−29 | 25.0 |
| Pax5 | 83 | 9321 | 244 | 146697 | 0.008826 | 0.001661 | 5.3 | 1.4E−29 | 25.4 |
| Pbx3 | 10 | 609 | 35 | 15217 | 0.016155 | 0.002294 | 7.0 | 6.1E−6 | 22.2 |
| Rxra | 26 | 1299 | 83 | 55298 | 0.019622 | 0.001498 | 13.1 | 4.0E−19 | 23.8 |
| Usf1 | 52 | 506 | 272 | 17701 | 0.093190 | 0.015134 | 6.2 | 2.1E−23 | 16.0 |
| YY1 | 8 | 1354 | 27 | 19203 | 0.005874 | 0.001404 | 4.2 | 1.3E−3 | 22.9 |
NOTE.—The following values are listed: the numbers of observed new site creations due to CpG deamination events (d+) and nondeamination events (nond+); the numbers of potential deamination and nondeamination-driven creation events which could lead to site creation but have not been observed (d− and nond−, respectively); creation efficiencies for deamination-driven creation events (CEdeam), nondeamination-driven creation events (CEnondeam), and percentages of deamination-driven creation events (% of d+). The P value was calculated with Fisher’s exact test.
FCpG deamination drives creation of in vivo TFBSs. (A) Upper panel: Histogram of c-Myc- and Oct4-binding sites created via single nucleotide mutation events. The presence of the CpA and TpG dinucleotides within individual binding sites is highlighted by a violet background. Lower panel: Histogram of single nucleotide mutation events leading to creation of TFBSs. CpG deamination–driven creation events and non-CpG deamination–driven creation events are depicted in red and blue, respectively. (B) PAX-6-, c-Myc-, and p53-binding sites are created via deamination of methylated CpGs in Alu transposons. R-AluS/J_cons: consensus sequence of the CpG-containing region in the right arm of AluS and AluJ subfamilies.