| Literature DB >> 32264909 |
Jian-Hong Sun1,2, Shi-Meng Ai3, Shu-Qun Liu4.
Abstract
BACKGROUND: CpGs, the major methylation sites in vertebrate genomes, exhibit a high mutation rate from the methylated form of CpG to TpG/CpA and, therefore, influence the evolution of genome composition. However, the quantitative effects of CpG to TpG/CpA mutations on the evolution of genome composition in terms of the dinucleotide frequencies/proportions remain poorly understood.Entities:
Keywords: Dinucleotide; Genome composition; Genome evolution; Methylation-induced mutation
Mesh:
Substances:
Year: 2020 PMID: 32264909 PMCID: PMC7140373 DOI: 10.1186/s12976-020-00122-x
Source DB: PubMed Journal: Theor Biol Med Model ISSN: 1742-4682 Impact factor: 2.432
Observed frequencies/proportions of the 16 dinucleotides and GC contents
| ApA/TpT | ApC/GpT | ApG/CpT | ApT | CpA/TpG | CpC/GpG | CpG | GpA/TpC | GpC | TpA | GC% | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Proportionobs vs. Proportionini | ↑ | ↓ | ↑ | ↑ | ↑ | ↓ | ↓ | ↓ | ↓ | ||
| 18.66% | 10.20% | 14.31% | 7.47% | 14.73% | 10.74% | 1.05% | 4.13% | 41.89% | |||
| 19.57% | 9.69% | 14.13% | 7.75% | 13.95% | 10.84% | 1.09% | 12.35% | 4.11% | 41.10% | ||
| 19.01% | 10.40% | 14.53% | 7.14% | 15.37% | 9.61% | 1.14% | 11.89% | 4.94% | 41.78% | ||
| 19.55% | 10.08% | 13.99% | 7.71% | 14.52% | 10.43% | 1.01% | 11.87% | 4.29% | 40.96% | ||
| 22.12% | 11.31% | 9.26% | 14.64% | 6.96% | 1.79% | 10.50% | 3.92% | 36.62% | |||
| 19.63% | 10.07% | 13.99% | 7.74% | 14.50% | 10.38% | 0.98% | 11.86% | 4.26% | 40.99% | ||
| 18.06% | 10.69% | 14.74% | 7.29% | 14.95% | 10.54% | 0.85% | 12.44% | 4.13% | 41.93% | ||
| Papio anubis (olive baboon) | 19.56% | 10.16% | 14.04% | 7.62% | 14.48% | 10.39% | 1.05% | 11.94% | 4.26% | 41.00% | |
| 18.65% | 10.18% | 14.35% | 7.44% | 14.70% | 10.74% | 1.08% | 4.17% | 41.95% | |||
| 19.21% | 10.07% | 13.90% | 7.46% | 14.43% | 11.09% | 1.24% | 11.90% | 4.42% | 41.91% |
Note: Only the autosomes of each genome were included in the statistical analyses; the symbols ‘↑’ and ‘↓’ represent an increase and decrease of the dinucleotide proportions observed (Proportionobs) in genomes relative to the assumed initial proportions (Proportionini) obtained based on GCini% = 50% (see Supplementary Table 3, Additional file 1), respectively; the values highlighted in bold exhibit changing trends incompatible with those predicted by MDM (see Table 2)
Expected/calculated proportions/frequencies of the 16 dinucleotides and GC contents obtained by MDM (GC % = 50%)
| ApA/TpT | ApC/GpT | ApG/CpT | ApT | CpA/TpG | CpC/GpG | CpG | GpA/TpC | GpC | TpA | GC% | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Proportionexp vs. Proportionini | ↑ | ↓ | ↑ | ↑ | ↑ | ↓ | ↓ | ↓ | ↓ | ↔ | ↓ |
| 13.82% | 12.12% | 13.84% | 7.71% | 17.70% | 11.16% | 1.05% | 11.18% | 5.17% | 6.25% | 44.80% | |
| 13.72% | 12.22% | 13.98% | 7.62% | 17.66% | 11.02% | 1.09% | 11.28% | 5.15% | 6.25% | 44.84% | |
| 13.60% | 11.87% | 13.66% | 7.99% | 17.62% | 11.34% | 1.14% | 11.40% | 5.14% | 6.25% | 44.89% | |
| 13.74% | 12.29% | 13.90% | 7.66% | 17.74% | 11.10% | 1.01% | 11.26% | 5.05% | 6.25% | 44.76% | |
| 13.62% | 12.34% | 13.44% | 7.54% | 16.96% | 11.56% | 1.79% | 11.38% | 5.12% | 6.25% | 45.54% | |
| 13.67% | 12.44% | 13.98% | 7.59% | 17.76% | 11.02% | 0.98% | 11.33% | 4.97% | 6.25% | 44.98% | |
| 13.84% | 12.06% | 13.90% | 7.79% | 17.90% | 11.10% | 0.85% | 11.16% | 5.14% | 6.25% | 44.60% | |
| 13.66% | 12.42% | 13.98% | 7.58% | 17.70% | 11.02% | 1.05% | 11.34% | 5.01% | 6.25% | 44.80% | |
| 13.78% | 12.17% | 13.84% | 7.69% | 17.68% | 11.16% | 1.08% | 11.22% | 5.14% | 6.25% | 44.83% | |
| 13.72% | 12.22% | 13.96% | 7.63% | 17.66% | 11.04% | 1.09% | 11.28% | 5.15% | 6.25% | 44.99% |
Note: The values presented were obtained by application of MDM to the assumed initial genome state with GCini% = 50%; the symbols‘↑’, ‘↓’ and ‘↔‘represent an increase, decrease, and no-change of the expected/calculated dinucleotide proportions (Proportionexp) relative to the assumed initial proportions (Proportionini; see Supplementary Table 3, Additional file 1), respectively
Fig. 1Relative differences between the expected and observed proportions of the 16 dinucleotides
Fig. 2Relative differences between the expected and observed GC contents
CpG to TpG/CpA mutation-caused changing trends in the numbers of dinucleotides
| ApA/TpT | ApC/GpT | ApG/CpT | ApT | CpA/TpG | CpC/GpG | CpG | GpA/TpC | GpC | TpA | |
|---|---|---|---|---|---|---|---|---|---|---|
| Changing trend | ↑ | undetermined | ↑ | ↑ | ↑ | ↓ | ↓ | ↓ | ↓ | ↔ |
Note: The changing trends are inferred directly from the matrix Q, with the symbols‘↑’, ‘↓’ and ‘↔’ representing an increase, decrease, and no-change, respectively, in the numbers of corresponding dinucleotides; ‘undetermined’ denotes that the changing trends in the numbers of ApC/GpT cannot be determined from the matrix Q without parameter estimation
Parameters estimated by statistics of the trinucleotides NpCpG and CpGpM
| 0.2804 | 0.2590 | 0.2075 | 0.2531 | 0.2525 | 0.2072 | 0.2588 | 0.2815 | |
| 0.2654 | 0.2857 | 0.2128 | 0.2361 | 0.2360 | 0.2127 | 0.2856 | 0.2657 | |
| 0.3397 | 0.2284 | 0.2161 | 0.2158 | 0.2157 | 0.2166 | 0.2281 | 0.3396 | |
| 0.2683 | 0.2679 | 0.2285 | 0.2353 | 0.2349 | 0.2286 | 0.2675 | 0.2690 | |
| 0.2887 | 0.2092 | 0.2523 | 0.2497 | 0.2498 | 0.2527 | 0.2096 | 0.2879 | |
| 0.2537 | 0.2818 | 0.2422 | 0.2223 | 0.2218 | 0.2420 | 0.2818 | 0.2544 | |
| 0.2856 | 0.2606 | 0.2053 | 0.2485 | 0.2483 | 0.2052 | 0.2605 | 0.2860 | |
| 0.2553 | 0.2837 | 0.2384 | 0.2226 | 0.2224 | 0.2386 | 0.2837 | 0.2554 | |
| 0.2778 | 0.2598 | 0.2150 | 0.2474 | 0.2466 | 0.2148 | 0.2595 | 0.2790 | |
| 0.2663 | 0.2838 | 0.2137 | 0.2361 | 0.2356 | 0.2135 | 0.2840 | 0.2668 |
Note: Only the autosomes of each genome were included in the statistical analyses