| Literature DB >> 21718472 |
Alexander Y Panchin1, Sergey I Mitrofanov, Andrei V Alexeevski, Sergey A Spirin, Yuri V Panchin.
Abstract
BACKGROUND: The substitution rates within different nucleotide contexts are subject to varying levels of bias. The most well known example of such bias is the excess of C to T (C > T) mutations in CpG (CG) dinucleotides. The molecular mechanisms underlying this bias are important factors in human genome evolution and cancer development. The discovery of other nucleotide contexts that have profound effects on substitution rates can improve our understanding of how mutations are acquired, and why mutation hotspots exist.Entities:
Mesh:
Year: 2011 PMID: 21718472 PMCID: PMC3152918 DOI: 10.1186/1471-2105-12-268
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Two-dimensional plot of mutation bias versus minimal contrast. Two-dimensional plot of mutation bias versus minimal contrast of all 1-4 bp mutation contexts. Several exclusive clusters are outlined. The small orange cluster contains {C > T|1,CG} and {G > A|2,CG} contexts. It can be discerned by minimal contrast from the large orange cluster that contains all 3-4 bp contexts with {C > T|1,CG} and {G > A|2,CG} subcontexts and only such contexts. The red cluster contains six 4 bp contexts distinguishable by both minimal contrast and mutation bias.
Figure 2Histogram of mutation biases and minimal contrasts. Histogram of mutation biases (Figures 2A and 2B) and minimal contrasts (Figure 2C and 2D). Figures 2A and 2C are complete histograms. Figures 2B and 2D are enlarged fragments of the outlined areas of figures 2A and 2C, respectively. 2B: red selection contains only those contexts that have {C > T|1, CG} and {G > A|2, CG} subcontexts and the contexts {C > T|1, CG} and {G > A|2, CG} themselves. 2D: the red selection contains only the {C > T|1, CG} and {G > A|2, CG} contexts. The brown selections in figures 2B and 2D contain only the following contexts: {T > C|2, ATTG} and complementary {A > G|3, CAAT}, {T > C|2, ATAG} and {A > G|3, CTAT}, {A > C|1, ACAA} and {T > G|4, TTGT}. 2C, 2D: red dots show a normal distribution (expectation 0.99, standard deviation 0.12).
Mutation contexts
| Mutation type | Subcontext | Context | Minimal contrast | Mutation bias | ||
|---|---|---|---|---|---|---|
| C > T | 5.1 | 4.7 | 5.1 | 4.7 | ||
| G > A | C | 5.1 | 4.7 | 5.1 | 4.7 | |
| T > C | A | 2.1 | 2.2 | 3.5 | 3.6 | |
| T > C | A | A | 1.8 | 1.8 | 3.3 | 3.5 |
| A > G | CA | CA | 2.1 | 2.2 | 3.5 | 3.6 |
| A > G | T | CT | 1.8 | 1.8 | 3.3 | 3.5 |
| A > C | 1.9 | 1.9 | 3.4 | 3.4 | ||
| T > G | TG | TTG | 1.9 | 1.9 | 3.5 | 3.4 |
This table presents contexts of mutations that have minimal contrast values higher than 1.7 and mutation bias values higher than 3. The mutated nucleotides are in bold and underlined. "NG" stands for "No Genes" (regions that do not belong to any known genes, CpG islands, or 1000 bp flanking regions of known genes) and "WG" stands for "Whole Genome". The subcontext is the one that is used for the minimal contrast calculation and is the same for the no gene and whole genome datasets for all mentioned contexts.