| Literature DB >> 23984310 |
Sofya A Medvedeva1, Alexander Y Panchin, Andrey V Alexeevski, Sergey A Spirin, Yuri V Panchin.
Abstract
In general, mutation frequencies are context-dependent: specific adjacent nucleotides may influence the probability to observe a specific type of mutation in a genome. Recently, several hypermutable motifs were identified in the human genome. Namely, there is an increased frequency of T>C mutations in the second position of the words ATTG and ATAG and an increased frequency of A>C mutations in the first position of the word ACAA. Previous studies have also shown that there is a remarkable difference between the mutagenesis of humans and drosophila. While C>T mutations are overrepresented in the CG context in humans (and other vertebrates), this mutation regularity is not observed in Drosophila melanogaster. Such differences in the observed regularities of mutagenesis between representatives of different taxa might reflect differences in the mechanisms involved in mutagenesis. We performed a systematical comparison of mutation regularities within 2-4 bp contexts in Homo sapiens and Drosophila melanogaster and found that the aforementioned contexts are not hypermutable in fruit flies. It seems that most mutation contexts affect mutation rates in a similar manner in H. sapiens and D. melanogaster; however, several important exceptions are noted and discussed.Entities:
Year: 2013 PMID: 23984310 PMCID: PMC3747623 DOI: 10.1155/2013/173616
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Comparison of nucleotide composition of complete alignments and conserved regions of D. melanogaster.
| Nucleotide | Nucleotide fraction within all positions | Nucleotide fraction within conserved positions | Difference, % |
|---|---|---|---|
| a | 0.2979 | 0.2901 | 2.6 |
| t | 0.2978 | 0.2899 | 2.7 |
| c | 0.2022 | 0.2100 | −3.9 |
| g | 0.2021 | 0.2100 | −3.9 |
Figure 1Mutation bias and minimal contrasts of mutation contexts in D. melanogaster. Each dot represents a mutation context. Triangles represent the {A>C ∣3, CCA} (as well as complementary contexts) and contexts that had this context as a subcontext. Most dots are in pairs because complementary contexts have similar mutation bias and minimal contrast values.
Comparison of single nucleotide mutations in D. melanogaster and H. sapiens. Transitions are italic, while transversions are bold.
|
|
| ||
|---|---|---|---|
| Mutation | Fraction | Mutation | Fraction |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 2Mutation bias and minimal contrast for D. melanogaster and H. sapiens. Each dot represents a mutation context (blue in D. melanogaster, red in H. sapiens). Dots are overlapping and are usually in pairs because complementary contexts have similar mutation bias and minimal contrast values.
Figure 3The difference between H. sapiens and D. melanogaster mutation bias ((a) and (b)) and minimal contrast ((c) and (d)) for 2–4 bp mutation contexts. Each dot represents a mutation context. The X axis represents the contexts minimal contrast values, and the Y axis represents the contexts mutation bias. The minimal contrast and mutation bias values are given for H. sapiens ((a) and (c)) and for D. melanogaster ((b) and (d)), and the color scheme indicates the difference between minimal contrasts. Thus, red dots on (a) and (c) represent contexts that are hypermutable in humans comparing to drosophila, while green dots represent contexts that are hypermutable in D. melanogaster comparing to H. sapiens. This scheme is reversed for (b) and (d).
Over- and underrepresentation of genomic frequencies for several words in H. sapiens and D. melanogaster. Data is taken from a previous study [20] supplementary table (available at http://mouse.genebee.msu.ru/words/Supple3(contrast_k).xls). The numbers represent the value C = [(Obs (W) – Exp (W))/Exp (W)] · 100%, where Obs (W) is the observed word frequency and Exp (W) is the expected word frequency (based on the frequencies of all of its subwords).
| Genomic word over- and underrepresentation in | ||
|---|---|---|
|
|
| |
|
Words containing a mutation context with increased mutation bias in | ||
| CG | −76.37% | −5.93% |
| ATAG | −0.79% | 4.38% |
| ATTG | −7.07% | −2.35% |
| ACAA | 1.62% | 3.75% |
|
Words derived from mutation contexts with increased mutation bias in | ||
| TG | 20.10% | 10.67% |
| ACAG | 1.51% | −4.94% |
| ACTG | −2.07% | −0.46% |
| CCAA | −6.17% | −1.61% |
|
Words containing mutation contexts with increased mutation bias in | ||
| CCAC | 0.19% | 1.52% |
| CACC | 1.18% | −4.24% |
| CCCA | 5.63% | 0.09% |
| GCCA | −2.77% | 3.63% |
| ACC | 2.28% | −2.39% |
| CCA | 14.82% | 9.90% |
|
Words derived from mutation contexts with increased mutation bias in | ||
| CCCC | −5.10% | 2.19% |
| GCCC | 1.66% | −1.41% |
| CCC | −12.66% | −7.78% |