| Literature DB >> 35218359 |
Madeleine Oman1,2, Aqsa Alam3, Rob W Ness1,2,3.
Abstract
The rate of mutations varies >100-fold across the genome, altering the rate of evolution, and susceptibility to genetic diseases. The strongest predictor of mutation rate is the sequence itself, varying 75-fold between trinucleotides. The fact that DNA sequence drives its own mutation rate raises a simple but important prediction; highly mutable sequences will mutate more frequently and eliminate themselves in favor of sequences with lower mutability, leading to a lower equilibrium mutation rate. However, purifying selection constrains changes in mutable sequences, causing higher rates of mutation. We conduct a simulation using real human mutation data to test if 1) DNA evolves to a low equilibrium mutation rate and 2) purifying selection causes a higher equilibrium mutation rate in the genome's most important regions. We explore how this simple process affects sequence evolution in the genome, and discuss the implications for modeling evolution and susceptibility to DNA damage.Entities:
Keywords: base composition; mutability, sequence context; mutation rate; purifying selection
Mesh:
Substances:
Year: 2022 PMID: 35218359 PMCID: PMC8920511 DOI: 10.1093/gbe/evac032
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Fig. 1.Relationship of trinucleotide mutability with trinucleotide variation in early (10% mutational coverage) and late (200% mutational coverage) stage simulations. Panels (A) and (B) display the number of times a trinucleotide is chosen to mutate in the simulation (filled) and how many times a trinucleotide is produced from a mutation event (open) in early (A) and late (B) stage simulations. Panels (C) and (D) display average trinucleotide change proportional to initial frequencies in early (C) and late (D) stage simulations. Gray horizontal line denotes the zero mark, indicating no change from the initial state. Negative values indicate decrease in frequency from the initial state, and positive values denote increases in frequency. Data for all panels was generated using ∼100 Kbp of randomly generated sequence with no purifying selection (n = 10).
Fig. 2.Sliding window of mutability from a simulated chromosome. The chromosome consists of ∼100 Kbp of coding (blue) and noncoding (white) regions at a ∼1:1 ratio in an alternating pattern, simulated for 200k iterations. Mutability is calculated as log10 values from the frequencies of trinucleotides in nonoverlapping windows of 1 kb. Dark gray ribbon represents standard error between replicate simulations (n = 10). The dotted and dashed lines represent the average mutability for all coding and non-coding regions, respectively.