| Literature DB >> 29138394 |
Elizabeth Aston1, Alastair Channon2, Roman V Belavkin3, Danna R Gifford4, Rok Krašovec4, Christopher G Knight4.
Abstract
The critical mutation rate (CMR) determines the shift between survival-of-the-fittest and survival of individuals with greater mutational robustness ("flattest"). We identify an inverse relationship between CMR and sequence length in an in silico system with a two-peak fitness landscape; CMR decreases to no more than five orders of magnitude above estimates of eukaryotic per base mutation rate. We confirm the CMR reduces exponentially at low population sizes, irrespective of peak radius and distance, and increases with the number of genetic crossovers. We also identify an inverse relationship between CMR and the number of genes, confirming that, for a similar number of genes to that for the plant Arabidopsis thaliana (25,000), the CMR is close to its known wild-type mutation rate; mutation rates for additional organisms were also found to be within one order of magnitude of the CMR. This is the first time such a simulation model has been assigned input and produced output within range for a given biological organism. The decrease in CMR with population size previously observed is maintained; there is potential for the model to influence understanding of populations undergoing bottleneck, stress, and conservation strategy for populations near extinction.Entities:
Mesh:
Year: 2017 PMID: 29138394 PMCID: PMC5686101 DOI: 10.1038/s41598-017-14628-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Two-peak fitness landscape with one narrow peak of high fitness (peak 0), and one broader peak of lower fitness (peak 1). Each step on the x axis represents a single base mutation. Diagram adapted from Wilke[4].
Mutation rates for various eukaryotic species.
| Species | Genome size (Mbp) | Mutation rate | Base/genome | Unit | Source |
|---|---|---|---|---|---|
| Human | 3080 |
| Per base | Per generation |
|
| Human | 3080 |
| Per genome | Per generation |
|
| Human | 3080 |
| Per base | Per cell division |
|
| Human | 3080 |
| Per genome | Per cell division |
|
| Human (Y chromosome) | 58 |
| Per base | Per generation |
|
| Human, chimpanzee | 3080 | 3 | Per genome | Per generation |
|
|
| 120 |
| Per base | Per generation |
|
|
| 120 |
| Per genome | Per generation |
|
|
| 120 |
| Per genome | Per generation |
|
|
| 120 |
| Per base | Per cell division |
|
| Quail, chicken | 1050 |
| Per genome | Per generation |
|
| Sheep, cow | 2870 |
| Per genome | Per generation |
|
| Old World Monkey | 1.9 | Per genome | Per generation |
| |
| Mouse, rat | 2640 |
| Per genome | Per generation |
|
| Mouse | 2640 |
| Per base | Per cell division |
|
| Mouse | 2640 |
| Per base | Per generation |
|
|
| 12.1 |
| Per base | Per generation |
|
|
| 12.1 |
| Per base | Per cell division |
|
| Average mammalian |
| Per base | Per genome/year |
| |
| Mammalian upper bound |
| Per base | Per genome/year |
| |
|
| 100 |
| Per base | Per generation |
|
|
| 100 | 2.9 | Per genome | Per generation |
|
|
| 157 |
| Per base | Per generation |
|
|
| 157 |
| Per base | Per generation |
|
Mutation rate estimates were obtained by comparing pseudogenes (genes that do not code for proteins or are never expressed) in humans and chimpanzees[49], combining the results of theoretical and empirical studies[55], mutation accumulation and radiation experiments[51], direct sequencing of the human Y chromosome[61], computational analysis of genes from species of placental mammals[55], whole-genome shotgun sequencing of mutation accumulation lines of the fruit fly Drosophila melanogaster [48,51] and the nematode worm Caenorhabditis elegans [53], examination of sequence variation in the human genome[56], scanning the mitochondrial genome of D. melanogaster [47], study of the complete genome of five Arabidopsis thaliana lines[54], and complete genome sequencing of the yeast Sacccharomyces cerevisiae [52]. Other sources listed contain a range of estimates[57,58].
Figure 2CMR when the simulation model was run for one gene with sequence lengths of 30 up to 150,000 bp (shown in the legend) for population sizes 10 up to 1,000. Peak 0 had a radius of 2 and peak 1 a radius of 5. The Hamming distance between the peaks was 10. The exponential lines were obtained by curve-fitting using R with a least squares method. Error bars are not plotted, but are small, for instance (–, 95% confidence interval) and (–, 95% confidence interval) for uppermost (30 bp) and lowest (150,000 bp) points (population size 900) respectively.
Figure 3CMR plotted against population size for varying values of scale parameter S. Population size was varied from 10 up to 1,000 and each individual consisted of 1 gene of 1,000 bp in length. Peak 0 and 1 were given a radius of 2 and 5 respectively, while the distance between their peaks was set to 10. (a) CMR plotted for population sizes 10 up to 1,000. The radius of the peaks was initially set to 2 and 5 for peak 0 and 1 respectively, with the distance between the top of the peaks set to 10. These values were then scaled by parameter S (shown in the legend). The exponential lines were obtained by curve-fitting using R with a least squares method. Error bars are not plotted, but are small, for instance (–, 95% confidence interval) and (–, 95% confidence interval) for uppermost (S = 10) and lowest (S = 1) points (population size 900) respectively. (b) CMR plotted for varying values of S when population size is 1,000.
Genetic distances between alleles for various genes.
| Distance between alleles | Unit | Gene | Source |
|---|---|---|---|
| 2.02 | % difference | Adh1-1F and Adh1-1S alleles in maize |
|
| 1 | Amino acid | Rice blast resistance (R) gene Pi-ta |
|
| 13 | Single nucleotide polymorphisms (SNPs) | Pikh allele for rice varieties |
|
| 7 | |||
| 7 | |||
| 19 | |||
| 45 | |||
| 48 | |||
| 54 | |||
| 19 | |||
| 56 | |||
| 47 | |||
| 19 | Base pairs | Adh1 alleles of wild barley |
|
| 0, 1, 1 | Synonymous, non-synonymous, non-coding SNPs | HMGCR |
|
| 3, 2, 1 | HSD3B1 | ||
| 1, 0, 0 | HTR1EL | ||
| 2, 3, 9 | HTR2A | ||
| 0, 1, 0 | HTR2C | ||
| 2, 0, 0 | HTR5A | ||
| 1, 0, 0 | HTR6 | ||
| 0, 0, 0 | HTR7 | ||
| 0, 0, 8 | IGF1 | ||
| 0, 0, 1 | IGF2 | ||
| 4, 3, 0 | ITGA2B | ||
| 4, 3, 0 | ITGB3 | ||
| 0, 1, 2 | KLK2 | ||
| 3, 0, 0 | LCAT | ||
| 7, 3, 0 | LDLR | ||
| 4, 3, 4 | LIPC | ||
| 1, 1, 0 | LPL | ||
| 1, 0, 0 | MAOA | ||
| 1, 0, 0 | MAOB | ||
| 1, 2, 1 | MPL | ||
| 1, 1, 5 | NGFB | ||
| 1, 0, 0 | NT3 | ||
| 5, 2, 0 | NTRK1 | ||
| 2, 0, 4 | PACE | ||
| 1, 2, 1 | PAI1 | ||
| 5, 4, 5 | PAI2 | ||
| 1, 3, 1 | PC1 | ||
| 5, 5, 4 | PCI | ||
| 0, 0, 0 | POMC | ||
| 1, 1, 1 | PRL | ||
| 3, 0, 0 | PROC | ||
| 1, 0, 0 | PROS1 | ||
| 0, 2, 0 | PTAFR | ||
| 1, 0, 2 | PTH | ||
| 0, 0, 13 | PTHLH | ||
| 5, 8, 0 | SELP | ||
| 1, 3, 1 | SHBG |
The data from Cargill et al.[35] represents polymorphisms in alleles for a subset of human genes (cross-section displayed below). Where there is more than one distance listed per source, the Unit and Source columns are left blank. Where there is multiple data for one gene, the Gene column is also left blank.
Figure 4(a) CMR plotted for varying number of crossovers. The number of crossover events per reproduction is given in the legend. The exponential lines were obtained by curve-fitting using R with a least squares method. Error bars are not plotted, but are small, for instance (–, 95% confidence interval) and (–, 95% confidence interval) for uppermost (5 crossovers) and lowest (1 crossover) points (population size 900) respectively. (b) CMR plotted for varying number of chromosomes. The number of chromosomes each gene was split into is given in the legend. The exponential lines were obtained by curve-fitting using R with a least squares method. Error bars are not plotted, but are small, for instance (–, 95% confidence interval) and (–, 95% confidence interval) for uppermost (10 chromosomes) and lowest (1 chromosome) points (population size 900) respectively. Population size was varied from 10 up to 1,000 and each individual consisted of 1 gene of 1,000 bp in length. Peak 0 had a radius of 2 and peak 1 a radius of 5. The Hamming distance between the peaks was 10. The number of crossovers per reproduction was increased from 1 (as per previous experiments) to 5, as per the legend in (a). The number of chromosomes per gene (given in the legend) was increased from 1 (as per previous experiments) to 10, as per the legend in (b).
Figure 5CMR plotted alongside gene number for varying population sizes. Data are shown for population sizes 10 to 80 with results plotted on a log log scale. Peak 0 had a radius of 2 and peak 1 a radius of 5. The Hamming distance between the peaks was 10. Gene length was kept constant at 1,000, while gene number was doubled from 1 up to 8,192. The corresponding quadratic lines were obtained by curve-fitting using R. Error bars are not plotted, but are small, for instance (–, 95% confidence interval) and (–, 95% confidence interval) for uppermost (population size 80) and lowest (population size 10) points (gene number 8192) respectively. A line representing 1/L, where L is gene length, is plotted for reference. Population sizes shown represent the steep part of the curve in Fig. 2 before it levels out. Population size 10 was also run with 25,000 genes, the correct range for the plant A. thaliana. Gene length was set to 1,000 bp to match the other runs or 2,000 bp to bring it closer to A. thaliana’s gene length. For reference, the range of per base mutation rates from Table 1 is shown for A. thaliana, Caenorhabditis elegans (nematode worm), Drosophila melanogaster (fruit fly), and humans (with gene number estimates from[24,64,65] and[27] respectively).
Crossover rates in A. thaliana taken from Table 1 in Giraut et al.[26].
| Female | Male | Ratio Male/Female | |
|---|---|---|---|
| Number of COs analysed | 5003 | 8532 | |
| Size genetic map (cM) | 332 | 575 | |
| COs per cell | 6.65 | 11.15 | 1.67 |
| COs per chromosome 1 bivalent | 1.63 | 2.85 | 1.75 |
| COs per chromosome 1 bivalent corrected* | 1.63 | 3.18 | 1.95 |
| COs per chromosome 2 bivalent | 1.19 | 1.89 | 1.58 |
| COs per chromosome 3 bivalent | 1.29 | 2.14 | 1.66 |
| COs per chromosome 4 bivalent | 1.10 | 1.71 | 1.56 |
| COs per chromosome 5 bivalent | 1.44 | 2.58 | 1.79 |
*Values given for male bivalent with and without correction for segregation bias (see Giraut et al.[26]).