Mikhail I Katsnelson1, Yuri I Wolf2, Eugene V Koonin3. 1. Institute for Molecules and Materials, Radboud University, 6525AJ Nijmegen, The Netherlands; M.Katsnelson@science.ru.nl koonin@ncbi.nlm.nih.gov. 2. National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894. 3. National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD 20894 M.Katsnelson@science.ru.nl koonin@ncbi.nlm.nih.gov.
Abstract
Is evolution always gradual or can it make leaps? We examine a mathematical model of an evolutionary process on a fitness landscape and obtain analytic solutions for the probability of multimutation leaps, that is, several mutations occurring simultaneously, within a single generation in 1 genome, and being fixed all together in the evolving population. The results indicate that, for typical, empirically observed combinations of the parameters of the evolutionary process, namely, effective population size, mutation rate, and distribution of selection coefficients of mutations, the probability of a multimutation leap is low, and accordingly the contribution of such leaps is minor at best. However, we show that, taking sign epistasis into account, leaps could become an important factor of evolution in cases of substantially elevated mutation rates, such as stress-induced mutagenesis in microbes. We hypothesize that stress-induced mutagenesis is an evolvable adaptive strategy.
Is evolution always gradual or can it make leaps? We examine a mathematical model of an evolutionary process on a fitness landscape and obtain analytic solutions for the probability of multimutation leaps, that is, several mutations occurring simultaneously, within a single generation in 1 genome, and being fixed all together in the evolving population. The results indicate that, for typical, empirically observed combinations of the parameters of the evolutionary process, namely, effective population size, mutation rate, and distribution of selection coefficients of mutations, the probability of a multimutation leap is low, and accordingly the contribution of such leaps is minor at best. However, we show that, taking sign epistasis into account, leaps could become an important factor of evolution in cases of substantially elevated mutation rates, such as stress-induced mutagenesis in microbes. We hypothesize that stress-induced mutagenesis is an evolvable adaptive strategy.
A venerable principle of natural philosophy, most consistently propounded by Leibnitz (1) and later embraced by prominent biologists, in particular Linnaeus (2), is “natura non facit saltus” (“nature does not make leaps”). This principle then became one of the key tenets of Darwin’s theory that was inherited by the modern synthesis of evolutionary biology. In evolutionary biology, the rejection of saltation takes the form of gradualism, that is, the notion that evolution proceeds gradually, via accumulation of “infinitesimally small” heritable changes (3, 4). However, some of the most consequential evolutionary changes, such as, for example, the emergence of major taxa, seem to occur abruptly rather than gradually, prompting hypotheses on the importance of saltational evolution, for example by Goldschmidt (“hopeful monsters”) and Simpson (“quantum evolution”). Subsequently, these ideas have received a more systematic, even if qualitative, treatment in the concepts of punctuated equilibrium (5, 6) and evolutionary transitions (7, 8).Within the framework of modern evolutionary biology, gradualism corresponds to the weak-mutation limit, that is, an evolutionary regime in which mutations occur one by one, consecutively, such that the first mutation is assessed by selection and either fixed or purged from the population, before the second mutation occurs (9). A radically different, saltational mode of evolution (10, 11) is conceivable under the strong-mutation limit (9) whereby multiple mutations occurring within a single generation and in the same genome potentially could be fixed all together. Under the fitness landscape concept (12, 13), gradual or more abrupt evolutionary processes can be depicted as distinct types of trajectories on fitness landscapes (Fig. 1). The typical evolutionary paths on such landscapes are thought to be 1 step at a time, uphill mutational walks (12). In small populations, where genetic drift becomes an important evolutionary factor, the likelihood of downhill movements becomes nonnegligible (14). In principle, however, a different type of moves on fitness landscapes could occur, namely, leaps (or “flights”) across valleys when a population can move to a different area in the landscape, for example to the slope of a different, higher peak, via simultaneous fixation of multiple mutations (Fig. 1).
Fig. 1.
Walks and leaps on different types of fitness landscapes. Dots show genome states; blue (shirt straight) arrows indicate consecutive moves via fixation of single mutations; red (long curved) arrows indicate multimutation leaps. (A) Nearly neutral landscape. (B) Landscape dominated by slightly deleterious mutations. (C) Kimura’s model landscape (a fraction of mutations is neutral; the rest are lethal). (D) Landscape combining beneficial and deleterious mutations.
Walks and leaps on different types of fitness landscapes. Dots show genome states; blue (shirt straight) arrows indicate consecutive moves via fixation of single mutations; red (long curved) arrows indicate multimutation leaps. (A) Nearly neutral landscape. (B) Landscape dominated by slightly deleterious mutations. (C) Kimura’s model landscape (a fraction of mutations is neutral; the rest are lethal). (D) Landscape combining beneficial and deleterious mutations.We sought to obtain analytically, within the population genetics framework, the conditions under which multimutational leaps might be feasible. The results suggest that, under most typical parameters of the evolutionary process, leaps cannot be fixed. However, taking sign epistasis into account, we show that saltational evolution could become relevant under conditions of elevated mutation rate under stress so that stress-induced mutagenesis could be considered an evolvable adaptation strategy.
Results
Multimutation Leaps in the Equilibrium Regime.
Let us assume (binary) genomes of length L (in the context of this analysis, L should be construed as the number of evolutionarily relevant sites, such as codons in protein-coding genes, rather than the total number of sites), the probability of single mutation μ << 1 per site per round of replication (generation), and constant effective population size N >> 1. Then, the transition probability from sequence i to sequence j is (equation 3.11 in ref. 15)where is the Hamming distance (number of different sites between the 2 sequences). The number of sequences separated by the distance h is equal to the number of ways h sites can be selected from L, that is,where the last, approximate expression is valid under the assumption that L >> 1 and L >> h (h can be of the order of 1).Assuming also µ << 1, we obtain a typical combinatorial probability of leaps over the distance h:which is a Poisson distribution with the expectation Lμ.In steady state, the probability of fixation of the state i is proportional to whereand where is the fitness of the genotype i [ is analogous to energy in the Boltzmann distribution within the analogy between population genetics and statistical physics (16)]. For other demographic structures and assumptions on the mutation process, the relationship between fixation probability can quantitatively differ while retaining the same form. In particular, for a population that produces offspring by binary division (fission), (17, 18).Then, the rate of the occurrence and fixation of the transition is (15)The distribution function of the fitness differential has to be specified (hereafter, we refer to x as fitness, omitting logarithm for brevity). We analyze first the case without epistasis, that is, with additive fitness effects of individual mutations:where are independent random variables with the distribution functions . Then, the distribution function of the fitness difference iswhich is obtained by using the standard Fourier transformation of the delta function.Now, let us specify the distribution of the fitness effects of mutations , assuming an exponential dependency of the probability of a mutation on its fitness effect, separately for beneficial and deleterious mutations:where is the normalization factor, is the ratio of the probabilities of beneficial and deleterious mutations, and is the inverse of the characteristic fitness difference for a single mutation (discussed below). For simplicity, we assume here the same decay rates for the probability density of the fitness effects of beneficial and deleterious mutations. Empirical data on the distributions of fitness effects of mutations (19, 20) clearly indicate that . From the normalization condition,Note that the mean of the fitness difference (selection coefficient) when the distribution of the fitness effects is given by isFor simplicity, we start with an assumption that the values of and are independent of i. For the model Then, from Eq. , the fixation rate of an h-mutation leap is equal toSubstituting into , we obtainConsider first the case r = 0 (all mutations are deleterious). Then, . For Δ > 0, that is, decrease of the fitness, we haveThen, the fixation rate of an h-mutation leap is equal towhere and is the Hurwitz zeta function . Therefore, the rate of fixation for leaps of the length h is equal to .In one extreme, if (, neutral landscape), and mutations are fixed at the rate they occur. In the opposite extreme case of strong negative selection , where is the Riemann zeta function. For a rough estimate, can be replaced by 1, and then, . In this case, the maximum of W(h) is reached at , which gives a nonnegligible fraction of multimutation leaps among the fixed mutations only for . However, in this case, the value of at this maximum is exponentially small because . Therefore, in the regime of strong selection against deleterious mutations and at high mutations rates , multiple mutations actually dominate the mutational landscape, but their fixation rate is extremely low. Qualitatively, this conclusion seems obvious, but we now obtain the quantitative criteria for what constitutes “strong selection.” We find that, even for the rate of multimutation leaps can be nonnegligible (>10−4 per generation; Fig. 2) at the optimal values, whereas for , any leaps with >1 are unfeasible (Fig. 2).
Fig. 2.
Rates of leaps on a landscape dominated by deleterious mutations. Rates of transitions are plotted against the per-genome mutation rate (Lμ) and the leap length for different strengths of selection (A: ν|s| = 10 and B: ν|s| = 100). Contour lines indicate orders of magnitude and start from the rate of 10−5 leaps per generation.
Rates of leaps on a landscape dominated by deleterious mutations. Rates of transitions are plotted against the per-genome mutation rate (Lμ) and the leap length for different strengths of selection (A: ν|s| = 10 and B: ν|s| = 100). Contour lines indicate orders of magnitude and start from the rate of 10−5 leaps per generation.Under a more realistic model, all values of (the inverse of the fitness effect of a mutation) are different. For and (no beneficial mutations), using Eq. , we getFor example, in Kimura’s neutral evolution model (21), is a binary random variable that takes a value of (, neutral mutation), with the probability , and a value of 0 (, lethal mutation), with the probability . Then, , and is replaced with in Eq. , a trivial replacement of the total genome length with the length of the part of the genome where mutations are allowed, . Accordingly, , and multimutation leaps become relevant for .Let us now estimate the probability of leaps with beneficial mutations . Assuming (rare beneficial mutations), Eq. takes the formand the fixation rate of a leap including beneficial mutations isIf (weak positive selection), , so that the role of beneficial mutations is negligible. If (strong positive selection),Comparing Eq. with the result for (Eq. ), one can see that, in this case, beneficial mutations are predominant among the fixed mutations ifIn this regime, multimutation leaps occur at nonnegligible rates under sufficiently high (but not excessive) mutation rates (Fig. 3).
Fig. 3.
Rates of leaps on a landscape combining beneficial and deleterious mutations. Rates of leaps are plotted against the per-genome mutation rate (Lμ) and the leap length for different strengths of selection (A and C: ν|s| = 10; B and D: ν|s| = 100) and for different frequencies of beneficial mutations (A and B: r = 10−4; C and D: r = 10−3). Contour lines indicate orders of magnitude and start from the rate of 10−5 leaps per generation.
Rates of leaps on a landscape combining beneficial and deleterious mutations. Rates of leaps are plotted against the per-genome mutation rate (Lμ) and the leap length for different strengths of selection (A and C: ν|s| = 10; B and D: ν|s| = 100) and for different frequencies of beneficial mutations (A and B: r = 10−4; C and D: r = 10−3). Contour lines indicate orders of magnitude and start from the rate of 10−5 leaps per generation.The model considered above assumes independent effects of different mutations (no epistasis, “ideal gas of mutations” model). Now, let us take into account epistasis. In the case of strong epistasis, effects of combinations of different mutations are increasingly strong, diverse, and, effectively, unpredictable, resulting in a rugged fitness landscape (22). In the limit of epistasis strength and unpredictability, epistasis creates numerous highly beneficial combinations that, once they occur, are highly likely to be fixed, and a far greater number of highly deleterious combinations that are immediately lethal. Due to the effective randomness of genetic interactions, we consider the resulting landscape as essentially random for , with the frequency of the beneficial combinations independent of . In this case, the effective number of fixed leaps with is simplyIf all single mutations are deleterious , their rate of fixation (Eq. ) can be approximated by , whereas for all leaps of the length , effective number of fixed leaps is . Therefore, the condition for isIn the high-mutation regime , multiple mutations occur orders of magnitude more frequently than single mutations, overwhelming the difference of scale between and , and making multimutation leaps much more likely. In the low-mutation regime , the balance between single and multiple mutations tends to and the condition for the dominance of multimutation leaps becomes . Around the Eigen threshold (23), the condition corresponds to , that is, the frequency of beneficial multimutation combinations should be unrealistically high to sustain evolution by multimutation leaps.
A Nonequilibrium Model of Stress-Induced Mutagenesis.
The analysis presented above suggests that the necessary condition for fixation of multimutational leaps is the high-mutation regime. At low mutation rates , multimutation events occur too rarely to be fixed in realistic settings even if the frequency of beneficial combinations among them is reasonably high. However, in the high-mutation regime , the above analysis is problematic for 2 reasons. First, the expression for the fixation rate (Eq. ) is technically valid only for the case when the new mutation is either fixed or lost before the emergence of the next one, which implies . Second, under any realistic model of the fitness landscape, most mutations should be deleterious. Thus, implies that most of the progeny carries 1 or more mutations, and therefore suffers from these deleterious effects. Under these conditions, the assumption of constant N is unrealistic, because the size of such a population will decrease under the mutational load, down to an eventual crash.The complete analysis of the behavior of a variable-size population under the high-mutation regime and strong mutational effects is currently beyond the state of the art. Therefore, here we analyze a simplified model of the short-term behavior of a (microbial) population after the onset of stress-induced mutagenesis .Consider a microbial population consisting of individuals. Under typical conditions, the population is in an equilibrium, so that approximately individuals survive the average generation span and produce progeny by division (here we consider simple asexual division as the progeny-generating process whereby each surviving individual produces 2 offspring; other demographic models can be accommodated without loss of generality). The typical mutation rate is low (, according to refs. 24 and 25), so the population can be considered homogeneous. Upon the onset of unfavorable conditions, the survival rate of the wild-type individuals drops to and the mutation rate in the stressed individuals increases such that .If is not too small , the immediate wild-type survivors produce first-generation progeny. With the expected number of mutations per descendant being , the distribution of the number of mutations in the progeny is given by the Poisson distribution with the expected number of mutants with mutations of .Let us consider a mutation landscape that is dominated by deleterious mutations with strong sign epistasis. All single mutations are deleterious, so the survival of their carriers over the generation time is . An overwhelming majority of multimutation combinations have even stronger negative effects, so for . Some small fraction of these combinations, however, is strongly beneficial in the new conditions, conferring to their carriers the survival rate of .What would the function look like? Intuitively, should decay to 0 at large , or at least not grow, as it is overwhelmingly likely that a sufficiently large set of mutations would contain a subset that it unconditionally lethal. Here, for simplicity, we consider a general form of that equals 0 for and monotonically decays with from at an arbitrary rate.If the deleterious effect of mutations is strong enough , then the only plausible source of beneficial mutants is the population of wild-type individuals (neither single mutants nor multiple mutants that do not carry the beneficial combinations survive to the next generation). The population of the wild-type individuals decays exponentially through both the diminished survival and through mutations, reaching at the -th generation after the onset of the unfavorable conditions. Ignoring stochastic fluctuations, the total number of wild-type individuals that survive until the population collapse can be estimated aswhich is approximately equal to if .Over the combined lifetimes of the surviving wild-type individuals, the expected number of beneficial mutants iswhich depends on the genome-wide mutation rate and the shape of the function.Let us first consider the 2 extreme cases of . In the limit of a completely flat function ( for all ), Eq. gives . This function asymptotically reaches the value of with . In the other extreme of a rapidly decaying , that is, for , Eq. gives . This function reaches its maximum at with .It can be shown that the estimates for all other monotonically decaying functions reach their maxima at finite values of withIndeed, let us consider first the simplest model
. Then, Eq. takes the formAs a function of , the quantityreaches the maximum at , where is the solution of the equationwith the value at the maximumFor (rapid decay of , and . In the opposite limit of slowly decaying , , and .For a general slowly decaying function , one can find thatImportantly, even in this case, the optimal mutation rate increases only logarithmically with the decay rate; furthermore, the optimum value is notably robust to changes in (Fig. 4).
Fig. 4.
Abundance of beneficial multimutation combinations depending on the mutation rate. Abundance of beneficial multimutation combinations, , given by Eq. , relative to . (A) with (blue), (orange), (green), and (red). (B) de with (blue), (orange), (green), and (red).
Abundance of beneficial multimutation combinations depending on the mutation rate. Abundance of beneficial multimutation combinations, , given by Eq. , relative to . (A) with (blue), (orange), (green), and (red). (B) de with (blue), (orange), (green), and (red).The approximate condition for population survival, , can be derived from Eqs. and and is bounded from below byat the optimal value of .
Discussion
Here, we obtained analytic expressions for the probability of the fixation of multimutation leaps for deleterious and beneficial mutations depending on the parameters of the evolutionary process, namely, effective genome size (L), mutation rate (μ), effective population size , and distribution of selection coefficients of mutations (s). Leaps in random fitness landscapes in the context of punctuated equilibrium have been previously considered for infinite (26, 27) or finite (28) populations. However, unlike the present work, these studies have focused on the analysis of the dynamics of the leaps rather than on the equilibrium distribution of their lengths. We further address the plausibility of beneficial multimutation leaps under epistasis and outside of equilibrium, for example in a microbial population under stress.The principal outcomes of the present analysis are the conditions under which multimutation leaps are fixed at a nonnegligible rate in different evolutionary regimes (Fig. 5). If the landscape is completely flat (strict neutrality, ), the leap length is distributed around that is , simply, the expected number of mutations per genome per generation. If , leaps are effectively impossible, and evolution can proceed only step by step (12). A considerable body of data exists on the values of each of the relevant parameters that define the probability of leaps. Generally, in the long term, the total expected number of mutations per genome per generation has to be of the order of 1 or lower (Eigen threshold) because, if , the population ultimately spirals into error catastrophe (it should be emphasized that error catastrophe, i.e., the loss of high-fitness genotypes through accumulation of deleterious mutations, is distinct from extinction catastrophe, i.e., loss of the entire population caused by deleterious mutation) (15, 23, 29–32). The selection for lower mutation rates is thought to be limited by the drift barrier and, accordingly, the genomic mutation rate appears to be inversely proportional to the effective population size, that is, (24, 25). Thus, , which appears to be an important universal in evolution.
Fig. 5.
Summary of the modeling results: evolution by multimutation leaps depending on the evolutionary regime and fitness landscape. (A) Multimutation leaps in the equilibrium regime. (B) First-generation multimutation leaps under stress-induced mutagenesis. The hatched area denotes the domain of the parameter space in which saltational evolution may or may not be possible depending on the r(h) value.
Summary of the modeling results: evolution by multimutation leaps depending on the evolutionary regime and fitness landscape. (A) Multimutation leaps in the equilibrium regime. (B) First-generation multimutation leaps under stress-induced mutagenesis. The hatched area denotes the domain of the parameter space in which saltational evolution may or may not be possible depending on the r(h) value.To estimate the leap probability, we can use Eq. and the characteristic values of the relevant parameters, for example those for human populations. As a crude approximation, Lμ = 1, v = 104, |s| = 10−2 which, in the absence of beneficial mutations, translates into the probability of a multimutation leap of about 4 × 10−5. Thus, such a leap would, on average, require over 23,000 generations, which is not a relevant value for the evolution of mammals (given that ∼140 single mutations are expected to be fixed during that time as calculated using the same formula). However, short leaps including beneficial mutations can occur with reasonable rates, such as 5 × 10−4 for h = 3, and the frequency of beneficial mutations r = 10−4, so such leaps are only 8 times less frequent than single-mutation fixations. Conceivably, such leaps of beneficial mutations could be a minor but nonnegligible evolutionary factor. For organisms with Lμ < 1 and larger v, the probability of leaps is substantially lower than the above estimates, so that under “normal” evolutionary regimes (at equilibrium) the contribution of leaps is negligible.However, in some biologically relevant and common situations, such as stress-induced mutagenesis, which occurs in microbes in response to double-stranded DNA breaks, the effective mutation rate can locally and temporarily increase by orders of magnitude (33, 34) while the population is going through a severe bottleneck (Fig. 5). If the fraction of beneficial combinations of mutations satisfies the condition (31), even in the extreme case when the rest of the mutations are lethal, the population has a chance to survive when its mutation rate (Lμ) assumes a value close to the optimum value given by Eq. . This value depends on the rate of the decay of the fraction of beneficial combinations of mutations with the number of mutations. Specifically, the optimal value of Lμ equals 2 for the steepest decay of r(h) and increases logarithmically slowly for more shallow functions. Under an extremely severe stress (N0 = 109, f = 10−3), the survival threshold [r(h)] corresponds to the fraction of beneficial pairs of mutations of about 3 × 10−6. This means that, in the case of a typical bacterial genome of 3 × 106 base pairs, for each (deleterious) mutation, there is, on average, 1 other mutation that yields a beneficial combination. This estimate pertains to the extreme case when all individual mutations are highly deleterious. Under more realistic conditions, when many mutations are effectively neutral, and a small fraction is beneficial, the threshold fraction of beneficial combinations will be considerably lower. These estimates indicate that multimutation leaps are likely to be an important factor of adaptive evolution under stress. An implication of these findings is that stress-induced mutagenesis could be a selectable adaptive mechanism, however controversial an issue the evolution of evolvability might be (35–39). It should be further noted that, in this situation, large populations will have a higher innovation potential than small populations because the former produce a greater diversity of multimutation combinations. In other terms, large populations have a greater chance to cross the entropy barrier to higher fitness genotypes (40). Thus, the stress-induced innovation regime is an alternative to innovation by drift that occurs, primarily, in small populations (during population bottlenecks) (14, 25). This conclusion complements the previous findings that large populations can readily cross fitness valleys through a series of consecutive mutations when the intermediate states are close to neutrality (41).Remarkably, experiments on adaptive evolution of bacterial populations revealed repeated emergence of hypermutators (i.e., mutations in repair genes that greatly increase the mutation rate in the respective clones) (42–44) resulting, in some case, in simultaneous fixation of “cohorts” of beneficial mutations (45). Furthermore, subsequent analyses have shown that mutator genotypes exist only transiently but exert long-lasting effects on the population evolution (46). These findings seem to provide direct experimental validation of the multimutational leaps predicted by our model.A different context in which multimutation leaps potentially might play a role is evolution of cancers. In most tumor types, mutation rate is dramatically, orders of magnitude elevated compared to normal tissues (47, 48). The effective population size in tumors is difficult to estimate, and therefore there is not enough information to use the condition (31) to assess the plausibility of multimutation leaps. Nevertheless, given the extremely high values of Lμ, it cannot be ruled out that the frequency of leaps is nonnegligible. Most of the mutations in tumors are passengers that have no effect on cancer progression or exert a deleterious effect (49, 50). Traditionally, tumorigenesis is thought to depend on several driver mutations that occur consecutively (51, 52). This is indeed likely to be the case in many tumors because the age of onset strongly and positively correlates with the number of drivers (53, 54). However, for a substantial fraction of tumors, no drivers are readily identifiable suggestive of the possibility that, in these cases, tumor progression is driven by “epistatic drivers” (53), that is, combinations of mutations that might occur by leaps.Another, completely different area where multimutation leaps could be important could be evolution of primordial replicators, in particular those in the hypothetical RNA world, that are thought to have had an extremely low replication fidelity, barely above the error catastrophe threshold (23, 55, 56). Furthermore, because the primordial replicators are likely to have been incompletely optimized, the fraction of beneficial mutational combinations could be relatively high. Under these conditions, multimutational leaps could have been an important route of evolutionary acceleration and thus might have contributed substantially to the most challenging evolutionary transition of all, that from precellular to cellular life forms.An important caveat of the above conclusions on the biological relevance of multimutational leaps is that the present analysis disregards clonal interference, that is, competition between clades in an evolving population, that plays a substantial role in the evolution of large populations under the high-mutation regime as indicated by both theory (17, 57, 58) and experiment (45, 59). Clearly, clonal interference has the potential to dampen the effect of multiple mutations. Nevertheless, it appears likely that a clone with multiple mutations would be a strong competitor under strong selection pressure, for example in the case of stress-induced mutagenesis.Taken together, all these biological considerations suggest that multimutation leaps with a beneficial effect, the probability of which we show to be nonnegligible under conditions of elevated mutagenesis, could be an important mechanism of evolution that so far has been largely overlooked. Given that elevated mutation rate caused by stress is pervasive in nature, saltational evolution, after all, might substantially contribute to the history of life, in direct defiance of “Natura non facit saltus.”
Authors: Sébastien Wielgoss; Jeffrey E Barrick; Olivier Tenaillon; Michael J Wiser; W James Dittmar; Stéphane Cruveiller; Béatrice Chane-Woon-Ming; Claudine Médigue; Richard E Lenski; Dominique Schneider Journal: Proc Natl Acad Sci U S A Date: 2012-12-17 Impact factor: 11.205
Authors: Bert Vogelstein; Nickolas Papadopoulos; Victor E Velculescu; Shibin Zhou; Luis A Diaz; Kenneth W Kinzler Journal: Science Date: 2013-03-29 Impact factor: 47.728