Jie Lin1, Michael Manhart2,3, Ariel Amir4. 1. John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138. 2. Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts 02138. 3. Institute of Integrative Biology, ETH Zurich, 8092 Zurich, Switzerland. 4. John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, Massachusetts 02138 arielamir@seas.harvard.edu.
Abstract
Selection of mutants in a microbial population depends on multiple cellular traits. In serial-dilution evolution experiments, three key traits are the lag time when transitioning from starvation to growth, the exponential growth rate, and the yield (number of cells per unit resource). Here, we investigate how these traits evolve in laboratory evolution experiments using a minimal model of population dynamics, where the only interaction between cells is competition for a single limiting resource. We find that the fixation probability of a beneficial mutation depends on a linear combination of its growth rate and lag time relative to its immediate ancestor, even under clonal interference. The relative selective pressure on growth rate and lag time is set by the dilution factor; a larger dilution factor favors the adaptation of growth rate over the adaptation of lag time. The model shows that yield, however, is under no direct selection. We also show how the adaptation speeds of growth and lag depend on experimental parameters and the underlying supply of mutations. Finally, we investigate the evolution of covariation between these traits across populations, which reveals that the population growth rate and lag time can evolve a nonzero correlation even if mutations have uncorrelated effects on the two traits. Altogether these results provide useful guidance to future experiments on microbial evolution.
Selection of mutants in a microbial population depends on multiple cellular traits. In serial-dilution evolution experiments, three key traits are the lag time when transitioning from starvation to growth, the exponential growth rate, and the yield (number of cells per unit resource). Here, we investigate how these traits evolve in laboratory evolution experiments using a minimal model of population dynamics, where the only interaction between cells is competition for a single limiting resource. We find that the fixation probability of a beneficial mutation depends on a linear combination of its growth rate and lag time relative to its immediate ancestor, even under clonal interference. The relative selective pressure on growth rate and lag time is set by the dilution factor; a larger dilution factor favors the adaptation of growth rate over the adaptation of lag time. The model shows that yield, however, is under no direct selection. We also show how the adaptation speeds of growth and lag depend on experimental parameters and the underlying supply of mutations. Finally, we investigate the evolution of covariation between these traits across populations, which reveals that the population growth rate and lag time can evolve a nonzero correlation even if mutations have uncorrelated effects on the two traits. Altogether these results provide useful guidance to future experiments on microbial evolution.
LABORATORY evolution experiments in microbes have provided insight into many aspects of evolution (Elena and Lenski 2003; Barrick and Lenski 2013; Desai 2013), such as the speed of adaptation (Wiser ), the nature of epistasis (Kryazhimskiy ), the distribution of selection coefficients from spontaneous mutations (Levy ), mutation rates (Wielgoss ), the spectrum of adaptive genomic variants (Barrick ), and the preponderance of clonal interference (Lang ). Despite this progress, links between the selection of mutations and their effects on specific cellular traits have remained poorly characterized. Growth traits, such as the lag time when transitioning from starvation to growth, the exponential growth rate, and the yield (resource efficiency), are ideal candidates for investigating this question. Their association with growth means they have relatively direct connections to selection and population dynamics. Furthermore, high-throughput techniques can measure these traits for hundreds of genotypes and environments (Levin-Reisman ; Warringer ; Zackrisson ; Ziv ). Numerous experiments have shown that single mutations can be pleiotropic, affecting multiple growth traits simultaneously (Fitzsimmons ; Adkar ). More recent experiments have even measured these traits at the single-cell level, revealing substantial nongenetic heterogeneity (Levin-Reisman ; Ziv , 2017). Several evolution experiments have found widespread evidence of adaptation in these traits (Vasi ; Novak ; Reding-Roman ; Li ). This data altogether indicates that covariation in these traits is pervasive in microbial populations.There have been a few previous attempts to develop quantitative models to describe evolution of these traits. For example, Vasi considered data after 2000 generations of evolution in Escherichia coli to estimate how much adaptation was attributable to different growth traits. Smith (2011) developed a mathematical model to study how different traits would allow strains to either fix, go extinct, or coexist. Wahl and Zhu (2015) studied the fixation probability of mutations affecting different growth traits separately (nonpleiotropic), especially to identify which traits were most likely to acquire fixed mutations and the importance of mutation occurrence time and dilution factor. However, simple quantitative results that can be used to interpret experimental data have remained lacking. More recent work (Manhart ; Manhart and Shakhnovich 2018) derived a quantitative relation between growth traits and selection, showing that selection consists of additive components on the lag and growth phases. However, this did not address the consequences of this selection for evolution, especially the adaptation of trait covariation.In this work, we investigate a minimal model of evolutionary dynamics in which cells interact only by competition for a single limiting resource. We find that the fixation probability of a mutation is accurately determined by a linear combination of its change in growth rate and change in lag time relative to its immediate ancestor, rather than depending on the precise combination of traits; the relative weight of these two components is determined by the dilution factor. Yield, on the other hand, is under no direct selection. This is true even in the presence of substantial clonal interference, where the mutant’s immediate ancestor may have a large fitness difference from the population mean. We provide quantitative predictions for the speed of adaptation of growth rate and lag time as well as their evolved covariation. Specifically, we find that even in the absence of an intrinsic correlation between growth and lag due to mutations, these traits can evolve a nonzero correlation due to selection and variation in number of fixed mutations.
Materials and Methods
Model of population dynamics
We consider a model of asexual microbial cells in a well-mixed batch culture, where the only interaction between different strains is competition for a single limiting resource (Manhart ; Manhart and Shakhnovich 2018). Each strain k is characterized by a lag time , growth rate , and yield (see Figure 1A for a two-strain example). Here the yield is the number of cells per unit resource (Vasi ), so that is the amount of resources consumed by time t by strain k, where is the number of cells of strain k at time t. We define R to be the initial amount of the limiting resource and assume different strains interact only by competing for the limiting resource; their growth traits are the same as when they grow independently. When the population has consumed all of the initial resource, the population reaches stationary phase with constant size. The saturation time at which this occurs is determined by which we can write in terms of the growth traits aswhere is the total population size and is the frequency of each strain k at the beginning of the growth cycle. In Equation 1, we assume the time is longer than each strain’s lag time . Note that some of our notation differs from related models in previous work, some of which used g for growth rate and λ for lag time (Manhart ), while others used λ for growth rate (Lin and Amir 2017). Although it is possible to extend the model to account for additional growth traits such as a death rate or lag and growth on secondary resources, here we focus on the minimal set of traits most often measured in microbial phenotyping experiments (Novak ; Fitzsimmons ; Levin-Reisman ; Warringer ; Jasmin and Zeyl 2012; Ziv ; Zackrisson ; Adkar ).
Figure 1
Model of selection on multiple microbial growth traits. (A) Simplified model of microbial population growth characterized by three traits: lag time L, growth rate r, and yield Y. The total initial population size is and the initial frequency of the mutant (strain 2) is x. After the whole population reaches stationary phase (time ), the population is diluted by a factor D into fresh media, and the cycle starts again. (B) Phase diagram of selection on mutants in the space of their growth rate and lag time relative to wild type. The slope of the diagonal line is
Model of selection on multiple microbial growth traits. (A) Simplified model of microbial population growth characterized by three traits: lag time L, growth rate r, and yield Y. The total initial population size is and the initial frequency of the mutant (strain 2) is x. After the whole population reaches stationary phase (time ), the population is diluted by a factor D into fresh media, and the cycle starts again. (B) Phase diagram of selection on mutants in the space of their growth rate and lag time relative to wild type. The slope of the diagonal line isWe define the selection coefficient between each pair of strains as the change in their log-ratio over the complete growth cycle (Chevin 2011; Good ):where is the population size of strain i at the beginning of the growth cycle and is the population size of strain i at the end. After the population reaches stationary phase, it is diluted by a factor of D into a fresh medium with amount R of the resource, and the cycle repeats (Figure 1A). We assume the population remains in the stationary phase for a sufficiently short time, such that we can ignore death and other dynamics during this phase (Finkel 2006; Avrani ).Over many cycles of growth, as would occur in a laboratory evolution experiment (Lenski ; Elena and Lenski 2003; Good ), the population dynamics of this system are characterized by the set of frequencies for all strains as well as the matrix of selection coefficients and the total population size at the beginning of each cycle. In Supplemental Material, Supplemental Methods (sections I, II, and III), we derive explicit equations for the deterministic dynamics of these quantities over multiple cycles of growth for an arbitrary number of strains. In the case of two strains, such as a mutant and a wild type, the selection coefficient is approximatelywhere is the growth rate of the mutant relative to the wild type and is the relative lag time. The approximation is valid as long as the growth rate difference between the mutant and the wild type is small (Supplemental Methods, section IV), which is true for most single mutations (Chevereau ; Levy ). This equation shows that the growth phase and the lag phase make distinct additive contributions to the total selection coefficient, with the dilution factor D controlling their relative magnitudes (Figure 1B). This is because a larger dilution factor will increase the amount of time the population grows exponentially, hence increasing selection on growth rate. Neutral coexistence between multiple strains is therefore possible if these two selection components balance although it requires an exact tuning of the growth traits with the dilution factor (Supplemental Methods, section III) (Manhart ; Manhart and Shakhnovich 2018). With a fixed dilution factor D, the population size at the beginning of each growth cycle changes according to (Supplemental Methods, section I)where is the effective yield of the whole population in the current growth cycle. In this manner the ratio sets the bottleneck size of the population, which for serial dilution is approximately the effective population size (Lenski ), and therefore determines the strength of genetic drift.
Model of evolutionary dynamics
We now consider the evolution of a population as new mutations arise that alter growth traits. We start with a wild-type population having lag time and growth rate which are roughly consistent with E. coli parameters, where time is measured in minutes (Lenski ; Vasi ); we set the wild-type yield to be without loss of generality. As in experiments, we vary the dilution factor D and the amount of resources R, which control the relative selection on growth vs. lag (set by D, Equation 3) and the effective population size (set by
Equation 4). We also set the initial population size of the first cycle toThe population grows according to the dynamics in Figure 1A. Each cell division can generate a new mutation with probability note that this rate is only for mutations altering growth traits, and therefore it is lower than the rate of mutations anywhere in the genome. We generate a random waiting time for each strain k until the next mutation with instantaneous rate When a mutation occurs, the growth traits for the mutant are drawn from a distribution where are the growth traits for the background strain on which the new mutation occurs and are the traits for the new mutant. Note that since mutations only arise during the exponential growth phase, beneficial or deleterious effects on lag time are not realized until the next growth cycle (Li ). After the growth cycle ceases (once the resource is exhausted according to Equation 1), we randomly choose cells, each with probability to form the population for the next growth cycle.We will assume mutational effects are not epistatic and scale with the trait values of the background strain, so that where
and (Supplemental Methods, section V). Since our primary goal is to scan the space of possible mutations, we focus on uniform distributions of mutational effects where
and In the Supplemental Methods, we extend our main results to the case of Gaussian distributions (section V) as well as an empirical distribution of mutational effects based on single-gene deletions in E. coli (section VI) (Campos ).
Data availability
Data and codes are available upon request. File S1 contains the Supplemental Methods. File S2 contains data of growth traits presented in Figure S3. Supplemental material available at figshare: https://doi.org/10.25386/genetics.12194754.
Results
Fixation of mutations
We first consider the fixation statistics of new mutations in our model. In Figure 2A we show the relative growth rates γ and the relative lag times ω of fixed mutations against their background strains, along with contours of constant selection coefficient s from Equation 3. As expected, fixed mutations either increase growth rate decrease lag time or both. In contrast, the yield of fixed mutations is the same as the ancestor on average (Figure 2B); indeed, the selection coefficient in Equation 3 does not depend on the yields. If a mutation arises with significantly higher or lower yield than the rest of the population, the bottleneck population size immediately adjusts to keep the overall fold-change of the population during the growth cycle fixed to the dilution factor D (Equation 4). Therefore mutations that significantly change yield have no effect on the overall population dynamics.
Figure 2
Selection coefficient determines fixation probability. (A) The relative growth rates γ and the relative lag times ω of fixed mutations against their background strain. Dashed lines mark contours of constant selection coefficient with interval while the solid line marks . (B) Same as A, but for relative growth rate γ and the relative yield δ. The red dots mark the relative yield of fixed mutations averaged over binned values of the relative growth rate γ. In A and B, and . (C, D, E, and F) Fixation probability of mutations against their selection coefficients for different amounts of resource R and dilution factors D as indicated in the titles. The red dashed line shows the fixation probability predicted in the SSWM regime (Equation 5), while the black line shows a numerical fit of the data points to Equation 6 with parameters and in D, and in E, and and in F. In all panels mutations randomly arise from a uniform distribution with
and
Selection coefficient determines fixation probability. (A) The relative growth rates γ and the relative lag times ω of fixed mutations against their background strain. Dashed lines mark contours of constant selection coefficient with interval while the solid line marks . (B) Same as A, but for relative growth rate γ and the relative yield δ. The red dots mark the relative yield of fixed mutations averaged over binned values of the relative growth rate γ. In A and B, and . (C, D, E, and F) Fixation probability of mutations against their selection coefficients for different amounts of resource R and dilution factors D as indicated in the titles. The red dashed line shows the fixation probability predicted in the SSWM regime (Equation 5), while the black line shows a numerical fit of the data points to Equation 6 with parameters and in D, and in E, and and in F. In all panels mutations randomly arise from a uniform distribution with
andFigure 2A also suggests that the density of fixed mutations in the growth-lag trait space depends solely on their selection coefficients, rather than the precise combination of traits, as long as other parameters such as the dilution factor D, the total amount of resource R, and the distribution of mutational effects are held fixed. Mathematically, this means that the fixation probability of a mutation with growth effect γ and lag effect ω can be expressed as To test this, we discretize the scatterplot of Figure 2A and compute the fixation probabilities of mutations as functions of γ and ω (Supplemental Methods, section VII). We then plot the resulting fixation probabilities of mutations as functions of their selection coefficients calculated by Equation 3 (Figure 2, C–F). We test the dependence of the fixation probability on the selection coefficient over a range of population dynamics regimes by varying the dilution factor D and the amount of resources R.For small populations, mutations generally arise and either fix or go extinct one at a time, a regime known as strong-selection weak-mutation (SSWM) (Gillespie 1984). In this case, we expect the fixation probability of a beneficial mutation with selection coefficient to be (Wahl and Gerrish 2001; Wahl and Zhu 2015; Guo )This is similar to the standard Wright–Fisher fixation probability of (Crow and Kimura 1970), but with a different prefactor due to averaging over the different times in the exponential growth phase at which the mutation can arise (Supplemental Methods, section VIII). Indeed, we see this predicted dependence matches the simulation results for the small population size of (Figure 2C).For larger populations, multiple beneficial mutations will be simultaneously present in the population and interfere with each other, an effect known as clonal interference (Gerrish and Lenski 1998; Desai and Fisher 2007; Schiffels ; Good ; Fisher 2013; Good and Desai 2014). Our simulations show that, as for the SSWM case, the fixation probability depends only on the selection coefficient (Equation 3) relative to the mutation’s immediate ancestor and not on the individual combination of mutant traits (Figure 2, D–F), with all other population parameters held constant. Previous work has determined the dependence of the fixation probability on the selection coefficient under clonal interference using various approximations (Gerrish and Lenski 1998; Schiffels ; Good ; Fisher 2013). Here, we focus on an empirical relation based on (Gerrish and Lenski 1998)where A and B are two constants that depend on other parameters of the population (D, R, and the distribution of mutational effects); we treat these as empirical parameters to fit to the simulation results, although Gerrish and Lenski (1998) predicted
i.e., the same constant as in the SSWM case (Equation 5). The factor in Equation 6 comes from the probability that no superior beneficial mutations appears before the current mutation fixes. Since the time to fixation scales as we expect the average number of superior mutations to be proportional to (for small s). This approximation holds only for selection coefficients that are not too small and therefore are expected to fix without additional beneficial mutations on the same background; Equation 6 breaks down for weaker beneficial mutations that typically fix by hitchhiking on stronger mutations (Schiffels ). Nevertheless, Equation 6 matches our simulation results well for a wide range of selection coefficients achieved in our simulations and larger population sizes (Figure 2, D–F). Furthermore, the constant A we fit to the simulation data are indeed close to the predicted value of except in the most extreme case of (Figure 2F).Altogether, Figure 2 shows that mutations with different effects on cell growth (for example, a mutant that increases the growth rate and a mutant that decreases the lag time) can nevertheless have approximately the same fixation probability as long as their overall effects on selection are the same according to Equation 3. To test the robustness of this result, we verify it for several additional distributions of mutational effects in the Supplemental Methods: a Gaussian distribution of mutational effects, including the presence of correlated mutational effects (Figure S1); a wider distribution of mutational effects with large selection coefficients (Figure S2); and an empirical distribution of mutational effects estimated from single-gene deletions in E. coli (Figure S3). In Figure S4A, we further test robustness by using the neutral phenotype (orthogonal to the selection coefficient) to quantify the range of γ and ω trait combinations that nevertheless have the same selection coefficient and fixation probability, and in Figure S4B we show that the selection coefficient on growth alone is insufficient to determine fixation probability.While the dependence of fixation probability on the selection coefficient is a classic result of population genetics (Hartl and Clark 1997), the existence of a simple relationship here is nontrivial since, strictly speaking, selection in this model is not only frequency-dependent (Manhart ) (i.e., selection between two strains depends on their frequencies) but also includes higher-order effects (Manhart and Shakhnovich 2018) (i.e., selection between strain 1 and strain 2 is affected by the presence of strain 3). Therefore, in principle, the fixation probability of a mutant may depend on the specific state of the population in which it is present, while the selection coefficient in Equation 3 only describes selection on the mutant in competition with its immediate ancestor. However, we see that, at least for the parameters considered in our simulations, these effects are negligible in determining the eventual fate of a mutation.
Adaptation of growth traits
As Figure 3A shows, many mutations arise and fix over the timescale of our simulations, which lead to predictable trends in the quantitative traits of the population. We first determine the relative fitness of the evolved population at each time point against the ancestral strain by simulating competition between an equal number of evolved and ancestral cells for one cycle, analogous to common experimental measurements (Lenski ; Elena and Lenski 2003). The resulting fitness trajectories are shown in Figure 3B. To see how different traits contribute to the fitness increase, we also calculate the average population traits at the beginning of each cycle; for instance, the average population growth rate at growth cycle n is As expected from Equation 3, the average growth rate increases (Figure 3C) and the average lag time decreases (Figure 3D) for all simulations. In contrast, the average yield evolves without apparent trend (Figure 3E), since Equation 3 indicates no direct selection on yield. We note that, while the cells do not evolve toward lower or higher resource efficiency on average, they do evolve to consume resources more quickly, since the rate of resource consumption ( for each cell of strain k) depends on both the yield as well as the growth rate. Therefore the saturation time of each growth cycle evolves to be shorter, consistent with recent work from Baake .
Figure 3
Dynamics of evolving populations. (A) Frequencies of new mutations as functions of the number n of growth cycles. Example trajectories of (B) the fitness of the evolved population relative to the ancestral population, (C) the evolved average growth rate, (D) the evolved average lag time, and (E) the evolved average yield. In all panels the dilution factor is the amount of resource at the beginning of each cycle is and mutations randomly arise from a uniform distribution with
and
Dynamics of evolving populations. (A) Frequencies of new mutations as functions of the number n of growth cycles. Example trajectories of (B) the fitness of the evolved population relative to the ancestral population, (C) the evolved average growth rate, (D) the evolved average lag time, and (E) the evolved average yield. In all panels the dilution factor is the amount of resource at the beginning of each cycle is and mutations randomly arise from a uniform distribution with
andFigure 3 suggests relatively constant speeds of adaptation for the relative fitness, the average growth rate, and the average lag time. For example, we can calculate the adaptation speed of the average growth rate as the averaged change in the average growth rate per cycle:where the bracket denotes an average over replicate populations and cycle number. In the Supplemental Methods (sections IX and X), we calculate the adaptation speeds of these traits in the SSWM regime to bewhere and are the standard deviations of the underlying distributions of γ and ω for single mutations
is the ancestral growth rate, and is the ancestral yield (we assume the yield does not change on average according to Figure 3E). Furthermore, the ratio of the growth adaptation rate and the lag adaptation rate is independent of the amount of resource and mutation rate in the SSWM regime:Equation 8 predicts that the adaptation speeds of the average growth rate, the average lag time, and the relative fitness should all increase with the amount of resources R and decrease with the dilution factor D (for large D); although this prediction assumes the SSWM regime (relatively small ), it nevertheless holds across a wide range of R- and D-values (Figure 4, A–C), except for where the speed of fitness increase is nonmonotonic with D (Figure 4C). The predicted adaptation speeds in Equation 8 also quantitatively match the simulated trajectories in the SSWM case (Figure 4, D–F); even outside of the SSWM regime, the relative rate in Equation 9 remains a good prediction at early times (Figure S5).
Figure 4
Speed of adaptation. The average per-cycle adaptation speed of (A) the average growth rate, (B) the average lag time, and (C) the fitness relative to the ancestral population as functions of the dilution factor D and total amount of resources R. The adaptation speeds are averaged over growth cycles and independent populations. (D) The average growth rate, (E) the average lag time, and (F) the fitness relative to the ancestral population as functions of the number n of growth cycles. The dilution factor is and the total resource is so the population is in the SSWM regime. The blue solid lines are simulation results, while the dashed lines show the mathematical predictions in Equation 8. All panels show averages over 500 independent simulated populations, with mutations randomly arising from a uniform distribution with
and
Speed of adaptation. The average per-cycle adaptation speed of (A) the average growth rate, (B) the average lag time, and (C) the fitness relative to the ancestral population as functions of the dilution factor D and total amount of resources R. The adaptation speeds are averaged over growth cycles and independent populations. (D) The average growth rate, (E) the average lag time, and (F) the fitness relative to the ancestral population as functions of the number n of growth cycles. The dilution factor is and the total resource is so the population is in the SSWM regime. The blue solid lines are simulation results, while the dashed lines show the mathematical predictions in Equation 8. All panels show averages over 500 independent simulated populations, with mutations randomly arising from a uniform distribution with
and
Evolved covariation between growth traits
We now turn to investigating how the covariation between traits evolves. We have generally assumed that individual mutations have uncorrelated effects on different traits. Campos recently systematically measured the growth curves of the single-gene deletions in E. coli. We compute the relative growth rate, lag time, and yield changes for the single-gene deletions compared with the wild type and find that the resulting empirical distribution of relative growth traits changes shows very small correlations between these traits (Figure S3, B and C), consistent with our assumptions. We note that these measurements, however, are subject to significant noise (Supplemental Methods, section VI), and therefore any conclusions ultimately require verification by further experiments.Even in the absence of mutational correlations, selection may induce a correlation between these traits in evolved populations. In Figure 5A, we schematically depict how the raw variation of traits from mutations is distorted by selection and fixation of multiple mutations. Specifically, for a single fixed mutation, selection induces a positive (i.e., antagonistic) correlation between the relative growth rate change and the relative lag time change. Figure 2A shows this for single fixed mutations, while Figure 5, B and C shows this positive correlation between the average growth rate and the average lag time across populations that have accumulated the same number of fixed mutations. For populations in the SSWM regime with the same number of fixed mutations, the Pearson correlation coefficient between the average growth rate and the average lag time across populations is approximately equal to the covariation of the relative growth rate change γ and the relative lag time change ω for a single fixed mutation:where is an average over the distribution of single fixed mutations (Supplemental Methods, section IX). We can explicitly calculate this quantity in the SSWM regime, which confirms that it is positive for uncorrelated mutational effects with uniform or Gaussian distributions (Supplemental Methods, section XI).
Figure 5
Evolved patterns of covariation among growth traits. (A) Schematic of how selection and fixation of multiple mutations shape the observed distribution of traits. The sign of the Pearson correlation coefficient between the average growth rate and lag time depends on whether we consider an ensemble of populations with the same number of fixed mutations or the same number of total mutation events. (B) Distribution of average growth rate and lag time for 1000 independent populations with the same number of fixed mutations. Each color corresponds to a different number of fixed mutations indicated in the legend. (C) Pearson correlation coefficient of growth rate and lag time for distributions in B, as a function of the number of fixed mutations. The dashed line is the prediction from Equation 10. (D) Same as B, except each color corresponds to a set of populations at a snapshot in time with the same number of total mutation events. Each color corresponds to a different number of total mutations events indicated in the legend. (E) Same as C, but for the set of populations shown in D. The dashed line is the prediction from Equation 11. In C and E, the error bars represent confidence intervals. In B–E, we simulate the SSWM regime by introducing random mutations one by one and determining their fixation from Equation 5 with
Evolved patterns of covariation among growth traits. (A) Schematic of how selection and fixation of multiple mutations shape the observed distribution of traits. The sign of the Pearson correlation coefficient between the average growth rate and lag time depends on whether we consider an ensemble of populations with the same number of fixed mutations or the same number of total mutation events. (B) Distribution of average growth rate and lag time for 1000 independent populations with the same number of fixed mutations. Each color corresponds to a different number of fixed mutations indicated in the legend. (C) Pearson correlation coefficient of growth rate and lag time for distributions in B, as a function of the number of fixed mutations. The dashed line is the prediction from Equation 10. (D) Same as B, except each color corresponds to a set of populations at a snapshot in time with the same number of total mutation events. Each color corresponds to a different number of total mutations events indicated in the legend. (E) Same as C, but for the set of populations shown in D. The dashed line is the prediction from Equation 11. In C and E, the error bars represent confidence intervals. In B–E, we simulate the SSWM regime by introducing random mutations one by one and determining their fixation from Equation 5 withHowever, in evolution experiments we typically observe populations at a particular snapshot in time, such that the populations may have a variable number of fixed mutations but the same number of total mutations that arose and either fixed or went extinct (since the number of total arising mutations is very large, we neglect its fluctuation across populations). Interestingly, the variation in number of fixed mutations at a snapshot in time causes the distribution of growth rates and lag times across populations to stretch into a negative correlation; this is an example of Simpson’s paradox from statistics (Simpson 1951). Figure 5A shows this effect schematically, while Figure 5, D and E show explicit results from simulations. An intuitive way to understand the evolved negative correlation is to approximate the effects of all fixed mutations as deterministic, so that each fixed mutation increases the average growth rate and decreases the average lag time by the same amount. Therefore, populations with a higher average growth rate must have a larger number of fixed mutations and thus also a shorter average lag time, leading to a negative correlation between the average growth rates and the average lag times. In the Supplemental Methods (section XI), we calculate this evolved Pearson correlation coefficient across populations in the SSWM regime to be approximatelyThat is, the correlation of traits across populations with multiple mutations is still a function of the distribution of single fixed mutations, but it is not equal to the correlation of single fixed mutations (Equation 10). In the Supplemental Methods (section XI), we explicitly calculate in the SSWM regime for uncorrelated uniform and Gaussian distributions of mutational effects, which shows that it is negative. Furthermore, we prove that it must always be negative for any symmetric and uncorrelated distribution (Supplemental Methods, section IX).The predicted correlations in Equations 10 and 11 quantitatively match the simulations well in the SSWM regime (Figure 5, C and E). While they are less accurate outside of the SSWM regime, they nevertheless still produce the correct sign of the evolved correlation within the parameter regimes of our simulations (Figure S6, A–C). However, the signs of the correlations can indeed change depending on the underlying distribution of mutational effects For example, in the Supplemental Methods, we explore the effects of varying the mean mutational effects (Figure S6D)—e.g., whether an average mutation has positive, negative, or zero effect on the growth rate—as well as the intrinsic mutational correlation between the relative growth rate change and the relative lag time change (Figure S6E).
Discussion
We have investigated a model of microbial evolution under serial dilution, which is both a common protocol for laboratory evolution experiments (Luckinbill 1978; Lenski ; Elena and Lenski 2003; Levy ; Kram ) as well as a rough model of evolution in natural environments with feast–famine cycles. While there has been extensive work to model population and evolutionary dynamics in these conditions (Gerrish and Lenski 1998; Wahl and Gerrish 2001; Desai 2013; Baake ; Guo ), these models have largely neglected the physiological links connecting mutations to selection. However, models that explicitly incorporate these features are necessary to interpret experimental evidence that mutations readily generate variation in multiple cellular traits, and that this variation is important to adaptation (Vasi ; Novak ; Reding-Roman ; Li ). Wahl and Zhu (2015) determined the relative fixation probabilities of mutations on different traits and the effects of mutation occurrence time and dilution factor, but the role of pleiotropy and evolutionary dynamics over many mutations were not considered.In this paper, we have studied a model where mutations can affect three quantitative growth traits—the lag time, the exponential growth rate, and the yield (Figure 1A)—since these three traits are widely measured for microbial populations. In particular, we have derived a simple expression (Equation 3) for the selection coefficient of a mutation in terms of its effects on growth and lag and a single environmental parameter, the dilution factor D. While previous work showed that this particular form of the selection coefficient determines the fixation probability of a single mutation in the SSWM regime (Manhart ), here we show that this holds even in the presence of clonal interference (Figure 2, C–F), which appears to be widespread in laboratory evolution experiments (Lang , 2013; Good ). Our result is therefore valuable for interpreting the abundant experimental data on mutant growth traits. We have also calculated the adaptation rates of growth traits per cycle in the SSWM regime, which turn out to increase with the amount of resource R and decrease with the dilution factor D. These results are confirmed by numerical simulations and remain good predictions even outside of the SSWM regime. Furthermore, some of these results are independent of the specific form of the selection coefficient (Equation 3), namely the fact that the fixation probability depends only on the selection coefficient (with other population parameters besides the mutant traits being held fixed) even in the clonal interference regime, and the expressions for the correlation coefficients of traits between populations (Equations 10 and 11).An important difference with the previous work on this model is that here we used a fixed dilution factor D, which requires that the bottleneck population size fluctuates as the population evolves. In contrast, previous work used a fixed and variable D (Manhart ; Manhart and Shakhnovich 2018). We observed two important differences between these regimes. First, in the case of fixed and variable D, the fold-change of the population during a single growth cycle, which is approximately (Manhart ), determines the relative selection between growth and lag, since it determines how long the population undergoes exponential growth. Therefore one can experimentally tune this relative selection by varying either the total amount of resources R or the fixed bottleneck size . However, when the dilution factor D is fixed, the population fold-change is always constrained to exactly equal D, and therefore D alone determines the relative selection on growth and lag (Equation 3). The second difference is that, with fixed and variable D, the selection coefficient depends explicitly on the effective yield and is therefore frequency-dependent (Supplemental Methods, section II), which enables the possibility of stable coexistence between two strains (Manhart ; Manhart and Shakhnovich 2018). However, for the fixed D case, the frequency dependence of is exactly canceled by (Equation 4). Therefore, there is only neutral coexistence in this case, requiring the growth and lag traits of the strains to follow an exact constraint set by D (Supplemental Methods, section III).A major result of our model is a prediction on the evolution of covariation between growth traits. In particular, we have shown that correlations between traits can emerge from selection and accumulation of multiple mutations even without an intrinsic correlation between traits from individual mutations (Figure 5 and Figure S6). We have also shown that selection alone produces no correlation between growth and yield, in the absence of correlated mutational effects (Figure 2B and Figure 3E). This is important for interpreting evolved patterns of traits in terms of selective or physiological tradeoffs. Specifically, it emphasizes that the evolved covariation between traits conflates both the underlying supply of variation from mutations as well as the action of selection and other aspects of population dynamics (e.g., genetic drift, spatial structure, recombination), and therefore it is difficult to make clear inferences about either aspect purely from the outcome of evolution alone. For example, simply observing a negative correlation between two traits from evolved populations is insufficient to infer whether that correlation is due to a physiological constraint on mutations (e.g., mutations cannot improve both traits simultaneously) or due to a selective constraint (e.g., selection favors specialization in one trait or another).These questions, of course, have been the foundation of quantitative trait genetics (Lynch and Walsh 1998). Historically, this field has emphasized polymorphic populations with abundant recombination as are applicable to plant and animal breeding. However, this regime is quite different from microbial populations, which, at least under laboratory conditions, are often asexual and dominated by linkage between competing mutations (Lang , 2013; Good ). We therefore need a quantitative description of both between-population as well as within-population covariation of traits of microbial populations in this regime. In the present study, we focus on between-population covariation in growth traits, but recent work by Gomez provides insight into the case of within-population covariation. They showed that a tradeoff across individuals within a population evolves between two quantitative traits under positive, additive selection; this suggests that while growth rate and lag time will be negatively correlated across populations (Figure 5, D and E), they should be positively correlated within populations.Microbial growth traits should indeed be an ideal setting for this approach due to abundant data, but conclusions on the nature of trait covariation have remained elusive. Physiological models have predicted a negative correlation between growth rate and lag time across genotypes (Baranyi and Roberts 1994; Himeoka and Kaneko 2017), while models of single-cell variation in lag times also suggest there should be a negative correlation at the whole-population level (Baranyi 1998). However, experimental evidence has been mixed, with some studies finding a negative correlation (Ziv , 2017), while others found no correlation (Levin-Reisman ; Warringer ; Adkar ). Studies of growth–yield correlations have long been motivated by r/K selection theory, which suggests there should be tradeoffs between growth rate and yield (Reznick ). For instance, metabolic models make this prediction (Pfeiffer ; MacLean 2007; Meyer ). However, experimental evidence has again been mixed, with some data showing a tradeoff (Jasmin and Zeyl 2012; Jasmin ; Bachmann ), while others show no correlation (Velicer and Lenski 1999; Novak ; Fitzsimmons ; Reding-Roman ) or even a positive correlation (Luckinbill 1978; Warringer ). Some of this ambiguity may have to do with dependence on the environmental conditions (Reding-Roman ) or the precise definition of yield. We define yield as the proportionality constant of population size to resource (Equation 1) and neglect any growth rate dependence on resource concentration. Under these conditions, we predict no direct selection on yield, which means that the only way to generate a correlation of yield with growth rate is if the two traits are constrained at the physiological level, so that mutational effects are correlated. In such cases yield could evolve but only as a spandrel (Gould and Lewontin 1979; Amir 2017). Ultimately, we believe more precise single-cell measurements of these traits, both across large unselected mutant libraries as well as evolved strains, are necessary to definitively test these issues (Campos ).
Authors: Herwig Bachmann; Martin Fischlechner; Iraes Rabbers; Nakul Barfa; Filipe Branco dos Santos; Douwe Molenaar; Bas Teusink Journal: Proc Natl Acad Sci U S A Date: 2013-08-12 Impact factor: 11.205
Authors: Sasha F Levy; Jamie R Blundell; Sandeep Venkataram; Dmitri A Petrov; Daniel S Fisher; Gavin Sherlock Journal: Nature Date: 2015-02-25 Impact factor: 49.962