| Literature DB >> 30517696 |
Judith Ilhan1, Anne Kupczok1, Christian Woehle1, Tanita Wein1, Nils F Hülter1, Philip Rosenstiel2, Giddy Landan1, Itzhak Mizrahi3,4, Tal Dagan1.
Abstract
The ubiquity of plasmids in all prokaryotic phyla and habitats and their ability to transfer between cells marks them as prominent constituents of prokaryotic genomes. Many plasmids are found in their host cell in multiple copies. This leads to an increased mutational supply of plasmid-encoded genes and genetically heterogeneous plasmid genomes. Nonetheless, the segregation of plasmid copies into daughter cells during cell division is considered to occur in the absence of selection on the plasmid alleles. We investigate the implications of random genetic drift of multicopy plasmids during cell division-termed here "segregational drift"-to plasmid evolution. Performing experimental evolution of low- and high-copy non-mobile plasmids in Escherichia coli, we find that the evolutionary rate of multicopy plasmids does not reflect the increased mutational supply expected according to their copy number. In addition, simulated evolution of multicopy plasmid alleles demonstrates that segregational drift leads to increased loss frequency and extended fixation time of plasmid mutations in comparison to haploid chromosomes. Furthermore, an examination of the experimentally evolved hosts reveals a significant impact of the plasmid type on the host chromosome evolution. Our study demonstrates that segregational drift of multicopy plasmids interferes with the retention and fixation of novel plasmid variants. Depending on the selection pressure on newly emerging variants, plasmid genomes may evolve slower than haploid chromosomes, regardless of their higher mutational supply. We suggest that plasmid copy number is an important determinant of plasmid evolvability due to the manifestation of segregational drift.Entities:
Keywords: bacterial evolution; mitochondria heteroplasmy; population genetics
Mesh:
Year: 2019 PMID: 30517696 PMCID: PMC6389322 DOI: 10.1093/molbev/msy225
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1.Genetic maps of plasmid pLC and pHC. The plasmids are of comparable genome size and differ in the origin of replication. The mob gene originally found in pBBR1 was nonfunctionalized by truncation during plasmid construction (denoted as Ψmob).
Number of Intergenic and Synonymous Point Mutations Detected in the Evolved Populations.
| Chromosome | Plasmid | ||||||
|---|---|---|---|---|---|---|---|
| Plasmid Type | Host Genotype | Observed Number of Point Mutations | Substitution per Site per Generation | PCN | Expected Number of Point Mutations | Observed Number of Point Mutations | |
| pLC | Wild type | 37 | 17 | 2.5 ± 0.7 | 20.9 ± 4.9 | 16.3 × 10−2 ± 5.0 × 10−2 | 0 |
| 42 | 16 | 2.4 ± 1.8 | 4.8 ± 3.2 | 4.5 × 10−2 ± 4.8 × 10−2 | 0 | ||
| Hypermutator | 37 | 148 | 22.4 ± 5.2 | 7.1 ± 2.2 | 5.0 × 10−1 ± 1.7 × 10−1 | 1 | |
| 42 | 87 | 13.2 ± 3.3 | 4.1 ± 1.4 | 1.8 × 10−1 ± 1.1 × 10−1 | 1 | ||
| pHC | Wild type | 37 | 6 | 0.87 ± 0.6 | 605.0 ± 501.3 | 2.0 ± 2.0 | 0 |
| 42 | 6 | 0.87 ± 0.8 | 294.0 ± 105.8 | 7.9 × 10−1 ± 7.2 × 10−1 | 0 | ||
| Hypermutator | 37 | 98 | 14.3 ± 3.8 | 154.5 ± 63.4 | 8.0 ± 5.0 | 0 | |
| 42 | 147 | 21.4 ± 11.5 | 252.6 ± 106.3 | 19.1 ± 16.2 | 0 | ||
Note.—A detailed information used for the calculation of the mean and SD for each of the eight experimental factor groups is shown in supplementary table S1, Supplementary Material online. Additional low-frequency point mutations were detected on the pLC plasmid considering the full sequencing coverage (see variant details in supplementary table S2, Supplementary Material online). A detailed list of all observed variants on the chromosome is given in supplementary table S3, Supplementary Material online. The variant distribution across the replicons is depicted in supplementary fig. S1, Supplementary Material online. We note that PCN estimate from the relative sequencing coverage of small plasmids may differ from the estimated PCN using qPCR due to the DNA extraction method as well as the sequencing approach (Becker et al. 2016). The large-scale estimates of PCN from sequencing results are lower than those obtained with qPCR, hence our estimate of the plasmid mutational supply is conservative. The ancestral PCN estimated from the sequencing results are as following: pLC-wt: 2, pLC-hypermutator: 13, pHC-wt: 86, and pHC-hypermutator: 116. A comparison of the PCN distribution in the evolved populations reveals that the PCN is significantly different from that of the ancestor plasmid in several of the experimental evolution settings. However, the direction of change (i.e., PCN increase or decrease) is heterogeneous (see supplementary table S1, Supplementary Material online, for details).
The chromosomal substitution rate is calculated as RChr = Nobs × (Nsites × Ngen)−1, where Nobs is the number of observed point mutations, Nsites is the number of intergenic and synonymous sites on the chromosome having a coverage ≥10, and Ngen is the number of generations in our experiment.
PCN was estimated from the sequencing results of evolved populations as the relative coverage of plasmid and chromosome replicons (detailed list in supplementary table S1, Supplementary Material online).
The expected number of point mutations on the plasmid is calculated as Nexp = RChr × Nsites × PCN × Ngen, where Nsites is the number of intergenic and synonymous sites on the plasmid having a coverage ≥10.
. 2.Simulated evolution of plasmids and chromosomes under neutrality with mutation (500 loci, mutation rate 2 × 10−9 per locus per generation). (A) Distribution of the number of mutations after 1,000 generations in 100 replicates. The color of the boxes corresponds to the combination of replicon and population size as in the legend. (B) Joint cumulative distribution function (CDF) of the AF of all mutations present in the replicates after 1,000 generations.
. 3.Simulated evolutionary dynamics of mutant alleles on plasmids and chromosomes. AF dynamics over 2,000 generations in 100 replicate simulated populations. Columns are the five combinations of simulated replicon type and population size, and rows are the different combinations of selective coefficient (s) and initial AF (f). Generations are on the x-axis, proportion of AF of replicates are stacked along the y-axis, population AF is color coded. For f = 0.001 and f = 0.0001, the starting frequency of mutant alleles is randomly sampled with the respective frequencies. For f = 0.5, the starting frequency of the mutant allele is exactly 0.5 for the chromosome and exactly 0.5 inside each cell for the plasmids. Note that, the initial frequency of cells with a mutant allele in the population is higher in the plasmid simulations compared with chromosome simulations (see supplementary table S4, Supplementary Material online, for expected number of mutant cells).
. 4.Frequency trajectories of plasmid alleles per cell in the 1,010 plasmid simulations. (A) Data shown for with initial allele frequencies f = 0.001, and f = 0.5. The frequency of mutant cells is calculated as the proportion of cells with at least one mutant plasmid allele within the population. The frequency of fixed mutant cells is the proportion of cells in which all plasmids loci contain the mutant allele (i.e., the plasmid allele is fixed in the cell). Starting with a low initial AF (f = 0.001), the difference between mutant plasmid allele dynamics and mutant cells in the population is most pronounced with s = 0.01. The results of the simulation where s = 0.1 are presented for comparison; in this setting, the dynamics of mutant plasmid alleles and mutant cells are similar. In the simulation of plasmid allele dynamics post balancing selection (f = 0.5), the ancestral and mutant alleles are set to an equal ratio within all cells at the beginning of the simulation. (B) Ancestral allele loss is calculated as the median number of generation where the ancestral allele was lost across all simulated populations.
. 5.Parallel evolution of the chromosome. (A) A heatmap representing 1,025 the number of shared mutated genes among the 48 evolved populations. The number of genes mutated in population pair x, y is calculated as the number of genes in which genetic variants (single base substitutions or indels) are observed in both populations. Cells along the diagonal present the number of genes where a mutation was detected in the corresponding population and at least one additional population. Annotation bars at the top and left right present the experimental factors (color coded as shown in [C]). (B) An interaction plot of the main factors in the experiment as calculated from the complete set of allele frequencies of parallel variants (included in supplementary table S3, Supplementary Material online). Dots represent means and error bars represent the standard error of the mean. ANOVA demonstrated significant effects for plasmid replicon type and temperature along with significant interactions between the three factors used in the experiment (statistics in supplementary table S5, Supplementary Material online). (C) Color-coded matrix of variants whose AF is significantly different among the main factors or their combination (using ANOVA on aligned rank transformed allele frequencies and FDR; statistics in supplementary table S6, Supplementary Material online). Intragenic variants (11 synonymous and 22 nonsynonymous) are indicated by the gene symbol and, in case of a single base substitution, coded as [ancestral amino acid][amino acid position][alternative amino acid]. The two insertion mutations are indicated by a gene symbol and coded as [nucleotide position in gene][+ inserted nucleotide]. [*] indicates a nonsense codon. The two intergenic variants are indicated by [i] coded as [ancestral nucleotide][genomic position][alternative nucleotide].
. 6.An illustration of the consequences of segregational drift to plasmid allele dynamics. The illustrated PCN is four and its segregation into daughter cells is balanced (i.e., the daughter cells inherit an equal number of plasmids). In the depicted scenario, one allele emerges on the chromosome (red X) and another allele emerges on the plasmid (magenta) at t0. Both alleles are neutral (i.e., s = 0). The population size (N) and the frequency of hosts (fhosts) over three generations are presented. The chromosomal allele is inherited into all daughter cells, such that it is present in the total population. Random segregation of the plasmid genotypes leads to decrease in the number of hosts. At t3, only a small minority in the population harbors the new plasmid allele. In the absence of selection for the plasmid allele presence, the new plasmid allele is deemed to remain at a very low frequency within the population (or be lost). Note.—This is a simplistic example that does not include loss of host cells due to drift and it does not include consideration of recombination among plasmids.