| Literature DB >> 25589261 |
Ram P Maharjan1, Bin Liu2, Lu Feng2, Thomas Ferenci1, Lei Wang3.
Abstract
Changes in allele frequencies and the fixation of beneficial mutations are central to evolution. The precise relationship between mutational and phenotypic sweeps is poorly described however, especially when multiple alleles are involved. Here, we investigate these relationships in a bacterial population over 60 days in a glucose-limited chemostat in a large population. High coverage metagenomic analysis revealed a disconnection between smooth phenotypic sweeps and the complexity of genetic changes in the population. Phenotypic adaptation was due to convergent evolution and involved soft sweeps by 7-26 highly represented alleles of several genes in different combinations. Allele combinations spread from undetectably low baselines, indicating that minor subpopulations provide the basis of most innovations. A hard sweep was also observed, involving a single combination of rpoS, mglD, malE, sdhC, and malT mutations sweeping to greater than 95% of the population. Other mutant genes persisted but at lower abundance, including hfq, consistent with its demonstrated frequency-dependent fitness under glucose limitation. Other persistent, newly identified low-frequency mutations were in the aceF, galF, ribD and asm genes, in noncoding regulatory regions, three large indels and a tandem duplication; these were less affected by fluctuations involving more dominant mutations indicating separate evolutionary paths. Our results indicate a dynamic subpopulation structure with a minimum of 42 detectable mutations maintained over 60 days. We also conclude that the massive population-level mutation supply in combination with clonal interference leads to the soft sweeps observed, but not to the exclusion of an occasional hard sweep.Entities:
Keywords: E. coli genome; experimental evolution; metagenomics; mutational sweeps; periodic selection
Mesh:
Year: 2015 PMID: 25589261 PMCID: PMC4350175 DOI: 10.1093/gbe/evv004
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FDistribution of mutations detected by whole-genome sequencing of an evolving population. (A) The number of occurrences of mutations within 11 samples. The histogram was based on a total of 2,729 mutations (1,182 SNPs, 74 small indels, 4 large indels, and 1 duplication mutation) across 11 samples collected over 60 days of continuous culture. (B) The relationship between the number of alleles found in mutated genes and their frequency at peak in the population. The histogram was based on 2,439 mutations in 1,435 genes with one or more mutations.
Nonsynonymous Changes Present at Three or More Time-Points in Nonrepeat Regions of the Genome
| Gene Name | Function | Occurrence in How Many Samples | Number of Alleles | Frequency in Population at Peak |
|---|---|---|---|---|
| RNA polymerase, sigma S (sigma 38) factor | 11 | 7 | 0.99 | |
| Fused conserved protein: DNA-binding transcriptional activator/maltotriose-ATP-binding protein | 10 | 26 | 0.99 | |
| DNA-binding transcriptional repressor | 11 | 7 | 0.98 | |
| Succinate dehydrogenase, membrane subunit, binds cytochrome b556 | 5 | 1 | 0.95 | |
| Maltose transporter subunit | 6 | 5 | 0.88 | |
| Methyl-directed mismatch repair protein | 9 | 2 | 0.67 | |
| Triosephosphate isomerase | 6 | 1 | 0.62 | |
| Hypothetical protein | 5 | 2 | 0.58 | |
| Polynucleotide phosphorylase/polyadenylase | 3 | 3 | 0.39 | |
| Putative assembly protein | 7 | 9 | 0.26 | |
| Alpha-amylase | 4 | 2 | 0.16 | |
| HF-I, host factor for RNA phage Q beta replication | 5 | 1 | 0.15 | |
| Phosphotransfer intermediate protein in two-component regulatory system with RcsBC | 4 | 1 | 0.14 | |
| Predicted inner membrane protein | 5 | 1 | 0.09 | |
| Fused predicted PTS enzymes: IIB component/IIC component | 3 | 5 | 0.08 | |
| ssDNA exonuclease, 5′ → 3′-specific | 6 | 6 | 0.08 | |
| Copper/silver efflux system, membrane component | 3 | 4 | 0.07 | |
| rhsD element protein | 6 | 14 | 0.07 | |
| Cryptic beta- | 6 | 5 | 0.06 | |
| Ecotin, a serine protease inhibitor | 3 | 1 | 0.06 | |
| Putative amidase and lipoprotein | 3 | 2 | 0.05 | |
| Phospho- | 3 | 3 | 0.05 | |
| Fused penicillin-binding protein 1 a: murein transglycosylase/murein transpeptidase | 4 | 5 | 0.05 | |
| Fused DNA-binding response regulator in two-component regulatory system with GlnL: response regulator/sigma54 interaction protein | 3 | 1 | 0.05 | |
| Hypothetical protein | 5 | 7 | 0.05 | |
| Putative DNA-binding response regulator in two-component system | 6 | 2 | 0.05 | |
| Formate dehydrogenase-N, alpha subunit, nitrate-inducible | 3 | 4 | 0.04 | |
| Glutamate synthase, large subunit | 5 | 9 | 0.04 | |
| 4 | 5 | 0.04 | ||
| Homocysteine-N5-methyltetrahydrofolate transmethylase, B12-dependent | 4 | 8 | 0.04 | |
| DNA polymerase II | 4 | 3 | 0.04 | |
| Cardiolipin synthase 1 | 3 | 4 | 0.04 | |
| Fused predicted transporter subunits of ABC superfamily: ATP-binding components | 3 | 2 | 0.04 | |
| Phosphoribosylformyl-glycineamide synthetase | 5 | 8 | 0.04 | |
| Fused aspartokinase I and homoserine dehydrogenase I | 4 | 6 | 0.04 | |
| Fumarate reductase (anaerobic) catalytic and NAD/flavoprotein subunit | 3 | 1 | 0.04 | |
| ATP-dependent DNA helicase | 6 | 8 | 0.04 | |
| Thioredoxin reductase, FAD/NAD(P)-binding | 3 | 4 | 0.04 | |
| Maltodextrin glucosidase | 4 | 4 | 0.03 | |
| Multidrug efflux system, subunit C | 3 | 5 | 0.03 | |
| CP4-44 prophage; antigen 43 (Ag43) phase-variable biofilm formation autotransporter | 3 | 4 | 0.03 | |
| Fused trehalose(maltose)-specific PTS enzyme: IIB component/IIC component | 3 | 2 | 0.03 | |
| Enterobactin synthase multienzyme complex component, ATP-dependent | 3 | 5 | 0.03 | |
| Acyl-CoA synthetase (long-chain-fatty-acid–CoA ligase) | 3 | 4 | 0.03 | |
| Putative transporter | 3 | 4 | 0.03 | |
| Putative DNA-binding transcriptional regulator of phosphonate uptake and biodegradation | 3 | 3 | 0.03 | |
| Paraquat-inducible membrane protein A | 3 | 4 | 0.03 | |
| Potassium translocating ATPase, subunit B | 3 | 4 | 0.03 | |
| Putrescine importer | 3 | 2 | 0.03 | |
| DNA polymerase III/DNA elongation factor III, tau and gamma subunits | 4 | 4 | 0.03 | |
| Putative ATP-dependent helicase | 3 | 4 | 0.03 | |
| Succinylglutamic semialdehyde dehydrogenase | 3 | 3 | 0.03 | |
| Gamma-aminobutyrate transporter | 3 | 3 | 0.03 | |
| Putative inner membrane protein | 3 | 1 | 0.03 | |
| Carbon-phosphorus lyase complex subunit | 4 | 4 | 0.03 | |
| Putative phosphosugar-binding protein | 3 | 2 | 0.03 | |
| Acetyl-CoA synthetase | 5 | 4 | 0.03 | |
| Nitrate reductase 2 (NRZ), alpha subunit | 3 | 4 | 0.03 | |
| Calcium/sodium:proton antiporter | 3 | 1 | 0.03 | |
| DNA ligase, NAD(+)-dependent | 3 | 5 | 0.03 | |
| Bacteriophage N4 receptor, outer membrane subunit | 3 | 5 | 0.03 | |
| Formate dehydrogenase-O, large subunit | 3 | 3 | 0.03 | |
| Carbamoyl-phosphate synthase large subunit | 4 | 4 | 0.02 | |
| Putative transporter | 4 | 3 | 0.02 | |
| Putative transcriptional accessory protein | 3 | 4 | 0.02 | |
| Zinc metallopeptidase | 3 | 3 | 0.02 | |
| Hypothetical protein | 3 | 2 | 0.02 | |
| Myristoyl-acyl carrier protein (ACP)-dependent acyltransferase | 5 | 1 | 0.02 | |
| Putative | 3 | 3 | 0.02 | |
| DNA adenine methylase | 3 | 2 | 0.02 | |
| Putative oxidoreductase with FAD/NAD(P)-binding domain | 3 | 2 | 0.02 | |
| Glycerate kinase I | 3 | 4 | 0.02 | |
| Bifunctional aconitate hydratase 2/2-methylisocitrate dehydratase | 3 | 2 | 0.02 | |
| Fused glutamine amidotransferase (component II) of anthranilate synthase/anthranilate phosphoribosyl transferase | 4 | 3 | 0.02 | |
| Fused predicted transporter subunits of ABC superfamily: ATP-binding components | 3 | 3 | 0.02 | |
| Fused histidinol-phosphatase/imidazoleglycerol-phosphate dehydratase | 3 | 2 | 0.02 | |
| Putative sugar transporter subunit: membrane component of ABC superfamily | 3 | 3 | 0.02 | |
| Taurine transporter subunit | 3 | 3 | 0.02 | |
| Sensory histidine kinase in two-component regulatory system with CreB or PhoB, regulator of the CreBC regulon | 4 | 2 | 0.02 | |
| Hypothetical protein | 3 | 3 | 0.02 | |
| Fused predicted DNA-binding transcriptional regulator/predicted amino transferase | 3 | 2 | 0.02 | |
| Mannose-1-phosphate guanyltransferase | 3 | 2 | 0.02 | |
| Ethanolamine ammonia-lyase, large subunit, heavy chain | 3 | 3 | 0.02 | |
| Gamma-glutamylphosphate reductase | 3 | 3 | 0.02 | |
| Rhodanase-like enzyme, sulfur transfer from thiosulfate | 3 | 3 | 0.02 | |
| Putative oxidoreductase, flavin:NADH component | 3 | 3 | 0.02 | |
| Fused signal transducer for aerotaxis sensory component/methyl accepting chemotaxis component | 3 | 2 | 0.02 | |
| Methyl-galactoside transporter subunit | 3 | 2 | 0.02 | |
| Putative glycosyl hydrolase | 3 | 2 | 0.02 | |
| Putative membrane fusion protein (MFP) component of efflux pump, membrane anchor | 3 | 2 | 0.02 | |
| Bifunctional histidinal dehydrogenase/histidinol dehydrogenase | 3 | 1 | 0.02 |
aThe entries include single nucleotide substitutions and 1-base insertions/deletions in coding sequences.
FCorrelation between phenotypic and genotypic sweeps. (A) The frequency of isolates from the population showing altered phenotype (iodine staining negative, •) and the sum of all the different rpoS mutations (○) in the population at given time points during 60 days of glucose-limited continuous culture as shown in supplementary table S1, Supplementary Material online. (B) Changes in expression of the mal regulon (malG–lacZ activity of culture, •) and the sum of the different malT mutations (○) in the population at given time points. The proportion of the population with iodine-negative phenotype and the malG–lacZ activity of the culture were estimated as described in the Materials and Methods section.
FDynamics of mutational changes in a chemostat population. (A) The total collection of the 2,709 mutations in 1,717 genes or regions identified in a glucose-limited chemostat population of E. coli K-12 strain BW2952. Each line represents a gene in which a mutation had occurred. (B) An extended view of the subset of genes from (A) in which mutant sequences were present in less than 20% of the population during the 60 days. (C) The total load of mutations in the population over 60 days. The counts of SNPs, single base-pair indels (S-indels), large indels (L-indels), and duplication mutations that were detected in at least 1% of the total population across 11 different samples collected over 60 days of continuous culture are shown. Frequencies of each mutation were estimated from the number of reads and total coverage at that particular genome position.
FDynamics of frequency changes of alleles within individual genes. The genomic position of each mutation in each gene is listed in the supplementary table S1, Supplementary Material online. Labels are shown for only the more dominant 5 of the 26 nonsynonymous malT alleles.
FSummary of events superimposed on the trajectories of malT allele frequencies. The mutations peaking at particular times are here superimposed on the malT allele sweeps shown in figure 4C (malT panel) to show the combinations of mutations participating in replacements over the 60 days. The association of different alleles of rpoS, mglD, malT, malE, and sdhC were based on the allelic frequency dynamics shown in figure 4 and verified by sequencing of individual isolates (see text for more detail).
FDynamics of frequency changes of each large indel and duplication mutation found in the chemostat population. The genomic position of each mutation in each gene is listed in the supplementary table S1, Supplementary Material online.