Literature DB >> 26496947

Transient overexpression of DNA adenine methylase enables efficient and mobile genome engineering with reduced off-target effects.

Rebecca M Lennen1, Annika I Nilsson Wallin1, Margit Pedersen1, Mads Bonde1, Hao Luo1, Markus J Herrgård1, Morten O A Sommer2.   

Abstract

Homologous recombination of single-stranded oligonucleotides is a highly efficient process for introducing precise mutations into the genome of E. coli and other organisms when mismatch repair (MMR) is disabled. This can result in the rapid accumulation of off-target mutations that can mask desired phenotypes, especially when selections need to be employed following the generation of combinatorial libraries. While the use of inducible mutator phenotypes or other MMR evasion tactics have proven useful, reported methods either require non-mobile genetic modifications or costly oligonucleotides that also result in reduced efficiencies of replacement. Therefore a new system was developed, Transient Mutator Multiplex Automated Genome Engineering (TM-MAGE), that solves problems encountered in other methods for oligonucleotide-mediated recombination. TM-MAGE enables nearly equivalent efficiencies of allelic replacement to the use of strains with fully disabled MMR and with an approximately 12- to 33-fold lower off-target mutation rate. Furthermore, growth temperatures are not restricted and a version of the plasmid can be readily removed by sucrose counterselection. TM-MAGE was used to combinatorially reconstruct mutations found in evolved salt-tolerant strains, enabling the identification of causative mutations and isolation of strains with up to 75% increases in growth rate and greatly reduced lag times in 0.6 M NaCl.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26496947      PMCID: PMC4770203          DOI: 10.1093/nar/gkv1090

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The ability to rapidly and precisely manipulate the genome of E.coli was first revolutionized by the development of mobile plasmids and prophages encoding inducible λ Red recombinase (1,2). The success of these systems was due to their mobility and curability as plasmids with temperature-sensitive origins, the enhanced rates of recombination afforded by expressing a viral recombinase, and the natural propensity of the γ subunit of λ Red to inhibit E. coli RecBCD exonuclease V. The latter feature allowed the transformation of linear double-stranded DNA (dsDNA) cassettes without degradation occurring too rapidly relative to recombination. It was additionally discovered that single-stranded DNA (ssDNA) oligonucleotides could be used to mediate homologous recombination without the drawbacks of degradative activity against dsDNA, with only the β subunit of λ Red recombinase being required for recombination (3). This discovery ultimately led to the development of more specific techniques for ssDNA oligonucleotide-mediated recombineering (4) and multiplex automated genome engineering (MAGE), where an electroporation mixture is cycled multiple times with multiplexed pools of oligonucleotides synthesized individually or on arrays to generate a combinatorial library of mutants (5,6). Due to its ability to rapidly generate diversity at precisely controlled locations, MAGE has opened up a diverse array of applications in the fields of metabolic engineering, synthetic biology and evolutionary biology. In metabolic engineering, these include generation of precise mutant libraries of interest, such as ribosome binding site libraries targeting one or more genes, promoter libraries and simultaneous manipulation of expression of genes in multiple pathways (5–7). In synthetic biology, applications have thus far included genome-wide genetic reprogramming, such as replacement of all UAG stop codons with UAA (8) and rare codon replacement in a panel of essential genes (9). In evolutionary biology, MAGE has been used to combinatorially reconstruct mutations identified in adaptive evolution experiments to assess causality (10) and condition-specific selection of antibiotic resistance (11). In spite of the wide range of opportunities presented by MAGE, applications have lagged due to some limitations of the MAGE systems published to date. Most notably, the majority of previously reported systems achieved high efficiencies of replacement of single nucleotides through the use of strains defective in mismatch repair (MMR) (by deletion of mutS, encoding one subunit of the MutHLS methyl-directed mismatch repair system) (4,5). Strains with disabled MMR systems achieve multiple orders-of-magnitude increases in efficiency for replacement of targeted single nucleotides, however, they exhibit a mutator phenotype with approximately 1000-fold increased spontaneous mutation rates relative to strains with intact MMR (12,13). These off-target mutations can potentially obfuscate results obtained from libraries that undergo large numbers of MAGE cycles or in MAGE applications where the generated combinatorial library is subsequently subjected to selections to isolate mutants with improved phenotypes. Resulting isolated mutants also exhibit genomic instability unless MMR is restored, thus deterring efforts to perform high-throughput screening or subsequent strain manipulations. Additional minor limitations include the use of relatively non-mobile prophage-based systems, often with temperature-inducible recombinases (4,5). These restrict the ability to easily remove the system following MAGE and limit growth both during and after MAGE to temperatures of 32°C or lower, respectively. The limitations in using MMR defective strains have previously been addressed through the use of non-MMR defective strains together with modifications that evade mismatch repair. This includes strategies such as generating mutations using particular mismatching base pairs (e.g. C:C or A:G) (4), forcing the mutation of more than five bases in a row (4), and the use of chemically modified bases that can functionally replace standard bases while evading mismatch repair (14). The former two strategies are undesirable as they significantly limit the scope of mutations possible to introduce by MAGE and preclude many applications such as reconstructing combinations of specific mutations observed in evolved strains and constructing fully diversified ribosome binding site libraries. The latter strategy introduces very significant cost increases for oligonucleotides and greatly reduced maximum measured allelic replacement efficiencies of up to 5-fold were observed for modified G (iso-deoxyguanosine) and A (2,6-diaminopurine) bases (14). An alternative strategy for overcoming the high off-target mutation rate induced by permanent disabling of MMR is the use of conditionally active mismatch repair proteins (15). A MAGE host strain was constructed by Nyerges et al. by introducing previously discovered mutations in MutS and MutL that result in temperature-sensitive mismatch repair activity (16) plus a prophage containing a temperature-inducible λ Red recombinase. This strain exhibited comparable allelic replacement efficiencies to a mutS− strain of up to 25%, however, the spontaneous mutation rate remained low only at temperatures below 32°C. While a promising strategy, the system would be difficult to employ in alternative host strains and would require either maintaining resulting isolated mutants at or below 32°C, or undergoing a time-consuming process to remove the MAGE system and revert the MMR proteins back to wild-type alleles. In this study, we report a new plasmid-based MAGE system, Transient Mutator Multiplex Automated Genome Engineering (TM-MAGE) (Figure 1A), that overcomes the limitations of previously reported methods that include high off-target mutation rates, lack of mobility and curability of the system, and the lack of ability to cultivate resulting mutant strains at 37°C. This is accomplished by transiently disabling MMR during MAGE cycling by inducible overexpression of E. coli Dam methylase, together with overexpression of the β subunit of λ Red in an artificial operon on a plasmid. The dam gene target was identified from an earlier screening study searching for native genes that increased the spontaneous mutation rate in E. coli when overexpressed (17). The efficiency and off-target mutation rates were measured for this system and compared to the EcNR2 strain (5), a plasmid-based system harboring only β in a mutS+ host, and the same system in a mutS− host. To further demonstrate the utility of this system, MAGE was used to introduce several mutations that had previously been identified in strains evolved for growth in high NaCl concentrations (18). Subsequent selection was used to enrich the population in these mutants, and the off-target mutation rate was shown to be substantially lower using TM-MAGE than performing MAGE with a plasmid-based system in a mutS− strain. A version of the plasmid containing the sacB gene (encoding levansucrase from Bacillus subtilis) enables one-step curing of the system by sucrose counterselection.
Figure 1.

(A) Proposed mechanism of Transient Mutator Multiplex Automated Genome Engineering (TM-MAGE). Chromosomal mutations are introduced via single-stranded oligonucleotide recombination on the lagging strand of replicating DNA, mediated by the β subunit of λ Red recombinase. Under ordinary conditions (left), the newly-synthesized strand is hemimethylated, which enables the MMR system, composed of MutS, MutL and MutH, to bind to mismatched lesions and restore the wild-type allele. In standard MAGE (middle), the MMR system is eliminated, allowing incorporation of the mismatched base and its complement base into the genome of one daughter cell after a second round of replication (red). However, permanent disabling of mismatch repair also allows the accumulation of off-target mutations (yellow) during subsequent manipulations. In TM-MAGE (right), transient overexpression of DNA adenine methylase (Dam) is hypothesized to generate significant methylation of the newly-synthesized strand, disabling detection of incorporated mismatches by the MMR system and reducing the rate of off-target mutations that accumulate in subsequent manipulations as Dam expression returns to its baseline level. (B) Schematic of TM-MAGE system plasmids pMA7 (top) and pMA7SacB (bottom). Plasmid pMA7 contains an artificial operon of the gene encoding the β subunit of λ Red recombinase and dam controlled by an arabinose-inducible (PBAD) promoter. Plasmid pMA7SacB additionally contains constitutively expressed sacB (levansucrase from B. subtilis) to allow for plasmid curing by sucrose counterselection.

MATERIALS AND METHODS

Chemicals, reagents and media

All chemicals and reagents were purchased from Sigma-Aldrich (St. Louis, MO) unless otherwise stated. Tryptone and yeast extract used for media preparation were purchased from Sigma-Aldrich. LB medium (Lennox) used for cultivations during MAGE cycling contained 10 g/l tryptone, 5 g/l yeast extract and 5 g/l sodium chloride (Sigma). An additional 0.5 mM MgSO4 was added to all LB media to correct an inherent magnesium limitation observed with these media components. Strains harboring plasmids were maintained with 100 μg/ml ampicillin except during growth for MAGE cycling and subsequent selections. MacConkey agar was purchased from Becton Dickinson (Franklin Lakes, NJ).

Oligonucleotides

All oligonucleotide sequences used in this study are shown in Supplementary Table S1. MAGE oligonucleotides used to introduce precise mutations that were previously identified for improved salt tolerance (18) (oligonucleotides 1 through 7) were designed using the MODEST oligonucleotide design tool (19), with the length set to 70 bp, 15 bp minimum end homology and other options kept as default. Other MAGE control oligonucleotides (to introduce premature stop codons in galK and xylA, oligonucleotides 8 and 9) were designed prior to the development of MODEST and feature mutated bases centered in the middle of the oligonucleotide. For demonstration of the maximum achievable MAGE efficiency for a single bp mismatch, a 90-bp oligonucleotide that results in the same xylA premature stop codon was designed with 4 phosphorothioated bases introduced at the 5′ and 3′ termini (oligonucleotide 10). All oligonucleotides were synthesized by Integrated DNA Technologies (Leuven, Belgium) with standard desalting and without additional purification.

Strains

Relevant strains used in this study are shown in Table 1. K-12 MG1655 was kindly donated by Adam Feist (University of California, San Diego), which is equivalent to ATCC 47076/CGSC 6300 (20). EcNR1 and EcNR2 are K-12 MG1655 and K-12 MG1655 mutS::cat harboring a λ prophage at the attB site which contains a temperature inducible λ Red recombinase (5). K-12 MG1655 mutS::cat was constructed by P1vir transduction of mutS::cat from strain EcNR2 into donor strain K-12 MG1655. The mutS replacement was confirmed by colony PCR using oligonucleotides 11 and 12 as primers (Supplementary Table S1).
Table 1.

Strains and plasmids used in this study

Strain/plasmidRelevant genotype/propertyaSource/ Reference
Strains
K-12 MG1655FλilvG- rfb-50 rph-1A. Feist, UCSD
EcNR1K-12 MG1655 λ-Red1 bla bio CmR5
EcNR2K-12 MG1655 λ-Red1 bla bio mutS CmR5
Plasmids
pMA1β subunit of λ Red under PBAD promoter, AmpR, ColE1 origin from pBAD24This study
pMA3dam under PBAD promoter, CmR, ColE1 origin from pBAD18-CmThis study
pMA7β subunit of λ Red and dam under PBAD promoter, AmpR, ColE1 origin from pBAD24This study
pMA9β subunit of λ Red and dam under PBAD promoter, AmpR, RSF1030 originThis study
pMA10β subunit of λ Red and dam under PBAD promoter, AmpR, CloDF13 originThis study
pMA7SacBβ subunit of λ Red and dam under PBAD promoter, sacB from B. subtilis under constitutive promoter, AmpR, ColE1 origin from pBAD24This study

aAbbreviations: Amp, ampicillin; Cm, chloramphenicol; R, resistance.

aAbbreviations: Amp, ampicillin; Cm, chloramphenicol; R, resistance.

Plasmid construction

Plasmid pMA1 (GenBank accession number KT285943) was constructed by PCR amplifying the gene encoding β from strain EcNR1 (5), while also introducing a Shine–Dalgarno sequence immediately upstream of the start codon, using primers 13 and 14 and inserting the PCR product between the NheI and HindIII sites in pBAD24 (21). Plasmid pMA3 was constructed by amplifying dam from K-12 MG1655, while also introducing a Shine–Dalgarno sequence immediately upstream of the start codon, using primers 15 and 16 and inserting between the NheI and HindIII sites in pBAD18-Cm (21). Plasmid pMA7 (GenBank accession number KT285941) was constructed by standard USER cloning (22) of three fragments, with the first fragment amplified using primers 17 and 18 from template pMA1, the second fragment amplified using primers 19 and 20 from template pMA1, and the third fragment amplified using primers 21 and 22 from template pMA3. Plasmid pMA7SacB (GenBank accession number KT285942) was constructed using a Gibson Assembly Kit (New England Biolabs). Primers 23 and 24 were used to PCR amplify the parental pMA7 plasmid. The sacB gene was amplified from E. coli DA16543 (donated by Dan Andersson, Uppsala University) using primers 25 and 26. Resulting DNA fragments were assembled according to the manufacturer's instructions. All plasmids generated in this study are listed in Table 1.

MAGE cycling

Starter cultures of K-12 MG1655 harboring plasmids pMA7, pMA1 or pMA7SacB, or strain K-12 MG1655 mutS::cat harboring plasmid pMA1 were grown overnight in LB + 100 μg/ml ampicillin at 37°C with 250 rpm shaking. Strain EcNR2 was grown in LB overnight at 30°C with 250 rpm shaking. After ∼16 h, 15 ml of LB medium (containing 100 μg/ml ampicilling where appropriate) in 50 ml Falcon tubes was inoculated to an initial OD600 of 0.05, and incubated at either 37°C (K-12 MG1655 strains) or 30°C (EcNR2) until the OD600 reached 0.6–0.7. For strains harboring plasmids pMA7, pMA1 or pMA7SacB, the recombinase and dam functions were induced by addition of L-arabinose to a final concentration of 0.2% w/v, and incubation with shaking was continued for 10 min. For strain EcNR2, the recombinase function was induced by transferring the culture to a 42°C water bath with 200 rpm linear shaking for 15 min. Cultures were then moved to an ice water bath and chilled for 15 min. Cells were centrifuged in Falcon tubes at 4000 x g for 5 min at 4°C, the supernatant was decanted and cells were subsequently resuspended in 30 ml, and thereafter 1 ml of autoclaved MilliQ water. Cells were transferred to 2.0 ml microcentrifuge tubes and centrifuged at 4000 x g for 5 min at 4°C, the supernatant was removed by pipetting, and cells were resuspended in 0.2 ml of autoclaved MilliQ water. Fifty microliters of cells were then transferred to 1 mm gap electroporation cuvettes, 0.5 μl of one oligonucleotide (for oligonucleotides 8, 9 or 10 resuspended to 100 nmol/ml in MilliQ water) or 3.5 μl of a mixture of 7 oligonucleotides (for oligonucleotides 1 through 7, pre-mixed in a tube with a total concentration of 100 nmol/ml) was added, and mixed with a Pasteur pipette. Cells were immediately electroporated using an ECM 399 electroporation system (Harvard Apparatus, Holliston, MA) at 1.8 kV and 36 μF (resulting in a 5 ms pulse length) and resuspended in 950 μl LB. The entire contents of the electroporation cuvette were either transferred into 50 ml Falcon tubes containing 14 ml of LB when more cycles were to be performed in a single day (for 6 cycle experiments, 3 cycles were performed per day), or into a 14 ml culture tube containing 1 ml of LB when the final cycle of the day was completed. Cultures were grown overnight at 37°C (or 30°C for strain EcNR2) with 250 rpm shaking to allow for chromosome segregation. For experiments performed during initial optimization of dam expression, optimization of the pMA7 system, and fluctuation tests for determination of mutation rates, an identical procedure was performed except that 15 ml of LB was inoculated 1:60 with starting culture, and centrifugation conditions were 6500 x g for 7 min at 4°C for the first two spins and 7000 x g for 30 s at 4°C for the final spin in a microcentrifuge. Induction of either arabinose-inducible or temperature-inducible β or λ Red was performed as previously described. For optimization of dam expression, cells were either non-induced, induced before chilling and centrifugation, induced immediately after electroporation, or induced both before chilling and centrifugation and after electroporation.

Plasmid curing

Following MAGE cycling and plating, individual colonies were struck as wedges on plates containing 10 g/l tryptone, 5 g/l yeast extract, 15 g/l agar, and 5% w/v sucrose (added from a 50% w/v stock after autoclaving the agar and media components) and grown at 37°C overnight. The next day, individual colonies were restruck on LB agar and LB agar containing 100 μg/ml ampicillin to verify plasmid removal by loss of ampicillin resistance.

Fluctuation tests

For the non-induced fluctuation tests, cultures were grown to saturation overnight in LB, or when pMA7 or pMA1 plasmids were present, LB + 100 μg/ml ampicillin. The following day, ten tubes containing 1.5 ml or 5 ml LB were inoculated with 5 μl of the saturated cultures. For non-induced state fluctuation tests, cultures were grown again to saturation overnight. For induced state fluctuation tests, cultures were grown to OD600 0.4, induced with 0.2% L-arabinose, and electrocompetent cells were prepared as described above. Following electroporation with 5 pmol of oligonucleotide 45, cells were grown to saturation in 5 ml of LB overnight. The following day, 10 μl (if a mutator strain background) or 200 μl (if a non-mutator strain background) was plated on LB agar containing 50 mg/ml rifampicin, and 100 μl (if a mutator strain background) or 1 ml (if a non-mutator strain background) was plated on LB agar containing 50 mg/ml nalidixic acid. Cells were also serially diluted and plated on LB agar to determine total viable cell counts. Mutation frequencies were determined as the ratio of the number of colonies per ml culture appearing on rifampicin or nalidixic acid plates to the total number of viable colonies per ml culture.

Selections of combinatorial populations generated by MAGE

One milliliter of MAGE library cultures that were generated after 6 cycles using the 7 oligonucleotide mixture (1 through 7) were washed once to remove LB medium by centrifuging at 4000 x g for 1 min, removing the supernatant, and resuspending in 1 ml of M9 medium. The resulting cell suspensions were diluted 1:100 (25 μl inoculum in 2.5 ml medium) into either M9 medium or M9 containing an additional 0.5 M NaCl in triplicates in 24 well plates. The M9 resuspended libraries were also serially diluted and plated on M9 agar or M9 agar + 0.6 M NaCl. After approximately 24 h, liquid cultures were passaged to an initial OD600 of 0.05 in fresh M9 medium, or in M9 medium containing 0.6 M NaCl, and this was repeated daily for 5 days (Supplementary Figure S3). Each day, approximately 1.5–2.0 × 109 cells were harvested from each culture by centrifugation at 16 000 x g for 2 min, and cell pellets were stored at −20°C. Following the fifth day, the cultures passaged in M9 + 0.6 M NaCl were serially diluted and spread on plates containing M9 agar + 0.6 M NaCl, to isolate individual colonies. Individual colonies were selected and regrown on LB agar for subsequent screening and analysis.

Determination of allele replacement efficiencies

Oligonucleotides 8, 9 and 10 were designed to introduce premature stop codon mutations in xylA (encoding xylose isomerase) and galK (encoding galactokinase), thereby rendering cells incapable of catabolizing D-xylose or D-galactose. To assess replacement efficiencies, cultures electroporated with these control oligonucleotides were serially diluted in phosphate buffered saline and plated on MacConkey agar plates containing 1% w/v D-xylose (for oligonucleotides 8 and 10) or 1% w/v D-galactose (for oligonucleotide 9). Plates were incubated overnight at 37°C and colonies that were either stained pink (wild-type) or white (mutant) were counted. Another measure of allele replacement efficiency, which also enabled determination of replacement efficiencies using the pooled NaCl oligonucleotides (1 through 7), was performed using amplicon sequencing (23). Briefly, cell pellets containing an estimated 5.0 × 108 to 1.5 × 109 cells (assuming OD600 1.0 = 1 × 109 cells/ml) were harvested by centrifugation at 16 000 x g for 2 min, either during exponential growth following an intermediate daily cycle, or following overnight outgrowth for the only or last daily cycle performed. The supernatant was removed and cell pellets were stored at −20°C until further use. Selected amplicons spanning a 150–170 bp region around targeted mutations were amplified by colony PCR using primers containing 5′ overhangs complementary to priming sequences for Nextera XT indexed adapters (Supplementary Table S1, primers 27 through 44). Each PCR reaction contained 5 μl of 2X Phusion HotStart II master mix (Thermo Scientific), 0.25 μl of forward primer (20 µM), 0.25 μl of reverse primer (20 µM), 3.5 μl water and 1 μl of resuspended cells (each frozen cell pellet was resuspended in 300 μl MilliQ water), and was run in a thermal cycler with the following program: (i) 98°C for 3 min, (ii) 98°C for 10 s, (iii) 56°C for 30 s, (iv) 72°C for 10 s, (v) Repeat steps 2–4 25 times, (vi) 72°C for 5 min and (vii) hold at 4°C. Different amplicons that could be sequenced under the same index were pooled together by combining 5 μl of each PCR product and adjusting the final volume to 50 μl with water. The pooled PCR reactions were cleaned up using Agencourt AmpPure XP beads (Beckman Coulter) per manufacturer directions, with the final elution in 52.5 μl of 10 mM Tris-HCl pH 8.5, of which 50 μl was transferred to a new PCR plate. Pooled, cleaned-up PCR products were amplified in a second PCR reaction with Nextera XT index 1 and 2 primers supplied with the Nextera XT 96 Index Kit (either 24 indices or 96 indices). Each PCR reaction contained 12.5 μl of 2X KAPA HiFi HotStart ReadyMix (KAPA Biosystems), 2.5 μl of a Nextera XT index 1 primer, 2.5 μl of a Nextera XT index 2 primer, 5 μl of water and 2.5 μl of pooled, cleaned-up PCR product. The resulting products were cleaned-up again with AmpPure XP beads and resuspended in 27.5 μl of 10 mM Tris-Cl pH 8.5, of which 25 μl was transferred to a new PCR plate. Concentrations of each pooled indexed product was determined using a Qubit dsDNA BR assay kit (Life Technologies, Carlsbad, CA), and PCR products were pooled to a final total concentration of 10 nM of DNA in 10 mM Tris-Cl pH 8.5 containing 0.1% v/v Tween 20. Manufacturer's instructions were followed for subsequent sequencing of libraries on a MiSeq sequencer (Illumina, San Diego, CA).

Analysis of amplicon sequencing data

A custom MATLAB (MathWorks, Inc., Natick, MA) script was used to count the frequencies of wild-type and mutant alleles by searching fastq files for a 25-bp sequence string flanking the mutation site (Supplementary Table S2), and counting the numbers of wild-type and mutant sequences. The resulting counts were used directly to calculate allelic replacement frequencies (ARFs) as (no. of mutant sequences) / (no. of wild-type sequences + no. of mutant sequences). Three biological replicates were analyzed for allele frequencies during selections of MAGE-generated populations, and P-values were determined by a two-sided t-test when averaged measurements were compared.

Growth screening

Biological triplicate colonies of selected isolates regrown on LB agar were inoculated into 300 μl M9 medium in 96-well deepwell plates and grown overnight at 37°C with 300 rpm shaking. For primary screening, clear-bottomed white 96-well half-deepwell plates (Enzyscreen B.V., Haarlem, Netherlands) containing 270 μl of either M9 medium or M9 medium + 0.667 M NaCl were inoculated with 30 μl of the overnight cultures diluted 1:10 in M9 medium, to bring to a final concentration of 0.6 M NaCl. Plates were clamped down with sandwich covers (Enzyscreen) and incubated at 37°C with 225 rpm shaking in a Growth Profiler (Enzyscreen), which optically scans the bottom of the plates. Scans were recorded every 15 min, and Growth Profiler software was used to integrate green pixels in selected 1 mm diameter circular regions in each well. The green pixel values were converted to approximate OD600 values using a previously established calibration curve. For secondary screening, 48-well FlowerPlates (m2plabs GmbH, Baesweiler, Germany) were inoculated to an initial OD600 of 0.03 (based on the pathlength for 200 μl of diluted cultures in a 96-well plate, as measured on a Synergy Mx plate reader (BioTek Instruments, Winooski, VT)) into M9 medium or M9 containing an additional 0.6 M final concentration of NaCl. The final volume in each well was 1.4 ml. Cultures were incubated in a BioLector microbioreactor system (m2p-labs) at 37°C with 1000 rpm shaking, with monitoring of the light backscatter intensity. Growth curves were baseline-adjusted by subtracting out the minimum light backscatter intensity in each well from all other values in that well.

Whole genome resequencing of MAGE isolates

Single colonies from each isolate regrown on LB plates were inoculated into LB medium and grown overnight at 37°C with 250 rpm shaking. Cell pellets were frozen at −20°C following centrifugation of 0.5 ml of culture at 16 000 x g for 2 min and removal of the supernatant. Genomic DNA was extracted from cell pellets using the PureLink Genomic DNA Mini Kit (Life Technologies) following the manufacturer's instructions, and was further concentrated and purified by ethanol precipitation as previously described (24). Genomic DNA concentrations used to calculate input DNA for library preparation (200 ng in 55 μl volume) were quantified using a Qubit broad-range dsDNA assay (Invitrogen). Libraries were prepared for sequencing using the TruSeq Nano HT Library Preparation Kit (Illumina). Sequence variants were determined from sequencing fastq output files using breseq (25) version 0.24rc6 and NCBI Reference Sequence NC_000913.3 as the reference sequence, with a base quality cutoff of 20. All called mutations were present in >80% of the spanning reads and in regions with coverage of more than 20-fold, with the exception of a nagA A203E mutation which did not meet these criteria but was detected in strain mutS_pMA1_NaCl_20.

RESULTS

Feasibility of dam overexpression for MAGE

A requirement for high replacement efficiencies for the majority of single base pair mutations by λ Red-mediated recombination of single-stranded oligonucleotides is inactivation of the MMR system. This is typically achieved by deletion of mutS or other genes in the MMR system, however, permanent inactivation can lead to a greatly elevated spontaneous mutation rate both during and after MAGE. Therefore, we reasoned that a gene that generates a mutator phenotype when overexpressed would allow MAGE to be performed with transient inactivation of MMR. The dam gene was previously identified as the top hit in a screen for genes that increase mutation rates when overexpressed (17). Therefore dam was selected for assessment of whether overexpression could enable high-efficiency allelic replacement through λ Red mediated recombination. A single cycle of MAGE was performed using oligonucleotide 9 (to introduce a premature stop codon in the galK gene) and strains EcNR1 (mutS) and EcNR2 (mutS−) harboring pMA3, a medium-copy plasmid containing the dam gene under control of an arabinose-inducible promoter. Four different dam induction procedures were tested: non-induced, induced immediately after electroporation, induced together with the λ Red genes before electrocompetent cell preparation, and induced both before electrocompetent cell preparation and after electroporation. Allelic replacement frequencies (ARFs) were assessed by counting white (positive) and red colonies appearing on MacConkey agar containing 1% galactose (Supplementary Figure S1). As expected, EcNR2 exhibited similar ARFs (6–10%) regardless of dam induction. In strain EcNR1, an elevated ARF in strain EcNR2 was achieved upon induction of dam together with λ Red, thereby demonstrating the feasibility of dam overexpression as an alternative to the use of permanent hypermutator strains(Figure 1A).

Development of a mobile MAGE system

In order to develop a mobile system that could be used for growth at 37°C, the β subunit of λ Red recombinase was cloned into pBAD24 (21) under control of an arabinose-inducible promoter to generate plasmid pMA1 (GenBank accession number KT285943). Next, dam was cloned in an artificial operon downstream of β in pMA1 to generate plasmid pMA7 (GenBank accession number KT285941) (Figure 1B). Finally, as it would be desirable to remove any plasmid-based MAGE system following its usage to generate mutations on the genome, the sacB gene was cloned into pMA7 to generate plasmid pMA7SacB (GenBank accession number KT285942), thus enabling plasmid curing by sucrose counterselection (Figure 1B). (A) Proposed mechanism of Transient Mutator Multiplex Automated Genome Engineering (TM-MAGE). Chromosomal mutations are introduced via single-stranded oligonucleotide recombination on the lagging strand of replicating DNA, mediated by the β subunit of λ Red recombinase. Under ordinary conditions (left), the newly-synthesized strand is hemimethylated, which enables the MMR system, composed of MutS, MutL and MutH, to bind to mismatched lesions and restore the wild-type allele. In standard MAGE (middle), the MMR system is eliminated, allowing incorporation of the mismatched base and its complement base into the genome of one daughter cell after a second round of replication (red). However, permanent disabling of mismatch repair also allows the accumulation of off-target mutations (yellow) during subsequent manipulations. In TM-MAGE (right), transient overexpression of DNA adenine methylase (Dam) is hypothesized to generate significant methylation of the newly-synthesized strand, disabling detection of incorporated mismatches by the MMR system and reducing the rate of off-target mutations that accumulate in subsequent manipulations as Dam expression returns to its baseline level. (B) Schematic of TM-MAGE system plasmids pMA7 (top) and pMA7SacB (bottom). Plasmid pMA7 contains an artificial operon of the gene encoding the β subunit of λ Red recombinase and dam controlled by an arabinose-inducible (PBAD) promoter. Plasmid pMA7SacB additionally contains constitutively expressed sacB (levansucrase from B. subtilis) to allow for plasmid curing by sucrose counterselection.

Optimization of the pMA7 system for MAGE

To determine the optimal induction time for the artificial operon containing dam and β, single cycles of MAGE were performed with control oligonucleotide 9 (which introduces a premature stop codon in galK, preventing catabolism of galactose) following induction of cultures with 0.2% w/v L-arabinose for time periods between 5 and 30 min. The frequency of replacement was assessed by plating cells on MacConkey agar containing 1% galactose (Figure 2). A maximum number of recombinants were obtained after 10 min induction time, therefore, this induction time was chosen for subsequent recombineering experiments. In addition to the use of the pBAD24 backbone, systems were also constructed by USER cloning to introduce origins of replication from pRSFDuet-1 and pCDFDuet-1 (Novagen) to generate plasmids pMA9 and pMA10, respectively. These systems were tested in K-12 MG1655 using control oligonucleotide 45 to introduce a premature stop codon in the malK gene, rendering cells that receive the mutation unable to catabolize maltose. Based on counting red and white colonies on MacConkey agar plates containing 1% maltose, no statistically significant differences in the efficiency of replacement were observed between these systems and pMA7. Therefore the pMA7 system was selected for further characterization.
Figure 2.

Allelic replacement efficiency (percent recombinants) with various lengths of Dam induction in K-12 MG1655/pMA7 prior to chilling cells for electrocompetent cell preparation. Ten minutes was found to result in optimal allelic replacement frequencies (ARFs) following a single cycle of MAGE with an oligonucleotide introducing a premature stop codon in galK (Y145*). ARFs were determined by counting red (wild-type allele) and white (mutant allele) colonies appearing following plating on MacConkey agar containing 1% galactose.

Allelic replacement efficiency (percent recombinants) with various lengths of Dam induction in K-12 MG1655/pMA7 prior to chilling cells for electrocompetent cell preparation. Ten minutes was found to result in optimal allelic replacement frequencies (ARFs) following a single cycle of MAGE with an oligonucleotide introducing a premature stop codon in galK (Y145*). ARFs were determined by counting red (wild-type allele) and white (mutant allele) colonies appearing following plating on MacConkey agar containing 1% galactose.

Assessment of system performance

The replacement efficiencies obtained with K-12 MG1655 harboring the pMA7 and pMA7SacB systems (TM-MAGE) were compared to strain EcNR2 using control oligonucleotide 10 (which introduces a premature stop codon in xylA, preventing catabolism of D-xylose). This oligonucleotide had a length of 90 bp and incorporated four phosphorothioated bases from the 5′ terminus, as this length and 5′ phosphorothioation was determined to yield optimal ARFs when introducing a nonsense mutation in lacZ (5). Electroporated cells were outgrown overnight, and ARFs were determined both by plating on MacConkey agar containing 1% w/v D-xylose and by amplicon sequencing. The use of both pMA7 and pMA7SacB resulted in approximately 20% ARF of xylA with the xylA Y13* allele, as measured both by plate counts and amplicon sequencing (Figure 3). The use of EcNR2 resulted in a similar 17% ARF as measured by plate counts, but interestingly only a 3.5% ARF as measured by amplicon sequencing when searching for an exact sequence match to a 25 bp region centered on the mutant or wild-type base. This percentage increased to only 5.8 ± 0.7% when first searching for sequences containing the forward priming sequence for xylA amplicon sequencing (43 in Supplementary Table S1) and then searching only for sequences containing the mutant or wild-type base and three preceding bases (TTCC or TTCA, respectively). Qualitative inspection of all sequences containing the forward priming sequence indicated an abundance of additional off-target mutations within the region complementary to the oligonucleotide. These mutations were likely responsible for alternative inactivations of xylA and could thus account for numerous false positives on plates. Far fewer off-target mutations were observed with the pMA7 and pMA7SacB systems, as evidenced by comparable ARFs determined by both plate counting and amplicon sequencing.
Figure 3.

Transient expression of Dam enables efficient recombineering. ARFs for xylA Y13* in strains EcNR2, K-12 MG1655/pMA7 and K-12 MG1655/pMA7SacB following one cycle of MAGE with a 90 bp phosphorothioated oligonucleotide. ARFs were determined both by counting red (wild-type allele) and white (mutant allele) colonies appearing following plating on MacConkey agar containing 1% xylose, and by amplicon sequencing within the xylA locus. Error bars indicated standard deviations about the mean of three replicates (two for K-12 MG1655/pMA7SacB).

Transient expression of Dam enables efficient recombineering. ARFs for xylA Y13* in strains EcNR2, K-12 MG1655/pMA7 and K-12 MG1655/pMA7SacB following one cycle of MAGE with a 90 bp phosphorothioated oligonucleotide. ARFs were determined both by counting red (wild-type allele) and white (mutant allele) colonies appearing following plating on MacConkey agar containing 1% xylose, and by amplicon sequencing within the xylA locus. Error bars indicated standard deviations about the mean of three replicates (two for K-12 MG1655/pMA7SacB). In a separate experiment, ARFs were compared over 6 MAGE cycles between K-12 MG1655/pMA7 (TM-MAGE), K-12 MG1655/pMA1 and K-12 MG1655 mutS::cat/pMA1 separately using oligonucleotides 8 and 10 (which introduce premature stop codons in galK and xylA, preventing catabolism of D-galactose and D-xylose, respectively) (Figure 4). These oligonucleotides have lengths of 70–72 bp and do not contain any phosphorothioated bases. Amplicon sequencing was performed using cells collected during exponential growth following each cycle or after overnight outgrowth (for the third and sixth cycles). Increasing ARFs with a saturating trend were observed using both control oligos, in general agreement with predictions of replacement frequency as a function of cycle number (5). The pMA7 system yielded similar but slightly reduced ARFs for most cycles compared to the hyper-mutator strain (mutS−) harboring the pMA1 system. The pMA1 system had a maximum ARF of only 0.3% after 6 cycles, as expected due to active mismatch repair. Counting of white and red colonies on MacConkey agar plates following the sixth cycle yielded nearly identical ARF values to those measured by amplicon sequencing and counting exact matches to a 25 bp region centered on the wild-type or mutant base.
Figure 4.

ARF from TM-MAGE with pMA7 are nearly equivalent to those obtained when mismatch repair is permanently disabled. ARFs for galK Y145* (top) and xylA Y13* (bottom) using 70-bp non-modified oligonucleotides were determined by amplicon sequencing using cell pellets obtained following each of 6 cycles of MAGE in strains K-12 MG1655/pMA1, K-12 MG1655/pMA7 and K-12 MG1655 mutS::cat/pMA1. ARFs were also determined from counting red (wild-type allele) and white (mutant allele) colonies appearing following plating after 6 cycles of MAGE on MacConkey agar containing galactose or xylose. Measurements were performed on single replicates.

ARF from TM-MAGE with pMA7 are nearly equivalent to those obtained when mismatch repair is permanently disabled. ARFs for galK Y145* (top) and xylA Y13* (bottom) using 70-bp non-modified oligonucleotides were determined by amplicon sequencing using cell pellets obtained following each of 6 cycles of MAGE in strains K-12 MG1655/pMA1, K-12 MG1655/pMA7 and K-12 MG1655 mutS::cat/pMA1. ARFs were also determined from counting red (wild-type allele) and white (mutant allele) colonies appearing following plating after 6 cycles of MAGE on MacConkey agar containing galactose or xylose. Measurements were performed on single replicates.

Curing of pMA7SacB

After 6 cycles of MAGE using either K-12 MG1655/pMA7SacB or pMA1, white colonies (xyl) growing on MacConkey agar plus 1% D-xylose were cured as described in methods. Colonies were obtained overnight on sucrose and removal of pMA7SacB was verified by lack of growth on LB plates containing ampicillin (Supplementary Figure S2). Additionally, retainment of the xylA Y13* allele following curing was qualitatively verified by plating on MacConkey agar plus 1% D-xylose. Similar results with overnight curing of the plasmid were obtained with subsequent usage of pMA7SacB in our laboratory for the introduction of other point mutations, with routine removal of the plasmid by a single streaking and overnight growth on sucrose-containing plates.

Determination of mutation frequencies resulting from dam overexpression

While dam overexpression generates comparable ARFs as when mutS− background hosts are employed, the primary goal of finding an alternative inducible system that disables mismatch repair is to reduce the background mutation rate compared to that found in mutS− hosts. Therefore, fluctuation tests were performed to measure the relative mutation rates in K-12 MG1655 harboring pMA1, pMA7 and K-12 MG1655 mutS::cat harboring pMA1, both with and without induction of β and dam expression by addition of L-arabinose. Strains without plasmids were also tested as controls without induction. In non-induced tests, K-12 MG1655 harboring no vector, pMA1 and pMA7 exhibited similar median resistant mutation frequencies of between 1.0 to 4.7 × 10−8 for rifampicin, and 1.3 to 3.5 × 10−9 for nalidixic acid (Figure 5A). K-12 MG1655 mutS::cat harboring no vector and pMA1 exhibited mutation frequencies that were approximately 2 orders of magnitude higher for rifampicin and between 2 to 3 orders of magnitude higher for nalidixic acid. Thus, it is apparent that pMA7 does not increase mutation rates without or prior to arabinose induction, indicating effective repression of dam overexpression in the absence of inducing agent. With arabinose induction, median frequencies of resistant mutations for both rifampicin and nalidixic acid remained similar (within about 1 order of magnitude) to those without induction for K-12 MG1655/pMA1 and K-12 MG1655 mutS::cat/pMA1, while those for K-12 MG1655/pMA7 increased by between 1 and 2 orders of magnitude, demonstrating an inducible mutator phenotype while maintaining a 12- to 33-fold reduced rate of mutation following induction as compared with a mutS− strain. This reduced mutation frequency is likely at least in part due to the transient nature of dam overexpression relative to the permanently disabled mismatch repair present in mutS strains.
Figure 5.

Spontaneous mutation rates are elevated only when Dam overexpression is induced, and are reduced compared to permanently disabled MMR. (A) Frequencies of rifampicin and nalidixic acid resistant mutants following plating of non-induced strains (K-12 MG1655 with and without pMA1, K-12 MG1655 mutS::cat with and without pMA1 and K-12 MG1655 with pMA7). (B) Frequencies of rifampicin and nalidixic acid resistant mutants arising following plating of induced strains that had undergone a procedure mimicking a single MAGE cycle. Individual replicate measurements are shown, with the median value denoted by a horizontal black line.

Spontaneous mutation rates are elevated only when Dam overexpression is induced, and are reduced compared to permanently disabled MMR. (A) Frequencies of rifampicin and nalidixic acid resistant mutants following plating of non-induced strains (K-12 MG1655 with and without pMA1, K-12 MG1655 mutS::cat with and without pMA1 and K-12 MG1655 with pMA7). (B) Frequencies of rifampicin and nalidixic acid resistant mutants arising following plating of induced strains that had undergone a procedure mimicking a single MAGE cycle. Individual replicate measurements are shown, with the median value denoted by a horizontal black line.

System performance with subsequent selections

One of the primary applications of MAGE is the generation of combinatorial libraries of a set of mutant alleles. In order to identify beneficial phenotypes from the resulting complex pool, performing subsequent high-throughput screening or selections is necessary. In the case when selections are performed, it was anticipated that low ARFs (e.g. the pMA1 system in a mutS+ background) will lead to insufficient diversity in the starting pool of mutants and will hamper efforts to isolate strains with highly improved phenotypes derived from combinatorial mutations. It was also anticipated that in strains with permanently disabled MMR (e.g. mutS−), accumulation of off-target mutations during the selection process would render phenotype to genotype relationships difficult to establish. TM-MAGE, which enables both high efficiency allelic replacement to achieve both a high diversity of combinatorial mutants and longer term genomic stability following MAGE cycling, was expected to lead to the isolation of strains with more improved phenotypes and with an improved ability to assign phenotype to genotype. This process was tested by constructing combinatorial mutant libraries in strains K-12 MG1655/pMA7 (TM-MAGE), K-12 MG1655/pMA1 (no mismatch repair) and K-12 MG1655 mutS::cat/pMA1 (permanently disabled mismatch repair). A pool of 7 oligonucleotides (1 through 7 in Supplementary Table S1) was transformed to introduce mutations that were previously identified in strains evolved for growth in high concentrations of NaCl (18). These oligonucleotides were designed to either directly introduce selected single nucleotide polymorphisms (SNPs) from the G6 and A3 strains (18), or to introduce premature stop codons at the nearest possible location to where insertion element sequences were located (as these mutations represent probable losses-of-function). The pooled oligonucleotides were transformed over 6 cycles of MAGE and ARFs were determined using amplicon sequencing of harvested cells after MAGE cycling. These frequencies are shown as the initial data points in Figure 6 for each mutant allele. Sixth cycle ARFs varied for different oligonucleotides and for the different systems, with K-12 MG1655 mutS::cat/pMA1 generally exhibiting the highest ARFs, TM-MAGE using K-12 MG1655/pMA7 exhibiting similar but slightly lower ARFs and K-12 MG1655/pMA1 exhibiting greatly reduced ARFs. The mrdA Q51L (T:A) and proV K12* (A:T) mutations, and to a lesser extent the nagA A203E (G:T) mutation, appeared to largely evade mismatch repair in K-12 MG1655/pMA1 by unknown mechanisms.
Figure 6.

Population frequencies of mutant alleles (indicated at right) as determined by amplicon sequencing following 6 cycles of MAGE and after 1, 3 and 5 days of daily passaging in M9 medium or M9 medium plus NaCl (0.5 M for the first passage and 0.6 M for subsequent passages). Libraries were generated in K-12 MG1655/pMA7 (black bars), K-12 MG1655/pMA1 (gray bars) and K-12 MG1655 mutS::cat/pMA1 (white bars). Error bars indicate standard deviations about the mean values of three biological replicate cultures. Only a single replicate was sequenced following 6 cycles of MAGE, therefore, no error bars are shown.

Population frequencies of mutant alleles (indicated at right) as determined by amplicon sequencing following 6 cycles of MAGE and after 1, 3 and 5 days of daily passaging in M9 medium or M9 medium plus NaCl (0.5 M for the first passage and 0.6 M for subsequent passages). Libraries were generated in K-12 MG1655/pMA7 (black bars), K-12 MG1655/pMA1 (gray bars) and K-12 MG1655 mutS::cat/pMA1 (white bars). Error bars indicate standard deviations about the mean values of three biological replicate cultures. Only a single replicate was sequenced following 6 cycles of MAGE, therefore, no error bars are shown. Following 6 MAGE cycles and overnight outgrowth, cells were plated on M9 agar with or without 0.6 M NaCl, and were also inoculated into liquid cultures of M9 medium with or without 0.5 M NaCl (this concentration was increased to 0.6 M NaCl with subsequent passages). Daily passaging was performed with the liquid cultures (Supplementary Figure S3), and after five passages and regrowth, cells were plated on M9 agar containing 0.6 M NaCl to select individual isolates. Additionally, after days 1, 3 and 5 of liquid culture passaging, cells were harvested for determination of the mutant allele frequency dynamics during selection using amplicon sequencing (Figure 6). Enrichment of the frequency of a mutation as a function of selection time in high NaCl concentrations is indicative of causality for salt tolerance, whether alone or in combination with other mutations. Only the mrdA Q51L, msrB C118F, nagA A203E and rpsA Q421K mutations were uniformly enriched during passaging in M9 + 0.6 M NaCl following MAGE in the mutS/pMA1 and pMA7 harboring strains. With the exception of the nagA A203E mutation in the population generated with pMA7, these trends were specific to selections in high salt and mutations were not observed to be enriched by passaging in M9 only. The proV K12* mutant was additionally enriched by passaging in high NaCl concentrations in mutS/pMA1 (P = 0.002) but not significantly for the pMA7-generated population. Following performing MAGE in K-12 MG1655/pMA1, no enrichment pattern could be discerned in high salt conditions, with the exception of nagA A203E. For many mutations, this was likely due to their extremely low starting frequencies as a result of active MMR during MAGE cycling. The mrdA Q51L mutation, which was present at a high starting frequency, exhibited a lack of enrichment in high salt. Because this mutation was enriched over 3–5 days using the other MAGE systems, a likely explanation is that this mutation is only beneficial in combination with one or more other mutations. In summary, enrichment of particular introduced mutations following selection in high salt concentrations could only be successfully observed when high ARFs could be achieved by MAGE (i.e. with either K-12 MG1655 mutS/pMA1 or K-12 MG1655/pMA7). This is most likely due to the presence of strains in the starting population that possess higher numbers of combinatorial mutations (e.g. 3–5) that together impart tolerance to high salt concentrations. From our amplicon sequencing data of the selected populations, four causal mutations for NaCl tolerance out of the original seven introduced mutations could be putatively identified using TM-MAGE: mrdA Q51L, msrB C118F, nagA A203E and rpsA Q421K. The proV K12* mutation was only observed to be enriched in the mutS−/pMA1 background strain, thus it was not possible to assign causality.

Screening and re-sequencing of selected isolates

For each of the three MAGE systems, a total of 20 isolates from M9 + NaCl plates (10 each from plates spread with cells immediately after MAGE cycling, and at the end of serial passaging) were screened for growth in liquid culture (M9 + 0.6 M NaCl) (Supplementary Figure S4). Primary screening results were used to select two or three isolates per system, which underwent secondary screening in a Biolector microbioreactor system (Figure 7). Additionally for each system, three isolates from M9 plates that were spread with cells immediately after MAGE cycling and one xyl− isolate were screened under the same conditions.
Figure 7.

Strains with the highest growth rates and greatly reduced lag times were reliably isolated from NaCl-selected libraries generated using TM-MAGE, as compared to other MAGE systems. (A–C) Averaged growth curves across three biological replicate cultures of control strains and individual isolates (A: K-12 MG1655/pMA7, B: K-12 MG1655/pMA1, C: K-12 MG1655 mutS::cat/pMA1) following 5 days of passaging on M9 + NaCl. (D) Calculated growth rates and (E) lag times of starting strains and isolated strains following M9 + NaCl selections. Error bars represent standard deviations about the mean of three biological replicate measurements.

Strains with the highest growth rates and greatly reduced lag times were reliably isolated from NaCl-selected libraries generated using TM-MAGE, as compared to other MAGE systems. (A–C) Averaged growth curves across three biological replicate cultures of control strains and individual isolates (A: K-12 MG1655/pMA7, B: K-12 MG1655/pMA1, C: K-12 MG1655 mutS::cat/pMA1) following 5 days of passaging on M9 + NaCl. (D) Calculated growth rates and (E) lag times of starting strains and isolated strains following M9 + NaCl selections. Error bars represent standard deviations about the mean of three biological replicate measurements. A total of 20 isolates were whole-genome resequenced with the goal of assessing typical numbers of off-target mutations arising during multiple cycles of MAGE and subsequent selections, as well as validating the causative mutations inferred by amplicon sequencing (Table 2; full lists of called sequence variants are provided in Supplementary Data File 1). Although the mutation rate was shown to be elevated over 10-fold in K-12 MG1655/pMA7 following a typical single cycle of MAGE and plating on highly selective conditions compared to K-12 MG1655/pMA1, which harbors fully active mismatch repair (Figure 5), the duration of this period of elevated mutation rate following induction is not known largely due to the unknown turnover rate of the overexpressed Dam protein. Additional cycling would be expected to extend the duration of the elevated mutation rate as a result of Dam overexpression, however, that increase would only be expected to be linear with a high Dam turnover rate. After 6 cycles of MAGE and plating on MacConkey agar, xyl− isolates generated using both the pMA7 and pMA1 system in the mutS background strain contained the target xylA Y13* mutation. The isolate generated by the pMA1 system contained two off-target mutations immediately adjacent to the target site (a 4-bp deletion beginning at the same genome coordinate as the Y13* mutation and a G→T mutation 5 bases away from that coordinate) that would also be expected to result in a loss-of-function of xylA. Aside from the off-target mutation in xylA using the pMA1 system, there was a single off-target mutation using pMA7, none using pMA1 and 16 using pMA1 in the mutS− strain (excluding mutations that were common across all sequenced mutS− strains that can be assumed to be present in the original background strain).
Table 2.

On-target and off-target mutations in re-sequenced isolates (20 total) following 6 cycles of MAGE to introduce the xylA Y13* mutation, in isolates following plating on M9 agar after MAGE cycling with a pool of 7 oligonucleotides designed to introduce mutations observed in evolved NaCl-tolerant strains, and in isolates cycled with the same oligonucleotide pool after 5 days of serially passaging cultures in M9 + NaCl and plating on M9 agar + NaCl

6th cycle xylA*M9M9 + NaCl
on-targetoff-targeton-targetoff-targetaon-targetoff-targeta
pMA7 rep. 1111131
pMA7 rep. 2--3444 + 1b
pMA7 rep. 3--0254
pMA1 rep. 102b10--
pMA1 rep. 2--1021
pMA1 rep. 3--0021
mutS/pMA1 rep. 1116414325c
mutS/pMA1 rep. 2--0739
mutS/pMA1 rep. 3--210315

aNot including pre-existing mutations occurring across all mutS- strains.

bOff-target mutations were within the oligonucleotide sequence.

cNot including a number of called missing coverage deletions (total of 43).

aNot including pre-existing mutations occurring across all mutS- strains. bOff-target mutations were within the oligonucleotide sequence. cNot including a number of called missing coverage deletions (total of 43). Following selection on M9 plates or after liquid passaging in M9 + NaCl (Table 2), K-12 MG1655/pMA7 isolates harbored between 1 and 5 off-target mutations (in isolate pMA7-NaCl-15 with 5 off-target mutations, one was within the sequence of oligonucleotide 7). By contrast, K-12 MG1655/pMA1 isolates harbored between 0 to 1 off-target mutations and strains using pMA1 in the mutS background harbored 7–25 mutations (7–14 from selection in M9, and 9–25 from selection in M9 + NaCl). Thus, while the mutation rate using the pMA7 system is generally elevated compared to pMA1 in a mutS+ background, there are fewer off-target mutations than in a mutS background, and off-target mutations were not particularly more abundant following selection in the condition with a higher selective pressure (liquid passages followed by plating on M9 agar + NaCl) versus the condition with the lower selective pressure (plating on M9 agar). This is suggestive that most off-target mutations arose during MAGE cycling when Dam was periodically induced, rather than during subsequent selections. Further investigating on-target mutations in the NaCl-selected mutants, strains isolated from the pMA7 population harbored between 3 and 5 on-target mutations, while those from the pMA1 population harbored 2 and those from the mutS/pMA1 population harbored 3 on-target mutations (Table 3). The strains isolated from the pMA1 population both contained the mrdA Q51L and nagA A203E mutations, which along with proV K12*, were the only mutations with elevated ARFs using this system, likely due to full or partial evasion of the mismatch repair system. These mutants exhibited only slightly improved growth in M9 + 0.6 M NaCl compared to the wild-type strain (Figure 7). By contrast, two out of three strains isolated from the pMA7 population contained the mrdA Q51L, nagA A203E, rpsA Q421K and msrB C118F mutations. The presence of these mutations is consistent with enrichments observed during passaging in M9 + NaCl by amplicon sequencing (Figure 6). Strains pMA7_NaCl_12 and pMA7_NaCl_15 exhibited a much greater improvement in growth in M9 + 0.6 M NaCl compared to the pMA1 isolates (Figure 7), with growth rates of 0.35 ± 0.03 and 0.30 ± 0.02 h−1, respectively, compared to the K-12 MG1655 growth rate of 0.20 ± 0.01 h−1. Lag times were also reduced in these isolates to 5.2 ± 0.5 and 3.6 ± 0.5 h, respectively, compared to 8.9 ± 1.0 h for K-12 MG1655. These strains in particular share the combination of all four of the above mentioned mutations. Strains isolated from the mutS− population harbored 3 mutations (all three contained mrdA Q51L and rpsA Q421K, with different third on-target mutations in each isolate) and also significantly improved growth phenotypes compared to K-12 MG1655 mutS::cat (0.22 to 0.23 h−1 compared with 0.15 h−1). However, the presence of a large number of off-target mutations rendered it difficult to assign causation of the phenotype to the sets of on-target mutations that were identified.
Table 3.

On-target mutations identified in isolates following plating on M9 agar after MAGE cycling, and in isolates after 5 days of serially passaging cultures in M9 + NaCl and plating on M9 agar + NaCl

MAGE systemIsolate nameM9 on-target mutationsIsolate nameM9 + NaCl on-target mutations
K-12 MG1655/ pMA7pMA7_M9_1 pMA7_M9_2 pMA7_M9_3mrdA Q51L nagA A203E, ydjK P17S, proV K12* nonepMA7_NaCl_11 pMA7_NaCl_12 pMA7_NaCl_15nagA A203E, rpsA Q421K, msrB C118F mrdA Q51L, nagA A203E, rpsA Q421K, msrB C118F mrdA Q51L, nagA A203E, rpsA Q421K, msrB C118F, ydjK P17S
K-12 MG1655/ pMA1pMA1_M9_1 pMA1_M9_2 pMA1_M9_3mrdA Q51L proV K12* nonepMA1_NaCl_11 pMA1_NaCl_15 pMA1_NaCl_19- mrdA Q51L, nagA A203E mrdA Q51L, nagA A203E
K-12 MG1655 mutS::cat/ pMA1mutS_pMA1_M9_1 mutS_pMA1_M9_2 mutS_pMA1_M9_3nagA A203E, ydjK P17S, msrB C118F, proV K12* none msrB C118F, proV K12*mutS_pMA1_NaCl_15 mutS_pMA1_NaCl_17 mutS_pMA1_NaCl_20mrdA Q51L, rpsA Q421K, ydjK P17S mrdA Q51L, nagA A203E, rpsA Q421K mrdA Q51L, rpsA Q421K, msrB C118F

DISCUSSION

The ability to use short (<100-bp), chemically synthesized, single-stranded oligonucleotides to precisely modify and edit genomes has opened up possibilities in a wide array of applications. To date, these have included the generation of combinatorial libraries targeting ribosome binding sites (RBS) of genes involved in biochemical production pathways (5), redesign of the entire genome via genome-wide stop codon replacement and replacement of rare codons (9), genome-wide promoter replacement (6), and combinatorial library generation followed by selection to identify causative combinations of mutations found in adaptively evolved strains (10). In all published applications besides the last example, which had utilized the pMA7 system in advance of this publication, strains deficient in mismatch repair (mutS−) have been employed and the impact of off-target mutations on the phenotype of the final selected or screened strains has largely not been reported. An average of 113 off-target mutations were identified in strains with UAG stop codons converted to UAA (26), and 355 off-target mutations were reported in a strain with all UAG stop codons converted to UAA (8). These off-target mutations are likely to affect strain phenotype, particularly when strains are subjected to selective pressures to identify isolates with improved phenotypes. As an example of this, we have generated MAGE libraries containing random RBS insertions upstream of 70 transcription factors in a mutS− host and have conducted short-term selections (3 passages) in stress conditions to identify stress-tolerant mutants. Re-sequenced isolates from these selections exhibited greatly improved tolerance, however, only 4 out of 11 strains contained targeted RBS modifications, and of those that did, it was not possible to reconstitute tolerance by re-introducing only the identified RBS modifications into the background strains (Supplementary Figure S5). In this study, a new plasmid-based system for performing MAGE, TM-MAGE, is reported that transiently inhibits methyl-directed mismatch repair through inducible overexpression of the Dam methylase while simultaneously expressing the β subunit of λ Red recombinase. Dam overexpression has long been known to result in an increased mutation rate due to increased methylation of both the template strand and newly synthesized strand during replication, due to hemimethylation being required for recognition of the newly synthesized strand by the MMR system (28). The system was shown to have a nearly equivalent frequency of allelic replacement as mutS strains (either EcNR2 or K-12 MG1655 mutS::cat). Furthermore, while inducing a mutator phenotype, a much lower occurrence of off-target mutations was observed compared with permanent inactivation of MMR, with an over 5-fold reduction in the average number of off-target mutations in re-sequenced isolates following 6 cycles of MAGE and 5 days of selections under osmotic stress conditions compared to mutS− mutants. This is likely a result of transcription of plasmid-based dam only being induced during MAGE cycling, with natural protein turnover restoring ordinary DNA hemimethylation in subsequent generations. Nyerges et al. have also addressed the issue of off-target mutations in MMR-deficient hosts during MAGE through the introduction of temperature-sensitive MutS and MutL protein alleles into a host strain, resulting in an over 6-fold reduction in off-target mutations following 20 cycles of MAGE (15). However, utilization of this system necessitates that the MAGE background strain first possess the two temperature-sensitive alleles in addition to a system for expressing λ Red recombinase (in this case a prophage-based recombinase was employed). Accordingly, mutants constructed using this system must be cultivated at temperatures less than 32°C to ensure active mismatch repair unless the non-temperature-sensitive alleles are restored. TM-MAGE addresses these limitations, as necessary functions are self-contained on a plasmid that can easily be transformed into any host strain, and can either be left in the host strain without cultivation temperature restrictions, or easily removed in a single day by sucrose counterselection when pMA7SacB is employed. These features enable robust generation of combinatorial libraries and precisely edited genomes with the ability to much more definitively assign phenotype to genotype due to the significantly reduced rate of accumulation of off-target mutations. In the previous publication in which TM-MAGE (pMA7) was utilized (10), a detailed analysis of system performance with respect to ARFs and off-target mutation rates had not been performed. A total of nine cycles of MAGE were performed over three days (in comparison to the 6 cycles in 2 days performed in this study, thus lengthening the total time in which Dam was overexpressed), followed by serial passaging under the selection condition (42°C) during continuous exponential growth for 3 days. In 43 resequenced isolates (10), a median of 4 off-target mutations and 3 on-target mutations were observed. After removing off-target mutations that occurred within the oligonucleotide sequences, mutations that occurred spontaneously in the pyrE/rph intergenic region or rph gene, and larger detected areas of missing coverage in a few strains, the median number of off-target mutations dropped to two. These mutations were not considered due to possible oligonucleotide synthesis errors, mutations in pyrE/rph correcting a known defect in K-12 MG1655 growth in minimal medium (27) and appearing to be especially selected for at 42°C, and the likely presence of sequencing artifacts in the strains with many called missing coverage deletions (and if present, would unlikely be a result of disabled MMR), respectively. These results are comparable with the off-target mutation rates observed in this study and offer further support of the greatly reduced background mutation rate achievable by TM-MAGE relative to strains with permanently disabled MMR. It should be noted that a potential limitation of TM-MAGE is the continued accumulation of off-target mutations during MAGE cycling. Similar numbers of off-target mutations were observed in re-sequenced isolates that had undergone 6 (this study) or 9 cycles of TM-MAGE (10). By constrast, a single cycle of MAGE was shown to generate an approximately one order-of-magnitude increase in the frequency of mutations providing resistance toward nalidixic acid or rifampicin (Figure 5B). This is suggestive of a non-linear accumulation of off-target mutations as a function of MAGE cycle number, and is likely a result of elevated Dam protein levels persisting during and after cycling in at least a portion of the passaged population in subsequent cycles. Despite that significant increases in off-target mutations were not observed after 9 cycles, it is recommended that TM-MAGE cycles be limited in number in applications where off-target mutations would be particularly problematic. Experiments should be appropriately scaled with regards to both the pooled oligonucleotide diversity and the ability to screen or select desired phenotypes from the resulting populations. In the case of performing selections following any MAGE protocol, consideration must be given to the ability to enrich a particular mutant in a timeframe shorter than that required for cells to undergo adaptive laboratory evolution. Despite these potential limitations, the off-target mutation rate using TM-MAGE is significantly lower than when performing MAGE in strains with permanently disabled MMR and is thus better suited for experiments requiring larger numbers of cycles. We demonstrated the power of using TM-MAGE to reconstruct causative mutations for tolerance to growth in high NaCl concentrations. These mutations were previously identified in the two best-performing isolates from adaptive laboratory evolutions of F− and F+ Hfr strains on high concentrations of NaCl (18). In that study, causative mutations were only partially explored through reconstruction of combinations of loss-of-function mutations, and through overexpression of the different mutated gene targets which only identified ydjK as improving osmotolerance. We here show that an improved combination of mutations with a relative fitness (growth rate of mutant / growth rate of wild-type − 1) in M9 + 0.6 M NaCl of 0.75 could be achieved using TM-MAGE, compared to reported fitnesses of 0.5 for the original evolved strains in M9 + 0.65 M NaCl and less than 0.5 for all overexpression and knockout mutants in M9 + 0.55 M NaCl (18). These TM-MAGE constructed strains harbored a combination of four mutations originating from both of the original evolved strains: mrdA Q51L, nagA A203E, rpsA Q421K and msrB C118F. Due to their extremely low initial frequency in the population, it was not possible to adequately select for mutants with this degree of improvement in growth phenotype from populations generated using pMA1 (which harbored an intact MMR system). And due to the high rate of off-target mutations likely occurring both before, during, and after MAGE in a strain with permanently disabled mismatch repair, it was also not possible to isolate mutants with significantly improved growth rates over wild-type K-12 MG1655, nor to assign causation from patterns in the growth behaviors of individual isolates. However, performing TM-MAGE with the pMA7 system, we were able to rapidly isolate improved osmotolerant mutants with combinations of mutations originally found in distinct evolved isolates. TM-MAGE should find broad applicability as a replacement for other MAGE systems. New features that have previously hindered the use of MAGE include the ability to efficiently generate mutations and libraries of mutations in alternative host strains, and the rapid removal of the system for use of MAGE in metabolically engineered production hosts or any other application where a clean strain background is desired. Most importantly, we have demonstrated the ability to couple TM-MAGE robustly with subsequent selections, which will enable the full exploration of the generated mutational space for desired selectable phenotypes. This is particularly useful for reconstructing and identifying causal mutations in strains isolated by adaptive laboratory evolution. Use of TM-MAGE will also lead to the generation of more stable genotypes for subsequent screening applications, when compared to systems with permanently disabled MMR.

ACCESSION NUMBERS

Sequences for plasmids pMA1, pMA7, and pMA7SacB have been deposited as Genbank accession numbers KT285943, KT285941 and KT285942, respectively.
  28 in total

Review 1.  Escherichia coli mutator genes.

Authors:  J P Horst; T H Wu; M G Marinus
Journal:  Trends Microbiol       Date:  1999-01       Impact factor: 17.079

2.  Mutants with temperature-sensitive defects in the Escherichia coli mismatch repair system: sensitivity to mispairs generated in vivo.

Authors:  Esther S Hong; Annie Yeung; Pauline Funchain; Malgorzata M Slupska; Jeffrey H Miller
Journal:  J Bacteriol       Date:  2005-02       Impact factor: 3.490

3.  One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products.

Authors:  K A Datsenko; B L Wanner
Journal:  Proc Natl Acad Sci U S A       Date:  2000-06-06       Impact factor: 11.205

4.  Tight regulation, modulation, and high-level expression by vectors containing the arabinose PBAD promoter.

Authors:  L M Guzman; D Belin; M J Carson; J Beckwith
Journal:  J Bacteriol       Date:  1995-07       Impact factor: 3.490

5.  Escherichia coli mutator mutants deficient in methylation-instructed DNA mismatch correction.

Authors:  B W Glickman; M Radman
Journal:  Proc Natl Acad Sci U S A       Date:  1980-02       Impact factor: 11.205

6.  Rational design of a synthetic Entner-Doudoroff pathway for improved and controllable NADPH regeneration.

Authors:  Chiam Yu Ng; Iman Farasat; Costas D Maranas; Howard M Salis
Journal:  Metab Eng       Date:  2015-03-10       Impact factor: 9.783

7.  Identification of mutator genes and mutational pathways in Escherichia coli using a multicopy cloning approach.

Authors:  Hanjing Yang; Erika Wolff; Mandy Kim; Amy Diep; Jeffrey H Miller
Journal:  Mol Microbiol       Date:  2004-07       Impact factor: 3.501

8.  Effects of high levels of DNA adenine methylation on methyl-directed mismatch repair in Escherichia coli.

Authors:  P J Pukkila; J Peterson; G Herman; P Modrich; M Meselson
Journal:  Genetics       Date:  1983-08       Impact factor: 4.562

9.  The Escherichia coli K-12 "wild types" W3110 and MG1655 have an rph frameshift mutation that leads to pyrimidine starvation due to low pyrE expression levels.

Authors:  K F Jensen
Journal:  J Bacteriol       Date:  1993-06       Impact factor: 3.490

10.  USER fusion: a rapid and efficient method for simultaneous fusion and cloning of multiple PCR products.

Authors:  Fernando Geu-Flores; Hussam H Nour-Eldin; Morten T Nielsen; Barbara A Halkier
Journal:  Nucleic Acids Res       Date:  2007-03-27       Impact factor: 16.971

View more
  20 in total

Review 1.  Experimental Design, Population Dynamics, and Diversity in Microbial Experimental Evolution.

Authors:  Bram Van den Bergh; Toon Swings; Maarten Fauvart; Jan Michiels
Journal:  Microbiol Mol Biol Rev       Date:  2018-07-25       Impact factor: 11.056

2.  CRISPR-TSKO: A Technique for Efficient Mutagenesis in Specific Cell Types, Tissues, or Organs in Arabidopsis.

Authors:  Ward Decaestecker; Rafael Andrade Buono; Marie L Pfeiffer; Nick Vangheluwe; Joris Jourquin; Mansour Karimi; Gert Van Isterdael; Tom Beeckman; Moritz K Nowack; Thomas B Jacobs
Journal:  Plant Cell       Date:  2019-09-27       Impact factor: 11.277

3.  CRISPR/Cas9-Assisted Seamless Genome Editing in Lactobacillus plantarum and Its Application in N-Acetylglucosamine Production.

Authors:  Ding Zhou; Zhennan Jiang; Qingxiao Pang; Yuan Zhu; Qian Wang; Qingsheng Qi
Journal:  Appl Environ Microbiol       Date:  2019-10-16       Impact factor: 4.792

Review 4.  Rewriting the Genetic Code.

Authors:  Takahito Mukai; Marc J Lajoie; Markus Englert; Dieter Söll
Journal:  Annu Rev Microbiol       Date:  2017-07-11       Impact factor: 15.500

5.  Recombineering and MAGE.

Authors:  Timothy M Wannier; Peter N Ciaccia; Andrew D Ellington; Gabriel T Filsinger; Farren J Isaacs; Kamyab Javanmardi; Michaela A Jones; Aditya M Kunjapur; Akos Nyerges; Csaba Pal; Max G Schubert; George M Church
Journal:  Nat Rev Methods Primers       Date:  2021-01-14

6.  A highly precise and portable genome engineering method allows comparison of mutational effects across bacterial species.

Authors:  Ákos Nyerges; Bálint Csörgő; István Nagy; Balázs Bálint; Péter Bihari; Viktória Lázár; Gábor Apjok; Kinga Umenhoffer; Balázs Bogos; György Pósfai; Csaba Pál
Journal:  Proc Natl Acad Sci U S A       Date:  2016-02-16       Impact factor: 11.205

7.  A standardized workflow for surveying recombinases expands bacterial genome-editing capabilities.

Authors:  Deirdre E Ricaurte; Esteban Martínez-García; Ákos Nyerges; Csaba Pál; Víctor de Lorenzo; Tomás Aparicio
Journal:  Microb Biotechnol       Date:  2017-11-02       Impact factor: 5.813

8.  Automated growth rate determination in high-throughput microbioreactor systems.

Authors:  Johannes Hemmerich; Wolfgang Wiechert; Marco Oldiges
Journal:  BMC Res Notes       Date:  2017-11-25

9.  CRMAGE: CRISPR Optimized MAGE Recombineering.

Authors:  Carlotta Ronda; Lasse Ebdrup Pedersen; Morten O A Sommer; Alex Toftgaard Nielsen
Journal:  Sci Rep       Date:  2016-01-22       Impact factor: 4.379

10.  CRISPR/Cas9-based genome editing for simultaneous interference with gene expression and protein stability.

Authors:  Virginia Martínez; Ida Lauritsen; Tonja Hobel; Songyuan Li; Alex Toftgaard Nielsen; Morten H H Nørholm
Journal:  Nucleic Acids Res       Date:  2017-11-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.