| Literature DB >> 29409059 |
Christopher Cozens1, Vitor B Pinheiro1,2.
Abstract
Engineering proteins for designer functions and biotechnological applications almost invariably requires (or at least benefits from) multiple mutations to non-contiguous residues. Several methods for multiple site-directed mutagenesis exist, but there remains a need for fast and simple methods to efficiently introduce such mutations - particularly for generating large, high quality libraries for directed evolution. Here, we present Darwin Assembly, which can deliver high quality libraries of >108 transformants, targeting multiple (>10) distal sites with minimal wild-type contamination (<0.25% of total population) and which takes a single working day from purified plasmid to library transformation. We demonstrate its efficacy with whole gene codon reassignment of chloramphenicol acetyl transferase, mutating 19 codons in a single reaction in KOD DNA polymerase and generating high quality, multiple-site libraries in T7 RNA polymerase and Tgo DNA polymerase. Darwin Assembly uses commercially available enzymes, can be readily automated, and offers a cost-effective route to highly complex and customizable library generation.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29409059 PMCID: PMC5934624 DOI: 10.1093/nar/gky067
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Common mutagenesis methods for the introduction of multiple distal mutations
| Method | Principle | Mutated codons | Efficacya | Efficiencyb |
|---|---|---|---|---|
| epPCR ( | PCR introduces errors in region being amplified | Random | n.a. | n.a. |
| MMR ( | Multiple mutagenic primers are used in a single PCR | 9 | 60% | 100% |
| RECODE ( | Multiple mutagenic primers are used in a single PCR | 4 | 58% | 100% |
| Nicking mutagenesis (multi site mutation) ( | Enzymatic generation of ssDNA template (nicking endonuclease and exoIII) | 3 | 50% | Up to 100% |
| Multiple mutagenic primers are extended and ligated | ||||
| Remaining template removed enzymatically (nicking endonuclease and exoIII) | ||||
| QuikChange ( | Multiple mutagenic primers are extended and ligated directly on a dsDNA template | 3 | 84% | 55% |
| Template removed enzymatically (DpnI) | ||||
| PFunkel (multi site mutation) ( | Phagemid synthesis of dU-containing ssDNA template | 4 | 50–100% | 50–100% |
| Multiple mutagenic primers are extended and ligated | ||||
| Template removed enzymatically (UDG and exoIII) | ||||
| MegAnneal ( | Uses epPCR to create primer library | Random | n.a. | 23–80%c |
| Phagemid synthesis of dU-containing ssDNA template | ||||
| Mutagenized primers are extended and ligated | ||||
| OmniChange ( | Pairs of mutagenic primers are used in short PCRs | 5 | 100% | 100% |
| Compatible overhangs chemically generated (iodine/phosphorothiate) and PCR-amplified fragments ligated | ||||
| POEP ( | Pairs of mutagenic primers are used in short PCRs and PAGE purified | 8 | 100 (digest) | 100% |
| PCR products are used in an overlap extension PCR | ||||
| MF OE-PCR ( | Pairs of mutagenic primers are used in short PCRs and purified | 13 | NR | NR |
| PCR products are used in an overlap extension PCR | ||||
| Golden Gate ( | epPCR or pairs of mutagenic primers are used in short PCRs and purified | Random | n.a. | n.a. |
| Type IIs restriction of fragments and ligation | ||||
| TAMS ( | Linear PCR to generate ssDNA template | 4 | NR | 100% |
| Multiple mutagenic primers are extended and ligated | ||||
| Outnest used for PCR amplification of assembled fragment | ||||
| OD SPM ( | Multiple mutagenic primers are extended and ligated directly on a denatured dsDNA template | 11 | NR | Up to 100% |
| Outnest used for PCR amplification of assembled fragment | ||||
| Darwin Assembly (this study) | Enzymatic generation of ssDNA template (nicking endonuclease and exoIII) | 19 | 98–100% | 100% |
| Multiple mutagenic primers are extended and ligated | ||||
| Outnest used for PCR amplification of assembled fragment |
Error-prone PCR (41) is simple but mutations introduced cannot be targeted to particular codons and its effectiveness is dependent on amplification biases and random incorporation of errors (42). In these circumstances efficacy and efficiency are not applicable concepts (n.a.). Some methods are not effective at mutating the whole population (43,44) and/or incorporating all mutations (45,46). Methods that can introduce mutations effectively and efficiently tend to require complex and time-consuming steps, such as phagemid propagation to generate dU-containing DNA (47,48), chemical degradation steps (49), multiple PCR reactions followed by overlap extension PCR reactions to assemble the mutated gene (44,50,51) or modularization of the library into PCR-tractable libraries that can be later re-assembled via Golden Gate assembly (6). Other methods, such as TAMS (52) and OD SPM (53), are simple and effective. Both introduce diversity by using primers containing the target mutations for primer extension and ligation against single-stranded templates followed by PCR and cloning (52,53). However, neither of these methods remove the second strand of the original plasmid and both use T4 DNA ligase, which can ligate across gaps, mispairs and cannot tolerate high temperatures favourable for specific annealing (28)—with resulting compromises in efficiency of library assembly.
a Efficacy is defined as the fraction of the population containing mutations.
b Efficiency is defined as the fraction of mutated clones where all targeted sites are mutated. Some methods do not report (NR) their efficacy or efficiency.
Figure 1.Principles of Darwin Assembly. Plasmid DNA (black, with the gene of interest in orange) is nicked by a nicking endonuclease (at the purple dot) and the cut strand degraded by exonuclease III (1). Boundary and inner (mutagenic) oligonucleotides are annealed to the ssDNA plasmid (2). Key features of the oligonucleotides are highlighted: 5′-boundary oligonucleotide is 5′-biotinylated; non-complementary overhangs are shown in blue with Type IIs endonuclease recognition sites shown in white; mutations are shown as red X in the inner oligonucleotides; 3′-boundary oligonucleotide is protected at its 3′-end. After annealing, primers are extended and ligated in an isothermal assembly reaction (3). The assembled strand can be isolated by paramagnetic streptavidin-coated beads (4) and purified by alkali washing prior to PCR using outnested priming sites (5) and cloning (6) using the type IIS restriction sites (white dots). The purification step (4) is not always necessary but we found it improved PCR performance, especially for long assembly reactions (>1 kb).
Figure 2.Darwin Assembly using a θ oligonucleotide. Here, a single θ oligonucleotide is used in place of the two boundary oligonucleotides allowing enzymatic cleanup after the assembly reaction. Plasmid DNA (black, with the gene of interest in orange) is nicked by a nicking endonuclease (at the purple dot) and the nicked strand degraded by exonuclease III (1). Inner oligonucleotides and a single θ oligonucleotide are annealed to the ssDNA plasmid (2). The θ oligonucleotide encodes both assembly priming and termination sequences linked by a flexible linker such that successful assembly of the mutated strand results in a closed circle (3). The template plasmid can now be linearized (e.g. at the yellow dot, by adding a targeting oligonucleotide and appropriate restriction endonuclease) and both exonuclease I and exonuclease III added to degrade any non-circular DNA (4). The mutated gene can now be amplified from the closed circle by PCR (5) and cloned into a fresh vector (6) using the type IIS restriction sites (white dots).
Figure 3.Whole gene codon reassignment. Codons targeted for reassignment and oligonucleotides used in Darwin assembly (A). Amino acid numbers are original codon sequences are shown. Outnest PCR showing amplification of the mutated CAT gene following the Darwin Assembly reaction across a range of assembly times (0, 10 and 60 min) and assembly oligonucleotide concentrations (B). Concentrations shown represent the molar excess of boundary (e.g. 25) and inner oligonucleotides (e.g. 125) over plasmid concentration used. Expected assembly amplicon is 568 bp. MW: 100 bp ladder (NEB).
Deep sequencing of CAT gene alanine scanning library
| Residue | Wild-type codon | Frequency | Mutated codon | Frequency | Other codons |
|---|---|---|---|---|---|
| Glu 100 | GAA | 79 205 (99.87%) | n.a. | n.a. | 103 (0.13%) |
| Thr 101 | ACG | 68 947 (86.94%) | GCT | 9925 (12.51%) | 436 (0.51%) |
| Phe 102 | TTT | 59 947 (75.59%) | GCA | 18 663 (12.51%) | 648 (0.82%) |
| Ser 103 | TCA | 58 395 (73.63%) | GCA | 20 592 (25.96%) | 309 (0.39%) |
| Ser 104 | GCA | 66 541 (83.90%) | GCA | 12 433 (15.68%) | 306 (0.39%) |
| Leu 105 | CTC | 135 (0.17%) | CTG | 79 027 (99.65%) | 146 (0.18%) |
| Trp 106 | TGG | 79 047 (99.67%) | n.a. | n.a. | 260 (0.33%) |
Near complete mutation of Leu105 confirms the efficiency of the approach with alanine represented in all target sites with frequencies between 12.51% and 25.96% (expected frequency: 20%). Sites not targeted by mutagenesis (Glu100 and Trp106)—and where mutations are not applicable (n.a.)— show the underlying error rate of sequencing (0.13–0.33%) is comparable with the error obtained in the engineered positions (0.18–0.82%).
Figure 4.Darwin assembled T7 RNA polymerase library. Sequencing of three separate regions of the assembled gene were used to determine the assembled diversity in the targeted codons (A). Number of reads obtained for each segment are shown as well as the range of each reaction (shown in blue). Frequency of incorporation of the 32 possible codons (NNS) is shown as a box plot, with overrepresented outliers explicitly labelled. Wild-type contamination was determined where possible (N.D.: not determined). Incorporation trends were detected in all positions and best correlated to the number of mismatches they introduced. A typical correlation is shown (B) but other positions varied (0.03–0.37), always indicating that the higher the number of mutations being concomitantly introduced (i.e. larger Hamming distance), the lower incorporation of the variant in the population. Incorporation trends were also analysed to determine systematic biases of incorporation. Incorporation frequencies were ranked at each position and rank order analysed (C)—the top three highest ranked (greens) and lowest ranked (oranges) codons are highlighted for clarity. Incorporation biases were detected but it was not clear whether that was the result of the very imbalanced sequence composition of the wild-type residues targeted. Outnest PCR of the T7 RNA polymerase library (expected product of 2840 bp) confirms that as little as 10 minutes of isothermal assembly were sufficient for successful amplification of a library (D). MW: 1 kb ladder (NEB). NT: no template PCR control.
Figure 5.Darwin assembled TgoT DNA polymerase library. (A) Five separate sequencing reactions (range and reads shown in blue) were required to sample the diversity introduced across the eight target residues (shown in red along the TgoT gene). Mutations included focused degeneracies (e.g. YWC used against Y384) or ‘small intelligent’ (S-int) diversity (NDT, VMA, ATG and TGG oligonucleotides mixed in a 12:6:1:1 ratio). Resulting incorporation is shown in box plots with outliers explicitly labelled. Wild-type contamination was determined from positions where diversity excluded those sequences (N.A.: not applicable). As with the T7 RNA polymerase library, incorporation trends and biases were analysed to identify any biases in assembly. Ranked incorporation frequencies are shown for the residues targeted with ‘small intelligent’ diversity, and the top three highest (greens) and lowest (oranges) ranked codons (based on a straight sum of ranks) are highlighted (B). Outnest PCR of the TgoT DNA polymerase library (expected product of 2501 bp) showing that the final PCR can be carried out with either A- (MyTaq) or B-family (Q5) polymerases (C). MW: 1 kb ladder (NEB). NT: no template PCR control.