| Literature DB >> 29785500 |
Aitao Li1,2,3, Carlos G Acevedo-Rocha4, Manfred T Reetz5,6.
Abstract
Site-saturation mutagenesis (SSM) has been used in directed evolution of proteins for a long time. As a special form of saturation mutagenesis, it involves individual randomization at a given residue with formation of all 19 amino acids. To date, the most efficient embodiment of SSM is a one-step PCR-based approach using NNK codon degeneracy. However, in the case of difficult-to-randomize genes, SSM may not deliver all of the expected 19 mutants, which compels the user to invest further efforts by applying site-directed mutagenesis for the construction of the missing mutants. To solve this problem, we developed a two-step PCR-based technique in which a mutagenic primer and a non-mutagenic (silent) primer are used to generate a short DNA fragment, which is recovered and then employed as a megaprimer to amplify the whole plasmid. The present two-step and older one-step (partially overlapped primer approach) procedures were compared by utilizing cytochrome P450-BM3, which is a "difficult-to-randomize" gene. The results document the distinct superiority of the new method by checking the library quality on DNA level based on massive sequence data, but also at amino acid level. Various future applications in biotechnology can be expected, including the utilization when constructing mutability landscapes, which provide semi-rational information for identifying hot spots for protein engineering and directed evolution.Entities:
Keywords: Directed evolution; Library quality; Megaprimer; Mutability landscapes; P450 monooxygenase; Site-saturation mutagenesis
Mesh:
Substances:
Year: 2018 PMID: 29785500 PMCID: PMC6013526 DOI: 10.1007/s00253-018-9041-2
Source DB: PubMed Journal: Appl Microbiol Biotechnol ISSN: 0175-7598 Impact factor: 4.813
Fig. 1a Standard SSM library construction using the one-step PCR approach with partially overlapping primers. A pair of partially overlapping mutagenic primers is used to perform the PCR reaction for incorporating mutations using plasmid pRSFDuet-1 harboring P450-BM3 gene as template; PCR reaction is performed to amplify the whole plasmid, and digestion with Dpn I is conducted to eliminate the parent template, followed by transformation into E. coli BL21 (DE3) and library harvesting. b New SSM library construction using the two-step PCR approach with a mutagenic primer and a non-mutagenic (silent) primer. A pair of primers consisting of one forward mutagenic primer and one reverse primer (or one forward primer and one reverse mutagenic prime, depending on the mutational position) is used to perform the PCR reaction for the purpose of incorporating designed mutations using pRSFDuet-1 harboring P450-BM3 gene as template. The amplified short DNA fragments (500 bp in length) containing mutations are recovered and subjected to the second PCR reaction as megaprimers. To amplify the whole plasmid, digestion with Dpn I is conducted to eliminate the parent template, followed by transformation into E. coli BL21 (DE3) and library harvesting
Comparison of one-step and two-step PCR approaches for NNK-based SSM
| Libraries | Number of sequences obtaineda | Number of amino acids presentb | Number of codons presentc | Percent wild typed | Qpool value from pooled plasmidse | |
|---|---|---|---|---|---|---|
| One-step PCR method | Y51X | 91 | 18 | 25 | 48 | 0.30 |
| S72X | 92 | 15 | 24 | 34 | 0.40 | |
| L75X | 91 | 18 | 28 | 35 | 0.44 | |
| L437X | 93 | 14 | 22 | 43 | 0.59 | |
| T438X | 91 | 19 | 25 | 47 | 0.31 | |
| Two-step PCR method | Y51X | 91 | 19 | 30 | 4 | 0.43 |
| S72X | 92 | 19 | 28 | 5 | 0.47 | |
| L75X | 92 | 19 | 30 | 2 | 0.74 | |
| L437X | 93 | 20 | 30 | 5 | 0.82 | |
| T438X | 91 | 18 | 28 | 8 | 0.67 |
aNumber of colonies with complete sequence, which was obtained after sequencing 96 colonies for each library
bNumber of different amino acids found within the sequence data for a given library (NNK primers theoretically encompass all 20 amino acids with different distributions)
cNumber of different codes found within the sequence data for a given library (NNK primers theoretically encompass all 32)
dFraction of the sequenced clone that contains the starting codon at the targeted position (assumed to result from wild-type carryover)
eCalculated for the entire codon from sequence data obtained from the pooled plasmids isolated after the initial transformation according to the method reported (Sullivan et al. 2013), the weighted average across the three bases of a codon yields a Qpool value between 0 and 1, with 1 indicating perfect randomization
Fig. 2Genetic diversity (a) and residue diversity (b) for each NNK-based SSM library for both one-step PCR (blue bar) and two-step (red bar) PCR methods based on different oversampling factors. Genetic diversity is defined by the ratio of the number of codons obtained after sequencing to the number of theoretically possible codons expected (32) expressed in percentage; residue diversity is defined by the ratio of the number of residues obtained after sequencing to the number of theoretically possible residues expected (20) expressed in percentage
Fig. 3Residue distribution (expressed in percentage) at each NNK-based SSM library for both one-step PCR (a) and two-step PCR (b) methods based on oversampling factor of 3. The theoretical distribution (expected distribution value) for each residue in a given NNK-based SSM library is shown with black bar. In some cases which exceed the scope are shown with Arabic numerals
Fig. 4Codon distributions across all NNK-based SSM libraries for both one-step PCR (a) and two-step PCR (b) methods. The aggregate number of occurrences of each codon in all five of the SSM libraries is listed with a maximum of 60 in the center of the plot. This scale was chosen based on the number of occurrences of the most commonly observed codon (ACG) in the one-step PCR approach library. The dashed green line represents the expected aggregate occurrence of each codon based on perfectly random replacements at each position. Codons present from wild-type carryover have been eliminated from the data for libraries of Y51X and L437X, since the wild-type codes in both libraries cannot be generated with NNK/MNN
Fig. 5Residue distributions across all NNK-based SSM libraries for both one-step PCR technique (a) and two-step PCR approach (b). The aggregate number of occurrences of each amino acid in all five of the saturation libraries is listed with a maximum of 70 in the center of the plot. This scale was chosen based on the number of occurrences of the most commonly observed amino acid (Leu) in the libraries constructed by the one-step PCR approach. The expected aggregate occurrence of each amino acid based on perfectly random replacements at each position is also shown (green dashed lines)