Christian Freund1, Dirk Schwarzer2. 1. Freie Universität Berlin, Institute of Chemistry and Biochemistry, Thielallee 63, 14195, Berlin, Germany. 2. University of Tübingen, Interfaculty Institute of Biochemistry (IFIB), Auf der Morgenstelle 34, 72076, Tübingen, Germany.
Abstract
The transpeptidase sortase A of Staphylococcus aureus (Sa-SrtA) is a valuable tool in protein chemistry. The native enzyme anchors surface proteins containing a highly conserved LPxTG sorting motif to a terminal glycine residue of the peptidoglycan layer in Gram-positive bacteria. This reaction is exploited for sortase-mediated ligation (SML), allowing the site-specific linkage of synthetic peptides and recombinant proteins by a native peptide bond. However, the moderate catalytic efficiency and specificity of Sa-SrtA fueled the development of new biocatalysts for SML, including the screening of sortase A variants form microorganisms other than S. aureus and the directed protein evolution of the Sa-SrtA enzyme itself. Novel display platforms and screening formats were developed to isolate sortases with altered properties from mutant libraries. This yielded sortases with strongly enhanced catalytic activity and enzymes recognizing new sorting motifs as substrates. This minireview focuses on recent advances in the field of directed sortase evolution and applications of these tailor-made enzymes in biochemistry.
The transpeptidase sortase A of Staphylococcus aureus (Sa-SrtA) is a valuable tool in protein chemistry. The native enzyme anchors surface proteins containing a highly conserved LPxTG sorting motif to a terminal glycine residue of the peptidoglycan layer in Gram-positive bacteria. This reaction is exploited for sortase-mediated ligation (SML), allowing the site-specific linkage of synthetic peptides and recombinant proteins by a native peptide bond. However, the moderate catalytic efficiency and specificity of Sa-SrtA fueled the development of new biocatalysts for SML, including the screening of sortase A variants form microorganisms other than S. aureus and the directed protein evolution of the Sa-SrtA enzyme itself. Novel display platforms and screening formats were developed to isolate sortases with altered properties from mutant libraries. This yielded sortases with strongly enhanced catalytic activity and enzymes recognizing new sorting motifs as substrates. This minireview focuses on recent advances in the field of directed sortase evolution and applications of these tailor-made enzymes in biochemistry.
Bacterial transpeptidases have gained considerable attention as promising tools in modern biochemistry and biotechnology. Sortase A of Staphylococcus aureus is the most widely used class of transpeptidase employed for bioconjugation and seminsynthesis of proteins.
In bacteria, sortase A is inserted into the bacterial cell wall and cross‐bridges surface proteins to the peptidoglycan, facilitating bacterial evasion of the host's immune system.
The enzyme recognizes a sorting motif LPxTG, (with x=any amino acid) in the C‐terminal region of the surface protein, and cleaves this sequence at the threonine residues.
Upon cleavage, the peptide chain downstream of the threonine residue is released as leaving group, while the upstream fragment remains tethered to the active site cysteine as threonine‐thioester (Figure 1a).
In the following, the enzyme is deacylated by ligating the sortase‐bound thioester to the N‐terminal amino group of the pentaglycine moiety located at the peptidoglycan cross‐bridge. Two features render sortases attractive tools for protein chemistry: The membrane anchor can be removed from the enzyme without loss of transpeptidase activity, and resulting N‐terminal truncated enzymes are soluble.
Furthermore, the LPxTG motif and N‐terminal glycine moieties are sufficient as sortase A substrates, with two glycines minimally required for full activity of the wild‐type enzyme. Upon incorporation of the sorting motif and N‐terminal glycine into two separate peptides or proteins, sortase A is capable of ligating the two fragments.
This methodology, also referred to as sortase‐mediated ligation (SML) or “sortagging,” is widely applied in modern biochemistry (Figure 1a).
Figure 1
Sortases and sortase‐mediated ligation (SML). a) Sortase A of S. aureus (Sa‐SrtA) recognizes LPxTG sorting motifs as substrates and cleaves this sequence at the threonine residue with formation of a sortase‐bound thioester. The downstream peptide with N‐terminal glycine residue is released from the enzyme. The thioester is resolved by ligation to a peptide nucleophile with at least two glycine residues at the N terminus. b) Native sortases install surface proteins to the bacterial peptidoglycan cross‐bridge. In the case of S. aureus, the cross‐bridge contains a pentaglycine moiety as the nucleophile, while the peptidoglycan of S. pyogenes provides an N‐terminal alanine residue for the ligation reaction. The composition of the bacterial peptidoglycan impacts sortase substrate specificity. c) Sortase A of Corynebacterium diphtheriae catalyzes isopeptide formation at an internal lysine of SpaA with a C‐terminal sorting motif from a neighboring SpaA during pilin polylmerization. SP: surface protein.
Sortases and sortase‐mediated ligation (SML). a) Sortase A of S. aureus (Sa‐SrtA) recognizes LPxTG sorting motifs as substrates and cleaves this sequence at the threonine residue with formation of a sortase‐bound thioester. The downstream peptide with N‐terminal glycine residue is released from the enzyme. The thioester is resolved by ligation to a peptide nucleophile with at least two glycine residues at the N terminus. b) Native sortases install surface proteins to the bacterial peptidoglycan cross‐bridge. In the case of S. aureus, the cross‐bridge contains a pentaglycine moiety as the nucleophile, while the peptidoglycan of S. pyogenes provides an N‐terminal alanine residue for the ligation reaction. The composition of the bacterial peptidoglycan impacts sortase substrate specificity. c) Sortase A of Corynebacterium diphtheriae catalyzes isopeptide formation at an internal lysine of SpaA with a C‐terminal sorting motif from a neighboring SpaA during pilin polylmerization. SP: surface protein.Up to date sortase A of S. aureus lacking the N‐terminal membrane anchor (Sa‐SrtA) is the most widely used enzyme for SML. However, SML catalyzed by Sa‐SrtA also suffers from shortcomings hampering broader applications. The reversibility of SML takes a toll on ligation yields, but engineering of sortase substrates by installation of depsipeptide bonds into the sorting motif, inducing secondary structural elements or metal‐assisted SML provide efficient strategies for addressing this problem.
The main downsides of the catalyst – wild‐type Sa‐SrtA – are the high KM value for the sorting motif, poor catalytic activity, a narrow specificity range for the LPxTG sorting motif, and the requirement of calcium as a cofactor.
Researchers have recently started addressing these issues by investigating previously uncharacterized sortases from other microorganisms and by directed evolution of Sa‐SrtA. While these developments are still ongoing, successful engineering of Sa‐SrtA has been reported over the last ten years, representing the central focus of this minireview.
Natural Sortase Variants
Sa‐SrtA served as the model enzyme for uncovering the function and activity of sortases in bacteria and belongs to the class A sortases. In addition to the class A enzymes, sortases of classes B‐F have been identified, which differ in function and specificity from Sa‐SrtA.
For example, sortase B of S. aureus (Sa‐SrtB) recognizes an NPxTG motif and is involved in bacterial iron uptake.
However, several sortases of classes B‐F show limited activity in vitro and are not commonly used for SML. Consequently, research on new catalysts of SML has focused mainly on class A sortases based on the assumptions that enzymes from other bacteria might reveal altered substrate specificity. The pentaglycine unit at the peptidoglycan cross‐bridge deacylates Sa‐SrtA bound thioesters in S. aureus, but this peptide nucleophile varies with the compositions of peptidoglycan.[
,
,
] For example, the peptidoglycan cross‐bridge of Streptococcus pyogenes harbors a terminal alanine, serine, or glycine residue.
The corresponding S. pyogenes sortase A (Sp‐SrtA) was shown to ligate the enzyme‐bound thioesters to N‐Ala nucleophiles (Figure 1b). This distinct specificity of Sp‐SrtA could be exploited for a dual ligation scheme when combined with Sa‐SrtA.Recent investigations aimed to explore the potential of previously uncharacterized class A sortases from various bacteria on a larger scale.
Focusing on the enzymes of Staphylococcus and Streptococcus origin, several sortase A genes were cloned, expressed, and the enzymes characterized with libraries of putative substrate peptides. These investigations uncovered that several Streptococcus sortases display a much more promiscuous substrate specificity than the Sa‐SrtA. The experiments confirmed that Streptococcus sortases accept a broader range of substrates including LPxTG, LPxTA, and LPxTS motifs and N‐Gly, N‐Ala, and N‐Ser nucleophiles.
In addition, alternative transacylation sites have been detected for some of these enzymes.
Furthermore, the majority of investigated Streptococcus enzymes show a strong preference for an alternative LPxLG sorting motif, which is ligated more efficiently than the canonical LPxTG sorting motif in vitro.A further promising enzyme for protein chemistry is sortase A of C. diphtheriae (Cd‐SrtA). This enzyme is responsible for pilin polymerization in this bacterium. The deacylating amine nucleophile in Cd‐SrtA catalyzed ligations is not the N‐terminal residue of a peptidoglycan cross‐bridge, but an internal lysine residue in pilin subunit SpaA. Pilin polymerization occurs by Cd‐SrtA catalyzed crosslinking of a C‐terminal LPLTG sequence in CSpaA to the internal lysine moiety in a neighboring NSpaA protein (Figure 1c). In vitro transpeptidase activity of Cd‐SrtA could be established by introducing two mutations into an N‐terminal lid extension of the enzyme that otherwise blocks the active site.
Introduction of three further mutations resulted in the Cd‐SrtA‐M3 enzyme that requires the internal lysine of SpaA as the ligation site and peptides with C‐terminal LPLTG extension.
Protein bioconjugation with Cd‐SrtA‐M3 was then established by fusing SpaA to a protein of interest, followed by SML with synthetic LPLTG containing peptides. Importantly, Cd‐SrtA‐M3 and Sa‐SrtA ligation schemes can be combined for dual labeling strategies when the protein of interest is fused to SpaA and contains an additional N‐terminal glycine extension.These investigations demonstrate that natural sortases possess new and useful features for SML. Given the number of sortase A sequences deposited in gene repositories, it appears likely that further investigative endeavors into the pool of uncharacterized sortases will uncover new biocatalysts with advantageous properties for SML (Table 1).
Table 1
Summary of sortases and sortase mutants discussed in this review.
Preferred sorting motif
Preferred nucleophile
Activity compared to wt Sa‐SrtA
Ca2+ dependent
Comments
Ref.
Wild‐type sortases
Sa‐SrtA
LPxTG
N‐Gly‐Gly
yes
Sp‐SrtA
LPxLG
N‐Ala‐Ala
‐
no
LPxTG, LPxTA donors and N‐Gly‐Gly, N‐Ala‐Gly, N‐Ser‐Gly acceptors are further substrates of Sp‐SrtA
[14]
Cd‐SrtA
LPLTG
Internal Lys of SpaA
n.d.
no
[16]
n.d. not determined in comparison to wild‐type Sa‐SrtA, ++ strongly enhanced,+enhanced,−reduced, (−) slightly reduced.
Summary of sortases and sortase mutants discussed in this review.Preferred sorting motifPreferred nucleophileActivity compared to wt Sa‐SrtACa2+ dependentCommentsRef.Wild‐type sortasesSa‐SrtALPxTGN‐Gly‐GlyyesSp‐SrtALPxLGN‐Ala‐Ala‐noLPxTG, LPxTA donors and N‐Gly‐Gly, N‐Ala‐Gly, N‐Ser‐Gly acceptors are further substrates of Sp‐SrtACd‐SrtALPLTGInternal Lys of SpaAn.d.noSortase mutants with enhanced activitySrtA‐M4LPxTGN‐Gly‐Gly++yeskcat/KM (LPETG)=28.000 M−1 s−1 compared to 200 M−1 s−1 for wt. 9.3‐fold lower kcat/KM (GGG)SrtA‐M5LPxTGN‐Gly‐Gly++yeskcat/KM (LPETG)=23.000 M−1 s−1 compared to 200 M−1s−1 for wt. 5.8‐fold lower kcat/KM (GGG)SrtA‐M7LPxTGN‐Gly‐Gly++nokcat/KM (LPETG)=5.780 M−1s−1 in the absence of Ca2+SrtA‐M5/D124GLPxTGN‐Gly‐Gly++yesSML at antibody N termini; kcat/KM (LPETG)=9.305 M−1s−1 compared to 159 M−1s−1 for wt.SrtA‐M5/D124G/Y187L/E189RLPxTGN‐Gly‐Gly++yesSML at antibody C termini: kcat/KM (LPETG)=16.722 M−1s−1.SrtA‐M4/R159N/K162PLPxTGN‐Gly‐Gly++yesincreased thermostability of M4SrtA‐P94H/A104T/E105D/G167E/Q172HLPxTGN‐Gly‐Gly+yesSrtA‐M3LPxTGN‐Gly‐Gly+yesMore DMSO resistentSortase mutants with altered specificitySrtA‐F40APxTGN‐Gly‐Gly−yesSrtA‐A1‐22APxTGN‐Gly‐Gly−yesSrtA‐F1‐20FPxTGN‐Gly‐Gly−yeseSrtA(4S)LPxSGN‐Gly‐Gly++yesengineered on SrtA‐M5 templateeSrtA(2A)LAxTGN‐Gly‐Gly++yesengineered on SrtA‐M5 templatemSrt2ALAxTGN‐Gly‐Glyn.d.noengineered on eSrt(2A) templateFurther sortase mutantsSa‐SrtA‐E105K/E108ALPxTGN‐Gly‐Gly(−)noSa‐SrtA‐E105K/E108QLPxTGN‐Gly‐Gly(−)non.d. not determined in comparison to wild‐type Sa‐SrtA, ++ strongly enhanced,+enhanced,−reduced, (−) slightly reduced.
Engineering of Sortases
Sortase engineering is a highly promising strategy for establishing new biocatalysts optimized for the needs of SML approaches. Such tailor‐made sortases with engineered features have been generated by directed protein evolution.
Engineered Sa‐SrtA mutants with enhanced kinetics
Sa‐SrtA possesses low catalytic activity as result of a high K
M of 5 mM for the LPxTG substrate and a turnover rate of 1 s−1.
The high K
M in the millimolar range is a major obstacle for SML approaches and might be explained by the fact that native Sa‐SrtA, which includes a membrane anchor, operates at high local concentrations of both substrates, which are also bound to the bacterial cell wall. Consequently, the evolutionary pressure on high substrate affinity might have been low. The lack of high local substrate concentrations in SML has been addressed by the development of proximity‐based SML.
Sortase and LPxTG containing substrate proteins are tethered together by the SpyCatcher‐SpyTag peptide‐protein pair in this approach, resulting in accelerated ligation rates.However, protein evolution is a highly promising alternative approach for addressing this problem by enhancing the kinetics of the catalyst itself. In order to evolve more active Sa‐SrtA mutants, an elaborated yeast display system for selecting sortase mutants with enhanced kinetics was established. Sortase mutants are displayed on the surface of yeast cells as fusion proteins with mating factor Aga2p, while the genetic information is stored in the cell. The Aga2p‐mut‐SrtA fusion is disulfide‐linked to a second protein, Aga1p, which can be equipped with a peptide nucleophile by bioconjugation employing a glycine functionalized S6 tag. Conjugation of the S6 tag requires the coenzyme A (CoA) triglycine substrate and 4′‐phosphopantethein transferase Sfp as catalyst. Upon installation of a triglycine nucleophile at Aga1p, bound SrtA was shown to ligate biotin‐LPxTG peptides to the triglycine nucleophile in a pseudo‐intramolecular reaction, thereby covalently linking the biotin handle to the yeast surface (Figure 2a).
This yeast display system was used to screen a library of 7.8×107 sortase mutants generated by PCR amplification with mutagenic dNTPs. The screen was aimed primarily at improving the poor affinity for the sorting motif by adding the biotin‐LPxTG in decreasing concentrations. Upon attachment of a streptavidin‐conjugated fluorophore to the biotin handle, the yeast cells were subjected to FACS sorting. After several rounds of selection, SrtA‐M4 and SrtA‐M5, enzymes carrying 4 and 5 mutations, respectively, were isolated and shown to have a more than 100‐fold higher catalytic efficiency (k
cat/K
M LPETG) than wild‐type Sa‐SrtA (Table 1). The mutations of SrtA–M5 are located close to the binding groove of the substrate containing the sorting motif (P94R/D160N/D165A/K196T) and the β7–β8 loop (K190E) interacting with the nucleophile (Figure 2d).
The SrtA‐M5 mutant is intensively used in SML approaches and further protein evolution approaches on the SrtA‐M5 scaffold have been reported.
Figure 2
Selection schemes for sortase A activity enhancement. a) General scheme for evolving bond‐forming enzymes by yeast display as applied to sortase A. SrtA of S. aureus is displayed as a fusion with the surface protein Aga2p (not shown), which itself is covalently linked to Agap1 (also not shown) carrying the reactive S6 peptide. Sfp phosphopantetheinyl transferase from Bacillus subtilis is then used to covalently link two glycine residues to the S6 tag, which provide the nucleophile for the sortase reaction. Biotinylated sorting motif peptide is then ligated to the N‐terminal glycine residue by the spatially proximal sortase A protein, thereby coupling genotype with phenotype. Streptavidin‐phycoerythrin is then used to detect and select successfully ligated product by FACS. b) a FRET‐based platform for srtA activity screening. The LPETG sorting motif was fused to the C terminus of EGFP, while a triglycine moiety was attached to the N terminus of cpVenus. Bacterial clones expressing sortase variants leading to increased FRET signal intensity were selected. c) Protein complementation assay used for the directed evolution of sortase A. The two fragments of murine dihydrofolate reductase require covalent ligation by SrtA to confer activity and bacterial survival in the presence of the antibiotic trimethoprim. d) Activity enhancing mutations of SrtA from S. aureus depicted within the solution structure of the enzyme (PDB ID: 2KID). Mutations found in Chen et al. (2011)
are displayed in orange, those found in Chen et al. (2016)
in yellow, those from Suliman et al. (2017)
in pink and those from Wojcik et al. (2019)
in blue. Amino acids are labeled by type and number with respect to the S. aureus wt protein, while the corresponding mutations found in the respective screens are indicated behind the number. Residues 59–72 from the structure have been omitted for clarity. Cys184 indicates the active site, while the core LPA tripeptide from the peptide analog is shown in green.
Selection schemes for sortase A activity enhancement. a) General scheme for evolving bond‐forming enzymes by yeast display as applied to sortase A. SrtA of S. aureus is displayed as a fusion with the surface protein Aga2p (not shown), which itself is covalently linked to Agap1 (also not shown) carrying the reactive S6 peptide. Sfp phosphopantetheinyl transferase from Bacillus subtilis is then used to covalently link two glycine residues to the S6 tag, which provide the nucleophile for the sortase reaction. Biotinylated sorting motif peptide is then ligated to the N‐terminal glycine residue by the spatially proximal sortase A protein, thereby coupling genotype with phenotype. Streptavidin‐phycoerythrin is then used to detect and select successfully ligated product by FACS. b) a FRET‐based platform for srtA activity screening. The LPETG sorting motif was fused to the C terminus of EGFP, while a triglycine moiety was attached to the N terminus of cpVenus. Bacterial clones expressing sortase variants leading to increased FRET signal intensity were selected. c) Protein complementation assay used for the directed evolution of sortase A. The two fragments of murine dihydrofolate reductase require covalent ligation by SrtA to confer activity and bacterial survival in the presence of the antibiotic trimethoprim. d) Activity enhancing mutations of SrtA from S. aureus depicted within the solution structure of the enzyme (PDB ID: 2KID). Mutations found in Chen et al. (2011)
are displayed in orange, those found in Chen et al. (2016)
in yellow, those from Suliman et al. (2017)
in pink and those from Wojcik et al. (2019)
in blue. Amino acids are labeled by type and number with respect to the S. aureus wt protein, while the corresponding mutations found in the respective screens are indicated behind the number. Residues 59–72 from the structure have been omitted for clarity. Cys184 indicates the active site, while the core LPA tripeptide from the peptide analog is shown in green.A different screening approach was able to evolve further Sa‐SrtA mutants with enhanced kinetics.
At first, the wild‐type Sa‐SrtA gene was subjected to error prone PCR. In addition, saturation mutagenesis of the SrtA‐M5 template was used for generating a second sortase library. Both libraries were screened for high FRET signal intensities resulting from SML of EGFP and cpVenus equipped with LPETG motif and triglycine moiety, respectively (Figure 2b). These screens uncovered three further mutations that enhanced sortase kinetics when combined with the M5 mutations (M5/D124G and M5/Y187L/E189R) (Figure 2d). These mutants were particularly useful for SML of antibodies with SrtA‐M5/D124G as the most efficient mutant for ligating peptides to the N terminus of an antibody. SrtA‐M5/Y187E/E189R was highly effective in ligation reactions at the C termini of antibodies.A further directed sortase evolution approach was based on dihydrofolate reductase (DHFR) complementation screens. This screening platform capitalizes on the assembly of full‐length mammalian DHFR from two fragments extended with LPxTG and triglycine sequences by SML (Figure 2c). Upon inhibition of endogenous bacterial DHFR by Trimethoprim, the ligated full‐length murine DHFR becomes essential for bacterial survival.Screening a library of 2×107 sortase mutants generated by error‐prone PCR uncovered five recurrent mutations P94H/A104T/E105D/G167E/Q172H) localized in the β6–β7 and β7–β8 loops of Sa‐SrtA (Figure 2d).
P94 is the only common residue found between two of the screens, supporting the conclusion that the type and exact conditions of the screening and selection methods influence the results. More interestingly, almost all mutations in these different screens change the charge of the corresponding wild‐type amino acid, indicating that additional long‐ranging electrostatic interactions play a pivotal role in substrate recognition and catalysis. Thus, while several mutations localize in the vicinity of the substrates, others are found more remotely at the tip of the β7–β8 loop (Figure 2d).Finally, further screening platforms for directed sortase evolution are currently in development. A recently reported strategy makes use of immobilized reporters of SML reactions in microtiter plate formats.
SML is used to ligate a reporter protein to peptides, serving as handles for immobilization. The reporter protein catalyzes a colorimetric readout reaction when successfully immobilized in the plates. Screening sortase libraries obtained by saturation mutagenesis at selected sites confirmed the efficiency of the M5 mutations for Sa‐SrtA catalysis (Table 1).
Sortase mutants with altered substrate specificity
In addition to engineering Sa‐SrtA kinetics, this enzyme has been also subjected to directed evolution of substrate specificity.Sortases with altered substrate specificity are attractive for protein semisynthesis approaches where proteins are reconstituted form synthetic and recombinant fragments by ligation reactions. Sortases with altered specificity could reduce the number of mutations that remain as ‘ligation scarring’ with the ligation products. Furthermore, sortases with alternative sorting motifs and nucleophile specificities could enable orthogonal SML with more than two fragments.Initial attempts to change the Sa‐SrtA specificity were based on the observation that Sa‐SrtB recognizes an NPQTN sorting motif but possesses very limited in vitro activity. Structural data allowed residues of the β6–β7 loop to be assigned as the main contact site between sortase and the substrate containing the sorting motif. A loop‐swap chimera (SrtLS) was generated by grafting the β6–β7 loop of Sa‐SrtB onto the Sa‐SrtA protein scaffold.
Chimeric SrtLS recognized the NPQTN sequence as substrate, but was only able to catalyze cleavage of this motif and not the ligation reaction. A more recently developed Sp‐SrtA chimera was engineered, focusing on altered specificity for the sortase deacylating peptide nucleophile.
Nucleophile recognition is largely mediated by residues of the β7–β8 loop. By grafting this region from Sa‐SrtA onto Sp‐SrtA, the resulting hybrid enzyme is restricted to N‐Gly substrates. The hybrid enzyme was no longer able to ligate LPxTA motifs to N‐Ala nucleophiles like wild‐type Sp‐SrtA, confirming the important role of the β7–β8 loop in nucleophile substrate recognition.The first directed evolution of Sa‐SrtA substrate specificity was used in a phage display system for selecting mutants with altered sorting motif specificity. A library of 1×108 Sa‐SrtA mutants was generated by NNK mutagenesis, randomizing six residues of the β6–β7 loop predicted to interact with the leucine residue of the LPxTG motif (Figure 3a).
The library was displayed as C‐terminal fusion with the pIII surface protein on M13 phages. The N terminus of the displayed sortase mutants was extended by a pentaglycine tail, enabling ligations to the N termini. Ligation of biotinylated LPxTG peptides allowed enrichment of phages on streptavidin resins. After three rounds of phage display with a biotin‐FPxTG bait, isolated sortase mutants were characterized. In general, these mutants were promiscuous with respect to the leucine position in the sorting motif and ligated the expected FPxTG sequence. The activity of the selected mutants was generally lower than that of the wild‐type enzyme. One mutant, referred to as F40 sortase, appeared particularly interesting because this enzyme catalyzed ligation reactions of APxTG and DPxTG substrates. The first of these amino acid sequences is present in histone H3, enabling traceless semisynthesis of this protein by SML.
However, the general broadening of specificity in this phage display campaign was surprising and further investigated. The initial library design was based on the first reported Sa‐SrtA crystal structure.
However, the subsequently elucidated NMR structure of the sortase A construct with covalently attached LPA core motif analog revealed a very different conformation of the β6–β7 loop in the substrate bound state, indicating that the residues selected for randomization might not have been ideal for evolving the Sa‐SrtA specificity.
This NMR structure could retrospectively rationalize the preference of Ala and Asp instead of Leu at the first position of the sorting motif, despite the ligation reaction having been performed with an FPKTG peptide. A more bulky Met residue is found instead of Val at position 168 of the β6–β7 loop in the F40 sortase, likely to geometrically favor small side‐chains in the substrate. In addition, R197 of the β7–β8 loop likely provides a favorable charge interaction in case the N‐terminal residue of the sorting motif is an Asp. To more rationally change the properties of the β6–β7 loop, a re‐designed Sa‐SrtA library was generated, based on the NMR Sa‐SrtA structure using trinucleotide building blocks, and also including variations of the β6–β7 loop length.
Phage display screening of this re‐designed library with FPxTG and APxTG baits yielded sortase mutants SrtA‐F1‐20 and SrtA‐A1‐22 with expected preferences for these altered sorting motifs. The majority of isolated mutants contained β6–β7 loops that were longer than the native loop (11 instead of nine residues; Figure 3b).
Figure 3
Sortase mutants with altered substrate specificity. a) Selection scheme for changing the specificity of SrtA. Self‐ligation of an altered sorting motif to the N‐terminal glycine artificially fused to the randomized sortases displayed on phage allows active sortases to be labeled by biotin and subsequent enrichment on NeutrAvidin‐coated agarose beads. The design of the first‐generation SrtA library is shown below left and indicates the six NNK randomized positions in orange. The second‐generation library was randomized at positions 161–169 by using trinucleotides and excluding stop and cysteine codons. Furthermore, loop lengths were allowed to vary between 7 and 11 amino acids. The most potent variant indeed contained 11 amino acids, as indicated bottom right. b) Altering sortase A specificity and calcium dependence by design. The amino acids of SrtA that were randomized or changed by design are rendered as sticks. Libraries selected for the recognition of amino acids other than leucine at the first position of the sorting motif are shown in orange with the randomized region restricted to the β6–β7 loop, see (a) for the identified mutations. Amino acids are indicated by type and number and the core LPA motif of the ligand is displayed in green. The two mutations that were rationally designed to render SrtA Ca2+‐independent are shown in purple. The bound calcium is displayed as a gray sphere. c) Mutations identified by full randomization of SrtA and selected to ligate the sorting motif LAETG are shown in blue. As in (b) amino acids that were found to be mutated compared to wt sortase A from S. aureus are rendered as sticks (in blue). In the screen performed in this study,
randomization started from sortase A with enhanced kinetic properties (the so‐called M5 variant or eSrtA, Figure 2d), thus two of the mutations, N160K and T196S, affected residues at the M5 variant positions.
Sortase mutants with altered substrate specificity. a) Selection scheme for changing the specificity of SrtA. Self‐ligation of an altered sorting motif to the N‐terminal glycine artificially fused to the randomized sortases displayed on phage allows active sortases to be labeled by biotin and subsequent enrichment on NeutrAvidin‐coated agarose beads. The design of the first‐generation SrtA library is shown below left and indicates the six NNK randomized positions in orange. The second‐generation library was randomized at positions 161–169 by using trinucleotides and excluding stop and cysteine codons. Furthermore, loop lengths were allowed to vary between 7 and 11 amino acids. The most potent variant indeed contained 11 amino acids, as indicated bottom right. b) Altering sortase A specificity and calcium dependence by design. The amino acids of SrtA that were randomized or changed by design are rendered as sticks. Libraries selected for the recognition of amino acids other than leucine at the first position of the sorting motif are shown in orange with the randomized region restricted to the β6–β7 loop, see (a) for the identified mutations. Amino acids are indicated by type and number and the core LPA motif of the ligand is displayed in green. The two mutations that were rationally designed to render SrtA Ca2+‐independent are shown in purple. The bound calcium is displayed as a gray sphere. c) Mutations identified by full randomization of SrtA and selected to ligate the sorting motif LAETG are shown in blue. As in (b) amino acids that were found to be mutated compared to wt sortase A from S. aureus are rendered as sticks (in blue). In the screen performed in this study,
randomization started from sortase A with enhanced kinetic properties (the so‐called M5 variant or eSrtA, Figure 2d), thus two of the mutations, N160K and T196S, affected residues at the M5 variant positions.A set of orthogonal sortases recognizing LAxTG and LPxSG motifs evolved on the basis of the yeast display system discussed above (Figure 2a) was reported, enabling dual labeling of proteins with SML. The screening campaign of a sortase library at 2 % mutation level was optimized for high selectivity, and yielded sortase mutants with the desired altered specificity and no significant loss in activity.
Three key mutations responsible for substrate promiscuity were derived from this screen and used to generate sortase mutants accepting LAxTG and LPxSG as sorting motifs. These mutants were subjected to further directed evolutions, finally delivering evolved sortases eSrtA(2A) and eSrtA(4S) showing high specificity for LAxTG and LPxSG motifs. Both enzymes accumulated a significant number of mutations during the evolution process, some of which were located at contact sites between the enzyme and the substrate, while other mutations were found in remote regions (Figure 3c). Importantly, these mutants maintained a high increase in activity (Table 1).
Calcium‐independent sortase mutants and mutants with improved solvent or temperature stability
A further challenge of Sa‐SrtA is the requirement of calcium for effective catalysis. Calcium‐dependency constitutes a significant problem for applications in living cells where calcium levels are low. Calcium‐dependency is observed mainly for sortases of Staphylococci origin, while other sortases, including Sp‐SrtA, are calcium‐independent. However, since Sa‐SrtA is the most active sortase uncovered so far, a calcium‐independent enzyme was generated by rational design.
In Sa‐SrtA, the Ca2+ ion is important for stabilizing the closed conformation of the β6–β7 loop in a substrate‐bound state. The Ca2+ binding site is formed by three glutamic acid residues: Glu171 of the β6–β7 loop, and Glu 108 and Glu 105 located in the β3–β4 loop. In the calcium‐independent Sp‐SrtA, the Ca2+ interactions are substituted by a salt bridge between a lysine and an aspartic acid moiety. The calcium‐independent Sa‐SrtA was generated by installing a similar salt bridge into the β3–β4 loop. The resulting mutants, Sa‐SrtA(E105K/E108A) and Sa‐SrtA(E105K/E108Q), catalyzed SML reactions with slightly slower kinetics compared to the wild‐type enzyme in the absence of calcium. Mutations mediating calcium independence could be further combined with the rate‐accelerating mutations of the SrtM5 enzyme.
The resulting SrtA‐M7 combines the advantageous features of both engineering approaches and is now a widely used tool in protein chemistry.
The SrtA‐M7 enzyme was further shown to accept a broad range of small molecule amine nucleophiles containing biorthogonal groups. When co‐expressing SrtA‐M7 and LPxTG containing proteins in Escherichia coli this feature was exploited for the in vivo incorporation of alkynes, azides and tetrazines into the co‐expressed proteins.A further desirable feature of enzymes used for biotechnology is their resistance to organic solvents. SML in organic solvents or mixtures of water and organic solvents would allow ligation reactions with hydrophobic substrates. Wild‐type sortase activity decreases dramatically in the presence of high concentrations of solvents like DMSO. An engineering campaign screened for sortase mutants able to perform SML in the presence of 45 % DMSO. The screen was based on ligation of a LPxTG modified CueO laccase reporter to an N‐Gly modified anchoring protein, enabling immobilization of the ligation products in polypropylene microtiter plates. A sortase library generated by saturation mutagenesis was screened in the presence of 45 % DMSO for efficient SML. The screen delivered sortase mutants, including SrtA‐M3, which showed higher transpeptidase activity than wild type Sa‐SrtA in both the presence and absence of 45 % DMSO.To improve the thermostability of Sa‐SrtA, multiple sequence alignments of sortases revealed certain positions of sortases from extremophiles to be conserved or distinct from Sa‐SrtA, identifying M155V and V193R as mutations that increased activity at 37 °C or higher (Figure 2d; Table 1).Notably, sortases evolved for high catalytic activity commonly come at the cost of decreased thermostability. It was therefore attempted to improve the stability of the activity‐enhanced M4 mutant from the original yeast display screen(Table 1).[
,
] This mutant shows a more than 10 °C lower thermal melting temperature compared to wild‐type SaSrtA (48.6 vs 59.4 °C). Single site‐saturation mutagenesis of residues in the β6–β7 loop identified two mutations (R159N and K162P) that, when combined, led to a 6 °C higher stability of this new M6‐SrtA. The relative activity of the M6 mutant was increased 58‐fold compared to wild‐type SaSrtA after pre‐incubation of the enzymes at 55 °C for 1 hour. Further improvement of stability by 1.5 °C was achieved by head‐to‐tail cyclization of the engineered variant.
Applications of Engineered Sortases
Engineered sortases have already made a strong impact on protein chemistry. In particular, SrtA‐M5 and the calcium‐independent SrtA‐M7 with enhanced kinetics are often used as more efficient alternatives to wild‐type Sa‐SrtA. Biotechnological approaches, such as the generation of antibody conjugates with biophysical probes or drugs by SML, benefit strongly from the accelerated ligation reactions catalyzed by these enzymes.[
,
]Here we will focus on biochemical applications of engineered sortases, particularly the utility of these enzymes for investigating the function of post‐translational modifications (PTMs) of proteins. We will refer to other reviews for a more global view on sortase applications.PTMs can serve as regulators of protein function and activity by modulating protein‐protein interactions or inducing conformational changes within the protein structure.
Histone proteins package eukaryotic DNA into chromatin and are extensively decorated by PTMs in the N‐terminal tail regions.
These histone tails protrude from the basic structural units of chromatin, the nucleosomes, into the nucleus.
Modifications of histone tails constitute an epigenetic histone code by recruiting chromatin factors to the DNA template. Protein semisynthesis is an attractive tool for generating histones with defined modification patterns. In the first step, PTMs are introduced into the synthetic fragments and a subsequent ligation reaction introduces the modification into the full‐length protein.
SML is particularly useful for generating semisynthetic histone H3. Histone H3 contains the sequence APATG at the interface between the tail and the globular fold, spanning positions 29 to 33, which resembles the LPxTG sorting motif closely. Wild‐type sortase has been used for generating libraries of nucleosomes with defined methylation and phosphorylation patterns.
The H3 tails were installed on the level of assembled nucleosomes and allowed the generation of libraries with defined modification patterns. Library screens with chromatin reader protein HP1 uncovered a cross‐talk between the HP1 recruitment site at H3K9me2 and phosphorylation of the remote residue S28, thereby demonstrating the utility of SML for chromatin research.However, SML of H3 with wild‐type Sa‐SrtA comes at the price of an A29L point mutation introduced into H3, which could potentially impact the native interactions between chromatin factors and the histone tail. The F40 sortase mutant represents an alternative catalyst for H3 SML, despite being less active than wild‐type Sa‐SrtA. Ligation of the native H3 sequence APATG by F40‐SrtA allows traceless H3 semisynthesis and has already been applied in chromatin biochemistry.
The SAGA complex is a transcriptional coactivator and possesses lysine acetyltransferase activity.
It was shown that SAGA is recruited to nucleosomes containing trimethylated H3K4me3 by a tandem Tudor domain with SAGA protein Sgf29. H3K4me3 nucleosomes were generated by F40‐SrtA catalyzed SML of H3 from a synthetic H3K4me3 peptide and N‐terminal truncated recombinant H3.
In contrast to the library approach discussed above, this SML reaction was performed on the histone level, and the ligated H3 was subsequently incorporated into reconstituted nucleosomes. In a competitive setting of H3K4me3 nucleosomes in the presence of unmodified nucleosomes, SAGA preferentially hyperacetylated the H3K4me3 fraction, allowing a model to be established for the recruitment of the SAGA complex to chromosomal promoter regions.In addition to lysine acetyltransferases, their counterparts, lysine or histone deacetylases (HDACs) constitute a further important class of transcriptional regulators. The CoREST complex plays a central role in neuronal development and possesses deacetylase activity, mediated by HDAC1. In addition, the complex contains LSD1, a histone demethylase of H3K4me2.
HDAC1 and LSD1 are tethered by scaffolding protein CoREST, forming the complex core referred to as LHC. The catalytic properties and regulation of LHC were characterized using nucleosomes modified at H3 with methylation marks at K4 and further acetylated lysine residues at the K9, K14, and K18 positions.
Modified H3 was generated by SML with F40 sortase and subsequently incorporated into nucleosomes (Figure4 a). The SML reaction involved sorting motif substrates with a depsipeptide bond downstream of the threonine residue in the sorting motif. The released leaving group contains an N‐terminal glycolic acid moiety, which does not compete with the N‐Gly nucleophile of truncated H3, thus driving the SML reaction to completion.
This strategy was also applied for establishing nucleosomal libraries with wild type Sa‐SrtA as discussed above. The site‐specifically methylated and acetylated nucleosomes showed that H3K14ac suppresses demethylation of H3K4me2 by LSD1. Furthermore, HDAC1 removes the H3K14ac mark sluggishly compared to acetylation marks at H3K9ac and H3K18ac. These observations indicate a regulatory crosstalk between H3K4me2 and H3K14ac.
The SML strategy was further applied for generating H3‐acetylated nucleosomes for investigating further complexes containing class I HDACs including CoREST, NuRD, Sin3B, MiDAC, and SMART.
Deacetylation experiments showed variations in substrate specificities of these complexes and a glycine residue upstream of H3K14ac that induces reduced deacetylase activity of CoREST towards this acetylation mark.
Figure 4
Applications of engineered sortases. a) Engineered F40 sortase allows traceless semi‐synthesis of histone H3. Highly efficient SML of H3 was enabled by inserting a depsipeptide bond into the sorting motif. Designer nucleosomes site‐specifically methylated and acetylated at Lys4 and Lys14 allowed mechanistic insights into the recognition of the LHC core of the CoREST complex. b) Genetic encoding of a lysine residue with isopeptide azido‐Gly‐Gly modification allowed in vivo installation of ubiquitin by engineered mSrt2A. Reduction of the azido moiety delivered the amine nucleophile, which was ligated with ubiquitin containing a C‐terminal LALTG sequence, which differed only at two positions from the native sequence.
Applications of engineered sortases. a) Engineered F40 sortase allows traceless semi‐synthesis of histone H3. Highly efficient SML of H3 was enabled by inserting a depsipeptide bond into the sorting motif. Designer nucleosomes site‐specifically methylated and acetylated at Lys4 and Lys14 allowed mechanistic insights into the recognition of the LHC core of the CoREST complex. b) Genetic encoding of a lysine residue with isopeptide azido‐Gly‐Gly modification allowed in vivo installation of ubiquitin by engineered mSrt2A. Reduction of the azido moiety delivered the amine nucleophile, which was ligated with ubiquitin containing a C‐terminal LALTG sequence, which differed only at two positions from the native sequence.Ubiquitylation and SUMOylation are two further important PTMs of proteins.
These small proteins are linked to lysine side chains via isopeptide bonds. Several elaborate ligation strategies have been developed for site‐specific ubiquitylation and SUMOylation of proteins.
A recently reported strategy, referred to as “sortylation”, combines genetic‐code expansion and SML with engineered sortases enabling ubiquitylation and SUMOylation in living cells (Figure 4b).
This approach was made possible by genetic encoding of lysine carrying an azido‐Gly‐Gly isopeptide extension using amber suppression technology. Upon incorporation into recombinant proteins, the azide can be reduced to an amine to serve as a nucleophile in SML reactions. To this end, the LPxTG motif was engineered into the C terminus of ubiquitin or SUMO, allowing their ligation with SrtA‐M5. However, the inserted proline residue yielded an undesirable outcome and was substituted with an alanine residue, resulting in an LAxTG motif serving as substrate of engineered eSrtA(2A). The resulting ubiquitylated proteins differ at two positions from the native amino acid sequence, and the isopeptide bond was shown to be resistant to deubiquitinases. This methodology could be further established in living cells by coexpression of all reaction partners. However, in order to cope with the low intracellular calcium concentration, the eSrtA(2A) enzyme was subjected to a further engineering step, introducing the D105H and E108E mutations to render the resulting mStrt2A calcium‐independent. The mSrt2A enzyme allowed application of this methodology in live cells, thus bypassing the native ubiquitylation and SUMOylation systems.These recent examples of SML with engineered sortases provide a first glimpse at the potential of these tools for investigating protein modifications. With more engineered sortases providing new features, it appears likely that the impact of SML on modern biochemistry will continue to evolve.
Summary and Outlook
Sortase engineering has come of age, showing that wild‐type sortase can be improved with regard to its catalytic activity and that its specificity can be changed with regard to individual positions in the sorting motif. This allows for interesting applications in the field of intracellular signaling, probing the role of individual post‐translational modification events. Capitalizing on the already established set of engineered sortases allows researchers to create sortases with new properties by combining structurally independent elements of the enzyme (e. g., the incorporation of kinetically derived mutants into specificity changing mutants). Alternatively, future research may exploit the richness of natural sortases by methods such as DNA shuffling, with the promise to further explore sequence space and to direct evolution towards new substrate specificities. These attempts are ideally flanked by mechanistic investigations that capture the essential dynamics and structural transitions that accompany the two ligation steps. Based on its ease of expression and manipulation, sortase A might become a good example of how to stretch the limits of a naturally occurring enzyme for the purpose of biotechnology.
Authors: Maximilian Fottner; Andreas-David Brunner; Verena Bittl; Daniel Horn-Ghetko; Alexander Jussupow; Ville R I Kaila; Anja Bremm; Kathrin Lang Journal: Nat Chem Biol Date: 2019-02-15 Impact factor: 15.040
Authors: Lena Schmohl; Jan Bierlmeier; Nicolai von Kügelgen; Leonie Kurz; Pascal Reis; Fabian Barthels; Pia Mach; Mike Schutkowski; Christian Freund; Dirk Schwarzer Journal: Bioorg Med Chem Date: 2017-06-22 Impact factor: 3.641
Authors: Alex W Jacobitz; Michele D Kattke; Jeff Wereszczynski; Robert T Clubb Journal: Adv Protein Chem Struct Biol Date: 2017-06-05 Impact factor: 3.507
Authors: S Rea; F Eisenhaber; D O'Carroll; B D Strahl; Z W Sun; M Schmid; S Opravil; K Mechtler; C P Ponting; C D Allis; T Jenuwein Journal: Nature Date: 2000-08-10 Impact factor: 49.962
Authors: Melody Gao; D Alex Johnson; Isabel M Piper; Hanna M Kodama; Justin E Svendsen; Elise Tahti; Frederick Longshore-Neate; Brandon Vogel; John M Antos; Jeanine F Amacher Journal: Protein Sci Date: 2022-01-03 Impact factor: 6.725
Authors: Zhipeng A Wang; Samuel D Whedon; Mingxuan Wu; Siyu Wang; Edward A Brown; Ananya Anmangandla; Liam Regan; Kwangwoon Lee; Jianfeng Du; Jun Young Hong; Louise Fairall; Taylor Kay; Hening Lin; Yingming Zhao; John W R Schwabe; Philip A Cole Journal: J Am Chem Soc Date: 2022-02-17 Impact factor: 16.383