Literature DB >> 35850488

Site-Specific Protein Labeling and Generation of Defined Ubiquitin-Protein Conjugates Using an Asparaginyl Endopeptidase.

Maximilian Fottner1, Johannes Heimgärtner1, Maximilian Gantz2, Rahel Mühlhofer2, Timon Nast-Kolb3, Kathrin Lang1,2.   

Abstract

Asparaginyl endopeptidases (AEPs) have recently been widely utilized for peptide and protein modification. Labeling is however restricted to protein termini, severely limiting flexibility and scope in creating diverse conjugates as needed for therapeutic and diagnostic applications. Here, we use genetic code expansion to site-specifically modify target proteins with an isopeptide-linked glycylglycine moiety that serves as an acceptor nucleophile in AEP-mediated transpeptidation with various probes containing a tripeptidic recognition motif. Our approach allows simple and flexible labeling of recombinant proteins at any internal site and leaves a minimal, entirely peptidic footprint (NGG) in the conjugation product. We show site-specific labeling of diverse target proteins with various biophysical probes, including dual labeling at an internal site and the N-terminus. Furthermore, we harness AEP-mediated transpeptidation for generation of ubiquitin- and ubiquitin-like-modifier conjugates bearing a native isopeptide bond and only one point mutation in the linker region.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35850488      PMCID: PMC9335880          DOI: 10.1021/jacs.2c02191

Source DB:  PubMed          Journal:  J Am Chem Soc        ISSN: 0002-7863            Impact factor:   16.383


Introduction

Site-specific protein labeling with various probes at internal sites and generation of defined protein–protein conjugates hold great promise for studying biological functions of proteins and for the development of therapeutic and diagnostic bioconjugates.[1−3] Nevertheless, the modification of proteins at user-defined sites under mild conditions still represents a formidable challenge. Development of genetic code expansion approaches for site-specific cotranslational encoding of non-canonical amino acids (ncAAs) bearing bioorthogonal handles and their reaction with custom-made labels has enabled the generation of a diverse range of protein conjugates with an exquisite level of control over the labeling site and number of modifications.[4,5] The synthesis and successful incorporation of functionalized ncAAs and probes can however be challenging. Furthermore, most bioorthogonal labeling reactions lead to bulky, hydrophobic, and artificial linkages between the biophysical probe and protein.[6−9] Alternatively, chemoenzymatic labeling methods have proven to be powerful tools for attaching probes to specific amino acid side chains within recognition sequences under mild conditions.[10−12] In a chemoenzymatic labeling experiment, the recombinantly expressed target protein is equipped with a peptide recognition motif (4–15 amino acids). The respective enzyme (a transferase[13] or ligase[14,15]) binds to this recognition tag and catalyzes covalent attachment of a functionalized substrate to a specific amino acid within this motif, thereby labeling the protein of interest (POI). The recognition motifs are typically fused to the N- or C-terminus of the POI, allowing installation of modifications close to the target protein’s termini. In a few instances, it was also possible to introduce the recognition sequences into accessible loop regions of the POI, providing thereby greater flexibility with respect to the location and number of modifications.[11,16] Still, most of these approaches leave sizable footprints and often bulky and artificial linkages in the ligation product. Apart from ligases and transferases that covalently attach a functionalized probe to an amino acid within the recognition tag, also engineered proteases and transpeptidases [e.g., subtiligase,[17] sortase,[18,19] and asparaginyl endopeptidases (AEPs)[20]] have been used successfully for terminal protein labeling and for the generation of protein–protein conjugates linked via their N- or C-termini. Such enzymes typically cleave a peptide bond within a recognition motif fused to the POI, forming a labile enzyme-POI intermediate that undergoes specific transpeptidation with a peptide functionalized with a user-chosen probe. Recent work has established the engineered AEP OaAEP1 [C247A] (dubbed OaAEP1 in the following) that can be produced recombinantly in E. coli as an ideally suited enzyme for N- and C-terminal labeling of recombinant proteins due to its high catalytic efficiency and minimal recognition motif (Figure a).[21,22]OaAEP1 is a cysteine protease that cleaves C-terminally of an asparagine or aspartate residue within the tripeptidic NGL recognition motif, forming a thioester intermediate that is resolved by attack of a suitable nucleophile (e.g., α-amino group of an N-terminal amino acid) to yield a ligation product. OaAEP1 has quite stringent sequence requirements for its recognition sequence (P1P1′P2′), with NGL representing an ideal motif, but was shown to be promiscuous for the incoming nucleophile sequence (P1″P2″). Previous work has found that both GL and GV dipeptides installed at the N-terminus of peptidic probes and proteins serve as efficient acceptor nucleophiles. Interestingly, the—in the latter case—resultant NGV ligation product is poorly recognized by OaAEP1, shifting the reaction equilibrium toward the product and thereby allowing sequential dual labeling at the N- and C-terminus.[23] Furthermore, OaAEP1’s promiscuity for the incoming nucleophile was harnessed by ligating various primary amines—if presented at high enough molar excess (typically 500–1000 equiv)—to a C-terminal NGL recognition site.[24] AEPs thereby represent versatile enzymes for protein labeling, but labeling is so far restricted to protein N- and C-termini.
Figure 1

OaAEP1-mediated protein labeling. (a) Previous work has established OaAEP1-mediated labeling of protein C- and N-termini. C-terminal labeling is achieved through attachment of an NGL motif at the C-terminus of a POI and its OaAEP1-catalyzed transpeptidation with a GV acceptor nucleophile bearing a user-defined cargo (blue circle). For N-terminal labeling, the GV acceptor nucleophile is installed at the N-terminus of the POI and reacted with a peptide displaying a C-terminal NGL motif in the presence of OaAEP1. (b) OaAEP1-mediated protein labeling at internal sites and generation of defined ubiquitin (Ub)/Ub-like protein (Ubl)-protein conjugates. Genetic code expansion allows site-specific encoding of GGisoK (via incorporation of AzGGisoK and subsequent Staudinger reduction) which can act as an acceptor nucleophile in OaAEP1-driven transpeptidation with peptides or proteins bearing a C-terminal NGL motif. OaAEP1-mediated internal site labeling enables the attachment of small molecules, biophysical probes, and Ub/Ubls to proteins in a site-specific manner, leaving a minimal scar (NGG). OaAEP1 Ub/Ubl-POI conjugates display a native isopeptide bond and only one point mutation in the linker region.

OaAEP1-mediated protein labeling. (a) Previous work has established OaAEP1-mediated labeling of protein C- and N-termini. C-terminal labeling is achieved through attachment of an NGL motif at the C-terminus of a POI and its OaAEP1-catalyzed transpeptidation with a GV acceptor nucleophile bearing a user-defined cargo (blue circle). For N-terminal labeling, the GV acceptor nucleophile is installed at the N-terminus of the POI and reacted with a peptide displaying a C-terminal NGL motif in the presence of OaAEP1. (b) OaAEP1-mediated protein labeling at internal sites and generation of defined ubiquitin (Ub)/Ub-like protein (Ubl)-protein conjugates. Genetic code expansion allows site-specific encoding of GGisoK (via incorporation of AzGGisoK and subsequent Staudinger reduction) which can act as an acceptor nucleophile in OaAEP1-driven transpeptidation with peptides or proteins bearing a C-terminal NGL motif. OaAEP1-mediated internal site labeling enables the attachment of small molecules, biophysical probes, and Ub/Ubls to proteins in a site-specific manner, leaving a minimal scar (NGG). OaAEP1 Ub/Ubl-POI conjugates display a native isopeptide bond and only one point mutation in the linker region. On our quest to realize site-specific protein labeling at internal sites and to generate complex protein–protein conjugates that are not exclusively linked through their respective N- or C-termini, we here combine OaAEP1-mediated transpeptidation with genetic code expansion. We show that target proteins carrying a site-specifically introduced isopeptide-linked glycylglycine moiety (GGisoK) serve as acceptor nucleophiles in OaAEP1-mediated ligation with various NGL-bearing probes and proteins, allowing the generation of site-specific and user-defined protein conjugates displaying a minimal tripeptidic mark in the ligation product (Figure b). We modify diverse POIs at specific sites with different biophysical probes. Furthermore, we leverage OaAEP1-mediated transpeptidation for the generation of defined ubiquitin (Ub)- and Ub-like protein (Ubl)-POI conjugates. Posttranslational modification of target proteins with Ub/Ubl presents one of the most common and versatile regulators in eukaryotic biology.[25,26] During ubiquitylation, the C-terminal carboxylate of Ub is attached to the ε-amino group of a lysine within a substrate protein to form an isopeptide bond via a complex machinery employing E1/E2/E3 enzymes.[25,26] We show that OaAEP1 can be used to covalently attach a Ub/Ubl variant containing the NGL recognition motif in its C-terminus to a GGisoK-bearing POI. OaAEP1-mediated Ub/Ubl conjugates display a native isopeptide bond connecting Ub/Ubl to a specific lysine and bear one point mutation in the linker region. Importantly, we show that these conjugates are still cleaved by the model deubiquitylase USP2,[27] and we demonstrate the generality of our approach by preparing various site-specifically ubiquitylated substrate proteins (Ub, SUMO2, and histone H3).

Results and Discussion

At the outset of our investigations, we were intrigued by the reported promiscuity of OaAEP1 for nucleophile acceptors (P1″P2″) and we assessed the enzyme’s tolerance and specificity for nucleophiles that can be site-specifically incorporated into proteins in the form of ncAAs via genetic code expansion. We have recently shown that we can site-specifically modify any POI with a GG-dipeptide moiety attached to the ε-amino group of a lysine residue via genetic encoding of AzGGisoK and its phosphine-based reduction to GGisoK (Figure b).[28,29] To assess OaAEP1’s tolerance for GXisoK nucleophiles (with X representing different natural amino acids), we synthesized GLisoK, GVisoK, and GGisoK and tested their OaAEP1-mediated transpeptidation onto Ub bearing the recognition sequence NGL at its C-terminus (Ub-NGL, Figure a). GLisoK and GVisoK were efficiently attached onto Ub-NGL within 30 min using 10 mol equiv of acceptor nucleophile over Ub-NGL (100 μM) and 0.02 equiv of OaAEP1. Ligation using GGisoK also resulted in the correct ligation product but was considerably slower, leading to >85% Ub-NGGisoK formation after 18 h (Figures b and S1). Unsurprisingly though, this ligation product showed higher stability against enzymatic hydrolysis upon prolonged incubation with OaAEP1 in the absence of any other acceptor nucleophile, with >85% intact NGG-ligation product after 24 h, while NGL- and NGV-bearing ligation products showed ∼50% hydrolysis under the same conditions (Figures c and S2). Encouraged by these results, we set out to investigate if GGisoK-bearing proteins could also function as potential nucleophiles in OaAEP1-mediated transpeptidation. We site-specifically encoded AzGGisoK in response to an introduced amber codon using the previously reported, selective pyrrolysyl-tRNA synthetase-derived AzGGisoKRS/tRNACUA pair.[28] The azide moiety in AzGGisoK-modified proteins can easily be reduced to its amine analogue via Staudinger reduction using phosphines such as tris(2-carboxyethyl)phosphine or 2-(diphenylphosphino)-benzoic acid, generating GGisoK-bearing proteins. Using this approach, we expressed C-terminally H6-tagged Ub, carrying GGisoK at position K48 (Ub-K48GGisoK). In parallel, we synthesized a desthiobiotin-NGL probe (dtb-NGL, Figure d). Incubation of 50 μM Ub-K48GGisoK with 500 μM dtb-NGL in the presence of 0.05 mol equiv OaAEP1 overnight however only afforded roughly 50% of the labeled product, as assessed by liquid chromatography mass spectrometry (LC–MS), with the remaining 50% constituting unlabeled Ub-K48GGisoK (Figure e).
Figure 2

OaAEP1-mediated protein labeling at internal sites. (a) Schematic representation of OaAEP1-mediated transpeptidation between Ub-NGL and GXisoK peptides (X = V, L, or G). (b) LC–MS analysis shows >90% product formation within short times for GVisoK and GLisoK, while transpeptidation with GGisoK takes 18 h to reach approx. 85% yield (100 μM Ub-NGL, 1 mM GXisoK, 2.5 μM OaAEP1, pH 7.0, 25 °C, calculated masses: Ub-NGL = 8579 Da, Ub-NGVisoK = 8693 Da, Ub-NGLisoK = 8707 Da, and Ub-NGGisoK = 8651 Da). (c) Time-resolved LC–MS analysis of purified Ub-NGXisoK ligation products (20 μM) incubated with OaAEP1 (0.5 μM, pH 6.8, 30 °C) indicates superior stability of Ub-NGGisoK toward OaAEP1-mediated hydrolysis (calculated masses: Ub-N = 8408 Da). (d) Schematic representation of OaAEP1-mediated labeling of Ub-K48GGisoK with NGL(H) peptides bearing different biophysical probes [dtb-NGL(H), alkyne-NGLH, and SuRho-NGLH (SuRho = sulforhodamine), blue box]. (e) Optimization of OaAEP1-mediated transpeptidation between Ub-K48GGisoK and dtb-NGL(H) as followed by time-resolved LC–MS. Although >90% labeling of Ub-K48GGisoK is achieved within 30 min using 1 mM dtb-NGLH, 1 mM NiSO4, and 10 μM OaAEP1, Ub-wt is recalcitrant toward labeling under similar conditions (calculated masses: Ub-K48GGisoK-H6 = 9502 Da, Ub-K48(dtb-N)GGisoK-H6 = 9812, and Ub-wt = 8565 Da, o.n. = overnight). (f) LC–MS analysis of OaAEP1-mediated transpeptidation of Ub-K48GGisoK with alkyne-NGLH and SuRho-NGLH peptides. Transpeptidation with alkyne-NGLH is quantitative within 30 min and with SuRho-NGLH yields >90% product formation overnight. (50 μM Ub-K48GGisoK, 1 mM alkyne/SuRho-NGLH, 1 mM NiCl2 or NiSO4, 5 or 10 μM OaAEP1, pH 7.4, 30 °C; calculated masses: Ub-K48(alkyne-N)GGisoK-H6 = 9753 Da and Ub-K48(SuRho-N)GGisoK-H6 = 10302 Da). (g) SDS–PAGE (left) and LC–MS analysis (right) of OaAEP1-driven labeling of Ub-K48GGisoK with Fmoc-(G)6-NGLH over time. The blue arrow indicates the labeled product displaying a gel shift, while a black arrow indicates the starting material. [50 μM Ub-K48GGisoK, 1 mM Fmoc-(G)6-NGLH, 1 mM NiCl2, 2 μM OaAEP1, pH 6.8, 30 °C; calculated mass: Ub-K48(Fmoc-(G)6-N)GGisoK-H6 = 10181 Da]. For detailed conditions, see Figures S1–S8.

OaAEP1-mediated protein labeling at internal sites. (a) Schematic representation of OaAEP1-mediated transpeptidation between Ub-NGL and GXisoK peptides (X = V, L, or G). (b) LC–MS analysis shows >90% product formation within short times for GVisoK and GLisoK, while transpeptidation with GGisoK takes 18 h to reach approx. 85% yield (100 μM Ub-NGL, 1 mM GXisoK, 2.5 μM OaAEP1, pH 7.0, 25 °C, calculated masses: Ub-NGL = 8579 Da, Ub-NGVisoK = 8693 Da, Ub-NGLisoK = 8707 Da, and Ub-NGGisoK = 8651 Da). (c) Time-resolved LC–MS analysis of purified Ub-NGXisoK ligation products (20 μM) incubated with OaAEP1 (0.5 μM, pH 6.8, 30 °C) indicates superior stability of Ub-NGGisoK toward OaAEP1-mediated hydrolysis (calculated masses: Ub-N = 8408 Da). (d) Schematic representation of OaAEP1-mediated labeling of Ub-K48GGisoK with NGL(H) peptides bearing different biophysical probes [dtb-NGL(H), alkyne-NGLH, and SuRho-NGLH (SuRho = sulforhodamine), blue box]. (e) Optimization of OaAEP1-mediated transpeptidation between Ub-K48GGisoK and dtb-NGL(H) as followed by time-resolved LC–MS. Although >90% labeling of Ub-K48GGisoK is achieved within 30 min using 1 mM dtb-NGLH, 1 mM NiSO4, and 10 μM OaAEP1, Ub-wt is recalcitrant toward labeling under similar conditions (calculated masses: Ub-K48GGisoK-H6 = 9502 Da, Ub-K48(dtb-N)GGisoK-H6 = 9812, and Ub-wt = 8565 Da, o.n. = overnight). (f) LC–MS analysis of OaAEP1-mediated transpeptidation of Ub-K48GGisoK with alkyne-NGLH and SuRho-NGLH peptides. Transpeptidation with alkyne-NGLH is quantitative within 30 min and with SuRho-NGLH yields >90% product formation overnight. (50 μM Ub-K48GGisoK, 1 mM alkyne/SuRho-NGLH, 1 mM NiCl2 or NiSO4, 5 or 10 μM OaAEP1, pH 7.4, 30 °C; calculated masses: Ub-K48(alkyne-N)GGisoK-H6 = 9753 Da and Ub-K48(SuRho-N)GGisoK-H6 = 10302 Da). (g) SDS–PAGE (left) and LC–MS analysis (right) of OaAEP1-driven labeling of Ub-K48GGisoK with Fmoc-(G)6-NGLH over time. The blue arrow indicates the labeled product displaying a gel shift, while a black arrow indicates the starting material. [50 μM Ub-K48GGisoK, 1 mM Fmoc-(G)6-NGLH, 1 mM NiCl2, 2 μM OaAEP1, pH 6.8, 30 °C; calculated mass: Ub-K48(Fmoc-(G)6-N)GGisoK-H6 = 10181 Da]. For detailed conditions, see Figures S1–S8. We reckoned that the sluggish reaction progress may stem from the fact that—contrary to traditional labeling approaches—in our approach, the nucleophile-bearing POI (Ub-K48GGisoK) is used as the substoichiometric component with the recognition motif-containing NGL probe in 10-fold molar excess. As the GL leaving group is released from the recognition motif over the course of the reaction, this byproduct competes with the desired GGisoK nucleophile, stagnating in a product-limiting equilibrium. Inspired by previous work,[30,31] we extended the NGL recognition motif in dtb-NGL by a C-terminal histidine (dtb-NGLH). Thereby, the released GLH can be quenched by Ni2+ complexation, sequestering the competing nucleophile acceptor (Figure S3). Using dtb-NGLH in the presence of 500 μM NiSO4, >80% of labeled protein was observed within 1 h at 30 °C. Increasing the OaAEP1 concentration to 10 μM and/or using 20-fold excess of the NGLH peptide in combination with 1 mM Ni2+ led to ∼95% product formation in less than an hour with 50 μM GGisoK-bearing protein (Figures e and S3–S5). Importantly, for wild-type Ub (Ub-wt), lacking the GGisoK nucleophile, we could not observe any ligation product formation (Figures e, S3 and S4). To show that labeling progress can indeed be accurately followed by quantifying MS-peak intensities, we purified dtb-labeled Ub to homogeneity via Strep-Tag affinity chromatography and determined MS-peak intensities of various protein mixtures, confirming a linear correlation between protein concentration and MS-peak intensity (Figure S6). Having established near-quantitative labeling conditions, we next explored the chemical diversity tolerated on NGLH probes for varied protein labeling. Both alkyne- and fluorophore-NGLH probes allowed efficient labeling of GGisoK-bearing POIs. Furthermore, we synthesized an NGLH-bearing decapeptide that allowed us to follow protein modification progress via migration differences of unlabeled and labeled protein on SDS-PAGE (Figures f,g and S6–S8). As the NGG motif that is installed upon transpeptidation is essentially refractory toward OaAEP1-mediated hydrolysis, we next explored site-specific sequential protein dual labeling at internal sites and at the N-terminus using differently functionalized NGLH-probes. Therefore, we expressed a Ub-variant bearing GGisoK at position K48 and displaying a TEV recognition site at its N-terminus, followed by a GV motif (Figure a). In the first step, we labeled the internal GGisoK with dtb-NGLH using OaAEP1. Next, we exposed the N-terminal GV acceptor nucleophile by TEV protease cleavage to make it amenable to transpeptidation with OaAEP1 using an alkyne-NGLH probe. Near-quantitative labeling was ascertained by LC–MS for each step (Figures b and S9). Additionally, we showed that the alkyne moiety could be efficiently used in further Cu(I)-catalyzed alkyne–azide cycloaddition for functionalization with commercially available azide-bearing fluorophores (Figure S9).
Figure 3

Site-specific sequential dual labeling using OaAEP1. (a) Schematic representation of OaAEP1-mediated dual labeling of TEV-GV-Ub-K48GGisoK (I). In the first step, the GGisoK moiety is labeled with dtb-NGLH (II) (50 μM (I), 1 mM dtb-NGLH, 2 μM OaAEP1, 500 μM NiCl2, pH 6.8, 30 °C, 100 min). Subsequent TEV cleavage reveals the N-terminal GV moiety which is modified with alkyne-NGLH (III) (50 μM (II), 1 mM alkyne-NGLH, 2 μM OaAEP1, 500 μM NiCl2, pH 6.8, 30 °C, 150 min). (b) LC–MS analysis of different labeling steps as depicted in (a). (Calculated masses: TEV-GV-Ub-K48GGisoK-H6 = 10952 Da, TEV-GV-Ub-K48(dtb-N)GGisoK-H6 = 11262 Da, and alkyne-NGV-Ub-K48(dtb-N)GGisoK-H6 = 10621 Da). For detailed conditions, see Figure S9.

Site-specific sequential dual labeling using OaAEP1. (a) Schematic representation of OaAEP1-mediated dual labeling of TEV-GV-Ub-K48GGisoK (I). In the first step, the GGisoK moiety is labeled with dtb-NGLH (II) (50 μM (I), 1 mM dtb-NGLH, 2 μM OaAEP1, 500 μM NiCl2, pH 6.8, 30 °C, 100 min). Subsequent TEV cleavage reveals the N-terminal GV moiety which is modified with alkyne-NGLH (III) (50 μM (II), 1 mM alkyne-NGLH, 2 μM OaAEP1, 500 μM NiCl2, pH 6.8, 30 °C, 150 min). (b) LC–MS analysis of different labeling steps as depicted in (a). (Calculated masses: TEV-GV-Ub-K48GGisoK-H6 = 10952 Da, TEV-GV-Ub-K48(dtb-N)GGisoK-H6 = 11262 Da, and alkyne-NGV-Ub-K48(dtb-N)GGisoK-H6 = 10621 Da). For detailed conditions, see Figure S9. To demonstrate that GGisoK-directed transpeptidation is suited for the preparative generation of defined POI-small-molecule conjugates, we aimed at site-specifically labeling the anti-eGFP nanobody (eGFP-nb) and test its specific binding to eGFP expressed in mammalian cells.[32] Guided by the eGFP-nb:eGFP crystal structure (Figure a, PDB: 3OGO),[33] we selected various surface-exposed positions within eGFP-nb to be replaced by GGisoK. For this, we expressed and purified eGFP-nb variants bearing GGisoK at individual positions (Q12, R18, R75, K86, K116, and at position 120 within an artificially prolonged C-terminus) in yields from 1 to 8 mg/L culture (Figures b and S10). Having confirmed via in vitro pull-down assays that all GGisoK-bearing eGFP-nb variants retained their binding capability toward eGFP (Figure S10), we proceeded with OaAEP1-mediated labeling using a sulforhodamine-NGLH probe (SuRho-NGLH, Figures f, S8 and S11). While we observed specific labeling for all six GGisoK-bearing nanobody variants as judged by in-gel fluorescence, the labeling efficiencies varied greatly among the six variants, presumably due to disparate steric access of OaAEP1 to the GGisoK acceptor nucleophile. eGFP-nb-R75GGisoK showed >85% labeling within 3 h, as assessed by LC–MS, and was purified at preparative scale for further experiments (Figures c,d and S11).
Figure 4

OaAEP1-mediated functionalization of the eGFP nanobody (eGFP-nb). (a) Crystal structure of the eGFP-nb bound to eGFP (PDB: 3OGO). Individual positions chosen for the site-specific introduction of GGisoK are marked in red. (b) SDS–PAGE analysis of eGFP-nb expressions (−/+ I = without/with induction). (c) Coomassie staining and in-gel fluorescence of OaAEP1-mediated site-specific labeling of eGFP-nb-R75GGisoK with SuRho-NGLH shows specific labeling for GGisoK bearing eGFP-nb (30 μM eGFP-nb-R75GGisoK, 0.6 mM SuRho-NGLH, 5 μM OaAEP1, 1 mM NiSO4, pH 7.4, 30 °C). (d) LC–MS analysis of purified eGFP-nb-R75(SuRho-N)GGisoK shows >85% labeling yield (calculated masses: eGFP-nb-R75GGisoK-H6 = 14,009 Da and eGFP-nb-R75(SuRho-N)GGisoK-H6 = 14809 Da). (e) Fluorescence microscopy of fixed HEK293T cells overexpressing eGFP-LifeAct treated with eGFP-nb-R75(SuRho-N)GGisoK. (f) Live-cell microscopy of HEK293T cells overexpressing eGFP-PDGFR-TM treated with eGFP-nb-R75(SuRho-N)GGisoK. Scale bars correspond to 20 μm. For detailed conditions and controls, see Figures S10 and S13.

OaAEP1-mediated functionalization of the eGFP nanobody (eGFP-nb). (a) Crystal structure of the eGFP-nb bound to eGFP (PDB: 3OGO). Individual positions chosen for the site-specific introduction of GGisoK are marked in red. (b) SDS–PAGE analysis of eGFP-nb expressions (−/+ I = without/with induction). (c) Coomassie staining and in-gel fluorescence of OaAEP1-mediated site-specific labeling of eGFP-nb-R75GGisoK with SuRho-NGLH shows specific labeling for GGisoK bearing eGFP-nb (30 μM eGFP-nb-R75GGisoK, 0.6 mM SuRho-NGLH, 5 μM OaAEP1, 1 mM NiSO4, pH 7.4, 30 °C). (d) LC–MS analysis of purified eGFP-nb-R75(SuRho-N)GGisoK shows >85% labeling yield (calculated masses: eGFP-nb-R75GGisoK-H6 = 14,009 Da and eGFP-nb-R75(SuRho-N)GGisoK-H6 = 14809 Da). (e) Fluorescence microscopy of fixed HEK293T cells overexpressing eGFP-LifeAct treated with eGFP-nb-R75(SuRho-N)GGisoK. (f) Live-cell microscopy of HEK293T cells overexpressing eGFP-PDGFR-TM treated with eGFP-nb-R75(SuRho-N)GGisoK. Scale bars correspond to 20 μm. For detailed conditions and controls, see Figures S10 and S13. We benchmarked this site-specifically SuRho-labeled eGFP-nb for imaging of eGFP-expressing proteins both in fixed and live mammalian cells. For this, we transfected HEK293T cells with eGFP-LifeAct, a fusion between eGFP and a 17-amino-acid-long peptide that binds specifically to the actin cytoskeleton.[34] Cells were fixed and incubated with SuRho-labeled eGFP-nb. Efficient labeling of the cytoskeleton and colocalized green and red fluorescence was confirmed by fluorescence microscopy (Figure e). Importantly, eGFP-LifeAct-expressing cells treated with the SuRho-NGLH probe alone or with unlabeled eGFP-nb did not show any nonspecific labeling (Figure S12). To show membrane labeling on live cells, we transfected HEK293T cells with a pDisplay construct where eGFP is fused to the transmembrane domain of the platelet-derived growth factor receptor (eGFP-PDGFR-TM) to display eGFP on the cell surface.[35] Incubation of eGFP-PDGFR-TM-expressing cells with SuRho-labeled eGFP-nb led to specific labeling of cells expressing eGFP at their plasma membrane (Figure f), while cells treated with SuRho or eGFP-nb alone did not show any nonspecific labeling (Figure S13). Given that OaAEP1 transpeptidation using substrates with NGL as a recognition motif and GGisoK-bearing proteins as acceptor nucleophiles leads to ligation products with an NGG sequence, we hypothesized that we could use this approach to build Ub-POI conjugates bearing a native isopeptide bond and harboring only one point mutation in the Ub C-terminus. Ub is a small, globular, and highly conserved 76 amino acid protein.[25,26] Its C-terminus is unstructured, and the last six amino acids have the sequence71-LRLRGG. During ubiquitylation, the carboxylate of G76 is attached to specific lysine residues in a POI (or Ub itself) through a complex enzymatic machinery, forming an isopeptide bond. We have recently developed a chemoenzymatic approach to build ubiquitylated proteins using the transpeptidase sortase (sortylation).[28,29] For this, we mutated the Ub C-terminus to contain a sortase (Srt2A)[36] recognition motif [71-LALTGG, dubbed Ub(AT)]. Srt2A-mediated transpeptidation between Ub(AT) and a GGisoK-bearing POI leads to Ub-POI conjugates displaying a native isopeptide bond and two point mutations in the linker region (R72A and R74T). We were able to show that sortase-generated diUbs largely maintain structural and functional integrity by retaining their binding affinities to many Ub-binding domains, a requirement for decoding diverse cellular functions. Nevertheless, the two point mutations make sortase-generated diUbs resistant toward DUBs, indicating that their recognition by DUBs might be impaired. In order to build more native diUbs, we expressed Ub bearing an RLNGLH sequence at its C-terminus [72-RLNGLH, dubbed Ub(N), Figure a]. Incubation of Ub-K48GGisoK (50 μM) with a fivefold excess of Ub(N) in the presence of 0.04 equiv of OaAEP1 afforded K48-linked diUb within 15 min, bearing a native isopeptide bond and one point mutation R74N [K48-diUb(N), Figures b and S14]. K48-diUb(N) could be produced in mg yields, and its identity was confirmed by LC–MS. We next set out to test if the isopeptide bond in K48-diUb(N) would be cleaved by DUBs. We therefore incubated purified K48-diUb-wt, Srt2A-generated diUb [K48-diUb(AT)], and OaAEP1-generated diUb [K48-diUb(N)] with the catalytic domain of Ub carboxyl terminal hydrolase 2 (USP2CD). Although the Srt2A-generated K48-diUb(AT) is completely resistant to USP2CD cleavage, K48-diUb(N) is as efficiently cleaved to the corresponding monoUbs as observed for K48-diUb bearing the wt sequence in the linker region (Figures c and S14).
Figure 5

Site-specific monoubiquitylation and monoSUMOylation using OaAEP1. (a) Schematic representation of OaAEP1-mediated site-specific monoubiquitylation/SUMOylation. Transpeptidation between Ub displaying an NGLH motif at its C-terminus [Ub(N)] and a GGisoK-bearing target protein results in the formation of an isopeptide-linked Ub-POI conjugate bearing only a single point mutation in the flexible linker region (R74N). In a similar fashion, SUMO2(N), bearing a T91N mutation, can be attached to a substrate protein displaying a GGisoK moiety using OaAEP1. (b) SDS–PAGE analysis showing OaAEP1-mediated K48-diUb formation by incubation of Ub-K48GGisoK, Ub(N), and OaAEP1 [left, 50 μM Ub-K48GGisoK, 250 μM Ub(N), 2 μM OaAEP1, 500 μM NiCl2, pH 6.8, 37 °C]. LC–MS analysis of purified K48-diUb(N) [right, calculated mass: K48-diUb(N) = 17892 Da]. (c) SDS–PAGE analysis of deubiquitylation assays with USP2CD. Natively linked K48-diUb (K48-diUb-wt) and OaAEP1-generated K48-diUb(N) are hydrolyzed within 30 min, while Srt2A-generated K48-diUb(AT) is recalcitrant toward DUB hydrolysis (10 μM K48-diUbs and 200 nM USP2CD). *denotes cleavage of the C-terminal H6-tag of the acceptor Ub of K48-diUb(AT). (d) SDS–PAGE analysis showing OaAEP1-mediated ubiquitylation of SUMO2-K11GGisoK-H6 by incubation of SUMO2-K11GGisoK, Ub(N), and OaAEP1 (50 μM SUMO2-K11GGisoK, 250 μM Ub(N), 2 μM OaAEP1, pH 6.8, 37 °C). (e) SDS–PAGE analysis showing OaAEP1-mediated ubiquitylation of histone H3 by incubation of H3-KxxGGisoK, Ub(N), and OaAEP1 (20 μM H3-KxxGGisoK-H6, 150 μM Ub(N), 1 μM OaAEP1, pH 7.0, 25 °C). (f) SDS–PAGE analysis of OaAEP1-mediated SUMOylation of Ub-K48GGisoK (150 μM Ub-K48GGisoK, 150 μM SUMO2(N), 2 μM OaAEP1, pH 7.4, 25 °C) *denotes hydrolysis of the C-terminal OaAEP1 motif leading to cleavage of the H6-tag of SUMO2(N)-H6. For detailed conditions, see Figures S14 and S15.

Site-specific monoubiquitylation and monoSUMOylation using OaAEP1. (a) Schematic representation of OaAEP1-mediated site-specific monoubiquitylation/SUMOylation. Transpeptidation between Ub displaying an NGLH motif at its C-terminus [Ub(N)] and a GGisoK-bearing target protein results in the formation of an isopeptide-linked Ub-POI conjugate bearing only a single point mutation in the flexible linker region (R74N). In a similar fashion, SUMO2(N), bearing a T91N mutation, can be attached to a substrate protein displaying a GGisoK moiety using OaAEP1. (b) SDS–PAGE analysis showing OaAEP1-mediated K48-diUb formation by incubation of Ub-K48GGisoK, Ub(N), and OaAEP1 [left, 50 μM Ub-K48GGisoK, 250 μM Ub(N), 2 μM OaAEP1, 500 μM NiCl2, pH 6.8, 37 °C]. LC–MS analysis of purified K48-diUb(N) [right, calculated mass: K48-diUb(N) = 17892 Da]. (c) SDS–PAGE analysis of deubiquitylation assays with USP2CD. Natively linked K48-diUb (K48-diUb-wt) and OaAEP1-generated K48-diUb(N) are hydrolyzed within 30 min, while Srt2A-generated K48-diUb(AT) is recalcitrant toward DUB hydrolysis (10 μM K48-diUbs and 200 nM USP2CD). *denotes cleavage of the C-terminal H6-tag of the acceptor Ub of K48-diUb(AT). (d) SDS–PAGE analysis showing OaAEP1-mediated ubiquitylation of SUMO2-K11GGisoK-H6 by incubation of SUMO2-K11GGisoK, Ub(N), and OaAEP1 (50 μM SUMO2-K11GGisoK, 250 μM Ub(N), 2 μM OaAEP1, pH 6.8, 37 °C). (e) SDS–PAGE analysis showing OaAEP1-mediated ubiquitylation of histone H3 by incubation of H3-KxxGGisoK, Ub(N), and OaAEP1 (20 μM H3-KxxGGisoK-H6, 150 μM Ub(N), 1 μM OaAEP1, pH 7.0, 25 °C). (f) SDS–PAGE analysis of OaAEP1-mediated SUMOylation of Ub-K48GGisoK (150 μM Ub-K48GGisoK, 150 μM SUMO2(N), 2 μM OaAEP1, pH 7.4, 25 °C) *denotes hydrolysis of the C-terminal OaAEP1 motif leading to cleavage of the H6-tag of SUMO2(N)-H6. For detailed conditions, see Figures S14 and S15. To prove the generality of OaAEP1-mediated ubiquitylation, we showed that other natively ubiquitylated substrate proteins can also be accessed by OaAEP1 and Ub(N). We prepared SUMO2 displaying GGisoK at position K11, as K11 represents a well-known and the most abundant Ub linkage site on SUMO2.[37] Incubation of SUMO2-K11GGisoK with Ub(N) in the presence of OaAEP1 led to specific formation of the SUMO2-Ub conjugate (Figure d), while SUMO2 bearing tert butyloxycarbonyl-l-lysine (BocK) at position K11 did not generate any SUMO2-Ub conjugate (Figure S14). Among the most abundant monoubiquitylated proteins are histones H2A and H2B with ubiquitylation fulfilling critical roles in regulating transcription and other cellular processes.[38−40] Recent reports have shown that also histones H3 and H4 are ubiquitylated,[41] for example, various large-scale quantitative proteomics studies identified several ubiquitylation sites within H3,[42] but the functions of these modifications are less understood. To demonstrate that OaAEP1-mediated ubiquitylation is suited to generate defined Ub-H3 conjugates, we prepared H3-constructs bearing amber codons at nine lysine positions and incorporated AzGGK in response to these introduced amber codons (Figure S15). After reduction of the azide group in AzGGK, we purified H3-K23GGisoK, H3-K27GGisoK, and H3-K79GGisoK via affinity chromatography and cation exchange followed by refolding. All three H3-GGisoK variants were incubated with Ub(N) and OaAEP1, and specific formation of Ub-H3 conjugates was observed within 5 min. Ubiquitylation yields ranged between 31 und 35% (Figures e and S15). We envisioned that OaAEP1-mediated generation of Ub conjugates may also be extendable to other Ubls that share Ub’s common ß-grasp fold with a flexible six-residue C-terminal tail and the characteristic GG motif that is attached to a lysine residue in the target protein. We introduced the NGL recognition motif into the C-terminus of the small-Ub-like-modifier SUMO2[43] by introducing the point mutation T91N, generating SUMO2(N) with the C-terminal sequence 89-QQNGLH6 (Figure f). Gratifyingly, incubation of SUMO2(N) and Ub-K48GGisoK in the presence of OaAEP1 led to the formation of the expected SUMO2-Ub conjugate bearing a T91N mutation in its linker region, confirming that OaAEP1-mediated transpeptidation is transferable to other Ubls (Figures d and S14).

Conclusions

We have shown that the recently engineered and chemically improved AEP OaAEP1 allows site-specific internal labeling of GGisoK-bearing proteins with a variety of small molecules and biophysical probes. OaAEP1-mediated labeling can be conducted at physiological pH in mild and aqueous buffers and leads to near-quantitative and essentially irreversible formation of the ligation product in the presence of Ni2+ salts. OaAEP1-mediated labeling compares favorably to other chemoenzymatic labeling approaches in terms of size and nature of recognition tag and footprint left in the ligation product. It requires an NGL recognition motif in the labeling reagent and the site-specific incorporation of GGisoK into a target protein, leading to a flexible and minimal tripeptidic NGG motif in the linker between the label and POI in the conjugation product. This is in contrast to other enzymatic labeling approaches such as transferases (e.g., transglutaminase,[44] SUMO-conjugating enzyme Ubc9,[16] phosphopanthetheinyl transferase,[13] and phosphocholine transferase[45]) and ligases (e.g., biotin ligase[14] and lipoic acid ligase[15]) that typically leave a mark of at least 6–15 amino acids in their conjugation products and often result in bulky and artificial linkages. Importantly, OaAEP1 only requires stable peptidic probes that can be easily synthesized and/or are often commercially available and results in entirely peptidic linkers, which may be especially advantageous for generating antibody-drug-like conjugates with reduced immunogenicity for in vivo administration. As the resulting NGG motif in the ligation product is essentially resistant toward further OaAEP1 hydrolysis, our approach is also suitable for sequential dual labeling at both an internal site and the N-terminus with diverse probes. Additionally, only low concentrations of OaAEP1 (0.01–0.04 mol equivalents) are required for efficient transpeptidation, as opposed to, for example, sortase-mediated labeling[28] and Ubc9-mediated labeling,[16] which mostly rely on >0.2 mol equivalents of their corresponding enzymes for high conversion rates. OaAEP1-mediated transpeptidation in combination with site-specific incorporation of GGisoK via genetic code expansion therefore represents an operationally simple process for labeling of recombinant proteins at internal sites, leaving a minimal, entirely peptidic scar (NGG) in the ligation product. It is however worth noting that due to the small recognition motif NGL, unwanted cleavage within the target protein may be a side reaction that can occur during OaAEP1 transpeptidation, especially if the POI displays asparagine residues followed by small and hydrophobic amino acids in highly exposed loop regions. Importantly, OaAEP1-mediated labeling can also be harnessed for conjugation of folded proteins to generate novel architectures and protein–protein conjugates in a programmable manner. On our quest to generate and study distinct Ub/Ubl topologies, we were able to show that OaAEP1-mediated transpeptidation allows site-specific monoubiquitylation and monoSUMOylation of target proteins. We have shown the ubiquitylation of diverse target proteins including the site-specific covalent attachment of Ub to SUMO2 and histones using OaAEP1. OaAEP1-generated Ub-POI conjugates display a native isopeptide bond connecting Ub and the POI with only one point mutation in the Ub/Ubl C-terminus. Compared to our previously developed sortylation approach,[28,29]OaAEP1-mediated site-specific conjugation of Ub/Ubl to target proteins requires less enzyme and shows faster conversion to product. For sortase-mediated ubiquitylation, a leucine spacer amino acid is often introduced preceding the sortase recognition motif, as this increases accessibility of the sortase to the Ub C-terminus and therefore sortylation yields.[28] Interestingly, introduction of such a spacer amino acid was not needed for OaAEP1-mediated ubiquitylation and SUMOylation. Excitingly, this leads to Ub/Ubl-POI conjugates that contain only one single point mutation in the linker region between Ub/Ubl and the POI. In fact—in contrast to sortase-generated diUbs—OaAEP1-generated diUbs are cleaved by DUBs to a similar extent as diUbs bearing a wt-linker, confirming their functional and structural resemblance to endogenous Ub/Ubl-conjugates. In this sense, OaAEP1-mediated ubiquitylation complements the approach recently reported by Bode and co-workers that requires the introduction of up to three point mutations into the POI to make it a substrate for Ubc9-mediated ubiquitylation.[16,46] Ultimately, we envision that OaAEP1-mediated generation of Ub and Ubl conjugates may be combined with sortylation to further extend our recently developed approach Ubl-tools (Ubl-topologies via orthogonal sortylation)[29] as a modular and robust tool for accessing defined Ub/Ubl chains where we can place DUB-resistant and DUB-susceptible linkages at defined positions.
  44 in total

1.  Role of histone H2A ubiquitination in Polycomb silencing.

Authors:  Hengbin Wang; Liangjun Wang; Hediye Erdjument-Bromage; Miguel Vidal; Paul Tempst; Richard S Jones; Yi Zhang
Journal:  Nature       Date:  2004-09-22       Impact factor: 49.962

Review 2.  Sumoylation: a regulatory protein modification in health and disease.

Authors:  Annette Flotho; Frauke Melchior
Journal:  Annu Rev Biochem       Date:  2013       Impact factor: 23.643

3.  Yeast display evolution of a kinetically efficient 13-amino acid substrate for lipoic acid ligase.

Authors:  Sujiet Puthenveetil; Daniel S Liu; Katharine A White; Samuel Thompson; Alice Y Ting
Journal:  J Am Chem Soc       Date:  2009-11-18       Impact factor: 15.419

Review 4.  Recent developments in bioorthogonal chemistry and the orthogonality within.

Authors:  Mike L W J Smeenk; Jordi Agramunt; Kimberly M Bonger
Journal:  Curr Opin Chem Biol       Date:  2020-11-03       Impact factor: 8.822

5.  Covalent Protein Labeling by Enzymatic Phosphocholination.

Authors:  Katharina Heller; Philipp Ochtrop; Michael F Albers; Florian B Zauner; Aymelt Itzen; Christian Hedberg
Journal:  Angew Chem Int Ed Engl       Date:  2015-07-03       Impact factor: 15.336

6.  Site-specific labeling of cell surface proteins with biophysical probes using biotin ligase.

Authors:  Irwin Chen; Mark Howarth; Weiying Lin; Alice Y Ting
Journal:  Nat Methods       Date:  2005-01-21       Impact factor: 28.547

7.  Transglutaminase-catalyzed site-specific conjugation of small-molecule probes to proteins in vitro and on the surface of living cells.

Authors:  Chi-Wang Lin; Alice Y Ting
Journal:  J Am Chem Soc       Date:  2006-04-12       Impact factor: 15.419

Review 8.  The ubiquitin code.

Authors:  David Komander; Michael Rape
Journal:  Annu Rev Biochem       Date:  2012-04-10       Impact factor: 23.643

Review 9.  Mechanisms of Deubiquitinase Specificity and Regulation.

Authors:  Tycho E T Mevissen; David Komander
Journal:  Annu Rev Biochem       Date:  2017-05-12       Impact factor: 23.643

Review 10.  Site-specific antibody-drug conjugates: the nexus of bioorthogonal chemistry, protein engineering, and drug development.

Authors:  Paresh Agarwal; Carolyn R Bertozzi
Journal:  Bioconjug Chem       Date:  2015-01-30       Impact factor: 4.774

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.