Literature DB >> 35658467

Platform for Orthogonal N-Cysteine-Specific Protein Modification Enabled by Cyclopropenone Reagents.

Alena Istrate¹, Michael B Geeson¹, Claudio D Navo², Barbara B Sousa³, Marta C Marques³, Ross J Taylor¹, Toby Journeaux¹, Sebastian R Oehler⁴, Michael R Mortensen⁴, Michael J Deery⁵, Andrew D Bond¹, Francisco Corzana⁶, Gonzalo Jiménez-Osés^2,7, Gonçalo J L Bernardes^1,3.

Abstract

Protein conjugates are valuable tools for studying biological processes or producing therapeutics, such as antibody-drug conjugates. Despite the development of several protein conjugation strategies in recent years, the ability to modify one specific amino acid residue on a protein in the presence of other reactive side chains remains a challenge. We show that monosubstituted cyclopropenone (CPO) reagents react selectively with the 1,2-aminothiol groups of N-terminal cysteine residues to give a stable 1,4-thiazepan-5-one linkage under mild, biocompatible conditions. The CPO-based reagents, all accessible from a common activated ester CPO-pentafluorophenol (CPO-PFP), allow selective modification of N-terminal cysteine-containing peptides and proteins even in the presence of internal, solvent-exposed cysteine residues. This approach enabled the preparation of a dual protein conjugate of 2×cys-GFP, containing both internal and N-terminal cysteine residues, by first modifying the N-terminal residue with a CPO-based reagent followed by modification of the internal cysteine with a traditional cysteine-modifying reagent. CPO-based reagents enabled a copper-free click reaction between two proteins, producing a dimer of a de novo protein mimic of IL2 that binds to the β-IL2 receptor with low nanomolar affinity. Importantly, the reagents are compatible with the common reducing agent dithiothreitol (DTT), a useful property for working with proteins prone to dimerization. Finally, quantum mechanical calculations uncover the origin of selectivity for CPO-based reagents for N-terminal cysteine residues. The ability to distinguish and specifically target N-terminal cysteine residues on proteins facilitates the construction of elaborate multilabeled bioconjugates with minimal protein engineering.

Entities: Chemical

Mesh：

Substances：

Year: 2022 PMID： 35658467 PMCID： PMC9490850 DOI： 10.1021/jacs.2c02185

Source DB: PubMed Journal: J Am Chem Soc ISSN： 0002-7863 Impact factor: 16.383

Introduction

Protein conjugates are important tools for creating valuable therapeutics, such as antibody–drug conjugates (ADCs)[1,2] and PEGylated proteins,[3] building new functionalized materials,[4] and studying biological processes.[5,6] Among the various strategies used for protein conjugation, modification of naturally occurring amino acids remains the method of choice because it offers the advantage of straightforward accessibility without the need for sequence alterations by means of genetic methods.[7] Ideally, conjugation reactions should proceed with complete chemo- and site-selectivity to generate well-defined protein constructs, which is a crucial requirement for several applications such as ADCs.[1,2] Similarly, such transformations should occur rapidly in mild aqueous solutions at room temperature and physiological pH. Although many protein conjugation strategies have been developed over recent years, the ability to modify one specific amino acid on a protein in the presence of other side chains with similar reactivity remains a challenge.[7] Protection of particularly reactive amino acids such as cysteine[8] or extensive sequence engineering with the introduction of specific tags for enhanced reactivity[9] is often required in order to achieve selectivity for the intended residue. Lysine[10−12] and cysteine[9,13−15] are the most commonly targeted proteinogenic amino acids for bioconjugation because they are nucleophilic under physiological conditions. Native lysine residues are very convenient targets, but they are abundant on protein surfaces, so it is difficult to achieve a high degree of selectivity for a given residue.[10] Conversely, cysteine residues are less abundant in proteins (<2%) and commonly less solvent-exposed, which makes them an excellent target for site-selective conjugation.[16] However, when cysteine residues are relied on for protein modification, there are several factors that must be taken into account. Specifically, cysteine residues often form disulfide bonds that are critical for a protein’s structure, and modification of such residues can lead to a loss of protein function. Moreover, many surface-exposed endogenous cysteine residues are directly involved in the catalytic activity of the protein and thus cannot be exploited for modification. Therefore, methods that can distinguish one cysteine residue from another within one protein could enable the construction of functional and well-defined biomolecule conjugates without the need for extensive genetic engineering or the incorporation of unnatural amino acids. The most reliable strategy to differentiate one cysteine in the presence of other thiol groups is to target an N-terminal cysteine residue (NCys). Several methods for selective NCys modification have been developed, including reaction with thioesters via native chemical ligation (NCL) or condensation with aromatic aldehydes or 2-cyanobenzothiazole derivatives (Figure ).[17] NCL enables linking of protein or peptide fragments in a robust and chemoselective manner through transthioesterification and S-to-N acyl exchange (Figure ).[18,19] However, this method is rarely used to produce protein conjugates because of difficulties in preparation and lack of stability of suitable thioester reagents. The reaction of 1,2-aminothiols with aldehydes to form thiazolidine has also been explored as a strategy for NCys modification (Figure ).[20] Neri and co-workers have successfully applied this approach for site-specific coupling of cytotoxic aldehydes to tumor-targeting antibodies, which produced homogeneous conjugates that were then used for targeted delivery and slow release of the cytotoxic component.[21] However, this reaction requires long incubation times (>48 h), occurs under acidic conditions (pH 4–5), and is typically performed with a large excess of the aldehyde derivative. These limitations can be addressed by using formyl benzeno boronic acids (FBBA) that stabilize thiazolidine formation through N→B coordination (Figure ). Recently, FBBA reagents have been used to rapidly modify several model NCys-containing peptides at neutral pH.[22,23] This reaction is reversible, and the product is unstable in an acidic environment (pH < 6) or in the presence of nucleophiles (e.g., free cysteine or benzyl hydroxylamine). It is however possible to use a thiazolidino boronate intermediate which undergoes an intramolecular acyl transfer to give more stable N-acyl thiazolidines.[24]

Figure 1

Overview of recent methods for targeting N-terminal cysteine residues for bioconjugation and the site-specific dual protein conjugation described here. NCL, native chemical ligation; 2-CBT, 2-cyanobenzothiazol; FBBA, formyl benzeno boronic acid. Another N-terminal Cys-labeling reaction was inspired by the final step of the chemical synthesis of D-luciferin[25] and is based on the condensation of free cysteine with 2-cyanobenzothiazol (CBT) (Figure ). After Rao and co-workers first demonstrated the potential of this reaction for NCys modification,[26] the approach has been widely used in site-specific protein labeling and molecular imaging. This method has major advantages for bioconjugation because of its efficiency, biocompatibility, and the stability of the resulting luciferin linkage.[27] However, 2-cyanobenzothiazol derivatives also react quickly (although reversibly) with simple thiols. As a result, when using excess CBT to ensure complete conjugation, other reduced protein thiols must be protected. Therefore, alternative bioconjugation reagents for fast and selective labeling of N-terminal cysteine residues are still required to enable the construction of complex protein conjugates of well-defined structures. In this work, the cyclopropenone (CPO) functional group was developed for site-specific modification of N-terminal cysteine residues on peptides and proteins. The three-membered rings of cyclopropenones feature significant ring-strain, though the aromatic character of the ring[28] renders it remarkably stable. The strain and large dipole moment allow this functional group to participate in cycloaddition and ring-opening reactions,[29,30] and as α,β-unsaturated ketones, cyclopropenones also act as electrophiles in 1,2- and 1,4-nucleophilic addition reactions. Several natural products contain cyclopropenone functional groups, highlighting their biocompatibility and stability under physiological conditions.[31,32] Cyclopropenone-containing molecules have been featured in some previous biological applications, for example, in the design of selective protease inhibitors[33] or as components for bioorthogonal reactions on proteins.[34−36] The latter work, pioneered by Prescher and co-workers, focused on the reaction of CPO-containing molecules (installed on lysine residues with NHS-ester chemistry) with phosphines as a strategy for bioorthogonal reactions and real-time cell imaging.[37] Finally, reactions of N-terminal cysteine residues in peptides with hybrid aminosulfhydryl-stapling reagents lead to products containing 1,4-thiazepan-5-one rings; due to the NHS-ester functional group present in those reagents, internal stapling of cysteine and lysine residues was targeted as a conjugation strategy.[38] Here, we report the efficient and selective reaction of monosubstituted cyclopropenone-containing reagents with the 1,2-aminothiol groups of N-terminal cysteine residues (Figure ). Importantly, the selectivity exhibited by CPO-based reagents toward N-terminal cysteine residues enables sequential and site-specific dual-modification of proteins. Such selectivity is also demonstrated to be operational in complex mixtures of proteins that contain internal or N-terminal cysteine residues and in the presence or absence of DTT.

Results and Discussion

Monosubstituted Cyclopropenones Are Efficient Reagents for N-Terminal Cysteine Labeling

Our investigations began with the synthesis of a model cyclopropenone in order to assess the stability of this functional group in aqueous buffers. Along the lines of a literature procedure,[39] 2-phenylethylcyclopropenone (1) was prepared from commercially available 4-phenyl-1-butyne by (i) treatment with TMSCF3, a formal source of difluorocarbene, to afford the corresponding 3,3-difluorocyclopropene derivative[40] followed by (ii) hydrolysis on wet silica gel.[35,39] Cyclopropenone 1 showed excellent stability after treatment with phosphate buffers (50 mM, pH 7–8) at 37 °C for 7 days (Figure S7). Next, the reaction of cyclopropenone 1 with l-cysteine ethyl ester was tested in the presence of base (Na2CO3) at 4 °C. This low temperature was found to be necessary with small, highly accessible nucleophiles such as amino acids and peptides in order to avoid side reactions and monitor the kinetics of the reaction, although it was not necessary for proteins. The reaction resulted in four isomeric compounds 2a–d in 87% yield after 30 min in aqueous solution. Analysis by liquid chromatography (LC)–mass spectrometry (MS) and NMR spectroscopy showed that the products are two pairs of diastereomeric regioisomers of a 1,4-thiazepan-5-one derivative, a stable seven-membered ring (Figure ) that results from a ring-expansion of the cyclopropenone group. Isomer 2a was isolated and characterized by X-ray crystallography, revealing its stereochemistry and confirming its structure (Figure ). To study the reaction further and confirm its outcome on a simpler, nonchiral model, cysteamine (CA) was treated with cyclopropenone 1. As expected, the reaction yielded two pairs of enantiomeric regioisomers (Figure ) that were separated by column chromatography and characterized by NMR spectroscopy.

Figure 2

(a) Reaction of cyclopropenone 1 with l-cysteine ethyl ester hydrochloride and cysteamine. (b) Oak Ridge thermal ellipsoid plot (ORTEP) of compound 2a with thermal ellipsoids at the 50% probability level and hydrogen atoms omitted for clarity.

Reaction Kinetics and Selectivity of CPO-Based Reagents

Next, the reaction kinetics of l-cysteine ethyl ester with CPO-based reagents were examined, and the limits of chemoselectivity were established. To evaluate the reaction kinetics, the HPLC chromatogram peak area of starting cyclopropenone 1 was monitored as a function of time over the course of the reaction (Figure S6). The second-order rate constant for this reaction was determined to be 3.0 M–1·s–1 at 4 °C, comparable to the value reported for the CBT–cysteine reaction (9.19 M–1·s–1) and strain-promoted azide–alkyne cycloaddition reactions (10–2–1 M–1·s–1) performed at higher temperatures (37 °C).[41] In fact, the rate constant for the reaction between cyclopropenone 1 and cysteine extrapolated to 37 °C would be 67 M–1·s–1, which supersedes the aforementioned values. A screen was performed to establish whether CPO-based reagent 1 underwent a reaction with other biologically relevant nucleophiles, including lysine, serine, threonine, tyrosine, glutathione, and cysteine methyl ester with a tert-butoxycarbonyl protected amino group (Boc-Cys-OMe). As shown by LC–MS, compound 1 did not react with lysine, serine, threonine, or tyrosine but showed excellent selectivity for l-cysteine. In addition, these amino acids did not interfere with the reaction between 1 and cysteine when present in the reaction mixture (Figures S9–S12). With the N-terminus protected, Boc-Cys-OMe nevertheless underwent a reaction with compound 1 to form a complex mixture of products, although the reaction was significantly slower compared to l-cysteine ethyl ester with an unprotected amino group at its N-terminus (Figure S14). In fact, when a stoichiometric mixture of both N-protected Boc-Cys-OMe and N-unprotected l-cysteine ethyl ester was treated with CPO-based reagent 1, high selectivity was observed toward the 1,2-aminothiol group of l-cysteine ethyl ester, resulting in almost exclusive formation of 1,4-thiazepan-5-one products 2a–d (Figure S14). Importantly, glutathione, the most abundant low-molecular-weight thiol in cells, did not react with CPO-based reagent 1 and did not interfere with the reaction between 1 and l-cysteine ethyl ester (Figure S13). CPO-based reagents have a clear advantage over 2-cyanobenzothiazole (CBT) reagents, which react with glutathione and other thiol nucleophiles.[26]

CPO-Based Reagents with a Range of Functionality Are Accessible from CPO-PFP

Before the applicability of the reaction on peptides and proteins was tested, several cyclopropenone-based reagents were designed and synthesized with different functionalities (Scheme ). The synthetic protocol began with commercially available 5-hexynoic acid 4, which was converted to the corresponding pentafluorophenol (PFP) ester 5 with pentafluorophenyl trifluoroacetate (PTFTFA, Scheme ). Ester 5 was subjected to cyclopropenation, affording the corresponding CPO-containing activated ester CPO-PFP, which is an easy-to-handle, stable (>6 months) white solid that can be accessed on a large scale. CPO-PFP can be used to append a cyclopropenone unit to a compound of interest if it contains a primary amine via a simple amide-bond-forming procedure in a fast and efficient manner. The reaction conditions of the coupling were optimized using benzylamine, providing CPO-BN in high yield (91%). This procedure was used to prepare several CPO-containing derivatives of interest for peptide and protein bioconjugation reactions. Cyclopropenone-based reagents were prepared that contained a poly(ethylene glycol) unit (CPO-PEG), a fluorescent dye (CPO-EDANS), and several derivatives with functional groups of relevance to click chemistry (CPO-PEG-Alkyne, CPO-N, and CPO-DBCO). Overall, intermediate CPO-PFP provides ready access to several CPO-based reagents by treatment with amine-bearing molecules of interest at room temperature in high yields and short reaction times (Scheme ).

Scheme 1

Synthesis of Cyclopropenone-Based Reagents with Different Functional Groups

Conditions: (i) PTFTFA, DIPEA, DCM, 1 h, 25 °C. (ii) 1. TMSCF3, NaI, THF, 25 °C, 48 h. 2. SiO2, CHCl3.

Synthesis of Cyclopropenone-Based Reagents with Different Functional Groups

Conditions: (i) PTFTFA, DIPEA, DCM, 1 h, 25 °C. (ii) 1. TMSCF3, NaI, THF, 25 °C, 48 h. 2. SiO2, CHCl3.

CPO-Based Reagents Selectively Modify N-Terminal Cysteine Residues on Peptides

The high reactivity and selectivity of compound 1 toward 1,2-aminothiols prompted us to investigate the ability of cyclopropenone-based reagents to modify N-terminal cysteine residues in peptides. Investigations began with two unprotected 5-mer peptides with N-terminal cysteine residues: CAIAI (P1) and CAIKI (P2). Notably, P2 also contains a lysine residue and therefore provides a test for the selectivity of CPO-based reagents for cysteine versus lysine residues in peptides. Treatment of both peptides (2 mM) with CPO-BN (2 equiv) in NaPi buffer (20 mM, pH 7)/acetonitrile resulted in complete conversion into the expected products after 1 h at 4 °C (Figure and Figures S15 and S17). Similarly, modification of peptides P1 and P2 with CPO-PEG resulted in the formation of the expected PEGylated products, as confirmed by LC–MS analysis (Figures S16 and S18).

Figure 3

Chemoselective modification of 1,2-aminothiols on peptides. (a) Site-specific modification of peptides P1–P4 by CPO-BN. Modification occurs only on the N-terminal cysteine residue. (b) CPO-BN selectively modifies the N-terminal cysteine residue of vasopressin (P5) and leaves the internal cysteine residue unmodified and accessible for further functionalization. Next, more complex peptides that contained multiple nucleophilic side chains were treated with CPO-based reagents. Initial efforts focused on laminin-derived synthetic peptide P3, an inhibitor of tumor growth.[42] This peptide contains nine amino acid residues, including an N-terminal cysteine, tyrosine, serine, asparagine, and arginine. Application of the optimized N-terminal cysteine modification protocol resulted in full conversion of the starting peptide after 1 h, as confirmed by LC–MS (Figure and Figure S19). The GTP-binding protein fragment (P4) is conceivably a more challenging substrate with 16 residues and multiple nucleophilic side chains (one N-terminal cysteine, three lysine, two serine, one threonine, and one methionine residue). Nevertheless, treatment of peptide P4 with CPO-BN (2 equiv) resulted in full conversion to products P4–BNa and P4–BNb within 1 h at 4 °C (Figure and Figure S20, respectively). No signal for double addition of CPO-BN was observed by mass spectrometry, highlighting the chemoselectivity of the protocol for cysteine residues over alternatives. To test whether chemoselectivity between N-terminal and internal cysteine residues held at the peptide level, a peptide that contains both was treated with CPO-BN. Vasopressin (P5), a 9-mer cyclic peptide by virtue of a disulfide bond, was reduced with TCEP to generate a linear peptide (P5-H) with cysteine residues in positions 1 (N-terminal) and 6 (internal). In control experiments without the addition of the reducing agent, vasopressin reacted with neither CPO-BN nor N-methylmaleimide (NMM), another common reagent for cysteine conjugation. In the course of the reduction of P5 to P5-H, care was taken to ensure that excess TCEP was removed from the reaction mixture before subsequent labeling (e.g., by using immobilized TCEP or limiting the amount used to 1 equiv) due to the incompatibility of phosphines with cysteine-labeling reagents, including cyclopropenones.[34,35] Reduced peptide P5-H was treated with either NMM or CPO-BN (2 equiv) to compare the selectivity of the two approaches (Figure ). As expected, NMM did not distinguish between the two cysteine residues, resulting in modification at both positions (Figure S22). In contrast, treatment of the reduced vasopressin peptide P5-H with CPO-BN resulted in selective modification of the N-terminal cysteine residue (Figure and Figure S24). Further experiments confirmed that the internal cysteine residue was still available for modification after the N-terminal cysteine residue was labeled with CPO-BN; treatment of CPO-modified vasopressin P5-BN with NMM led to quantitative modification of the internal cysteine residue (Figure and Figure S25), and analysis by LC–MS/MS confirmed the site specificity of the modifications (Figure S26). CBT-based reagents (Figure ), arguably the state-of-the-art method for N-terminal cysteine modification,[26] were tested under the same conditions to provide a direct comparison between CPO- and CBT-based reagents for selectivity toward the N-terminal cysteine residue of vasopressin. In contrast to the case with CPO-BN, treatment of P5-H with 2-cyano-6-hydroxybenzothiazole (CBT) afforded a mixture of single- and double-modified peptides (Figure and Figure S23). This case study highlights the potential for cyclopropenone-based reagents to offer increased site-specificity for N-terminal cysteine residues compared to the CBT analogues.

CPO-Based Reagents React Exclusively with N-Terminal Cysteine Residues on Proteins

With excellent selectivity and reaction kinetics demonstrated at the peptide level, attention turned to proteins. Fortunately, recombinant proteins with N-terminal cysteine residues are widely used for NCL, and so various approaches have been developed for their direct production.[43−46] A recombinant enhanced green fluorescent protein containing an N-terminal cysteine residue (cys-GFP) was produced by engineering a variant with the tobacco etch virus (TEV) protease recognition sequence (ENLYFQ↓C; arrow indicates the cleavage site) introduced after the His6 purification tag at the N-terminus. In this way, the cleavage of the expressed protein by TEV protease simultaneously removed the His6 tag and generated cys-GFP. Incubation of cys-GFP with CPO-BN (100 equiv) in NaPi buffer (20 mM, pH 7.0) at 25 °C for 2 h gave the desired conjugate GFP-CPO-BN with high efficiency, as confirmed by LC–MS spectrometry (Figure and Figure S38). Because proteins were modified at lower concentrations (ca. 30 μM) than peptides, higher stoichiometric equivalents of CPO-based reagents were used to ensure short reaction times (2–4 h). Following modification, digestion of GFP-CPO-BN with trypsin and subsequent analysis by LC–MS/MS confirmed the site of modification (Figure S41). The protein conjugate GFP-CPO-BN also displayed excellent stability; after incubation with glutathione (5 mM, 24 h, 37 °C), no deconjugation was observed (Figure S40). Molecular dynamics simulations (0.5 μs) were performed on the four possible conjugates of GFP-CPO-BN (Figure S66). The root-mean-square deviation values of the peptide backbone in all complexes ranged from 1.21 to 2.67 Å, suggesting that the addition of CPO-BN does not cause significant structural modifications to the protein. This theoretical approach was validated with experiments for other proteins by using circular dichroism spectroscopy.

Figure 4

Top left: Chemoselective modification of 1,2-aminothiols on proteins. Bottom: Site-specific bioconjugation of cys-GFP with CPO-BN, CPO-PEG, CPO-EDANS, and CPO-biotin. Top right: Treatment of a mixture of cys-GFP, C2Am, AnxV, and DesAB-HET (5 μM each; N-terminal residues C, G, A, and M, respectively) with CPO-biotin or MI-biotin (50 equiv) in NaPi buffer (20 mM, pH 7.0) with or without DTT (500 equiv). After demonstrating the chemoselectivity and efficiency of CPO-BN for N-terminal cysteine bioconjugation, we expanded the scope of the reaction to other CPO-based reagents. Testing CPO-PEG and CPO-EDANS under the same reaction conditions as used for CPO-BN resulted in successful conversion of cys-GFP to the expected products (Figure and Figures S42 and S43). In the case of CPO-EDANS, the lower conversion (70%) ostensibly arises from the poor solubility of the reagent. In order to determine whether purification of CPO-based reagents was necessary for protein conjugation, we tested out a telescoped procedure. CPO-PFP was mixed with a commercially available amine-functionalized derivative of biotin, providing CPO-biotin as assayed by LC–MS (Figure S47). Then, without any additional purification steps, the resulting solution was directly added to cys-GFP (100 equiv of CPO-biotin to 1 equiv of cys-GFP), and the reaction was monitored by LC–MS. To our delight, conversion of cys-GFP to the expected conjugate was complete after incubation for 2 h (Figure and Figure S45). Thus, in this case there is no need to purify the CPO-based reagent before the protein-labeling step which improves the throughput of the protein-modification workflow. In the course of our studies employing cys-GFP, it was found that the addition of dithiothreitol (DTT) to the protein stock solution was necessary to maintain the reactivity of the cysteine residue, presumably due to high susceptibility of the N-terminal cysteine residue to undergo oxidation and lead to protein dimerization. Nevertheless, an excess of DTT (500 equiv) in the reaction mixture did not hinder the bioconjugation reaction of CPO-based reagents with the N-terminal cysteine residue. In contrast, the reaction of cys-GFP with the traditional maleimide-based reagent MI-biotin under otherwise identical conditions produced very low conversion (∼13%; Figure S46), presumably a result of the incompatibility of maleimides with excess DTT.[47] Once again, this highlights the selectivity of the CPO-based reagents toward 1,2-aminothiols; 100 equiv of the reagent was enough to afford the desired protein conjugate even in the presence of an excess of DTT (Figure ). Next, we explored whether CPO-based reagents could selectively label a N-terminal cysteine-containing protein in the presence of other internal cysteine-containing proteins. A mixture of four proteins was prepared: cys-GFP (three free cysteine residues, including one N-terminal), an engineered variant of the C2A domain of synaptotagmin-I (C2Am, one free cysteine residue), annexin V (AnxV, one free cysteine residue), and an engineered variant of a nanobody (DesAB-HET, one free cysteine residue), and it was incubated with CPO-biotin or the analogous maleimide-based reagent, MI-biotin. In the case of CPO-biotin, successful conjugation was observed with cys-GFP whereas proteins C2Am, AnxV, and DesAB-HET were left unchanged. Similar results were observed regardless of whether DTT (500 equiv) was present in the reaction mixture (Figure and Figures S50 and S52). In contrast, MI-biotin did not exhibit any selectivity and fully modified three out of four proteins (cys-GFP, C2Am, and DesAB-HET) when DTT was absent from the reaction mixture (Figure and Figure S51). AnxV was not modified because it usually requires a large excess of reagents, higher temperatures, or longer reaction times for the cysteine modification to proceed.[48] As expected, in the presence of DTT, only minor modification of proteins (0–15%) with MI-biotin was observed (Figure and Figure S53). Overall, these data confirm the ability of CPO-based reagents to orthogonally label N-terminal cysteine residues in complex mixtures of other proteins bearing reactive internal cysteine residues and in the presence of DTT.

Site-Specific Cysteine Labeling of a Single Protein

Finally, CPO-based reagents were applied to the preparation of a protein conjugate endowed with two modifications selectively applied at different cysteine residues (dual protein conjugate). Preparation of dual protein conjugates typically relies on adding both of the two new functionalities to a single residue[49,50] or relying on disulfide groups for protection;[51] few methods exploit the inherent reactivity of distinct cysteine residues to achieve site-specific labeling,[9,52] especially in the absence of a tag sequence.[53−55] To test CPO-based reagents for this purpose, a double mutant of GFP (2×cys-GFP) was produced that contained both an internal (S147C) and an N-terminal cysteine residue (Figure a). Treatment of 2×cys-GFP with CPO-PEG (100 equiv, 2 h) in the presence of DTT (500 equiv) led to a protein conjugate with a mass corresponding to 2×cys-GFP-1 (Figure b). After excess CPO-PEG and DTT were removed, the product was treated with the carbonylacrylic-based reagent[15]CAA-BN (20 equiv, 2 h), providing the dual protein conjugate 2×cys-GFP-2 (Figure a). Attempts to handle 2×cys-GFP in the absence of DTT resulted in extremely rapid dimerization (<5 min) of the protein as assayed by LC–MS, highlighting the importance of DTT-compatible CPO-based reagents for bioconjugation reactions. Following treatment with trypsin, analysis of both 2×cys-GFP-1 and 2×cys-GFP-2 by LC–MS/MS supported the site-specificity of the two modifications. Analysis by circular dichroism spectroscopy (Figure c) confirmed that the initial protein and subsequent conjugates retained the secondary structure with high proportions of antiparallel β-sheets.[56]

Figure 5

(a) Selective modification of 2×cys-GFP with (i) CPO-PEG and (ii) benzoylacrylic reagent. Only the major regioisomer is shown for clarity. (b) Deconvoluted mass spectra of GFP species. (c) Circular dichroism of GFP derivatives.

Cyclopropenone Ring-Opening after Double Nucleophilic Attack Makes the Reaction with 1,2-Aminothiols Irreversible

To understand the origin of the selectivity for CPO-based reagents for N-terminal cysteine residues over other thiols such as internal cysteine residues and DTT, the whole reaction mechanism was interrogated through quantum mechanical (QM) calculations. 2-Methylcyclopropenone (A) and cysteamine (CA) provided a suitable abbreviated model for this study (Figure ), permitting calculation of a comprehensive free energy surface linking each of the four possible regio- and stereoisomeric products to the starting materials (see the Supporting Information). Here, discussion is limited to the minimum energy pathway, although calculations were in good qualitative agreement with the observed ratio of products (Supporting Information).

Figure 6

Proposed mechanism and minimum energy pathway for the reaction between 2-methylcyclopropenone (A) and cysteamine thiolate (CA) calculated at the PCM (H2O)/M06-2X/6-31+G(d,p) level of theory. Acid–base equilibria were calculated using trihydrated bicarbonate and carbonate anions as an acid and a base, respectively (see the Supporting Information). Given the intrinsic inaccuracy of calculating such equilibria, relative energies of charged/neutral species should be considered with caution, though qualitative trends hold. The addition of CA to A to different positions of the double bond can result in two possible pathways, leading to a enantiomeric mixture of two regioisomers (see the Supporting Information); only the minimum energy profile is shown here for simplicity. In the first step of the mechanism, cysteamine thiolate undergoes nucleophilic conjugate addition to the slightly more favored, least-hindered β-carbon of A through transition state TS1 (ΔG‡ = 14.6 kcal·mol–1). The thio–enolate adduct ([B]–) is unstable with respect to the starting materials (ΔG[ = 10.5 kcal·mol–1), as commonly calculated for such intermediates in S-Michael-type reactions.[57] In contrast to the common trend calculated for noncyclopropenone electrophiles, protonation of the enolate does not lead to significant stabilization of the adduct (ΔG[ = 8.2 kcal·mol–1). Therefore, the addition of simple thiolates to cyclopropenones is endergonic and reversible. This fact is crucial for the selectivity of CPO-based regents for N-terminal cysteine residues over other thiols such as internal cysteine residue side chains, glutathione, and DTT. The second part of the reaction mechanism is only accessible to thiols with pendant nucleophiles, thus excluding internal cysteine residues as substrates. The intramolecular addition of the amino group to the cyclopropanone carbonyl proceeds via TS2 (ΔG‡ = 14.9 kcal·mol–1) which, upon tautomerization, provides tetrahedral intermediate [C] (ΔG[ = −8.4 kcal·mol–1). Despite the presence of a bicyclic 2-thia-5-azabicyclo[4.1.0]heptane structure in [C], the formation of this intermediate is exothermic due to the significant strain released by carbonyl sp2→sp3 rehybridization, in line with previous observations.[58] In an alternative mechanistic scenario, we were unable to locate transition states from [B]− corresponding to spontaneous ring-opening and formation of ketene–ylide intermediates of the type proposed in the bioorthogonal ligation of cyclopropenones assisted by triarylphosphines,[34] possibly a result of the substitution for a cationic phosphonium to a neutral thioether substituent. Anionic intermediate [C]−, formed upon deprotonation of C, undergoes a fast ring-expansion with an intrinsic barrier (TS3) of 5.9 kcal·mol–1 in a process that resembles mechanistic aspects of Favorskii rearrangements.[58,59] The formation of 1,4-thiazepan-5-one species D is strongly exothermic (ΔG[ = −41.9 kcal·mol–1), in good agreement with the observed stability of protein- and peptide-conjugates containing this functional group. These computational results are in good agreement with the experimentally determined kinetic parameters of the reaction of 1 with l-cysteine ethyl ester. As a model reaction for lysine side chains reacting with CPO-based reagents, calculations were performed on the addition of methylamine to 2-methylcyclopropenone (A). The activation barrier for the initial conjugate addition was calculated to have a free energy of 19.5 kcal·mol–1 (Figure S4b), significantly higher than the case for thiolates (ΔG‡ = 14.6 kcal·mol–1). These calculated results are in accordance with experiment, where the preference for CPO-based reagents to react with thiolates (and 1,2-aminothiols) over primary amines was demonstrated in competition experiments.

CPO-Based Modification and Dimerization of nIL2

Finally, the scope of proteins that could be modified with CPO-based reagents was extended to a protein of relevance to the IL2 receptor. Recent work demonstrated the potential for de novo protein design by preparing a mimic of IL2 (nIL2) that binds selectively to the β, γ subunits of the IL2 receptor but not to the α subunit.[60] The properties of nIL2 have resulted in its evaluation in a Phase I clinical trial for treating advanced solid tumors with it.[61] A variant of nIL2 that has an N-terminal cysteine residue (cys-nIL2) was prepared for the purpose of site-specific conjugation with CPO-based reagents. The introduction of an N-terminal cysteine residue was achieved using an enterokinase cleavage site (DDDDK↓C), which also resulted in the removal of the His6 purification tag. Because PEGylation of proteins can result in enhanced biological properties and chemical stability,[3] initial studies focused on the bioconjugation of cys-nIL2 with CPO-PEG (100 equiv); once again the presence of DTT was necessary in order to prevent the formation of a disulfide dimer of cys-nIL2 but did not interfere with the protein conjugation step. The resulting conjugate, nIL2-PEG, formed quantitatively after 3 h and was characterized by mass spectrometry and CD spectroscopy (Figure ). The latter showed that CPO-based PEGylation did not affect the secondary structure of cys-nIL2.

Figure 7

Preparation of a covalent dimer of cys-nIL2 using CPO-based chemistry. Conditions: (i, ii) CPO-N or CPO-DBCO (100 equiv, respectively), DTT (100 equiv), NaPi (20 mM), pH 7, 4 h; (iii) NaPi (20 mM), pH 7, 1 h. We wondered whether it might be possible to perform a copper-free click reaction between two proteins prepared using a protocol employing CPO-based reagents (Figure ). De novo designed cys-nIL2 represents an interesting case study in this regard, as dimerization would be expected to enhance binding to the IL2-β receptor compared to monomeric nIL2 as a result of avidity effects. In order to prepare a dimer, cys-nIL2 was treated with either CPO-N or CPO-DBCO, providing the desired click-labeled protein conjugates nIL2-N and nIL2-DBCO, respectively. These orthogonally labeled proteins were then combined (∼30 μM), and the reaction was monitored by LC–MS spectrometry. The desired dimer (nIL2) formed in 30 min and after purification by size exclusion chromatography was obtained in 52% isolated yield and was characterized by SDS–PAGE and LC–MS spectrometry. This approach utilizing N-terminal cysteine residues is complementary to other methods for protein–protein conjugation, which rely on longer peptide sequence tags at internal or C-terminal positions. In contrast to the use of a bis(maleimide)-PEG3 reagent, this approach using bioorthogonal click chemistry enabled by CPO-based protein conjugation permits the use of DTT and the opportunity for N-terminal site-selectivity in the presence of other cysteine residues. The binding of cys-nIL2, nIL2-PEG, and (nIL2) to the β-IL2 receptor was measured using biolayer interferometry. The unmodified protein cys-nIL2 exhibited excellent binding (Kd ≈ 13 nM) in line with the previous measurement of the original protein nIL2 (Kd ≈ 19 nM). PEGylated derivative nIL2-PEG displayed a slightly diminished binding (Kd ≈ 34 nM), ostensibly as a result of increased steric bulk engendered by the −(PEG)4OH modification. To our delight the dimer (nIL2), prepared by CPO-based modification and subsequent click-coupling, provided an order of magnitude increase in the binding (Kd ≈ 3 nm) compared to the corresponding monomer. This primarily arises due to a large decrease in koff, a result of the increased avidity offered by two equivalent binding regions on (nIL2).

Conclusion

We have developed an efficient method for selectively modifying N-terminal cysteine residues on peptides and proteins. The method relies on the reaction of monosubstituted cyclopropenones with 1,2-aminothiols, resulting in the formation of a stable 1,4-thiazepan-5-one linkage. Various functional groups (dyes, ligands, biotin, PEG groups, and handles for click chemistry) can be readily added to proteins via the universal precursor CPO-PFP, an isolable and bench-stable solid. The reaction of N-terminal cysteine residues and CPO-based reagents proceeds with high efficiency under mild conditions (aqueous buffer, pH 7, 4–25 °C). Remarkably, the reaction is selective for N-terminal cysteine residues in the presence of (i) internal cysteine residues on the same protein, (ii) internal cysteine residues on other proteins, (iii) biological thiols, and (iv) nucleophilic reagents such as DTT. The ability to target N-terminal cysteine residues, readily available on proteins as components of NCL reactions, in the presence of other solvent-exposed and reactive cysteine residues represents a straightforward method for constructing complex and chemically precise bioconjugates, such as 2×cys-GFP-2 and (nIL2), without the need for extensive protein engineering.

53 in total

1. Direct production of proteins with N-terminal cysteine for site-specific conjugation.

Authors: Ian E Gentle; David P De Souza; Manuel Baca
Journal: Bioconjug Chem Date: 2004 May-Jun Impact factor: 4.774

2. Recent advances in the construction of antibody-drug conjugates.

Authors: Vijay Chudasama; Antoine Maruani; Stephen Caddick
Journal: Nat Chem Date: 2016-01-04 Impact factor: 24.427

3. Single-Site Labeling of Native Proteins Enabled by a Chemoselective and Site-Selective Chemical Technology.

Authors: Srinivasa Rao Adusumalli; Dattatraya Gautam Rawale; Usha Singh; Prabhanshu Tripathi; Rajesh Paul; Neetu Kalra; Ram Kumar Mishra; Sanjeev Shukla; Vishal Rai
Journal: J Am Chem Soc Date: 2018-10-26 Impact factor: 15.419

Review 4. Chemoenzymatic Semisynthesis of Proteins.

Authors: Robert E Thompson; Tom W Muir
Journal: Chem Rev Date: 2019-11-27 Impact factor: 60.622

5. A biocompatible condensation reaction for the labeling of terminal cysteine residues on proteins.

Authors: Hongjun Ren; Fei Xiao; Ke Zhan; Young-Pil Kim; Hexin Xie; Zuyong Xia; Jianghong Rao
Journal: Angew Chem Int Ed Engl Date: 2009 Impact factor: 15.336

6. Thiazolidine formation as a general and site-specific conjugation method for synthetic peptides and proteins.

Authors: L Zhang; J P Tam
Journal: Anal Biochem Date: 1996-01-01 Impact factor: 3.365

7. Difluorocarbene Addition to Alkenes and Alkynes in Continuous Flow.

Authors: Pauline Rullière; Patrick Cyr; André B Charette
Journal: Org Lett Date: 2016-04-27 Impact factor: 6.005

Review 8. Contemporary Approaches for Site-Selective Dual Functionalization of Proteins.

Authors: Lujuan Xu; Seah Ling Kuan; Tanja Weil
Journal: Angew Chem Int Ed Engl Date: 2021-02-26 Impact factor: 15.336

9. A novel approach to the site-selective dual labelling of a protein via chemoselective cysteine modification.

Authors: Ramiz I Nathani; Paul Moody; Vijay Chudasama; Mark E B Smith; Richard J Fitzmaurice; Stephen Caddick
Journal: Chem Sci Date: 2013-06-18 Impact factor: 9.825

10. Orthogonal Cysteine Protection Enables Homogeneous Multi-Drug Antibody-Drug Conjugates.

Authors: Matthew R Levengood; Xinqun Zhang; Joshua H Hunter; Kim K Emmerton; Jamie B Miyamoto; Timothy S Lewis; Peter D Senter
Journal: Angew Chem Int Ed Engl Date: 2016-12-14 Impact factor: 15.336

2 in total

1. π-Clamp-Mediated Homo- and Heterodimerization of Single-Domain Antibodies via Site-Specific Homobifunctional Conjugation.

Authors: Ross J Taylor; Mauricio Aguilar Rangel; Michael B Geeson; Pietro Sormanni; Michele Vendruscolo; Gonçalo J L Bernardes
Journal: J Am Chem Soc Date: 2022-07-14 Impact factor: 16.383

Review 2. Chemical and Enzymatic Methods for Post-Translational Protein-Protein Conjugation.

Authors: Ross J Taylor; Michael B Geeson; Toby Journeaux; Gonçalo J L Bernardes
Journal: J Am Chem Soc Date: 2022-07-31 Impact factor: 16.383

2 in total