Literature DB >> 36083810

Redox-Controlled Chemical Protein Synthesis: Sundry Shades of Latency.

Vangelis Agouridas^1,2, Nathalie Ollivier¹, Jérôme Vicogne¹, Vincent Diemer¹, Oleg Melnyk¹.

Abstract

ConspectusThe last two decades have witnessed the rise in power of chemical protein synthesis to the point where it now constitutes an established corpus of synthetic methods efficiently complementing biological approaches. One factor explaining this spectacular evolution is the emergence of a new class of chemoselective reactions enabling the formation of native peptide bonds between two unprotected peptidic segments, also known as native ligation reactions. In recent years, their application has fueled the production of homogeneous batches of large and highly decorated protein targets with a control of their composition at the atomic level. In doing so, native ligation reactions have provided the means for successful applications in chemical biology, medicinal chemistry, materials science, and nanotechnology research.The native chemical ligation (NCL) reaction has had a major impact on the field by enabling the chemoselective formation of a native peptide bond between a C-terminal peptidyl thioester and an N-terminal cysteinyl peptide. Since its introduction in 1994, the NCL reaction has been made the object of significant improvements and its scope and limitations have been thoroughly investigated. Furthermore, the diversification of peptide segment assembly strategies has been essential to access proteins of increasing complexity and has had to overcome the challenge of controlling the reactivity of ligation partners.One hallmark of NCL is its dependency on thiol reactivity, including for its catalysis. While Nature constantly plays with the redox properties of biological thiols for the regulation of numerous biochemical pathways, such a control of reactivity is challenging to achieve in synthetic organic chemistry and, in particular, for those methods used for assembling peptide segments by chemical ligation. This Account covers the studies conducted by our group in this area. A leading theme of our research has been the conception of controllable acyl donors and cysteine surrogates that place the chemoselective formation of amide bonds by NCL-like reactions under the control of dichalcogenide-based redox systems. The dependency of the redox potential of dichalcogenide bonds on the nature of the chalcogenides involved (S, Se) has appeared as a powerful means for diversifying the systems, while allowing their sequential activation for protein synthesis. Such a control of reactivity mediated by the addition of harmless redox additives has greatly facilitated the modular and efficient preparation of multiple targets of biological relevance. Taken together, these endeavors provide a practical and robust set of methods to address synthetic challenges in chemical protein synthesis.

Entities: Chemical

Mesh：

Substances：

Year: 2022 PMID： 36083810 PMCID： PMC9494750 DOI： 10.1021/acs.accounts.2c00436

Source DB: PubMed Journal: Acc Chem Res ISSN： 0001-4842 Impact factor: 24.466

Raibaut, L.; Cargoët, M.; Ollivier, N.; Chang, Y. M.; Drobecq, H.; Boll, E.; Desmet, R.; Monbaliu, J.-C. M.; Melnyk, O. .[1] This study discusses in detail the concepts related to the latency of bis(2-sulfanylethyl)amido (SEA) and bis(2-selanylethyl)amido (SeEA) thioester surrogates, their acyl donor potencies, and how redox and kinetically controlled assembly processes can be combined for the total synthesis of the biotinylated NK1 protein, a 20 kDa protein derived from the hepatocyte growth factor (HGF). Ollivier, N.; Desmet, R.; Drobecq, H.; Blanpain, A.; Boll, E.; Leclercq, B.; Mougel, A.; Vicogne, J.; Melnyk, O. .[2] This article is a useful complement to this Account by discussing the application of SEA latent thioester chemistry to the synthesis of proteins through iterative ligation on a water-compatible solid support. Diemer, V.; Ollivier, N.; Leclercq, B.; Drobecq, H.; Vicogne, J.; Agouridas, V.; Melnyk, O. (3) This study shows that N-(2-selanylethyl)cysteine (SetCys) is a redox-controlled cysteine surrogate that can be combined with the SEA group for the one-pot synthesis of cyclic proteins. This article also provides a robust kinetic model for SetCys-mediated ligation. Snella, B.; Grain, B.; Vicogne, J.; Capet, F.; Wiltschi, B.; Melnyk, O.; Agouridas, V. .[4] This study describes the capacity of bis(2-sulfanylethyl) oxalamides (oxoSEA) to act as an efficient acyl donor in the nanomolar concentration range in purified or complex media such as cell lysates.

Introduction

“To equal Nature...I therefore foresee the day when physiological chemistry will not only make extensive use of the natural enzymes as agents, but when it will also prepare synthetic ferments for its purposes” said Emil Fischer in his Nobel Lecture in 1902.[5] Emil Fischer’s prediction came true in 1969 when Bruce Merrifield could produce a fully synthetic and functional ribonuclease A enzyme from component amino acids using the solid phase peptide synthesis (SPPS).[6] Since then, chemical protein synthesis has become an essential method for the study of protein structure and function, for the investigation of cellular mechanisms, or for the development of therapeutic molecules due to its capacity to access proteins with virtually any kind of modification or functional probe. Since the synthesis of ribonuclease A enzyme, the methods used for accessing proteins by chemical synthesis have considerably evolved. While pioneering works privileged the SPPS based on the stepwise coupling of protected amino acids in organic solvents,[7] modern chemical protein synthesis primarily relies on the chemoselective coupling of unprotected peptide segments in aqueous media among which the native chemical ligation (NCL[8]) or the ketoacid-hydroxylamine (KAHA[9]) or serine/threonine (STL[10]) ligations are more advanced. The assembly of proteins from shorter peptide segments is faced with the diversity of the peptide junctions that has to be made and with the variable solubility and stability of the peptide segments that are linked together. Tremendous efforts are devoted to solve these issues especially by extending the scope of the NCL reaction, the most widely used peptide ligation method for chemical protein synthesis up to now (Figure a).[11] NCL relies on the coupling of a peptide thioester with a cysteinyl peptide to produce a longer peptide with a novel peptide bond to cysteine. The performance of chemical protein synthesis using NCL can be appreciated by the number of functional proteins produced so far as well as the exceptional size of some synthetic proteins accessed through this chemistry.[12,13]

Figure 1

Native chemical ligation reaction. (a) Principle. (b) NCL-mediated peptide segment assembly using a Cys protecting group strategy.

Native chemical ligation reaction. (a) Principle. (b) NCL-mediated peptide segment assembly using a Cys protecting group strategy. Nevertheless, the future of chemical protein synthesis and its spread to the chemical biology community largely depends on the ease with which a protein can be assembled from individual peptide segments. While peptides can be routinely produced using SPPS and automated protocols, this is not yet the case for proteins, but the field is clearly advancing in this direction.[14] We believe that a key to success resides in the simplification of the peptide segment assembly strategies. This was the main goal of our research efforts during the past decade. Rather than make us go back, the search for simplicity required innovation and led us to the conception of novel redox-controlled ligation chemistries related to NCL.

Chemical Protein Synthesis: The Search for Pot-Economy

The assembly of a protein from a collection of shorter peptide segments is inevitably confronted with the need to sequentially mask and unmask the reactivity of chemical groups involved in the formation of the peptide bonds as the synthesis progresses. In the example provided in Figure b, a protein is assembled from three peptide segments by elongating the peptide chain in the C-to-N direction using NCL. This strategy implies the presence of two N-terminal Cys residues at step 1. Although NCL is highly chemoselective, it cannot distinguish between two Cys residues that are freely accessible. One classical and very popular approach to direct the peptide segment assembly in such a case is to equip the N-terminal Cys residue of the middle peptide segment with a temporary protecting group (PG), which must be removed before running the second ligation step. The mean size of the peptide segments used for protein chemical synthesis is in average less than 40 AAs, a value that reflects the performance of the SPPS.[13] Therefore, the access to proteins of larger size will require a higher number of peptide segments to be concatenated and, thus, will result in an increase of process complexity due to the accumulation of chemical steps and intermediate purifications, whatever the chemical strategy used to control the protein assembly. Consequently, simplification of protein chemical synthesis has been early viewed as a critical point to address for accessing proteins of exceptional size. In that, protein chemists are confronted with issues that have been experienced by organic chemists for a long time in their quest for the chemical synthesis of complex natural products. Today, the minimal use of protective groups and the integration of the dimensions of step-[15] and pot-economy[16] in process design is perceived as natural when it comes to envisage the synthesis of a protein, and such a tendency participates in making protein chemical synthesis more popular. Proteins are a special kind of “natural products” in that their total synthesis requires manipulation and especially purification of polar polypeptide intermediates. Numerous reports have documented moderate or weak recoveries of large polypeptides when classical methods of chromatography such as reversed-phase HPLC are used.[17] This is the reason that the performance of a series of chemical transformations in the same reactor has been the focus of so many research efforts. The development of one-pot peptide assembly methods started with the work of Bang and Kent on the synthesis of crambin using the thiazolidine protection for N-terminal cysteine in 2004 (Figure b).[18] Such a use of Cys PGs for the design of one-pot peptide assembly strategies remains a timely and dynamic area of research considering the efficient approaches of this kind published recently.[11,19] Inspired by the significant gain in synthetic efficiency provided by these one-pot approaches, we considered taking such a strategy a step further. Is it possible to make it more pot-economical without using Cys PGs to control the selectivity of the peptide assembly? This question took us in a conceptually different direction by placing the concept of thioester group latency at the heart of our work.

Latent Functional Groups: An Old Trick for New Challenges

The concept of latent functionality was formalized for the first time by Lednicer in 1972[20] and widely applied afterward.[21] According to the definition given by Lednicer, “this idea, which we have chosen to call latent functionality, carries some necessary function through one or more steps of a synthesis in a precursor form; at the proper stage the precursor is converted to the needed group”. The decision to use latent thioesters instead of protected cysteines during the peptide segment assembly implies the elongation process should be reconfigured in the N-to-C direction as depicted in Figure . At Step 1 of the process, only the N-terminal Cys and the active thioester group present in the reaction mixture can react, the second thioester being masked under its latent form Xoff during the ligation. The design of latent chemical systems, presenting two well-defined silent (Xoff) and active (Xon) states is necessary. Otherwise, the residual reactivity expressed during the latency period would inevitably affect the selectivity of the assembly, and the yield of the desired ligation product would depend on the differential reactivity of the present acyl donors. This is typically what happens during kinetically controlled ligations (KCLs[22]), which also enable protein synthesis in the N-to-C direction. The concepts developed in this Account differ markedly from KCL approaches in that the second thioester functionality needed for Step 2 is generated only after the first ligation step is completed.

Figure 2

Concept of latency illustrated with the one-pot assembly of three peptide segments.

Concept of latency illustrated with the one-pot assembly of three peptide segments. The need to make the peptide assembly more pot-economical led us to think how to place the latent functionality under the control of an easily-tunable physicochemical parameter of the aqueous solution where the protein assembly takes place. In response to the applied stimulus, the active thioester group would be generated in situ after a series of spontaneous chemical transformations. Now comes the question of which physicochemical parameter to choose for the design of latent thioester systems. The use of pH as a parameter to control peptide assembly is limited by the narrow range of pH tolerated by NCL. Indeed, Cys thiolates are the nucleophilic species in NCL and are significantly populated only above pH 5.5–6,[23] while thioesters are prone to hydrolysis in basic media. Consequently, reports using the pH to control protein assembly are rare.[24] The sensitivity of some chemical systems to general acid or base catalysis gives opportunities to place the latent thioester under the control of the buffer concentration. This mode of control was studied by Otaka and co-workers with the SEAlide latent thioester system, which is responsive to phosphate buffer concentration.[25] The use of temperature has great potential as it enables an external control of the process with no significant change of pH or other reaction parameters.[26] However, the design of temperature-dependent chemical systems is a difficult task as it requires elevated activation energies to provide a strong response within a narrow range of temperature. An acceptable temperature range for reactions conducted with peptide segments in water is usually 10–55 °C, unless special setups such as microfluidic conditions are employed.[27] In search for a physicochemical parameter that could strongly vary at neutral pH, we identified the very large spectrum of reducing powers attainable in water with simple and well-tolerated reagents such as thiols and phosphines. Indeed, just the consideration of the reducing power of thiols, appreciated through the value of the thiol–disulfide interchange equilibrium constant relative to mercaptoethanol disulfide, shows differences of several orders of magnitude by going from aromatic thiols to dithiols such as 1,4-dithiothreitol (DTT) (Figure a).[29] We therefore chose to design redox-sensitive systems for peptide ligation though NCL that would have to accommodate the presence of thiols such as the Cys thiol involved in the NCL step and the monothiol catalysts classically used to promote the ligation.[28] The high stability of certain cyclic disulfides toward aryl and alkyl thiol reductants, especially those formed from 1,4- and 1,5-dithiols, led us to propose cyclic disulfides as a means to place the latency under redox control (X, Y = S; Figure b). Knowing also the large difference in redox potential between disulfides and selenosulfides or diselenides, we perceived the opportunity to create a family of latent systems responding to incremental changes in reducing power by combining sulfur and selenium chemistries (X, Y = S, Se; Figure b).

Figure 3

Sulfur and selenium chemistries enable the design of redox-switches. (a) Thiol–disulfide interchange equilibrium constant relative to mercaptoethanol (Keq) for a selection of thiols and selenocysteamine. (b) Redox-switches based on cyclic dichalcogenide structures.

Carboxamides as Redox-Controlled Latent Thioesters

Then came the time to know how to implement these ideas for the creation of redox-sensitive thioesters. Studies conducted by Barnett and Jencks in the late 1960’s on the rearrangement of N-acetylmercaptoethylamine 1 into S-acetylmercaptoethylamine 2/3 in aqueous acid was particularly inspiring (Figure a).[30] In such an equilibrium, the thioester form 2/3 is significantly populated only in highly acidic media (pH < 2) due to the masking of the amine nucleophile upon protonation. Therefore, the conception of latent thioesters based on the natural capacity of N-(2-mercaptoethyl) amides of type 1 to equilibrate with thioesters of type 2/3 had to overcome a major obstacle in making the rearrangement happen at the working pH of NCL. The migration of the acyl group from nitrogen to sulfur can potentially be promoted by weakening the amide bond (Figure b). Previous works showed that this could be achieved by introducing electron-withdrawing groups (EWGs) alpha to the carbonyl,[31] by increasing the size of the substituents on the amide nitrogen,[32] or by enabling intramolecular hydrogen bonding to the amide nitrogen.[33] The latter strategy can be particularly powerful as shown by the impact of a weak intramolecular hydrogen bond to the prolyl nitrogen on the barrier of amide isomerization.[33] Another means to favor the rearrangement of amides of type 1 is to increase the concentration of the nucleophile that attacks the amide carbonyl (Figure b).

Figure 4

Rearrangement of N-(2-mercaptoethyl) amides into thioesters. (a) The case of N-acetylmercaptoethylamine 1. (b) Possible means for favoring the N,S-acyl shift of N-(2-mercaptoethyl) amides.

Rearrangement of N-(2-mercaptoethyl) amides into thioesters. (a) The case of N-acetylmercaptoethylamine 1. (b) Possible means for favoring the N,S-acyl shift of N-(2-mercaptoethyl) amides. It turns out that simply appending a second 2-mercaptoethyl limb to the amide nitrogen of 1 yields the bis(2-sulfanylethyl) amide (SEA) system 4, which can potentially benefit from several of the activation mechanisms discussed above (Figure ). Indeed, doing so increases the size of amide nitrogen substituents and the number of thiol groups around the amide carbonyl and creates a favorable arrangement for an intramolecular H-bonding to the amide carbonyl to occur. Experimentally, SEA amides of type 4 act as thioester surrogates at slightly acidic or neutral pH and react with N-terminal cysteinyl peptides to produce a peptide bond to cysteine.[34] This reactivity contrasts with that of monovalent N-(2-mercaptoethyl) amides of type 1 or N,N-(2-mercaptoethyl) alkyl amides lacking the second thiol group. Experimentations using SEA peptides showed that SEA-mediated reactions are usually under the kinetic control of the N,S-acyl shift process (4 → 5 in Figure , R = peptide).[1] Computational studies using H–CO–Gly–N(CH2CH2SH)2 (R = H–(C=O)–NHCH2– in Figure ) as a model for SEA peptides showed an intramolecular S–H–N interaction in the transition state TS of lowest energy. As the calculations suggest, the SEA amide system 4 enables one thiolate to act as the nucleophile while the other thiol transfers its proton to the amide nitrogen in the transition state. The exact nature of the N,S-acyl shift transition state remains to be established, probably by conducting more in depth computational studies that take into account explicit water molecules. Regarding the scope and limitations of the SEA-mediated ligation, we noticed that SEA-peptide epimerization or hydrolysis is usually insignificant during ligation, which proceeds optimally at pH 5.5 under an inert atmosphere.[1] The rate of SEA-mediated ligation and occurrence of side reactions is impacted by the nature of the C-terminal amino acid bearing the SEA functionality in a manner that follows the behavior of classical peptide alkyl thioesters.[11]

Figure 5

Conception of the bis(2-sulfanylethyl)amido (SEA) system.

Conception of the bis(2-sulfanylethyl)amido (SEA) system. A second important consequence of appending a second 2-mercaptoethyl limb to the amide nitrogen of 1 is to create a 1,5-dithiol structure in amide 4, a feature that enables one to place the SEA acyl donor under redox control (Figure ). The cyclic form 7, which is formally an N-acyl-1,2,5-dithiazepane,[35] is called SEAoff. The SEAoff amide is extremely stable toward various reagents such as acids, bases, and amine nucleophiles as can be expected for a tertiary carboxamide. Such a property facilitates the installation of a SEA amide group into peptides using standard solid phase or solution protocols.[4,36] Also, and as anticipated from the thiol–disulfide interchange equilibrium constants presented in Figure a, SEAoff amide 7 is stable in the presence of large excesses of 4-mercaptophenylacetic acid (MPAA), the gold-standard catalyst of NCL, and under conditions that enable the reduction of acyclic disulfides, especially those derived from cysteine. Hence, SEAoff amide 7 does not interfere with the NCL reaction using regular thioester components, provided strong reductants are excluded from the ligation mixture. In contrast, the reduction of the cyclic disulfide bond using DTT or tris(2-carboxyethyl)phosphine (TCEP) unlocks the acyl donor capabilities of the SEA system by triggering the N,S-acyl shift and the formation of thioester 5 from amide 4. This binary chemical behavior makes the assembly of three peptide segments in one-pot under redox control extremely simple to setup (Figure a). In the first step, a regular NCL reaction is performed in absence of a strong reductant under an inert atmosphere, conditions that enable one to keep the SEAoff group inactive while minimizing thiol oxidation by molecular oxygen. Note that, under such conditions, the partial formation of mixed disulfides between Cys thiols and MPAA is frequently seen without affecting the ligation outcome. Then, the addition of TCEP and the third segment triggers the second ligation step and the formation of the full-length polypeptide.

Figure 6

SEAoff latent thioester. (a) Sequential NCL and SEA-mediated ligations enable the redox-controlled assembly of three peptide segments in one-pot. (b) Fmoc-SPPS of SEAoff peptides.

SEAoff latent thioester. (a) Sequential NCL and SEA-mediated ligations enable the redox-controlled assembly of three peptide segments in one-pot. (b) Fmoc-SPPS of SEAoff peptides. The process described in Figure a provides a straightforward access to proteins made of 100–150 amino acids, if individual peptide segments are produced by SPPS. For comparison, the size of the protein domains observed in 3D-structure databases shows a narrow distribution with a maximum frequency of around 100–150 amino acids.[37] Therefore, such a three peptide segment assembly process covers many needs in chemical biology programs. It can be set up with ease considering the facility to access SEAoff peptides by 9-fluorenylmethyloxycarbonyl (Fmoc) SPPS using SEA solid supports (Figure b).[34] Very recently, Kajihara and co-workers established a means for accessing C-terminal SEAoff polypeptides from recombinant precursors produced in E. coli.[38] Therefore, the redox-controlled assembly of large proteins according to Figure is now potentially feasible using peptide segments of extended length produced in live cells. An alternative to access large protein targets consists of an extension of the principles of the solution process depicted in Figure to a water-compatible solid support for the stepwise concatenation of peptide segments.[2,39,40] Doing so pushes the concept of pot-economy one step further and provides a potential solution to automated chemical protein synthesis.[14] The performance of the one-pot redox-controlled three peptide segment assembly method can be appreciated with the total synthesis of SUMO proteins, a type of post-translational modification related to ubiquitin.[41−43] The synthesis of a SUMO-2-SUMO-3 dimer protein analog having a size of 21 kDa is described in Figure .[41] The assembly of the SUMO-2 domain was performed at Step 1 using a cysteine residue naturally present in its central position. A cysteine residue was introduced on the side chain of Lys[11] within the SUMO-3 domain to facilitate the formation of a branched product upon SEA-mediated ligation with the SUMO-2 domain obtained from the previous step.

Figure 7

Total synthesis of a SUMO-2-SUMO-3 dimer protein analog (adapted from PDB 2mp2). The color code corresponds to the ligated peptide segments; see Figure .

Total synthesis of a SUMO-2-SUMO-3 dimer protein analog (adapted from PDB 2mp2). The color code corresponds to the ligated peptide segments; see Figure . The second application we want to discuss is a concrete illustration of the power of chemical protein synthesis for studying protein function. It had to overcome the challenge of folding proteins stabilized by multiple disulfide bonds, a problem to which protein chemists are frequently confronted after having assembled the linear polypeptide precursor. In our case, we have achieved native folds in good yields by using redox glutathione buffers. Note that the understanding of protein folding and the development of methods enabling a full control of the disulfide bond pattern upon folding is a timely topic with recent remarkable achievements.[44] In the work summarized in Figure , we used SEA chemistry to produce milligram quantities of the biotinylated kringle 1 (K1) domain from the hepatocyte growth factor (HGF). This synthetic protein enabled one to clarify the role of the K1 domain in the binding and activation of the MET tyrosine kinase receptor. In particular, a biotin label was installed on the protein to investigate the role of multivalency in the agonistic activity of the K1 domain using streptavidin (S) as a presentation platform. Contrary to the monomeric K1 domain, semisynthetic K1/streptavidin mixtures of complexes containing (K1)2S as the major component displayed strong agonistic activities.[45] This observation motivated the design of a recombinant covalent K1K1 dimer molecule linking two K1 domains in tandem, which displays even stronger MET agonist activity in vitro and in vivo.[46]

Figure 8

From the chemical synthesis of the biotinylated HGF K1 domain to the design of a potent recombinant MET agonist made of two K1 domains in tandem. (a) Sequence of the biotinylated K1 domain. The color code corresponds to the ligated peptide segments; see Figure . (b) MET agonist activity of K1 constructs shows a strong dependence on multivalency (K1: PDB 1bht; K1K1dimer: PDB 7ocl). One hallmark of the SEAoff system is to keep its latent properties regardless of the nature of the R group linked to the SEA amide carbonyl (Figure a). In contrast, the reactivity of the SEAon system is dramatically affected by the electron withdrawing capability of the R group. These features enable us to significantly extend the scope of SEA chemistry. For example, the SEAoff group can be easily installed on aspartic or glutamic acid side chains and serve as a ligation site to access elaborated peptide scaffolds.[36] The deportation of the SEA group from the C-terminal position to the side chain of Asp/Glu is at the price of a reduction of the acyl donor capability due to the increased distance between the electron withdrawing alpha nitrogen and the SEA carbonyl in the Asp/Glu(SEA) systems. Oppositely, the introduction of a strong electron withdrawing group in R as in the oxalyl SEA (oxoSEA) system enabled us to achieve exceptional rates approaching 30 M–1 s–1, a property that permitted ligation to proceed in the nanomolar range in purified or complex media such a cell lysates.[4]

Figure 9

Rate of SEA-mediated ligation increases with the electron withdrawing capability of the R group linked to the SEA group carbonyl. (a) Relative acyl donor capabilities for different SEA systems. (b) X-ray structure of the oxalyl SEA group.[4] (c) The SEA 1,2,5-dithiazepane heterocycle adopts a chairlike conformation,[4] similar to that described for 5-phenyl-1,2,5-dithiazepane.[47] To sum up at this stage, the SEAoff amide group discussed above enriches the growing family of latent thioester systems, which include peptide hydrazides[48,49] and some N,S-acyl shift systems[50,51] such as protected cysteinyl prolyl esters[52] or photocaged SEAlide.[53] However, at this stage, the SEAoff group appears as a single shot rifle enabling no more than three peptide segments to be assembled in one-pot (see Figure ). As a logical continuation of our work, we wondered if it would be possible to assemble more than three peptide segments in one-pot under redox control? The answer to this question is yes, and the next section describes how we an achieved such a goal.

Sea: From Sulfur to Selenium

The extension of the concept depicted in Figure to the assembly of four peptide segments in one-pot put us on the quest of a redox-controllable thioester system that can stay latent under the reducing conditions allowing SEA-mediated ligation (Figure ). The nature of the R group attached to the SEAoff latent system is not expected to significantly change the redox properties of the cyclic disulfide or its accessibility because the chair conformation adopted by the 1,2,5-dithiazepane heterocycle projects the nitrogen substituent away from the disulfide bond (Figure b,c). Therefore, playing with the nature of the R group was not envisaged as a mean for diversifying the SEAoff family with respect to their sensitivity to reductants. An alternative would be to substitute the carbon atoms of the 1,2,5-dithiazepane scaffold, which are closer to the disulfide group, but at the expense of increased synthetic complexity.

Figure 10

Design of the bis(2-selanylethyl)amido (SeEA) latent thioester. (a) The cyclic diselenide SeEAoff is stable toward MPAA and DTT. (b) Key experiment showing the selective and sequential activation of SEAoff and SeEAoff systems. A simple solution to this problem was achieved by substituting the sulfur atoms in the SEAoff group by selenium ones to place the acyl donor under the redox control of a cyclic diselenide bond (SeEAoff, Figure a).[54] By doing so, we hoped that the SeEAoff group would resist reduction and thus activation by DTT, a strong thiol reductant that we knew to be an alternative to TCEP for unlocking the SEAoff group. This idea was supported by the work of Iwaoka et al., who showed that the cyclic diselenide analogous to DTT cyclic disulfide could not be reduced by an excess of DTT.[55] In practice, the SeEAoff group proved to be fully stable in the presence of MPAA and DTT, conditions that enable SEA-mediated ligation to proceed efficiently (formation of peptide 11, Figure b). No trace of ligation product between SeEAoff peptide 10 and Cys peptide 8 could be detected by HPLC upon addition of DTT to the peptide mixture, even after prolonged reaction times. In contrast, the addition of TCEP in the mixture led to the reduction of the Se–Se bond and triggered the formation of ligation product 12 in high yield. The high selectivity achieved upon the activation of SEAoff and SeEAoff latent thioesters enabled us to assemble four peptide segments in one-pot through a sequential NCL/SEA/SeEA ligation process.[54] In addition to this, we also established that the S → Se substitution in the SEA group offers another asset that is to significantly increase the rate of the ligation process on going from SEA to SeEA (Figure a).[1] The more than 10-fold difference in the rate of SeEA and SEA-mediated ligations enables one to assemble three peptide segments in one-pot through a KCL approach (Figure b). Note that SeEAoff peptides are produced from SEAoff peptides by exchange with the bis(2-selenoethyl)amine (Figure c). An SPPS method for accessing SeEA peptides by Fmoc-SPPS is highly desirable and remains to be developed.

Figure 11

Impact of S → Se substitution on the acyl donor capability of the SEA/SeEA groups. (a) The kinetic data were obtained using Cys peptide CILKEPVHGV-NH2 at pH 7.2, 100 mM MPAA, 100 mM TCEP, and 37 °C (pseudo-first order rate law, data taken from ref (1)). (b) KCL of three peptide segments using SEA and SeEA-mediated ligations. (c) Synthesis of SeEA peptides. Although the SeEA/SEA-based KCL process is of great interest on its own, its value is heightened by the possibility of combining it with the SeEA/SEA redox-controlled approach presented before (Figure ). This is possible because, in the redox-controlled assembly strategy utilizing SEAoff and SeEAoff functionalities, the SeEAoff group is unmasked in a late stage (Figure b), while it is activated at the beginning of the elongation in the KCL approach (Figure b). The connection of the above redox and KCL assembly processes enabled us to produce the 20 kDa biotinylated NK1 polypeptide NK1-B from the six peptide segments A–F (Figure ). The first two one-pot processes were redox-controlled and yielded SeEAoff segment ABCD, which was isolated and subsequently engaged in a KCL process with segments E and F to complete the synthesis of NK1-B.

Figure 12

Chemical synthesis of NK1-B protein. (a) Sequence of NK1-B protein. (b) Total synthesis of biotinylated NK1 protein by combining SeEA/SEA-based redox- and kinetically-controlled assembly processes.

The Search for a Redox-Controlled Cys Surrogate

So far, we discussed the gain in pot-economy provided by the use of redox-controlled thioester surrogates in the chemical synthesis of proteins. In this regard, the synthesis of the NK1-B protein shows that a 20 kDa protein can be assembled through three successive one-pot processes without resorting to Cys PGs. Though always desirable, avoidance of the use of Cys PGs during SEA/SeEA-assisted protein synthesis is not always possible. Without dwelling on the rare cases when the primary sequence of the protein starts with a Cys residue, the resort to a Cys PG strategy cannot be avoided when a free Cys is required after the elongation step for further protein modification using the NCL reaction. The problem is illustrated in Figure a with the synthesis of a backbone cyclized protein from two peptide segments.

Figure 13

Access to sophisticated protein scaffolds such as cyclic proteins requires the late chemical manipulation of Cys residues. (a) Classical approach to the redox-controlled assembly of a cyclic protein using a protection strategy for the N-terminal Cys. (b) One-pot assembly of a cyclic protein using redox-controlled latent thioesters and Cys surrogates. In recent years, significant progresses have been made to enable the deprotection of Cys in one-pot after NCL.[56,57] On the forefront are the noble metal-assisted chemistries developed by Brik’s group.[57] On our side, we searched for a redox-controlled Cys surrogate whose sensitivity to reductants could mirror that of SEAoff, as such a combination might considerably simplify the access to sophisticated protein scaffolds in one-pot (Figure b). Though a simple idea, it took us years to find a solution to this problem, and we did so with the generous help of serendipity. The difficulty in designing a redox-controlled Cys surrogate comes from the lability and dynamic behavior of acyclic disulfides or selenosulfides derived from Cys thiol under classical NCL conditions. We noticed that simply appending a 2-mercaptoethyl limb to the α-amino group of cysteine results in a 1,5-dithiol structure that can form a 7-membered cyclic disulfide of the type we are looking for, i.e., SutCys in Figure a. Unfortunately, it can be easily predicted that such a Cys derivative can hardly be converted back to cysteine owing to the well-known difficulty in breaking a bond between an aliphatic carbon and a nitrogen atom,[3,58] in line with previous studies on the N-(2-mercaptoethyl) auxiliary in the early days of NCL.[59] With these considerations, we concluded that the Cys residue offers no obvious attachment point that could enable the design of a self-immolative cyclic disulfide.

Figure 14

Conception of the SetCys redox-controlled cysteine surrogate. (a) Structure of SutCys and SetCys residues. (b) Proposed mechanism for the conversion of SetCys into Cys upon reduction.

Conception of the SetCys redox-controlled cysteine surrogate. (a) Structure of SutCys and SetCys residues. (b) Proposed mechanism for the conversion of SetCys into Cys upon reduction. The quest for a redox-sensitive Cys surrogate came again on stage when we discovered that the selenium analogue of SutCys called SetCys can spontaneously lose its selenoethyl arm upon reduction by TCEP or 3,3′,3″-phosphanetriyltris(benzenesulfonic acid) trisodium salt (TPPTS, Figure ).[3,60] We showed that the loss of the selenoethyl arm proceeds by an ionic mechanism whose rate is the fastest at pH 6–6.5, i.e., a pH compatible with NCL or SEA-mediated ligation. Our investigations strongly suggest that the cleavage of the C–N bond proceeds through an intramolecular substitution of the protonated SetCys alpha amino group by the selenoate as depicted in Figure b. Fmoc-protected SetCys amino acid can be produced in gram scale from Cys. Introduced by Fmoc-SPPS in peptide segments, it comes to enlarge the tiny family of redox-controlled Cys surrogates, another interesting example being the α-azido cysteine that is activated by TCEP through a classical Staudinger reaction.[61] As anticipated, SetCys can synergize with SEAoff latent thioester to access elaborated cyclic protein molecules using exactly the principle depicted in Figure b. In practice, the first step of such a process corresponds to a regular NCL reaction under weak reductive conditions (excess MPAA, Step 1, Figure ). Under such conditions, the SetCys and SEAoff groups remain silent until TCEP is added to the mixture to trigger the second cyclative SetCys/SEA-mediated NCL process. The method was used to produce cyclic variants of the K1 HGF domain varying by the length of the linker joining K1 N- and C-termini. The backbone cyclized cK1 polypeptide was folded successfully and assayed for its agonistic activity on the MET receptor.

Figure 15

One-pot synthesis of a cyclic variant of the K1 HGF domain using a redox-controlled NCL-SetCys/SEA assembly process.

Conclusion and Outlook

The simplification of the chemical synthesis of a large variety of protein molecules is an important goal to pursue to make the chemical synthesis of proteins more widespread and easier to implement in the research lab as well as in industry for large scale production. The facilitation of the assembly of the polypeptide chain is one part of this big challenge. In this Account, we described our advances in the development of redox-controllable latent thioesters and cysteine surrogates. The structures of these chemical systems have in common the presence of a seven-membered ring cyclic dichalcogenide, the reduction of which triggers the ligation process. The placement of the formation of peptide bonds under the dependence of the reduction of disulfide, selenosulfide, or diselenide bonds provided the means to combine these systems with great selectivity just by experimenting with common and harmless reductants. One hallmark of the systems we have designed is the clustering of several O, N, S, and Se atoms in and around the seven-membered ring structure. This enabled us to achieve the desired properties by rendering C–N bonds spontaneously cleavable in water under mild conditions, while such a bond is usually very stable in a normal context. There is certainly more research to do in this direction to explore novel controllable reactions that can be set up with minimal synthetic efforts, cost, and waste.

48 in total

Redox-Controlled Chemical Protein Synthesis: Sundry Shades of Latency.

Introduction

Chemical Protein Synthesis: The Search for Pot-Economy

Latent Functional Groups: An Old Trick for New Challenges

Carboxamides as Redox-Controlled Latent Thioesters

Sea: From Sulfur to Selenium

The Search for a Redox-Controlled Cys Surrogate

Conclusion and Outlook

1. A one-pot total synthesis of crambin.

2. Kinetically Controlled Chemoselective Cyclization Simplifies the Access to Cyclic and Branched Peptides.

3. The total synthesis of an enzyme with ribonuclease A activity.

Review 4. Strategies and open questions in solid-phase protein chemical synthesis.

Review 5. Chemical Protein Synthesis in Medicinal Chemistry.

6. Synthetic catalysis of amide isomerization.

7. Thiol Catalysis of Selenosulfide Bond Cleavage by a Triarylphosphine.

8. Synthesis of proteins by native chemical ligation.

9. Pedal to the Metal: The Homogeneous Catalysis of the Native Chemical Ligation Reaction.

10. Efficient Chemical Protein Synthesis using Fmoc-Masked N-Terminal Cysteine in Peptide Thioester Segments.