Patrick G Isenegger1, Benjamin G Davis1. 1. Chemistry Research Laboratory, Department of Chemistry , University of Oxford , Mansfield Road , Oxford OX1 3TA , United Kingdom.
Abstract
The manipulation and modulation of biomolecules has the potential to herald new modes of Biology and Medicine through chemical "editing". Key to the success of such processes will be the selectivities, reactivities and efficiencies that may be brought to bear in bond-formation and bond-cleavage in a benign manner. In this Perspective, we use select examples, primarily from our own research, to examine the current opportunities, limitations and the particular potential of metal-mediated processes as exemplars of possible alternative catalytic modes and manifolds to those already found in nature.
The manipulation and modulation of biomolecules has the potential to herald new modes of Biology and Medicine through chemical "editing". Key to the success of such processes will be the selectivities, reactivities and efficiencies that may be brought to bear in bond-formation and bond-cleavage in a benign manner. In this Perspective, we use select examples, primarily from our own research, to examine the current opportunities, limitations and the particular potential of metal-mediated processes as exemplars of possible alternative catalytic modes and manifolds to those already found in nature.
Principles of catalysis rightly dominate
the logic of chemical
and biological transformations. Beyond the immediate advantages of
efficiencies in time and yield, ever-present in chemical thinking,
there are also immediate strategic advantages that nature exploits
extensively. Among these are critical features of control, feedback
and diversification achieved by adding (and sometimes dynamically
removing) available structural features to proteins after or during
expression. Such elaboration of structure can powerfully modulate
function. These are dominated in nature by post-translational modifications
(PTMs, Figure ) achieved
largely through enzymatic side-chain alterations ranging from the
small (methylation, acetylation, phosphorylation) to the complex (glycosylation,
peptidylation). Cascades of catalysis allow molecular amplification
of such on-protein marks or signals in biology to drive the most fundamental
physiologically relevant processes: signaling (e.g., via activatory,
cytoplasmic phosphorylation cascades at Ser/Thr/Tyr), folding and
energy storage (e.g., via oscillatory glycosylations-deglycosylations
at Tyr/Asn), protein degradation (e.g., via deactivatory, intracellular
ubiquitylation-proteolysis at N-termini-/Lys/backbones)
and even wound responses (e.g., via self-regulatory, intravascular
proteolytic (thrombolytic/genic) cascades at backbone zymogenic sites).
All transmit and amplify molecular information signals at efficiencies
and turnover numbers (and often frequencies) that are currently inaccessible
to even the best synthetic or chemical methods and do so in selectivity
modes that cannot yet be considered. Yet the catalytic principles
involved are immediately accessible and, if correctly applied both
benignly and selectively, “new-to-nature” catalysis
could interface with and exploit such natural catalytic modes in a
striking manner: this highlights a powerful strategic opportunity
for the control of biology and physiology via abiotic, catalytic post-translational
chemistry. Many such catalyst modes may be considered; we see particular
strategic merit in those that are metal-mediated and this review will
focus primarily on these, while, at the same time, considering them
only to be illustrative.
Figure 1
Protein modification strategies to make or mimic
natural post-translational
modifications (PTMs). Tag examples chosen here focus on use of metal-mediated
catalysis (see also Figure ).
Protein modification strategies to make or mimic
natural post-translational
modifications (PTMs). Tag examples chosen here focus on use of metal-mediated
catalysis (see also Figure ).
Figure 2
Protein modifications using metal-mediated catalysis. Tag orientation
determines cycle and hence reactivity and selectivity.
Key principles emerge. To study
or modulate a specific biological
event mediated by a modified protein then needed functionalization
should likely occur precisely at a given position of interest with
excellent site-selectivity. This site-selectivity arises, in essence,
from both chemo- and regio-selectivity. This may prove challenging
not only given the density of potentially competing functional groups
in proteins but also the high copy-number of certain reactive amino-acids
(issues of chemo- and regio-selectivity, respectively). Some generalities
exist, however, that can aid strategy. For example, many such competing
groups are hard Lewis bases. Therefore, use of more rare (low copy
number) soft Lewis bases, such as naturally found in cysteine (Cys),
can provide a (partly) chemically distinct “handle”
(e.g., with “soft” electrophiles). Alternatively, more
distinct (“orthogonal”) unnatural amino acids, some
with highly conditional reactivity, can be used. When these functional
groups are site-selectively positioned, as a “tag” (in
a first step), they can then be chemoselectively addressed (“modified”
in a second step) in a two-step “tag-and-modify”[1] approach that results in site-selective installation
of chosen alterations typically as non-native mimics (Figure ). We find the origins of such
an approach in methods first delineated by Wilchek et al.[2] and developed by seminal contributions of Koshland,
Bender, Lowe, Jones and Hilvert among others.[3−6] Notably, the variety of bond-forming
processes employed in such approaches still remains limited and few
allow native or closely mimicking modifications;[7] most synthetic bioconjugations are still far from “traceless”
or “native” (Figure ).Although not essential for “tag”
positioning, ready
integration with protein expression can exploit forms of “codon
reassignment”. Thiscan allow “geneticcontrol”
of the insertion of unnatural amino acids through the reassignment[8,9] of sense codons (e.g., via use of Met-auxotrophs to reassign the
Met codon) or of nonsense codons (e.g., via use of compatible synthetase-tRNA
pairs in the suppression of stop, typically amber, codons). In this
way, simple alteration of codons in a gene sequence can code for “tag”
position. If combined with mutation, in the case of sense codons,
to “free-up” those codons (e.g., by converting all Met
codons to those for near-isosteric Ile) then even greater positional
freedom for the “tag” may be achieved. This is one way
in which direct interfacing of “tags” with living systems
can be considered [as “stumps” smuggled into biology
for further chemical “grafting”] and, of course, those
systems may be used to readily prepare proteins for in vitro use. This said, some thought must be given to the intolerance of
“ribosomal filtering” for many structures, which despite
continued advances, limits the diversity of “tags” and
prevents the direct incorporation of many of the more complex PTM
side-chains (e.g., glyco-amino acids). For example, most of the engineered
systems used to suppress stop codons still typically generate closely
resembling “tags” to the their parent systems (lysine-
or tyrosine-like) in their chemical and structural properties.[9]The choice of both the transformation used
to “modify”
and any associated catalyst should make reference to biology in choosing
not only appropriate conditions but function: what rates or processes
provide (likely complementary) value in interrogating or modulating
biological activity? Biologically ambient conditions are apparent:
at or below 37 °C, (part) aqueous solvent (with only trace organic)
at moderate (buffered) pH 6–8, often dense and/or at high ionic
strength. Furthermore, although not essential, ready correlation of
structural change with functional outcome benefits from nearcomplete
conversions. This may be aided by the resulting product showing stability
in aqueous media and biology (e.g., toward hydrolysis or catabolism).
In this regard, metal-catalyzed (e.g., carbon–carbon or carbon–heteroatom)
bond-forming reactions throw up strong candidates, in part due to
their complementarity to (and hence nonprocessability by) natural
bond-breaking.Some strong catalyticcandidates that are highly
efficient in their
processing of small molecules appear to fit-the-bill, including some
that even display good moisture-compatibility or even act in aqueous
solutions. However, limitations arise not only in catalyst solubility
but reagent/substrate solubility and the potential for protein-as-ligand
engagement. The latter leads potentially to sequestration, poisoning,
degradation (induced proteolysis) or simply confounded reaction analysis
(e.g., suppressed MS ionization). It is of interest that this may
not necessarily affect reactivity or selectivity in a negative manner.[10]Perhaps foremost among these challenges
are those of protein concentration.
Indeed, compared to typical small molecule transformations (M–mM),
concentrations of protein are μM–nM: higher levels drive
protein (co)aggregation and even within most protein crystals these
are estimated to be only mM.[11] A perhaps
overgeneralized analysis of enzymaticcatalysis highlights that nature
solves these issues by using substrate recruitment to Michaelis complexes
to drive effective molarities in a manner that contrasts with the
likely nonsaturative modes of many small molecule catalysts. Evolutionary
pressure is therefore often brought to bear upon KM driven by available/intracellularconcentrations.[12,13] By contrast, iterative design “pressures” upon small
molecule catalysts focus on turnover (numbers and frequencies).Effective rate may need to be key in biological contexts (e.g.,
to trap transient events or to minimize competing degradation) and,
as a result, in synthetic protein alterations relatively high concentrations
of catalysts are often used. In addition, unlike many other convergent
modes of synthesis, one reaction partner, i.e. the protein substrate,
is normally (but see radiolabeling below) strategically far more important.
Therefore, the need to drive conversion of the protein to completion
leads also regularly to excesses of modifying reagent as well as catalyst.
While arguably inefficient, this proves feasible due to ready separation
of product from small-molecule reagents and catalysts using a variety
of what might be termed “size-exclusion” methods (often
gel permeation- or dialysis-based); in this sense, the protein itself
is acting also as a “purification hook” and some strategic
similarity with solid-phase methods exists. It may also be argued
therefore that a primary focus of the field, to date, has been to
engender and exploit selectivity (via enhanced relative rate) rather
than reactivity per se. Given this use at often superstoichiometric
levels, it also raises fundamental questions about the true strategic
role of the catalytically competent intermediates that are being exploited
(catalysts or reagents?). This too then gives rise to necessary considerations
of the required order of recruitment of substrates to catalyst centers
(perhaps driven by excess or relay). That, in turn, then requires
consideration of which (e.g., metalated) intermediates should then
be protein-bound (or not, Figure ) and hence a consideration
of the “tag” type required in the protein, i.e. “tag
orientation” (Figure ).Protein modifications using metal-mediated catalysis. Tag orientation
determines cycle and hence reactivity and selectivity.In this Perspective, we aim to use select metal-mediated
manifolds
as examples that not only illustrate these limitations and considerations
but highlight their functional potential through examples that focus
on mimicry of biologically relevant PTMs (Figure ), as well as some mimicry of the modes of
natural catalysis (cycles and cascades of nested selectivity) that
introduce and remove such PTMs.
Copper-Mediated Azide–Alkyne
Addition
The early promise of generating functional effectiveness
as well
as the clear appetite for such conjugation methods is perhaps no better
illustrated than by the now near-classical, transition-metal-catalyzed
variants of Huisgen–Dimroth–Michael 1,3-dipolar-cycloaddtions
between azides and alkynes (CuAAC). These form triazoles with increased
(1,3) regioselectivity[17,18] as well as sufficient rate acceleration
to suggest compatibility under biologically benign conditions (water
with a range of pH and oxygen). When coupled with ready “tags”
such as Met-analogues homopropargylglycine (Hpg)[19] azidohomoalanine (Aha)[20] or
via amber codon suppression with Tyr analogues such as p-propargyloxyphenylalanine[21] these allowed
early lessons in on-protein efficiency and biological function to
be learnt. When compared at identical protein sites azido-“tags”
proved almost an order of magnitude more efficient (under pseudo-first
order conditions) than similar alkynyl (e.g., Ahacf Hpg).[14] These observations are consistent
with recent mechanistic observations[22] implicating
alkyne as a key proto-demetallating species in both mono- and bis-copper(I)cycles and with the greater effectiveness of generating an excess
of cuprated-alkyne to react with low concentration protein azides:
this highlights the need for strategicconsiderations, i.e. an azido
“tag” > alkynyl “tag” orientation (Figure ). Initial suggestions
of nonspecifictoxicity from observations in cell-surface labeling
studies[23] have more recently been attributed
to forms of oxidative damage at Cys, Met and even His via oxygen-derived
ROS.[24] These highlight the utility of high-purity
Cu(I) reagent, ligands that stabilize Cu(I)/(II) and outcompete nonspecific
protein sites as well as the use of sacrificial ligands as oxidative
“buffers”.The compatibility of this optimized
chemistry in complex protein
assembly and the utility of 1,3-triazole as a tolerated link motif
for functional mimicry was illustrated through sequential protein
modifications combining triazolechemistry with kinetically controlled
disulfide formation (Figure a).[14] These allowed site-selective
multisite attachment as well as differential modification attachment
with essentially complete conversion to test mimicry of dual PTM patterns,
including dual differential glycosylation[15] or sulfo-Tyr/glycosylation patterns.[14] In this way, creation of a mimic of the white-blood cell-surface
protein P-selectin-glycoprotein-ligand-1 (PSGL-1) was accomplished
bearing sTyr-like and sialyl-Lewis-x-like PTM moieties; these engendered
comparable binding properties to the mimic as the wild-type to cognate
receptor CD62P and so created a protein that could function in vivo to visualize inflammation (Figure b).[14] Thischemistry
also proves sufficiently robust to create virus-decoy particles at
extremes of valency, creating protein constructs bearing even up to
1,620 glycans.[25] These latter glycodendriprotein
particles displayed striking, sugar-mediated binding function, thereby
preventing mammaliancell infection by Ebola pseudotyped virus through
competitive blockade of the DC-SIGN receptor with IC50s
in the nanomolar to picomolar range. Rates of reactions for such on-protein
CuAACare also high enough to allow ready use of common radioreagent
[18F]-FDG as a prosthetic radiolabel in proteins, despite
its short halflife.[26]
Figure 3
CuAAC combined with other
modifications allows either (a) dual
display of different PTM mimics[14,15] or (c) nested selectivity
in sequential catalysis.[16] (b) Such mimics
function even in vivo allowing a PSGL1-mimic to visualize
CD62P as a marker of inflammation. [Adapted from reference (14) with permission from SpringerNature.]
CuAACcombined with other
modifications allows either (a) dual
display of different PTM mimics[14,15] or (c) nested selectivity
in sequential catalysis.[16] (b) Such mimics
function even in vivo allowing a PSGL1-mimic to visualize
CD62P as a marker of inflammation. [Adapted from reference (14) with permission from SpringerNature.]Apparent considerations of linker-dependent
compatibility with
biological function were highlighted in later model systems in a demonstrated
but select tolerance with subsequent enzymatic processing (here glycosylation, Figure c). This allowed
overall chemo-enzymatic modification in a manner that, in part, mimics
the sequential glycosylation processes of post-translational O-glycosylation in the creation of synthetic glycoproteins.
Thus, combination of a mutually compatible syntheticCu(I)-driven
GlcNAc attachment cycle with the use of an unnatural oxazoline substrate 1 in a transglycosylation cycle processed by the hydrolytic
endoglycosidase Endo-A allowed overall attachment of a Man-GlcNAc-GlcNActrisaccharide site-selectively into proteins (Figure c). Importantly, the Aha-derived N-triazole-linked GlcNAc-protein proved to be far the most
efficient substrate for Endo-A, allowing even selection of one product
triazole variant over another from mixtures. Albeit in a short sequence,
this biomimetically highlighted the role of a precursor (here Cu(I)-driven)
cycle in determining a pathway guided (or filtered) by a subsequent
(here Endo-A-driven) cycle. Such catalytic sequences may also be inverted
(e.g., Endo-S then CuAAC).[27] Such layers
of selectivity derived from sequential catalyticcycles we postulate
will prove critically powerful in application to more complex environments
(see also below PdCC on GYG for another selective metal-then-enzyme
catalysis sequence).
Ru-mediated Olefin Cross-Metathesis
Ready C–heteroatom bond-formation to generate heterocycles
may prove attractive in proteins, particularly in high-stability motifs
(e.g., triazoles via C—N) that display apparent functional
compatibility, as those above (Figure ). However, there will undoubtedly be need for more
subtle attachment methods that lack such relatively bulky heterocyclic
“scars”. This dictates more discreet motifs, of which
high stability C=C bonds could prove useful and versatile candidates.
Ru-mediated olefincross-metathesis (OCM) therefore provided an excellent
possibility in its known air and moisture stability, strong chemoselectivity
for a nonproteinogenic functional group and useful tolerance for other
functional groups. Indeed, such use of olefins as reactive tags in
proteins has been suggested for more than two decades.[28]Successful application of OCM in proteins,
hinged on what were
initially counterintuitive observations. In our hands[29] and those of others, simple olefins provided, e.g. by tags
such as homoallylglycine, fail. This failure and the subsequent solution
illustrates additional useful principles in catalysis related to tag
orientation (Figure ). While semitolerant of moisture, necessary Ru alkylidenes eventually
decompose in protic solvents. Reaction with protein at a low concentration
therefore, in effect, becomes a reaction controlled by relative decomposition
rates vs those of protein substrate recruitment. Reactive relaying
methods have been powerful tools in enabling OCM but the potential
of ligand relaying by some types of motifs had perhaps been overlooked.
Among these, chalcogenic (e.g., S(II) thioether) motifs had been,
in fact, identified as poisoning to certain Ru-mediated CM processes.[30] However, screening of olenificthioethers in
amino acids side-chains[29] revealed that
chalcogenicRu-coordination ability could prove wholly beneficial
in aqueous OCM, if positioned correctly: allylicchalcogens proved
highly effective.[30] As a result allylchalcogen
protein “tags” for protein OCM have emerged that can
be installed chemically, such as S- (Sac) and Se-
(Seac) allylcysteine variants, using a variety of methods that include
either nucleophilicconjugate addition Cβ—S[29] or Cβ—Se[31] bond formation from unnatural amino acid precursor
dehydroalanine (Dha) or via 2,3-sigmatropicCβ—S
bond formation from Cys-derived selenenylsulfide precursors (Figure a).[32] More recently, we have shown that the homologated allylsulfide
“tag” residue S-allylhomocysteine (Ahc)
can also be installed genetically as a Met surrogate (Figure a).[33]
Figure 4
(a)
RuOCM is enabled by various modes of chemical (for Sac, Seac)[29,31,32] or genetic[33] (for Ahc) allylchalcogen tag installation as privileged
motifs. (b) RuOCM may be embedded in mimics of write–read–erase
cycles.[31]
(a)
RuOCM is enabled by various modes of chemical (for Sac, Seac)[29,31,32] or genetic[33] (for Ahc) allylchalcogen tag installation as privileged
motifs. (b) RuOCM may be embedded in mimics of write–read–erase
cycles.[31]The recruiting ability of the chalogen heteroatoms in these
side-chains
toward readily available Hoveyda–Grubbs second generation catalyst
(HGII) proved to be a critical determinant in choice and exploration
of appropriate catalyticcycle and “tag” orientation
(Figure ); this enables
efficient OCMs in a variety of proteins[29,31,33,34] at rates dictated by
on-protein rate constants of the order of 10–1 M–1 s–1[31] (and therefore comparable in rate with faster CuAAc reactions).
Effective Ru-mediated ring-closing metathesis in proteins with allylicoxygen as the heteroatom in olefinic tags has also been observed.[35] It should be noted that not all proteins are
compatible with t-BuOHcosolvent that is typically
required for sufficient HGII dissolution. Nonetheless, RuOCM has allowed
attachment of a variety of different olefin-derived moieties including
PEG-ylation, fluorophores and, perhaps more relevantly, allylamines
that provided access to modified Lys-mimetics (see below, Figure b). Computational
analyses[31] suggest a reaction profile in
which the Ru-carbene intermediate after “Se-relay” is
enhanced in stability over that from “S-relay” and is
also accessed through a lowered transition state. The enhancing effect
of linker-extended side-chains to allow greater accessibility[34] also appears consistent with the key role of
such chalcogen-mediated recruitment.Notably, addition of MgCl2 as a source of competitive
hard Lewis acid has proven crucial to prevent nonproductive chelation
of Ru by nonspecific amino acid residues in proteins.[29] This too is illustrative of broader principles: absolute
avoidance of likely protein–metalcoordination is not at all
necessary; productive reactivity may still be gained simply through
effective competition to generate a sufficiently available catalyst
pool. Again, we suggest that such dynamic, competitive principles
are reflective of those exploited in complex biological media.Such protein–RuOCM has various potential benefits of reversibility
(as yet unrealized in proteins) of C=C bond formation and of
putative biomimicry. In one such demonstration of the latter, installation
of Seac (from Dha) into histone H3 at the relevant K9 site enabled
the use of allylacetamide as an olefinic reagent (that is typically
sluggish in CM) to install (“write” in the language
of epigenetics) a mimic of the side-chain K9Ac (Figure b). The resulting protein mimic of histone
H3-K9Ac was successfully “read” by anti-LysAc antibodies.
It could then be removed (or “erased”) through oxidative
Cope-type selenoxide elimination to reaccess the Dha precursor (and
so go “full-circle”). As for Cu-mediated methods, this
biomimetic in vitro ‘write–read–erase’
cycle, in part mediated by abioticRu-catalysis, again suggests that
relevant exploration of biology will prove possible through insertion
of chemical catalysis into sequences that mimic nature.
Pd-mediated Cross-Coupling
This use of an abioticmetal as an effective and selective inducer
of function raises the potential of other synthetically powerful metals
as similar triggers in Biology. None has been more powerful in small
molecule chemistry than Pd.[36] Pioneering
early use of Pd in proteins[37] (see also
below and reviews[38−40]) had revealed low conversions and limitations in
proteins caused, in part, variously by the nature of the Pd source,
protein loss, need for cosolvents and attempted use of phosphine-based
ligands that are typical in small molecule cross-coupling (CC). As
for CuAAC, considerations of tag-to-reactant orientation are mechanistically
pertinent (Figure ). Installation of the reductively susceptible partner, e.g. arylhalide,
as the “tag” in, e.g. p-iodophenylalanine
(pIPhe), carries advantages of not only good incorporation
efficiency (pIPhe is one of the best incorporated amber suppressors[41]) but in principle allows putative flexibility
toward different CC manifolds (see also below). It also, however,
necessitates productive interception of palladated-protein intermediates
over competitive side processes such as proto-depalladation, a potential
disadvantage when considering tag orientation (Figure ). Conversely, incorporation of p-boronophenylalanine (pBPhe) as “tag”,
while bringing with it the potential to use, e.g. Ar-[Pd]-Hal, intermediates
in excess in Suzuki–Miyaura CC to side-step such side-reaction,
may then become limited by other steps, e.g. poor transmetalation
by (perhaps internally chelated) pBPhe. Indeed, for
Suzuki–Miyaura CC, the latter appears to be limiting giving
only moderate observed conversions at high reaction temperatures (70
°C) likely incompatible with retaining function in most proteins.[42]These prior limits for both “orientations”
of tags
suggested to us the necessity for catalyst variation with an emphasis
on protein utility. Thus, use of disulfide-compatible phosphine-free
ligands, in the creation of fully water-soluble palladium-pyrimidine
(pre)catalysts enabled on-protein Suzuki–Miyaura CC using chemically
installed arylhalides as “tags” with essentially full
conversion for a wide variety of boronic acid partners under mild
reaction conditions (Figure a).[43]
Figure 5
(a) Benign PdCC allows
attachment of various groups using pyrimidine-
or guanidine-based ligands including (b) use at low concentrations
of 18F-prosthetic reagent and (c) self-liganding PEG-ylation
reagents.
(a) Benign PdCC allows
attachment of various groups using pyrimidine-
or guanidine-based ligands including (b) use at low concentrations
of 18F-prosthetic reagent and (c) self-liganding PEG-ylation
reagents.Fuller strategic generality was
then realized by also creating
“tags” through amber suppression with pIPhe.[44] This allowed not only coupling via “genetic
installation” in vitro but also application
to increasingly complex protein structures and contexts, including
those in cellular and living contexts. Effective CC on living (E. coli) cell surfaces enabled not only the labeling
of cellular subpopulations but also highlighted sufficient nontoxicity
as well as the ability to “switch” the functional states
of cell-surface proteins, such as the ion channel OmpC.[45] This illustrated the utility of the strategy
of “tag-and-modify” in which Pd acts as an abiotic,
nontoxic trigger of that switching in living systems. Extension of
the method[46] also allowed the content of
sugars displayed on living cell-surfaces to be so heavily “overwritten”
that the global interaction of cells with corresponding sugar-binding
lectin proteins could be determined simply by Pd-mediated chemistry;
this allowed a form of glycocalyx engineering on living cells.As well as the triggering and modulation of living biology, the
Suzuki–Miyaura CC has also proven to be a useful testing ground
for biotechnologically relevant challenges. As noted above, strategic
focus on the “value” of protein product allows excess
of reagent/catalyst in many in vitro settings (and
this may override or complement considerations based on catalyticcycle design, Figure ). One key exception is found in radiolabeling, where low concentrations
of reagent and short half-lives enforce further constraints. Suzuki–Miyaura
CC-mediated 18F-radiolabeling of proteins using the prosthetic
reagent [18F]4-fluorophenylboronic acid necessitated the
development of new catalytic systems. Screens of alternative pyrimidine-
and guanidine-based ligands at different substrate concentrations
identified dimethylguanidine as suitable for lower boronic acidconcentrations,
allowing generation of 18F-labeled protein in 5% RCY, despite
the short (t1/2 ∼ 110 min) half-life
of 18F (Figure b).[47] PEG-ylation is a widely adopted
method for the modulation of protein pharmacokinetics. Notably, exploration
of Suzuki–Miyaura conditions not only allowed precise PEG positioning
in proteins but also revealed unexpected interactions with glycol
moieties. These in turn led to the development of a self-liganding
system using a methyl-“PEG2K”-derived boronic acid and
Pd without need for external ligand (Figure c),[48] a result
that was subsequently exploited for intracellularPd-delivery.[49]The activity and compatibility of these
systems has more recently
allowed applications to address increasingly more complex questions
in biology. The formation of glycogen, which is a primary energy storage
molecule of many organisms, provides an intriguing challenge (Figure ). Glycogen particles
are initiated from a seed-core protein glycogenin GYG that autoglucosylates
one of its own residues, an anchor Tyr at site 195, to start the polysaccharide.
This self-modification means that, intriguingly, GYG is not strictly
a catalyst and its glucosyltransferase enzymatic state (reactivity
and selectivity) is potentially altered at each individual, unisolable
step. Probing this unusual mode of self-modulation has therefore proven
strikingly difficult to address through traditional biochemical means.
Instead, through the construction of an “OH → I”
(Tyr → pIPhe195) mutation (via amber suppression),
an “off” state of GYGcould be generated that allowed
Pd-mediated shunting into active mimics that were representative of
these intermediate states (Figure ).[50] ThisPd-mediated switching
“on” of GYG was accomplished despite GYG’s own
dependence on transition metal Mn, toward which Pd at extreme concentrations
is inhibitory. In this way chemical (here PdCC) cycles were again
(see above) inserted sequentially with subsequent biocatalytic (here
GYG-autoglucosylation) steps (a triggered cascade), albeit nested
inside that catalyst. It revealed three distinct subphases in “glycogenesis”,
prime–extend–refine, the second of which appears to
be another example of a retentive SNi-like glycosylation,
displaying striking rates and even plasticity toward the sugars that
it utilizes. This highlights that novel syntheticchemical approaches
(e.g., bond-making in biology) may have the power to probe the mechanisms
of biology more deeply than previously possible.
Figure 6
Use of PdCC-mediated
“shunting” allows access to
mimics of intermediate states found in the initiation of glycogen.[50]
Use of PdCC-mediated
“shunting” allows access to
mimics of intermediate states found in the initiation of glycogen.[50]These demonstrated advantages of such activation modes seemingly
compatible with even such “delicate” systems coupled
also with a wide variety of commercially available, potential coupling
partners (boronic acids, silanes, halogenated aromatic systems, alkynes
etc.) suggests a likely proliferation of modes of abiotic-metal-mediated
reactivity adapted for Biology aided by the use of appropriate strategies
and/or catalyst systems discussed above. Indeed, in the past decade
Heck[37,51] and Tsuij–Trost[52] reactions; and Sonogashira CCs,[49,53−55] (using pyrimidine- or PEG-based ligands) have emerged.
In addition Pd-mediated manipulations of peptides have elegantly facilitated
the ligation assembly of peptides and proteins.[56,57]While “tag” selection (and orientation, Figure ) is a key strategic
tool in the recruitment and exploitation of such catalytic systems
in such biomolecule methods, its sometime dependence on less-familiar
methods (e.g., unnatural amino acid incorporation) has seen a continued
interest in application to native residues. Francis’s early
prescient work had highlighted the potential of heteroatomic nucleophiles
in intercepting palladated intermediates,[52] by exploiting Tyr-OH. Thisconcept has more recently also been extended
to the exploitation of Cys-SH as a usefully selective native heteroatom.
Thus, intermolecular[58] and pseudointramolecular[59]S-arylation have been elegantly
demonstrated using preformed aryl palladium ligand oxidative addition
complexes (Figure a).
Figure 7
(a) S-Arylation of native Cys is enabled either
by use of prepalladated oxidative-addition complexes[58,59] or metal-binding-site directed methods.[10] (b) Active site directed S-arylation allows effective
covalent inhibition in enzymes.[10]
(a) S-Arylation of native Cys is enabled either
by use of prepalladated oxidative-addition complexes[58,59] or metal-binding-site directed methods.[10] (b) Active site directed S-arylation allows effective
covalent inhibition in enzymes.[10]This exploitation of native residues
coupled with the known utilization
by some proteins of transition metals raises an exciting possibility
as to whether such CCcould be directed through a self-protein-liganded
manifold by exploiting native metal-binding sites. This “guiding”
of a suitably reactive metal-complex was recently exploited in regioselective
protein functionalization (Figure a,b). In metal-dependent mannosyl-glycerate synthase
(MGS) a variety of arylation events were guided to a Cys adjacent
to the enzyme’s DxD metal-binding motif allowing not only effective
site-selective S-arylation modification with added
aryl iodide reagents but also reagent-dependent covalent inhibition
of the enzyme (Figure b).[10] Given the recent resurgence of covalent
inhibition in enzymes that bearconserved active-site Cys, this raises
the clear potential of this method even in eventual “catalytic
therapies”.
Summary, Lessons and Outlook
The
interfacing of catalysis with bioconjugations brings with it
unique considerations and opportunities. Common to both chemo- and
bio-catalysis are issues of recruitment of protein to catalyst center
that may be limited by or that may, indeed, take advantage of (e.g.,
through relaying in a tag-determined order to recruit as a form of
‘Michaelis-complex mimicking’) different cycle steps.
Both can be aided by not only mechanistic analysis but also by protein-specificconsiderations, such as tag-orientation (Figure ) or use of preformed/excess of catalyst+reagent.
In such cases the corresponding alkylidene (RuOCM) or oxidative addition
complex (PdCC) may be being used at levels that make them more like
highly selective reagents rather than catalysts. Nonetheless, they
are also saturated catalyst states used effectively, akin to how biology
accomplishes transformations. They also perhaps point the way toward
future engineering of saturation (again akin to biological optimization
of KM) in chemical systems to improve
their effectiveness while maintaining turnover. Use of preformed saturated
catalyst states[58] might also rescue (by
better sequestering) biologically troubling (e.g., phosphine) ligands
in such systems.For example, our discovery of the effectiveness
of chalcogen-relayed
RuOCM in proteins we find to also be illustrative of a different strategic
thinking that may prove useful in catalytic protein chemistry inasmuch
as we were not the first to attempt on-protein OCM. Yet our somewhat
antithetic use of privileged tags (Sac, Seac, Ahc), which may be viewed
as an atypical, perhaps limiting, mode of “substrate engineering”
in many methodological studies, was instead highly effective in proteins
in its enabling of reactivity, selectivity and function by driving
effective protein recruitment.Other considerations of solubility
and nonspecificmetal–protein
interaction[60] may require competition (e.g.,
from other metals as by Mg(II) in RuOCM[29]) or scavengers (e.g., as for 3-mercaptopropionic acid in PdCC[44]) and altered substrates (e.g., lipid-boronic
acid rather than boronate ester, where the acid was a needed “solubility
handle” that was discarded during PdCC[43]). In thiscontext it should be noted that at high concentrations
some exogenous thiol-based scavengers may prove toxic or act as reductants,
potentially disrupting, e.g. intramoleculardisulfides in proteins.
Nonetheless, concentrations for scavenging are typically well tolerated.
Moreover, residual or competitive metals do not necessarily lead to
loss of structure and/or function[50,58,61] (e.g., as high as 6% Pd[58] in some functional protein samples).One may view all of these
as biomimetic strategies, required in
the “systems chemistry” that is Biology. While we have
focused this review largely on metal-mediated process, we have also
noted examples of its combined use with biocatalysis, often also being
applied in unnatural modes. The principles of ordering, orientation,
selectivity and mimicry set out here in many parts apply also to solely
biocatalytic strategies.[62] Indeed, one
might consider the exploitation of Ubq-mediated proteolysis in so-called
Protacs[63] or the repurposing of proteases
to target their degradation to chosen proteins[64] as examples of man-made, “new-to-nature”
modes of hydrolytic protein modification.Increasing translation
into more complex living systems will hinge
on proper exploitation of the selectivity of such manifolds and it
is perhaps this, rather than the inherent efficiency of catalysis,
that is a primary benefit. The corresponding interfacing or separation
of these manifolds with/from those that exist in nature becomes an
intriguing challenge in chemical control. The future, therefore, will
lie in other catalytic modes via perhaps other (even endogenous) metals
and tags, as well as nonmetal small molecule catalysts[65] or those that drive electron transfer/redox
(metal or nonmetal).[66,67] If these can be tamed for benign
use in vivo and combined with multiple strategies
of selectivity (even beyond chemo-/regio-/stereo-) then such catalytic
protein (and other biomolecule) editing could prove widely powerful
in biology and physiology.
Authors: K M Sakamoto; K B Kim; A Kumagai; F Mercurio; C M Crews; R J Deshaies Journal: Proc Natl Acad Sci U S A Date: 2001-07-03 Impact factor: 11.205
Authors: Benjamin G Davis; Rafael F Sala; David R W Hodgson; Astrid Ullman; Kanjai Khumtaveeporn; David A Estell; Karl Sanford; Richard R Bott; J Bryan Jones Journal: Chembiochem Date: 2003-06-06 Impact factor: 3.164
Authors: Samuel L Scinto; Didier A Bilodeau; Robert Hincapie; Wankyu Lee; Sean S Nguyen; Minghao Xu; Christopher W Am Ende; M G Finn; Kathrin Lang; Qing Lin; John Paul Pezacki; Jennifer A Prescher; Marc S Robillard; Joseph M Fox Journal: Nat Rev Methods Primers Date: 2021-04-15
Authors: Marco S Messina; Kathryn M M Messina; Arvind Bhattacharya; Hayden R Montgomery; Heather D Maynard Journal: Prog Polym Sci Date: 2019-11-18 Impact factor: 29.190
Authors: Brian Josephson; Charlie Fehl; Patrick G Isenegger; Simon Nadal; Tom H Wright; Adeline W J Poh; Ben J Bower; Andrew M Giltrap; Lifu Chen; Christopher Batchelor-McAuley; Grace Roper; Oluwatobi Arisa; Jeroen B I Sap; Akane Kawamura; Andrew J Baldwin; Shabaz Mohammed; Richard G Compton; Veronique Gouverneur; Benjamin G Davis Journal: Nature Date: 2020-09-23 Impact factor: 49.962
Authors: Muhammad Jbara; Jacob Rodriguez; Heemal H Dhanjee; Andrei Loas; Stephen L Buchwald; Bradley L Pentelute Journal: Angew Chem Int Ed Engl Date: 2021-04-16 Impact factor: 15.336
Authors: Mary A Waddington; Xin Zheng; Julia M Stauber; Elamar Hakim Moully; Hayden R Montgomery; Liban M A Saleh; Petr Král; Alexander M Spokoyny Journal: J Am Chem Soc Date: 2021-06-01 Impact factor: 16.383