The incorporation of fluorine can not only significantly facilitate the study of proteins but also potentially modulate their function. Though some biosynthetic methods allow global residue-replacement, post-translational fluorine incorporation would constitute a fast and efficient alternative. Here, we reveal a mild method for direct protein radical trifluoromethylation at native residues as a strategy for symmetric-multifluorine incorporation on mg scales with high recoveries. High selectivity toward tryptophan residues enhanced the utility of this direct trifluoromethylation technique allowing ready study of fluorinated protein constructs using 19F-NMR.
The incorporation of fluorine can not only significantly facilitate the study of proteins but also potentially modulate their function. Though some biosynthetic methods allow global residue-replacement, post-translational fluorine incorporation would constitute a fast and efficient alternative. Here, we reveal a mild method for direct protein radical trifluoromethylation at native residues as a strategy for symmetric-multifluorine incorporation on mg scales with high recoveries. High selectivity toward tryptophan residues enhanced the utility of this direct trifluoromethylation technique allowing ready study of fluorinated protein constructs using 19F-NMR.
Although absent from nearly
all-natural products, small organofluorine compounds have found widespread
applications from materials,[1] to clinical
diagnostics (19F-MRI/18F-PET)[2,3] and
medicinal chemistry (where 30%[4] of new
small molecule drugs contain F to alter physicochemical properties[5] and/or modulate affinity[6]). Incorporation of fluorine into biomolecules is more rare but potential
applications in structural re-engineering[7] and as reporter tags for “zero-background” 19F-NMR[8,9] raise striking possibilities. For the latter,
the benefits include: high sensitivity, extreme responsiveness to
local environment, broad chemical shift range and low reactivity once
incorporated.[10] This has helped to elucidate
structural, functional and dynamic[11,12] interactions
in receptor function landscapes[13] and enzymic
catalysis.[14] Indeed, fluorine’s
features can render it superior over other nuclei in, for example,
in-cell NMR spectroscopy[15] or protein-observed 19F-NMR (PrOF) protein–ligand binding.[16−19]Apart from (semi)synthetic ligations,[20] two main strategies are used for generating fluorinated proteins:
incorporation of fluorinated amino acids using biosynthetic methods
or chemical modification.[21] The former
is typically only residue-specific[22] but
can be achieved in a site-selective manner through sense or nonsense
codon reassignment (e.g., for positioning of CF3-l-phenylalanine[23]). However, for operational
simplicity and to avoid identified issues with the former,[24] the latter modification method has often been
preferred as more direct. It has relied largely on reaction of nucleophilic
side chains with prosthetic groups bearing fluorine (typically cysteine
and lysine) often attached via linkers or prosthetics[25,26] that can perturb structure and create distance imprecision.[24] A lack of other methods for installing fluorine
into biomolecules has been highlighted as a restriction;[24,27] due to low relative abundance and roles in interiors, aromatic amino
acids are attractive targets. Therefore, direct (hetero)aromatic trifluoromethylation
could be a powerful tool allowing (a) “zero/short linker”
fluorine incorporation via (b) a higher sensitivity multiequivalent-F
system.[28,29] Here, we describe a novel method that exploits
the chemoselectivity of radicals in protein chemistry to achieve this
aim (Figure ).
Figure 1
Direct radical
trifluoromethylation of protein substrates. ·CF3 radical
targeting (hetero)aromatic residues can be generated
from sodium trifluoromethanesulfinate (NaTFMS, Langlois’ reagent)
under aqueous, oxidative conditions.
Direct radical
trifluoromethylation of protein substrates. ·CF3 radical
targeting (hetero)aromatic residues can be generated
from sodium trifluoromethanesulfinate (NaTFMS, Langlois’ reagent)
under aqueous, oxidative conditions.Direct C–CF3-bond trifluoromethylation
presents
a striking synthetic challenge in biomolecules. Most strategies for
C–H functionalization have been applied only to small molecules
using methods or conditions essentially incompatible with proteins
(elevated temperatures,[30] organic solvents[31] or strong acids[32]). Among possible strategies, we identified radical-based approaches.[33] We have demonstrated the benign nature of radicals
for C–C-bond formation through use of a designed unnatural
amino acid.[34] This illustrated radical
efficiency under ambient aqueous conditions and operational simplicity.
We therefore considered whether a suitable ·CF3 precursor
might be generated that could allow natural amino acid modification.Various putative ·CF3 sources[35] were tested using model amino acid substrates. Langlois’
reagent (NaTFMS) proved most promising via multiple redox initiation
modes (e.g., transition-metal-free, photocatalytic) under mild conditions.[36−38] Baran et al. identified that NaTFMS may be used to modify small
molecule N-heterocycle caffeine when spiked into
cell lysate, which has “important implications in the area
of bioconjugation”.[39] Advantageously, 19F-NMR allowed us ready reactivity screening of all abundant
natural amino acids (SI Figure S1) and
revealed selective reactivity of NaTFMS/TBHP[39] with only aromatic amino acids and free cysteine ((hetero)arene-C–H
or Cys-S–H trifluoromethylation) and none toward any other
natural amino acids. 19F-NMR suggested near homogeneous
product formation from tryptophan (Trp) and cysteine (Cys); low levels
of products from His, Tyr and Phe, appeared as complex mixtures of
apparent regioisomers.Next, limited (<40%) conversion competition
assays (Figure ) using
mixtures
of equimolar Trp, Phe, Tyr, Cys, His, Met with NaTFMS/TBHP were used
to assess residue-specific selectivity. These revealed preferential
formation of the Trp-C2 isomer at a krel > ∼7.5-fold. This could be further increased to krel > ∼30-fold by lowering the pH
to 6 (SI Table T2), potentially due to
reduced reactivity
of protonated His residues. The main side reaction was oxidative dimerization
of Cys without trifluoromethylation. These observations suggested
not only the cysteine disulfide is inert but in the absence of free
Cys, strongly chemo- and regio- Trp-selective trifluoromethylation
might be possible in proteins. This was confirmed by a similar competition
reaction in the absence of Cys (SI Figure
S3), along with His modification as minor (<1:24) side-reaction.
Optimization of the reaction toward Trp revealed the key role of pH
(SI Table T3). In weakly or nonbuffered
aqueous conditions, rapid acidification was observed, consistent with
a mechanism that leads to hydrogensulfate formation (Figure ).[40] Useful conversions to CF3-Trp (>50%) were obtained
at
a moderate pH ∼ 6 (Figure S4), optimal
for most biomolecules.
Figure 2
Limited-conversion competition assay of natural amino
acids (0.03
mmol) revealed chemo- and regio-selectivity toward Trp with minor
products (Cys and lower levels of Phe, His, Tyr isomers, determined
by 19F-NMR, LC-MS). See Figure S3 for parallel reaction in absence of Cys.
Limited-conversion competition assay of natural amino
acids (0.03
mmol) revealed chemo- and regio-selectivity toward Trp with minor
products (Cys and lower levels of Phe, His, Tyr isomers, determined
by 19F-NMR, LC-MS). See Figure S3 for parallel reaction in absence of Cys.Together, these results prompted investigation in more complex
peptidic and protein substrates. First, reaction of N-acetyl-l-tryptophanamide confirmed reactivity and identical
regio-selectivity to that seen for Trp. Second, as a simple model
peptide, we chose melittin, a 26-residue peptide with a single Trp
residue at Trp19. Use of excess NaTFMS/TBHP with methionine as an
oxidative buffer allowed short reaction times (Figure ). The exclusive site of modification at
Trp19 was confirmed by LC-MS/MS analysis (SI Figure S3); Trp19 oxidation and dual Trp19 CF3-ylation
were observed under prolonged conditions. 4-Hydroxy-TEMPO not only
allowed ready termination of the reaction at varied time points but
also confirmed the radical-mediated nature.
Figure 3
Reaction of melittin
(MWcalc= 2847) with NaTFMS/TBHP.
(a) Relative conversions from LC-MS with the optimal reaction time
highlighted in blue. (b) Deconvoluted LC-MS spectra of reaction mixtures
after 5 min ± Met. Undesired oxidation (+32) is curtailed by
sacrificial reductant giving melittin-TrpCF3 (+68, MWcalc = 2915, MWfound = 2915).
Reaction of melittin
(MWcalc= 2847) with NaTFMS/TBHP.
(a) Relative conversions from LC-MS with the optimal reaction time
highlighted in blue. (b) Deconvoluted LC-MS spectra of reaction mixtures
after 5 min ± Met. Undesired oxidation (+32) is curtailed by
sacrificial reductant giving melittin-TrpCF3 (+68, MWcalc = 2915, MWfound = 2915).As a natural protein substrate, we chose model hemoprotein–horse
heart myoglobin (Mb). Mb contains two buried Trp (Trp7, Trp14) in
the presence of several other potentially reactive, more accessible
residues (SI Figure S6). Optimization of
conditions (pH, temperature, reactants loading, SI Table T4, SI Figure S7) allowed modulation of reactivity
by CF3 copy-number. In this way, at pH 6 (100 mM ammonium
acetate), 0 °C, using 200 equiv NaTFMS after 10 min we were able
to create myoglobin bearing primarily one or two CF3 groups
(Figure a). Higher
equivalents of Langlois’ reagent allowed further shortening
of reaction times, without apparent adverse effects on protein (SI Table T4, entries 5,6). In contrast to literature
observations,[39] we observed no significant
influence of Zn2+ as an additive. Control of pH also minimized
concomitant oxidation (SI Figure S8). Tryptic
digest-LCMS/MS revealed that Trp7 and Trp14 were the dominant sites
of modification (Figure b and SI Figure S9) despite their buried
nature, highlighting the low steric hindrance of ·CF3, even in these congested environments. This accessibility to buried
sites complements existing nucleophilic, prosthetic-mediated methods,
which are more applicable to exposed sites, requiring partial denaturation
to access buried sites.[41−43] Similarly, accessibility is needed
for standard labeling of Cys with fluorinated tags. Importantly, CD
spectroscopy revealed that trifluoromethylation had negligible influence
on the structure of Mb-TrpCF3 protein (SI Figure S10).
Figure 4
Direct trifluoromethylation of proteins (a)
myoglobin (Mb, MWcalc = 16951.5) deconvoluted LC-MS spectrum
showing singly-
(MWcalc = 17019, MWfound = 17020) and doubly-
modified (MWcalc = 17087, MWfound = 17087),
(b) tryptic-LC-MS/MS confirmed site-selectivity at Trp7 and Trp14.
Direct trifluoromethylation of proteins (a)
myoglobin (Mb, MWcalc = 16951.5) deconvoluted LC-MS spectrum
showing singly-
(MWcalc = 17019, MWfound = 17020) and doubly-
modified (MWcalc = 17087, MWfound = 17087),
(b) tryptic-LC-MS/MS confirmed site-selectivity at Trp7 and Trp14.Next, we tested the prospective
utility of our developed direct
trifluoromethylation as a tool for enabling protein 19F-NMR.
Pleasingly, mg-scale reactions allowed straightforward “labeling”
of Mb with CF3 with ∼65–80% recovery. As
a proof-of-principle, we used 19F-NMR to directly probe
sites of modification (Figure a). The native protein spectra of modified Mb revealed two
major 19F-peaks. To elucidate their origin, protein was
digested with trypsin (to free residues from their microenvironment
and remove nonequivalence); the resonances collapsed to δF −58.8 ppm, in good agreement to that for 2-CF3-Trp in model Trp-CF3bis-amide
(δF −58.76). This was further reinforced by
denaturative 19F-NMR structural analyses:[44] titration with urea induced a gradual change in chemical
shift (Figure b) that
could be explained by a known progressive loss of secondary structure
in Mb under such conditions.[45] Treatment
with 10 M urea resulted in full coalescence to δ-58.74 ppm (Figure a, full spectrum Figure S11).[46,47]
Figure 5
19F-NMR analyses enabled by direct trifluoromethylation.
(a) 19F-NMR of CF3Trp-Mb (700 μM in 100
mM NH4OAc, pH 8, blue), denatured by 10 M urea (300 μM,
red) and peptide mixture from tryptic digest (from 2 mM protein, green).
Dashes indicate chemical shift for small molecule model. Inset at
higher field with individual Trp resolved. (b) Urea titration by 19F-NMR (protein 200–400 μM). (c) Chemical shift
differences from ligands in PrOF, protein concentration (100–300
μM).
19F-NMR analyses enabled by direct trifluoromethylation.
(a) 19F-NMR of CF3Trp-Mb (700 μM in 100
mM NH4OAc, pH 8, blue), denatured by 10 M urea (300 μM,
red) and peptide mixture from tryptic digest (from 2 mM protein, green).
Dashes indicate chemical shift for small molecule model. Inset at
higher field with individual Trp resolved. (b) Urea titration by 19F-NMR (protein 200–400 μM). (c) Chemical shift
differences from ligands in PrOF, protein concentration (100–300
μM).We extended essentially the same
protocol to other model proteins
with different folds and Trp copy number: pantothenate synthetase
(PanC with single Trp306) and more demanding enzyme lysozyme (Ly),
which possesses six potentially reactive Trp (Trp28, Trp62, Trp63,
Trp108, Trp111 and Trp123) and eight Cys engaged within each other
as disulfides. Both tryptic-LC-MS/MS and 19F-NMR (SI Figures S12–17) not only confirmed
Trp residues as primary modification sites in both proteins but also
revealed no reaction of Cys residues, consistent with our prior observations.
Thermal denaturation also gave rise to resonance coalesence, as for
Mb (SI Figure S19). We were also able to
determine that trifluoromethylation had negligible influence upon
both the structure of Ly (via CD, SI Figure
S20) but also upon its enzymatic function (SI Figure 22, SI Table T5), which was essentially unaltered. We were
also able to use this fully active CF3-ylated Ly in quantitative 19F-NMR PrOF[16−19] (SI Figure 21) to directly determine
the binding of known alkaloid inhibitor ligand berberine (Kd 20 μM, consistent with values determined
by complementary methods[48]). The potential
of our direct trifluoromethylation method was also demonstrated in
more complex structural experiments for different protein–ligand
states studied by PrOF.[16−19] This allowed observation of displacement of water
bound at heminiron (which we confirmed by UV–vis[49] was retained after CF3-ylation, Figure S23) by known ligands cyanide, fluoride,
imidazole (Im)[50] (Figure c, full spectra Figure
S24).[16] We also tested effects of
common osmo/cryo- protectants (sarcosine – Sar, trehalose –
Tre)[51] and putative ligand phenol.[52] CN–, Im caused clear upfield
shifts, consistent with being known specific ligands.[47] Both Trp residues are located within the same helix; both
signals reported changes, yet these were more pronounced for more-tightly
packed Trp14 (Figure S26). Interestingly,
known ligand F– did not trigger a change in chemical
shift suggesting that this small ligand has a negligible impact on
overall protein conformation. Conversely, Tre and Sar caused globally
observed changes, consistent with nonspecific interactions or changes
in solute properties, in agreement with the mode of actions of these
compounds.[53] Phenol, which inhibits the
dehaloperoxidase activity of myoglobins,[52] showed only small changes implying only nonspecific binding, likely
not at hemin (Figure S25).In conclusion,
we have demonstrated direct trifluoromethylation
of natural residues in proteins. Tuned radical chemistry allows fast
(<10 min) and chemo-selective modification of Trp with a useful
but not absolute selectivity over other putative residues in peptidic
systems and proteins. This method appears to provide a selective and
direct protocol for trifluoromethylation of proteins. Methods exploiting
photochemical generation[54,55] or the use of electrophilic
alkylation chemistry[56] suggest lower and/or
differing selectivities. Interestingly, in the context of the known
ambiguity of ·CF3 toward heteroarenes,[57] this selectivity seems to exploit electrophilic
polarity; our preliminary results using known precursors to nucleophilic
·CF2Me under essentially identical conditions failed.
Trp vs His selectivity is increased at pH 6 (Table
T2), also consistent with lower reactivity of electrophilic
·CF3 toward protonated His. It allows direct access
to 19F-NMR experiments without the requirement of additional
mutation of protein substrates and use of linkers, as is currently
the case.[58] In particular, we showed through
fractional CF3-“labeling” 19F-NMR
spectra can be obtained directly with no significant line broadening.[59] Although Trp residues are rare, their frequent
role and in proximity to protein–ligand/protein interactions
and presence at hydrophobic interfaces makes them useful probes in
PrOF assays.[60−63] We speculate too that the reactivity observed here with typically
inaccessible sites reflects both the lack of bulk (no ‘linker’/
“zero size”) and prior observations that biphasic/interfacial
regions enhance small molecule radical reactions.[40] It therefore complements other Trp-modification, such as
the use of Rh-carbenoids that work best with surface residues and
currently allow more generality in modifying group than the method
we present here.[42] Other applications can
also be envisaged: tuning of intrinsic Trp fluorescence[64] or physicochemical effects via fluorination
(e.g., “Teflon” proteins[65]). We observed even low levels of fluorine incorporation altered
fluorescence spectra (SI Figure S27) and
changed chromatographic behaviors (SI Figure
S28).
Authors: Samantha J Tower; Wesley J Hetcher; Tyler E Myers; Nicholas J Kuehl; Michael T Taylor Journal: J Am Chem Soc Date: 2020-05-05 Impact factor: 15.419
Authors: Ming Cheng; Awuri Asuru; Janna Kiselar; George Mathai; Mark R Chance; Michael L Gross Journal: J Am Soc Mass Spectrom Date: 2020-04-21 Impact factor: 3.109
Authors: Brian Josephson; Charlie Fehl; Patrick G Isenegger; Simon Nadal; Tom H Wright; Adeline W J Poh; Ben J Bower; Andrew M Giltrap; Lifu Chen; Christopher Batchelor-McAuley; Grace Roper; Oluwatobi Arisa; Jeroen B I Sap; Akane Kawamura; Andrew J Baldwin; Shabaz Mohammed; Richard G Compton; Veronique Gouverneur; Benjamin G Davis Journal: Nature Date: 2020-09-23 Impact factor: 49.962