Rapid, site-specific labeling of proteins with diverse probes remains an outstanding challenge for chemical biologists. Enzyme-mediated labeling approaches may be rapid but use protein or peptide fusions that introduce perturbations into the protein under study and may limit the sites that can be labeled, while many "bioorthogonal" reactions for which a component can be genetically encoded are too slow to effect quantitative site-specific labeling of proteins on a time scale that is useful for studying many biological processes. We report a fluorogenic reaction between bicyclo[6.1.0]non-4-yn-9-ylmethanol (BCN) and tetrazines that is 3-7 orders of magnitude faster than many bioorthogonal reactions. Unlike the reactions of strained alkenes, including trans-cyclooctenes and norbornenes, with tetrazines, the BCN-tetrazine reaction gives a single product of defined stereochemistry. We have discovered aminoacyl-tRNA synthetase/tRNA pairs for the efficient site-specific incorporation of a BCN-containing amino acid, 1, and a trans-cyclooctene-containing amino acid 2 (which also reacts extremely rapidly with tetrazines) into proteins expressed in Escherichia coli and mammalian cells. We demonstrate the rapid fluorogenic labeling of proteins containing 1 and 2 in vitro, in E. coli , and in live mammalian cells. These approaches may be extended to site-specific protein labeling in animals, and we anticipate that they will have a broad impact on labeling and imaging studies.
Rapid, site-specific labeling of proteins with diverse probes remains an outstanding challenge for chemical biologists. Enzyme-mediated labeling approaches may be rapid but use protein or peptide fusions that introduce perturbations into the protein under study and may limit the sites that can be labeled, while many "bioorthogonal" reactions for which a component can be genetically encoded are too slow to effect quantitative site-specific labeling of proteins on a time scale that is useful for studying many biological processes. We report a fluorogenic reaction between bicyclo[6.1.0]non-4-yn-9-ylmethanol (BCN) and tetrazines that is 3-7 orders of magnitude faster than many bioorthogonal reactions. Unlike the reactions of strained alkenes, including trans-cyclooctenes and norbornenes, with tetrazines, the BCN-tetrazine reaction gives a single product of defined stereochemistry. We have discovered aminoacyl-tRNA synthetase/tRNA pairs for the efficient site-specific incorporation of a BCN-containing amino acid, 1, and a trans-cyclooctene-containing amino acid 2 (which also reacts extremely rapidly with tetrazines) into proteins expressed in Escherichia coli and mammalian cells. We demonstrate the rapid fluorogenic labeling of proteins containing 1 and 2 in vitro, in E. coli , and in live mammalian cells. These approaches may be extended to site-specific protein labeling in animals, and we anticipate that they will have a broad impact on labeling and imaging studies.
Inverse electron-demand Diels–Alder
reactions between strained alkenes (including norbornenes and trans-cyclooctenes) and tetrazines have emerged as an important
class of rapid bioorthogonal reactions.[1−4] The rates reported for some of these reactions
are incredibly high.[3,4] Very recently, three approaches
for specific labeling of proteins using these reactions have been
reported. A lipoic acid ligase variant that accepts a trans-cyclooctene substrate was used to label proteins bearing a 13 amino
acid lipoic acid ligase tag in a two-step procedure.[5] A tetrazine was introduced at a specific site in a protein
expressed in via genetic
code expansion and then derivatized with a strained trans-cyclooctene–diacetylfluorescein.[6] We demonstrated the incorporation of a strained alkene (a norbornene-containing
amino acid) via genetic code expansion and site-specific fluorogenic
labeling with tetrazine fluorophores in vitro, in , and in live mammalian cells.[7] Related work was recently reported by others,[8,9] and while the present work was under review, the low-efficiency
incorporation of one of the amino acids reported here (2) and the detection of some fluorescent labeling in fixed cells was
described.[9]Recent work in organic
solvents suggests that the reaction between
tetrazines and BCN, which was first described in strain-promoted reactions
with azides,[10] may proceed very rapidly.[11] However, this reaction, unlike the much slower
reaction of simple cyclooctynes with azides, nitrones,[12−16] and tetrazines,[9,17] has not been explored in aqueous
media or as a chemoselective route to label macromolecules.Here we present a rapid and fluorogenic reaction between tetrazines
and BCN and demonstrate the genetic encoding of BCN- and trans-cyclooctene-containing amino acids 1 and 2, respectively, in and mammalian
cells. We show the specific, rapid, and fluorogenic labeling of proteins
in and live mammalian cells
with tetrazine probes, and explicitly demonstrate the advantages of
this approach (Scheme 1).
Scheme 1
Genetic Encoding
and Fluorogenic Labeling of Unnatural Amino Acids 1 and 2
The rate constants for the reactions of various
dienophiles [BCN, trans-cyclooctene-4-ol (TCO), and
bicyclo[6.1.0]non-4-ene-9-ylmethanol
(sTCO)] with tetrazines have been determined.[3−5,9,11] However, in many cases,
different tetrazines, solvent systems, or measurement methods were
used, making it challenging to compare quantitatively the reactivities
of the dienophiles with tetrazines of interest. Our initial experiments
confirmed that the reactions of the dienophiles 1–3 with tetrazine 6 (Figure 1) were too fast to study by manual mixing under pseudo-first-order
conditions. We therefore turned to stopped-flow techniques to determine
the pseudo-first-order rate constants for these reactions. By following
the exponential decay of the tetrazine absorbance at 320 nm upon reaction
with a 10–100-fold excess of BCN in 55:45 methanol/water, we
determined the rate constants for the reactions of BCN with 6 and 7 as 437 ± 13 and 1245 ± 45 M–1 s–1, respectively [Figure 2a,b and Figure S2a,b in the Supporting Information (SI)]. LC–MS and NMR analysis
confirmed the products (see the SI and Figure S1). Under the same conditions, the rate
constants for the reactions of TCO with 6 and 7 were 5235 ± 258 and 17248 ± 3132 M–1 s–1, respectively (Figure
S3). The reaction between BCN and 6 is ∼1000
times faster than the reaction between 5-norbornene-2-ol and 6,[7] while the TCO reaction is 10–15
times faster than that with BCN. The sTCO reaction was too fast for
accurate measurements by stopped-flow techniques, and we estimate
that it is at least 50 times faster than the TCO reaction. Similar
rate accelerations were observed for the reactions of BCN with 7, 8, and 9 (SI Table 1 and Figure S2).
Figure 1
Structures of unnatural amino acids 1–5 and tetrazine derivatives 6–17 used in this study. For the structures of
TAMRA-X, Bodipy TMR-X,
Bodipy-FL, and CFDA, see Figure S4.
Figure 2
Characterization of the reaction of BCN with 7. (a)
Stopped-flow kinetics of the reaction. The inset shows the conjugation
of 7 to 5-norbornene-2-ol (Nor); the different time scales
should be noted. Conditions: c = 0.05 mM and cBCN = cNor = 5 mM in 55:45 MeOH/H2O at 25 °C.
(b) Determination of the second-order rate constant k for the reaction of 7 and BCN. (c) Fluorogenic reaction
of 11 with BCN.
Structures of unnatural amino acids 1–5 and tetrazine derivatives 6–17 used in this study. For the structures of
TAMRA-X, Bodipy TMR-X,
Bodipy-FL, and CFDA, see Figure S4.Characterization of the reaction of BCN with 7. (a)
Stopped-flow kinetics of the reaction. The inset shows the conjugation
of 7 to 5-norbornene-2-ol (Nor); the different time scales
should be noted. Conditions: c = 0.05 mM and cBCN = cNor = 5 mM in 55:45 MeOH/H2O at 25 °C.
(b) Determination of the second-order rate constant k for the reaction of 7 and BCN. (c) Fluorogenic reaction
of 11 with BCN.Several tetrazine–fluorophore conjugates,
including 11, 13, 14, and 16 (Figure 1 and Figure S4) are substantially
quenched with respect to the free fluorophores.[7,18] We
found that the reaction of BCN with 11, 13, 14, and 16 leads to a 5–10-fold
increase in fluorescence, suggesting that the formation of the pyridazine
product efficiently relieves the fluorophore quenching (Figure 2c and Figure S5). These
fluorogenic reactions with BCN, like those between strained alkenes
and tetrazines,[7,18] are advantageous for imaging
experiments since they maximize the labeling signal while minimizing
the fluorescence arising from the free tetrazine fluorophore.Next, we aimed to design, synthesize, and genetically encode amino
acids bearing BCN, TCO, and sTCO for site-specific protein labeling
with a diverse range of probes both in vitro and in cells. The pyrrolysyl-tRNA
synthetase (PylRS)/tRNACUA pairs from species, including (Mb) and (Mm), and their evolved derivatives have been used to direct
the site-specific incorporation of a growing list of structurally
diverse unnatural amino acids in response to the amber codon[19−26] in a range of hosts, allowing synthetases evolved in to be used for genetic code expansion in
a growing list of cells and organisms, including , , yeast, human cells, and .[7,27−31] We designed the unnatural amino acids 1, 2, and 3 (Figure 1) with the goal
of incorporating them into proteins using the PylRS/tRNACUA pair or an evolved derivative. The amino acids were synthesized
as described in the SI.We screened
the MbPylRS/tRNACUA pair
along with a panel of MbPylRS mutants for their ability
to direct the incorporation of 1–3 in response to an amber codon introduced at position 150 in a C-terminally
His6-tagged superfolder green fluorescent protein (sfGFP).
Cells containing a mutant of MbPylRS with the three
amino acid substitutions Y271M, L274G, and C313A[32] in the enzyme active site [which we named BCN-tRNA synthetase
(BCNRS)] and a plasmid that encodes MbtRNACUA and sfGFP-His6 with an amber codon at position 150 (psfGFP150TAGPylT-His6) led to the amino acid-dependent
synthesis of full-length sfGFP-His6 (Figure 3a and Figure S6). We found an additional
synthetase mutant bearing the mutations Y271A, L274 M, and C313A,[32] which we named TCO-tRNA synthetase (TCORS).
The TCORS/tRNACUA pair led to the amino acid-dependent
synthesis of sfGFP from psfGFP150TAGPylT-His6 in the presence of 2. Finally, we found that
both the BCNRS/tRNACUA pair and the TCORS/tRNACUA pair lead to the amino acid-dependent synthesis of sfGFP from psfGFP150TAGPylT-His6 in the presence of 3. For each amino acid, sfGFP was isolated in good yield after
His-tag and gel-filtration purification (6–12 mg/L of culture;
Figure 3b). This is comparable to the yields
obtained for other well-incorporated unnatural amino acids, including 5. Electrospray ionization mass spectrometry (ESI-MS) data
for sfGFP produced from psfGFP150TAGPylT-His6 in the presence of each unnatural amino acid were consistent
with their site-specific incorporation (Figure 3c–e).
Figure 3
Efficient genetically encoded incorporation of unnatural
amino
acids in . (a) Amino acid-dependent
overexpression of sfGFP-His6 bearing an amber codon at
position 150. The expressed protein was detected in lysates using
an anti-His6 antibody and Coomassie staining. (b) Coomassie-stained
gel showing purified proteins. (c–e) ESI-MS data for amino
acid incorporation. For sfGFP-1-His6: found,
28017.54 Da; calcd, 28017.62 Da. For sfGFP-2-His6: found, 27993.36 Da; calcd, 27992.82 Da. For sfGFP-His6 produced with 3 as described: found, 28019.34
Da; calcd, 28019.63 Da. The minor peaks in the mass spectra correspond
to loss of the N-terminal methionine.
Efficient genetically encoded incorporation of unnatural
amino
acids in . (a) Amino acid-dependent
overexpression of sfGFP-His6 bearing an amber codon at
position 150. The expressed protein was detected in lysates using
an anti-His6 antibody and Coomassie staining. (b) Coomassie-stained
gel showing purified proteins. (c–e) ESI-MS data for amino
acid incorporation. For sfGFP-1-His6: found,
28017.54 Da; calcd, 28017.62 Da. For sfGFP-2-His6: found, 27993.36 Da; calcd, 27992.82 Da. For sfGFP-His6 produced with 3 as described: found, 28019.34
Da; calcd, 28019.63 Da. The minor peaks in the mass spectra correspond
to loss of the N-terminal methionine.To demonstrate that the tetrazine–dye probes
react efficiently
and specifically with recombinant proteins bearing site-specifically
incorporated 1, we labeled purified sfGFP-1-His6 with 10 equiv of tetrazine–fluorophore conjugate 11 for 1 h at room temperature. SDS-PAGE and ESI-MS analyses
confirmed the quantitative labeling of sfGFP-1 (Figure 4a,b). Control experiments demonstrated that sfGFP-4 was labeled under the same conditions and that no nonspecific
labeling occurred with sfGFP-5. ESI-MS showed that sfGFP-1 could be efficiently and specifically derivatized with 6–9 (Figure S7) and with 12–14 and 16 (Figure S8). We also demonstrated that
purified sfGFP-2 could be quantitatively labeled with 11 (Figure 4a,c). Interestingly, we
observed only very weak labeling of sfGFP-His6 purified
from cells expressing TCORS/tRNACUA and psfGFP150TAGPylT-His6 and grown in the presence of 3 (Figure 4a,d) and substoichiometric labeling of this protein
prior to purification (Figure S9). Since
the sfGFP expressed in the presence of 3 has a mass corresponding
to the incorporation of 3, these observations are consistent
with the in vivo conversion of a fraction of the trans-alkene in 3 to its unreactive cis isomer.
This isomerization is known to occur in the presence of thiols.[4]
Figure 4
Rapid and specific labeling of recombinant proteins with
tetrazine–fluorophore
conjugates. (a) Specific labeling of sfGFP bearing 1, 2, or 4 with 11 (10 equiv) demonstrated
by SDS-PAGE and in-gel fluorescence. (b) Quantitative labeling of
sfGFP-1 with 11 demonstrated by ESI-MS.
Before bioconjugation (blue): found, 28018.1 Da; calcd, 28017.6 Da.
After bioconjugation (red): found, 28824.2 Da; calcd, 28823.2 Da.
(c) Quantitative labeling of sfGFP-2 with 11 demonstrated by ESI-MS. Before bioconjugation (blue): found, 27993.2
Da; calcd, 27992.8 Da. After bioconjugation (red): found, 28799.4
Da; calcd, 28799.1 Da. (d) No labeling with 11 of sfGFP-His6 expressed in the presence of 3 could be detected
by MS. (e) Very rapid labeling of proteins containing 1 or 2.
Rapid and specific labeling of recombinant proteins with
tetrazine–fluorophore
conjugates. (a) Specific labeling of sfGFP bearing 1, 2, or 4 with 11 (10 equiv) demonstrated
by SDS-PAGE and in-gel fluorescence. (b) Quantitative labeling of
sfGFP-1 with 11 demonstrated by ESI-MS.
Before bioconjugation (blue): found, 28018.1 Da; calcd, 28017.6 Da.
After bioconjugation (red): found, 28824.2 Da; calcd, 28823.2 Da.
(c) Quantitative labeling of sfGFP-2 with 11 demonstrated by ESI-MS. Before bioconjugation (blue): found, 27993.2
Da; calcd, 27992.8 Da. After bioconjugation (red): found, 28799.4
Da; calcd, 28799.1 Da. (d) No labeling with 11 of sfGFP-His6 expressed in the presence of 3 could be detected
by MS. (e) Very rapid labeling of proteins containing 1 or 2.To demonstrate that the reaction between BCN and
tetrazines is
highly selective within a cellular context, we performed the reaction
on expressing sfGFP-1 (Figure S10). In-gel fluorescence imaging
of this proteome labeling demonstrated that while many proteins in
the lysates were present at a comparable abundance to sfGFP-1, there was very little background labeling. This suggests
that the reaction is specific with respect to the proteome.To investigate whether the rates
of the BCN– and TCO–tetrazine
cycloadditions observed for small molecules translate into exceptionally
rapid protein labeling, we compared the labeling of purified sfGFP
bearing 1, 2, or 4 with 10
equiv of 11. In-gel fluorescence imaging of the labeling
reaction as a function of time (Figure 4e)
indicated that the reaction of sfGFP-4 reached completion
in ∼1 h. In contrast, the labeling of sfGFP-1 and
sfGFP-2 was complete within the few seconds it took to
measure the first time point, demonstrating that the rate acceleration
of the BCN– and TCO–tetrazine reactions translates into
much more rapid protein labeling.To demonstrate the incorporation
of amino acids 1 and 2 into mammalian cells,
we transplanted the mutations allowing
the incorporation of 1 or 2 into a mammalian-optimized MbPylRS. Western blots showed that both 1 and 2 can be genetically encoded with high efficiency into proteins
in mammalian cells using the BCNRS/tRNACUA and TCORS/tRNACUA pairs, respectively (Figure 5a).
Figure 5
Site-specific
incorporation of 1 and 2 into proteins in
mammalian cells and their rapid and specific labeling
with tetrazine fluorophores. (a) Western blots demonstrate that the
expression of full-length mCherry(TAG)eGFP-HA is dependent on the
presence of 1 or 2 and tRNACUA. BCNRS and TCORS were FLAG-tagged. (b) Specific and ultrarapid labeling
of a cell-surface protein in live mammalian cells. Left: EGFR-GFP
bearing 1, 2, or 5 at position
128 is visible as green fluorescence at the membrane of transfected
cells. Middle: treatment of cells with 11 (400 nM) selectively
labels EGFR containing 1 or 2. Right: merged
green and red fluorescence images with differential interference contrast
(DIC). Cells were imaged 2 min after the addition of 11. (c) Specific and rapid labeling of a nuclear protein in live mammalian
cells. Left: jun-1-mCherry and jun-5-mCherry
are visible as red fluorescence in the nuclei of transfected cells.
Middle: Selective labeling of jun-1-mCherry with 17 (200 nM). Right: merged red and green fluorescence with
DIC. No labeling was observed for cells bearing jun-5-mCherry.
Site-specific
incorporation of 1 and 2 into proteins in
mammalian cells and their rapid and specific labeling
with tetrazine fluorophores. (a) Western blots demonstrate that the
expression of full-length mCherry(TAG)eGFP-HA is dependent on the
presence of 1 or 2 and tRNACUA. BCNRS and TCORS were FLAG-tagged. (b) Specific and ultrarapid labeling
of a cell-surface protein in live mammalian cells. Left: EGFR-GFP
bearing 1, 2, or 5 at position
128 is visible as green fluorescence at the membrane of transfected
cells. Middle: treatment of cells with 11 (400 nM) selectively
labels EGFR containing 1 or 2. Right: merged
green and red fluorescence images with differential interference contrast
(DIC). Cells were imaged 2 min after the addition of 11. (c) Specific and rapid labeling of a nuclear protein in live mammalian
cells. Left: jun-1-mCherry and jun-5-mCherry
are visible as red fluorescence in the nuclei of transfected cells.
Middle: Selective labeling of jun-1-mCherry with 17 (200 nM). Right: merged red and green fluorescence with
DIC. No labeling was observed for cells bearing jun-5-mCherry.To investigate whether the rapid BCN–tetrazine
ligation
would provide advantages for site-specific labeling of proteins on
mammalian cells, we expressed an epidermal growth factor receptor
(EGFR)–GFP fusion bearing an amber codon at position 128 (EGFR(128TAG)GFP) in HEK-293 cells containing the BCNRS/tRNACUA pair cultured in the presence of 1 (0.5 mM).
Full-length EGFR-1-GFP was produced in the presence of 1, resulting in bright green fluorescence at the cell membrane.
To label 1 with tetrazine–fluorophore conjugates,
we incubated cells with 11 (400 nM), changed the medium,
and imaged the red fluorescence arising from TAMRA labeling. The TAMRA
fluorescence colocalized nicely with the cell-surface EGFR-GFP fluorescence.
Clear labeling of cells bearing EGFR-1-GFP was observed
within 2 min, the first time point we could measure; additional time
points demonstrated that the labeling was saturated within 2 min (Figure 5b and Figures S11–S14); similar results were obtained with 12. Incorporation
of 2 into the EGFR–GFP fusion led to similarly
rapid and efficient labeling with 11 (Figure 5b and Figures S15 and S16). In contrast, it took 2 h before we observed any specific labeling
of cells bearing EGFR-4-GFP under identical conditions
(Figure S14).[7] In control experiments we observed neither labeling of cells bearing
EGFR-5-GFP nor nonspecific labeling of cells that did
not express EGFR-GFP. We observed weak but measurable labeling of
EGFR-GFP expressed in HEK-293 cells from EGFR(128TAG)GFP in the presence of the BCNRS/tRNACUA pair and 3 (Figure S17). These observations are
consistent with the isomerization of a fraction of 3 in
mammalian cells and with our observations in .To demonstrate the rapid labeling of an intracellular protein
in
mammalian cells, we expressed a transcription factor, jun, with a
C-terminal mCherry fusion from a gene bearing an amber codon in the
linker between JunB (jun) and mCherry. In the presence of amino acid 1 and the BCNRS/tRNACUA pair, the jun-1-mCherry protein was produced in HEK cells and, as expected, localized
in the nuclei of cells (Figure 5c and Figure S18). Labeling with cell-permeable conjugate 17 (200 nM) resulted in green fluorescence that colocalized
nicely with the mCherry signal at the first time point analyzed (after
15 min of labeling and 90 min of washing). No specific labeling was
observed in nontransfected cells in the same sample or in control
cells expressing jun-5-mCherry, further confirming the
specificity of intracellular labeling.In conclusion, we have
reported the exceptionally rapid fluorogenic
reaction of BCN with a range of tetrazines under aqueous conditions
at room temperature. The reaction between a strained alkene (e.g.,
a norbornene or trans-cyclooctene) and a tetrazine
may lead to a mixture of diastereomers and regioisomers as well as
isomers from dihydropyridazine isomerization.[3,4] In
contrast, the BCN–tetrazine reaction leads to the formation
of a single product. This will be an advantage in applications where
homogeneity in the orientation of probe attachment may be important,
including single-molecule spectroscopy, super-resolution microscopy,
and FRET approaches.We have demonstrated the efficient site-specific
incorporation
of 1 and 2 into proteins in and mammalian cells and their efficient,
specific, fluorogenic, and rapid labeling with tetrazine fluorophore
conjugates. While we have demonstrated the advantages of this approach
in vitro, in , and in live mammalian
cells, the ability to incorporate unnatural amino acids in using the PylRS/tRNACUA pair[29] suggests that it may be possible to extend the
labeling approach described here to site-specific protein labeling
in animals.
Authors: Jason L Seitchik; Jennifer C Peeler; Michael T Taylor; Melissa L Blackman; Timothy W Rhoads; Richard B Cooley; Christian Refakis; Joseph M Fox; Ryan A Mehl Journal: J Am Chem Soc Date: 2012-02-01 Impact factor: 15.419
Authors: Duy P Nguyen; Hrvoje Lusic; Heinz Neumann; Prashant B Kapadnis; Alexander Deiters; Jason W Chin Journal: J Am Chem Soc Date: 2009-07-01 Impact factor: 15.419
Authors: Arnaud Gautier; Duy P Nguyen; Hrvoje Lusic; Wenlin An; Alexander Deiters; Jason W Chin Journal: J Am Chem Soc Date: 2010-03-31 Impact factor: 15.419
Authors: Satpal Virdee; Prashant B Kapadnis; Thomas Elliott; Kathrin Lang; Julia Madrzak; Duy P Nguyen; Lutz Riechmann; Jason W Chin Journal: J Am Chem Soc Date: 2011-06-28 Impact factor: 15.419
Authors: Riddha Das; Ryan F Landis; Gulen Yesilbag Tonga; Roberto Cao-Milán; David C Luther; Vincent M Rotello Journal: ACS Nano Date: 2018-12-10 Impact factor: 15.881
Authors: Ivana Nikić; Jun Hee Kang; Gemma Estrada Girona; Iker Valle Aramburu; Edward A Lemke Journal: Nat Protoc Date: 2015-04-23 Impact factor: 13.491
Authors: Russell J Ernst; Toke P Krogager; Elizabeth S Maywood; Roberto Zanchi; Václav Beránek; Thomas S Elliott; Nicholas P Barry; Michael H Hastings; Jason W Chin Journal: Nat Chem Biol Date: 2016-08-29 Impact factor: 15.040