Yulin Tian1, Marco Paolo Jacinto1, Yu Zeng2, Zhipeng Yu1, Jun Qu3, Wenshe R Liu2, Qing Lin1. 1. Department of Chemistry, State University of New York at Buffalo , Buffalo, New York 14260, United States. 2. Department of Chemistry, Texas A&M University , College Station, Texas 77845, United States. 3. Department of Pharmaceutical Sciences, State University of New York at Buffalo , Buffalo, New York 14260, United States.
Abstract
The genetically encoded photo-cross-linkers promise to offer a temporally controlled tool to map transient and dynamic protein-protein interaction complexes in living cells. Here we report the synthesis of a panel of 2-aryl-5-carboxytetrazole-lysine analogs (ACTKs) and their site-specific incorporation into proteins via amber codon suppression in Escherichia coli and mammalian cells. Among five ACTKs investigated, N-methylpyrroletetrazole-lysine (mPyTK) was found to give robust and site-selective photo-cross-linking reactivity in E. coli when placed at an appropriate site at the protein interaction interface. A comparison study indicated that mPyTK exhibits higher photo-cross-linking efficiency than a diazirine-based photo-cross-linker, AbK, when placed at the same location of the interaction interface in vitro. When mPyTK was introduced into the adapter protein Grb2, it enabled the photocapture of EGFR in a stimulus-dependent manner. The design of mPyTK along with the identification of its cognate aminoacyl-tRNA synthetase makes it possible to map transient protein-protein interactions and their interfaces in living cells.
The genetically encoded photo-cross-linkers promise to offer a temporally controlled tool to map transient and dynamic protein-protein interaction complexes in living cells. Here we report the synthesis of a panel of 2-aryl-5-carboxytetrazole-lysine analogs (ACTKs) and their site-specific incorporation into proteins via amber codon suppression in Escherichia coli and mammalian cells. Among five ACTKs investigated, N-methylpyrroletetrazole-lysine (mPyTK) was found to give robust and site-selective photo-cross-linking reactivity in E. coli when placed at an appropriate site at the protein interaction interface. A comparison study indicated that mPyTK exhibits higher photo-cross-linking efficiency than a diazirine-based photo-cross-linker, AbK, when placed at the same location of the interaction interface in vitro. When mPyTK was introduced into the adapter protein Grb2, it enabled the photocapture of EGFR in a stimulus-dependent manner. The design of mPyTK along with the identification of its cognate aminoacyl-tRNA synthetase makes it possible to map transient protein-protein interactions and their interfaces in living cells.
To map dynamic protein–protein
interactions in living cells, a powerful chemical strategy involves
the use of genetically encoded photo-cross-linkers that permanently
link transient protein–protein interaction complexes with a
burst of light. Based on the structures, the reported genetically
encoded photo-cross-linkers contain one of the three moieties: phenyl
azide such as pAzF,[1] benzophenone
such as pBpa,[2] and diazirine
such as AbK[3,4] and their derivatives (Chart ).[5−8] Although these genetically encoded photo-cross-linkers
have proven to be valuable in the study of protein structure and function,
they invariably cross-link with their interacting protein partners
with no selectivity for any particular residue as the photogenerated
reactive intermediate, i.e., the nitrene from phenyl azide, the diradical
from benzophenone and the carbene from diazirine, inserts into a proximal
C–H bond with appropriate distance and angle,[9] making it difficult to predict a prior suitable positions
for installation of the photo-cross-linker. In addition, the tandem
mass spectrometry-based mapping of the interaction interface is complicated
as any residue from the interacting protein partner can potentially
participate in the photo-cross-linking.
Chart 1
Genetically Encoded
Photo-Cross-Linkers
Recently, we reported a new photoaffinity label based on
2-aryl-5-carboxytetrazole
(ACT) with a size similar to benzophenone, which cross-links its target
proteins via addition with a proximal nucleophile near the active
site.[10] Because ACT exhibits ligand-dependent
selective photo-cross-linking, we envisioned that ACT may also serve
as a new class of genetically encoded photo-cross-linkers for mapping
transient protein–protein interaction interfaces. It is noted
that a biocompatible proximity-driven nucleophilic substitution reaction
between a genetically encoded Nε-fluoroacetyllysine and cysteine was reported recently for mapping
the protein–protein interaction interface.[11] However, the occurrence of native cysteine at the protein–protein
interaction interface is rather rare.[12] Herein, we report the synthesis of a panel of ACT-lysine analogs
(ACTK, Chart ), and
the identification of a new ACTK-specific pyrrolysyl-tRNA synthetase
for site-specific incorporation of ACTK into proteins in Escherichia coli and in mammalian cells. One of the
ACTK analogs, mPyTK, exhibited robust and site-selective photo-cross-linking
of a GST dimer in bacteria. In a comparison study, mPyTK showed significantly
higher cross-linking efficiency than AbK when both are incorporated
at the same location of GST. Moreover, the mPyTK-encoded adapter protein,
Grb2, showed a stimulus and position-dependent capture of its transient
interaction partner, epidermal growth factor receptor (EGFR), in mammalian
cells.Because pyrrolysyl-tRNA synthetase (PylRS) and its variants
have
shown tremendous versatility in charging various lysine derivatives
into proteins site-selectively in bacteria, yeast and mammalian cells,[13] we decided to append ACT motif onto the ε-amino
group via simple acylation reaction. For the synthesis of ACTK analogs 1–4 in Chart , the key intermediate, ethyl 2-aryl-2H-tetrazole-5-carboxylate, was obtained through CuII-catalyzed cross-coupling of ethyl 2H-tetrazole-5-carboxylate
with the phenylaryliodonium salt[14] (Schemes
S1–S4 in Supporting Information).
For PhTK (5), the Kakehi tetrazole synthesis was followed
to give the ethyl 2-phenyl-2H-tetrazole-5-carboxylate
intermediate[15] (Scheme S5 in Supporting Information). Subsequent hydrolysis
and coupling with Fmoc-lysine·HCl followed by removal of the
protecting group afforded the ACT-lysine analogs 1–5 (Chart ).
To identify PylRS mutants that efficiently charge mPyTK (1), an MmPylRS library, in which four residues surrounding
the N-methylpyrrole-lysine side chain based on the
crystal structure of Methanosarcina mazei PylRS in complex with Pyl-AMP[16] (Y306,
L309, C348 and Y384; Figure a) were randomized, was subjected to successive rounds of
the positive and negative selections.[17] An MmPylRS mutant displaying the highest amber
suppression efficiency in E. coli was
identified that carries the Y306V/L309A/C348F/Y384F mutations and
is hereafter referred to as mPyTKRS. A plasmid pEvol-mPyTKRS encoding
mPyTKRS and tRNAPylCUA was then constructed
and showed site-specific incorporation of mPyTK into superfolder green
fluorescent protein (sfGFP) carrying an amber mutation at the Q204
position and a C-terminal His tag (Figure b). Interestingly, we found
that mPyTKRS also exhibits polyspecificity and allows site-specific
incorporation of other ACTK into sfGFP-Q204TAG with the level of expression
following the order of mPyTK > FTK > PhTK > PyTK > TTK
(Figure b). A 100
mL scale expression
of the sfGFP-Q204mPyTK mutant in BL21DE3 cells in the presence of
1 mM mPyTK gave a protein yield of 0.8 mg L–1; subsequent
mass spectrometry analysis verified the incorporation of mPyTK (Figure c and Figure S1 in
the Supporting Information (SI)). The lower
expression yield is likely due to reduced enzymatic activity of mPyTKRS
as a higher yield, 17.8 mg L–1, was obtained when
BocK was charged into the sfGFP-Q204 position using the wild-type
PylRS under identical conditions (Figure S2 in the SI).
Figure 1
Site-specific incorporation of ACT-based photo-cross-linkers
into
sfGFP via amber suppression. (a) A close-up view of the binding of
mPyTK (carbon skeleton shown in cyan tube model) or pyrrolysine-AMP
(carbon skeleton shown in yellow tube model) in the active site of MmPylRS (PDB code: 2ZIM). The residues in the vicinity of mPyTK selected for
randomization are marked in green tube model. (b) Anti-His6 Western blot of sfGFP-Q204ACTK mutants expressed in BL21DE3 cells
in the absence (−) and presence of 1 mM ACTK. (c) Deconvoluted
mass spectrum of sfGFP-Q204mPyTK: calcd, 27885.1 Da [M – Met
+ H+]; found, 27 884.2 ± 2.0 Da. The smaller
mass peak of 27767.1 Da corresponds to sfGFP-Q204W, the product of
near-cognate suppression by Trp.
Site-specific incorporation of ACT-based photo-cross-linkers
into
sfGFP via amber suppression. (a) A close-up view of the binding of
mPyTK (carbon skeleton shown in cyan tube model) or pyrrolysine-AMP
(carbon skeleton shown in yellow tube model) in the active site of MmPylRS (PDB code: 2ZIM). The residues in the vicinity of mPyTK selected for
randomization are marked in green tube model. (b) Anti-His6 Western blot of sfGFP-Q204ACTK mutants expressed in BL21DE3 cells
in the absence (−) and presence of 1 mM ACTK. (c) Deconvoluted
mass spectrum of sfGFP-Q204mPyTK: calcd, 27885.1 Da [M – Met
+ H+]; found, 27 884.2 ± 2.0 Da. The smaller
mass peak of 27767.1 Da corresponds to sfGFP-Q204W, the product of
near-cognate suppression by Trp.Next, we examined the efficiency of ACTK in photo-cross-linking Schistosoma japonicum glutathione-S-transferase (SjGST) homodimer (Figure a), a system used widely for
evaluating the genetically encoded photo-cross-linkers.[1,2] Inspection of the GST dimer structure (PDB code: 1Y6E) revealed that the
four interfacial residues, E52, F53, L66 and R74, are located at a
distance of ∼4–8 Å from a potential nucleophilic
residue from the opposite monomer (Figure S3 in SI). Thus, the GST mutants carrying mPyTK at these positions
were expressed in BL21DE3 cells and purified by Ni-NTA affinity chromatography.
To our surprise, only the GST-E52TAG mutant gave detectable expression
(with a yield of 2.8 mg L–1) based on SDS-PAGE,
which was confirmed by LC-MS data (Figure S4 in SI). To probe whether mPyTK enables photo-cross-linking in
vitro, the purified GST-E52mPyTK protein was irradiated with a hand-held
302 nm UV lamp for 0, 1, 5 and 15 min on ice, and covalent dimer formation
was monitored by SDS-PAGE. We observed time-dependent GST dimer formation
for the E52mPyTK mutant with ∼53% yield at 5 min, but not for
wild-type (Figure b), indicating that the ACT moiety is responsible for dimer cross-linking.
For comparison, we expressed the GST-E52AbK mutant using the wild-type
PylRS with a yield of 2.5 mg L–1 and examined its
photo-cross-linking reactivity. To our surprise, AbK exhibited very
weak reactivity as the dimer band was detected only by Western blot[4] (Figure S5 in SI)
but not Coomassie blue (Figure b). Interestingly, a higher cross-linking yield (∼79%)
was obtained when E. coli cells expressing
GST-E52mPyTK were directly photoirradiated, which could be attributed
to higher intracellular concentration of the GST mutant (Figure c and Figure S6 in SI). Because mPyTKRS can charge other ACTKs into
proteins site-selectively, we expressed SjGST mutants
carrying PyTK, FTK, TTK and PhTK, respectively, at position-52 and
compared their photo-cross-linking efficiency in E.
coli cells. Based on Western blot analysis, only mPyTK,
FTK and TTK showed the cross-linked dimer with the efficiency order
of mPyTK > TTK > FTK (Figure d), presumably due to the highest electron density
at N-methyl-pyrrole ring, which helps to stabilize
the photogenerated
carboxy-nitrile imine and increase its lifetime in biological media.
Figure 2
Photo-cross-linking
reactivity of the ACTK-encoded SjGST mutants. (a)
Scheme for photo-cross-linking of GST-ACTK to form
covalent GST dimer. The cross-linking sites are marked as red lines
between the two monomers. (b) Coomassie blue stained SDS-PAGE gels
after the wild-type (WT), E52mPyTK and E52AbK GST mutants were photoirradiated
for 0, 1, 5 or 15 min. 302 nm UV light was used for WT and E52mPyTK
mutant whereas 365 nm UV light was used for E52AbK mutant. Asterisk
indicates an impurity derived from the Ni-NTA affinity purification.
(c) Time-dependent photo-cross-linking of GST-E52mPyTK in E. coli. The GST monomer and photo-cross-linked dimer
in cell lysates were detected by Western blot using an anti-His6 antibody. (d) Comparing photo-cross-linking efficiency of
five ACTKs by Western blot using anti-His6 antibody. E. coli cells were photoirradiated with a 302 nm
UV lamp for 5 min before cell lysis.
Photo-cross-linking
reactivity of the ACTK-encoded SjGST mutants. (a)
Scheme for photo-cross-linking of GST-ACTK to form
covalent GST dimer. The cross-linking sites are marked as red lines
between the two monomers. (b) Coomassie blue stained SDS-PAGE gels
after the wild-type (WT), E52mPyTK and E52AbK GST mutants were photoirradiated
for 0, 1, 5 or 15 min. 302 nm UV light was used for WT and E52mPyTK
mutant whereas 365 nm UV light was used for E52AbK mutant. Asterisk
indicates an impurity derived from the Ni-NTA affinity purification.
(c) Time-dependent photo-cross-linking of GST-E52mPyTK in E. coli. The GST monomer and photo-cross-linked dimer
in cell lysates were detected by Western blot using an anti-His6 antibody. (d) Comparing photo-cross-linking efficiency of
five ACTKs by Western blot using anti-His6 antibody. E. coli cells were photoirradiated with a 302 nm
UV lamp for 5 min before cell lysis.Because ACT photoreacts with proximal nucleophilic residues
on
proteins, we sought to determine which nucleophilic residues on the
opposite GST monomer might react with the photogenerated carboxy-nitrile
imine intermediate. To this end, we built a model of the GST-E52mPyTK
and surveyed the chemical environment surrounding mPyTK. Four nucleophilic
residues (E92, M133, C139 and K141) were identified that are located
2.8–13.0 Å from the electrophilic nitrile iminecarbon
(Figure a). To determine
which one of these four residues participates the cross-linking reaction,
we mutated these residues to alanine and examined the photo-cross-linking
activity of the resulting mutants. We found the Glu92 → Ala
mutation completely abolished the covalent dimer formation whereas
other mutations had no effect (Figure b). This result is consistent with the proximity-driven
reactivity as E92 is closest to mPyTK with a calculated distance between
the carboxylateoxygen and the nitrile iminecarbon of 2.8 Å
(Figure a). Similar
results were obtained when the alanine scan was conducted with the
GST-E52-FTK mutant (Figure S7 in SI). We
propose a photo-cross-linking mechanism in which the E92 carboxylate
undergoes nucleophilic addition to the photogenerated carboxy-nitrile
imine followed by 1,4-acyl shift (Figure c). The rearranged cross-linked structure
was supported by tandem mass spectrometry data in which the two fragment
ions derived from the two discrete cleavage pathways were positively
identified (Figure S8 in SI). For comparison,
we performed photo-cross-linking studies with the same set of alanine
mutants of the GST-E52AbK (E92A, M133A, C139A and K141A). We did not
observe prominent attenuation in GST dimer formation; unexpectedly,
two alanine mutants (M133A and K141A) showed greater extent of dimer
formation than the GST-E52AbK alone (Figure S7 in SI), presumably due to a remodeling of the interaction interface
that alters the distance and/or angle of a suitable proximal C–H
bond.[9]
Figure 3
Identifying the mPyTK photo-cross-linking
site in GST. (a) A close-up
view of the nucleophilic residues from the opposite GST monomer (colored
in gray) surrounding mPyTK in GST monomer (colored in yellow). The
side chains of proximal resides (E92, M133, C139 and K141) are rendered
in tube model. (b) Coomassie blue stained SDS-PAGE gel showing UV-dependent
cross-linking of the GST-mPyTK alanine mutants. Asterisk indicates
an impurity derived from Ni-NTA affinity purification. The proteins
were photoirradiated with a hand-held 302 nm UV lamp on ice for 15
min before SDS-PAGE. (c) Proposed mechanism for mPyTK-mediated photo-cross-linking
of GST dimer. The two cleavage pathways are marked with blue and red
dash lines on the cross-linked structure (see Figure S8 in SI for details).
Identifying the mPyTK photo-cross-linking
site in GST. (a) A close-up
view of the nucleophilic residues from the opposite GST monomer (colored
in gray) surrounding mPyTK in GST monomer (colored in yellow). The
side chains of proximal resides (E92, M133, C139 and K141) are rendered
in tube model. (b) Coomassie blue stained SDS-PAGE gel showing UV-dependent
cross-linking of the GST-mPyTKalanine mutants. Asterisk indicates
an impurity derived from Ni-NTA affinity purification. The proteins
were photoirradiated with a hand-held 302 nm UV lamp on ice for 15
min before SDS-PAGE. (c) Proposed mechanism for mPyTK-mediated photo-cross-linking
of GST dimer. The two cleavage pathways are marked with blue and red
dash lines on the cross-linked structure (see Figure S8 in SI for details).To examine whether the genetically encoded mPyTK can capture
protein–protein
interaction complexes in mammalian cells, we first confirmed that
when mPyTKRS was expressed in HEK293T cells, it allowed site-selective
incorporation of mPyTK into mCherry-TAG-EGFP based on confocal fluorescence
microscopic analysis (Figure S9 in SI).
We then introduced mPyTK into Grb2, an adaptor protein that links
phosphorylated EGFR to the Ras signaling pathway through guanine nucleotide
exchange factor Sos[18] and was used previously
for evaluating the genetically encoded photo-cross-linkers in mammalian
cells.[19] Inspection of crystal structure
of the Grb2 SH2 domain in complex with a phosphotyrosine-containing
heptapeptide ligand[20] revealed that 9 residues
surrounding the ligand, A91, D104, V105, Q106, F108, K109, L111, W121
and N143, could be mutated to mPyTK for potential photo-cross-linking
with the SH2 ligands such as the cytoplasmic domain of EGFR (Figure a). Thus, HEK293T
cells were cotransfected with pCMV-mPyTKRS-tRNAPylCUA encoding mPyTK-specific PylRS and tRNAPylCUA, pCMV6-Grb2-myc-DDK encoding either wild-type or amber
mutant with TAG codon substituted at any of the 9 positions with a
C-terminal myc-DDK tag, and pcDNA3-EGFR-EGFP encoding full-length
EGFR and a C-terminal EGFP tag, and protein expressions were carried
out in DMEM medium supplemented with 10% FBS and 1 mM mPyTK. The cells
were starved for 12 h before EGF stimulation and subsequent photoirradiation
on ice. The cells were then lysed and the lysates were treated with
protein tyrosine phosphatase 1B to hydrolyze the phosphotyrosine to
obviate noncovalent Grb2-interacting protein complexes. The cross-linked
Grb2-interacting proteins were immunoprecipitated with anti-Flag antibody
and analyzed by sequential Western blots using antimyc and anti-EGFR
antibodies. The cross-linked Grb2–EGFR complex was detected
for 8 out 9 Grb2-mPyTK mutants; the D104mPyTK mutant gave the highest
photo-cross-linking yield followed by the V105mPyTK, Q106mPyTK and
N143mPyTK mutants (Figure b and Figure S10 in SI). The clustering
of the four mutants (D104, V105, Q106 and N143) in the same region
of Grb2 SH2 domain suggests that the mPyTK photo-cross-linker in these
mutants may react with the same nucleophilic residue on EGFR across
the interaction interface. Moreover, the photo-cross-linking of EGFR
is EGF stimulation and photoirradiation-dependent and is mediated
by mPyTK as the wild-type Grb2 did not exhibit covalent capture of
EGFR (Figure S11A in SI). The highest photo-cross-linking
yield for Grb2-D104mPyTK was observed when cells were stimulated with
EGF for 15 min (Figure S11B in SI), indicating
that the Grb2–EGFR interaction is transient and dynamic, resembling
some other known EGF-dependent protein–protein interactions.[21]
Figure 4
Photo-cross-linking of EGFR by the mPyTK-encoded Grb2
mutants in
mammalian cells. (a) A close-up view of Grb2-SH2 domain in complex
with phosphotyrosine-containing heptapeptide (PDB code: 1TZE), highlighting the
proximal residues surrounding pY (rendered in cyan tube model) that
were selected for the mPyTK mutagenesis. (b) Comparison of the efficiency
of the mPyTK-containing Grb2-SH2 domain mutants for photo-cross-linking
with EGFR in HEK293T cells. Cells were exposed to 302 nm UV light
for 5 min before lysis. The Grb2-cross-linked proteins in cell lysates
were immunoprecipitated with the anti-FLAG antibody-immobilized agarose
beads. The samples were analyzed SDS-PAGE/Western blot and probed
successively with antimyc antibody (top panel) and anti-EGFR antibody
(bottom panel).
Photo-cross-linking of EGFR by the mPyTK-encoded Grb2
mutants in
mammalian cells. (a) A close-up view of Grb2-SH2 domain in complex
with phosphotyrosine-containing heptapeptide (PDB code: 1TZE), highlighting the
proximal residues surrounding pY (rendered in cyan tube model) that
were selected for the mPyTK mutagenesis. (b) Comparison of the efficiency
of the mPyTK-containing Grb2-SH2 domain mutants for photo-cross-linking
with EGFR in HEK293T cells. Cells were exposed to 302 nm UV light
for 5 min before lysis. The Grb2-cross-linked proteins in cell lysates
were immunoprecipitated with the anti-FLAG antibody-immobilized agarose
beads. The samples were analyzed SDS-PAGE/Western blot and probed
successively with antimyc antibody (top panel) and anti-EGFR antibody
(bottom panel).In summary, we have synthesized
a panel of 2-aryl-5-carboxy-tetrazole-based
photo-cross-linkers and evolved a polyspecific pyrrolysyl-tRNA synthetase
variant that charges these ACT-lysine analogs site-selectively into
proteins in E. coli. One of the most
reactive, genetically encoded photo-cross-linkers, mPyTK, allowed
site-selective photo-cross-linking of a GST dimer in vitro and in E. coli cells. In comparison studies, mPyTK exhibited
a significantly higher efficiency than AbK when placed at the same
location of the GST dimer interface. Moreover, mPyTK enabled covalent
capture of the transient Grb2-interacting protein partner in mammalian
cells in a stimulus-dependent manner. In view of their higher photo-cross-linking
yields and unique cross-linking mechanism, these genetically encoded
ACT-lysines should offer a powerful chemical tool to map transient
protein–protein interactions underlining signal transduction
pathways. Because tandem MS analysis revealed the characteristic fragments
after photo-cross-linking, these ACT-lysines may also find applications
in structural analysis of protein complexes.
Authors: J Rahuel; B Gay; D Erdmann; A Strauss; C Garcia-Echeverría; P Furet; G Caravatti; H Fretz; J Schoepfer; M G Grütter Journal: Nat Struct Biol Date: 1996-07
Authors: N Li; A Batzer; R Daly; V Yajnik; E Skolnik; P Chardin; D Bar-Sagi; B Margolis; J Schlessinger Journal: Nature Date: 1993-05-06 Impact factor: 49.962
Authors: Yong Zheng; Cunjie Zhang; David R Croucher; Mohamed A Soliman; Nicole St-Denis; Adrian Pasculescu; Lorne Taylor; Stephen A Tate; W Rod Hardy; Karen Colwill; Anna Yue Dai; Rick Bagshaw; James W Dennis; Anne-Claude Gingras; Roger J Daly; Tony Pawson Journal: Nature Date: 2013-07-11 Impact factor: 49.962
Authors: Alexander R Nödling; Luke A Spear; Thomas L Williams; Louis Y P Luk; Yu-Hsuan Tsai Journal: Essays Biochem Date: 2019-07-03 Impact factor: 8.000