Dong Liu1, Wen-Hao Wu1, Ya-Jie Liu1, Xia-Ling Wu1, Yang Cao1, Bo Song2, Xiaopeng Li2, Wen-Bin Zhang1. 1. Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, P. R. China. 2. Department of Chemistry, University of South Florida, Tampa, Florida 33620, United States.
Abstract
Recombinant proteins are traditionally limited to linear configuration. Herein, we report in vivo protein topology engineering using highly efficient, mechanically interlocking SpyX modules named AXB and BXA. SpyX modules are protein domains composed of p53dim (X), SpyTag (A), and SpyCatcher (B). The p53dim guides the intertwining of the two nascent protein chains followed by autocatalytic isopeptide bond formation between SpyTag and SpyCatcher to fulfill the interlocking, leading to a variety of backbone topologies. Direct expression of AXB or BXA produces protein catenanes with distinct ring sizes. Recombinant proteins containing SpyX modules are obtained either as mechanically interlocked obligate dimers if the protein of interest is fused to the N- or C-terminus of SpyX modules, or as star proteins if the protein is fused to both N- and C-termini. As examples, cellular syntheses of dimers of (GB1)2 (where GB1 stands for immunoglobulin-binding domain B1 of streptococcal protein G) and of four-arm elastin-like star proteins were demonstrated. Comparison of the catenation efficiencies in different constructs reveals that BXA is generally much more effective than AXB, which is rationalized by the arrangement of three domains in space. Mechanical interlocking induces considerable stability enhancement. Both AXB and BXA have a melting point ∼20 °C higher than the linear controls and the BXA catenane has a melting point ~2 °C higher than the cyclic control BX'A. Notably, four-arm elastin-like star proteins demonstrate remarkable tolerance against trypsin digestion. The SpyX modules provide a convenient and versatile approach to construct unconventional protein topologies via the "assembly-reaction" synergy, which opens a new horizon in protein science for stability enhancement and function reinforcement via topology engineering.
Recombinant proteins are traditionally limited to linear configuration. Herein, we report in vivo protein topology engineering using highly efficient, mechanically interlocking SpyX modules named AXB and BXA. SpyX modules are protein domains composed of p53dim (X), SpyTag (A), and SpyCatcher (B). The p53dim guides the intertwining of the two nascent protein chains followed by autocatalytic isopeptide bond formation between SpyTag and SpyCatcher to fulfill the interlocking, leading to a variety of backbone topologies. Direct expression of AXB or BXA produces protein catenanes with distinct ring sizes. Recombinant proteins containing SpyX modules are obtained either as mechanically interlocked obligate dimers if the protein of interest is fused to the N- or C-terminus of SpyX modules, or as star proteins if the protein is fused to both N- and C-termini. As examples, cellular syntheses of dimers of (GB1)2 (where GB1 stands for immunoglobulin-binding domain B1 of streptococcal protein G) and of four-arm elastin-like star proteins were demonstrated. Comparison of the catenation efficiencies in different constructs reveals that BXA is generally much more effective than AXB, which is rationalized by the arrangement of three domains in space. Mechanical interlocking induces considerable stability enhancement. Both AXB and BXA have a melting point ∼20 °C higher than the linear controls and the BXAcatenane has a melting point ~2 °C higher than the cyclic control BX'A. Notably, four-arm elastin-like star proteins demonstrate remarkable tolerance against trypsin digestion. The SpyX modules provide a convenient and versatile approach to construct unconventional protein topologies via the "assembly-reaction" synergy, which opens a new horizon in protein science for stability enhancement and function reinforcement via topology engineering.
In synthetic polymers,
chain topology and molecular architecture
have been important molecular parameters to tune their physical properties
and functions.[1−3] Controlling macromolecular topology is thus one of
the most commonly practiced molecular engineering approaches.[4,5] To a certain extent, it compensates for the lack of sequence control
in traditional polymers. An extraordinary example is the star polymer,
which emerged back in the early days of polymer science.[3] Multiple polymer chains emanating from a central
focal point constitute the basic structure of star polymers.[6] This unique topology imparts intriguing properties
to star polymers, including low viscosity, multiple surface functional
groups, low crystallinity, etc.[7,8] Nowadays, synthetic
star polymers can be prepared with delicate control over the arm number,
arm composition, molecular weight, polydispersity, etc.[7] Sophisticated architectures with heterogeneous
compositions are not uncommon.[9,10] By contrast, recombinant
proteins are usually synthesized by the cellular machinery in an extremely
precise fashion.[11] The stereochemistry,
sequence, chain length, and even 3D folded structure are all determined
on the genetic level and faithfully reproduced in the biological system.
The topology is, however, usually restrained to linear configuration.
To date, there are only a few examples of nonlinear proteins, including
cyclic proteins, tadpoles, stars, H-shape proteins, and catenanes,[12,13] yet its full potential remains to be explored. It is thus highly
desired to further expand the topological dimension for protein engineering.The significance of controlling the composition and topology of
the protein backbone has been increasingly recognized.[12] A purely synthetic approach to “backbone
engineering” has been developed via introducing compositions
other than amide bonds (e.g., esters and triazole linkages) into the
polypeptide backbone to tune their assembly and properties.[14,15] Knotted topology has been found to bring considerable stabilizing
effects with completely native polypeptide backbones.[16,17] Topological variations beyond linear configuration have included
cyclics, tadpoles, stars, and other branched proteins.[12] Among them, cyclic topology is the most commonly
encountered nonlinear protein topology in nature and has also been
extensively studied to date.[18−20] Various technologies have been
developed to prepare cyclic proteins, both in vivo(21) and in vitro,[22] including intein technology,[23] SpyTag-SpyCatcher chemistry,[24,25] sortase/butlase-mediated
ligation,[26−30] etc. Typical benefits from backbone cyclization include the dramatically
improved stability against both thermal denaturation and proteolytic
digestion. By contrast, there is only scarce reporting on star proteins
and other branched proteins in the literature.[12] A related example is the synthetic star polypeptides that
exhibit robust and potent antimicrobial activities.[31−33] This is in
line with their highly exposed multivalent surface functionalities.
Taking advantage of the highly reactive SpyTag-SpyCatcher chemistry,[34] isomeric three-arm and four-arm star proteins
have been prepared with high efficiency in vitro.[12] However, direct expression of star proteins in vivo remains a challenge.Supramolecular architecture
is another important, yet less explored
topic in protein science.[35−37] Nature has elegantly demonstrated
the power of supramolecular structure engineering in examples such
as the viral capsids where the stability is greatly enhanced by mutually
interlocked protein catenanes.[38] Dawson
et al. reported the solid state synthesis of a peptide catenane by
native chemical ligation and carefully studied the thermodynamics
of protein catenation.[39−41] Recently, protein catenation has also been achieved in vivo based on an elastin-like model protein, facilitated
by p53dim-guided chain intertwining and SpyTag-SpyCatcher ligation.[13] Being genetically encoded, this method can be
adapted to other proteins of interest as well. Protein catenation
brings in significant improvement in thermal stability and proteolytic
resistance.[13,40] Nevertheless, the catenation
efficiency was only moderate (∼60%) in previous reports. We
thus seek ways to further improve the catenation efficiency and create
even more complex protein topologies.In this contribution,
we report the design and application of genetically
encoded SpyX modules for the efficient cellular synthesis of diverse
mechanically interlocked protein architectures including obligate
dimers and star proteins (Figure ). The topology was demonstrated by partial digestion
experiments, and the catenation efficiency was evaluated for different
constructs. In general, the SpyX modules have high catenation efficiencies
even in the presence of fusion proteins at either end of the N- and
C-termini or both. The consequences of topology engineering have also
been examined. It was found that proteins with complex topologies
exhibit significantly improved stability against thermal denaturation
and trypsin digestion.
Figure 1
(A) Illustration of gene constructs for SpyX modules,
namely, AXB
and BXA, and fusion proteins containing the SpyX module at different
locations; the reactive carboxylic acid and amine groups in SpyX modules
are also shown; the yellow segment is the TEV cleavage site, a short
linker sequence, to facilitate subsequent proof of topology. (B) Cellular
expression of AXB and BXA produces protein catenanes with distinct
ring sizes; SpyX modules lead to the formation of obligate dimers
in the constructs of SpyX-(GB1)2 and (GB1)2-SpyX
and to star proteins in the constructs of EAXBE and EBXAE (PDB ID: 4MLS for SpyTag-SpyCatcher
complex, 5JXV for GB1, 1SAK for p53dim).
(A) Illustration of gene constructs for SpyX modules,
namely, AXB
and BXA, and fusion proteins containing the SpyX module at different
locations; the reactive carboxylic acid and amine groups in SpyX modules
are also shown; the yellow segment is the TEV cleavage site, a short
linker sequence, to facilitate subsequent proof of topology. (B) Cellular
expression of AXB and BXA produces protein catenanes with distinct
ring sizes; SpyX modules lead to the formation of obligate dimers
in the constructs of SpyX-(GB1)2 and (GB1)2-SpyX
and to star proteins in the constructs of EAXBE and EBXAE (PDB ID: 4MLS for SpyTag-SpyCatcher
complex, 5JXV for GB1, 1SAK for p53dim).
Results and Discussion
Molecular
Design of SpyX Modules
The design of SpyX
modules is based on p53dim (X, which forms an entwined dimer upon
folding[40,42]) and a genetically encoded SpyTag (A) and
SpyCatcher (B) reactive pair (which spontaneously form an isopeptide
bond upon reconstitution).[34] Depending
on their relative locations, there are two configurations, namely,
AXB and BXA (Figure ). To facilitate catenation, we consistently added short linkers
with sequence of GSGS before X and of SGGSG after X in both constructs
(Figures S1 and S2). Upon expression in vivo, the X domain shall form entwined dimer, bringing
A and B in vicinity to react and form the covalent catenanes. Hence,
AXB is expected to have concatenated rings with relatively smaller
sizes, and BXA would have almost the entire SpyCatcher domain concatenated.
Since the A-B complex is tightly folded,[43] the overall sizes of AXB and BXA should be similar. They all possess
two N-termini and two C-termini, allowing the preparation of diverse
mechanically interlocked protein topologies. When fusion proteins
are tethered at one single terminus, obligate dimers will be obtained.
When fusion proteins are tethered on both termini, four-arm star proteins
will be produced (Figure ). To probe the topology, a proteolytic site recognized and
cleaved by tobacco etch virus (TEV) protease was included in both
designs near the X domain. Two mutants were also designed as controls.
The K344P mutant of the p53dim domain (X′, where the lysine
at position 344 is mutated to proline) could not form the entwined
dimer and exists as a monomer.[13,40] The D/A mutant of SpyTag
(A′) abolishes the chemical reactivity by changing the aspartic
acid to nonreactive alanine (Figures S1 and S2).[12] Therefore, AX′B and BX′A
are expected to give cyclic monomer controls whereas A′XB and
BXA′ serve as linear monomer controls. We anticipated that
by placing SpyTag, SpyCatcher, and p53dim in close neighborhood, the
catenation efficiency can be dramatically improved. The resulting
SpyX domains have relatively small overall sizes (∼24 kDa)
and thus may serve as mechanically interlocking protein modules.
Protein Catenanes from SpyX Modules
The genes were
designed, optimized, synthesized, and cloned to pQE80L vector. The
plasmids were used to transform Escherichia coli. BL21 strain and the expressions were all performed in 2XYT medium
at 30 °C with IPTG induction. Purifications were performed using
affinity chromatography following the standard protocol. The products
were analyzed by size-exclusion chromatography (SEC) and SDS–PAGE.
The characterization results of AXB and BXA as well as their mutants
are summarized in Figure . It is clear that both AXB and BXA were mainly obtained as
dimers in the crude products (Figure A). For AXB, there are some chain-extended oligomers,
whereas, for BXA, it is almost exclusively dimer. After purification
with SEC, the dimers of AXB and BXA show up as monomodal peaks at
retention volumes much lower than the cyclic monomer mutants (AX′B
and BX′A) (Figures B and 2C). Interestingly, the linear
monomer mutants (A′XB and BXA′) elute at almost the
same time as the catenanes. It indicates that, in solution, the linear
mutants exist as dimers due to the entwining of X and the complexation
of A′ and B regardless of covalent bond formation. It is also
the preorganization of A and B by X that promotes the highly efficient
catenation. The MALDI-TOF mass spectra also corroborated the dimer
formation (Figures D and S3). For AXB, it gives mainly a
molecular peak of [M + H]+ at m/z of 47163 that is consistent with the expected value of
47097 for AXB dimer within the range of error. The A′XB and
AX′B both show mainly one molecular peak assigned as [M + H]+ at m/z of 23523 and 23524,
respectively. For BXA, the molecular peak of [M + H]+ at m/z of 49265 was only observed at relatively
low abundance as compared to the major peak assigned as [M + 2H]2+ at m/z of 24646. We speculated
that BXA may have strong binding affinity to proton and, thus, mainly
forms doubly charged ions upon MALDI ionization.
Figure 2
(A) SEC overlay of the
crude expression products of AXB and BXA.
(B) SEC overlay of the purified products of AXB, AX′B, and
A′XB. (C) SEC overlay of the purified products of BXA, BX′A,
and BXA′. (D) MALDI-TOF mass spectrum of AXB. (E) SDS–PAGE
analysis of AXB, BXA, AX′B, BX′A, A′XB, and BXA′.
(F) SDS–PAGE analysis of partial digestion of AXB, AX′B,
and A′XB to prove the catenane topology (N, undigested control;
P, partially digested; F, fully digested).
(A) SEC overlay of the
crude expression products of AXB and BXA.
(B) SEC overlay of the purified products of AXB, AX′B, and
A′XB. (C) SEC overlay of the purified products of BXA, BX′A,
and BXA′. (D) MALDI-TOF mass spectrum of AXB. (E) SDS–PAGE
analysis of AXB, BXA, AX′B, BX′A, A′XB, and BXA′.
(F) SDS–PAGE analysis of partial digestion of AXB, AX′B,
and A′XB to prove the catenane topology (N, undigested control;
P, partially digested; F, fully digested).The dimer formation is also evident in SDS–PAGE for
both
AXB and BXA (Figure E). Theoretically, assuming that the linkers between A and X and
between X and B are sufficiently long and flexible, there will be
four possible products, namely, [2]catenane, cyclic dimer, knotted
cyclic dimer, and noncovalently associated rings (Figure S4). Among them, only the last one breaks into cyclic
monomers in SDS–PAGE. We used gel densitometry to quantify
the relative abundance of possible products. For AXB, the dimer is
a clean band, accounting for ∼92% of the mixture. There is
much more cyclic monomer (∼6%) than linear monomer (∼1%).
For BXA, the dimer band is smeared and there seem to be two distinct
bands which represent ∼61% and ∼31% of the total mixture,
respectively. The smearing could be due to a highly stable folded
structure that resists denaturation even in the presence of SDS.[13,43] While products of different topology may also contribute to multiple
smearing bands, we will show in the following section by limited TEV
digestion experiments that it is less likely. Compared to AXB, the
residual BXA monomer exists almost exclusively as cyclic monomers
(∼8%). It indicates that the monomeric rings are the major
side product in both cases, which could result either from premature
ring closure or from the formation of noncovalently associated rings
after chain entwining. The catenation efficiency is thus evaluated
as the product of dimer contents determined individually from SEC
profile and SDS–PAGE. It is (64% × 92%) for AXB and (96%
× 92%) for BXA, respectively.We further performed electrospray
ionization mass spectrometry
(ESI-MS) characterization combined with traveling wave ion mobility
(TWIM) separation, which is useful to differentiate species with identical m/z but different charges and sizes.[44] This powerful technique has been used to study
supramolecular aggregates,[45] peptide self-assembly,[46,47] protein conformation,[48,49] etc. In this work,
the proteins were first ionized and the ionic species were subjected
to ion mobility separation by their mass-to-charge (m/z) ratios and sizes/charges. The full ESI mass
spectra of AXB and BXA with charge states denoted are shown in Figure . It should be noted
that the signals of trace monomers may superimpose within that of
dimers. After ion mobility separation, the dimers were indeed found
to be the dominant species for both AXB and BXA (Figures C and 3D). The monomers were only detected in trace abundance. The signals
of monomers and dimers extracted from TWIM-MS spectra are shown in Figure S5. The results are consistent with the
findings in SDS–PAGE, again confirming high catenation efficiencies.
Figure 3
ESI-MS
characterizations of (A) AXB and (B) BXA and two-dimensional
TWIM-MS plot for (C) AXB and (D) BXA. For both samples, two species,
namely, monomer and dimer, were observed with dimer being the overwhelming
majority. The brightness of the color represents the abundance of
signals at different charge states.
ESI-MS
characterizations of (A) AXB and (B) BXA and two-dimensional
TWIM-MS plot for (C) AXB and (D) BXA. For both samples, two species,
namely, monomer and dimer, were observed with dimer being the overwhelming
majority. The brightness of the color represents the abundance of
signals at different charge states.To prove that the dimers are catenanes rather than cyclic
dimers,
we performed partial digestion experiments. Since TEV cleavage site
is placed inside the concatenated ring, treating proteins with TEV
protease would first lead to both cyclic monomer and linear monomer
upon limited proteolysis and further to linear monomer upon complete
digestion. Meanwhile, digestion of cyclic monomer controls and linear
monomer controls were also performed for comparison. The appearance
of cyclic monomer upon partial digestion is a clear evidence of the
interlocked catenane topology in both cases (Figures F and S6). By
contrast, the cyclic and linear controls were converted to relinearized
monomers and fragments, respectively, under identical conditions.
The same process was also followed by UPLC–ESI-MS (Figures S7 and S8). Taking AXB for example, the
purified sample shows up as a main peak at 47089, which matches the
expected value of 47097 for catenane. Consistent with SDS–PAGE
analysis, there are two very minor peaks with molecular weights of
23544 and 23562, which can be assigned as the cyclic and linear monomers,
respectively. Upon partial digestion, the amounts of cyclic and linear
monomers increased rapidly, especially the linear one. Eventually,
only the peak assigned to the linear monomer can be observed (Figure S7). Throughout the process, we could
not observe any peaks corresponding to the linear dimer with molecular
weight of 47114, which is the partially digested product of both simple
and knotted cyclic dimers. The UPLC–ESI-MS analysis of the
partially digested products of BXA also gives similar results (Figure S8). It suggests that the cyclic dimers
exist, if any, in extremely low abundance. Other indirect evidence
comes from the comparison between the AXB dimer and AX′B dimer
in SDS–PAGE (Figure E, lanes 2 and 4). The dimer of AX′B is most likely
a cyclic dimer, but it appears in a distinct position as compared
to the dimer of AXB. So the existence of cyclic dimers in AXB is less
likely. Therefore, both AXB and BXA are indeed mechanically interlocked
protein catenanes.
From Obligate Dimers to Star Proteins
Having established
that AXB and BXA are both very efficient in forming protein catenanes,
we further investigated how the introduction of various proteins would
affect the catenation efficiency. The protein can be added either
to the N-terminus or to the C-terminus of SpyX modules, or both. As
an example, immunoglobulin-binding domain B1 of streptococcal protein
G, a small folded protein known as GB1, has been used as the model
protein.[50] Two consecutive domains have
been added, leading to four constructs: (GB1)2-AXB, AXB-(GB1)2, (GB1)2-BXA, BXA-(GB1)2 (Figure S9). The proteins were expressed under
the same conditions as AXB and BXA. The SEC overlay of the crude expression
products is shown in Figure A. It can be seen that both AXB and BXA effectively lead to
the formation of dimers and the dimer content in the crude product
is already very high (up to 97% in BXA-(GB1)2). Since GB1
protein is known to exist as monomer, the products obtained here can
be regarded as obligate dimers of (GB1)2. After SEC purification,
the products show up as symmetric monomodal peaks, suggesting uniform
composition. The SDS–PAGE analysis reveals that there are also
trace amounts of monomers in the products, a scenario similar to that
in AXB or BXA. The catenation efficiencies are summarized in Table , which are generally
good. The MALDI-TOF mass spectra further prove the dimer formation
(Figure S10). For example, AXB-(GB1)2 shows a major molecular peak [M]+ of 74356, which
matches well with the calculated value of 74341 within the range of
error. The results suggest that the incorporation of folded protein
domains, either to the N- or to the C-terminus, does not affect the
mechanical interlocking of SpyX modules, leading to the formation
of obligate dimers. The results have more implications for engineering
protein activities that depend critically on the dimerization states,
such as 3CL protease,[51] or for creating
multifunctional proteins that can be used to make protein-based hydrogels.[52]
Figure 4
Molecular characterizations of mechanically
interlocked obligate
dimers and star polymers: (A) SEC overlay of the crude expression
products and (B) purified products of (GB1)2-AXB, AXB-(GB1)2, (GB1)2-BXA, BXA-(GB1)2. (C) SDS–PAGE
analysis of purified products of (GB1)2-AXB, AXB-(GB1)2, (GB1)2-BXA, BXA-(GB1)2. (D) SEC overlay
of the crude expression products of EAXBE and EBXAE. (E) SEC overlay
and (F) SDS–PAGE analysis of EAXBE, EA′XBE, EA′X′BE,
EBXAE, EBX′AE, EBX′A′E.
Table 1
Summary of Protein
Catenation Efficiencies
samples
SECa (%)
SDSb (%)
totalc (%)
AXB
64
92
59
BXA
96
92
88
AXB-(GB1)2
82
80
66
BXA-(GB1)2
97
86
83
(GB1)2-AXB
78
93
73
(GB1)2-BXA
92
83
76
EAXBE
84
85
71
EBXAE
86
89
77
Calculated from SEC profile based
on the integration area of the dimer peak.
Calculated from the SDS–PAGE
analysis based on gel densitometry analysis.
The overall catenation efficiency.
Calculated from SEC profile based
on the integration area of the dimer peak.Calculated from the SDS–PAGE
analysis based on gel densitometry analysis.The overall catenation efficiency.Molecular characterizations of mechanically
interlocked obligate
dimers and star polymers: (A) SEC overlay of the crude expression
products and (B) purified products of (GB1)2-AXB, AXB-(GB1)2, (GB1)2-BXA, BXA-(GB1)2. (C) SDS–PAGE
analysis of purified products of (GB1)2-AXB, AXB-(GB1)2, (GB1)2-BXA, BXA-(GB1)2. (D) SEC overlay
of the crude expression products of EAXBE and EBXAE. (E) SEC overlay
and (F) SDS–PAGE analysis of EAXBE, EA′XBE, EA′X′BE,
EBXAE, EBX′AE, EBX′A′E.We further challenge the capability of SpyX modules to mechanically
interlock fusion proteins by installing elastin-like protein domains
onto both termini of the SpyX domain. Elastin-like proteins are intrinsically
disordered proteins (IDPs) with random-coil-like conformations in
solution.[53−55] Unlike folded proteins such as GB1, IDPs usually
have no stable ordered secondary structure in solution.[56,57] They generally have much larger hydrodynamic volume than folded
protein counterparts. Hence, they are expected to exert considerable
steric hindrance on catenation owing to the excluded volume effects.
They are ideal models for evaluating the influence of fusion proteins
on catenation efficiency. An ELP with molecular weight of ∼8
kDa was attached to each of the terminus. Since the center core is
tightly folded, it is reasonable to consider the products as 4-arm
star polymers. Two star protein constructs were designed and denoted
as EAXBE and EBXAE, respectively (Figures , S11, and S12). Their cyclic and linear monomer controls were also designed using
the KP mutant of p53dim (X′) as well as the DA mutant of SpyTag
(A′). Unlike A′XB and BXA′, the linear controls
in this case (EA′X′BE and EBX′A′E) are
incapable of forming intertwined dimers in solution.The crude
products of EAXBE and EBXAE were first analyzed by SEC.
Although their molecular weights are identical, the samples elute
at distinct retention volumes, suggesting distinct hydrodynamic volumes
in solution. After SEC purification, the monomodal peaks of these
star proteins were compared to the corresponding cyclic and linear
controls (Figure E).
The results clearly demonstrate that both EAXBE and EBXAE are dimers
appearing at much lower retention volume relative to the cyclic and
linear mutants. Interestingly, the elution profiles of EAX′BE
and EA′X′BE overlap with each other, whereas the retention
volume of EBX′AE is obviously larger than that of EBX′A′E.
The SDS–PAGE analysis of the purified products reveals that
both EAXBE and EBXAE are dimers with apparent molecular weights at
∼100 and ∼120 kDa, respectively, whereas the cyclic
controls appear at ∼40 kDa and the linear control appears at
∼50 kDa. Although both proteins are 4-arm stars with identical
molecular weights, there are still minor differences between their
topologies. The difference is consistent with the SEC profiles, suggesting
that BXA has a more expanded, much larger structure than AXB. This
could be rationalized based on the arm distribution on the SpyX core,
which will be further elaborated later. In MALDI-TOF mass spectra,
only one peak could be observed for both EAXBE and EBXAE, which are
assigned as the molecular peak with two charges [M + 2H]2+ (Figure S13). This is probably because
the products may have strong binding affinity to proton and, thus,
mainly form doubly charged ions upon MALDI ionization. It may also
have something to do with the mechanically interlocked structure where
each of the two noncovalently interlocked segments bears one charge.
A similar phenomenon has been observed previously.[13] This evidence
demonstrates that star proteins can be effectively expressed in vivo using SpyX as an interlocking module to create the
core.
Discussions on Catenation Efficiency and the Effects of Topology
Engineering
We have shown that SpyX modules (AXB and BXA)
are highly efficient in forming mechanically interlocked architectures
regardless of the tethering position of the fused proteins. But the
catenation efficiency varies as a direct consequence of the 3D arrangement
of A, X, and B. In general, BXA is much more effective than AXB in
guiding the mechanical interlocking of protein chains. The dimer content
in the crude products can be almost quantitative (up to 97% for BXA-(GB1)2). We analyzed the crystal structures of X (PDB: 1SAK) and A–B
complex (PDB: 4MLS) and proposed possible models for AXB and BXA as shown in Figure . The chain topology
of p53dim and SpyTag/SpyCatcher complex mandates their configuration
in three-dimensional (3D) space. For p53dim, the intertwined structure
makes the N-terminus and C-terminus be at same side of the folded
structure. For A–B complex, since SpyTag and the first strand
of SpyCatcher are arranged in parallel β-sheet configuration,
the N-terminus of SpyTag and the N- and C-termini of SpyCatcher domain
are crowded on the one end of the β-barrel. This is in perfect
alignment with the configuration of BXA (Figure ), leading to a more compact structure and
a better positioning for efficient reaction. The catenation efficiency
is thus much higher. However, the configuration of AXB puts A and
B in antiparallel arrangement. It would require either one to fold
back in the opposite direction for reconstitution and reaction (Figure ). This slows down
the reaction and decreases the catenation efficiency. The trend is
consistent for all fusion proteins. For AXB alone, the slower reaction
between A and B leads to the accumulation of telechelic proteins,
and thus, the intermolecular reactions are promoted to give more chain-extended
products (Figure A).
The overall catenation efficiency was merely ∼59%. For fusion
proteins containing AXB, the tethered proteins exert significant steric
hindrance due to excluded volume effect to discourage the intermolecular
over intramolecular reactions, leading to higher catenation efficiency.
The effect is most evident if SpyTag is located between the protein
of interest and p53dim domain. Therefore, (GB1)2-AXB has
a higher catenation efficiency (∼73%) than AXB-(GB1)2 (∼66%). A similar trend has also been observed in the BXA
module, as shown by the higher catenation efficiency in BXA-(GB1)2 (∼83%) than (GB1)2-BXA (76%). This is somewhat
counterintuitive, yet not surprising since changing the location of
SpyTag to the middle of the chain has been previously shown to increase
the yield of tadpole proteins.[12] The 3D
configuration of AXB and BXA is the origin of the difference in retention
volumes of various fusion proteins. For AXB, both N- and C-termini
point to the same side of the β-barrel and the tethered proteins
are thus quite crowded, leading to a compact structure and a smaller
hydrodynamic volume. For BXA, the N-terminus and C-terminus are pointing
to the opposite ends of the β-barrel of the A–B complex,
and the tethered proteins are thus more expanded in 3D space, leading
to apparently higher hydrodynamic volume in SEC (Figure A). This difference is also
consistently observed in each pair of SpyX-containing fusion proteins.
Figure 5
Proposed
models for the configuration of AXB (A) and BXA (B) modules
in 3D space. The models are generated based on the PDB structure of
X (PDB: 1SAK) and A–B complex (PDB: 4MLS).
Proposed
models for the configuration of AXB (A) and BXA (B) modules
in 3D space. The models are generated based on the PDB structure of
X (PDB: 1SAK) and A–B complex (PDB: 4MLS).While the creation of different protein topologies itself
presents
an enormous challenge, a more fundamental question is “how
does the topology influence the properties and functions of proteins?”
To address this, we first examined the protein catenanes of AXB and
BXA in comparison to the linear and cyclic controls. As revealed by
DSC thermogram (Figure S14), both catenanes
and cyclic monomers exhibit a dramatic increase in melting point (Tm) (∼86 °C) as compared to the linear
monomers (∼62 °C). Thermal stability improvement from
cyclic protein to catenane is relatively minor, probably because the
protein is already very stable. The AXB and AX′B have almost
the same Tm, which is reasonable since
the cyclized portion is relatively small and the influence is minor.
By contrast, BXA and BX′A are both more stable than the corresponding
AXB and AX′B. In addition, BXA exhibits a Tm about 2 °C higher than BX′A. We also attempted
to follow the thermal denaturation by circular dichroism (CD) spectrometry.
The CD spectra at 30 °C are shown in Figure S15. The spectra of AXB and BXA are very similar to the parent
protein (the CnaB2).[58] The presence of
α-helices in p53dim adds to the signal at around 220 nm. By
contrast, the signal around 220 nm is much weaker in the mutants AX′B
and BX′A, indicating the absence of intertwined dimers in AX′B
and BX′A. The temperature-dependent CD spectra (Figure S16) qualitatively agree with the DSC
results.One prominent feature of star polymers is their spatially
defined,
compact 3D structure with multiple arms in space. We speculated that
such a three-dimensionally crowded arrangement would impart the star
proteins tremendous resistance toward proteolysis due to the steric
hindrance that prevents the protease to bind and cut. The proteolytic
digestion experiments were performed on the star proteins as well
as their cyclic and linear controls using trypsin under identical
conditions with equal amounts of proteins. All of the possible trypsin
cleavage sites in these proteins are shown in Figures S17 and S18. Most of the cleavage sites are located
at the AXB or BXA domain, which is around the middle of the chain.
Thus, trypsin digestion would mostly break the catenanes into monomers,
relinearize cyclic monomers, and convert linear monomers into fragments.
The digestion profiles are shown in Figures , S19, and S20. Since the bands of cyclic monomers are smeared in SDS–PAGE
and the positions of cyclic and relinearized monomers are very close
to each other, we could not accurately evaluate the progress of trypsin
digestion. Hence, the digestion of cyclic monomers is not included
for comparison. Meanwhile, it can be seen that star proteins are much
more stable than the linear controls. While only ∼20% of EAXBE
is degraded over 5 h, EBXAE is completely stable and shows no signs
of degradation at all. By contrast, the linear monomers are quickly
digested into fragments. In both cases, only ∼10% of the linear
monomers survived after 5 h of proteolysis. Considering that their
differences are mostly just on topology, the stability enhancement
by adopting the star topology is indeed remarkable.
Figure 6
Protein digestion profiles
of (A) EAXBE and EA′X′BE
and (B) EBXAE and EBX′A′E subjected to incubation with
trypsin.
Protein digestion profiles
of (A) EAXBE and EA′X′BE
and (B) EBXAE and EBX′A′E subjected to incubation with
trypsin.
Conclusions
In
summary, we have developed highly efficient, genetically encoded,
mechanically interlocking SpyX modules (AXB and BXA) for engineering
protein topology in vivo. The expression of fusion
proteins containing SpyX modules leads to the formation of a variety
of mechanically interlocked protein topologies, including protein
catenanes, obligate dimers, and star proteins. The clustering of p53dim
with SpyTag/SpyCatcher ensures efficient protein chain intertwining
and covalent fixation. The catenation efficiency was found to depend
on the configuration of three components as well as the fused protein
of interest. The configuration also influences the stability of the
final product. Generally, BXA has better catenation efficiency than
AXB and the products containing BXA are also more stable. There are
many advantages in using SpyX modules for protein engineering. First
of all, it leads to the formation of obligate dimers which would never
disassociate even upon high dilution and does not require the tethered
protein domain to possess specific protein–protein interactions.
Second, it allows the formation of star proteins simply by direct
expression, which is useful as multifunctional proteins. Finally,
topology may enhance protein stability. For example, star proteins
are shown to possess significantly enhanced stability against trypsin
digestion. The capability of SpyX modules for in vivo protein topology engineering is an extraordinary example demonstrating
the power of the “assembly-reaction” synergy. It expands
the scope of protein topology beyond linear configuration and serves
as a versatile platform to further engineer the functional properties
of proteins.
Authors: Giang K T Nguyen; Antony Kam; Shining Loo; Anna E Jansson; Lucy X Pan; James P Tam Journal: J Am Chem Soc Date: 2015-12-07 Impact factor: 15.419
Authors: Fei Sun; Wen-Bin Zhang; Alborz Mahdavi; Frances H Arnold; David A Tirrell Journal: Proc Natl Acad Sci U S A Date: 2014-07-21 Impact factor: 11.205
Authors: Summer L Bernstein; Nicholas F Dupuis; Noel D Lazo; Thomas Wyttenbach; Margaret M Condron; Gal Bitan; David B Teplow; Joan-Emma Shea; Brandon T Ruotolo; Carol V Robinson; Michael T Bowers Journal: Nat Chem Date: 2009-07 Impact factor: 24.427