Emma E Cawood1,2,3, Nicolas Guthertz1,3, Jessica S Ebo1,3, Theodoros K Karamanos1,3,4, Sheena E Radford1,3, Andrew J Wilson1,2. 1. Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds LS2 9JT, United Kingdom. 2. School of Chemistry, University of Leeds, Leeds LS2 9JT, United Kingdom. 3. School of Molecular and Cellular Biology, University of Leeds, Leeds LS2 9JT, United Kingdom. 4. Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, United States.
Abstract
Protein-protein interactions (PPIs) are involved in many of life's essential biological functions yet are also an underlying cause of several human diseases, including amyloidosis. The modulation of PPIs presents opportunities to gain mechanistic insights into amyloid assembly, particularly through the use of methods which can trap specific intermediates for detailed study. Such information can also provide a starting point for drug discovery. Here, we demonstrate that covalently tethered small molecule fragments can be used to stabilize specific oligomers during amyloid fibril formation, facilitating the structural characterization of these assembly intermediates. We exemplify the power of covalent tethering using the naturally occurring truncated variant (ΔN6) of the human protein β2-microglobulin (β2m), which assembles into amyloid fibrils associated with dialysis-related amyloidosis. Using this approach, we have trapped tetramers formed by ΔN6 under conditions which would normally lead to fibril formation and found that the degree of tetramer stabilization depends on the site of the covalent tether and the nature of the protein-fragment interaction. The covalent protein-ligand linkage enabled structural characterization of these trapped, off-pathway oligomers using X-ray crystallography and NMR, providing insight into why tetramer stabilization inhibits amyloid assembly. Our findings highlight the power of "post-translational chemical modification" as a tool to study biological molecular mechanisms.
Protein-protein interactions (PPIs) are involved in many of life's essential biological functions yet are also an underlying cause of several human diseases, including amyloidosis. The modulation of PPIs presents opportunities to gain mechanistic insights into amyloid assembly, particularly through the use of methods which can trap specific intermediates for detailed study. Such information can also provide a starting point for drug discovery. Here, we demonstrate that covalently tethered small molecule fragments can be used to stabilize specific oligomers during amyloid fibril formation, facilitating the structural characterization of these assembly intermediates. We exemplify the power of covalent tethering using the naturally occurring truncated variant (ΔN6) of the human protein β2-microglobulin (β2m), which assembles into amyloid fibrils associated with dialysis-related amyloidosis. Using this approach, we have trapped tetramers formed by ΔN6 under conditions which would normally lead to fibril formation and found that the degree of tetramer stabilization depends on the site of the covalent tether and the nature of the protein-fragment interaction. The covalent protein-ligand linkage enabled structural characterization of these trapped, off-pathway oligomers using X-ray crystallography and NMR, providing insight into why tetramer stabilization inhibits amyloid assembly. Our findings highlight the power of "post-translational chemical modification" as a tool to study biological molecular mechanisms.
The regulated self-assembly
of proteins into ordered complexes
drives many biological processes, ranging from viral capsid formation
and actin polymerization, to DNA maintenance and repair.[1,2] However, self-assembly can also occur aberrantly as a result of
changes in the concentration,[3] environment,[4−6] primary sequence,[7,8] or post-translational processing[9−11] of a protein. Aberrant assembly events are associated with a range
of disorders, and they can involve polymerization of natively folded
protein molecules,[2] as in sickle cell anemia,[12] or aggregation which is accompanied by significant
conformational changes, as exemplified by amyloid diseases.[13,14] Understanding the molecular basis and consequences of such protein–protein
interaction (PPI) pathways and identifying methods for their modulation[15−20] therefore have implications for the treatment of disease,[21] as well as in the development of new biomaterials,
where protein self-assembly can be exploited to yield structures with
defined architectures or novel biomechanical properties.[22,23] However, manipulating and defining the mechanisms of self-assembly
is challenging, due to the transient nature and heterogeneity (in
mass and structure) of oligomeric intermediates.[24,25] The use of methods to trap specific oligomeric complexes can help
overcome these challenges and offers the opportunity to structurally
characterize otherwise transient intermediates, identify targets for
drug discovery, and develop new scaffolds for protein-based nanostructures.Here, we describe the use of disulfide tethering[26] to rapidly explore chemical space and identify “post-translational
chemical modifications”[27,28] that stabilize specific
PPIs associated with amyloid assembly. These site-specific, covalent
modifications may act in one of two ways: altering the surface properties
of the protein and/or covalently reinforcing a noncovalent protein–ligand
interaction. Using a naturally occurring, amyloidogenic variant of
β2-microglobulin (β2m)—the
ΔN6 variant—as a model system, we show that covalently
tethered fragments represent highly effective tools for tuning oligomer
populations and stabilizing particular species in self-assembly pathways.
By combining kinetic analysis with structural information obtained
from NMR and X-ray crystallography, the covalent modifications identified
here have led to structural and functional insights into the role
of tetramers in ΔN6 amyloid formation.Native monomeric
β2m has a seven-stranded immunoglobulin
fold and forms the noncovalently bound light chain of the major histocompatibility
class I complex.[29] Aberrant self-assembly
of β2m molecules into amyloid fibrils[30] is a hallmark of dialysis-related amyloidosis
(DRA).[31−33] The amyloidogenic variant of β2m
which is the focus of this paper—ΔN6—is formed
from the wild-type protein by proteolysis of its N-terminal six amino
acids and makes up ∼20–30% of β2m molecules
found in fibrils extracted from DRA patients.[34,35] ΔN6 is capable of rapid assembly into amyloid fibrils in vitro at near-neutral pH, through the association of
the dynamically structured monomers[36] into
dimers and hexamers that retain a nativelike fold[37] (Figure A). Subsequent conformational rearrangement of these oligomers into
cross-β structures leads to amyloid fibril formation and elongation.[38]
Figure 1
(A) Native monomeric ΔN6 (PDB 2XKU(36)) contains
seven antiparallel β-strands (labeled A–G), with a solvent-excluded
disulfide bond between strands B and F (yellow). The monomeric protein
is capable of associating into transient, nativelike dimers and hexamers
en route to amyloid fibrils (structures not drawn to scale).[37] The conversion of these nativelike oligomers
into fibrils requires structural rearrangement of the existing β-strands[38] and further self-assembly. ΔN6 tetramers
have been observed, but their structure and role in fibril formation
depend on the solution conditions.[37,86] (B) The two
most ligandable sites (pink) of ΔN6 identified by computational
solvent mapping are located adjacent to the BC (target site 1) and
DE (target site 2) loops. Three residues (S33, S52, and L65; purple)
at these two sites were substituted with cysteine in order to target
each pocket using the disulfide tethering strategy (see Figure A). The orientation of ΔN6
in the “Target site 1” panel is the same as for the
monomer shown in (A).
(A) Native monomeric ΔN6 (PDB 2XKU(36)) contains
seven antiparallel β-strands (labeled A–G), with a solvent-excluded
disulfide bond between strands B and F (yellow). The monomeric protein
is capable of associating into transient, nativelike dimers and hexamers
en route to amyloid fibrils (structures not drawn to scale).[37] The conversion of these nativelike oligomers
into fibrils requires structural rearrangement of the existing β-strands[38] and further self-assembly. ΔN6 tetramers
have been observed, but their structure and role in fibril formation
depend on the solution conditions.[37,86] (B) The two
most ligandable sites (pink) of ΔN6 identified by computational
solvent mapping are located adjacent to the BC (target site 1) and
DE (target site 2) loops. Three residues (S33, S52, and L65; purple)
at these two sites were substituted with cysteine in order to target
each pocket using the disulfide tethering strategy (see Figure A). The orientation of ΔN6
in the “Target site 1” panel is the same as for the
monomer shown in (A).
Figure 2
(A) Schematic
representation of the disulfide tethering method
used to compare noncovalent affinities of different fragments for
specific target sites on ΔN6. When using cocktails of fragments,
the compound which forms the most favorable noncovalent interactions
with the target protein while tethered by a disulfide bond will produce
the most stable (i.e., highest populated) covalent protein–fragment
adduct at equilibrium—this is reflected by a higher RZ score.
(B) Data from a disulfide tethering screen against three ΔN6
cysteine variants (S33C, S52C, and L65C) normalized as RZ scores.
For fragments which were present in more than one screening cocktail,
data are shown as the mean ± one standard deviation. Each cysteine
variant was present at 5 μM and was incubated for 24 h with
25 μM of each disulfide-linked fragment (in cocktails of five
fragments) and 500 μM βME, in 25 mM sodium phosphate,
pH 6.2, 2% v/v DMSO. Black circles are shown for fragments which were
synthesized but not included in the screening library due to poor
purity.
Recent structural models
of ΔN6 dimers and hexamers have
shown that the DE and BC loops of the protein are involved in both
oligomerization interfaces (Figure A).[37] The same regions have
also been identified at the interface of amyloid-competent ΔN6−β2m heterodimers,[39] β2m−β2m homodimers,[40] and inhibitory heterodimers formed between ΔN6 and a nonamyloidogenic
β2m ortholog (murine β2m).[39] These examples implicate the DE and BC loops,
and thereby also the associated β-strands, as mediators of oligomerization,
and thus targeting these regions (e.g., by small molecules) was anticipated
to yield tools for controlling and studying ΔN6 self-assembly.
The dynamic nature of ΔN6 monomers[36] and oligomers,[37] combined with the lack
of an obvious ligand binding pocket, has thus far hindered the development
of small molecule modulators of amyloid formation (although protein-based
modulators have been identified[41]). In
light of this challenge, we focused our efforts on the development
of covalent ligands to manipulate ΔN6 self-assembly, based on
the success of such compounds in targeting other challenging PPIs.[42−44]
Results and Discussion
Identifying Covalent Ligands for ΔN6
by Disulfide Tethering
Computational solvent mapping (using
the FTMap server[45]) was performed against
an NMR-derived conformational
ensemble of ΔN6[36] in order to predict
which regions of the protein are likely to be hot spots for small
molecule binding. The resulting data (Figure S1) highlighted two pockets adjacent to the DE and BC loops as promising
targets for ligand development (site 1, between the C and D strands;
and site 2, beneath the DE loop; Figure B). To identify possible covalent ligands
for these sites, a library of small molecules was screened using the
“disulfide tethering” method[26] (Figure A). First developed by Erlanson and co-workers as a
site-directed screening strategy,[46] this
approach uses disulfide bonds to covalently trap and assess the interaction
affinity of small molecules (typically fragments) which have bound
noncovalently to the protein of interest. Libraries of disulfide-functionalized
molecules are screened for their ability to form disulfide bonds with
cysteine variants of the target protein, as small molecules which
exhibit favorable noncovalent interactions near the free, solvent-exposed
cysteine residue will undergo thiol–disulfide exchange more
effectively due to their increased local concentration. The relative
population of different covalent protein−ligand complexes at
equilibrium can therefore be used as a proxy for the noncovalent affinity
of a given fragment for a particular region of the target protein.(A) Schematic
representation of the disulfide tethering method
used to compare noncovalent affinities of different fragments for
specific target sites on ΔN6. When using cocktails of fragments,
the compound which forms the most favorable noncovalent interactions
with the target protein while tethered by a disulfide bond will produce
the most stable (i.e., highest populated) covalent protein–fragment
adduct at equilibrium—this is reflected by a higher RZ score.
(B) Data from a disulfide tethering screen against three ΔN6
cysteine variants (S33C, S52C, and L65C) normalized as RZ scores.
For fragments which were present in more than one screening cocktail,
data are shown as the mean ± one standard deviation. Each cysteine
variant was present at 5 μM and was incubated for 24 h with
25 μM of each disulfide-linked fragment (in cocktails of five
fragments) and 500 μM βME, in 25 mM sodium phosphate,
pH 6.2, 2% v/v DMSO. Black circles are shown for fragments which were
synthesized but not included in the screening library due to poor
purity.Three single cysteine variants
of ΔN6 (S33C, S52C, and L65C; Figure B) were expressed
and purified (Figure S2) in order to monitor
small molecule binding at sites 1 and 2 by using the disulfide tethering
approach. A library of 84 symmetrical disulfides (designed with the
aid of molecular docking, as described in the Supporting Information) was prepared through the use of solid-phase
synthesis, and 76 of these fragments were sufficiently pure for screening
purposes (Figure S3) and were screened
against each ΔN6 cysteine variant in cocktails of five, in the
presence of excess reducing agent (β-mercaptoethanol, βME; Figure S4). The relative populations of the different
protein–fragment adducts were assessed at 24 h by electrospray
ionization mass spectrometry and normalized as robust Z (RZ) scores[47] (Figure B; Figures S5–S7), where
higher RZ scores were anticipated to reflect more favorable protein–fragment
interactions (Figure A). Comparison of the distribution of protein–fragment adducts
observed for the three ΔN6 cysteine variants to that of an unrelated,
largely helical control protein, MCL-1, showed that there was poor
correlation between the datasets for S52C– and L65C−ΔN6
with that for MCL-1 (median r = 0.37; Figure S6) and therefore that the identity of
the protein affects which protein–fragment adducts dominate
at equilibrium. The higher correlation between the datasets for S33C−ΔN6
and MCL-1 reflects the similar chemical nature of their binding pockets
(Figure S6). These observations suggest
that the preference of ΔN6 for particular fragments (as shown
in Figure B) is a
result of specific noncovalent interactions. RZ scores were therefore
used to report on the relative noncovalent affinities of tethered
fragments for a particular region on the surface of ΔN6.
Covalent
Functionalization and Ligand Binding Drive ΔN6
Tetramerization
To interrogate how covalent modification
of the different regions of ΔN6 affects aggregation, sedimentation
velocity analytical ultracentrifugation (SV-AUC) was used to assess
the oligomeric state of a series of individual protein–fragment
adducts (Figure A; Figures S8–S10). Fragments with a range
of RZ scores were selected for testing with each cysteine variant,
so as to distinguish between changes in oligomeric state which are
due to covalent modification of the protein (i.e., changes observed
for all samples of a given cysteine variant) versus changes which
arise from specific noncovalent protein–ligand interactions
(i.e., those observed only for covalently tethered fragments with
high RZ scores).
Figure 3
(A) Examples of fragments used to form the S52C–
and L65C–fragment
adducts studied by SV-AUC. (B–E) SV-AUC data collected for
L65C–fragment adducts (B, C) or S52C–fragment adducts
(D, E). Experiments were performed with 150 μM protein in 25
mM sodium phosphate at pH 6.2, 25 °C. Peaks in the c(s) distributions were assigned to monomer (m),
dimer (d), tetramer (t), or hexamer (h), based on previous studies[37] and predicted sedimentation coefficients for
these oligomers (calculated using the Svedberg equation[48]). Tetramer peak areas for a range of L65C–fragment
adducts were found to correlate with the RZ score of the attached
fragment (C). The relationship between fragment RZ score and tetramer
peak area for S52C–fragment adducts was less clear, and other
properties of the fragments may play a role in modulating tetramer
populations (E) (see Figure S14).
(A) Examples of fragments used to form the S52C–
and L65C–fragment
adducts studied by SV-AUC. (B–E) SV-AUC data collected for
L65C–fragment adducts (B, C) or S52C–fragment adducts
(D, E). Experiments were performed with 150 μM protein in 25
mM sodium phosphate at pH 6.2, 25 °C. Peaks in the c(s) distributions were assigned to monomer (m),
dimer (d), tetramer (t), or hexamer (h), based on previous studies[37] and predicted sedimentation coefficients for
these oligomers (calculated using the Svedberg equation[48]). Tetramer peak areas for a range of L65C–fragment
adducts were found to correlate with the RZ score of the attached
fragment (C). The relationship between fragment RZ score and tetramer
peak area for S52C–fragment adducts was less clear, and other
properties of the fragments may play a role in modulating tetramer
populations (E) (see Figure S14).In the absence of covalent modification, ΔN6
is approximately
50% monomeric under the conditions employed (150 μM protein,
pH 6.2, 25 °C), with dimers, tetramers, and hexamers representing
the majority of faster sedimenting species (Figure S2), consistent with previous reports.[37] Tethering of high RZ score fragments to all three ΔN6 cysteine
variants was found to increase the population of tetramers (representative
data in Figure B,D;
see also Figures S11–S13). Tethering
with low RZ score fragments, however, produced a variety of results,
which depended both on the cysteine variant and the ligand employed.For L65C–fragment adducts, the area of the tetramer peak
in the continuous sedimentation coefficient (c(s)) distributions showed a positive correlation with the
RZ score of the tethered fragment (Figure C; Figures S13, S14): a covalently attached fragment with a low RZ score (βME)
produced an oligomer distribution which was similar (albeit not identical)
to that of ΔN6 alone (tetramer peak areas of 5 and 18%, respectively),
while tethered fragments with high RZ scores produced significantly
larger tetramer peaks (e.g., a 45% tetramer peak area was observed
for the adduct between L65C and disulfide 54, named L65C–S54; Figure B,C). The only predicted ligandable pocket that is near residue 65
of ΔN6 is target site 2 (Figure B), suggesting that the formation of tetramers by L65C–fragment
adducts is driven by noncovalent binding to this pocket.The
nature of the relationship between tetramer population and
fragment RZ score for the S52C– and S33C–fragment adducts
was less clear. All S52C–fragment adducts produced tetramer
peak areas of ≥43% (with most between 86 and 95%), regardless
of RZ score (Figure D,E; Figures S12, S14), implying that
a different property of the fragments was driving the changes in oligomeric
state at this site. Although a limited range of fragment sizes were
used in screening, the observed tetramer populations for the different
S52C–fragment adducts are consistent with fragment size (and
thus the surface topography of protein–fragment adducts) being
a contributing factor (Figure S14). The
S33C–fragment adducts analyzed were generally polydisperse,
with only two samples (of seven analyzed) producing c(s) distributions which were readily interpretable
(Figure S11): S33C−βME and
S33C–S79, the latter of which showed an increased
tetramer peak area (43%) over the former (16%). These data are insufficient
to ascertain which characteristic (characteristics) of the covalent
fragments influences (influence) the tetramer population of S33C–fragment
adducts, but they show that, as for L65C and S52C, the addition of
specific covalent fragments to S33C can be used to modulate oligomer
distributions.
Tetramer Stabilization Inhibits Amyloid Assembly
The
ability of covalently tethered fragments to generate defined oligomer
populations offered the opportunity to explore the effect of tetramer
stabilization on ΔN6 amyloid assembly. Protein–fragment
adducts with a range of tetramer populations were added to preformed
ΔN6 amyloid fibrils, and the ability of these samples to elongate
the fibril seeds was analyzed using the fluorescent, fibril-binding
dye, thioflavin T (ThT).[49,50] By using this strategy,
fibril elongation by each sample will be templated by seeding to a
common fibril product, irrespective of the covalent ligand, enabling
direct comparison of the initial rates of fibril growth for different
protein–fragment adducts. It should be noted that, under the
conditions employed, spontaneous (i.e., unseeded) fibril formation
does not occur on the time scale of the experiment.For all
three ΔN6 cysteine variants, the observed initial elongation
rates were lower for samples with higher tetramer populations (Figure ; Figure S15A,B). The most dramatic change in fibril elongation
was seen for S52C–S54 (86% tetramer peak area
in the c(s) distribution), where
the rate of elongation was reduced more than 30-fold relative to ΔN6
(Figure E,F). Global
linear regression analysis across all samples showed that there is
a negative correlation (r = −0.78) between
the observed initial fibril elongation rate and tetramer population,
and extrapolation of the linear regression line to 100% tetramer corresponds
precisely to an elongation rate of zero (Figure S15B). Together, these observations indicate that the tetramers
formed by the S33C–, S52C–, and L65C–fragment
adducts lie off-pathway to amyloid fibril formation. Prediction of
elongation rates using kinetic schemes in which tetramers lie on-
or off-pathway to fibrils also supports this conclusion (Figure S16). The fibril elongation data thus
highlight tetramer stabilization as a strategy to inhibit ΔN6
amyloid formation. In addition, they support a model in which covalent
functionalization and covalent reinforcement of ligand binding around
the DE and BC loops can be used to slow the progression of seeded
amyloid formation by modulating the stability and population distribution
of oligomers.
Figure 4
Change in ThT fluorescence over time for various protein–fragment
adducts (150 μM) in the presence of ΔN6 fibril seeds (15
μM monomer equivalents) shows that the ability of samples to
elongate fibrils depends on the tetramer population. ThT fluorescence
curves in (A), (C), and (E) are shown as the median curve, with the
highest and lowest values shaded in gray (n = 3).
The relationship between the initial elongation rates calculated from
these curves and the tetramer peak areas from the corresponding c(s) distributions are shown in (B), (D),
and (F). Error bars (standard deviation) are shown for all data points
(mean values) in (B), (D), and (F)—those error bars that are
not visible are smaller than the displayed data point. Additional
ThT fluorescence curves associated with (B) which were not shown in
(A) (i.e., the white circles in (B)) are shown in Figure S15A. All experiments were performed under quiescent
conditions in 25 mM sodium phosphate at pH 6.2, 25 °C. Elongation
rates for unliganded ΔN6 are shown by black crosses in (B),
(D), and (F).
Change in ThT fluorescence over time for various protein–fragment
adducts (150 μM) in the presence of ΔN6 fibril seeds (15
μM monomer equivalents) shows that the ability of samples to
elongate fibrils depends on the tetramer population. ThT fluorescence
curves in (A), (C), and (E) are shown as the median curve, with the
highest and lowest values shaded in gray (n = 3).
The relationship between the initial elongation rates calculated from
these curves and the tetramer peak areas from the corresponding c(s) distributions are shown in (B), (D),
and (F). Error bars (standard deviation) are shown for all data points
(mean values) in (B), (D), and (F)—those error bars that are
not visible are smaller than the displayed data point. Additional
ThT fluorescence curves associated with (B) which were not shown in
(A) (i.e., the white circles in (B)) are shown in Figure S15A. All experiments were performed under quiescent
conditions in 25 mM sodium phosphate at pH 6.2, 25 °C. Elongation
rates for unliganded ΔN6 are shown by black crosses in (B),
(D), and (F).
Structural Characterization
of Stabilized Tetramers
To understand how the tetramers generated
by the covalent modification
of ΔN6 are structurally related to the previously characterized
ΔN6 dimers and hexamers,[37] as well
as how and why covalent modification around the DE and BC loops leads
to tetramerization, X-ray crystallography and solution-state NMR structural
studies were performed.S52C–S54 was found
to crystallize as a ring-shaped tetramer with a solvent-accessible
central cavity, formed from two asymmetric units each containing two
ΔN6 molecules in an antiparallel orientation (Figure A–C; Table S1). As seen for the on-pathway ΔN6 dimer and
hexamer,[37] the protein subunits in the
crystallized tetramer are highly nativelike, but notably with perturbations
to the DE and BC loops as well as a shift of the D strand from a β-bulge
to a straight β-strand (Figure D; Figure S17A). These structural
changes appear to be linked to each other and to tetramerization:
straightening of the D strand allows for protein–protein interactions
to occur via β-sheet augmentation, forming one of the two interaction
interfaces in the tetramer (Figure B), and additionally results in a rearrangement of
phenylalanine residues at the top of the D, E, and B strands (Phe56,
Phe62, and Phe30, respectively) which is accommodated by rearrangement
of the BC loop (Figure S17B). This movement
of residues around the BC and DE loops allows several key protein–protein
contacts to be made within the second interaction interface, which
occurs through a face-to-face, antiparallel interaction of the ABED
β-sheets (Figure C; Figure S17C).
Figure 5
(A–C) Crystal
structure of the S52C–S54 tetramer (diffracted
to 2.4 Å; PDB 7AFV), formed from two asymmetric units: subunits
1a/1b (gray/pale blue) and subunits 2a/2b (pale green/dark green).
Protein subunits interact via two interfaces: the D strand interface
(B) and the ABED sheet interface (C). Four copies of the covalent
−S54 fragment (shown as spheres in A and C) are
present in this complex and lie within the ABED sheet interface, in
a central solvent-accessible cavity (C). (D) Per-residue, pairwise
RMSD values for the S52C–S54 tetramer crystal
structure (subunit 1a/2a) compared with the monomeric ΔN6 NMR
ensemble (30 structures; PDB 2XKU(36)), reported for all non-hydrogen
atoms as the mean RMSD (±standard deviation). Residues which
are ≤6 Å from another protein subunit or a −S54 fragment within the S52C–S54 tetramer
structure are additionally highlighted by green or purple bars, respectively.
The locations of the β-strands in the S52C–S54 tetramer crystal structure are shown above the plot. (E) Each −S54 fragment can bind in one of two binding sites around the
site 2 pocket: by π stacking between two Tyr26 residues within
the ABED sheet interface (top) or by π stacking against Tyr67
(bottom). The 2Fo–Fc electron density map (contoured at 1.1σ)
is shown in cyan for the displayed amino acid side chains and organic
molecules. (F) Combined 1H–15N chemical
shift differences between S52C–S54 and L65C−βME
HMQC NMR spectra—these samples had tetramer peak areas of 86
and 5%, respectively, in their c(s) distributions. Residues which were not visible (or could not be
confidently assigned based on comparison with previous ΔN6 assignments[36]) for either sample are shown as black circles.
Residues which were visible for L65C−βME but were broadened
beyond detection for S52C–S54 are shown by purple
bars. Resonances which were visible in both spectra are colored according
to the magnitude of the chemical shift perturbation (CSP) relative
to the standard deviation of the dataset (σ): CSP ≥ 2σ,
red; σ ≤ CSP < 2σ, yellow; CSP < σ,
gray. Four main regions (labeled 1–4) show either significant
changes in the position of 1H–15N resonances
or complete loss of these resonances in samples with higher tetramer
populations (see also Figure S21).
(A–C) Crystal
structure of the S52C–S54 tetramer (diffracted
to 2.4 Å; PDB 7AFV), formed from two asymmetric units: subunits
1a/1b (gray/pale blue) and subunits 2a/2b (pale green/dark green).
Protein subunits interact via two interfaces: the D strand interface
(B) and the ABED sheet interface (C). Four copies of the covalent
−S54 fragment (shown as spheres in A and C) are
present in this complex and lie within the ABED sheet interface, in
a central solvent-accessible cavity (C). (D) Per-residue, pairwise
RMSD values for the S52C–S54 tetramer crystal
structure (subunit 1a/2a) compared with the monomeric ΔN6 NMR
ensemble (30 structures; PDB 2XKU(36)), reported for all non-hydrogen
atoms as the mean RMSD (±standard deviation). Residues which
are ≤6 Å from another protein subunit or a −S54 fragment within the S52C–S54 tetramer
structure are additionally highlighted by green or purple bars, respectively.
The locations of the β-strands in the S52C–S54 tetramer crystal structure are shown above the plot. (E) Each −S54 fragment can bind in one of two binding sites around the
site 2 pocket: by π stacking between two Tyr26 residues within
the ABED sheet interface (top) or by π stacking against Tyr67
(bottom). The 2Fo–Fc electron density map (contoured at 1.1σ)
is shown in cyan for the displayed amino acid side chains and organic
molecules. (F) Combined 1H–15N chemical
shift differences between S52C–S54 and L65C−βME
HMQC NMR spectra—these samples had tetramer peak areas of 86
and 5%, respectively, in their c(s) distributions. Residues which were not visible (or could not be
confidently assigned based on comparison with previous ΔN6 assignments[36]) for either sample are shown as black circles.
Residues which were visible for L65C−βME but were broadened
beyond detection for S52C–S54 are shown by purple
bars. Resonances which were visible in both spectra are colored according
to the magnitude of the chemical shift perturbation (CSP) relative
to the standard deviation of the dataset (σ): CSP ≥ 2σ,
red; σ ≤ CSP < 2σ, yellow; CSP < σ,
gray. Four main regions (labeled 1–4) show either significant
changes in the position of 1H–15N resonances
or complete loss of these resonances in samples with higher tetramer
populations (see also Figure S21).Clear electron density indicated the presence of
four covalently
bound −S54 fragments within the central cavity
of the tetramer (Figure A; bottom structure) which make intrasubunit and/or intersubunit
contacts with the site 2 pocket of surrounding protein subunits (Figure E; Figure S17C). The protein–ligand interactions with
the strongest electron density, and which are therefore the interactions
which presumably contribute most to the stabilization of the tetramer,
involve π stacking between two Tyr26 residues within the ABED
sheet interface and the bicyclic ring system of a fragment 54 molecule (gray ligand in Figure E)—this arrangement is seen for two of the four
fragments in a tetramer. These protein–ligand interactions
may additionally stabilize the straight conformation of the D strand:
residue 52 lies too far from Tyr26 in the monomeric ΔN6 structure
for fragment 54 to interact with Tyr26, and it thus appears
that this protein–ligand interaction can only form when the
D strand is straight (Figure S17D). Therefore,
while no direct protein–ligand contacts occur with either the
DE or BC loops, contacts with the associated β-strands appear
to have driven conformational changes in these regions, and together
the observed structural changes have altered the ΔN6 self-assembly
landscape in favor of the tetramer. The electron density associated
with the remaining two fragments in the tetramer is not as well-defined
as for those intercalated between Tyr26 residues, but it suggests
a binding mode whereby the fragments can form intrasubunit π–π
interactions with Tyr67 residues (Figure E, pale blue ligand).1H–15N heteronuclear multiple quantum
coherence (HMQC) NMR spectra and 15N-relaxation measurements
acquired for various S52C– and L65C–fragment adducts
were consistent with the tetramer observed within the S52C–S54 crystal lattice, implying that this crystal structure reflects
the nature of these off-pathway ΔN6 tetramers in solution (Figures S18–S21). Comparison of the chemical
shifts and intensities of 1H–15N backbone
amide resonances between samples with higher tetramer populations
(e.g., S52C–S54 and L65C–S54) and those with lower tetramer populations (e.g., ΔN6, L65C−βME)
showed differences across four regions of primary sequence (labeled
1–4 in Figure F and Figure S21). Regions 1–3
contain residues that either make up the tetramer interaction interfaces
and/or have shifted significantly (≥4 Å root-mean-square
deviation, RMSD) in the crystal structure relative to monomeric ΔN6
(Figure D). While
region 4 does not undergo any structural changes itself in the crystal
structure, it is adjacent to the BC loop, and hence residues in region
4 will experience a change in chemical environment upon tetramerization
(Figure D).In addition to the NMR data supporting the relevance of the S52C–S54 crystal structure to oligomerization in solution, the NMR
data also indicate that the tetramers formed by the S52C– and
L65C–fragment adducts are structurally similar. The ability
of S52C– and L65C–fragment adducts to adopt similar
tetrameric species can be rationalized in the context of the crystal
structure. D strand straightening results in residues 52 and 65 now
lying adjacent to one another (Figure S17A, inset), so we anticipate that fragments tethered to the L65C variant
of ΔN6 can access the same binding pockets which −S54 is observed to occupy in the S52C–S54 crystal structure.
Structural Similarities of Off-Pathway ΔN6
Tetramers with
Full-Length β2m Oligomers
While similar
regions of ΔN6 are involved in the formation of the off-pathway
tetramer and the previously identified on-pathway dimers/hexamers
(which were characterized under conditions similar to those employed
here),[37] the PPIs themselves are significantly
different and mutually exclusive (Figure S22A). This observation rationalizes the off-pathway behavior of the
tetramer, as it would need to completely dissociate to form the on-pathway
hexamer. However, although the tetramer is different from previously
identified ΔN6 oligomers,[37,41,51] its interfaces resemble those observed in oligomers of the full-length
β2m protein.[52−55] The most striking similarities are those with a crystallographic
tetramer formed by the P32A variant (PDB 2F8O;[52]Figure S22B,C). P32A β2m is
largely monomeric in solution[52] (in the
absence of divalent copper[56]), but at the
high concentrations within the crystal lattice the protein molecules
can interact via the same D strand and ABED interfaces shown in Figure (Figure S22B). Although the solution relevance of the crystallographic
P32A tetramer was not shown, the NMR data obtained for S52C–S54 confirm that this arrangement of protomers is possible
in solution and stable in the presence of suitable ligands.While the crystallographic P32A β2m tetramer possesses
the same straight D strand as the S52C–S54 ΔN6
tetramer, other β2m oligomers for which high-resolution
structural information is available lack straight D strands and instead
involve DE and BC loop-mediated PPIs.[53−55] Since these oligomers
which lack straight D strands have been shown (or proposed) to be
on-pathway species, we hypothesize that formation of an eight-stranded
ABED–DEBA β-sheet is an off-pathway PPI which can be
driven by ligand binding across the ABED sheet (as seen for ΔN6
S52C–S54) or modulation of the BC loop (as for
β2m P32A).
Conclusions
Covalent
small molecules are becoming increasingly attractive tools
to modulate specific protein–protein interactions,[43,44,57−59] to interrogate
the role of different proteins in biological processes,[44,57,60−62] and to facilitate
structural studies of challenging targets.[42,63] The results presented here expand this covalent chemical tool approach,
highlighting its power to facilitate analysis of the structure and
the role of specific oligomers in self-assembly pathways. We have
demonstrated that covalently reinforced protein–ligand interactions[64,65] can be used to manipulate heterogeneous and dynamic PPIs, exemplified
by those formed in the initial stages of amyloid formation, which
are notoriously difficult to target selectively when purely noncovalent
approaches are used.[66] Specifically, covalent
ligands identified by disulfide tethering allowed a transient off-pathway
tetramer (as shown by kinetic data and models) formed by the amyloidogenic
ΔN6 variant of β2m to be trapped and characterized
in atomic detail for the first time. These results highlight the power
of the chemical modification of proteins as a general strategy to
manipulate complex self-assembling systems and to drive the formation
of defined oligomers for detailed study.
Experimental
Section
Safety
No unexpected, new, and/or significant hazards
or risks were associated with this research.
Disulfide Tethering Screening
Screening cocktails were
prepared by combining five disulfide-linked fragments (1.25 mM each;
all present as symmetrical disulfides) in the presence of βME
(25 mM) in DMSO. These initial cocktail mixtures were incubated at
ambient temperature (∼18 °C) for 12 h to allow the formation
of mixed disulfides with βME (which generally had improved aqueous
solubility over the symmetrical disulfides).Screening cocktails
were diluted further into each ΔN6 cysteine variant (5 μM
protein, with 25 μM each disulfide, 500 μM βME,
2% v/v DMSO) in 25 mM sodium phosphate at either pH 6.2 (main text
data) or pH 8.0 (Supporting Information), and the resulting screening mixtures were incubated at ambient
temperature. The distribution of covalent protein–fragment
adducts over time was analyzed by a Bruker maXis Impact QTOF mass
spectrometer, with an electrospray ionization source. Samples (1 μL
injections) were desalted prior to mass spectrometry by an in-line
Dionex UltiMate 3000 liquid chromatography system (Thermo Scientific),
equipped with an Aeris Widepore C4 column (Phenomenex), running a
gradient between water and acetonitrile, both supplemented with 0.1%
formic acid.MCL-1 (residues 172–327; purified as described
previously[67]) was screened using the same
procedure, but
in 25 mM sodium phosphate, pH 7.4.Mass spectra were deconvoluted
with the use of a maximum entropy
algorithm (DataAnalysis, Bruker). For each protein–fragment
adduct in each cocktail, the intensity of the deconvoluted peak at
24 h was measured relative to the protein−βME adduct
peak. These relative intensities were then converted to RZ scores.[47] The RZ score for a given fragment can be described
by eq :where x is the relative
intensity of the protein–fragment adduct
peak for fragment i and xall is a list of relative peak intensities for all protein–fragment
adducts in that dataset (i.e., for a given cysteine variant at a given
pH). In all cases, peak intensities were measured relative to the
βME adduct. The medium absolute deviation (MAD)[68] is defined asRZ scores for the disulfide
analogue of βME could not be
determined by this method, due to the presence of βME as a reducing
agent in all screening mixtures. Instead, its RZ score was estimated
through competition experiments, as shown in Figure S7.
Experiments were carried out at 48 000 rpm, 25 °C,
with
an An50-Ti rotor in a Beckman Coulter Optima XL-I ultracentrifuge.
Samples and reference buffer (400 μL) were loaded into 12 mm
aluminum centerpieces, with sapphire windows. Samples were thermally
equilibrated for 2–3 h at 0 rpm and were subsequently analyzed
using the interference detection system at a protein concentration
of 150 μM in 25 mM sodium phosphate, pH 6.2. All samples had
been exhaustively dialyzed in this buffer prior to loading, and the
dialysate was used as the reference buffer.Data were analyzed
in SEDFIT (version 12.44)[69] with a continuous c(s) distribution model and with a maximum
entropy regularization confidence interval of 0.95. Values for the
partial specific volume (ν̅) of ΔN6, buffer viscosity
(η), and buffer density (ρ) were all calculated with SEDNTERP[70] (ν̅ = 0.72832 mL/g; η = 0.00898
P and ρ = 0.99954 g/mL for 25 mM sodium phosphate, pH 6.2).
The final fitting was performed with the Levenberg–Marquardt
algorithm, although initial fitting was performed with both the Simplex
and Levenberg–Marquardt algorithms. For c(s) distributions with significant peak broadening or overlap,
further processing was subsequently performed to permit baseline resolution
of these peaks: multiple Gaussian functions were fit to the initial c(s) distribution (using the curve_fit
function from SciPy.optimize), and then the positions of these best
fit Gaussians were used in SEDFIT as Bayesian prior expectations[71] (modeled again as Gaussians, located at the
best fit positions identified by curve_fit, with widths set to 0.2
and amplitudes set to 0.1). Refitting the data with the Levenberg–Marquardt
algorithm led to improved or equally good root-mean-square deviations
(compared to the original c(s) distribution)
for all datasets to which this processing method had been applied,
confirming the validity of this approach.Peaks in the c(s) distributions
were assigned to specific oligomers based on comparison to published
ΔN6 SV-AUC data[37] and predicted sedimentation
coefficients for these oligomers. Predicted sedimentation coefficients
were calculated with the Svedberg equation[48] at a range of possible frictional ratios, so as to estimate the
range of possible sedimentation coefficients which could describe
a particle of a given molecular weight.
Thioflavin T Aggregation
Assays
Protein–fragment
adducts (150 μM) were incubated in 25 mM sodium phosphate, pH
6.2, with 10 μM ThT, in the presence or absence of 15 μM
(monomer equivalent) fibril seeds prepared from ΔN6, in 96-well
untreated half-area plates (Corning; 100 μL sample per well).
In the same plate, 15 μM (monomer equivalent) fibril seeds were
incubated alone in 25 mM sodium phosphate, pH 6.2. Details concerning
the preparation of fibril seeds can be found in the Supporting Information.Fibril elongation was allowed
to occur quiescently, with brief agitation (10 s at 600 rpm) only
occurring prior to each reading (every 12 min). Experiments were performed
for 24 h at 25 °C in a FLUOstar OPTIMA plate reader (BMG Labtech),
exciting ThT at 444 nm and measuring its fluorescence emission intensity
at 480 nm.Data were processed by subtracting two datasets from
each seeded
dataset to remove changes in ThT fluorescence which had occurred independently
of fibril elongation: (a) data obtained from the same protein–fragment
adduct in the absence of fibril seeds; (b) data obtained for fibril
seeds alone in buffer. The absence of fibrillar aggregates in the
unseeded samples after 24 h was confirmed by negative-stain electron
microscopy (see Supporting Information).
Initial elongation rates from the processed seeded datasets were measured
by fitting a straight line to the first 3 h of data and determining
the gradient.
Protein Crystallography and Data Processing
A stock
solution of S52C–S54 was prepared by incubation
of S52C (1.5 mM) and the symmetrical disulfide of fragment 54 (Di-S54; 1.5 mM) in 25 mM sodium phosphate, pH 8.0,
725 μM βME, 20% v/v DMSO, for 40 h at room temperature,
before exhaustive dialysis against 25 mM sodium phosphate, pH 6.2,
at 4 °C, using dialysis tubes with a 3.5 kDa molecular weight
cutoff (Generon). The concentration of the dialyzed sample was determined
through a bicinchoninic acid (BCA) assay.[72] Crystals were grown by mixing 0.2 μL of the protein–fragment
complex (1.1 mM) and 0.1 μL of the crystallization solution
in hanging drop plates at 293 K. The crystallization solution (39
mM bicine, 61 mM Tris, pH 8.5, 7.8% w/v PEG 3350, 7.8% w/v PEG 1000,
7.8% v/v 2-methyl-2,4-pentanediol, and 6.7 mM each of 1,6-hexanediol,
1-butanol, 1,2-propanediol, 2-propanol, 1,4-butanediol, and 1,3-propanediol)
was prepared from Morpheus Buffer System 3 at pH 8.5, Morpheus Precipitant
Mix 4, and Morpheus Alcohols Mix (all from Molecular Dimensions).
After 2 weeks, crystals were fished and flash-frozen in liquid nitrogen.
The diffraction data were collected at 0.9795 Å, with 0.025 s
exposition and 0.15° oscillation, for a total of 2400 images
on beamline I24 at Diamond Light Source (U.K.). The data were processed
using the xia2[73] bundle, with DIALS[74] for integration and using Pointless/Aimless[75,76] for scaling and merging. The data were processed using CC1/2 and
completeness as cutoff criteria.[77] The
structure of S52C–S54 was solved by molecular
replacement, using full-length β2m (PDB 5CS7(78)) with the first six amino acids removed as the search model
in PHASER.[79] COOT[80] and REFMAC5[81] were used for refinement.
Parameterization of Cys−S54 (as a non-natural
amino acid) was carried out in XPLOR-NIH,[82] followed by refinement of its structure using the fixupCovalentGeometry
function while satisfying the electron density. The quality of the
final structure was assessed with MolProbity.[83] Data collection and refinement statistics are shown in Table S1. Figures were prepared using PyMOL (version
2.4, Schrödinger).
Protein NMR
All spectra were acquired
by using a 750
MHz Bruker Avance III HD spectrometer equipped with a TCI cryoprobe,
at a protein concentration of 150 μM in 25 mM sodium phosphate,
pH 6.2, at 25 °C. 1H–15N SOFAST
HMQC spectra were processed with the use of NMRPipe,[84] and calculation of peak intensities was performed in CcpNmr
Analysis (version 2.4).[85]1H–15N peaks were assigned to specific backbone resonances for
each protein–fragment adduct (L65C−βME, L65C–S54, S52C−βME, and S52C–S54) by comparison to existing ΔN6 assignments,[36] only considering peaks with intensities at least 3-fold
greater than the spectral noise. Backbone chemical shift perturbations
(CSPs) between samples were calculated with eq :where ΔδHN and ΔδN represent the difference
in peak position in the direct and
indirect dimensions, respectively.L65C−βME and
L65C–S54T2 experiments
were performed by acquiring a series of sensitivity-enhanced 1H–15N heteronuclear single quantum coherence
(HSQC) spectra in an interleaved fashion at a range of delay times
(0.017–0.136 ms for L65C−βME, 0.017–0.119
ms for L65C–S54). Peak intensities (I) were extracted from each spectrum with the series.tab NMRPipe module,
and per-residue 15N relaxation rates (R2) were calculated by fitting the peak intensity at different
delay times (t) to a two-parameter exponential function
(eq ).For S52C−βME and S52C–S54, sensitivity
limitations did not permit the determination of 15N R2 rates on a per-residue basis. Instead, the
first increment of the standard T2 experiment
at each delay time (0.017–0.102 ms for S52C−βME,
0.017–0.085 ms S52C–S54) was used to determine
the R2 rate of the whole amide region
(7.6–9.2 ppm). Mean 15N R2 rates for the whole amide region were calculated in the same way
for ΔN6 (0.017–0.085 ms) and the two L65C adducts (L65C−βME
and L65C–S54, using the same datasets acquired
for determining per-residue 15N R2 rates).
Authors: E Rennella; T Cutuil; P Schanda; I Ayala; F Gabel; V Forge; A Corazza; G Esposito; B Brutscher Journal: J Mol Biol Date: 2013-05-03 Impact factor: 5.469
Authors: Michael C Owen; David Gnutt; Mimi Gao; Sebastian K T S Wärmländer; Jüri Jarvet; Astrid Gräslund; Roland Winter; Simon Ebbinghaus; Birgit Strodel Journal: Chem Soc Rev Date: 2019-07-15 Impact factor: 54.564
Authors: Jack D Sadowsky; Mark A Burlingame; Dennis W Wolan; Christopher L McClendon; Matthew P Jacobson; James A Wells Journal: Proc Natl Acad Sci U S A Date: 2011-03-23 Impact factor: 11.205
Authors: Keriann M Backus; Bruno E Correia; Kenneth M Lum; Stefano Forli; Benjamin D Horning; Gonzalo E González-Páez; Sandip Chatterjee; Bryan R Lanning; John R Teijaro; Arthur J Olson; Dennis W Wolan; Benjamin F Cravatt Journal: Nature Date: 2016-06-15 Impact factor: 49.962
Authors: Sabine M Ulamec; Roberto Maya-Martinez; Emily J Byrd; Katherine M Dewison; Yong Xu; Leon F Willis; Frank Sobott; George R Heath; Patricija van Oosten Hawle; Vladimir L Buchman; Sheena E Radford; David J Brockwell Journal: Nat Commun Date: 2022-08-25 Impact factor: 17.694