Literature DB >> 33253560

Modulation of Amyloidogenic Protein Self-Assembly Using Tethered Small Molecules.

Emma E Cawood^1,2,3, Nicolas Guthertz^1,3, Jessica S Ebo^1,3, Theodoros K Karamanos^1,3,4, Sheena E Radford^1,3, Andrew J Wilson^1,2.

Abstract

Protein-protein interactions (PPIs) are involved in many of life's essential biological functions yet are also an underlying cause of several human diseases, including amyloidosis. The modulation of PPIs presents opportunities to gain mechanistic insights into amyloid assembly, particularly through the use of methods which can trap specific intermediates for detailed study. Such information can also provide a starting point for drug discovery. Here, we demonstrate that covalently tethered small molecule fragments can be used to stabilize specific oligomers during amyloid fibril formation, facilitating the structural characterization of these assembly intermediates. We exemplify the power of covalent tethering using the naturally occurring truncated variant (ΔN6) of the human protein β2-microglobulin (β2m), which assembles into amyloid fibrils associated with dialysis-related amyloidosis. Using this approach, we have trapped tetramers formed by ΔN6 under conditions which would normally lead to fibril formation and found that the degree of tetramer stabilization depends on the site of the covalent tether and the nature of the protein-fragment interaction. The covalent protein-ligand linkage enabled structural characterization of these trapped, off-pathway oligomers using X-ray crystallography and NMR, providing insight into why tetramer stabilization inhibits amyloid assembly. Our findings highlight the power of "post-translational chemical modification" as a tool to study biological molecular mechanisms.

Entities: Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2020 PMID： 33253560 PMCID： PMC7729939 DOI： 10.1021/jacs.0c10629

Source DB: PubMed Journal: J Am Chem Soc ISSN： 0002-7863 Impact factor: 15.419

Introduction

The regulated self-assembly of proteins into ordered complexes drives many biological processes, ranging from viral capsid formation and actin polymerization, to DNA maintenance and repair.[1,2] However, self-assembly can also occur aberrantly as a result of changes in the concentration,[3] environment,[4−6] primary sequence,[7,8] or post-translational processing[9−11] of a protein. Aberrant assembly events are associated with a range of disorders, and they can involve polymerization of natively folded protein molecules,[2] as in sickle cell anemia,[12] or aggregation which is accompanied by significant conformational changes, as exemplified by amyloid diseases.[13,14] Understanding the molecular basis and consequences of such protein–protein interaction (PPI) pathways and identifying methods for their modulation[15−20] therefore have implications for the treatment of disease,[21] as well as in the development of new biomaterials, where protein self-assembly can be exploited to yield structures with defined architectures or novel biomechanical properties.[22,23] However, manipulating and defining the mechanisms of self-assembly is challenging, due to the transient nature and heterogeneity (in mass and structure) of oligomeric intermediates.[24,25] The use of methods to trap specific oligomeric complexes can help overcome these challenges and offers the opportunity to structurally characterize otherwise transient intermediates, identify targets for drug discovery, and develop new scaffolds for protein-based nanostructures. Here, we describe the use of disulfide tethering[26] to rapidly explore chemical space and identify “post-translational chemical modifications”[27,28] that stabilize specific PPIs associated with amyloid assembly. These site-specific, covalent modifications may act in one of two ways: altering the surface properties of the protein and/or covalently reinforcing a noncovalent protein–ligand interaction. Using a naturally occurring, amyloidogenic variant of β2-microglobulin (β2m)—the ΔN6 variant—as a model system, we show that covalently tethered fragments represent highly effective tools for tuning oligomer populations and stabilizing particular species in self-assembly pathways. By combining kinetic analysis with structural information obtained from NMR and X-ray crystallography, the covalent modifications identified here have led to structural and functional insights into the role of tetramers in ΔN6 amyloid formation. Native monomeric β2m has a seven-stranded immunoglobulin fold and forms the noncovalently bound light chain of the major histocompatibility class I complex.[29] Aberrant self-assembly of β2m molecules into amyloid fibrils[30] is a hallmark of dialysis-related amyloidosis (DRA).[31−33] The amyloidogenic variant of β2m which is the focus of this paper—ΔN6—is formed from the wild-type protein by proteolysis of its N-terminal six amino acids and makes up ∼20–30% of β2m molecules found in fibrils extracted from DRA patients.[34,35] ΔN6 is capable of rapid assembly into amyloid fibrils in vitro at near-neutral pH, through the association of the dynamically structured monomers[36] into dimers and hexamers that retain a nativelike fold[37] (Figure A). Subsequent conformational rearrangement of these oligomers into cross-β structures leads to amyloid fibril formation and elongation.[38]

Figure 1

(A) Native monomeric ΔN6 (PDB 2XKU(36)) contains seven antiparallel β-strands (labeled A–G), with a solvent-excluded disulfide bond between strands B and F (yellow). The monomeric protein is capable of associating into transient, nativelike dimers and hexamers en route to amyloid fibrils (structures not drawn to scale).[37] The conversion of these nativelike oligomers into fibrils requires structural rearrangement of the existing β-strands[38] and further self-assembly. ΔN6 tetramers have been observed, but their structure and role in fibril formation depend on the solution conditions.[37,86] (B) The two most ligandable sites (pink) of ΔN6 identified by computational solvent mapping are located adjacent to the BC (target site 1) and DE (target site 2) loops. Three residues (S33, S52, and L65; purple) at these two sites were substituted with cysteine in order to target each pocket using the disulfide tethering strategy (see Figure A). The orientation of ΔN6 in the “Target site 1” panel is the same as for the monomer shown in (A).

Figure 2

(A) Schematic representation of the disulfide tethering method used to compare noncovalent affinities of different fragments for specific target sites on ΔN6. When using cocktails of fragments, the compound which forms the most favorable noncovalent interactions with the target protein while tethered by a disulfide bond will produce the most stable (i.e., highest populated) covalent protein–fragment adduct at equilibrium—this is reflected by a higher RZ score. (B) Data from a disulfide tethering screen against three ΔN6 cysteine variants (S33C, S52C, and L65C) normalized as RZ scores. For fragments which were present in more than one screening cocktail, data are shown as the mean ± one standard deviation. Each cysteine variant was present at 5 μM and was incubated for 24 h with 25 μM of each disulfide-linked fragment (in cocktails of five fragments) and 500 μM βME, in 25 mM sodium phosphate, pH 6.2, 2% v/v DMSO. Black circles are shown for fragments which were synthesized but not included in the screening library due to poor purity.

Recent structural models of ΔN6 dimers and hexamers have shown that the DE and BC loops of the protein are involved in both oligomerization interfaces (Figure A).[37] The same regions have also been identified at the interface of amyloid-competent ΔN6−β2m heterodimers,[39] β2m−β2m homodimers,[40] and inhibitory heterodimers formed between ΔN6 and a nonamyloidogenic β2m ortholog (murine β2m).[39] These examples implicate the DE and BC loops, and thereby also the associated β-strands, as mediators of oligomerization, and thus targeting these regions (e.g., by small molecules) was anticipated to yield tools for controlling and studying ΔN6 self-assembly. The dynamic nature of ΔN6 monomers[36] and oligomers,[37] combined with the lack of an obvious ligand binding pocket, has thus far hindered the development of small molecule modulators of amyloid formation (although protein-based modulators have been identified[41]). In light of this challenge, we focused our efforts on the development of covalent ligands to manipulate ΔN6 self-assembly, based on the success of such compounds in targeting other challenging PPIs.[42−44]

Results and Discussion

Identifying Covalent Ligands for ΔN6 by Disulfide Tethering

Computational solvent mapping (using the FTMap server[45]) was performed against an NMR-derived conformational ensemble of ΔN6[36] in order to predict which regions of the protein are likely to be hot spots for small molecule binding. The resulting data (Figure S1) highlighted two pockets adjacent to the DE and BC loops as promising targets for ligand development (site 1, between the C and D strands; and site 2, beneath the DE loop; Figure B). To identify possible covalent ligands for these sites, a library of small molecules was screened using the “disulfide tethering” method[26] (Figure A). First developed by Erlanson and co-workers as a site-directed screening strategy,[46] this approach uses disulfide bonds to covalently trap and assess the interaction affinity of small molecules (typically fragments) which have bound noncovalently to the protein of interest. Libraries of disulfide-functionalized molecules are screened for their ability to form disulfide bonds with cysteine variants of the target protein, as small molecules which exhibit favorable noncovalent interactions near the free, solvent-exposed cysteine residue will undergo thiol–disulfide exchange more effectively due to their increased local concentration. The relative population of different covalent protein−ligand complexes at equilibrium can therefore be used as a proxy for the noncovalent affinity of a given fragment for a particular region of the target protein. (A) Schematic representation of the disulfide tethering method used to compare noncovalent affinities of different fragments for specific target sites on ΔN6. When using cocktails of fragments, the compound which forms the most favorable noncovalent interactions with the target protein while tethered by a disulfide bond will produce the most stable (i.e., highest populated) covalent protein–fragment adduct at equilibrium—this is reflected by a higher RZ score. (B) Data from a disulfide tethering screen against three ΔN6 cysteine variants (S33C, S52C, and L65C) normalized as RZ scores. For fragments which were present in more than one screening cocktail, data are shown as the mean ± one standard deviation. Each cysteine variant was present at 5 μM and was incubated for 24 h with 25 μM of each disulfide-linked fragment (in cocktails of five fragments) and 500 μM βME, in 25 mM sodium phosphate, pH 6.2, 2% v/v DMSO. Black circles are shown for fragments which were synthesized but not included in the screening library due to poor purity. Three single cysteine variants of ΔN6 (S33C, S52C, and L65C; Figure B) were expressed and purified (Figure S2) in order to monitor small molecule binding at sites 1 and 2 by using the disulfide tethering approach. A library of 84 symmetrical disulfides (designed with the aid of molecular docking, as described in the Supporting Information) was prepared through the use of solid-phase synthesis, and 76 of these fragments were sufficiently pure for screening purposes (Figure S3) and were screened against each ΔN6 cysteine variant in cocktails of five, in the presence of excess reducing agent (β-mercaptoethanol, βME; Figure S4). The relative populations of the different protein–fragment adducts were assessed at 24 h by electrospray ionization mass spectrometry and normalized as robust Z (RZ) scores[47] (Figure B; Figures S5–S7), where higher RZ scores were anticipated to reflect more favorable protein–fragment interactions (Figure A). Comparison of the distribution of protein–fragment adducts observed for the three ΔN6 cysteine variants to that of an unrelated, largely helical control protein, MCL-1, showed that there was poor correlation between the datasets for S52C– and L65C−ΔN6 with that for MCL-1 (median r = 0.37; Figure S6) and therefore that the identity of the protein affects which protein–fragment adducts dominate at equilibrium. The higher correlation between the datasets for S33C−ΔN6 and MCL-1 reflects the similar chemical nature of their binding pockets (Figure S6). These observations suggest that the preference of ΔN6 for particular fragments (as shown in Figure B) is a result of specific noncovalent interactions. RZ scores were therefore used to report on the relative noncovalent affinities of tethered fragments for a particular region on the surface of ΔN6.

Covalent Functionalization and Ligand Binding Drive ΔN6 Tetramerization

To interrogate how covalent modification of the different regions of ΔN6 affects aggregation, sedimentation velocity analytical ultracentrifugation (SV-AUC) was used to assess the oligomeric state of a series of individual protein–fragment adducts (Figure A; Figures S8–S10). Fragments with a range of RZ scores were selected for testing with each cysteine variant, so as to distinguish between changes in oligomeric state which are due to covalent modification of the protein (i.e., changes observed for all samples of a given cysteine variant) versus changes which arise from specific noncovalent protein–ligand interactions (i.e., those observed only for covalently tethered fragments with high RZ scores).

Figure 3

(A) Examples of fragments used to form the S52C– and L65C–fragment adducts studied by SV-AUC. (B–E) SV-AUC data collected for L65C–fragment adducts (B, C) or S52C–fragment adducts (D, E). Experiments were performed with 150 μM protein in 25 mM sodium phosphate at pH 6.2, 25 °C. Peaks in the c(s) distributions were assigned to monomer (m), dimer (d), tetramer (t), or hexamer (h), based on previous studies[37] and predicted sedimentation coefficients for these oligomers (calculated using the Svedberg equation[48]). Tetramer peak areas for a range of L65C–fragment adducts were found to correlate with the RZ score of the attached fragment (C). The relationship between fragment RZ score and tetramer peak area for S52C–fragment adducts was less clear, and other properties of the fragments may play a role in modulating tetramer populations (E) (see Figure S14). In the absence of covalent modification, ΔN6 is approximately 50% monomeric under the conditions employed (150 μM protein, pH 6.2, 25 °C), with dimers, tetramers, and hexamers representing the majority of faster sedimenting species (Figure S2), consistent with previous reports.[37] Tethering of high RZ score fragments to all three ΔN6 cysteine variants was found to increase the population of tetramers (representative data in Figure B,D; see also Figures S11–S13). Tethering with low RZ score fragments, however, produced a variety of results, which depended both on the cysteine variant and the ligand employed. For L65C–fragment adducts, the area of the tetramer peak in the continuous sedimentation coefficient (c(s)) distributions showed a positive correlation with the RZ score of the tethered fragment (Figure C; Figures S13, S14): a covalently attached fragment with a low RZ score (βME) produced an oligomer distribution which was similar (albeit not identical) to that of ΔN6 alone (tetramer peak areas of 5 and 18%, respectively), while tethered fragments with high RZ scores produced significantly larger tetramer peaks (e.g., a 45% tetramer peak area was observed for the adduct between L65C and disulfide 54, named L65C–S54; Figure B,C). The only predicted ligandable pocket that is near residue 65 of ΔN6 is target site 2 (Figure B), suggesting that the formation of tetramers by L65C–fragment adducts is driven by noncovalent binding to this pocket. The nature of the relationship between tetramer population and fragment RZ score for the S52C– and S33C–fragment adducts was less clear. All S52C–fragment adducts produced tetramer peak areas of ≥43% (with most between 86 and 95%), regardless of RZ score (Figure D,E; Figures S12, S14), implying that a different property of the fragments was driving the changes in oligomeric state at this site. Although a limited range of fragment sizes were used in screening, the observed tetramer populations for the different S52C–fragment adducts are consistent with fragment size (and thus the surface topography of protein–fragment adducts) being a contributing factor (Figure S14). The S33C–fragment adducts analyzed were generally polydisperse, with only two samples (of seven analyzed) producing c(s) distributions which were readily interpretable (Figure S11): S33C−βME and S33C–S79, the latter of which showed an increased tetramer peak area (43%) over the former (16%). These data are insufficient to ascertain which characteristic (characteristics) of the covalent fragments influences (influence) the tetramer population of S33C–fragment adducts, but they show that, as for L65C and S52C, the addition of specific covalent fragments to S33C can be used to modulate oligomer distributions.

Tetramer Stabilization Inhibits Amyloid Assembly

The ability of covalently tethered fragments to generate defined oligomer populations offered the opportunity to explore the effect of tetramer stabilization on ΔN6 amyloid assembly. Protein–fragment adducts with a range of tetramer populations were added to preformed ΔN6 amyloid fibrils, and the ability of these samples to elongate the fibril seeds was analyzed using the fluorescent, fibril-binding dye, thioflavin T (ThT).[49,50] By using this strategy, fibril elongation by each sample will be templated by seeding to a common fibril product, irrespective of the covalent ligand, enabling direct comparison of the initial rates of fibril growth for different protein–fragment adducts. It should be noted that, under the conditions employed, spontaneous (i.e., unseeded) fibril formation does not occur on the time scale of the experiment. For all three ΔN6 cysteine variants, the observed initial elongation rates were lower for samples with higher tetramer populations (Figure ; Figure S15A,B). The most dramatic change in fibril elongation was seen for S52C–S54 (86% tetramer peak area in the c(s) distribution), where the rate of elongation was reduced more than 30-fold relative to ΔN6 (Figure E,F). Global linear regression analysis across all samples showed that there is a negative correlation (r = −0.78) between the observed initial fibril elongation rate and tetramer population, and extrapolation of the linear regression line to 100% tetramer corresponds precisely to an elongation rate of zero (Figure S15B). Together, these observations indicate that the tetramers formed by the S33C–, S52C–, and L65C–fragment adducts lie off-pathway to amyloid fibril formation. Prediction of elongation rates using kinetic schemes in which tetramers lie on- or off-pathway to fibrils also supports this conclusion (Figure S16). The fibril elongation data thus highlight tetramer stabilization as a strategy to inhibit ΔN6 amyloid formation. In addition, they support a model in which covalent functionalization and covalent reinforcement of ligand binding around the DE and BC loops can be used to slow the progression of seeded amyloid formation by modulating the stability and population distribution of oligomers.

Figure 4

Change in ThT fluorescence over time for various protein–fragment adducts (150 μM) in the presence of ΔN6 fibril seeds (15 μM monomer equivalents) shows that the ability of samples to elongate fibrils depends on the tetramer population. ThT fluorescence curves in (A), (C), and (E) are shown as the median curve, with the highest and lowest values shaded in gray (n = 3). The relationship between the initial elongation rates calculated from these curves and the tetramer peak areas from the corresponding c(s) distributions are shown in (B), (D), and (F). Error bars (standard deviation) are shown for all data points (mean values) in (B), (D), and (F)—those error bars that are not visible are smaller than the displayed data point. Additional ThT fluorescence curves associated with (B) which were not shown in (A) (i.e., the white circles in (B)) are shown in Figure S15A. All experiments were performed under quiescent conditions in 25 mM sodium phosphate at pH 6.2, 25 °C. Elongation rates for unliganded ΔN6 are shown by black crosses in (B), (D), and (F).

Structural Characterization of Stabilized Tetramers

To understand how the tetramers generated by the covalent modification of ΔN6 are structurally related to the previously characterized ΔN6 dimers and hexamers,[37] as well as how and why covalent modification around the DE and BC loops leads to tetramerization, X-ray crystallography and solution-state NMR structural studies were performed. S52C–S54 was found to crystallize as a ring-shaped tetramer with a solvent-accessible central cavity, formed from two asymmetric units each containing two ΔN6 molecules in an antiparallel orientation (Figure A–C; Table S1). As seen for the on-pathway ΔN6 dimer and hexamer,[37] the protein subunits in the crystallized tetramer are highly nativelike, but notably with perturbations to the DE and BC loops as well as a shift of the D strand from a β-bulge to a straight β-strand (Figure D; Figure S17A). These structural changes appear to be linked to each other and to tetramerization: straightening of the D strand allows for protein–protein interactions to occur via β-sheet augmentation, forming one of the two interaction interfaces in the tetramer (Figure B), and additionally results in a rearrangement of phenylalanine residues at the top of the D, E, and B strands (Phe56, Phe62, and Phe30, respectively) which is accommodated by rearrangement of the BC loop (Figure S17B). This movement of residues around the BC and DE loops allows several key protein–protein contacts to be made within the second interaction interface, which occurs through a face-to-face, antiparallel interaction of the ABED β-sheets (Figure C; Figure S17C).

Figure 5

(A–C) Crystal structure of the S52C–S54 tetramer (diffracted to 2.4 Å; PDB 7AFV), formed from two asymmetric units: subunits 1a/1b (gray/pale blue) and subunits 2a/2b (pale green/dark green). Protein subunits interact via two interfaces: the D strand interface (B) and the ABED sheet interface (C). Four copies of the covalent −S54 fragment (shown as spheres in A and C) are present in this complex and lie within the ABED sheet interface, in a central solvent-accessible cavity (C). (D) Per-residue, pairwise RMSD values for the S52C–S54 tetramer crystal structure (subunit 1a/2a) compared with the monomeric ΔN6 NMR ensemble (30 structures; PDB 2XKU(36)), reported for all non-hydrogen atoms as the mean RMSD (±standard deviation). Residues which are ≤6 Å from another protein subunit or a −S54 fragment within the S52C–S54 tetramer structure are additionally highlighted by green or purple bars, respectively. The locations of the β-strands in the S52C–S54 tetramer crystal structure are shown above the plot. (E) Each −S54 fragment can bind in one of two binding sites around the site 2 pocket: by π stacking between two Tyr26 residues within the ABED sheet interface (top) or by π stacking against Tyr67 (bottom). The 2Fo–Fc electron density map (contoured at 1.1σ) is shown in cyan for the displayed amino acid side chains and organic molecules. (F) Combined 1H–15N chemical shift differences between S52C–S54 and L65C−βME HMQC NMR spectra—these samples had tetramer peak areas of 86 and 5%, respectively, in their c(s) distributions. Residues which were not visible (or could not be confidently assigned based on comparison with previous ΔN6 assignments[36]) for either sample are shown as black circles. Residues which were visible for L65C−βME but were broadened beyond detection for S52C–S54 are shown by purple bars. Resonances which were visible in both spectra are colored according to the magnitude of the chemical shift perturbation (CSP) relative to the standard deviation of the dataset (σ): CSP ≥ 2σ, red; σ ≤ CSP < 2σ, yellow; CSP < σ, gray. Four main regions (labeled 1–4) show either significant changes in the position of 1H–15N resonances or complete loss of these resonances in samples with higher tetramer populations (see also Figure S21). Clear electron density indicated the presence of four covalently bound −S54 fragments within the central cavity of the tetramer (Figure A; bottom structure) which make intrasubunit and/or intersubunit contacts with the site 2 pocket of surrounding protein subunits (Figure E; Figure S17C). The protein–ligand interactions with the strongest electron density, and which are therefore the interactions which presumably contribute most to the stabilization of the tetramer, involve π stacking between two Tyr26 residues within the ABED sheet interface and the bicyclic ring system of a fragment 54 molecule (gray ligand in Figure E)—this arrangement is seen for two of the four fragments in a tetramer. These protein–ligand interactions may additionally stabilize the straight conformation of the D strand: residue 52 lies too far from Tyr26 in the monomeric ΔN6 structure for fragment 54 to interact with Tyr26, and it thus appears that this protein–ligand interaction can only form when the D strand is straight (Figure S17D). Therefore, while no direct protein–ligand contacts occur with either the DE or BC loops, contacts with the associated β-strands appear to have driven conformational changes in these regions, and together the observed structural changes have altered the ΔN6 self-assembly landscape in favor of the tetramer. The electron density associated with the remaining two fragments in the tetramer is not as well-defined as for those intercalated between Tyr26 residues, but it suggests a binding mode whereby the fragments can form intrasubunit π–π interactions with Tyr67 residues (Figure E, pale blue ligand). 1H–15N heteronuclear multiple quantum coherence (HMQC) NMR spectra and 15N-relaxation measurements acquired for various S52C– and L65C–fragment adducts were consistent with the tetramer observed within the S52C–S54 crystal lattice, implying that this crystal structure reflects the nature of these off-pathway ΔN6 tetramers in solution (Figures S18–S21). Comparison of the chemical shifts and intensities of 1H–15N backbone amide resonances between samples with higher tetramer populations (e.g., S52C–S54 and L65C–S54) and those with lower tetramer populations (e.g., ΔN6, L65C−βME) showed differences across four regions of primary sequence (labeled 1–4 in Figure F and Figure S21). Regions 1–3 contain residues that either make up the tetramer interaction interfaces and/or have shifted significantly (≥4 Å root-mean-square deviation, RMSD) in the crystal structure relative to monomeric ΔN6 (Figure D). While region 4 does not undergo any structural changes itself in the crystal structure, it is adjacent to the BC loop, and hence residues in region 4 will experience a change in chemical environment upon tetramerization (Figure D). In addition to the NMR data supporting the relevance of the S52C–S54 crystal structure to oligomerization in solution, the NMR data also indicate that the tetramers formed by the S52C– and L65C–fragment adducts are structurally similar. The ability of S52C– and L65C–fragment adducts to adopt similar tetrameric species can be rationalized in the context of the crystal structure. D strand straightening results in residues 52 and 65 now lying adjacent to one another (Figure S17A, inset), so we anticipate that fragments tethered to the L65C variant of ΔN6 can access the same binding pockets which −S54 is observed to occupy in the S52C–S54 crystal structure.

Structural Similarities of Off-Pathway ΔN6 Tetramers with Full-Length β2m Oligomers

While similar regions of ΔN6 are involved in the formation of the off-pathway tetramer and the previously identified on-pathway dimers/hexamers (which were characterized under conditions similar to those employed here),[37] the PPIs themselves are significantly different and mutually exclusive (Figure S22A). This observation rationalizes the off-pathway behavior of the tetramer, as it would need to completely dissociate to form the on-pathway hexamer. However, although the tetramer is different from previously identified ΔN6 oligomers,[37,41,51] its interfaces resemble those observed in oligomers of the full-length β2m protein.[52−55] The most striking similarities are those with a crystallographic tetramer formed by the P32A variant (PDB 2F8O;[52]Figure S22B,C). P32A β2m is largely monomeric in solution[52] (in the absence of divalent copper[56]), but at the high concentrations within the crystal lattice the protein molecules can interact via the same D strand and ABED interfaces shown in Figure (Figure S22B). Although the solution relevance of the crystallographic P32A tetramer was not shown, the NMR data obtained for S52C–S54 confirm that this arrangement of protomers is possible in solution and stable in the presence of suitable ligands. While the crystallographic P32A β2m tetramer possesses the same straight D strand as the S52C–S54 ΔN6 tetramer, other β2m oligomers for which high-resolution structural information is available lack straight D strands and instead involve DE and BC loop-mediated PPIs.[53−55] Since these oligomers which lack straight D strands have been shown (or proposed) to be on-pathway species, we hypothesize that formation of an eight-stranded ABED–DEBA β-sheet is an off-pathway PPI which can be driven by ligand binding across the ABED sheet (as seen for ΔN6 S52C–S54) or modulation of the BC loop (as for β2m P32A).

Conclusions

Covalent small molecules are becoming increasingly attractive tools to modulate specific protein–protein interactions,[43,44,57−59] to interrogate the role of different proteins in biological processes,[44,57,60−62] and to facilitate structural studies of challenging targets.[42,63] The results presented here expand this covalent chemical tool approach, highlighting its power to facilitate analysis of the structure and the role of specific oligomers in self-assembly pathways. We have demonstrated that covalently reinforced protein–ligand interactions[64,65] can be used to manipulate heterogeneous and dynamic PPIs, exemplified by those formed in the initial stages of amyloid formation, which are notoriously difficult to target selectively when purely noncovalent approaches are used.[66] Specifically, covalent ligands identified by disulfide tethering allowed a transient off-pathway tetramer (as shown by kinetic data and models) formed by the amyloidogenic ΔN6 variant of β2m to be trapped and characterized in atomic detail for the first time. These results highlight the power of the chemical modification of proteins as a general strategy to manipulate complex self-assembling systems and to drive the formation of defined oligomers for detailed study.

Experimental Section

Safety

No unexpected, new, and/or significant hazards or risks were associated with this research.

Disulfide Tethering Screening

Screening cocktails were prepared by combining five disulfide-linked fragments (1.25 mM each; all present as symmetrical disulfides) in the presence of βME (25 mM) in DMSO. These initial cocktail mixtures were incubated at ambient temperature (∼18 °C) for 12 h to allow the formation of mixed disulfides with βME (which generally had improved aqueous solubility over the symmetrical disulfides). Screening cocktails were diluted further into each ΔN6 cysteine variant (5 μM protein, with 25 μM each disulfide, 500 μM βME, 2% v/v DMSO) in 25 mM sodium phosphate at either pH 6.2 (main text data) or pH 8.0 (Supporting Information), and the resulting screening mixtures were incubated at ambient temperature. The distribution of covalent protein–fragment adducts over time was analyzed by a Bruker maXis Impact QTOF mass spectrometer, with an electrospray ionization source. Samples (1 μL injections) were desalted prior to mass spectrometry by an in-line Dionex UltiMate 3000 liquid chromatography system (Thermo Scientific), equipped with an Aeris Widepore C4 column (Phenomenex), running a gradient between water and acetonitrile, both supplemented with 0.1% formic acid. MCL-1 (residues 172–327; purified as described previously[67]) was screened using the same procedure, but in 25 mM sodium phosphate, pH 7.4. Mass spectra were deconvoluted with the use of a maximum entropy algorithm (DataAnalysis, Bruker). For each protein–fragment adduct in each cocktail, the intensity of the deconvoluted peak at 24 h was measured relative to the protein−βME adduct peak. These relative intensities were then converted to RZ scores.[47] The RZ score for a given fragment can be described by eq :where x is the relative intensity of the protein–fragment adduct peak for fragment i and xall is a list of relative peak intensities for all protein–fragment adducts in that dataset (i.e., for a given cysteine variant at a given pH). In all cases, peak intensities were measured relative to the βME adduct. The medium absolute deviation (MAD)[68] is defined as RZ scores for the disulfide analogue of βME could not be determined by this method, due to the presence of βME as a reducing agent in all screening mixtures. Instead, its RZ score was estimated through competition experiments, as shown in Figure S7.

Sedimentation Velocity Analytical Ultracentrifugation

Experiments were carried out at 48 000 rpm, 25 °C, with an An50-Ti rotor in a Beckman Coulter Optima XL-I ultracentrifuge. Samples and reference buffer (400 μL) were loaded into 12 mm aluminum centerpieces, with sapphire windows. Samples were thermally equilibrated for 2–3 h at 0 rpm and were subsequently analyzed using the interference detection system at a protein concentration of 150 μM in 25 mM sodium phosphate, pH 6.2. All samples had been exhaustively dialyzed in this buffer prior to loading, and the dialysate was used as the reference buffer. Data were analyzed in SEDFIT (version 12.44)[69] with a continuous c(s) distribution model and with a maximum entropy regularization confidence interval of 0.95. Values for the partial specific volume (ν̅) of ΔN6, buffer viscosity (η), and buffer density (ρ) were all calculated with SEDNTERP[70] (ν̅ = 0.72832 mL/g; η = 0.00898 P and ρ = 0.99954 g/mL for 25 mM sodium phosphate, pH 6.2). The final fitting was performed with the Levenberg–Marquardt algorithm, although initial fitting was performed with both the Simplex and Levenberg–Marquardt algorithms. For c(s) distributions with significant peak broadening or overlap, further processing was subsequently performed to permit baseline resolution of these peaks: multiple Gaussian functions were fit to the initial c(s) distribution (using the curve_fit function from SciPy.optimize), and then the positions of these best fit Gaussians were used in SEDFIT as Bayesian prior expectations[71] (modeled again as Gaussians, located at the best fit positions identified by curve_fit, with widths set to 0.2 and amplitudes set to 0.1). Refitting the data with the Levenberg–Marquardt algorithm led to improved or equally good root-mean-square deviations (compared to the original c(s) distribution) for all datasets to which this processing method had been applied, confirming the validity of this approach. Peaks in the c(s) distributions were assigned to specific oligomers based on comparison to published ΔN6 SV-AUC data[37] and predicted sedimentation coefficients for these oligomers. Predicted sedimentation coefficients were calculated with the Svedberg equation[48] at a range of possible frictional ratios, so as to estimate the range of possible sedimentation coefficients which could describe a particle of a given molecular weight.

Thioflavin T Aggregation Assays

Protein–fragment adducts (150 μM) were incubated in 25 mM sodium phosphate, pH 6.2, with 10 μM ThT, in the presence or absence of 15 μM (monomer equivalent) fibril seeds prepared from ΔN6, in 96-well untreated half-area plates (Corning; 100 μL sample per well). In the same plate, 15 μM (monomer equivalent) fibril seeds were incubated alone in 25 mM sodium phosphate, pH 6.2. Details concerning the preparation of fibril seeds can be found in the Supporting Information. Fibril elongation was allowed to occur quiescently, with brief agitation (10 s at 600 rpm) only occurring prior to each reading (every 12 min). Experiments were performed for 24 h at 25 °C in a FLUOstar OPTIMA plate reader (BMG Labtech), exciting ThT at 444 nm and measuring its fluorescence emission intensity at 480 nm. Data were processed by subtracting two datasets from each seeded dataset to remove changes in ThT fluorescence which had occurred independently of fibril elongation: (a) data obtained from the same protein–fragment adduct in the absence of fibril seeds; (b) data obtained for fibril seeds alone in buffer. The absence of fibrillar aggregates in the unseeded samples after 24 h was confirmed by negative-stain electron microscopy (see Supporting Information). Initial elongation rates from the processed seeded datasets were measured by fitting a straight line to the first 3 h of data and determining the gradient.

Protein Crystallography and Data Processing

A stock solution of S52C–S54 was prepared by incubation of S52C (1.5 mM) and the symmetrical disulfide of fragment 54 (Di-S54; 1.5 mM) in 25 mM sodium phosphate, pH 8.0, 725 μM βME, 20% v/v DMSO, for 40 h at room temperature, before exhaustive dialysis against 25 mM sodium phosphate, pH 6.2, at 4 °C, using dialysis tubes with a 3.5 kDa molecular weight cutoff (Generon). The concentration of the dialyzed sample was determined through a bicinchoninic acid (BCA) assay.[72] Crystals were grown by mixing 0.2 μL of the protein–fragment complex (1.1 mM) and 0.1 μL of the crystallization solution in hanging drop plates at 293 K. The crystallization solution (39 mM bicine, 61 mM Tris, pH 8.5, 7.8% w/v PEG 3350, 7.8% w/v PEG 1000, 7.8% v/v 2-methyl-2,4-pentanediol, and 6.7 mM each of 1,6-hexanediol, 1-butanol, 1,2-propanediol, 2-propanol, 1,4-butanediol, and 1,3-propanediol) was prepared from Morpheus Buffer System 3 at pH 8.5, Morpheus Precipitant Mix 4, and Morpheus Alcohols Mix (all from Molecular Dimensions). After 2 weeks, crystals were fished and flash-frozen in liquid nitrogen. The diffraction data were collected at 0.9795 Å, with 0.025 s exposition and 0.15° oscillation, for a total of 2400 images on beamline I24 at Diamond Light Source (U.K.). The data were processed using the xia2[73] bundle, with DIALS[74] for integration and using Pointless/Aimless[75,76] for scaling and merging. The data were processed using CC1/2 and completeness as cutoff criteria.[77] The structure of S52C–S54 was solved by molecular replacement, using full-length β2m (PDB 5CS7(78)) with the first six amino acids removed as the search model in PHASER.[79] COOT[80] and REFMAC5[81] were used for refinement. Parameterization of Cys−S54 (as a non-natural amino acid) was carried out in XPLOR-NIH,[82] followed by refinement of its structure using the fixupCovalentGeometry function while satisfying the electron density. The quality of the final structure was assessed with MolProbity.[83] Data collection and refinement statistics are shown in Table S1. Figures were prepared using PyMOL (version 2.4, Schrödinger).

Protein NMR

All spectra were acquired by using a 750 MHz Bruker Avance III HD spectrometer equipped with a TCI cryoprobe, at a protein concentration of 150 μM in 25 mM sodium phosphate, pH 6.2, at 25 °C. 1H–15N SOFAST HMQC spectra were processed with the use of NMRPipe,[84] and calculation of peak intensities was performed in CcpNmr Analysis (version 2.4).[85]1H–15N peaks were assigned to specific backbone resonances for each protein–fragment adduct (L65C−βME, L65C–S54, S52C−βME, and S52C–S54) by comparison to existing ΔN6 assignments,[36] only considering peaks with intensities at least 3-fold greater than the spectral noise. Backbone chemical shift perturbations (CSPs) between samples were calculated with eq :where ΔδHN and ΔδN represent the difference in peak position in the direct and indirect dimensions, respectively. L65C−βME and L65C–S54T2 experiments were performed by acquiring a series of sensitivity-enhanced 1H–15N heteronuclear single quantum coherence (HSQC) spectra in an interleaved fashion at a range of delay times (0.017–0.136 ms for L65C−βME, 0.017–0.119 ms for L65C–S54). Peak intensities (I) were extracted from each spectrum with the series.tab NMRPipe module, and per-residue 15N relaxation rates (R2) were calculated by fitting the peak intensity at different delay times (t) to a two-parameter exponential function (eq ). For S52C−βME and S52C–S54, sensitivity limitations did not permit the determination of 15N R2 rates on a per-residue basis. Instead, the first increment of the standard T2 experiment at each delay time (0.017–0.102 ms for S52C−βME, 0.017–0.085 ms S52C–S54) was used to determine the R2 rate of the whole amide region (7.6–9.2 ppm). Mean 15N R2 rates for the whole amide region were calculated in the same way for ΔN6 (0.017–0.085 ms) and the two L65C adducts (L65C−βME and L65C–S54, using the same datasets acquired for determining per-residue 15N R2 rates).

81 in total

1. Oligomeric states along the folding pathways of β2-microglobulin: kinetics, thermodynamics, and structure.

Authors: E Rennella; T Cutuil; P Schanda; I Ayala; F Gabel; V Forge; A Corazza; G Esposito; B Brutscher
Journal: J Mol Biol Date: 2013-05-03 Impact factor: 5.469

2. Recognizing and analyzing variability in amyloid formation kinetics: Simulation and statistical methods.

Authors: Damien Hall; Ran Zhao; Masatomo So; Masayuki Adachi; Germán Rivas; John A Carver; Yuji Goto
Journal: Anal Biochem Date: 2016-07-16 Impact factor: 3.365

Review 3. Effects of in vivo conditions on amyloid aggregation.

Authors: Michael C Owen; David Gnutt; Mimi Gao; Sebastian K T S Wärmländer; Jüri Jarvet; Astrid Gräslund; Roland Winter; Simon Ebbinghaus; Birgit Strodel
Journal: Chem Soc Rev Date: 2019-07-15 Impact factor: 54.564

4. Turning a protein kinase on or off from a single allosteric site via disulfide trapping.

Authors: Jack D Sadowsky; Mark A Burlingame; Dennis W Wolan; Christopher L McClendon; Matthew P Jacobson; James A Wells
Journal: Proc Natl Acad Sci U S A Date: 2011-03-23 Impact factor: 11.205

5. K-Ras(G12C) inhibitors allosterically control GTP affinity and effector interactions.

Authors: Jonathan M Ostrem; Ulf Peters; Martin L Sos; James A Wells; Kevan M Shokat
Journal: Nature Date: 2013-11-20 Impact factor: 49.962

6. Proteome-wide covalent ligand discovery in native biological systems.

Authors: Keriann M Backus; Bruno E Correia; Kenneth M Lum; Stefano Forli; Benjamin D Horning; Gonzalo E González-Páez; Sandip Chatterjee; Bryan R Lanning; John R Teijaro; Arthur J Olson; Dennis W Wolan; Benjamin F Cravatt
Journal: Nature Date: 2016-06-15 Impact factor: 49.962

Review 7. Dialysis-related amyloidosis: challenges and solutions.

Authors: R Scarpioni; M Ricardi; V Albertazzi; S De Amicis; F Rastelli; L Zerbini
Journal: Int J Nephrol Renovasc Dis Date: 2016-12-07

8. Multistep Conformation Selection in Amyloid Assembly.

Authors: Ming-Chien Hsieh; Chen Liang; Anil K Mehta; David G Lynn; Martha A Grover
Journal: J Am Chem Soc Date: 2017-11-15 Impact factor: 15.419

9. Increased Aggregation Is More Frequently Associated to Human Disease-Associated Mutations Than to Neutral Polymorphisms.

Authors: Greet De Baets; Loic Van Doorn; Frederic Rousseau; Joost Schymkowitz
Journal: PLoS Comput Biol Date: 2015-09-04 Impact factor: 4.475

10. Critical aggregation concentration for the formation of early Amyloid-β (1-42) oligomers.

Authors: Mercedes Novo; Sonia Freire; Wajih Al-Soufi
Journal: Sci Rep Date: 2018-01-29 Impact factor: 4.379

4 in total

1. Towards identification of protein-protein interaction stabilizers via inhibitory peptide-fragment hybrids using templated fragment ligation.

Authors: Sonja Srdanović; Zsofia Hegedüs; Stuart L Warriner; Andrew J Wilson
Journal: RSC Chem Biol Date: 2022-04-01

2. Visualizing and trapping transient oligomers in amyloid assembly pathways.

Authors: Emma E Cawood; Theodoros K Karamanos; Andrew J Wilson; Sheena E Radford
Journal: Biophys Chem Date: 2020-11-10 Impact factor: 2.352

Review 3. Generating Ensembles of Dynamic Misfolding Proteins.

Authors: Theodoros K Karamanos; Arnout P Kalverda; Sheena E Radford
Journal: Front Neurosci Date: 2022-03-31 Impact factor: 4.677

4. Single residue modulators of amyloid formation in the N-terminal P1-region of α-synuclein.

Authors: Sabine M Ulamec; Roberto Maya-Martinez; Emily J Byrd; Katherine M Dewison; Yong Xu; Leon F Willis; Frank Sobott; George R Heath; Patricija van Oosten Hawle; Vladimir L Buchman; Sheena E Radford; David J Brockwell
Journal: Nat Commun Date: 2022-08-25 Impact factor: 17.694

4 in total