Literature DB >> 32866007

Comprehensive Structural and Thermodynamic Analysis of Prefibrillar WT α-Synuclein and Its G51D, E46K, and A53T Mutants by a Combination of Small-Angle X-ray Scattering and Variational Bayesian Weighting.

Paolo Moretti¹, Paolo Mariani¹, Maria Grazia Ortore¹, Nicoletta Plotegher², Luigi Bubacco², Mariano Beltramini², Francesco Spinozzi¹.

Abstract

The in solution synchrotron small-angle X-ray scattering SAXS technique has been used to investigate an intrinsically disordered protein (IDP) related to Parkinson's disease, the α-synuclein (α-syn), in prefibrillar diluted conditions. SAXS experiments have been performed as a function of temperature and concentration on the wild type (WT) and on the three pathogenic mutants G51D, E46K, and A53T. To identify the conformers that populate WT α-syn and the pathogenic mutants in prefibrillar conditions, scattering data have been analyzed by a new variational bayesian weighting method (VBWSAS) based on an ensemble of conformers, which includes unfolded monomers, trimers, and tetramers, both in helical-rich and strand-rich forms. The developed VBWSAS method uses a thermodynamic scheme to account for temperature and concentration effects and considers long-range protein-protein interactions in the framework of the random phase approximation. The global analysis of the whole set of data indicates that WT α-syn is mostly present as unfolded monomers and trimers (helical-rich trimers at low T and strand-rich trimers at high T), but not tetramers, as previously derived by several studies. On the contrary, different conformer combinations characterize mutants. In the α-syn G51D mutant, the most abundant aggregates at all the temperatures are strand-rich tetramers. Strand-rich tetramers are also the predominant forms in the A53T mutant, but their weight decreases with temperature. Only monomeric conformers, with a preference for the ones with the smallest sizes, are present in the E46K mutant. The derived conformational behavior then suggests a different availability of species prone to aggregate, depending on mutation, temperature, and concentration and accounting for the different neurotoxicity of α-syn variants. Indeed, this approach may be of pivotal importance to describe conformational and aggregational properties of other IDPs.

Entities: Chemical Disease Gene Mutation Species

Year: 2020 PMID： 32866007 PMCID： PMC8154249 DOI： 10.1021/acs.jcim.0c00807

Source DB: PubMed Journal: J Chem Inf Model ISSN： 1549-9596 Impact factor: 4.956

Introduction

Intrinsically disordered proteins (IDPs) are a challenge for the biophysical community.[1] In the past decade, they have in fact attracted attention from both a theoretical and an experimental point of view.[2−4] In contrast to the structure–function paradigm that has dominated for many years in protein science, it has become clear that the function played by IDPs in many biological processes is due not only to the lack of a unique tertiary structure but also to mainly the high degree of conformational heterogeneity.[5] The emerging structural picture represents IDPs as an ensemble of conformers that transform by a dynamical formation and destruction of secondary structure elements.[6] This conformational “flexibility” enables IDPs to play a pivotal role in protein–protein recognition, signal transduction, and transcriptional regulation processes.[7] In addition to these physiological functions, the intrinsic “plasticity” of IDPs has been associated with a number of pathological processes, among which are neurodegenerative diseases and cancer.[8] IDPs are indeed relatively free to explore a wide conformational landscape and, under certain environmental conditions, they can adopt conformations that trigger aggregation pathways.[9−13] In the case of cross-β interactions, IDPs progress toward the formation of fibrillar structures, also known as amyloid fibers, which are among the hallmark of several neurodegenerative diseases. Examples are Parkinson’s disease (PD), associated with the fibrillation of α-synuclein (α-syn),[14,15] Alzheimer’s disease, associated with the β-amyloid (Aβ) peptide, and Hungtington’s disease, in which huntingtin modifications are involved. One of the main challenging issues in studying IDPs is to describe in a quantitative way the conformational ensemble[3] in order to identify which are the main structural features that trigger, under diverse chemical–physical conditions, the nucleation step of the fibrillation processes.[3,10] From a theoretical point of view, molecular dynamics (MD) and molecular mechanics (MM) approaches have been largely exploited to define ensembles of conformers in equilibrium conditions.[16,17] Recent MD achievements indicate that results strongly depend on the chosen force field and on the model adopted to describe water molecules.[18,19] Conversely, calculated ensembles of conformers are very often used to interpret sets of experimental data that depend on the distribution of conformational states of the protein. This type of analysis is focused on defining the population weights of each conformer in order to assess its contribution to the averaged data of the observables.[17,20] Experimental techniques typically analyzed with ensembles of conformers are nuclear magnetic resonance (NMR),[21,22] small-angle X-ray scattering (SAXS),[7,23] and Förster resonance energy transfer (FRET) spectroscopy.[1,6] In all cases, the number of available experimental observables is by far lower than the number of conformers (hence of the degrees of freedom) of the chosen IDP ensembles, and, as a consequence, there is no unique solution which allows one to reproduce the experimental data.[24] As thoroughly discussed in a review of Ravera et al.,[25] two opposite approaches have been described to overcome this redundancy problem. The first exploits, in different forms, the maximum entropy principle,[26] aiming to obtain the least biased probability distribution of each conformer.[24,27] The second approach is inspired by the “Occam razor’s” rule, i.e., the maximum parsimony principle, which is aimed to determine the minimum number of conformers that are sufficient to recover the experimental data.[20,28−30] In general, the first approach is considered more suitable to describe the behavior of IDPs, which, being a “natural” ensemble of a great number of conformers, can hardly be imagined as a set of few conformers.[20] Moreover, the combination of a conformation ensemble with a set of experimental data by means of maximum entropy approaches can be considered a proper way to not only analyze the data but also to validate the theoretical ensembles. The maximum entropy principle is adopted following two possible strategies. In the first one, it acts directly into molecular simulations by means of restraints between experimental and calculated observables. In the second strategy, the maximum entropy intervenes “a posteriori” as a reweighting method able to determine the weights of the conformers of the ensemble generated by MD or MM simulations in order to optimize the consistency with experimental data.[17,20,31] The Bayesian formalism,[29,32,33] which belongs to the maximum entropy scheme, combines prior information on a IDP system with experimental data and, most importantly, takes into account the experimental errors in these data. Hence, the Bayesian inference has been considered particularly suited to investigate IDPs or large intrinsically disordered protein regions (IDPRs).[4,5,34,35] In particular, a computationally efficient algorithm, called variational Bayesian weighting (VBW), has been adopted to derive the population weights of each conformer together with its standard deviations from NMR data of IDPs, such as α-syn[36] and Aβ.[37] The efficiency of VBW is in the use of the simple Dirichlet distribution[38] to describe both the prior probability function of conformers as well as the posterior probability that takes into account the information provided by experimental data. In this article, we first present a novel VBW reweighting method to extensively study the conformational properties of IDPs by taking full advantage of small-angle scattering (SAS) data and their measured variances. The method, which we have called VBWSAS, takes into consideration ensembles of conformers in different multimeric states and applies, for each class of multimers, the VBW strategy within an overall thermodynamic scheme. On the basis of a batch of SAS curves recorded under different chemical–physical conditions, the VBWSAS method is capable of deriving not only the monomer population weights of each multimeric conformer but also their variation as a function of temperature and protein concentration. Also, the secondary structure of IDPs is derived in terms of propensities[39] of each residue to be in defined regions of the Ramachandran map. We then apply the VBWSAS approach to analyze SAXS data of α-syn, a 140 residue protein that constitutes almost 1% of the total proteins in soluble cytosolic brain fractions.[40] Several different functions have been ascribed to α-syn, including synaptic vesicles trafficking and neurotransmitter release. Coherently, the protein is known to interact with several different binding partners and with negatively charged lipid membranes. A large body of evidence led to the concept that misfolded forms of α-syn are associated with the pathogenesis of Parkinson’s disease (PD).[41] Under pathological conditions, α-syn forms a heterogeneous ensemble of oligomeric species, some of which are converted to β-sheet-rich fibrillar forms of the protein. These α-syn aggregates have been shown to be toxic for neurons through different molecular mechanisms (reviewed in Plotegher et al.[42]). As depicted in Figure , the amino acid sequence of α-syn can be divided into three domains: the N-terminal domain, residues 1–60, which acquire an α-helical structure when the protein interacts with negatively charged lipid membranes or vesicles; the highly amyloidogenic and hydrophobic NAC (non-Aβ-component) domain, residues 61–95; and the C-terminal domain, residues 96–140, enriched in acidic residues and prolines. Both the relatively low hydrophobicity and the high net charge are the cause of the intrinsically disordered nature of α-syn.

Figure 1

Schematic depiction of α-syn. N-terminus (1–60), NAC domain (61–95), and C-terminus (96–140) are colored light blue, orange, and light red, respectively. Starting and ending residues for N-terminus, NAC region, and C-terminus are labeled. Here, we present high-quality synchrotron SAXS data, measured as a function of temperature and concentration, of wild type (WT) α-syn samples as well as three point mutants G51D, E46K, and A53T, associated with the familial form of PD. Temperature and concentration are two parameters already known to impact α-syn aggregation in vitro, as well as the pathological point mutations, which were shown to impact on the protein aggregation propensity.[43,44] The VBWSAS analysis of these SAXS data has been performed by adopting the ensemble of conformers derived by Gurry et al.,[45] which comprehends unfolded monomers, trimers, and tetramers, these latter in both helical-rich and strand-rich forms. The results and their analyses allow a description of α-syn conformational and multimeric disorder and its changes as a function of pathological point mutation, concentration, and temperature.

Material and Methods

The VBWSAS Method

The method here developed considers an ensemble of N conformers of a IDP under investigation, supposed to be constituted by a polypeptide chain of Naa residues (amino acids). We assume that this ensemble contains all the conformational states that monomers of the IDP can adopt in any condition experimentally observed. As a consequence, the IDP molecules will be distributed in N conformers according to a set of monomer population weights w, with the normalization condition ∑w = 1. We define w as the set of all the monomer population weights. Moreover, we assume that several monomers, in a given conformation, can form defined multimers, so that the ensemble can be subdivided in M classes of conformers, which are different for their aggregation number, indicated by m. Accordingly, we assume that in the mth class of conformers there are N conformers, so that ∑N = N. We introduce the set W that contains the multimer population weights within the m-class of conformers, with the normalization condition ∑W = 1 Hence, if the protein monomer is in the mth class of conformers, W represents the multimer population weight in which it is folded according to the jth conformer of that class of conformers. The monomer population weight of IDPs in the mth class of conformers, i.e., monomers forming multimers with aggregation number m, independently on their conformations, is indicated by the symbol ω, with the normalization condition ∑ω = 1. With these definitions, the monomer population weight w of the ith conformer among all the N conformers of the ensemble can be written aswhere m is the class of conformers to which the i-conformer belongs and j indicates which of the conformers of that class of conformers the i-conformer corresponds. From a thermodynamic point of view, in an ideal solution, the chemical potential of a monomeric chain of the IDP in the i-conformer forming a multimer with an m aggregation state is defined aswhere R is the universal gas constant, T the absolute temperature, and C the molar concentration of the i-multimer, corresponding to C = (c/M1)(w/m), c being the nominal w/v protein concentration and M1 the IDP monomer molecular weight. At equilibrium, the chemical potentials of all monomers are equal. Hence, by referring to the first conformer (i = 1), at equilibrium we havewhere ΔG° = μ° – μ°1 is the standard Gibbs free energy change corresponding to the transformation of a solution 1 M of monomers in the 1-conformer and having aggregation number m1 into monomers in the i-conformer with aggregation number m. The last equation allows one to derive the thermodynamic average, corresponding to the equilibrium conditions, of the monomer population weight in the i-conformer, named ⟨w⟩ , as a function of the one of the first conformerBy combining with the normalization conditions ∑⟨w⟩ = 1, a polynomial equation of degree γ = max{m} of the unique variable is obtainablewhere δ is the Kronecker delta function. According to the Abel–Ruffini theorem, analytic solutions are available only up to γ = 4, i.e., up to the formation of tetramers, which are the multimers with the maximum aggregation number in the ensemble adopted in this work.[45−48] Classical thermodynamics allows one also to describe the standard Gibbs free energy change as a function of T in terms of the variations of the standard enthalpy and the standard entropy, both at the reference temperature T0 = 298.15 K (ΔS⊖ and ΔH⊖, respectively), and the variation of the heat capacity at constant pressure (ΔC, supposed not to vary with temperature), all referred to the first conformer, according to . To note, by using eq , the ratio , seen in eq , can be written in terms of three dimensionless variations of enthalpy, , entropy , and constant pressure heat capacity

Variational Bayesian Weighting on Different Classes of Conformers

By generalizing the variational Bayesian weighting (VBW) method,[5,8,36,45] we introduce the posterior probability density function (PDF) f(W1, ..., W) to find out a M-dimensional set of multimer population weights, namely, W1, ..., W, aswhere fin(W1, ..., W) is the prior probability density function, fex(W1, ..., W) is the likelihood probability density function for the experimental observations, and Z is the normalization factor. In this work, experimental observations are SAXS or SANS curves. For the sake of simplicity and tractability of the problem, we make the strong and crucial assumption that the posterior PDF is factorized in a product of M posterior PDFs corresponding to each class of conformers, namely . We also assume that the same assumption holds for the prior PDF . Moreover, according to Fisher et al.,[37] we make the much stronger assumption that each class of conformers’ PDF can be expressed by a Dirichlet function[38]with Γ(x) being the gamma function. To note, the Dirichlet function is fully defined by the set of real positive parameters α ≡ (α,..., α), whose sum is defined as . On the basis of the known properties of the Dirichlet distribution, the average and the covariance of the set of multimer population weights areAccording to Fisher et al.,[37] an unbiased prior PDF can be defined through a Dirichlet function with all parameters fixed to α = 1/2,It is worth noticing that, on the basis of eqs and 12, all the multimer population weights corresponding to the prior PDF have a unique average value ⟨W⟩ = N–1, with variance 2N–2(N–1)/(N+2). On these grounds, it can be demonstrated that the sequence of the sets of Dirichlet parameters, α1, ..., α, may be found by minimizing the functional

SAS and Variational Bayesian Weighting

The variational Bayesian Weighting formalism can in principle be applied to any experimental observation obtained over a system of IDPs. In the case of a small-angle X-ray or neutron scattering curve (here labeled with a subscript c), the external probability is represented by the following equationwhere and σc(q) represent the experimental macroscopic differential X-ray or neutron scattering cross section (SCS) and its standard deviation, respectively, measured in the kth of Nc, values of the scattering vector modulus q. In the most general case, let us assume that our investigated IDP, with conformations as well as aggregations described by a selected ensemble, may be at moderate or high concentration, so that in the experimental SAS curve the effect of long-range protein–protein interactions can be observed. Considering a unique average protein–protein structure factor S(q) that takes into account effective interactions among any conformer or multimers, according to the SAS formalism, the SCS values, which have to be close to the corresponding experimental values provided by SAS, can be expressed as a function of W and ωwhere the average form factor iswith being the total nominal number density of monomers (N is Avogadro’s number). In eq , corresponds to the average, over the polar angles α and β of the scattering vector q (orientational average), of the squared form factor of the jth conformer of the m-class of conformers, a function that can be calculated on the basis of atomic coordinates (e.g., from a PDB file[49]) for both X-rays or neutron scattering by means of methods such as SASMOL.[50] According to scattering theory, S(q) is the effective or measured structure factorwhere the so-called coupling function βell(q) is the ratio |⟨Feff(q)⟩|2/⟨|Feff(q)|2⟩ between the square of the effective orientational average form factor and the orientational average of the effective squared form factor. As discussed by Pedersen et al.,[51] this function, which typically deviates from 1 for anisometric shapes, can be approximated in an acceptable way by assuming that the effective particle has a simple geometrical shape. In our case, we have considered the shape of a biaxial ellipsoid. The protein–protein structure factor, S(q), is calculated as the perturbation of the hard sphere structure factor S0(q) obtained with the well-known Percus–Yevick (PY) approximation in the framework of the random phase approximation (RPA).[52] The perturbation is due to the presence of two Yukawian terms, the first representing the screened Coulumbian repulsion potential and the other an attractive potential.[52] The relevant parameters of this approximation are “effective values” of the so-called “effective particle”: the number density, n, the radius, R, the net charge, Z, the inverse Debye screening length, κ (which depends on the ionic strength I of the protein solution), the attractive potential at contact, J, and the range of the attractive interaction, d. Considering both the average aggregation number, , and the average of its reciprocal, , these parameters can be approximated as follows: n = n◦⟨m–1⟩, Z = Z1⟨m⟩, J = J1⟨m⟩2/3, and R = R1⟨m⟩1/3, where Z1, J1, and R1 are the monomer net electric charge, the depth of the attractive potential of the monomer, and the average radius of the monomer. Notice that we have supposed that Z is simply proportional to ⟨m⟩. On the other hand, J is supposed to scale as the surface of the protein, here simply defined as the one of the spheres defined by the radius R. This latter clearly scales as the cubic root of the volume, which is directly proportional to ⟨m⟩. According to this view, the volume of the biaxial ellipsoid, which is used to determine βell(q), is (4/3)πR13⟨m⟩; hence, the only parameter that should be optimized is the ellipsoid anisometry ν, i.e., the ratio between the semiaxis a and b, b representing the two equal semiaxes. In eq , B is a flat background which takes into account incoherent scattering effects, particularly relevant in SANS experiments. By applying the advantageous properties of the Dirichlet distribution, eq transforms towhere χc2 is the canonical reduced chi-square, calculated on the basis of the theoretical SCSs (eq ) corresponding to the thermodynamic averages of all classes of conformers, ⟨ω⟩, and multimer population weights, ⟨W⟩ψ(x) is the digamma function, ψ(x) = Γ′(x)/Γ(x), and the following working pair factors have been introduced It is worth noticing that, on one hand, the factorization of the posterior PDF in a product of M posterior PDFs corresponds to the definition of M Bayesian problems, each one with its own set of parameters α. On the other hand, the overall minimization of L depends also on the class of conformers weights ⟨ω⟩, which are not treated in the Bayesian framework. Let us now assume that several SAS curves have been measured on the same IDP molecule at different temperatures T and total weight concentrations c; in this case, a unique analysis of all the data can be realized by combining the thermodynamic model with the VBW approach, with the evident advantage of significantly reducing the number of parameters that should be optimized, hence by increasing the statistical robustness of the achieved results. Indeed, according to the thermodynamic model, through eq , we are able to calculate all the values of the monomer population weights ⟨w⟩, and then, we can derive both the average values of class of conformers parameters, and, as a consequence, the values of multimer population weights , where i is the conformer among the ensemble of N conformers corresponding to the jth conformer of the mth class of conformers. On the other hand, the Dirichlet parameters can be expressed as a function of ⟨W⟩ and α, according to α = α⟨W⟩. In these conditions, we can minimize an overall functional defined on the basis of all the Nc SAS curves experimentally availableAdjustable parameters shared by all curves are ΔH⊖/(RT0), ΔS⊖/R, and ΔC/R, which allow the determination of ⟨w⟩ at any T and c, together with the parameters defining the effective structure factor. Curve-specific adjustable parameters are α. We have named this new formalism VBWSAS. As shown in the next paragraph, with this approach, we have been able to obtain good quality fits of SAS experimental data.

Propensities

The basic result of the analysis of a set of SAS data of a IDP with the VBWSAS method is the determination, as a function of temperature and protein concentration, of the average monomer population weights ⟨w⟩ of the chosen ensemble of conformers. This information allows one to derive other structural features that depend on ⟨w⟩. According to Ozenne et al.,[27] one of the most relevant of this information is the folding propensity of each amino acid, defined, in general, as the probability to find the amino acid a in an element of the protein secondary structure, such as α-helices or β-sheets. In this framework, it is of relevance to define a criterion to divide the space of the angles ϕ and ψ of the Ramachandran map[53] in regions (r) that are well representative of the most significant elements of the secondary structure. For example, according to Ozenne et al.,[27] the Ramachandran map can be divided into four regions defined as α-left, α-right, β-proline, and β-sheet, a choice which seems unrepresentative to us. Here, we propose to use a different subdivision, based on the distribution ρ(ϕ, ψ) of populated regions in the Ramachandran plot reported by Lovell et al.,[54] who have analyzed the conformation of 500 high-resolution protein structures through the application of different types of structural analysis. It follows that by contouring the ρ(ϕ, ψ) distribution (normalized to a maximum value of 1) at the levels 0.0005 and 0.02, energetically allowed and energetically favored regions could be identified. Moreover, following Ozenne et al.,[27] for −180° ≤ ϕ ≤ 0°, the allowed region is subdivided in the α-right allowed region for −120° ≤ ψ ≤ – 50° and in the β allowed region for −180° ≤ ψ ≤ – 120° and 50° < ψ < 180°. As a result, we identify eight regions, which include the three canonical regions of β, α-right, and α-left, each one divided into energetically favorable and energetically allowed, the glycine zone and the unstructured region. A color-coded visualization of the eight regions in the Ramachandran plot is shown in Figure . They are hereafter labeled as βfav, βall, αRfav, αRall, αLfav, αLall, gly, and uns.

Figure 2

Regions of the Ramachandran map according to the VBWSAS method.

Regions of the Ramachandran map according to the VBWSAS method. We consider now the atomic structure of the i-multimer of the ensemble, constituted, for example, by m chains. For each g-chain and for each a-residue (from 2 to Naa – 1) of the primary sequence, the ϕ, ψ angles can be calculated, and hence, the index r of the region of the Ramachandran map to which that residue belongs can be assigned. Clearly, the same residue in the different conformers of the ensemble could match different regions. Hence, considering the average monomer population weights ⟨w⟩ screened by SAS experiments, the propensity of the a-residue to populate the r-region of the Ramachandran map is defined by the following equationwhere, as usual, δ is the Kronecker’s delta function. The variance of the propensity is

Sample Preparation

Wild type α-syn and the E46K, G51D, and A53T familial mutants were expressed and purified following a previously described protocol.[12] Briefly, all the α-syn variants were cloned into the pET-28a plasmid and were expressed into the BL21(DE3) E. coli strain. Bacterial cultures were grown at 37 °C in Luria–Bertani broth and induced with 0.1 mM isopropyl-b-thiogalactopyranoside (IPTG). After 5 h, cells were collected by centrifugation, and recombinant proteins were recovered from the periplasm by osmotic shock. The periplasmic homogenate was boiled for 10 min, and the soluble fraction underwent a two-step (35% and 55%) ammonium sulfate precipitation. The pellet was resuspended in 20 mM Tris-HCl at pH 8.0 and dialyzed. The protein solution was loaded into a 6 mL Resource Q column (Amersham Biosciences) and eluted with a 0–500 mM NaCl gradient. After dialysis against water, all the α-syn variants were lyophilized and stored as powder at −20 °C. For SAXS measurements, proteins were solubilized in water, and ultrafiltration spin columns, with a cutoff of 100 kDa (Amicon), were used to remove larger aggregates, possibly formed during lyophilization and resuspension. Protein purity and integrity were checked after purification and/or storage by SDS-PAGE, and concentration was calculated measuring the absorbance of protein solutions using a spectrophotometer (Perkin Elmer) and considering the molar extinction coefficient of the α-syn at 280 nm equal to 5960 M–1cm–1.

SAXS Experiments

Experimental SAXS data were recorded at the BioSAXS beamline BM29 at The European Synchrotron, ESRF in Grenoble (France). The α-synuclein WT and the point mutants G51D, E46K, and A53T were measured at different w/v concentrations c comprised between 1 and 10 g/L at temperatures of 25°, 37°, and 45 °C. An automated sample changer was used, and the sample environment was a quartz glass capillary with a diameter of 1.8 mm. The sample-to-detector distance was 2.867 m, and the photon energy was set to 12.5 keV. Accordingly, the modulus of the scattering vector, q = 4π sin θ/λ (2θ being the scattering angle and λ = 0.992 Å the X-ray wavelength) was comprised in the range of 0.022–0.41 Å–1. Two-dimensional SAXS raw data were recorded by a Pilatus 1 M detector, corrected for detector efficiency, radially averaged to get isotropic signals, and calibrated in absolute units (cm–1) by using water. The protein in solution, the buffer, and the empty cell were measured 20 times with an acquisition time of 1 s. The experimental SCS, , of protein samples were obtained by subtracting the signal (averaged over the 20 measurements) from the one of buffer corrected for the protein volume fraction. The experimental standard deviations on SCS, σc(q), were calculated according to the error propagation theory on the basis of the average values and the standard deviations obtained from the 20 independent measurements of sample, buffer, and empty cell.

Results and Discussion

All measured SAXS curves are reported in Figure in the form of log–log plots (panel a) along the whole q range, in the form of Kratky plots (panel b, up to q = 0.3 Å–1), and as liner-linear plots (panel c) to emphasize the region at low q. For the sake of comparison among the various experimental conditions, curves have been divided by the w/v protein concentration c. Qualitative similarities among the curves of WT α-syn (gray curves) as well as among the curves of each mutant (G51D, salmon curves; E46K, gold curves; A53T, turquoise curves) can be appreciated. We also observe that, among the curves referring to the same α-syn mutant, the main differences at low q (panel c) are due to the presence of a broad interference peak, which changes in position and height mainly, as expected, as a function of c (notice that solid curves refer to the highest values of c). The Kratky plots (panel b) allow one to better appreciate the differences at high q not only among the curves of different α-syn types but also among the ones of the same type. Most importantly, for all protein types, concentrations, and temperatures, Kratky plots show a peak, indicating the presence of folded protein domains, as well as a plateau at high q, a signature of unfolded chains,[55] suggesting that α-syn molecules are either in partially unfolded states or a mixture of folded and unfolded states. Figure , panel c, indicates that in the intermediate q range around 0.05 Å–1, a region of SAXS data that would not be modified by the effect of the structure factor, the normalized curves reach different values, depending on α-syn type, c, and T. This feature suggests the possible presence of oligomeric forms.

Figure 3

Synchrotron SAXS curves of WT α-syn (gray curves) and the mutants G51D (salmon curves), E46K (gold curves), and A53T (turquoise curves) reported as a function of q. Each panel refers to a different temperature, as indicate above. Data, expressed as macroscopic differential scattering cross sections in absolute scale (cm–1) divided by the protein w/v concentration c, are reported as log–log plots (main panels a), Kratky plots (subpanels b), and linear–linear plots (subpanels c). The darkness of the colors increases with the temperature. Dotted, dashed, and solid lines refer to concentration ranges c ≤ 4 g/L, 4 < c < 8 g/L, and c ≥ 8 g/L, respectively. Error bars have been omitted for clarity. This preliminary and qualitative information has led us to develop the VBWSAS method fully described in The VBWSAS Method section. Indeed, since SAXS data reveal the possible presence of multimeric conformers, it is necessary to adopt an ensemble of protein conformers that includes multimers. On the other hand, the presence of an interference peak at low q implies the adoption of a data analysis method that deals not only with form factors but also with structure factors. On these grounds, we have analyzed with the VBWSAS method the four series of SAXS curves, each series corresponding to one of the four α-syn types. In order to deal with the possible presence of multimers, we have adopted the ensemble of α-syn conformers published by Gurry et al.[45] This ensemble contains N = 189 conformers, recorded as PDB files. To note, these conformers have been selected by the authors, through NMR data, from a larger library of 533 conformers built from a pool of 60,000 structures that, in order to get heterogeneous conformers, was subsequently reduced by a minimum pairwise root-mean-square deviation cutoff of 9 Å. Within the N = 189 conformers, there are M = 4 classes of conformers, corresponding to N1 = 98 monomers (51.9% of the total, referred to as 1A-subclass), N3 = 15 trimers (7.9%), and N4 = 76 tetramers (40.2%). Notice that there are no dimers (N2 = 0). By following the secondary structure assignment proposed by Gurry et al.,[45] based on the DSSP method,[56] the trimers are subdivided in N3 = 4 (2.1%) helical-rich conformers (3B-subclass) and N3 = 11 (5.8%) strand-rich conformers (3C-subclass). Likewise, among the tetramers, there are N4 = 19 (10.1%) helical-rich conformers (4D-subclass) and N4 = 57 (30.2%) strand-rich conformers (4E-subclass). We assume that all these conformers are suitable to define the conformational and multimeric probability distribution of any of the four α-syn types, at any concentration and temperature investigated by SAXS. All form factors calculated with SASMOL are shown in Figure . Curves in the form of Kratky plots (panel c) clearly show that monomers (red color curves) are unfolded chains, whereas the presence of a peak at q ≈ 0.1 Å–1 for trimers (green and cyan color curves) as well as tetramers (blue and magenta color curves) confirms that they are folded conformers. We also observed that the behaviors at high q (panels a and b) are flatter for folded multimers than for unfolded monomers. These simulations, when compared with the experimental curves shown in Figure , suggest that the investigated α-syn types in solution may be seen as mixtures of unfolded monomers and folded multimers, confirming the appropriateness of the Gurry et al.[45] ensemble adopted by our VBWSAS approach.

Figure 4

SAXS form factors of the α-syn PDB conformers found by Gurry et al.[45] calculated with SASMOL. Results are reported as linear-log plots (panel a), log–log plots along the q-range of our experiments (panel b), and as Kratky plots (panel c). Red color refers to monomers (1A-subclass). Trimers are shown in green (helical-rich 3B-subclass) and cyan (strand-rich 3C-subclass). Tetramers are shown in blue (helical-rich 4D-subclass) and magenta (strand-rich 4E-subclass). The darkness of the color has been assigned on the basis of the calculated radius of gyration of the conformer, according to the palette on the left. The relative mass density of the hydration water has been fixed to 1.05. See Ortore et al.[50] for details. VBWSAS has been developed in the Bayesian framework; however, it contains a considerable number of parameters. Hence, in order to obtain robust results, it is worth fixing the value, and whenever possible the T or c dependencies, of all the parameters that represent either experimental conditions or consolidated chemical–physical properties of our system. Accordingly, considering the thermal expansion of water (see Variation with T of Concentration and Solvent SLD, Supporting Information), we have estimated the variation with T of the protein w/v concentration and of the scattering length density (SLD) of bulk water. In the SASMOL method, the contribution of hydration water to the form factor is taken into account by assigning to the water molecules in the first hydration shell a relative mass density d different from the one of bulk water. It is known and widely accepted in the SAS community that for folded protein d is in the order of 1.05–1.15,[57] whereas there is not clear evidence of its value for unfolded proteins. We have to consider that, since the volume of the first hydration shell for unfolded proteins is quite large in respect to the dry protein volume, the effect of d can greatly vary the form factor. However, unfolded proteins expose toward the solvent both hydrophobic and hydrophilic groups. Hence, it seemed wise to limit the validity range of d to 0.95–1.05 and to optimize a unique average value, applied to all the conformers of the ensemble, optimized in the narrow range. For the screened Coulumbian repulsion potential, we have approximated the value of the relative dielectric constant of the solutions with the one of pure water, whose dependency on temperature is known.[58] The monomer net charge Z1 of WT α-syn and of the three mutants G51T, E46K, and A53T has been calculated, according to the primary sequence, as a function of T, considering the acidic dissociation constant (pKa at the reference temperature T0;[59] see Table S1 of the Supporting Information) of the side chain of the 20 amino acids as well as the ones of N- and C-terminal groups. Results are reported in Table S2 of the Supporting Information. Conversely, since there are not consolidated theories to estimate the parameters of the Yukawian attractive potential (the energy at the contact J1 and the decay length d), we have left them free to vary not only with the α-syn type but also with T and c. Similarly, we consider free parameters also the average radius of the monomer R1 (that enters in both the hard sphere and in the two Yukawian terms of the potential) and the ellipsoid anisometry ν defining the coupling function βell(q). However, in order to avoid excessive and unlikely oscillations for all these free parameters, a regularization algorithm has been adopted.[60−62] Therefore, we simultaneously analyze with the VBWSAS method all the SAXS curves measured for each α-syn type by minimizing the following merit functionwhere L is defined according to eq , and V is the regularization factorTo note, V increases with the difference of the kth single curve fitting parameter, X (k = 1, 2, 3, 4 refers to J1 and d, R1 and ν, respectively), of two close chemical–physical conditions (c and T), corresponding to the c-curve and the c′-curve. The constant α in eq is wisely chosen in order to get a factor αV lower than ≈10% of whole final merit function . The minimization has been performed by combining the simulated annealing with the simplex methods,[63] and in order to estimate the standard deviations of all fitting parameters, it has been repeated several times by randomly sampling each point of the SAXS experimental curves from a Gaussian with mean value and variance σc2(q). Best fitting curves obtained by applying VBWSAS for each of the four series of SAXS curves are reported in Figure . To note, we have also performed VBWSAS analyses by using subsets of the ensemble of Gurry et al.[45] For all α-syn species, we have found that the best curve fits, in particular at low q, are obtained by using all the 189 conformes of the Gurry et al.[45] ensemble, confirming the appropriateness of its structural heterogeniety. Detailed graphs reporting the distinct contributions of form and structure factors are shown in Figures S1–S4 of the Supporting Information. We notice that all the experimental features of SAXS curves at both high and low q, including the interference peak mainly evident at the largest concentration, are nicely reproduced by VBWSAS. It is also worth considering that we have fully exploited the absolute calibration of the data and the very precise buffer subtraction procedure described in the SAXS Experiments section.

Figure 5

Experimental SAXS curves of WT α-syn and the mutants G51T, E46K, and A53T superimposed with the best fits obtained with VBWSAS (solid black and white lines). Curves are color coded on the basis of the α-syn type and of the temperature, according to the caption of Figure . The nominal protein concentration is reported beside each curve in g/L units. For the sake of a better visualization, curves in the same column, referring to the α-syn type shown on the top of the column, have been staked by multiplying for a factor 10, m being the index of the row from the bottom. Experimental standard deviations are reported as error bars at every 5 points, for clarity. We look now at the results, starting from the thermodynamic fitting parameters of each i-conformer reported, in the form of histograms, in the panels of Figure and calculated as differences with respect to the mean value, ΔΔΦ = ΔΦ – ⟨ΔΦ⟩, ΔΦ representing ΔH⊖, T0ΔS⊖, ΔC, and ΔG⊖ = ΔH⊖ – T0ΔS⊖. It should be noticed that the histogram bars have been colored on the basis of the subclass and the radius of gyration of the i-conformer, following the same color settings of Figure . To simplify the interpretation of these results, we have sorted the N = 189 conformers in ascending order of ΔΔH⊖, as reported in the top panels of Figure . Moreover, in the other three panels below each enthalpy panel, related to the same α-syn type, we report the data as a function of the same sorted sequence of conformers used in the enthalpy panel. In this way, we can better estimate the relationships, if any, between the thermodynamic parameters of each i-conformer. The high similarity between the first two panels of the same column of Figure clearly shows an entropy/enthalpy compensation effect.[64] Indeed the variations ΔΔH⊖ and T0ΔΔS⊖ are comprised between −100 and 100 kJ/mol, whereas their difference, corresponding to ΔΔG⊖ (bottom panels of Figure ), is lower by nearly 1 order of magnitude, varying from −10 to 10 kJ/mol. These low free energy differences, close to the thermal energy at room temperature, confirm that the chosen ensemble of conformers is suitable to describe a disordered conformational landscape. Interestingly, ΔΔG⊖ of monomers and, in less extent, trimers (1A, 3B, and 3C subclasses shown with red, green, and cyan bars, respectively) are mostly positive, whereas for the tetramers subclasses (4D and 4E, blue and magenta bars, respectively) they are negative.

Figure 6

Thermodynamic fitting parameters obtained by the analysis of SAXS data with VBWSAS of WT α-syn and the mutants G51T, E46K, and A53T. All data are reported as differences with respect to their mean (ΔΔΦ = ΔΦ – ⟨ΔΦ⟩). The 189 conformers are sorted on the basis of the standard enthalpy changes reported, for each α-syn type, in the top panels. The same sorted series of conformers is adopted in the other panels regarding the same α-syn type, which report the variation of standard entropy, heat capacity, and standard Gibbs free energy (this latter calculated by ΔΔH⊖ – T0ΔΔS⊖). The color hue of the histogram bars is assigned according to the α-syn subclass (1A red, 3B green, 3C cyan, 4D blue, 4E magenta), and the darkness of the color increases with R, as described in the caption of Figure . Standard deviations are shown as black error bars. In the VBWSAS method, the fitted thermodynamic parameters are used to calculate the average monomer population weights ⟨w⟩ (eq ), which clearly represent the most relevant information regarding the conformational landscape. In order to provide a comprehensive description of the achieved results, we have calculated them, for each α-syn type, at three unique values of w/v concentration (2, 5, and 10 g/L) and three unique values of temperature (25°, 37° and 45 °C). Results are reported in Figure in the form of histograms, with bars colored according to the same code used in Figure and with the i-conformers sorted in ascending order of ⟨w⟩.

Figure 7

Histograms of the average monomer population weights ⟨w⟩ associated with the N = 189 α-syn PDB conformers exploited by VBWSAS for the best fit analysis of the four batches of SAXS curves for WT α-syn and the three mutants G51T, E46K, and A53T shown in Figure . Values are calculated from eq on the basis of the fitted values of all the thermodynamic parameters (Figure ). Results are sorted from the lowest (left) to the highest (right) ⟨w⟩. The color hue of the histogram bars is assigned according to the α-syn subclass (1A red, 3B green, 3C cyan, 4D blue, 4E magenta) and the darkness of the color increases with R, as described in the caption of Figure . Standard deviations are shown as black error bars. We consider first the panels relative to WT α-syn. The proximity of the bars with the same hue of color is immediately evident, as well as the similarity of their heights. This result deserves a more thorough consideration, bearing in mind that it has been obtained by analyzing our experimental SAXS data, at different c and T, and adopting an ensemble of conformers already filtered through the NMR experiments reported by Gurry et al.[45] We observe that WT α-syn molecules are mostly present as trimers (3B and 3C subclasses, green and cyan bars, respectively), closely followed by all the conformers in the monomeric state (1A, red bars) and with the remaining tetramers (4C and 4D, blue and magenta bars, respectively) in the last positions. Moreover, we see that the bars of trimers and monomers reach a value quite close to N–1 (dashed line in Figure , corresponding to the totally flat (unbiased) monomer population weight distribution), and the tetramers’ bars arrive at values slightly higher than N–2. Basically, despite these differences, the VBWSAS analysis of WT α-syn confirm that all the N = 189 conformers of Gurry et al.[45] significantly contribute to describing the conformational distribution. We also observed that, with increasing T, the heights of the bars get closer in value. The T effect, as well as the less pronounced concentration effect, are better visualized in Figure (left panel), which reports the T-trends of the subclasses of conformers ω (which is the sum of ⟨w⟩ for i belonging to the same subclass) at the three selected values of c. We see that, for WT α-syn (Figure , top left panel), the trends of the monomers subclass weights (ω1A, red curves) show a maximum at ≈42 °C, which depends on c going from ≈0.7 at 2 g/L to ≈0.3 at 10 g/L. We also see that by increasing T the ω3B weights (helical-rich trimers, green curves) decrease, whereas ω3C (strand-rich trimers, cyan curves) increases, and this effect has a direct correlation with protein concentration. Interestingly, at high concentration (5–10 g/L), a α/β transition is observed: up to ≈27 °C the most populated subclass is 3B (helical-rich trimers) and subsequently the 3C (strand-rich trimers) subclass, which reaches a maximum concentration at ≈35 °C. In Figure (top right panels), the most populated conformers at 10 g/L and for T = 25 °C and T = 37 °C are represented; we notice the proximity of the sequences of residues, belonging to the three different chains, associated with the formation of fibrils, as reported by Guerrero-Ferreira et al.,[65] suggesting that these trimeric conformers might be representative of those that trigger the nucleation in the fibrillation processes. A more detailed visualization of the most populated conformers, at 10 g/L and for T = 25 °C and T = 37 °C, is shown in Figure S5 of the Supporting Information. The obtained results, regarding the temperature effects, are in agreement with the increase of nucleation and growth of fibrils, as reported by Morris and Finke.[66] In terms of protein concentration effects, the results reported here confirm the aggregation propensity reported by many in vitro studies (reviewed in Plotegher et al.[42]). Finally, the effects reported are relevant for the pathogenesis of synucleinopathies, considering for instance that duplication and triplication of the gene encoding for α-syn and the associated increase α-syn concentration cause inherited forms of early PD onset.[67]

Figure 8

Left panels show temperature trends of the subclasses of conformers ω with m = 1A (monomer, red), m = 3B (helical-rich trimers, green), m = 3C (strand-rich trimers, cyan), m = 4D (helical-rich tetramers, blue), and m = 4E (strand-rich tetramers, magenta) calculated from the thermodynamic parameters found by the VBWSAS analysis of SAXS data for WT α-syn and the three mutants G51T, E46K, and A53T. Thin, regular, and thick lines refers to c = 2 g/L, c = 5 g/L, and c = 10 g/L, respectively. Right panels show the most populated conformers at c = 10 g/L, of all α-syn types apart from E46K. On the bottom left and bottom right of each conformer, the subclass and the obtained ⟨w⟩ (written up to the last significant digit) are reported, respectively. The eight sequences that, according to Guerrero-Ferreira et al.,[65] are forming the parallel β-strands in the fibrils (i.e., residues 42–46 (β1), 48–49 (β2), 52–57 (β3), 59–66 (β4), 69–72 (β5), 77–82 (β6), 89–92 (β7), and 94–102 (β8)) and that belong to the different monomers in the trimers and in the tetramers are shown with different colors. We turn now to describe the results obtained for the G51D α-syn mutant type. The average monomer population weights ⟨w⟩ are shown in Figure , second column of panels. The changes with respect to WT α-syn are evident: the trimers (green and cyan bars) go in the last positions, with values of ⟨w⟩ in the order of N–7, whereas tetramers (blue and magenta bars) and monomers (red bars) show values of ⟨w⟩ of the same order of magnitude. Moreover, by observing the top and the bottom panels (corresponding to the more distant conditions in terms of c and T), we see that at low concentration and temperature the monomers have the highest weights, whereas tetramers overcome monomers at the highest values of c and T. We also observed a more mixed situations in the intermediate panels, corresponding to the other combinations of c and T. The same behavior can also be detected in the correlation map of ⟨w⟩, calculated for the same c and T conditions, between WT and the G51D α-syn mutant type and reported in Figure S6 of the Supporting Information. This result is also confirmed in Figure , bottom left panel. The values of ω1A (red curves) are almost independent of T and change from ≈0.8 at 2 g/L to ≈0.4 at 10 g/L, and in parallel, the subclasses of conformers of the strand-rich tetramers, ω4E (cyan curves), arrives to ≈0.6 at 10 g/L, being ≈0.1 at 2 g/L. Figure (middle right panels) show the most populated conformers, at 10 g/L, for T = 25 °C and T = 37 °C, evidencing a possible role of compact helical-rich tetramers in the fibrils nucleation at high temperature. A completely different landscape has been defined by VBWSAS for the E46K mutant. The E46K panels of Figure provide a simple message; only the monomeric conformers (red bars) are significantly present in solution, independently on c or T. We see, in fact, that the tetramers ⟨w⟩ values (blue and magenta bars) are very low, around N–5, and those of the trimers even lower, ≈ N–7. This result is fully confirmed in the top right panel of Figure , where we just observed ω1A = 1. Looking carefully at the shades of the red bars in the E46K panels of Figure , it can be notice that, particularly at low c and high T, the brightest bars are in the first positions, suggesting the prevalence of monomers with the lowest gyration radii. A similar observation is confirmed in the E46K correlation maps of ⟨w⟩ shown in Figure S6 of the Supporting Information. Finally, we analyze results for the A53T α-syn mutant type. Corresponding panels of Figure show the most marked variability of the results in respect to c and T. We see, in fact, that at c = 2 g/L monomers (red bars) prevail, followed by tetramers (blue and magenta bars) and, to a lesser extent, by trimers (green and cyan bars), and this trend is reinforced with increasing T. The situation is less straightforward at c = 5 g/L, where the populations of monomers and tetramers are close. At c = 10 g/L, in particular at the highest temperatures, the prevalence of tetramers with respect to monomers is evident. It can be noticed that the values of ⟨w⟩ for trimers markedly increase with T, for any value of c. Once more, the bottom left A53T panel of Figure confirms this monomer–tetramer competition. Regarding the most populated conformers at 10 g/L for T = 25 °C and T = 37 °C (Figure right A53T panels), it emerges that at high T the predominant tetrameric helical-rich conformers are less compact than those of the G51D mutant under the same c and T conditions, with the sequences of residues responsible of the fibril formation (show in blue, red, green, and yellow colors) quite far apart. The A53T correlation maps of Figure S6 of the Supporting Information confirm this monomer–tetramer competition and add the information that strand-rich tetramers (magenta curves or symbols) dominate with respect to helical-rich tetramers (blue curves or symbols). These results are in agreement with the information available in the vast literature on the aggregation kinetics of the different mutants compared to WT.[44] Specifically, the G51D and A53T mutants, which, according to our results, show an increased proportion of β-strand multimeric species, have been reported to be more prone to aggregation than WT α-syn. In addition, it has been revealed that the E46K mutant shows a longer lag phase, suggesting that the nucleation centers that trigger the aggregation are scanty when compared to the other mutants; this observation is fully confirmed by the VBWSAS results on E46K, which indicate that only monomeric conformers have a significant population. Aggregation kinetics often present some variability and reproducibility issues that also depend on the method used to measure the process. Results obtained by using VBWSAS to weigh the ensemble of monomers and multimers for the three G51D, A53T, and E46K mutants under different concentration and temperature conditions may help in rationalizing experimental results previously found and in carefully planning new experiments. All previous evaluations about the properties of ⟨w⟩ for the investigated α-syn type reflect the behavior of form factors (eq ). However, SAXS curves also contain information about the structure factors, which are enclosed in the VBWSAS formalism (eq ). Fitting parameters related to the structure factors obtained by the analysis of all the experimental curves shown in Figure are detailed reported, as a function of T and for selected ranges of c, in Figure S8 of the Supporting Information. The high error bars obtained for most of these parameters suggest that the information content regarding protein–protein interaction, extracted from the whole q range of our experimental curves, should be considered quite low. Hence the physical interpretation of these parameters should be taken with a word of caution. For example, the mean radius of the monomer R1, calculated over all the c and T values of each α-syn type, results 24 ± 1 Å for WT, a value that decreases to 14 ± 1 Å for G51D and becomes much lower for E46K (8.1 ± 0.6 Å) and A53T (9 ± 1 Å). We recall that those values are subsequently multiplied by the average aggregation number ⟨m⟩ to get the radius of the hard-sphere term in the pair potential and the value ⟨m⟩, being a function of ⟨w⟩, change with c and T, as shown in Figure S9 of the Supporting Information. Considering the attractive potential at contact, written as J = J1 ⟨m⟩2/3, fitted values of J1 for WT α-syn and for the two mutants G51D and A53T are almost constant with c and T. Their mean values are 390 ± 40, 91 ± 7, and 110 ± 20 kJ/mol, respectively. The case of E46K is different; J1 changes from 500 ± 10 kJ/mol at low concentrations to 240 ± 80 kJ/mol at intermediate concentrations, suggesting that the E46K monomers experience a more complex network of interactions quite sensitive to c variations. An overall evaluation about the structure factors is provided in Figure , where both functions S(q) (solid lines) and S(q) (dashed lines) are reported for the four α-syn types at the intermediate c and for different T. We notice that for WT, G51D, and A53T types these functions slightly oscillate around 1, indicating a competition between attractive and repulsive forces, whereas the E46K type shows a different regime, with structure factors markedly higher than 1 at low q, indicating a prevalence of attractive forces. Moreover, it should be underlined that for both G51D and A53T α-syn types, S(q) is quite damped with respect to S(q), an effect due to the features of the coupling function βell(q) (eq ) that mainly depend on ⟨m⟩, always greater than 2 for G51D and A53T α-syn types, and the ellipsoid anisometry ν, which is ≈5 for both of them (Figure S8 of the Supporting Information). On the contrary, ν is ≈2 for WT and E46K α-syn types, and for the latter, the conformers are substantially monomers (Figure S9 of the Supporting Information). Hence, the effect of βell(q) is less marked, and no differences between S(q) and S(q) are observed.

Figure 9

Structure factors, S(q) (solid lines), and measured structure factors, S(q) (dashed line), (eq ) obtained by SAXS data analysis with VBWSAS for WT α-syn (c = 5.5 g/L) and three mutants G51T (c = 6.1 g/L), E46K (c = 6.0 g/L), and A53T (c = 5.4 g/L) at the three temperatures as shown in the legends. Detailed parameters are listed in Table S3 of the Supporting Information. A further indication of the different interaction regime for the E46K α-syn type is shown in Figure , which reports the trends of the pair interaction potential u(r) (solid lines) and its attractive (dashed lines) and repulsive (dotted lines) contributions, corresponding to the cases shown in Figure . These results suggest that the monomers of the E46K α-syn type may experience an overall isotropic attraction effect, probably due to the fact that their net number of electric charges, |Z1|, is ≈7, less than at least 2 units with respect to the other three α-syn types (Table S2, Supporting Information).

Figure 10

Pair interaction potentials, u(r) (solid lines), together with repulsive and attractive terms, u(r) (dashed lines) and u(r) (dotted lines), respectively, obtained by SAXS data analysis with VBWSAS for WT α-syn (c = 5.5 g/L) and three mutants G51T (c = 6.1 g/L), E46K (c = 6.0 g/L), and A53T (c = 5.4 g/L) at the three temperatures as shown in the legends. Interestingly, the peculiarity of the E46K mutant in terms of net charge impacts also its long term interactions in solution[43] and on the known structural properties of E46K fibrils.[68] Indeed, Ranjan and Kumar[43] showed, using solution NMR, that the substitution of the glutamic acid E46 with a positively charged lysine is the only mutation associated with pathology that present long-range contact rearrangements at the C-terminal of the protein. Coherently, E46K amyloid fibrils show the largest chemical shift perturbations as measured with solid state NMR. Therefore, the E46K mutation determines a substantial change in the fibril structure compared to WT α-syn and other pathological mutants studied by Tuttle et al.[68] The other important piece of information that can be drawn from the VBWSAS analysis of SAXS data presented here relates to the determination of the propensity of each a-residue of α-syn, in WT or mutant type, to populate the r-region of the Ramachandran map shown in Figure , as described in the Propensities section. In fact, this type of information allows one to understand how mutations in a single amino acid impact the propensity to form β-sheets and therefore alter the intermolecular and intramolecular interactions that govern the aggregation properties of the protein. According to eq , propensities are indeed functions of the average monomer population weights ⟨w⟩ and depend on the indexes r that show, for each g-chain of each i-conformer of the ensemble, the region of the Ramachandran map to which an a-residue belongs. To calculate these indexes, we have determined the ϕ, ψ angles of the residues with the pdbtorsions tool, from the BiopTools package,[69] applied to all PDB files of our ensemble. Propensities p(r) of the 140 residues of α-syn, in the WT and in the three mutant types, derived by the VBWSAS analysis of our SAXS data, have been calculated for three representative concentrations and three temperatures by using the average monomer population weight values reported in Figure . In order to highlight the effect of point mutations, we have considered the differences Δp(r) between the propensities of the mutant α-syn and the ones of the WT. Moreover, to better identify the role of the point mutation in promoting a significant change of the secondary structure of sufficiently long sequences of residues, we have established a simple criterion, as follows. We consider all the possible sequences of at least eight subsequent residues starting from the a1-residue and ending in the a2-residue, with a2 – a1 ≥ 7. Then, we check whether the value of Δp(r) of all residues a with a1 ≤ a ≤ a2 have the same positive or negative sign. If the check is validated, the mean change of propensity and the corresponding standard deviations are assigned to all the residues of the sequence a1 – a2, and these values are reported in the histogram. Otherwise, the histogram bars in the range a1 ≤ a ≤ a2 are fixed to 0. Hence, the lack of bars for a sequence is simply the result of Δp(r) with the opposite sign within that sequence. A length of eight residues segment was chosen because it approximately represents the average persistence length of a polypeptide.[70] Resulting histograms of the application of this criterion are reported in Figure for the two most significant regions, βfav (red histograms) and αRall (green histograms).

Figure 11

Mean propensity changes between mutant and WT α-syn reported as a function of the amino acid position a for the energetically favorable β-sheet region (βfav, red histograms) and the energetically favorable right α-helix region (αRfav, green histograms). See Figure for the definition of the Ramachandran plot regions. The NAC portion of the sequence is highlighted in light orange. The position of the point mutation is indicated by the vertical line. Concentration, c, is expressed in g/L. Looking at the G51D panels of Figure , a clear presence of a high red bar can be observed, corresponding to an approximate 0.2 increase of the βfav propensity. At c = 2 g/L, this bar is from residue 51 (where the mutation is, evidenced by a thin vertical line) to residue 64, just before the NAC sequence, shown in a light orange background. By increasing c at 25 °C, ⟨Δp(βfav)⟩ increases, and in the middle of the NAC sequence, a bar with negative ⟨Δp(βfav)⟩ appears, suggesting that in this part of the NAC sequence WT α-syn is more prone to form strands than the G51D type. Concomitantly, in the same sequences, the propensity of αRall decreases by approximately 0.2, suggesting a helical-to-sheet mechanism promoted by the G51D mutation. By increasing T, the height of the more intense positive bar decreases, and four more negative bars appear, two on the left and two on the right of the positive bar. At 45 °C, the positive bar is wider, ranging from residue 43 to residue 64. In summary, at the highest T, the change in the propensity going from the WT to G51D type, regarding the βfav region, is due to the increase of propensity in the sequence comprised between the residues 43 and 64 in the G51D type and a decrease of the propensity in sequences on the left of residue 43 and on the right of residue 64 and can be attributed to a more marked presence of strand-rich tetramers and a less marked presence of strand-rich trimers in the G51D α-syn type in respect to WT α-syn. The case of the E46K α-syn type is different and should be discussed by bearing in mind that mostly monomeric conformers are present in solution at any c and T, as determined by the analysis of ⟨w⟩ previously described. The propensity panels regarding E46K in Figure show the presence of a high red bar, in the region of βfav, close to the mutation position 46, from residue 51 to residue 65. For c = 2 g/L, this bar is more marked at 37 °C and disappears at 45 °C. At c = 5 g/L and, more clearly, at c = 10 g/L beside this bar, there is a small bar in the NAC sequence with a negative change of propensity. All these red bars compensate with the green bars, indicating, also for this mutant, a helical-to-strand mechanism. In summary, the E46K α-syn type proteins are painted as interacting monomers, and the monomers with highest ⟨w⟩ are the ones that show a higher βfav propensity in the sequence of residues 51–65. Finally, we look at the A53T panels of Figure , a α-syn mutant type that, according to the analysis of ⟨w⟩ previously discussed, is mainly constituted by monomeric (1A-subclass) and tetrameric strand-rich (4E-subclass) conformers. We see a positive red bar, indicating a positive change of propensity in the βfav region of the Ramachandran maps in a sequence close to the residue 53, where the mutation has occurred, ranging from residue 51 to residue 67. At 25 °C, this bar increases with c, and on the right, inside the NAC sequence, a negative bar is growing. At 37 °C and for c = 2 g/L and c = 5 g/L, the positive bar is wider, extending from residue 47 to residue 65, and other negative bars on the left and on the right of this sequence appear. At 37 °C and c = 10 g/L, the positive bar returns narrower, from residue 51 to residue 64. Passing at 45 °C, for any c, the positive bar remains from residue 51 to residue 64. Hence, if we consider that the wider the sequence with a positive change of βfav propensity is, the higher is the tendency of the mutant to trigger the cross-β nucleation process,[71] we can conclude that for the A53T α-syn mutant type, the most effective conditions occur at low concentration and 37 °C. This subtle effect could be due to the intricate interplay among the subclasses of conformers weights ω (Figure ) that show a maximum at around 37 °C for the A53T α-syn mutant type. For the sake of completeness, in Figures S10–S18 of the Supporting Information, the mean change of propensity for all eight regions of the Ramachandran map are reported.

Conclusions

The possible presence of folded α-syn tetramers in prefibrillar conditions, together with unfolded monomers, is an issue widely discussed.[8,46−48] Some experiments have shown that the detection of such tetramers depends not only on the chemical–physical conditions but also on the origin of α-syn, which can be produced using bacteria or isolated from mammalian cells as well as from red blood cells. But unfortunately, there is not a clear reproducibility of these results, and consensus on their interpretation has not been reached yet.[72] Regarding the use of SAXS techniques, coupled with proper ensembles of conformers, to investigate α-syn in conditions prodromic to formation of fibrils, most of the published results have only considered the radius of gyration calculated in a small range of q, typically by Guinier’s approximation.[8,10,73,74] In other cases, SAXS curves have been analyzed in the full range of q, but without considering the absolute calibration of data and so by finding an optimum scaling factor κ or flat background B.[31,39] We did not find studies that considered long-range interactions, which cause, possibly also at low concentration, a broad interference peak that could affect the Guinier’s region. To the best of our knowledge, this is the first time that an approach is proposed to explore the question of the possible species of α-syn oligomers present at the prefibrillar state. We have fully exploited the performances of one of the most advanced synchrotron SAXS instruments (BM29, ESRF, Grenoble), which allows a precise absolute scale calibration and a perfect buffer subtraction, avoiding the need to use nuisance parameters, such as κ and B, in fitting data. Accordingly, the results that we have obtained with the VBWSAS method take into account the modifications provided by c and T on the absolute scale in the entire q range, and the model we have applied includes the variations of the form factor, based on a thermodynamic scheme, and the ones of the structure factor, described by the well-established PY-RPA approximation. Indeed, our SAXS curves do suggest, before any interpretation, that some oligomeric forms of α-syn could be present in our samples. Despite the fact that the most updated protocols proposed methods to remove the oligomers,[72] they are the result of an equilibrium process so that oligomers are bound to be naturally present together with monomers. Results that we have obtained for WT α-syn partially contradict the ones derived by Gurry et al.[45]: the most prominent forms of α-syn are trimers, not tetramers. Among trimers, at low T, the most abundant are the helical-rich ones, whereas at high T they are the strand-rich ones. Considering the α-syn G51D type, which is one of the most aggressive mutants leading to the earliest onsets of PD disease,[75] our results indicate that strand-rich tetramers are the most abundant aggregated form of α-syn at all temperatures, whereas any trimeric form has a negligible population. This important result suggests the possibility that strand-rich tetramers can be the multimeric species that trigger the nucleation of fibrils or that these soluble multimers (or the larger ones that can form in the early stages of amyloid fibril formation) may be toxic species for the neurons. Results obtained for the α-syn A53T type, a mutant considered of clinical significance and widely studied,[14] also confirm the predominance of strand-rich tetramers that tends to diminish with T, suggesting that A53T may be less aggressive than G51D in the early onset of fibrillation thus providing further details on nucleation events that occur at the beginning of the aggregation process. The propensity analysis of both G51D and A53T, in comparison with WT α-syn, confirm these interpretations, indicating an increase of the propensity of the β-favored region in the portion of the sequence between the mutation point and the NAC portion. Completely different are the VBWSAS results for the E46K mutant, which is known to provoke small changes in the conformation by enhancing the contacts between N- and C-termini of α-syn.[14] Our VBWSAS analysis does confirm that this mutant is mostly present as monomeric conformers, with a preference for the ones with the smallest values of R. Moreover, such monomers show a long-range unspecific tendency to attract themselves. VBWSAS results also show that only at high T there is an increase of the propensity of the β-favored region. The higher compactness of the E46K monomers is also confirmed by the maps of Cα–Cα distances reported in Figure S6 of the Supporting Information, showing an average negative difference between pairs of Cα atoms between the E46K type and WT in the off diagonal regions. These results are in good agreement with the differences in the aggregation propensities and fibrils structure for the E46K mutant. Overall, the VBWSAS method applied to α-syn shows evidence that the different aggregation and toxicity behavior of the pathogenic mutants is likely to originate from the different disordered conformers that these protein species naturally populate in prefibrillar conditions. Our results may suggest that the structure of these conformers should be well characterized in order to understand how they contribute to the α-syn aggregation process in relationship to PD etiopathogenesis and features. Here, α-syn and its mutants have been used as a complex paradigm for IDPs, but the proposed VBWSAS approach can be easily extended to other IDPs whose behavior biochemistry is crucial for understanding the early molecular events that lead to IDP-related neurodegenerative diseases. In summary, we have shown that to disentangle conformational information out of a suitable IDP ensemble by means of SAXS data it is necessary to adopt a method with solid foundations from both statistical and thermodynamic points of view. VBWSAS can serve this aim. The VBWSAS software is available upon request.

71 in total

1. Predictive atomic resolution descriptions of intrinsically disordered hTau40 and α-synuclein in solution from NMR and small angle scattering.

Authors: Martin Schwalbe; Valéry Ozenne; Stefan Bibow; Mariusz Jaremko; Lukasz Jaremko; Michal Gajda; Malene Ringkjøbing Jensen; Jacek Biernat; Stefan Becker; Eckhard Mandelkow; Markus Zweckstetter; Martin Blackledge
Journal: Structure Date: 2013-12-19 Impact factor: 5.006

2. Comparative studies of disordered proteins with similar sequences: application to Aβ40 and Aβ42.

Authors: Charles K Fisher; Orly Ullman; Collin M Stultz
Journal: Biophys J Date: 2013-04-02 Impact factor: 4.033

3. Comprehensive structural and dynamical view of an unfolded protein from the combination of single-molecule FRET, NMR, and SAXS.

Authors: Mikayel Aznauryan; Leonildo Delgado; Andrea Soranno; Daniel Nettels; Jie-Rong Huang; Alexander M Labhardt; Stephan Grzesiek; Benjamin Schuler
Journal: Proc Natl Acad Sci U S A Date: 2016-08-26 Impact factor: 11.205

4. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors: W Kabsch; C Sander
Journal: Biopolymers Date: 1983-12 Impact factor: 2.505

5. Efficient construction of disordered protein ensembles in a Bayesian framework with optimal selection of conformations.

Authors: Charles K Fisher; Orly Ullman; Collin M Stultz
Journal: Pac Symp Biocomput Date: 2012

6. Brain alpha-synuclein accumulation in multiple system atrophy, Parkinson's disease and progressive supranuclear palsy: a comparative investigation.

Authors: Junchao Tong; Henry Wong; Mark Guttman; Lee C Ang; Lysia S Forno; Mitsunobu Shimadzu; Ali H Rajput; Manfred D Muenter; Stephen J Kish; Oleh Hornykiewicz; Yoshiaki Furukawa
Journal: Brain Date: 2009-11-10 Impact factor: 13.501

7. Modeling intrinsically disordered proteins with bayesian statistics.

Authors: Charles K Fisher; Austin Huang; Collin M Stultz
Journal: J Am Chem Soc Date: 2010-10-27 Impact factor: 15.419

8. α-Synuclein occurs physiologically as a helically folded tetramer that resists aggregation.

Authors: Tim Bartels; Joanna G Choi; Dennis J Selkoe
Journal: Nature Date: 2011-08-14 Impact factor: 49.962

9. Solid-state NMR structure of a pathogenic fibril of full-length human α-synuclein.

Authors: Marcus D Tuttle; Gemma Comellas; Andrew J Nieuwkoop; Dustin J Covell; Deborah A Berthold; Kathryn D Kloepper; Joseph M Courtney; Jae K Kim; Alexander M Barclay; Amy Kendall; William Wan; Gerald Stubbs; Charles D Schwieters; Virginia M Y Lee; Julia M George; Chad M Rienstra
Journal: Nat Struct Mol Biol Date: 2016-03-28 Impact factor: 15.369

10. Bayesian refinement of protein structures and ensembles against SAXS data using molecular dynamics.

Authors: Roman Shevchuk; Jochen S Hub
Journal: PLoS Comput Biol Date: 2017-10-18 Impact factor: 4.475

2 in total

Review 1. Implementing Complementary Approaches to Shape the Mechanism of α-Synuclein Oligomerization as a Model of Amyloid Aggregation.

Authors: Marco Giampà; María J Amundarain; Maria Georgina Herrera; Nicolò Tonali; Veronica I Dodero
Journal: Molecules Date: 2021-12-24 Impact factor: 4.411

2. SAXS Reveals the Stabilization Effects of Modified Sugars on Model Proteins.

Authors: Astra Piccinini; Eva C Lourenço; Osvaldo S Ascenso; Maria Rita Ventura; Heinz Amenitsch; Paolo Moretti; Paolo Mariani; Maria Grazia Ortore; Francesco Spinozzi
Journal: Life (Basel) Date: 2022-01-15

2 in total