Literature DB >> 35694516

Wide-Line NMR Melting Diagrams, Their Thermodynamic Interpretation, and Secondary Structure Predictions for A30P and E46K α-Synuclein.

Mónika Bokor¹, Eszter Házy², Ágnes Tantos².

Abstract

Parkinson's disease is thought to be caused by aggregation of the intrinsically disordered protein, α-synuclein. Two amyloidogenic variants, A30P, and E46K familial mutants were investigated by wide-line 1H NMR spectrometry as a completion of our earlier work on wild-type and A53T α-synuclein (Bokor M. et al. WT and A53T α-synuclein systems: melting diagram and its new interpretation. Int. J. Mol. Sci.2020, 21, 3997.). A monolayer of mobile water molecules hydrates A30P α-synuclein at the lowest potential barriers (temperatures), while E46K α-synuclein has here third as much mobile hydration, insufficient for functionality. According to wide-line 1H NMR results and secondary structure predictions, E46K α-synuclein is more compact than the A30P variant and they are more compact than the wild type (WT) and A53T variants. Linear hydration vs potential barrier sections of A30P α-synuclein shows one and E46K shows two slopes. The different slopes of the latter between potential barriers E a,1 and E a,2 reflect a change in water-protein interactions. The 31-32% homogeneous potential barrier distribution of the protein-water bonds refers to a non-negligible amount of secondary structures in all four α-synuclein variants. The secondary structures detected by wide-line 1H NMR are solvent-exposed α-helices, which are predicted by secondary structure models. β-sheets are only minor components of the protein structures as three- and eight-state predicted secondary structures are dominated by α-helices and coils.

Entities: Chemical

Year: 2022 PMID： 35694516 PMCID： PMC9178613 DOI： 10.1021/acsomega.2c00477

Source DB: PubMed Journal: ACS Omega ISSN： 2470-1343

Introduction

The intrinsically disordered proteins (IDPs) are unfolded under physiological conditions. While this feature is important for many physiological functions,[1] disordered sequences are often prone to aggregation and fibril formation with detrimental consequences.[2] Several neurodegenerative diseases are caused by the abnormal oligomerization and polymerization of proteins with disordered regions, including Alzheimer’s disease and Parkinson’s disease (PD). The exact molecular event that induces pathological aggregation of these proteins is still waiting to be elucidated,[3] but there are certain verified mutations that render disordered proteins more prone to aggregation than their wild-type counterparts. α-Synuclein (α-S) is an IDP under normal physiological conditions (it is the general consensus),[4] adopting random coil conformation.[5,6] Notwithstanding, there exist variations of wild-type (WT) α-S structural propensity (globular or extended). The suppositions about secondary and tertiary structures of α-S are highly controversial, including conflicting views in the literature.[7] The rapid interconversion between conformers impacting α-S and the different experimental methodologies used can be the causes of the controversies. α-S is coded by the SNCA gene, with several point mutations in the gene known to cause familial forms of PD, including A30P, A53T, and E46K mutations.[8] A30P mutation retards[9] the formation of both oligomers and fibrils and only this mutation affects the overall α-S structure.[7] The E46K mutant increases membrane affinity[10] and accelerates α-S aggregation and fibril formation.[11,12] The A53T mutant considerably accelerates α-S aggregation, and its fibril formation is faster than that of the wild-type (WT) α-S.[13−15] Different research groups came to different conclusions about α-S monomers having a globular-like structure or being extended random coils.[7] Earlier, the WT and A53T α-S variants were investigated in monomer, oligomer, and amyloid forms.[16] The monomers proved to be IDPs and more compact than random coils; about 32(3)% of their solvent-accessible surface is determined by the secondary structure. They are already functional at the lowest potential barriers with mobile hydration water: a monolayer of mobile hydration water is surrounding them. They realize all possible hydrogen bonds with the solvent water. Wide-line 1H NMR is an accepted method that can provide information on the location and structural environment of hydrogen atoms in proteins as it enables the direct observation of translational and rotational movements of molecules in the condensed phase. NMR characteristics of aqueous solutions rapidly frozen and then slowly thawed through equilibrium thermal states provide direct information on the immobile and partially or fully mobile parts of the molecules, yielding invaluable insight into the overall structure of the proteins. Here, our previous work on WT and A53T α-S was completed with new results on two other familial mutants, A30P and E46K α-S. We measured the melting diagrams (MDs, relative ratios of mobile water as a function of normalized functional temperature; see the Supporting Information of ref (17)) by wide-line NMR, i.e., the amount of mobile hydration water as a function of temperature, to get information on the steps and gradience in the development of full hydration of α-S. Secondary structure (SS) predictions on the same proteins were calculated and compared to the experimental results by wide-line NMR. These measurements were supplemented with secondary structure predictions.

Materials and Methods

Expression and purification of recombinant human A30P and E46K mutant variants of α-S in a pRK-172-based expression system were performed as described.[18] Briefly, expression of the proteins was performed in Escherichia coli Bl21(DE3) in a pT7-7-based expression system, after IPTG induction. Bacterial cell pellets were harvested by centrifugation and resuspended in 10 mM Tris–HCl, pH 8.0, 1 mM EDTA, and 1 mM cOmplete protease inhibitor cocktail. After cell lysis, streptomycin sulfate-precipitated DNA was removed by centrifugation and an ammonium sulfate precipitation step was performed to selectively precipitate the α-S protein. After centrifugation at 13,500g for 30 min at 4 °C, the pellet was dissolved in 10 mM Tris–HCl, pH 7.4, and 1 mM cOmplete, and filtered using a 0.2 μm mesh. The resulting solution was loaded onto a Resource Q anion exchange column on an Äkta Explorer chromatography system (GE Healthcare). The purity and integrity of the purified proteins were confirmed by SDS-PAGE (for representative gel pictures, see Supporting Figure 1). Peak fractions were collected and dialyzed against double-distilled water before lyophilization. In sample preparation, the mass of lyophilized protein (without any further refinement) was measured, and an appropriate amount of double-distilled water was added to obtain the nominal concentration of 50 mg/mL. Oligomers formed during the process were removed by filtering the solution through a 100 kDa membrane. All measurements were carried out on three identical samples prepared independently. The wide-line NMR approach we applied is detailed in ref (19). The beginning of the movement (rotation) of water molecules bound to the surface of the protein, the process that is considered melting, is followed by observing motional narrowing in wide-line 1H NMR spectroscopy. The motional narrowing is a useful criterion for mobility since dynamics of hydration water happens on a picosecond time scale and NMR is slower by an order of magnitude. Fundamental temperature is Tf = R·T and the normalized fundamental temperature scale is Tfn = R·T/(R·273.15) = T/273.15. The events of the beginning of molecular motion can be characterized on an energy scale by the application of Tfn. The number of water molecules in the first hydrate shell of the protein is nho and the total number of water molecules in the entire heterogeneous hydration region is nhe. Applying these definitions, the total number of hydrating water molecules at 0 °C or Tfn = 1 is (nho + nhe). The number of mobile water molecules per amino acid residue is indicated with naa. It can be calculated from the measured fraction of mobile protons in water n as naa = (n/2)·(Mr(protein)/Mr(H2O))/a, where a is the number of amino acid residues in the protein and Mr(i) is the relative formula mass of compound i. The amount of mobile hydration can be given also by the common measure of hydration (not the same as the term mobile hydration) as the mass of the solvent water divided by the mass of the solute protein, h = mwater/mprotein. In our measurements, it is given by the measured (by wide-line 1H NMR) fraction of mobile water n multiplied by the mass of water and divided by the mass of protein, hNMR = n·mwater/mprotein. Hydration and naa can be interrelated as *a. The melting curve can be formally described as a series expansion[20,21]where the summation is carried out up to the quadratic term. The cubic term, which was applied in ref (16), was unnecessary. Tfn0 is the lowest temperature where mobile water molecules are detected and Ea,0 (Tfn0) is the lowest potential barrier with mobile water molecules at the solvent-accessible surface. The parameter Tfn1 gives the temperature where the thermal trend of the MD switches between constant and linearly increasing. Likewise, the trend changes from linear to quadratic at Tfn2. Protein preparation and wide-line NMR measurements were described in former publications (ref (16) and the Supporting Information in ref (17)). The applied three-state SS prediction methods are Brewery, Jpred4, Porter 5.0, PSIPRED, PSRSM, RaptorX, SCRATCH, and SPIDER3. Further, eight-state predictions were made with the following methods only: Brewery, Porter 5.0, RaptorX, SCRATCH, and SPIDER3. Secondary structures made of a minimum of four amino acids consecutively are considered only. Brewery[22] (http://distilldeep.ucd.ie/brewery/) is the state-of-the-art predictor of protein structural annotations (secondary structure in three and eight classes). Brewery is based on ensembles of cascaded BRNNs (bidirectional recurrent neural networks) and Convolutional Neural Networks. JPred is a protein secondary structure prediction server and has been in operation since approximately 1998. JPred incorporates the Jnet algorithm to make predictions that are more accurate. JPred4[23] (http://www.compbio.dundee.ac.uk/jpred4/index.html) is its the current version. Porter 5.0 and Porter8 5.0[24,25] (http://distilldeep.ucd.ie/porter/) are servers for protein secondary structure prediction in three and eight classes based on ensembles of cascaded BRNNs and Convolutional Neural Networks. PSIPRED[26] (http://bioinf.cs.ucl.ac.uk/psipred/) is a simple and accurate secondary structure prediction method, incorporating two feed-forward neural networks, which perform an analysis on output obtained from PSI-BLAST (Position-Specific Iterated—BLAST). PSRSM[27] (http://qilubio.qlu.edu.cn:82/protein_PSRSM/default.aspx) uses methods based on data partitioning and the semirandom subspace method. RaptorX Property[28] (http://raptorx.uchicago.edu/) is a web server that predicts the structural properties of a protein sequence without using any templates. This server employs a powerful in-house deep-learning model, DeepCNF (Deep Convolutional Neural Fields), to predict the SS. SCRATCH[29] (http://scratch.proteomics.ics.uci.edu/) uses ensembles of bidirectional recurrent neural network architectures, PSI-BLAST-derived profiles, and a large nonredundant training set to derive two new predictors: (a) the second version of the SSpro program for secondary structure classification into three categories and (b) the first version of the SSpro8 program for secondary structure classification into the eight classes produced by the DSSP program. SPIDER3[30] (https://sparks-lab.org/server/spider3/) captures nonlocal interactions by long short-term memory bidirectional recurrent neural networks for improving the prediction of the protein secondary structure. IUPred2A[31−35] was used, which is a combined web interface that allows one to identify disordered protein regions using IUPred2 and disordered binding regions using ANCHOR2. Both IUPred2 and ANCHOR2 indicate a disordered region and a disordered binding site, respectively, with scores above 0.5.

Results

Direct Observations of Structural States through the Melting Diagrams

In an attempt to decipher the molecular background of the differences in the behavior of the two different α-S mutant variants, we conducted wide-line NMR measurements under identical conditions. In wide-line NMR, IDPs and globular proteins are easily distinguishable based on their MDs. While globular proteins show a plateau of a constant level of mobile hydration water throughout a relatively wide temperature range, disordered proteins are characterized by a constant growth in mobile water with increasing temperature.[19,20,36,37] This measurement enables us to directly detect the mobility of the hydration layer surrounding the proteins, providing important insight into their structural flexibility. The most informative values are: Tfn0, t0, or Ea,0. Tfn0 is the lowest functional temperature, where mobile hydration water appears; t0 is the same in degree Celsius units, and Ea,0 is the corresponding potential barrier value calculated by multiplying with the special heat of ice at 0 °C. The MDs of A30P and E46K α-S (Figure ) are characteristic of IDPs. They show the first constant amount of mobile hydration water at relatively high temperature/functional normalized temperature or potential barrier[19] (Tfn0, t0, or Ea,0, Table ) compared to globular proteins (see the Supporting Information of ref (17)), and they have an intensely elevating section from Ea,1 on, in contrast to the constant number of mobile water molecules per amino acid residue, naa (see Materials and Methods) values of globular proteins in the same temperature range.

Figure 1

Table 1

Parameter Values for the Polynomial Relation (eq Describes the Mobile Water Fraction, n)a

α-S variant	A30P	E46K
A = n(T_fn0) = n(T_fn1)b	0.0156(2)	0.0058(1)
naa(E_a,0) = naa(E_a,1)	0.90(1)	0.335(6)
h(E_a,0) = h(E_a,1)	0.313(4)	0.117(2)
B	0.24(1)	0.07(1), 0.29(2)c
C	5(1)·10¹	36(5)
T_fn0b	0.824(3)	0.828(3)
E_a,0/kJ mol^–1	4.95(2)	4.97(2)
t₀/°C	–48.2(8)	–47.1(8)
T_fn1, T_fn1^eff	0.879(2)	0.875(4), 0.856(2)
E_a,1, E_a,1^eff/kJ mol^–1	5.28(1)	5.26(2), 5.15(1)
t₁, t₁^eff/°C	–33.0(7)	–34(1), −39.4(6)
T_fn2	0.959(4)	0.959(2)
E_a,2/kJ mol^–1	5.76(2)	5.76(1)
t₂/°C	–11(1)	–11.2(5)
n(T_fn2)	0.0345(2)	0.030(2)
naa(E_a,2)	1.98(1)	1.7(1)
h(E_a,2)	0.689(4)	0.60(3)
n(T_fn = 1)	0.13(2)	0.102(2)
naa(E_a = 6.01 kJ mol^–1)	7(1)	5.9(9)
h(E_a = 6.01 kJ mol^–1)	2.6(4)	2.0(3)

Error in the last digit is given in parentheses. n(i) values are given for a 50 mg/mL protein concentration.

Tfn0eff in the case of E46K.

Below and above Tfn = 0.903, respectively.

Melting diagram of A30P (red) and E46K (blue) α-synuclein. Trend changing points (Ea,i) are indicated. Lines are fitted eq , and the parameters are given in Table . Inserted graph: hydration of the globular proteins, BSA (green squares), β-casein (violet down triangles), and lysozyme (gold left triangles) are plotted on the insert. Error in the last digit is given in parentheses. n(i) values are given for a 50 mg/mL protein concentration. Tfn0eff in the case of E46K. Below and above Tfn = 0.903, respectively. If we want to understand the differences in the behavior of the two mutant proteins, the best way is to create a differential melting diagram (DMD) by plotting the differential values calculated from the MDs against functional temperature (Δn/ΔTfn). The trend changes are more striking on the DMDs. There is a short constant naa (Figure ) or zero Δn/ΔTfn (Figure ) section between Ea,0/Ea,0eff and Ea,1 (Table ) for A30P/E46K. On a microscopic scale, it reflects the presence of naa = 0.90(1) mobile hydration with a 4.95(2) kJ mol–1 potential barrier regarding the motion of water molecules for A30P. This amount of mobile water corresponds to hydration (g water/g solute) h = 0.313(4), which is sufficient for A30P α-S to be active[38,39] and equals to approximately monolayer hydration[38] with 126(1) H2O/protein. Additional hydration water molecules become mobile at Ea,1, and the amount of mobile hydration increases linearly (constant Δn/ΔTfn) to reach 277(1) H2O/protein at Ea,2 (Table ). The increase becomes quadratic (linear Δn/ΔTfn) at Ea,2.

Figure 2

Differential form of the melting diagram of A30P (red) and E46K (blue) α-synuclein. The differentials (Δn/ΔTfn) were calculated from the fitted curves shown in Figure .

Differential form of the melting diagram of A30P (red) and E46K (blue) α-synuclein. The differentials (Δn/ΔTfn) were calculated from the fitted curves shown in Figure . E46K α-S differs from A30P α-S mostly in the low potential barrier region (Figures and 2) and behaves unexpectedly at low potential barrier values. The first mobile hydration water molecules for E46K α-S were detected at a low potential barrier value (Ea = 4.7(1) kJ mol–1), that is, at a low temperature value compared to the other α-S variants (Figure and ref (16)). This potential barrier is as low as in the case of globular proteins (inserted graph in Figure ).[19,20,36] The amount of mobile hydration water molecules increases initially, right after the appearance of the first amount of mobile water and before the constant section between Ea,0eff = 4.97 kJ mol–1 and Ea,1 = 5.26(2) kJ mol–1. The gradual growth of the mobile hydration below Ea,0eff indicates the lack of a step-like change in the potential barriers at Ea,0eff and shows their broad distribution there. The solvent-accessible surface of E46K α-S is very heterogeneous regarding protein–water interactions in this potential barrier section. The plateau (constant naavsTfn) of E46K has a mobile hydration value of naa = 0.335(6) or h = 0.117(2), which means 46.9(9) H2O/protein, and ends at Ea,1 = 5.26(2) kJ mol–1 (Table ). There is a step, a jump in the magnitude of naa, in the MD of E46K at Ea,1 (Figure ), which corresponds to a spike in the differential form of the MD (DMD, Figure ). This is followed by a linearly growing section of MD, but with two different regions with different slopes (B parameters, Table ). This corresponds to constant sections with different magnitudes in the DMD (Figure ). The slope of the MD changes at Ea = 5.42 kJ mol–1. The slope change refers to a change in the interaction between water and E46K α-S at the slope change, i.e. at Ea = 5.42(6) kJ mol–1. Different types of water–protein interactions are active below and above the slope change. The hydration at the change is naa = 0.8(1) or h = 0.27(5) or 2.2(4)·10–2 H2O/protein. This value is approximately equal to the hydration in the first hydration layer of a protein[38,39] and the difference between it and h(Ea,2) is h = 0.33(8) or 2.7(6)·10–2 H2O/protein. That is, new types of water molecules in their interactions with proteins become mobile at the change additional to the first hydration layer, which becomes built up until the change. The two mutant α-S variants have nearly parallel MDs at Ea,1 ≤ Ea ≤ 6.01 kJ mol–1 with the A30P α-S having higher mobile hydration (Figure ). The hydration of A30P α-S is greater by naa = 0.35(2) on average than the hydration of E46K α-S between Ea,1eff and Ea = 6.01 kJ mol–1. The hho and hhe values, i.e., the homogeneously and the heterogeneously bound mobile hydration water amounts for A30P and E46K α-S (Table ), are markedly lower than in WT and A53T variants (hho = 0.44(8) and hhe = 2.8(2), on the average).[16] The A30P and E46K α-S mutants reach their highest hydration level at the melting point of bulk water with an average of h = 2.2(3), which corresponds to 9(1)·102 H2O/protein. The WT and A53T α-S variants have higher hydration at this point (h = 3.31(7) or 1.32(2)·103 H2O/protein on the average).[16]

Table 2

Dynamic Parameters from the Polynomial Relation Describing the Melting Diagramsa

α-S variant	A30P	E46K	A53T	WT
n_ho	0.0156(2)	0.0058(1)	0.22(4)	0.22(4)
naa_ho	0.90(1)	0.335(6)	2.5(5)	2.5(5)
h_ho	0.313(4)	0.117(2)	0.44(8)	0.44(8)
n_he	0.11(2)	0.10(2)	0.18(6)	0.142(9)
naa_he	7(1)	5.5(9)	23(6)	16(1)
h_he	2.3(4)	1.9(3)	4(1)	2.8(2)
HeR = (1 – T_fn1)/(1 – T_fn0)	0.68(2)	0.72(3)	0.65(4)	0.70(4)
HeR_n = n_he/(n_ho + n_he)	0.9(1)	0.9(2)	0.89(3)	0.87(3)
HeM = (B + 2C)/(1 – T_fn1)	8(1)·10²	6(1)·10²	1.30(6)·10³	9.8(5)·10²

For detailed definitions of the parameters, see ref (16). nho = A is the mobile water fraction bound homogeneously and nhe = n(Tfn = 1) – nho is the mobile water fraction bounded heterogeneously. The error in the last digit is given in parenthesis. Protein molecules can be characterized and categorized by the homogeneity/heterogeneity of the energy distribution of water binding. This ratio is measured and the defining relation is HeR = (1 – Tfn1)/(1 – Tfn0), in which (1 – Tfn1) and (1 – Tfn0) give the measured distances from the melting point of ice. The observed heterogeneity ratio for A30P and E46K is HeR = 0.69(2) (Table ), which corresponds to 31(2)% homogeneous potential barrier distribution. This distribution width is equal within experimental error with that of WT and A53T α-S, being 33(4)%.[16] These homogeneity ratios agree with α-Ss being more compact than it is expected for a random coil state,[40,41]i.e., these proteins have a non-negligible extent of secondary structures. The ratio of the amount of heterogeneously bound water to the total number of bound water (heterogeneously plus homogeneously bound) is HeR = nhe/(nhe + nho) = 0.9(1) for both A30P and E46K variants, approximately as high as HeR = 0.88(1) for WT and A53T variants. The HeR values indicate high heterogeneity of the bonds formed by the α-Ss. The measure of heterogeneity, HeM = (B + 2C)/(1 – Tfn1), characterizes the degree of heterogeneity of the protein–water interactions close to 0 °C or Ea = 6.01 kJ mol–1. The A30P and the E46K variants have the same HeM values within experimental error, and these values are significantly smaller than those of the WT (9.8(5)·102) and the A53T (1.03(6)·103) variants.[16] The increase in the hydrations of mutants presented in this work is less intense than in the cases of WT and A53T α-Ss, HeM(A53T) > HeM(WT) > HeM(A30P) ≈ HeM(E46K).

In Silico Analysis of Secondary Structures

To relate our data from the wide-line NMR results to the disorder and secondary structure content of the two protein variants, we used different in silico structure prediction methods. Disorder content was predicted with IUPred2,[31,32] which results in a probability value for each residue of being part of a disordered region, with values above 0.5 indicating disorder tendency. The closer the value is to 1, the higher the probability is of a structurally disordered state at the given position. The average disorder probability scores for the whole length are 0.54(2) for A30P, 0.52(2) for E46K α-S (Figure ), and 0.53(2) for WT. Between residues 110 to 140, the scores are 0.823(7) and 0.805(8) for A30P and E46K, respectively, while it is 0.819(8) for WT. The first 13 amino acid residues can be considered ordered with a disorder score of 0.29(1) for A30P and 0.27(1) for E46K. This value is 0.27(1) for WT, which equals the latter. The IUPred2A curves of A310P and E46K mutants (Figure ) coincide with that of WT α-S,[16] the only exception is A30P between positions 19 and 41, where it shows a higher disorder tendency than the WT and E46K. The ANCHOR2 curve of WT α-S coincides with that of the E46K mutant through the whole protein length and there are only small differences within the first 30 amino acid residues of the A30P mutant (Figure ). The first 82 residues do not form a disordered binding site (average score of 0.428(2) WT α-S: 4.424(1)) but the last 30 residues at the C terminus show a strong indication for protein binding (score of 0.914(9) WT α-S: 0.926(8)). Residues 100–110 form a transitional region between the two.

Figure 3

Prediction of protein disorder and disordered binding sites for A30P and E46K α-synuclein by the IUPred2 (A30P cyan, E46K blue solid lines) and ANCHOR2 (A30P dark green, E46K green solid lines) programs. The results for WT α-synuclein are also given for comparison (IUPred2 orange, ANCHOR2 red dashed lines). A score above 0.5 predicts protein disorder or disordered biding site. Since IUPred provides information on the disorder tendency of a protein but not on the structural propensities of the ordered regions, we also applied in silico algorithms to analyze the secondary structure content of the studied proteins. In the SS predictions, the determinant motifs are coils, helices, and β-sheets according to three- and eight-state prediction methods. Protein secondary structures are traditionally characterized as three general states: helix (H), strand (E), and coil (C). From these general three states, the DSSP[42] program proposed a finer characterization of the secondary structures by extending the three states into eight states: helix (G), α-helix (H), π-helix (I), β-strand (E), bridge (B), turn (T), bend (S), and others (L). These eight secondary structure states are often mapped into the following three states. H: α-helix, which corresponds to the right- or left-handed cylindrical/helical conformations that include G, H, and I states. E: β-strand, which corresponds to pleated sheet structures that encompass E and B states. C: coil, which covers the remaining S, T, and L states. The state-of-the-art methods are currently reaching almost 88% for a three-class prediction and 76.5% for an eight-class prediction.[43] For the α-S variants (Figure ), the three- and eight-state methods indicate the coil and the helix to be the most dominant.

Figure 4

Predicted three- (A) and eight-state (B) secondary structures for A30P (black circles), E46K (red diamonds), and WT (gold triangles) α-synuclein. The average structure of the eight modeling programs for three-state methods (Brewery, Jpred4, Porter 5.0, PSIPRED, PSRSM, RaptorX, SCRATCH, and SPIDER3) and five modeling programs for eight-state methods (Brewery, Porter 5.0, RaptorX, SCRATCH, and SPIDER3) are shown. Three-state predictions use the average of the results of eight different SS prediction methods (Brewery, JPred4, Porter 5.0, PSIPRED, PSRSM, RaptorX, SCRATCH, SPIDER3). The three-state SS prediction of A30P, E46K, and WT α-Ss shows that α-helices and coils are the main SS elements. α-helices form in 47–48% of the protein chain and coils give 39%, while β-sheets are only present in 9%. The β-sheet content is minimal in WT α-S (4 amino acid residues long) but it is three times greater in the A30P (13 residues) and in the E46K (12 residues) mutants. The eight-state SS prediction consists of the average of five modeling methods (Brewery, Porter 5.0, RaptorX, SCRATCH, SPIDER3). According to this prediction, A30P and E46K mutants contain two longer α-helices (46 and 51% of the whole length, respectively), while the WT version has more pieces of shorter α-helix type sections (43% altogether). The WT α-S in summary shows a longer β-sheet SS (13%) than the two mutants (E46K 7%) and A30P has the shortest (5%) such section.

Discussion

Structure of the α-Synuclein Mutants

Approximately, one monolayer of mobile water molecules hydrates A30P α-S at the lowest potential barriers (h = 0.313(4)). E46K has h = 0.117(2) mobile hydration here, which is almost a third of the former and is not enough for the functionality of E46K α-S since a protein needs at least h = 0.2 to be functional.[38,44] The lower initial and then overall hydration can be a result of E46K α-S being more compact than A30P α-S. It appears that every third or fourth hydration site of A30P α-S would be occupied in E46K α-S. The linear hydration section between Ea,1 and Ea,2 for A30P α-S can be described with one slope, while the same section for E46K α-S has two distinct sections with different slopes. The type of the protein–water interaction changes where the slope changes for E46K at Ea = 5.43(1). naa or h increases only slightly before the change; it is almost constant here, and after the change, naa or h increases even more rapidly than A30P α-S. The hydration increases from Ea,1(eff) to Ea,2 are ΔhA30P = 0.48(3) and ΔhE46K = 0.33(8). This amount of increase suggests a newer layer of mobile hydration to build up.[38] The rate of expansion of the hydration layers is the greatest at potential barriers greater than Ea,2. The difference in the hydration of A30P and E46K α-Ss in favor of A30P is indicative again of a more compact structure of E46K α-S. The α-S mutants A30P and E46K have markedly lower hydration (h(Ea = 6.01 kJ mol–1), Table ) than the mutant A53T (hA53T = 4.0(2)) and the WT (hWT = 3.3(2)) α-S at the melting point of bulk water (Ea = 6.01 kJ mol–1). It can be deduced from these data that E46K α-S is the most compact structurally with the smallest solvent-accessible surface and A53T α-S has the most open structure. A recent cryoelectron microscopy study revealed that the E46K mutant α-S forms structurally distinct, more compact amyloid fibrils than the wild type.[45] The authors attributed this feature to a misfolding pathway of the mutant, where the salt bridge between E46 and K80 is disrupted by the electrostatic repulsion in the mutant fibril. This altered structural tendency might be reflected already in the monomer form, as indicated by our measurements. The homogeneously and the heterogeneously bound mobile hydration water amounts for A30P and E46K α-S (hho and hhe, Table ) also show that the WT and A53T α-Ss are more open structurally than the present mutants. The heterogeneity ratio, HeR, from the dynamic MD parameters shows non-negligible secondary structures in A30P and E46K α-S by the 31(2) % homogeneous potential barrier distribution of the protein–water bonds.

Comparison of the Measured and Predicted Structures

α-S has been mainly considered to contain α-helices with a small number of isolated β-sheets[46−49] or to be an α + β protein,[50,51] but the prediction of an all-β structure with some peripheral small α-helices for α-S is also a valid possibility.[52] Wide-line NMR measurements provide valuable information on the overall solvent accessibility and structural states of the proteins studied but no detailed structural information. Combining our measured data with structure predictions enables us to understand and interpret the structural and physiological consequences of the studied mutations. IUPred2A[31,32] is a combined web interface that allows one to identify disordered protein regions using IUPred2 and disordered binding regions using ANCHOR2. The algorithm identifies disordered protein regions and it is found that A30P α-S has longer such regions than E46K α-S. Based on the IUPred results, the disorder tendencies of WT, A30P, and E46K α-Ss are very close to each other, but A30P α-S has a more open structure than E46K α-S. ANCHOR2 predicts regions that undergo a disorder-to-order transition upon binding to another protein. Based on ANCHOR2 results, A30P and E46K, just as WT α-S, have a disordered binding site spanning 30 residues at their C terminus and a transitional but binding region of 10 residues before it. These results agree with the finding that A30P and E46K α-Ss are also IDPs, as seen by wide-line 1H NMR. More precisely, 70(3) % of their solvent-accessible surface is heterogeneous/disordered (Table ). Even though the majority of the proteins appears to be in a solvent-accessible, disordered state, they also contain a significant amount of regions with secondary structures. The three- and eight-state-predicted SSs of the here studied α-S mutants are dominated by α-helices and coils and β-sheets are only minor components of the structures. From this, we can deduce that the secondary structures detected by wide-line NMR are solvent-exposed α-helices in these proteins. The predicted SSs for the three α-S variants (WT, A30P, E46K) are very similar to each other, with the biggest difference being in the extent of β-sheets in the three-state predictions. This method predicts almost no β-sheets in the WT α-S but a markedly increased β-sheet content for the two mutants (Figure A). On the contrary, the eight-state predictions forecast the largest β-sheet content for WT α-S, which is in accordance with our earlier secondary structure calculations.[17] At the sites of the mutations, the predictions show no special features, while the disorder prediction clearly showed an increased disorder tendency of the A30P variant around the mutation site. A comparison of these predictions with the results of the wide-line NMR reveals that the measured structural states are clearly different from the predicted ones. The mutations induce measurable changes in the secondary structure content of the protein, resulting in alterations of the overall structures in the case of the mutants. This observation also highlights the limits of structure prediction algorithms in detecting structural changes caused by single amino acid changes.

47 in total

1. Comparative investigations of biopolymer hydration by physicochemical and modeling techniques.

Authors: H Durchschlag; P Zipper
Journal: Biophys Chem Date: 2001-11-28 Impact factor: 2.352

2. The alpha-synuclein mutation E46K promotes aggregation in cultured cells.

Authors: Neeraj Pandey; Robert E Schmidt; James E Galvin
Journal: Exp Neurol Date: 2005-12-01 Impact factor: 5.330

3. Analyzing Protein Disorder with IUPred2A.

Authors: Gábor Erdős; Zsuzsanna Dosztányi
Journal: Curr Protoc Bioinformatics Date: 2020-06

4. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors: W Kabsch; C Sander
Journal: Biopolymers Date: 1983-12 Impact factor: 2.505

5. Molecular Simulations Reveal Terminal Group Mediated Stabilization of Helical Conformers in Both Amyloid-β42 and α-Synuclein.

Authors: Shayon Bhattacharya; Liang Xu; Damien Thompson
Journal: ACS Chem Neurosci Date: 2019-04-05 Impact factor: 4.418

6. Water-protein interactions from high-resolution protein crystallography.

Authors: Masayoshi Nakasako
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2004-08-29 Impact factor: 6.237

7. High-speed atomic force microscopy reveals structural dynamics of α-synuclein monomers and dimers.

Authors: Yuliang Zhang; Mohtadin Hashemi; Zhengjian Lv; Benfeard Williams; Konstantin I Popov; Nikolay V Dokholyan; Yuri L Lyubchenko
Journal: J Chem Phys Date: 2018-03-28 Impact factor: 3.488

8. JPred4: a protein secondary structure prediction server.

Authors: Alexey Drozdetskiy; Christian Cole; James Procter; Geoffrey J Barton
Journal: Nucleic Acids Res Date: 2015-04-16 Impact factor: 16.971

9. Conformational ensemble of native α-synuclein in solution as determined by short-distance crosslinking constraint-guided discrete molecular dynamics simulations.

Authors: Nicholas I Brodie; Konstantin I Popov; Evgeniy V Petrotchenko; Nikolay V Dokholyan; Christoph H Borchers
Journal: PLoS Comput Biol Date: 2019-03-27 Impact factor: 4.475

10. Prediction of protein binding regions in disordered proteins.

Authors: Bálint Mészáros; István Simon; Zsuzsanna Dosztányi
Journal: PLoS Comput Biol Date: 2009-05-01 Impact factor: 4.475