Eric Fagerberg1, Samuel Lenton2, Tommy Nylander2, Tilo Seydel3, Marie Skepö1,4. 1. Theoretical Chemistry, Lund University, POB 124, SE-221 00 Lund, Sweden. 2. Physical Chemistry, Lund University, POB 124, SE-221 00 Lund, Sweden. 3. Institut Max von Laue - Paul Langevin, 71 avenue des Martyrs, CS 20156, F-38042 Grenoble, France. 4. LINXS - Lund Institute of Advanced Neutron and X-ray Science, Scheelevägen 19, SE-223 70 Lund, Sweden.
Abstract
Intrinsically disordered proteins (IDPs) are proteins that, in comparison with globular/structured proteins, lack a distinct tertiary structure. Here, we use the model IDP, Histatin 5, for studying its dynamical properties under self-crowding conditions with quasi-elastic neutron scattering in combination with full atomistic molecular dynamics (MD) simulations. The aim is to determine the effects of crowding on the center-of-mass diffusion as well as the internal diffusive behavior. The diffusion was found to decrease significantly, which we hypothesize can be attributed to some degree of aggregation at higher protein concentrations, (≥100 mg/mL), as indicated by recent small-angle X-ray scattering studies. Temperature effects are also considered and found to, largely, follow Stokes-Einstein behavior. Simple geometric considerations fail to accurately predict the rates of diffusion, while simulations show semiquantitative agreement with experiments, dependent on assumptions of the ratio between translational and rotational diffusion. A scaling law that previously was found to successfully describe the behavior of globular proteins was found to be inadequate for the IDP, Histatin 5. Analysis of the MD simulations show that the width of the distribution with respect to diffusion is not a simplistic mirroring of the distribution of radius of gyration, hence, displaying the particular features of IDPs that need to be accounted for.
Intrinsically disordered proteins (IDPs) are proteins that, in comparison with globular/structured proteins, lack a distinct tertiary structure. Here, we use the model IDP, Histatin 5, for studying its dynamical properties under self-crowding conditions with quasi-elastic neutron scattering in combination with full atomistic molecular dynamics (MD) simulations. The aim is to determine the effects of crowding on the center-of-mass diffusion as well as the internal diffusive behavior. The diffusion was found to decrease significantly, which we hypothesize can be attributed to some degree of aggregation at higher protein concentrations, (≥100 mg/mL), as indicated by recent small-angle X-ray scattering studies. Temperature effects are also considered and found to, largely, follow Stokes-Einstein behavior. Simple geometric considerations fail to accurately predict the rates of diffusion, while simulations show semiquantitative agreement with experiments, dependent on assumptions of the ratio between translational and rotational diffusion. A scaling law that previously was found to successfully describe the behavior of globular proteins was found to be inadequate for the IDP, Histatin 5. Analysis of the MD simulations show that the width of the distribution with respect to diffusion is not a simplistic mirroring of the distribution of radius of gyration, hence, displaying the particular features of IDPs that need to be accounted for.
In
contrast to globular proteins, intrinsically disordered proteins
(IDPs) lack a well-defined three-dimensional structure, instead they
adopt an ensemble of conformers in solution.[1] Consequently, IDPs can rapidly sample a large volume of conformational
space.[2] This innate flexibility, combined
with the ability of IDPs to bind with high specificity, allows a single
IDP to regulate a range of biological functions.[3]The precise nature of the conformational ensembles
adopted by IDPs
depend on a variety of conditions including, for example, temperature,
ionic strength, and presence of binding partners.[4,5] One
condition, often neglected by experimental studies, is the effect
of crowding on the dynamical properties of IDPs, and how these effects
relate to the protein function. Determining the dynamical properties
under crowded conditions is pertinent due to the high intracellular
concentration of macromolecules, which can reach up to 400 mg/mL.[6,7] At these concentrations, it is expected that protein–protein
interactions, as excluded volume effects and electrostatic interactions,
impact not only the conformational ensemble,[8−10] but also restrict
the ability of IDPs to diffuse throughout the crowded intracellular
milieu.[11]Both these factors have
important implications for how the functions
of IDPs are regulated intracellularly. For example, a reduction in
diffusion caused by macromolecular crowding could provide spatial
means of controlling IDP interactions, while reducing the flexibility
of the conformational ensemble may restrict the rate at which IDPs
can interact with other macromolecules.[12] In order to study the diffusive properties of IDPs under crowded
conditions, appropriate time and length scales must be considered.
The diffusive dynamics of IDPs take place on hierarchical time and
length-scales covering a range from picoseconds to hours and from
ångströms to micrometers.[13]Due to both the high concentrations of proteins required to
accurately
represent intracellular conditions, and the relatively small spatial
and temporal scales on which the diffusive motions of IDPs take place,
studying these properties experimentally is not straightforward. Measurements
of the translational diffusion of an IDP and a globular protein in
crowded environments have been completed by NMR on relatively long
time-scales. Wang et al. determined that under crowded conditions,
the IDP had faster translational diffusion compared to the globular
protein, while under dilute conditions the opposite was observed.[14]Similarly a single-molecule experiment,
completed in cells, has
shown that the diffusion of an IDP is faster than a globular protein
(while both have significantly reduced diffusion) upon crowding and
that this effect is largely based on the size of the crowding molecule.[15] Despite the insights provided by such experimental
studies, the nanosecond time scale of translational diffusion is difficult
to access, leaving this time window of IDP diffusion under crowded
conditions largely unexplored.In recent years, significant
advancements have been made in the
instrumentation of high-resolution neutron spectroscopy, increasing
the possibilities for studying biological samples using this technique.
Parallel developments in the analysis methods used to deconvolute
the resulting experimental data means that it is now possible to simultaneously
measure the center-of-mass (COM) and their superimposed internal diffusive
motions of proteins in solution, on the nanosecond time scale, and
the ångström length scale. In contrast to, for example,
fluorescence spectroscopy, these methods have the advantages of being
label-free, having access to shorter time scales and allowing for
the gathering of simultaneous spatial and dynamical information.[16] Neutron spectroscopy also complements NMR that
accesses angular correlations and generally longer times scales.[17] Moreover, neutron techniques provide access
to opaque samples such as highly turbid protein solutions that are
difficult to measure by, e.g., light scattering. Numerous studies
investigating the diffusion of globular proteins in solution using
these novel neutron methods have been published during the last few
years.[18−22] Similar studies of IDPs under crowded conditions are, however, comparatively
rare.[23]On the nanosecond observation
time scale, and nanometer observation
length scale of high-resolution quasi-elastic neutron spectroscopy
(QENS), the observable COM diffusion coefficient D = D(D, D) consists of contributions
from both rotational D and translational D diffusion, and corresponds to the so-called short-time diffusion
in terms of the physics of colloidal hard sphere suspensions.[19,24] When native proteins rich in hydrogen atoms are suspended in a deuterated
solvent, the measured spectra from the proteins possess the information
on the self-, or, synonymously, tracer diffusion of these proteins
due to the prevailing incoherent scattering from the proteins. On
the diffusive short-time scale, the proteins diffuse on average by
only a small fraction of their hydrodynamic radius. Therefore, their
diffusion is governed by hydrodynamic interactions, while direct interactions,
i.e., collisions, can be neglected. An advantage of probing self-diffusion
consists in an unambiguous access to the hydrodynamic size of the
diffusing assembly, since no effect from the static structure needs
to be taken into account.[22,25] Moreover, the prevailing
incoherent scattering allows in principle to determine the elastic
incoherent structure factors (EISF) of the proteins in solution, which
provides information on the geometry of the confinement of the internal
diffusive motions.[25,26] For dilute protein solutions,
the EISF can be measured only since recently, employing spectrometers
with the highest neutron beam brightness and signal-to-noise ratio,
while previous neutron spectroscopy studies have already explored
IDPs and globular proteins in hydrated powder states.[27−29] A challenge for IDPs, as opposed to, e.g., globular proteins with
a stable structure, is due to their fluctuating shape which may result
in a fluctuating hydrodynamic radius as well as possibly a fluctuating
EISF. Moreover, the possibly more open average shape of IDPs may render
any picture of compact colloidal objects insufficient to describe
IDPs.Here, we employ high-resolution neutron backscattering
spectroscopy
accessing high momentum transfers typically within 0.2 Å–1 < q < 2.0
Å–1 to probe the self-diffusion of the extensively
studied, relatively short, IDP, Histatin 5 (Hst5).[30−34] We first discuss the experimental results from high-resolution
QENS on aqueous solutions of Hst5 in terms of the established models
for well-folded proteins with a compact shape.[16,35] Hst5 has been well-investigated in terms of structure with SAXS,[36,37] NMR,[38,39] and circular dichroism,[40,41] including investigations on the effect of temperature, crowding,
and to limited extent salt, often combined with simulation to further
interpret results or benchmark simulation models.[42−46]Thereafter we compare the experimental results
with full atomistic
molecular dynamics simulations from which both the COM diffusive dynamics
and the EISF are obtained. Based on this comparison, we discuss the
significant impact of the fluctuating shape of the IDPs on the nanosecond
observation time scale of our experiment, and comparisons are made
with globular proteins.
Methods
Sample
Preparation
Hst5 was purchased
from Genemed Synthesis, Inc. (San Antonio, USA) and TAG Copenhagen
A/S (Copenhagen, Denmark). In case of nondialyzed samples, the protein
was used directly as obtained. The protein concentration was determined
with a Thermo Scientific Nanodrop OneC UV–vis spectrophotometer
using an extinction coefficient of ϵ = 2560 M–1 cm–1, and molecular weight of 3.036 kDa. The buffer
used contained D2O with 20 mM Tris, with either 150 mM
NaCl or nominally 10 mM NaCl (no salt was explicitly added; only a
small amount of sodium hydroxide required to set the pH to 7 contributes
to the salt content in this case). In case of the dialyzed samples,
6–8 cm long pieces of 16 mm flat width, 500–1000 MWCO
membranes (SpectrumLabs, Piraeus, Greece), were used to dialyze the
protein. The protein powder was taken from the can and dissolved in
Milli-Q water, and exhaustively dialyzed against Milli-Q water, at
least 200 times its volume with four water changes. Between the water
changes, the dialysis was left to proceed for 4–12 h at room
temperature. After dialysis, the protein was freeze-dried and stored
at −20 °C. Thereafter, the sample preparation procedure
was the same as for nondialyzed samples. All samples measured are
found in Table S1.
QENS
The QENS experiments[47−49] were carried out on the backscattering
spectrometer IN16B at the
Institut Laue-Langevin, Grenoble, France.[50,51] A scattering vector range of 0.2 < q < 1.8
Å–1 was covered, using Si111 monochromator
and analyzers, corresponding to an elastic neutron wavelength of 6.271
Å. QENS spectra were recorded by mechanically Doppler-shifting
the incident energy through a movement of the monochromator. Examples
of these spectra are found in Figures S1 and S2 in the Supporting Information.
Analysis
The QENS data were reduced
using Mantid(52) and subsequently
fitted using python scripts derived from https://github.com/seydel/QENS_utilities.The scattering function S(q, ω) recorded on IN16B depends on the momentum transfer ℏq and energy transfer ℏω, and was modeled by eq ,[16]where denotes the
spectrometer resolution function, A0(q) the EISF, and a Lorentzian
function. β(q) and represent scalars.
The Lorentzian widths
γ, Γ, and account for the contributions from the
protein COM diffusion, superimposed internal protein diffusion, and
solvent water diffusion, respectively.Importantly, the fits
can be performed with and without imposing
a dependence of the model (eq ) on the momentum transfer q, respectively.
For the COM diffusion, Fickian-type diffusion has been confirmed for
numerous well-folded proteins,[16]with the observable apparent COM diffusion
coefficient D. The internal diffusive motions of
the proteins have previously been described by the simplistic jump-diffusion
model,[16,35] which has proven sufficient on the rather
narrow energy range of −30 μeV ≤ ℏω ≤ + 30 μeV covered by our present experiment,with the internal diffusion coefficient Dint and the residence time between diffusive
jumps τ. On the observation time scale of our experiments, it
is assumed that eq approximately
accounts for protein backbone fluctuations, while side-chain motions
are too fast to be captured.[16] In this
work, two fitting approaches are compared, both based on eq : The first approach is to fit the
spectra obtained for each value of q separately (“per-q”) and to subsequently fit eq to the obtained γ(q). The second approach consists of imposing both eq for COM and eq for internal diffusivity in a fit of the
spectra for all q simultaneously, denoted “jump-diffusion”
fit.According to Cragnell et al.,[36] the
specific volume, ν, of Hst5 is
0.702 3 mL/g, calculated with Sednterp, which uses a method
by Cohn and Edsall.[53] The volume fractions
can then be determined,with V being the
volume of the solvent. The equation can be rearranged
to be expressed with concentrations.The fit was evaluated
with the goodness-of-fit, L1 loss function,
and L2 loss function. For goodness-of-fit,where #y is the number of
experimental data points, #p is the number of parameters, yexp is the experimental data, yfit is the fitted data and σ is the experimental error. For the
L1 loss function,and the L2 loss function.If singular observations are poorly fitted altough the overall
fits are reasonable, this would be captured by comparing the L1 and
L2 loss functions. Viscosities of pure water and of deuterium oxide
were calculated according to the relation by Cho et al.[54] To account for how changes in solvent properties
affect protein behavior, the Stokes–Einstein equation is used
to normalize the data for temperature and viscosity, yielding an effective
hydrodynamic radius (R),as the diffusion achieved
from QENS is an
apparent diffusion, rather than a translational diffusion D.
Paalman–Pings Correction,
Jump-Diffusion vs per-q Model
Paalman–Pings[55] corrections were applied, and the effect evaluated.
A longer
discussion on this is found in the Supporting Information, but it was mainly found that Paalman–Pings
corrections have a positive, but small impact on the results. Therefore,
our results include such a correction. A discussion on which model
to be used is found in the Supporting Information, which concluded the “jump-diffusion” model to best
find COM diffusion, while fitting each q individually (“per-q” model) was best for achieving comparable EISF.
Separation of Rotational and Translational Diffusion
In
our QENS experiment, the observable apparent diffusion coefficient D = D(D, D) consists
of contributions from both the rotational D and translational D diffusion. In practice, the separation of D and D is carried out with assumptions as below,[16,19]andwhere j represents the spherical Bessel function of the
first kind
and order l, and ρ(r) denotes the radial hydrogen distribution function
of the protein.
Relation between Diffusion, Molecular Mass,
and Fractal Dimension
A relation between diffusion and molecular
mass as well as fractal
dimension (d) has been
proposed by Augé et al.,[56]which can be linearized
to beThe constant C is
dependent on the ”molecular family” considered, and
needs to be parametrized. For this purpose, the α-synuclein
diffusion measured by QENS (Fujiwara et al.[23]) was used together with the SAXS-data of α-synuclein (Ahmed
et al.[57]). The fractal dimension of α-synuclein
was computed from the SAXS data via linear fitting of log(I) vs log(q) at high q, using the limits of q = 2.0 Å–1 and q = 2.8 Å–1, as described in Johansen et al.[58] The constant C was found to be 10 353.
For the prediction of diffusion for Hst5, the SAXS data of Cragnell
et al.[36] were used.
Simulation Details
Trajectories from
simulations of single-chain Hst5 in water using the A99SBN-ILDN (“A-ILDN”)
force field with TIP4P-D water model, the CHARMM36m (“C36m”)
force field with TIP3P water model modified for CHARMM, and the CHARMM36IDPSFF
(“C36IDPS”) force field with TIP3P water model modified
for CHARMM, were obtained from Henriques et al.[43] and Jephthah et al.,[59] and processed
as described below. The simulations in this study were performed using
the GROMACS software[60−64] version 2019.2, with Amber ff99SB-disp (“A-Disp”)
together with the TIP4P-derived water model a99SB-disp force field
and the accompanying parameters for ions.[65] These force fields were chosen since they have all previously been
shown to accurately describe the structural properties of Hst5.[43,59,66] The choice of A-Disp for the
crowded simulations was motivated by it being a ”balanced”
force field, intended to work for both globular and disordered proteins,
which may be of importance in crowding studies as a possible outcome
of crowding of IDPs is the induced ordering of the protein.[67] The leapfrog integrator was used with a time
step of 2 fs to compute the equations of motion. The LINCS algorithm
was used to constrain bonds involving hydrogen atoms.[68] A 12 Å cutoff was used for short-range electrostatic
and Lennard-Jones interactions. For long-ranged electrostatic interactions,
Particle-Mesh Ewald was used with a fourth order interpolation and
1.6 Å grid spacing.[69] A Verlet neighbor
list, updated every 100 fs was used, with a cutoff of 10 Å. Long-range
dispersion corrections were applied to energy and pressure. Separate
temperature baths with a velocity-rescale thermostat were coupled
to the protein and solvent including ions at a temperature of 300
K and a relaxation constant of 0.1 ps.[70] A Parrinello–Rahman barostat was applied, with the pressure
fixed to 1 bar and setting relaxation time to 2 ps, whereas the isothermal
compressibility was 4.5 × 10–5 bar–1. Periodic boundary conditions were applied in all directions. For
the crowded simulations, a box with an initial side length of 9.4
nm was used (a box geometry was also used in the case of single-chain
simulation) to insert the adequate amount of protein chains, then
the box size was increased to 10 nm, to attain the correct protein
concentration (two chains in case of 10 mg/mL protein concentration,
ten chains in case of 50 mg/mL protein), and thereafter solvated with
standard GROMACS tools. The increase in box size was to ensure a minimum
distance between the box and the proteins. Sodium and chloride ions
were inserted to both neutralize the charge of the proteins and to
achieve a salt concentration of 150 mM. The final system sizes were
37 628 solvent molecules, 111 sodium ions, and 116 chloride
ions for single-chain simulation, 30 389 solvent molecules,
90 sodium ions, and 100 chloride ions for a protein concentration
of 10 mg/mL and 29 282 solvent molecules, 90 sodium ions, and
140 chloride ions for a protein concentration of 50 mg/mL. All chains
had a starting conformation as a linear molecule, built in PyMol version
1.8 (Schrödinger, LLC). Initial energy minimization was performed
with the steepest descent algorithm, followed by a stability equilibration
run for 2 ns in the canonical ensemble (NVT: constant
number of particles, volume and temperature). Thereafter a pressure
stabilization was performed in the isobaric–isothermal ensemble
(NPT: constant number of particles, pressure and
temperature) for 2 ns. Production simulations were performed for 1200,
4000, and 4700 ns for single-chain simulations, 10 mg/mL protein concentration
simulation and 50 mg/mL protein concentration simulation, respectively,
using five replicates thus a total of 6, 20, and 23.5 μs. Any
other settings were left as default, as determined by the GROMACS
software. For computing the viscosity of the pure solvent A-Disp,
7198 molecules of A-Disp water was added initially in a cubic box
with a side length of 6 nm, resulting in an initial molecular density
of 33 molecules/nm3. A simulation was performed as above,
with the following deviations: Three replicates were used, the NPT
pressure equilibration was performed for 10 ns, and the production
run was 30 ns long. The correctness of this simulation was confirmed
by calculating the diffusion of the A-Disp water via mean square displacement
(see below), which was found to be 1.9 × 10–5 cm2/s - the same as was originally found by Robustelli
et al.[65] All simulation trajectories have
been used in full, without removing any initial part of the trajectories.
This may introduce a small bias from the choice of starting structure.
The influence of such bias has been deemed negligible from the convergence
assessment, which can be found in the Supporting Information. For the production of EISF from trajectories,
the program MDANSE[71] was used (for further
details on the algorithm used in MDANSE, see the Supporting Information).
Cluster Analysis of Molecular Dynamics Simulations
of Proteins
at a Concentration of 50 mg/mL
The GROMACS tool mindist was
used to compute the minimum c–α carbon
distance between all the chains in the simulation box for each replicate.
Two cut-offs for defining whether two chains are in a cluster were
used, 6 and 7 Å, as these are in the range used in similar applications.[72,73]
Diffusion Parameters from Molecular Dynamics Simulations
Transnational diffusion was determined by calculating the mean square
displacement, MSD (by GROMACS standard tools) using the Einstein relation
MSD = 6Dt, where D is translational diffusion
and t time. A straight line was fitted to the region of time = 0 and
time = 4.6 ns, corresponding to the coherence time of IN16B (with
linearity of the region being ensured by computing R2 for
each fit). This was completed for all chains in each replicate, averaging
across all replicates and chains. The calculated translational diffusion
for all individual chains can be found in the Supporting Information Tables S4–S8 and S10. The periodic boundary
conditions were treated to ensure that no molecules were broken or
diffusing across the simulation box, ensuring a continuous trajectory.To account for the finite box used in the simulation, the correction
for translational diffusion by Yeh and Hummer[74] was used. Previous simulations of Hst5 by Henriques et al. used
a rhombic dodecahedron simulation box rather than a cubic box, so
in this case the constant ξ for FCC-lattice computed by Hasimoto[75] was used, with the side-length being the length
of the side of a unit cell with the rhombic dodecahedron inscribed.
As well for the simulation by Henriques et al., viscosity of the pure
solvent was taken from von Bülow et al.[72] as it used the TIP4P-D water model. The corresponding viscosity
for the CHARMM modified TIP3P water model was taken from Ong and Liow.[76] The diffusion constants obtained were also further
corrected for the discrepancy of using H2O in the simulation
while using D2O in the experiment, by multiplying the ratio
of viscosity between H2O and D2O. The correction
also requires the viscosity of the system simulated, which was computed
via an Einstein relation,as described by Hess.[77] For single-chain simulations, the HYDROPRO software[78] was also used for calculating translational
diffusion,
as an alternative. In this case, no corrections for the finite box-size
is needed, as the structure is used directly. As well, when using
HYDROPRO, the viscosity of D2O was used as input directly
rather than correcting the H2O and D2O viscosity
difference postprocess. For the computation of autocorrelation of
the translational diffusion from HYDROPRO, the following definition
of autocorrelation was used,where Dt is the translational diffusion at snapshot index i, the mean translational
diffusion, and N the total number of snapshots used.
Results and Discussion
Trends
in Apparent COM Diffusion Coefficient
The apparent diffusion
coefficient (D) of Hst5
was determined from the experimental QENS data using the jump-diffusion
model with Paalman–Pings corrections applied (see discussion
in the Supporting Information). The results
are shown in Figure as a function of both protein concentration and temperature. A consistent
downward trend in apparent diffusion with increasing protein concentration
is observed, indicating that crowding induces a reduction in the COM
diffusion of Hst5. The temperature effect is, in relative terms, consistent
across the different protein concentrations, increasing almost 3-fold
from lowest to highest temperature. The data point for 50 mg/mL and
310 K is missing in the data set since we, for the IDP of the size
used in this study, reached the limitations of technique at IN16B;
i.e., the combined high speed-diffusion and comparably low protein
concentrations did not provide feasible data. Considering the concentration
dependence, the results indicate that the dynamics changes more drastically
at lower protein concentrations. This is, however, not a consistent
trend, as indicated by the small increase in slope between protein
concentration 150–200 mg/mL, with the exception of the 298
K data, where the slope remains constant in the interval 100–200
mg/mL.
Figure 1
Apparent COM diffusion coefficient D versus the
protein concentration c (lower x-axis) and protein volume fraction ϕ
(upper x-axis). The symbols denote the experimental
results obtained from fitting the IN16B QENS spectra in terms of Fickian
diffusion (eq ) for D, using jump-diffusion model for internal diffusion. Data
shown is for a salt concentration of 150 mM NaCl, with the Paalman–Pings
correction applied.
Apparent COM diffusion coefficient D versus the
protein concentration c (lower x-axis) and protein volume fraction ϕ
(upper x-axis). The symbols denote the experimental
results obtained from fitting the IN16B QENS spectra in terms of Fickian
diffusion (eq ) for D, using jump-diffusion model for internal diffusion. Data
shown is for a salt concentration of 150 mM NaCl, with the Paalman–Pings
correction applied.QENS experiments give
the apparent diffusion D, which convolutes translational D and rotational diffusion D. For globular proteins, Roosen-Runge
et al.[19] have demonstrated a procedure
to deconvolute these two
contributions. This was achieved by modeling their globular protein
as an ellipsoid, using Perrin factors to achieve the dilute limit
of rotational- and translational diffusion, as well as using a relation
of how rotational diffusion changes with increasing crowding. The
latter is valid for charged spheres (cf. eq ); hence, it is not certain that this methodology
is valid for IDPs, both considering the deconvolution itself, and
the rotational diffusion relation. Additionally, Fagerberg et al.[37] have shown that aggregation may take place at
protein concentrations larger than 50 mg/mL. Therefore, this procedure
is not applied.As one may expect from the equipartition theorem,
a higher temperature
results in a faster diffusion. The temperature dependence on solvent
properties may also be a factor in regard to the temperature dependence
of the observed apparent diffusion. To evaluate this further, eq is applied, however, this
equation assumes translational diffusion, yielding the hydrodynamics
radius (R). Instead,
an “effective” radius (R) is considered here. The important observation
made from applying eq is whether the temperature dependence of the observed diffusion
is a consequence of changes in solvent properties, which would yield
a similar R across
temperatures, or if additional explanations are necessary. Results
are shown in Figure .
Figure 2
“Effective” radius of hydration Re obtained using the Stokes–Einstein
relation (eq ) from
the experimental diffusion coefficients to assess if the temperature
dependence of the results is a consequence of changes in solvent properties
solely. The diffusion coefficients used are those displayed in Figure (150 mM NaCl concentration).
The radius obtained represents an “effective” radius
or “pseudo-hydrodynamic” radius since the observable
apparent diffusion D is an implicit function of both
the rotational and the translational diffusion. Error bars may be
smaller than the symbol size.
“Effective” radius of hydration Re obtained using the Stokes–Einstein
relation (eq ) from
the experimental diffusion coefficients to assess if the temperature
dependence of the results is a consequence of changes in solvent properties
solely. The diffusion coefficients used are those displayed in Figure (150 mM NaCl concentration).
The radius obtained represents an “effective” radius
or “pseudo-hydrodynamic” radius since the observable
apparent diffusion D is an implicit function of both
the rotational and the translational diffusion. Error bars may be
smaller than the symbol size.At low protein concentration, the R differs only slightly between the different temperatures,
12.9 and 11.8 Å, at 280 and 298 K, respectively (a relative difference
of 10%), indicating that the temperature dependence is Stokesian.
At higher protein concentrations, the picture is again blurred by
the possibility of aggregation, but the determined R displays a difference of 15% at 100
mg/mL, 5% at 150 mg/mL, and 20% at 200 mg/mL.Another step further
would be to use R to
determine an estimate for the radius of gyration
(R). Employing the relation
between R and R as suggested by Nygaard et
al.,[79] and assuming R = R, it is found that R is 8.1 and 6.0 Å at 280 and 298 K, respectively, at 50
mg/mL protein concentration. This is far from the value of 12.4 Å
measured by Fagerberg et al.[37] at the same
protein concentration, indicating that all approximations assumed
are not valid.
Effect of Salt Concentration
At a
protein concentration of 200 mg/mL, the effect of different salt concentrations
were considered. Two different procedures were used regarding sample
preparation, with or without dialysis, which produces different salt
concentrations in the solution (a discussion and estimation of these
can be found in Supporting Information).
Furthermore, in order to verify the reproducibility, and considering
the limited access to experimental beamtime, a single sample was measured
twice with several months in-between the measurements. The apparent
COM diffusion coefficient obtained by these measurements are shown
in Figure .
Figure 3
Apparent COM
diffusion coefficient D obtained
from the IN16B QENS data, subsequent to Paalman–Pings corrections,
by fitting eq and therein
imposing eqs and 3 for samples with a protein concentration of 200
mg/mL, while varying the amount of salt and temperature T.
Apparent COM
diffusion coefficient D obtained
from the IN16B QENS data, subsequent to Paalman–Pings corrections,
by fitting eq and therein
imposing eqs and 3 for samples with a protein concentration of 200
mg/mL, while varying the amount of salt and temperature T.There is a clear downshift in
diffusion as the salt concentration
increases. The relative temperature effect seems unaffected by the
level of salt, with an increase in diffusion by approximately a factor
of 3 in the temperature range between 280 to 310 K.An increase
in salt concentration alters the solvent properties.
Of importance is the viscosity changes, which can be determined by
the equations of Goldsack and Franchetto,[80,81] which are valid in the salt concentration range considered. Calculations
can be found in Supporting Information.
Using these viscosities, R was determined via the Stokes–Einstein equation (eq ), which can be used to
assess the effect on solvent properties by increases in salt. A considerable
difference for the different levels of salt is still found (see Figure S9), suggesting that a large amount of
salt has additional effects than changing solvent viscosity. From
literature, slowdown of diffusion with increasing salt concentration
has also been observed in QENS measurements of BSA and YCl3.[82] In that case, the results were explained
by salt-induced clustering, which might also be an explanation here.
However, it should be emphasized that the salt used in the aforementioned
study was trivalent and the protein studied was globular, which is
significantly different compared with the current case. Another possible
explanation for the decrease in diffusion with increasing salt concentration
can be found by considering the coarse-grained simulations at different
salt concentrations previously performed by Fagerberg et al.[37] In these simulations, a larger R was found at low salt conditions, speculatively
due the stronger charge-repulsion found at low salt concentrations
as Hst5 has a fairly high net charge. Considering Hst5 to be an ellipsoid,
a more extended structure would mean the polar semiaxis would be longer,
and the equatorial semiaxis would be smaller. Applying the equation
for diffusion of an ellipsoid moving randomly, reported by Berg,[83] this would mean an increase in diffusion. An
increase in diffusion at low salt concentration would be equivalent
with a decrease in diffusion at high salt concentration, in line with
the results shown here. The difference in salt for the samples was
achieved not by adding salt, but by abstaining from dialyzing the
samples intended to have high salt concentration. This, however, also
reveals the importance of sample preparation. The lack of proper sample
preparation may introduce excessive amounts of salt, yielding lower
than expected diffusion rates.
Comparison
with Simple Geometries
To appreciate the deviation of Hst5
from the diffusive behavior of
more simple geometries, we here determine the corresponding effective
diffusion for a sphere and an ellipsoid (Table ). For both geometries, the translational
diffusion is obtained through the Stokes–Einstein equation,
and the rotational diffusion through Einstein–Smoluchowski
relation, with the difference being the friction factor. Using the P(r) from SAXS as found by Cragnell et
al.,[36] and the R found by the same SAXS-data (with R estimated from the relation by Nygaard
et al.), the apparent (or effective) diffusion is found to be 1.96
Å2/ns—a magnitude off (!).
Table 1
Computed Diffusion Using Simple Geometrical
Modelsa
model
Dt
Dr
D
sphere
1.35
0.005
1.96
ellipsoid
1.56
0.1
2.66
Translational diffusion D and apparent diffusion D in units of Å2/ns and rotational diffusion D in units of 1/ns. For the
spherical case, the relation of Nygaard et al. was
used to compute R from R.
Translational diffusion D and apparent diffusion D in units of Å2/ns and rotational diffusion D in units of 1/ns. For the
spherical case, the relation of Nygaard et al. was
used to compute R from R.A better approximation of the shape can be found by
fitting an
ellipsoid to the SAXS-data. Such fitting yields polar semiaxis a = 32.9 Å, and equatorial semiaxes b = 5 Å, respectively. Perrin-factors are then used to attain
translational- and rotational diffusion, following Roosen-Runge et
al.[19] Again, using P(r) as found by SAXS-measurement,
the apparent diffusion becomes 2.66 Å2/ns - still
significantly smaller than the experimentally determined value.On a related note, it is possible to compute R via Perrin-factors, offering an alternative
to the relation by Nygaard et al. Inspecting the numbers determined,
14.61 Å (Nygaard-relation) and 12.65 Å (Perrin-factors),
it is observed that if the Perrin-factors are used, a R smaller than R is found. Furthermore, using the R obtained through Perrin-factors,
the apparent diffusion becomes smaller. This would make a case against
using Perrin-factors to calculate R in this case, which is not unexpected, since the relation
by Nygaard et al. was specifically developed for IDPs and thus should
indeed perform better for Hst5.
Scaling Laws for Colloidal
Hard Sphere Suspensions
Several scaling laws for diffusion
constants with increasing crowding
have been published, each making different assumptions on particle
properties.[24,84,85] Here, as both translational- and rotational diffusion relationships
were derived, and since Hst5 is a charged peptide, we choose the scaling
law that assumes charged spheres by Banchio and Nägele,[24]with a = 2.5 and a =
1.3 and the effective hydrodynamic volume fraction ϕ. This scaling law has successfully described the
change in diffusion for BSA protein.[19] We
stress that the applicability of such scaling models based on spherical
colloids may be severely limited to describe strongly nonspherical
proteins.[86] Using the scaling law in combination
with the implicit function D = D(D, D) by Roosen-Runge et al., eq , to calculate the observable D, would in this case be valid as the scaling law of Banchio
and Nägele assumes charged spheres, thus circumventing the
uncertainty of whether the deconvolution procedure is valid for IDPs.
A comparison can thereafter be achieved by dividing with the value
attained at 50 mg/mL, in both the colloid model case and the experimental
case (see Figure ).
It is noted that ϕ in eq differs from the experimental
as-prepared protein volume fraction ϕ (eq ) (“dry volume fraction”) because
the COM diffusion is governed by R that includes a hydration shell.[19] In our QENS experiment, the difference in scattering cross section
between hydrogen and deuterium makes it possible to probe the (nondeuterated)
protein exclusively, i.e., no hydration shell is considered in the
assumptions of the relative amplitudes β and of the contributions to the scattering
signal (eq ), and any
effect from a change in the solvent dynamics itself near the protein
surface is neglected.[87] We here assume ϕh = (R/R)3ϕ to approximately accommodate this difference. R was approximated by the relation by
Nygaard et al., as before, however, we also consider that this relation
is one of many suggested, and that this has a fairly low R/R ratio (1.06) for Hst5. Nygaard et al. also points out that
for an idealized sphere, R/R = 1.28, but
an even larger ratio is found by using R = R + hydration shell, and by assuming a hydration shell of 5.5 Å,
using data from Perticaroli et al.,[87] who
found the perturbed number of water molecules near a globular protein
corresponds to a hydration layer smaller than 5.5 Å. This gives
a ratio of 1.40. Results from using both ”dry volume fraction”
and ”hydrodynamic volume fraction”, comparing the scaling
law with our measurements, are shown in Figure .
Figure 4
Comparison of diffusion constants obtained by
the colloidal scaling
law of Banchio and Nägele (eq ), utilizing the deconvolution procedure by Roosen-Runge
et al. (eq ), and
the experimental data. Experimental data is shown for 150 mM NaCl,
at 298 K, with Paalman–Pings corrections used. The scaling
law is combined with different assumptions on R, as it assumes a volume fraction based on
effective R.
Comparison of diffusion constants obtained by
the colloidal scaling
law of Banchio and Nägele (eq ), utilizing the deconvolution procedure by Roosen-Runge
et al. (eq ), and
the experimental data. Experimental data is shown for 150 mM NaCl,
at 298 K, with Paalman–Pings corrections used. The scaling
law is combined with different assumptions on R, as it assumes a volume fraction based on
effective R.Using the relation by Nygaard et al., and accounting for
the difference
in hydrodynamic/dry volume fraction, only returns a small difference.
An increased R seemingly
also gives better agreement with the experimental result, but not
even the worst-case scenario considered here can fully account for
the low diffusion rate. In this context, previous research has indicated
that hydration of IDPs may be different compared to that of globular
proteins.[88]
Prediction
Using Fractal Dimension as a
Variable
Instead of approximating Hst5 as a sphere or an
ellipsoid, an analytical expression for diffusion using fractal dimension
and molecular mass as input variable has been suggested by Augé
et al.,[56] as can be viewed in eq . After parametrization
of this relation using QENS data of α-synuclein, a prediction
of diffusion for Hst5 of 41 Å2/ns is obtained. This
is more than double the experimentally determined value of about 17
Å2/ns at the lowest measured protein concentration.
To consider the crowding effect, it is observed from QENS that increasing
protein concentration from 50 to 100 mg/mL decreases the diffusion
by 40%. Such a crowding response would still not be enough to validate
the model. Of note in this context is the sensitivity of the prediction—a
difference of 0.05 in fractal dimension can yield a difference in
diffusion of almost 10 Å2/ns, which is of relevance
since the region from which the fractal dimension is calculated is
somewhat noisy and prone to the accuracy of buffer subtraction.
Molecular Dynamics Simulations
Previously,
atomistic simulations of single-chain Hst5 have been performed by
Henriques et al.,[43] using the Amber99SBN-ILDN
force field (shortened “A-ILDN” here), and Jephthah
et al.[59] using the CHARMM36m (“C36m”
for short) and CHARMM36IDPSFF (“C36IDPS” for short)
force fields. Structural data obtained from all of these force fields
have previously shown to be in agreement with experimental SAXS data
of Hst5, thus showing their suitability. We also perform a simulation
with the Amber99SB-disp (“A-Disp” for short), which
also has been used to predict structural dimensions of Hst5. The determined
translational diffusion can be found in Table .
Table 2
Computed Translational
Diffusion Coefficients
(D) Obtained Using Different
Force Fields and Different Methodsa
force field
Dt
Dt incl. crowding
HYDROPRO prediction
of Dt
A-ILDN
22.4 ± 0.3
21.9 ± 0.3
13.2 ± 0.8
A-Disp
16.0 ± 0.2
15.7 ± 0.1
13.4 ± 0.8
C36m
55.6 ± 1.3
54.3 ± 1.2
13.2 ± 0.9
C36IDPS
59.8 ± 0.9
58.4 ± 0.9
13.9 ± 0.9
The “crowding”
included is by using eq . The ± sign indicates standard deviation. All units in Å2/ns.
The “crowding”
included is by using eq . The ± sign indicates standard deviation. All units in Å2/ns.Comparing the
different numbers obtained, there is a clear difference
between using a CHARMM-based force field with TIP3P based water model,
or an Amber-based force field with a TIP4P-D based water model, with
the former predicting much faster translational diffusion. The translational
diffusion produced from these simulations is not the same as the effective
diffusion found in QENS, but can still be used for comparison with
our experimental results, at least as a lower bound. The corresponding
diffusion coefficient found by QENS is 16.8 ± 0.66 Å2/ns, at a concentration of 50 mg/mL. Therefore, it is seen
that most force fields overestimate the translational diffusion, with
the exception of A-Disp, which, taking standard deviation into account,
is barely in agreement with experiment.No concentration dependence
on the structural properties of Hst5
has been observed, between low protein (≈ 6 mg/mL) and 50 mg/mL
protein concentration experimentally,[37] but the same may not be true for dynamical properties. Therefore,
the difference in diffusion may be attributed to the differing protein
concentration in the simulation, which here uses infinite dilution,
and the compared experiment, which here is at a protein concentration
of 50 mg/mL. eq can
provide an estimate, given that it assumes charged hard spheres, to
evaluate this concern. Using the diffusion constants obtained from
the infinite-dilution simulation and these scaling laws, a recalculated
translational diffusion is obtained, see Table . It is mainly observed that the change is
small, only qualitatively showing that the A-Disp force field is slightly
underestimating the diffusion, rather than being within experimental
error.As an alternative procedure, translational diffusion
can be computed
via the HYDROPRO program.[78] Results from
this procedure are found in Table . As the translational diffusion is somewhat smaller
than the effective diffusion found by QENS, this estimate is surprisingly
close to experiment, given that HYDROPRO was parametrized with crystal
structures (while Hst5 is an IDP). However, adding the fact that experiment
was performed at a higher protein concentration, thus subject to a
crowding effect, may suggest the calculated number is an underestimation.
Using the HYDROPRO approach, the translational diffusion was calculated
for each snapshot, yielding a distribution of diffusion, see Figure .
Figure 5
Reduced distribution
of translational diffusion computed for each
snapshot using HYDROPRO or R, combined with the relation of Nygaard et al. to obtain R, and the Stokes–Einstein
relation. The ”reduction” is done by dividing with the
mean translational diffusion for each case.
Reduced distribution
of translational diffusion computed for each
snapshot using HYDROPRO or R, combined with the relation of Nygaard et al. to obtain R, and the Stokes–Einstein
relation. The ”reduction” is done by dividing with the
mean translational diffusion for each case.To show that the “HYDROPRO-approach” is not a simplistic
mirroring of the size distribution of particles (defined in terms
of R), R was determined for each individual
snapshot, from which the translational diffusion was calculated via
Stokes–Einstein equation. As we know that this procedure yields
lower average translational diffusion, both distributions were divided
by the average translational diffusion in Figure . As expected HYDROPRO produces a broader
distribution, indicating that the particle shape heterogeneity, and
not just the particle size heterogeneity, is a contributing factor.
As well as considering distributions of translational diffusion, the
HYDROPRO approach also allows for the computation of the autocorrelation
of the translational diffusion (eq ), achieved for each individual chain for all force
fields in Figure .
Figure 6
Computed
autocorrelation of the translational diffusion (eq ) for all the replicates
using the A-ILDN force field (A), the A-Disp force field (B), the
C36m force field (C), and the C36IDP force field (D), using the HYDROPRO
software. The resolution here is one data point every 100 ps.
Computed
autocorrelation of the translational diffusion (eq ) for all the replicates
using the A-ILDN force field (A), the A-Disp force field (B), the
C36m force field (C), and the C36IDP force field (D), using the HYDROPRO
software. The resolution here is one data point every 100 ps.The autocorrelation time is much longer than the
time observed
in the QENS experiment, regardless of the force field used, which
based on an energy resolution of 0.9 μeV full width at half
maximum (fwhm) would not be longer than a few nanoseconds. This implies
that the QENS experiment yields ensemble averages over the system,
and not an average over time.
Elastic Incoherent Structure
Factors
The Elastic Incoherent Structure Factor (EISF) was
calculated from
the simulation using the MDANSE software[71] (details of how this computation is performed can be found in Supporting Information, section 6.), the results
are shown in Figure .
Figure 7
Comparison of the elastic incoherent structure factor (EISF) A0(q) between experiment (eq , protein concentration c = 50 mg/mL, 150 mM salt,
temperature T = 298 K, not imposing the q-dependence in the model) and simulation. (A) Comparing the single-chain
simulations with experiment. (B) Comparing the crowded simulations
with experiment.
Comparison of the elastic incoherent structure factor (EISF) A0(q) between experiment (eq , protein concentration c = 50 mg/mL, 150 mM salt,
temperature T = 298 K, not imposing the q-dependence in the model) and simulation. (A) Comparing the single-chain
simulations with experiment. (B) Comparing the crowded simulations
with experiment.Remarkably, the C36IDPS
force field yields a perhaps somewhat closer
agreement with experiment, despite its previously shown poor results
in terms of diffusion. It is also seen to be fairly noisy, which may
explain the superficial better agreement with experiment as a coincidence,
when scaling the curve as to have A0(0)
= 1. We speculate that the rather featureless EISF, obtained by both
experiment and simulation, corroborates the effective average over
the fluctuating shape of the protein. The single drop with increasing q in this picture reflects an effective diffusive mean-free
path within the protein. The corresponding experimental EISF at higher
protein concentrations is found in the Supporting Information, where there is indication of the apparent mean
free path decreases with increasing crowding, though the fairly large
error bars partly obscures this view. A difference in EISF, can in
this case, be caused by the different conditions of the experiment
and the simulation. In terms of q, the simulation
has its limits set by the simulation box size and the precision of
the coordinates. For the simulation of Henriques et al., which used
a rhombic dodecahedron as box geometry, this amounts to q-values of about 0.05–600 Å–1. Similar
values apply for the crowded simulations, which used a cubic box geometry.
The QENS measurements on the other hand are restricted to q-values of 0.19–1.8 Å–1.
In terms of energy resolution, the experimental resolution of 0.9
μeV fwhm result in that motions slower than a few nanoseconds
are perceived as immobile in the experiment. Correspondingly, the
finite length of the simulated trajectory will render slow motions
beyond such cutoff invisible in the simulations. On the other end,
very fast motions may not be captured due to the finite sampling in
the simulations.
Effect of Crowding
Starting with
structural features from the simulations, R was found to be 12.2 Å in the case of 10 mg/mL
protein concentration and 12.9 Å for the 50 mg/mL protein concentration
simulation, which should be compared with the single-chain simulation,
which produced a R of
13.1 Å (all numbers for A-Disp force field, having a standard
deviation of 2.2 Å). The commonly accepted experimental value
at low protein concentration (approximated as infinite dilution) is
13.8 Å,[36] though other experimentalists
have found values as low as 12.4 Å.[37] This is however not the first time Hst5 has been simulated with
the A-Disp force field: Jephthah et al.[59] found R of 12.9 Å.
Shrestha et al.[66] on the other hand, found
a R slightly below 12
Å with standard simulation methods and slightly above 12 Å
with enhanced sampling, though showing good agreement with SAXS data
despite the somewhat small values of R. Hence, considering the variation between different
experiments and simulations, it would seem that crowding in these
simulation does not induce a change in R. As can be gathered from the last snapshots of the
simulations (Figures S18), irreversible
aggregation does not seem to have occurred for the protein concentration
considered. Both of these observations are in line with previous crowding
experiments.[37] Considering secondary structure
in terms of the phi/psi dihedral angles, we first note that the Ramachandran
plot for the single-chain simulation using the A-Disp force field
(Figures S23, left) is similar to the one
produced by Jephthah et al.[59] Second, by
comparing the Ramachandran plots for the different protein concentrations
(Figure S23), it is observed that they
are practically indistinguishable. This would further indicate that,
structurally, there is no change upon increasing concentration in
the concentration span investigated.
Clustering Analysis
An analysis was performed to investigate
the possible formation of transient clusters. As can be seen from Figures S19–S21, the proteins are very
active in forming and breaking clusters of varying size. The exact
numbers for these depends on how a cluster is defined, but using a
metric found in similar studies with globular proteins[72,73] and testing two different cut-offs, it is found that on average,
there are two clusters present, with a total of six protein chains
participating, and the largest cluster being three to four protein
chains (Table S9). This analysis may depend
on the size of the box used (which determines the number of protein
chains in the system, keeping the concentration constant).
Diffusion
Regarding the diffusion, there is a clear
decrease in D with increasing
crowding. Using the A-Disp force field, infinite dilution showed D of 16.0 (standard deviation
0.2) Å2/ns (as discussed above), while crowded simulations
predicted D of 14.6
(standard deviation 0.1) and 10.7 (standard deviation 0.9) Å2/ns for 10 and 50 mg/mL protein concentration, respectively.
Even if, for any individual chain, fluctuations may be large, this
still points to a crowding effect greater than the crowding scaling
law of choice indicated. The data is shown in Figure , together with the D part of the scaling law (eq ) and the experimental data. It
is observed that the scaling law, used with the Nygaard relation here,
also for the simulation data underestimates the effect of crowding.
In this case, the difference is not due to irreversible aggregation,
which might have been postulated in the experimental case.
Figure 8
D as obtained from
simulation of Hst5 at different crowding conditions with the A-Disp
force field, the scaling law using only the translational diffusion
part – no rotational part, and the experimental data. The reader
is reminded that the experimental data is an apparent diffusion (D = D(D, D)).
D as obtained from
simulation of Hst5 at different crowding conditions with the A-Disp
force field, the scaling law using only the translational diffusion
part – no rotational part, and the experimental data. The reader
is reminded that the experimental data is an apparent diffusion (D = D(D, D)).However, the presence of transient clusters may
contribute to the
decrease in diffusion. Comparison of the diffusion on a per-replicate
basis with the different metrics of transient clustering (Figure S22), there is an indication that the
diffusion is dependent on the total number of proteins participating
in transient clusters. This is also shown by making a linear regression
of the data and computing R2, see Table S11. It should be stressed that this is
only an indication, given the few data points available here.Comparing with the experimental value of 16.8 ± 0.66 Å2/ns (D) at 50 mg/mL with the predicted value
10.7 Å2/ns (D) from the simulation of 50 mg/mL protein concentration, it
might be hard to conclude whether the model is in line with experiment
or not—as a lower bound, any value below the experimentally
found value could be argued reasonable. To achieve a more fair comparison,
we studied the relative ratios between D and D as found by Fujiwara et al.,[23] where the ratio was found to be 1.27 (while
the same ratio using the ellipsoid approximation as shown above is
1.7). With this ratio, an approximate D of 13.6 Å2/ns is found, indicating simulation to yield slower dynamics
than experimental evidence. The ratio may differ between different
systems, but as a first approximation (and a better approximation
than the ellipsoid approximation, as Fujiwara et al. studied an IDP),
the model used seem to underestimate the dynamics of Hst5. Although
the system is crowded with protein to a volume fraction of 0.03, the
water model may indeed play a role in this underestimation, given
its deviation from experiment.Wang et al.[89] has previously measured
diffusion of a globular and a disordered protein using NMR together
with different crowders, finding a significant change in diffusion
for both kinds of systems, though at more crowded conditions than
studied here (300 mg/mL). However, pointing to other studies, Wang
et al. notes that this occurs without a change in structure, which
would be in line with the results from our simulations, but also partly
in line with a study by König et al.[15] When investigating the IDP prothymosin α, in the crowded conditions
of eukaryotic cells, no change in chain dimensions was found but a
slowdown of translational diffusion (factor of 1.5), though also finding
chain compaction upon further crowding (induced by hyperosmotic stress).In absolute numbers, our results have similar magnitude as those
of Fujiwara et al., who showed α-synuclein to have D in the range of 9–16 Å2/ns for different
temperatures at a protein concentration of 10 mg/mL. Two key differences
should be mentioned in this context: The measurements of Fujiwara
et al. were performed at a protein concentration of about 10 mg/mL,
and α-synuclein is a much larger protein (14.5 kDa, while Hst5
is 3.0 kDa). Other QENS measurements performed with globular proteins
show a magnitudal difference in D, as shown in Table . The difference in
size should of course be considered here, as the molar mass range
in the above studies is 57–150 kDa. However, BSA, which has
slightly less than half the size of immunoglobulin, has similar diffusivity
at dilute conditions, clearly indicating that larger molecular mass
does not directly result in slower diffusion, since the shape also
matters. This would in turn indicate that the magnitude faster diffusion
found in the IDPs surveyed is not a simple consequence of their smaller
molecular mass.
Table 3
Other Studies of Proteins Using QENS,
Considering the Size of the Protein and the Obtained Apparent Diffusion
protein
size [kDa]
app. diff. [Å2/ns]
note
Hst5
3.0
2.5–17
this work
α-synuclein[23]
14.5
9–16
immunoglobulin[20]
150
1–4
temperature
of 293 K
BSA[19]
66.4
0–4
temperature of 280 K
GroEl[18]
57
0.5–1.8
temperature of 297 K
Conclusions
The apparent diffusion of the IDP Hst5
has been obtained by QENS
under self-crowded conditions. A decrease in diffusion with increasing
crowding is found, exceeding the decrease calculated from an established
scaling law, which has previously been shown to describe the diffusion
of globular proteins. We hypothesize that this strong decrease is
a consequence of a minor degree of aggregation at higher crowding
levels. The temperature dependence of the results is largely explained
by the Stokes–Einstein equation. Increasing the salt concentration
decreases the diffusion, a relation not described by changes in solvent
properties (i.e., salt-dependent viscosity). Usage of simple geometries
grossly underestimates diffusion constants, and analyzing structures
from MD simulations with HYDROPRO indicate that the distribution of
the diffusion not only depends on size. Crowded MD simulations also
show a clear decrease of the diffusion constant with increasing crowding,
in semiquantitative agreement with experiment. There is some uncertainty
in this comparison, as the experimental observable is not directly
comparable with the translational diffusion obtained from simulation,
though a few suggestions on this relationship can be found in the
literature. The EISF found in the experiment has too large error bars
to be reasonably compared with the ones obtained from the simulation
results. Under crowded conditions, the diffusion of the IDP Hst5 is
more greatly influenced than the structural properties.
Authors: David Van Der Spoel; Erik Lindahl; Berk Hess; Gerrit Groenhof; Alan E Mark; Herman J C Berendsen Journal: J Comput Chem Date: 2005-12 Impact factor: 3.376
Authors: Marco Grimaldo; Hender Lopez; Christian Beck; Felix Roosen-Runge; Martine Moulin; Juliette M Devos; Valerie Laux; Michael Härtlein; Stefano Da Vela; Ralf Schweins; Alessandro Mariani; Fajun Zhang; Jean-Louis Barrat; Martin Oettel; V Trevor Forsyth; Tilo Seydel; Frank Schreiber Journal: J Phys Chem Lett Date: 2019-03-28 Impact factor: 6.475
Authors: Christian Beck; Marco Grimaldo; Hender Lopez; Stefano Da Vela; Benedikt Sohmen; Fajun Zhang; Martin Oettel; Jean-Louis Barrat; Felix Roosen-Runge; Frank Schreiber; Tilo Seydel Journal: J Phys Chem B Date: 2022-09-16 Impact factor: 3.466