Sandhyaa Subramanian1, Hemashree Golla1, Kalivarathan Divakar2, Adithi Kannan1, David de Sancho3,4, Athi N Naganathan1. 1. Department of Biotechnology, Bhupat & Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, India. 2. Department of Biotechnology, National Institute of Technology Warangal, Warangal 506004, India. 3. Polimero eta Material Aurreratuak: Fisika, Kimika eta Teknologia, Kimika Fakultatea, Euskal Herriko Unibertsitatea UPV/EHU, Donostia-San Sebastián 20080, Spain. 4. Donostia International Physics Center (DIPC), PK 1072, Donostia-San Sebastián 20080, Spain.
Abstract
The rate at which a protein molecule folds is determined by opposing energetic and entropic contributions to the free energy that shape the folding landscape. Delineating the extent to which they impact the diffusional barrier-crossing events, including the magnitude of internal friction and barrier height, has largely been a challenging task. In this work, we extract the underlying thermodynamic and dynamic contributions to the folding rate of an unusually slow-folding helical DNA-binding domain, PurR, which shares the characteristics of ultrafast downhill-folding proteins but nonetheless appears to exhibit an apparent two-state equilibrium. We combine equilibrium spectroscopy, temperature-viscosity-dependent kinetics, statistical mechanical modeling, and coarse-grained simulations to show that the conformational behavior of PurR is highly heterogeneous characterized by a large spread in melting temperatures, marginal thermodynamic barriers, and populated partially structured states. PurR appears to be at the threshold of disorder arising from frustrated electrostatics and weak packing that in turn slows down folding due to a shallow, bumpy landscape and not due to large thermodynamic barriers or strong internal friction. Our work highlights how a strong temperature dependence on the pre-exponential could signal a shallow landscape and not necessarily a slow-folding diffusion coefficient, thus determining the folding timescales of even millisecond folding proteins and hints at possible structural origins for the shallow landscape.
The rate at which a protein molecule folds is determined by opposing energetic and entropic contributions to the free energy that shape the folding landscape. Delineating the extent to which they impact the diffusional barrier-crossing events, including the magnitude of internal friction and barrier height, has largely been a challenging task. In this work, we extract the underlying thermodynamic and dynamic contributions to the folding rate of an unusually slow-folding helical DNA-binding domain, PurR, which shares the characteristics of ultrafast downhill-folding proteins but nonetheless appears to exhibit an apparent two-state equilibrium. We combine equilibrium spectroscopy, temperature-viscosity-dependent kinetics, statistical mechanical modeling, and coarse-grained simulations to show that the conformational behavior of PurR is highly heterogeneous characterized by a large spread in melting temperatures, marginal thermodynamic barriers, and populated partially structured states. PurR appears to be at the threshold of disorder arising from frustrated electrostatics and weak packing that in turn slows down folding due to a shallow, bumpy landscape and not due to large thermodynamic barriers or strong internal friction. Our work highlights how a strong temperature dependence on the pre-exponential could signal a shallow landscape and not necessarily a slow-folding diffusion coefficient, thus determining the folding timescales of even millisecond folding proteins and hints at possible structural origins for the shallow landscape.
The marginal stability of most proteins
has its origins in the
strength and nature of interactions including hydrophobic packing
in the protein interior and surface electrostatics that can be both
favorable and unfavorable. The requirements for marginal stability
are manifold and can range from regulatory needs of the organism to
efficient folding and function.[1−9] These conflicting requirements contribute to the evolution of protein
sequences and determine not just the identity of the amino acid at
a specific location in the sequence but also subtly influence the
immediate environment around it, contributing to the coevolution of
sites far in the protein sequence.[10−14] Such features and constraints are driven to an extreme
in DNA-binding domains (DBDs) whose entire nucleic acid-binding face
exhibits a net positive electrostatic potential to bind the polyanionic
partner. In fact, many of the DBDs display complex thermodynamics
in the apo form, which has been attributed to the population of partially
structured states in solution.[15−26] The conformational complexity need not be restricted to DBDs but
likely applies to any macromolecule that binds an oppositely charged
counterpart. “Frustration,” even if minimized by evolutionary
forces to guarantee efficient folding, is a universal feature of proteins
and enzymes and is predicted to influence or in some cases even drive
folding-function behaviors.[27,28]An outstanding
question is the extent to which such frustration
(from geometrical constraints, packing interactions, or electrostatics)
influences the folding landscape of DBDs and thereby the folding speed.
Assuming a simple one-dimensional free-energy profile, the rate of
diffusional barrier crossing (k) according to Kramers
rate theory[29] can be written aswhere ωU2 and
ω*2 are the curvatures of the unfolded well and barrier
top (transition state), respectively, while D is
the folding diffusion coefficient and ΔG* is
the free-energy barrier to folding at temperature T. Thus, the rate of folding is determined not just by the thermodynamic
barrier height (ΔG*) but also the curvatures
of the reactant and barrier top wells (ω2) and the
folding diffusion coefficient (D), with latter terms
being part of the pre-exponential factor, k0. The large residual frustration in DBDs can have an impact on the
magnitude of thermodynamic free-energy barriers or on the pre-exponential
factor to protein folding or both, thus governing the folding speed.
The effect on the diffusion constant in the pre-exponential has variously
been termed as internal friction or landscape roughness or ruggedness.[27,30−33] It arises from microscopic barriers to peptide bond rotations[34,35] and non-native interactions[36−38] that need to be broken to form
native contacts, with the latter likely dominating the dynamics of
single-domain proteins.[39] Such a feature
can manifest as roughness both along and orthogonal to the folding
coordinate, thus slowing down the folding rate.[40]Interestingly, many fast-folding proteins exhibit
a dramatic slowing
down of folding relaxation rates at low temperatures (∼5000–30,000
s–1 between 280–290 K) despite matching with
theoretical speed limit expectations of ∼106 s–1 at 333 K.[41−45] Given that they exhibit downhill folding profiles under these conditions,
the rates directly report on the pre-exponential factor to protein
folding and are suggestive of a large temperature dependence on the
pre-exponential term.[27] The large temperature
dependence is conventionally interpreted as large internal friction
that in turn slows down the folding diffusion coefficient (D in eq ).
However, it is also possible that a shallow landscape or a weak curvature
in the unfolded well or barrier top (ωU2 and ω*2 in eq ) could equally slow down the folding, with the degree of
curvature changing with temperature. In this regard, millisecond folders
are generally thought to fold slow because of large thermodynamic
barriers (>3 RT). While this expectation holds
true
for many systems,[46] it is possible that
some proteins exhibit enhanced frustration due to functional or regulatory
constraints that slow down the pre-exponential factor, similar to
the extent observed in downhill-folding proteins. Recent simulations
highlight that the dynamics of α-helical systems are more sensitive
to frustration than those of β-sheet proteins,[47] and this is borne out in the studies of R17 and α3D.[48,49] Interestingly, even β-sheet
proteins exhibit an order of magnitude difference in the pre-exponential
factor within members of the same family.[50]These observations raise questions on the extent to which
the magnitude
of the pre-exponential factor affects the folding speed of slow-folding
α-helical proteins and whether one can extract the thermodynamic
and dynamic contributions to the rate equation by studying them. We
extend this question further and ask if the slow folding is due to
large free-energy barriers separating the various substrates, strong
internal friction, or the manifestation of broad unfolded wells and
barrier tops (indicative of shallow landscapes with multiple minima).
We answer these questions by probing the folding conformational landscape
of the DBD of PurR (purine repressor), a 56-residue helical domain
belonging to the LacR family of transcription regulators. PurR exhibits
large electrostatic frustration throughout its solvent-exposed surface
including the DNA-binding face (Figure A,B). This structural feature is surprisingly similar
to its homologue CytR that is disordered in solution and folds in
the presence of DNA[51] and unlike other
related proteins of the same family (Figure S1). Such strong frustration is expected to not only destabilize the
protein but also contribute to pockets of local structure that can
impede access to the folded state during folding of the protein. Here,
we combine experiments, statistical mechanical modeling, and coarse-grained
simulations to show that PurR exhibits little apparent internal friction,
thermodynamically uncoupled folding of structural elements, and downhill-like
folding profiles with multiple partially structured states. Despite
this, PurR folds slowly, hinting that the slow folding has its likely
origins in a shallow landscape.
Figure 1
(A) Cartoon representation of PurR (PDB
id 1pru) highlighting
the
DNA-binding helix (green) and the intrinsic fluorescence probes W37
(blue) and Y45 (red). The helical sequence boundaries are 5–11,
15–23, and 31–43 for helices 1, 2, and 3, respectively.
(B) Electrostatic potential maps of PurR DNA-binding face (left) and
the opposite face (right). The large positive potential on the DNA-binding
face enables PurR to bind DNA.
(A) Cartoon representation of PurR (PDB
id 1pru) highlighting
the
DNA-binding helix (green) and the intrinsic fluorescence probes W37
(blue) and Y45 (red). The helical sequence boundaries are 5–11,
15–23, and 31–43 for helices 1, 2, and 3, respectively.
(B) Electrostatic potential maps of PurR DNA-binding face (left) and
the opposite face (right). The large positive potential on the DNA-binding
face enables PurR to bind DNA.
Methods
The DBDs of the transcriptional repressor PurR (corresponding to
residues 1–57 of the protein with the sequence: MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVN) and its truncated variant (lacking the disordered
C-terminal tail highlighted in bold) were overexpressed and purified
as previously described.[26] Far-UV and near-UV
circular dichroism (CD) and fluorescence experiments on PurR and its
variants were performed as described before.[52] All experiments were carried out in pH 7.0, 20 mM sodium phosphate
buffer [effective ionic strength (IS) of 43 mM], unless otherwise
mentioned. Chemical denaturation experiments monitored by far-UV CD
and fluorescence were performed at 285, 298, and 310 K at protein
concentrations of ∼25 and ∼10 μM, respectively.
The samples were incubated for 2 h in increasing concentrations of
urea (0–8 M, in intervals of 0.5 M) in 20 mM sodium phosphate
buffer, pH 7.0 before measurements.
Stopped-Flow Kinetics
The folding and unfolding traces
of PurR and its mutants were recorded at 285, 298, and 310 K by fluorescence
using a Chirascan SF3 Stopped Flow instrument (deadtime of ∼1–2
ms; Applied Photophysics Ltd.) coupled to a thermostated water bath
as described earlier.[52] For temperature-dependent
folding kinetics, denatured PurR in 6 M urea was refolded at 0.55
M urea (final protein concentration ∼10 μM) at temperatures
285–305 K in steps of 2.5 K. The folding traces were recorded
by exciting the protein with a 280 nm light-emitting diode (LED).
A thousand data points were collected for every scan, and six scans
were recorded at an interval of every 1 min at every temperature.
The unfolding and refolding kinetics of PurR at different glucose
(0.25–1.5 M) and urea concentrations (0–8 M) were performed
in a similar manner to determine the rate dependence on solvent viscosity.
A rolling-ball micro-viscometer (Lovis 2000 ME, Anton Paar) with a
built-in temperature controller was employed to measure the dynamic
viscosity of urea-glucose mixtures at 285 K.
Differential Scanning Calorimetry
A Microcal VP-DSC
microcalorimeter (Malvern Ltd.) coupled to an automated sample injector
was employed to measure heat capacity profiles. All samples were degassed
at room temperature prior to calorimetric measurements. Desalted protein
solutions of the wild-type (WT) PurR and the truncated variant (concentrations
∼25 to ∼100 μM) and buffers were scanned at a
rate of 1 K/min. Calorimetric cells were maintained under an excess
pressure of 60 psi to prevent boiling at high temperatures. Buffer–buffer
baselines before and after the protein scans were routinely acquired
to ensure there was little thermal drift. The resulting apparent heat
capacities were converted into absolute units following the method
of Sanchez-Ruiz and co-workers.[53]
Fluorescence
Lifetime Measurements
Time-dependent fluorescence
intensity decays of W37 in PurR were recorded in a ChronosBH (ISS
Inc.) spectrometer coupled to a Peltier temperature controller. The
excitation pulse (from a 300 nm LED) and emitted photons were passed
through UV grade Glan-Thompson polarizers set at 0 and 54.7°,
respectively, from the vertical z-axis. The instrument
response function was measured using LUDOX solution. The emitted photons
were passed through a 345 nm long-pass filter (SCHOTT) to minimize
scattering artifacts. All decay curves were recorded until the peak
count reached 104 or the total count approached 108. The traces were fitted to biexponential functions with the
χ2 values being <1.5 at all temperatures.
WSME Model
The native-centric Wako-Saitô-Muñoz-Eaton
(WSME) model, coarse-grained at the residue level,[54,55] was employed to derive the thermodynamic landscape of PurR. Briefly,
the WSME model assigns a residue conformational status 0 for unfolded
and 1 for folded residues, enabling every possible microstate to be
represented as strings of 1s and 0s. Instead of employing the version
that accounts 2 states (where N is the protein length), we employ an advanced version
of the WSME model[56,57] that includes contributions from
single and two stretches of folded residues (single and double sequence
approximations) while also allowing for interactions between the two
folded stretches, thus accounting for a total of 791,617 microstates.
The statistical weight of each of the microstates includes contributions
from van der Waals interactions (heavy-atom neighbors identified with
a 5 Å distance cutoff from the PDB file 1pru), electrostatics
(all-to-all native electrostatics with an effective dielectric constant
of 29 in the Debye–Hückel formalism), implicit solvation,
and excess conformational entropy of −6.1 J mol–1 K–1 per residue for nonhelical disordered residues
identified by STRIDE.[58] Proline at position
47 was assigned an entropic penalty of 0, given its limited backbone
flexibility. Heat capacity profiles were generated from derivatives
of the total partition function (Z), while free-energy
profiles and conformational landscapes were obtained by algorithmically
grouping microstates with a specific number of structured residues.
The final parameters, obtained by quantitatively reproducing the differential
scanning calorimetry (DSC) curve, are ξ = −106.5 ±
0.62 J mol–1 (van der Waals interaction energy per
native contact), ΔSconf = −16.5
± 0.12 J mol–1 K–1 per residue
for all residues except proline (entropic penalty for fixing a residue
in the native conformation), and a = 1.91 ±
0.01 kJ mol–1 K–1. The parameter a determines the intercept of the native heat capacity baseline
in the equation N = (a + 0.0067
× (T – 273.15))Mw/1000 where Mw is the molecular
weight of PurR (6282.2 g mol–1).
Coarse-Grained
Simulations
Molecular simulations were
run using the Karanicolas and Brooks structure-based model that is
coarse-grained to the level of Cαs.[59] In this model, the potential energy is defined as the sum
of terms for bonds, angles, torsions, and nonbonded interactions.
The terms for bonds and angles are harmonic with the equilibrium values
corresponding to those between pairs of Cα beads.
Propensities for the dihedral energy terms are derived from a statistical
analysis of structures in the PDB. Finally, favorable nonbonded terms
are included for pairs of beads corresponding to amino acids whose
heavy atoms are “in contact” (i.e., their distance is
under a cutoff of 4.5 Å) in the reference native structures.
In addition to the prescription by Karanicolas and Brooks, we incorporate
the effects of electrostatic and non-native interactions following
Kim and Hummer (KH), as before.[60,61] Interactions between
charged residues are defined using a Debye–Hückel potentialwhere q and q are the
net amino acid charges, D is the dielectric constant,
and ξ is the screening length. Non-native interactions are described
using a sequence-dependent Lennard-Jones potential that replaces the
excluded volume term in the Karanicolas and Brooks model. Further
details on the models can be found elsewhere.[60,61]We generated simulation models using the PDB structure for
PurR (1pru).
Simulations were run at temperatures ranging between 270 and 360 K
using a Langevin integrator with a friction coefficient of 0.2 ps–1 and a time step of 10 fs using Gromacs 4.0.5.[62] To analyze the data, we project the resulting
trajectories on the fraction of native contacts, Q, and combine the information from multiple temperatures using the
weighted histogram analysis method.[63]
Results and Discussion
Equilibrium Thermodynamics and Slow-Folding
Kinetics of PurR
The equilibrium thermal unfolding of WT
PurR DBD (hereon referred
to as PurR) as monitored by far-UV CD at 222 nm exhibits the characteristic
sigmoidal profile expected of two-state systems with well-defined
pre- and post-transition regions (Figure A). The data can be fit to a two-state model
with a Tm of 323.5 ± 0.5 K and ΔHm of 149.7 ± 9.5 kJ mol–1 (error bars represent 95% confidence here and throughout the text),
similar to most mesophilic proteins. PurR being a small helical domain
has a long-range order (LRO[64]) of just
∼0.39; this is indicative of local interactions dominating
the contact energetics and predicts that PurR should fold relatively
fast in the microsecond timescale. However, stopped-flow kinetics
points to well-defined kinetic phases described by single exponential
functions with the kinetic amplitudes matching the populations from
equilibrium measurements (inset to Figures B and S2C). A
chevronlike behavior is observed at all experimental temperatures
with an extrapolated folding rate constant in the absence of urea
of just ∼71 s–1 at 285 K that increases to
∼347 s–1 at 310 K (Figures B and S2A). PurR,
therefore, falls well below the expected range in the plot of LRO
versus folding rate (Figure C). Moreover, changing the conditions (increasing temperatures
or IS to 0.5 M) affects the folding rate only marginally (Figures C and S2B). Thus, though PurR exhibits an apparent
two-state equilibrium, it folds significantly slower than the expectation
from LRO predictions, arguing for specific sequence effects determining
its folding behavior. In the next few sections, we systematically
explore the likely reasons for slow folding combining experiments
and simulations.
Figure 2
Slow Folding of PurR. (A) Thermal unfolding profile of
PurR as
monitored by far-UV CD at 222 nm plotted in mean residue ellipticity
units of deg cm2 dmol–1. The vertical
dashed line signals the melting temperature from a two-state fit (red
curve), while U and F represent unfolded and folded baselines, respectively.
(B) Folding kinetics from stopped-flow experiments with open and filled
circles representing the measured refolding and unfolding relaxation
rates, respectively. Inset: relative kinetic amplitudes (circles)
and equilibrium populations derived from a two-state model (red curve).
(C) Correlation between LRO and folding rates for a database of proteins[65] (black circles). Blue, green, and red stars
signal the PurR folding rate constants at 285, 298, and 310 K, respectively,
at 43 mM IS, pH 7.0. Blue and green circles are the folding rates
of PurR at 285 and 298 K at 500 mM IS, pH 7.0.
Slow Folding of PurR. (A) Thermal unfolding profile of
PurR as
monitored by far-UV CD at 222 nm plotted in mean residue ellipticity
units of deg cm2 dmol–1. The vertical
dashed line signals the melting temperature from a two-state fit (red
curve), while U and F represent unfolded and folded baselines, respectively.
(B) Folding kinetics from stopped-flow experiments with open and filled
circles representing the measured refolding and unfolding relaxation
rates, respectively. Inset: relative kinetic amplitudes (circles)
and equilibrium populations derived from a two-state model (red curve).
(C) Correlation between LRO and folding rates for a database of proteins[65] (black circles). Blue, green, and red stars
signal the PurR folding rate constants at 285, 298, and 310 K, respectively,
at 43 mM IS, pH 7.0. Blue and green circles are the folding rates
of PurR at 285 and 298 K at 500 mM IS, pH 7.0.
Minimal Apparent Internal Friction
One of the primary
origins of slow folding is internal friction. Although a rigorous
definition of internal friction has been elusive,[34,35,39] it is broadly accepted that its effects
are encapsulated within the folding diffusion coefficient D in Kramers rate theory (eq ). For practical purposes, internal friction effects
are quantified experimentally by measuring the kinetic rate constants
at varying concentrations of viscogens that increase the solvent viscosity
(η) and assuming that the folding free energy surface is unaltered.[66] If the dynamics are enslaved to friction from
the solvent, because the friction coefficient varies in proportion
to the viscosity, we can rewrite eq as[67]where A is a constant that
includes the contributions of the energy landscape to the pre-exponential
(i.e., the curvatures at the bottom of the well and the barrier top).
Under iso-stability conditions, if we divide the folding rate at a
reference viscosity η0 by that at a working viscosity η, that is, kf,0/kf, the exponential term
and A cancel out. Hence, the plot of kf,0/kf versus the ratio of
the viscosities η/η0 would follow a straight
line with a slope of 1 and 0 offset.[67] If
instead, internal friction (σ) needs to be invoked, then eq reduces to[67]and an insensitivity
to the viscosity that
translates to a non-zero offset and a slope lower than 1 in the plot
of kf,0/kf versus η/η0.To explore
this, we measured the folding relaxation rates of PurR at different
glucose (0–1.5 M, the viscogen) and urea concentrations. A
chevronlike behavior is evident at various glucose concentrations
(Figures S3 and S4) from which the rates
are extracted at two different iso-stability conditions of 8.4 and
4 kJ mol–1. The measured relative rates scale directly
with the relative solvent viscosity for PurR (Figure A), and the folding times (τ) exhibit
a near-linear dependence on the relative viscosity (Figure B). These results indicate
that the slow folding of PurR is not a consequence of internal friction,
at least as conventionally interpreted.
Figure 3
Minimal apparent internal
friction. (A) Plots of relative folding
rates (ordinate) versus relative viscosity at 285 K and at two different
iso-stability conditions of 8.4 and 4 kJ mol–1,
respectively (blue and red). For the iso-stability condition of 8.4
kJ mol–1, the urea concentration spans the range
0–3.25 M at different glucose concentrations, while for 4 kJ
mol–1 iso-stability, the urea concentration range
is 1.95–5.55 M. (B) Plot of the folding time (ordinate) against
relative viscosity at iso-stability conditions of 8.4 kJ mol–1. Red line is the expected linear dependence.
Minimal apparent internal
friction. (A) Plots of relative folding
rates (ordinate) versus relative viscosity at 285 K and at two different
iso-stability conditions of 8.4 and 4 kJ mol–1,
respectively (blue and red). For the iso-stability condition of 8.4
kJ mol–1, the urea concentration spans the range
0–3.25 M at different glucose concentrations, while for 4 kJ
mol–1 iso-stability, the urea concentration range
is 1.95–5.55 M. (B) Plot of the folding time (ordinate) against
relative viscosity at iso-stability conditions of 8.4 kJ mol–1. Red line is the expected linear dependence.
Marginal Thermodynamic Barriers
An alternate possibility
is that PurR folds over large free-energy barriers that in turn contribute
to its unusually slow folding. DSC is an ideal avenue for extracting
the thermodynamic barriers,[68] given the
fundamental connection between heat capacity and partition function,[69,70] which has been validated in several small single-domain proteins.[71,72] To probe if the folding thermodynamics of PurR is characterized
by large thermodynamic barriers, we measured the absolute heat capacity
profile of PurR from the dependence of the apparent heat capacity
on protein concentrations (Figure S5A).
The resulting DSC profile is broad, upshifted from the expected Freire
baseline, exhibits little pretransition, cannot be characterized by
a chemical two-state model (Figure S5B),
and therefore displays the features of marginal barrier systems (Figure A). The broad DSC
curve is not a consequence of the disordered C-terminal tail as the
PurR variant without the disordered tail does not change the overall
broadness of the DSC curve and the cooperativity as monitored by far-UV
CD or the relaxation rates (Figure S6).
The WSME model fits with detailed energetics reproduce the experimental
heat capacity profile of the WT PurR very well (see Methods for parameters). The one-dimensional free-energy profiles
as a function of the number of structured residues (the reaction coordinate)
reveal a downhill-like folding gradient at low temperatures and a
maximal thermodynamic barrier of just ∼3 kJ mol–1 at 329 K (Figure B). The probability distribution at 329 K is broad and without well-defined
unfolded or partially structured states (Figure C).
Figure 4
Marginal thermodynamic barriers. (A) Absolute
heat capacity of
PurR (circles) fit to the WSME model (red) together with the predicted
native baseline (black). The thermodynamic fluctuations of well-folded
proteins follow a specific trend with temperature and molecular weight
that is represented by the Freire baseline (green). The observed upward
shift of the experimental data is generally seen as evidence for large
thermodynamic fluctuations in the native ensemble. The vertical dashed
line signals the melting temperature from a two-state fit. (B,C) Predicted
free-energy profiles (panel B, in kJ mol–1) and
probability distributions (panel C) at 278 K (blue), 298 K (green),
310 K (red), and 329 K (black). Note the downhill-folding profiles
at low temperatures and the marginal barrier with multiple partially
structured states at 329 K. (D) Heat capacity curves derived from
coarse-grained simulations with native (Go̅) and native plus
non-native potentials (Go̅ + KH). (E,F) Free-energy profiles
(panel E, in kJ mol–1) and the corresponding density
distributions (panel F) as a function of the fraction of native contacts
for the Go̅ + KH potential at 270 K (blue), 280 K (green), 290
K (red), and 310 K (black). Note that 310 K represents the apparent
melting temperature in the relative energy scale of the coarse-grained
simulations.
Marginal thermodynamic barriers. (A) Absolute
heat capacity of
PurR (circles) fit to the WSME model (red) together with the predicted
native baseline (black). The thermodynamic fluctuations of well-folded
proteins follow a specific trend with temperature and molecular weight
that is represented by the Freire baseline (green). The observed upward
shift of the experimental data is generally seen as evidence for large
thermodynamic fluctuations in the native ensemble. The vertical dashed
line signals the melting temperature from a two-state fit. (B,C) Predicted
free-energy profiles (panel B, in kJ mol–1) and
probability distributions (panel C) at 278 K (blue), 298 K (green),
310 K (red), and 329 K (black). Note the downhill-folding profiles
at low temperatures and the marginal barrier with multiple partially
structured states at 329 K. (D) Heat capacity curves derived from
coarse-grained simulations with native (Go̅) and native plus
non-native potentials (Go̅ + KH). (E,F) Free-energy profiles
(panel E, in kJ mol–1) and the corresponding density
distributions (panel F) as a function of the fraction of native contacts
for the Go̅ + KH potential at 270 K (blue), 280 K (green), 290
K (red), and 310 K (black). Note that 310 K represents the apparent
melting temperature in the relative energy scale of the coarse-grained
simulations.To gain further insight into the
folding of PurR, we have run coarse-grained
simulations using the model by Karanicolas and Brooks.[59] To test for the role of non-native interactions
in modulating barrier height magnitudes, we supplemented the original
model with the transferable Kim−Hummer potential, which incorporates
non-native hydrophobic and electrostatic interactions. Interestingly,
the resulting thermogram is broader when non-native states are considered
compared to the pure Go̅-like potential (Figure D). A similar broad thermogram is observed
on different realizations of the energy function, where we separately
consider the influence of non-native interactions and charges (Figure S7). In all cases, we find little evidence
for large thermodynamic barriers, and the resulting barrier height
magnitudes (Figures E and S7) are consistent with the predictions
from the WSME model. This agreement across models with different degrees
of structural resolution and energy functions is strong evidence that
PurR folds over only small thermodynamic barriers (∼1 to 2 kBT) and that the slow-folding
rate has other molecular origins.
Non-Cooperative Unfolding
and Partially Structured States
One striking aspect of the
free-energy profiles is that they point
to three partially structured states (as the barriers are small, we
cannot classify them as true intermediates) that are populated en route to folded state in the WSME model and at least
one partially structured state in the coarse-grained simulations.
To further test for this feature from experiments, we performed multiprobe
spectroscopic measurements. We identify considerable differences in
melting temperatures of ∼5 K in PurR when analyzed by a two-state
model: Tm of 321.2 ± 1.2 from quantum
yield (QY) measurements on excitation at 295 nm (QY295)
(Figure S8A), 323.2 ± 1.0 from QY
measurements at 274 nm excitation (QY274) (Figure S8A), 323.5 ± 0.5 K from far-UV CD
(Figure A), and 326.9
± 0.3 K from DSC (Figure A). Remarkably, the wavelength of maximum fluorescence emission
(λmax) of the sole tryptophan (W37) displays a transition
point of 333 K (from a first-derivative analysis, as the baseline
for the unfolded state is not well defined) which is nearly 12 K higher
than the melting temperature obtained from QY measurements, that is,
from the same data set (Figure A). This is more apparent when one observes that λmax starts changing only at about 320 K, while the heat capacity
profile already displays large changes at this temperature.
Figure 5
Non-cooperative
unfolding thermodynamics from experiments and simulations.
(A) Changes in fluorescence emission maximum of PurR as a function
of temperature and the corresponding inflection point of 332 K (dashed
line). The melting temperature of 321 K from changes in fluorescence
intensity at 295 nm from the same experiment is shown as a continuous
black line for reference. (B) FLT amplitudes of PurR for the long
(blue) and short (red) lifetime components, together with the crossover
temperature of 332 K. (C) Predicted unfolding curves for the different
secondary structure elements (helix 1/H1, helix 2/H2, and helix 3/H3)
of PurR from the WSME model. The corresponding inflection points from
first-derivative analysis are marked at 325, 332, and 328 K in blue,
green, and red, respectively. (D) Temperature dependence of the fraction
of native contacts (Q) for contacts formed by the
different protein helices from coarse-grained simulations. (E) Overlay
of multiple snapshots corresponding to the partially structured state
observed in the simulations at a reaction coordinate value of Q ∼ 0.5 (colored tube). We show a transparent cartoon
representation of the native state for reference.
Non-cooperative
unfolding thermodynamics from experiments and simulations.
(A) Changes in fluorescence emission maximum of PurR as a function
of temperature and the corresponding inflection point of 332 K (dashed
line). The melting temperature of 321 K from changes in fluorescence
intensity at 295 nm from the same experiment is shown as a continuous
black line for reference. (B) FLT amplitudes of PurR for the long
(blue) and short (red) lifetime components, together with the crossover
temperature of 332 K. (C) Predicted unfolding curves for the different
secondary structure elements (helix 1/H1, helix 2/H2, and helix 3/H3)
of PurR from the WSME model. The corresponding inflection points from
first-derivative analysis are marked at 325, 332, and 328 K in blue,
green, and red, respectively. (D) Temperature dependence of the fraction
of native contacts (Q) for contacts formed by the
different protein helices from coarse-grained simulations. (E) Overlay
of multiple snapshots corresponding to the partially structured state
observed in the simulations at a reaction coordinate value of Q ∼ 0.5 (colored tube). We show a transparent cartoon
representation of the native state for reference.To validate this observation further, we performed fluorescence
lifetime (FLT) analysis of W37 located in helix 3 making long-range
interactions with helix 2; the advantage of this technique is that
the signals and the species population can be directly decoupled without
resorting to baselines. FLT measurements reveal two lifetimes for
W37, the longer one (∼7 ns) corresponding to the main-chain
conformation or side-chain orientation sensitive to the folded state
and the shorter one (∼1 ns) representing the unfolded conformation
(Figure S8B,C). The corresponding amplitudes
follow a clear sigmoidal pattern with the amplitude crossover at 332
K (Figure B), very
similar to the inflection point of fluorescence λmax changes. The large difference in apparent melting temperatures that
is consistent across different experimental probes thus suggests that
W37 is differentially sensitive to the folding environment, with the
fluorescence intensity (from which QY is calculated) and λmax sensitive to global and local unfolding events around the
tryptophan, respectively.These experimental observations are
strikingly captured by both
the WSME model and coarse-grained simulations. The WSME model predicts
that the individual helical elements exhibit differences in the overall
stability that translates to differences in the melting temperature
ranging from 325–332 K, with 332 K being the melting temperature
of helix 2 (Figure C). Coarse-grained molecular simulations paint a picture of the folding–unfolding
equilibrium qualitatively consistent with that of the WSME model.
In Figure D, we show
the average values of Q calculated for contacts involving
helix 1, 2, or 3 (we note that these may include some overlapping
pairs of interactions). Clearly, the melting of the interactions formed
by the helices is decoupled, with helix 1 being first to unfold, helix
3 being the closest to the average unfolding, and helix 2 being third.
We note that the rank order in the melting temperatures is exactly
the same as that from the WSME model. The decoupled unfolding manifests
as a partially structured state during (un)folding at a reaction coordinate
value of ∼0.5 (Figure E) characterized by a detached helix 1 that samples varied
conformations (Figure E). The emergence of this partially structured state results in different
conformational changes being probed preferentially at different temperatures
(Φ-values for these barrier crossing events calculated from
the simulations are shown in Figure S9).
At the same time, the unique peak in the heat capacity curve (Figure ) does not warrant
a description in terms of separate thermodynamic transitions.
Strong
Temperature Dependence on the Pre-Exponential to Folding
In this section, we explore the extent to which the pre-exponential
factor described in eq needs to change with temperature to account for the observed slow
folding. Since the thermodynamic free-energy profiles are available,
it should be possible to extract this pre-exponential factor to protein
folding (k0) by measuring the folding
relaxation rates at a range of temperatures, similar to earlier work
on fast-folding proteins.[73] Note that this
approach does not disentangle the different terms in k0 but serves to only identify its dependence on temperature.
We measured the folding relaxation rates of PurR at a range of temperatures
from 285–305 K and at a final urea concentration of 0.55 M
(near-native conditions; Figure A). The measured rates are surprisingly similar to
the sampling rate of an excited state in the disordered ensemble of
CytR[74] and the folding kinetics of an engineered
folded variant CytR A28V/A48M (double mutant, DM) that exhibits an
equally complex thermodynamic behavior with equilibrium melting temperature
differences of ∼10 K.[52]
Figure 6
Strong temperature
dependence on the pre-exponential factor to
folding. (A) Observed relaxation rates of PurR from stopped-flow experiments
at near-native conditions of 0.55 M urea (open red circles) starting
from protein denatured in 6 M urea. The corresponding rates from chevron
plots (Figures B and S2) are shown in filled red circles highlighting
the internal consistency across different experiments. The fit from
diffusive calculations on the free-energy profiles (Figure B) is shown as a red curve.
The observed relaxation rates for the sampling of an excited folded
conformation in the disordered ensemble of CytR is shown in green.
Black circles represent the relaxation rates for the folded variant
of CytR, termed the CytR DM (double mutant, A29V/A48M).[52] (B) Measurements of the pre-exponential factor
from experiments (blue and magenta) versus that estimated for the
folded PurR (red) and the disordered CytR (green).
Strong temperature
dependence on the pre-exponential factor to
folding. (A) Observed relaxation rates of PurR from stopped-flow experiments
at near-native conditions of 0.55 M urea (open red circles) starting
from protein denatured in 6 M urea. The corresponding rates from chevron
plots (Figures B and S2) are shown in filled red circles highlighting
the internal consistency across different experiments. The fit from
diffusive calculations on the free-energy profiles (Figure B) is shown as a red curve.
The observed relaxation rates for the sampling of an excited folded
conformation in the disordered ensemble of CytR is shown in green.
Black circles represent the relaxation rates for the folded variant
of CytR, termed the CytR DM (double mutant, A29V/A48M).[52] (B) Measurements of the pre-exponential factor
from experiments (blue and magenta) versus that estimated for the
folded PurR (red) and the disordered CytR (green).We extract the pre-exponential term by performing diffusive
calculations
on the free-energy profiles generated by the WSME model (thus accounting
for barrier effects) by solving the one-dimensional diffusion equation
using the rata-matrix approach of Hofrichter and co-workers.[75] We employ a phenomenological Arrhenius dependence
on k0 as Ae–, where Ea is the activation energy that approximates the landscape
roughness including solvent effects and changes in shape of the free-energy
profile with temperature.[73] The temperature
dependence is captured well only when k0 varies from ∼3000 s–1 at 285 K to ∼200,000
s–1 at 320 K (Figure B). It is pertinent to note that the former is just
an order of magnitude slower than the relaxation rates measured for
downhill-folding proteins while agreeing well with the downhill estimates
at higher temperatures.The activation energy is found to be
80 or 1.43 kJ mol–1 per residue, nearly 40% higher
than the average per residue dependence
of fast-folding proteins.[73] Interestingly,
the pre-exponential terms match with those of the disordered CytR
at low temperatures that displays a shallow, bumpy landscape from
experiments and simulations.[74]
Discussion
The conformational behavior of PurR is thus observed to be complex,
with some attributes from each of downhill (spread in melting temperatures,
marginal barriers, and downhill-folding profiles from simulations),
two-state folding (chevron kinetics and sigmoidal unfolding curves),
and multistate folding (partially structured states from simulations).
These features are captured well at a (semi-)quantitative level by
the WSME model and variants of coarse-grained simulations, with the
latter including non-native energetic effects. The simulations paint
a dynamic picture in which the protein molecule struggles to fit in
the helices together as it folds. Accordingly, the three helices exhibit
graded thermodynamic stability that in turn lowers the thermodynamic
cooperativity and barriers but promotes pockets of partial structure
all along the folding coordinate that manifests as differences in
melting temperatures and broad heat capacity profiles. In other words,
the different structural elements are only weakly coupled, that is,
exhibiting large sensitivity to perturbations, contributing to complex
underlying landscape. Interestingly, the homologue CytR is disordered
in solution, while the engineered folded variant of CytR with two
hydrophobic substitutions (A29V/A48M) exhibits very similar features:
broad DSC profile, differences in melting temperatures of 10 K, heterogeneous,
and slow folding.[52]What is unique
about the structure of PurR that contributes to
the conformationally heterogeneous unfolding, despite exhibiting slow
two-state like chevron kinetics? In this regard, it is known that
DBDs function in a highly complex environment around DNA arising from
counterion condensation. It is therefore likely that the structure
of PurR is evolutionarily selected for folding and function in the
vicinity of DNA and not in the conditions employed in the current
set of experiments (43 mM IS). Increasing the solvent IS to mimic
the environment around DNA (>2 M IS[76,77]) results in
an unusual feature where parts of the PurR structure are lost (less
negative ellipticity at 222 nm) despite the increased stability (Figure A). This hints at
a complex surface electrostatic feature involving a combination of
both stabilizing and destabilizing effects and partitioning of local-nonlocal
electrostatics, with one effect dominating over the other depending
on the conditions. The folding relaxation rates increase on increasing
the solvent IS but fold faster than the dead-time of the stopped flow-instrument
even at 298 K at IS greater than 0.5 M (Figure S2). In fact, our observations are consistent with single-molecule
experiments on α3D that point to conflicting electrostatic
interactions as the primary source of internal friction or a slower
pre-exponential to folding.[49] Recent statistical
mechanical modeling of PurR folding in the presence of DNA demonstrates
a progressive titling of its landscape toward the folded state in
the vicinity of DNA, providing hints that the folding landscape could
be fine-tuned by quinary interactions.[78]
Figure 7
Electrostatic
frustration and weak packing contribute to a shallow
folding landscape. (A) Thermal unfolding curves of PurR at different
IS conditions as monitored by far-UV CD at 222 nm and reported in
mean-residue ellipticity units of deg cm2 dmol–1. Note the loss of secondary structure at high IS (black and red)
despite an increase in the melting temperature. (B) Thermal unfolding
curves of PurR mutants V21A (red) and I40A (green).[52] (C) Two-dimensional conformational landscape of PurR predicted
by the WSME model at 329 K highlighting the lack of large thermodynamic
barriers between folded (F), unfolded (U), and the numerous partially
structured states (valleys in dark blue shade). nN and nC represent the number
of structured residues in the N- and C-terminals, respectively. nN includes the first two helices, while the
rest of the sequence falls in nC. (D,E)
Two-dimensional free-energy landscape (in units of kBT) of PurR from coarse-grained simulations
pointing to a complex conformational ensemble with multiple valleys
(light and dark blue contours). The coordinates employed are the fraction
of native contacts in helix 1 (QH1), helix 2 (QH2), and helix 3 (QH3).
Electrostatic
frustration and weak packing contribute to a shallow
folding landscape. (A) Thermal unfolding curves of PurR at different
IS conditions as monitored by far-UV CD at 222 nm and reported in
mean-residue ellipticity units of deg cm2 dmol–1. Note the loss of secondary structure at high IS (black and red)
despite an increase in the melting temperature. (B) Thermal unfolding
curves of PurR mutants V21A (red) and I40A (green).[52] (C) Two-dimensional conformational landscape of PurR predicted
by the WSME model at 329 K highlighting the lack of large thermodynamic
barriers between folded (F), unfolded (U), and the numerous partially
structured states (valleys in dark blue shade). nN and nC represent the number
of structured residues in the N- and C-terminals, respectively. nN includes the first two helices, while the
rest of the sequence falls in nC. (D,E)
Two-dimensional free-energy landscape (in units of kBT) of PurR from coarse-grained simulations
pointing to a complex conformational ensemble with multiple valleys
(light and dark blue contours). The coordinates employed are the fraction
of native contacts in helix 1 (QH1), helix 2 (QH2), and helix 3 (QH3).Additionally, protein engineering experiments highlight an unusual
packing thermodynamics with single-point mutations V21A and I40A strongly
destabilizing the protein (Figure B).[52] In fact, the V21A
mutation fully unfolds PurR, arguing for a weak hydrophobic effect
driving folding in this system. This observation is borne out by the
fact that a close family member, CytR, is disordered in solution and
is a mere two hydrophobic substitutions from being folded at low temperatures.[26,52]Taken together, the molecular origin of slow folding in PurR
seems
to be a combination of multiple sequence-structural features that
in turn affect the pre-exponential to folding and not because of large
thermodynamic barrier height. We find evidence for this from a steep
temperature dependence of relaxation rates and the extracted pre-exponential
terms that match with downhill folding proteins at higher temperatures
and are slower by an order of magnitude at lower temperatures. Additionally,
the free-energy landscape of PurR is complex when assuming two conformational
coordinates from the WSME model or coarse-grained simulations. In
the former, it is seen that nearly all conformations are equally likely
with a broad distribution of molten-globule-like conformations at
the midpoint with no well-defined funnel toward the folded state (Figure C). This is also
observed in two-dimensional probability density plots from coarse-grained
simulations, wherein partially structured states are populated irrespective
of the projections (Figure D,E; see also contact maps in Figure S10).Experiments on unfolded protein L under folding conditions
highlight
a dramatic slowdown in folding diffusion coefficient compared to unfolding
conditions,[79] while single-molecule experiments
point to large contributions from internal friction on even unfolded
and disordered proteins.[80] Given these
observations and since internal friction effects arise likely from
microscopic barriers to dihedral motions,[34] it is surprising to find that conventional measures of internal
friction fail to reveal any landscape roughness or apparent internal
friction in PurR. If the established techniques for “measuring”
internal friction are robust (Figure ), assuming a one-dimensional coordinate, it can therefore
be concluded that the folding landscape of PurR is characterized by
broad unfolded and folded basins with no well-defined transition-state
ensemble. In terms of Kramers theory of reaction rates, these features
hint at small curvatures (ω2) of unfolded well and
barrier top in PurR, which in turn slow down the folding rate and
not through a slow-folding diffusion coefficient. Experimental works
narrowing the barrier increase the folding rate by an order of magnitude,[81] thus suggesting that broader barriers or unfolded
wells would proportionately slow down folding. An alternate possibility
is that denaturant-stabilizer mixtures (as in urea-glucose use in
the current study) modulate both the intramolecular diffusion coefficient
and barriers in compensatory fashions, resulting in a linear relative
viscosity versus rate plot (Figure ). It is important to note that the extent of trapping
in the populated partially structured states could also be different
that necessitates the use of a coordinate dependent diffusion coefficient
or “heterogeneous friction.”[67] A two-dimensional landscape, on the other hand, could contribute
to complex distribution of folding fluxes, again requiring additional
slow diffusional terms. Experiments on mutants that populate one or
more of the partially structured states could be specifically employed
to probe for such coordinate-dependent effects.
Conclusions
In
summary, our experiments combined with simulations rule out
a large thermodynamic barrier in PurR and point to the slaving of
the dynamics to the shape of the underlying free-energy landscape
that appears as a strong temperature dependence on the pre-exponential,
slowing down folding. Structural, mutational, and thermodynamic analyses
of packing and electrostatic effects indicate that PurR is at the
threshold of disorder. We also find that studies combining scanning
calorimetry experiments, multiprobe spectroscopy, viscosity-dependent
kinetics, and thermodynamic modeling can provide an unparalleled view
on the underlying folding landscape of proteins. It still remains
to be seen if the slow folding determined by the slow pre-exponential
is a conserved feature of LacR/PurR family members. The suitability
of additional coordinates in explaining slow folding and the role
of DNA in smoothening the folding landscape or speeding up the folding
pre-exponential through its large negative electrostatic potential
also remain to be seen.
Authors: Stephen J DeCamp; Athi N Naganathan; Steven A Waldauer; Olgica Bakajin; Lisa J Lapidus Journal: Biophys J Date: 2009-09-16 Impact factor: 4.033
Authors: Beth G Wensley; Sarah Batey; Fleur A C Bone; Zheng Ming Chan; Nuala R Tumelty; Annette Steward; Lee Gyan Kwa; Alessandro Borgia; Jane Clarke Journal: Nature Date: 2010-02-04 Impact factor: 49.962
Authors: Dmitry M Korzhnev; Robert M Vernon; Tomasz L Religa; Alexandar L Hansen; David Baker; Alan R Fersht; Lewis E Kay Journal: J Am Chem Soc Date: 2011-06-28 Impact factor: 15.419
Authors: Athi N Naganathan; Peng Li; Raúl Perez-Jimenez; Jose M Sanchez-Ruiz; Victor Muñoz Journal: J Am Chem Soc Date: 2010-08-18 Impact factor: 15.419