Literature DB >> 32955882

Slow Folding of a Helical Protein: Large Barriers, Strong Internal Friction, or a Shallow, Bumpy Landscape?

Sandhyaa Subramanian1, Hemashree Golla1, Kalivarathan Divakar2, Adithi Kannan1, David de Sancho3,4, Athi N Naganathan1.   

Abstract

The rate at which a protein molecule folds is determined by opposing energetic and entropic contributions to the free energy that shape the folding landscape. Delineating the extent to which they impact the diffusional barrier-crossing events, including the magnitude of internal friction and barrier height, has largely been a challenging task. In this work, we extract the underlying thermodynamic and dynamic contributions to the folding rate of an unusually slow-folding helical DNA-binding domain, PurR, which shares the characteristics of ultrafast downhill-folding proteins but nonetheless appears to exhibit an apparent two-state equilibrium. We combine equilibrium spectroscopy, temperature-viscosity-dependent kinetics, statistical mechanical modeling, and coarse-grained simulations to show that the conformational behavior of PurR is highly heterogeneous characterized by a large spread in melting temperatures, marginal thermodynamic barriers, and populated partially structured states. PurR appears to be at the threshold of disorder arising from frustrated electrostatics and weak packing that in turn slows down folding due to a shallow, bumpy landscape and not due to large thermodynamic barriers or strong internal friction. Our work highlights how a strong temperature dependence on the pre-exponential could signal a shallow landscape and not necessarily a slow-folding diffusion coefficient, thus determining the folding timescales of even millisecond folding proteins and hints at possible structural origins for the shallow landscape.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32955882      PMCID: PMC7659034          DOI: 10.1021/acs.jpcb.0c05976

Source DB:  PubMed          Journal:  J Phys Chem B        ISSN: 1520-5207            Impact factor:   2.991


Introduction

The marginal stability of most proteins has its origins in the strength and nature of interactions including hydrophobic packing in the protein interior and surface electrostatics that can be both favorable and unfavorable. The requirements for marginal stability are manifold and can range from regulatory needs of the organism to efficient folding and function.[1−9] These conflicting requirements contribute to the evolution of protein sequences and determine not just the identity of the amino acid at a specific location in the sequence but also subtly influence the immediate environment around it, contributing to the coevolution of sites far in the protein sequence.[10−14] Such features and constraints are driven to an extreme in DNA-binding domains (DBDs) whose entire nucleic acid-binding face exhibits a net positive electrostatic potential to bind the polyanionic partner. In fact, many of the DBDs display complex thermodynamics in the apo form, which has been attributed to the population of partially structured states in solution.[15−26] The conformational complexity need not be restricted to DBDs but likely applies to any macromolecule that binds an oppositely charged counterpart. “Frustration,” even if minimized by evolutionary forces to guarantee efficient folding, is a universal feature of proteins and enzymes and is predicted to influence or in some cases even drive folding-function behaviors.[27,28] An outstanding question is the extent to which such frustration (from geometrical constraints, packing interactions, or electrostatics) influences the folding landscape of DBDs and thereby the folding speed. Assuming a simple one-dimensional free-energy profile, the rate of diffusional barrier crossing (k) according to Kramers rate theory[29] can be written aswhere ωU2 and ω*2 are the curvatures of the unfolded well and barrier top (transition state), respectively, while D is the folding diffusion coefficient and ΔG* is the free-energy barrier to folding at temperature T. Thus, the rate of folding is determined not just by the thermodynamic barrier height (ΔG*) but also the curvatures of the reactant and barrier top wells (ω2) and the folding diffusion coefficient (D), with latter terms being part of the pre-exponential factor, k0. The large residual frustration in DBDs can have an impact on the magnitude of thermodynamic free-energy barriers or on the pre-exponential factor to protein folding or both, thus governing the folding speed. The effect on the diffusion constant in the pre-exponential has variously been termed as internal friction or landscape roughness or ruggedness.[27,30−33] It arises from microscopic barriers to peptide bond rotations[34,35] and non-native interactions[36−38] that need to be broken to form native contacts, with the latter likely dominating the dynamics of single-domain proteins.[39] Such a feature can manifest as roughness both along and orthogonal to the folding coordinate, thus slowing down the folding rate.[40] Interestingly, many fast-folding proteins exhibit a dramatic slowing down of folding relaxation rates at low temperatures (∼5000–30,000 s–1 between 280–290 K) despite matching with theoretical speed limit expectations of ∼106 s–1 at 333 K.[41−45] Given that they exhibit downhill folding profiles under these conditions, the rates directly report on the pre-exponential factor to protein folding and are suggestive of a large temperature dependence on the pre-exponential term.[27] The large temperature dependence is conventionally interpreted as large internal friction that in turn slows down the folding diffusion coefficient (D in eq ). However, it is also possible that a shallow landscape or a weak curvature in the unfolded well or barrier top (ωU2 and ω*2 in eq ) could equally slow down the folding, with the degree of curvature changing with temperature. In this regard, millisecond folders are generally thought to fold slow because of large thermodynamic barriers (>3 RT). While this expectation holds true for many systems,[46] it is possible that some proteins exhibit enhanced frustration due to functional or regulatory constraints that slow down the pre-exponential factor, similar to the extent observed in downhill-folding proteins. Recent simulations highlight that the dynamics of α-helical systems are more sensitive to frustration than those of β-sheet proteins,[47] and this is borne out in the studies of R17 and α3D.[48,49] Interestingly, even β-sheet proteins exhibit an order of magnitude difference in the pre-exponential factor within members of the same family.[50] These observations raise questions on the extent to which the magnitude of the pre-exponential factor affects the folding speed of slow-folding α-helical proteins and whether one can extract the thermodynamic and dynamic contributions to the rate equation by studying them. We extend this question further and ask if the slow folding is due to large free-energy barriers separating the various substrates, strong internal friction, or the manifestation of broad unfolded wells and barrier tops (indicative of shallow landscapes with multiple minima). We answer these questions by probing the folding conformational landscape of the DBD of PurR (purine repressor), a 56-residue helical domain belonging to the LacR family of transcription regulators. PurR exhibits large electrostatic frustration throughout its solvent-exposed surface including the DNA-binding face (Figure A,B). This structural feature is surprisingly similar to its homologue CytR that is disordered in solution and folds in the presence of DNA[51] and unlike other related proteins of the same family (Figure S1). Such strong frustration is expected to not only destabilize the protein but also contribute to pockets of local structure that can impede access to the folded state during folding of the protein. Here, we combine experiments, statistical mechanical modeling, and coarse-grained simulations to show that PurR exhibits little apparent internal friction, thermodynamically uncoupled folding of structural elements, and downhill-like folding profiles with multiple partially structured states. Despite this, PurR folds slowly, hinting that the slow folding has its likely origins in a shallow landscape.
Figure 1

(A) Cartoon representation of PurR (PDB id 1pru) highlighting the DNA-binding helix (green) and the intrinsic fluorescence probes W37 (blue) and Y45 (red). The helical sequence boundaries are 5–11, 15–23, and 31–43 for helices 1, 2, and 3, respectively. (B) Electrostatic potential maps of PurR DNA-binding face (left) and the opposite face (right). The large positive potential on the DNA-binding face enables PurR to bind DNA.

(A) Cartoon representation of PurR (PDB id 1pru) highlighting the DNA-binding helix (green) and the intrinsic fluorescence probes W37 (blue) and Y45 (red). The helical sequence boundaries are 5–11, 15–23, and 31–43 for helices 1, 2, and 3, respectively. (B) Electrostatic potential maps of PurR DNA-binding face (left) and the opposite face (right). The large positive potential on the DNA-binding face enables PurR to bind DNA.

Methods

The DBDs of the transcriptional repressor PurR (corresponding to residues 1–57 of the protein with the sequence: MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVN) and its truncated variant (lacking the disordered C-terminal tail highlighted in bold) were overexpressed and purified as previously described.[26] Far-UV and near-UV circular dichroism (CD) and fluorescence experiments on PurR and its variants were performed as described before.[52] All experiments were carried out in pH 7.0, 20 mM sodium phosphate buffer [effective ionic strength (IS) of 43 mM], unless otherwise mentioned. Chemical denaturation experiments monitored by far-UV CD and fluorescence were performed at 285, 298, and 310 K at protein concentrations of ∼25 and ∼10 μM, respectively. The samples were incubated for 2 h in increasing concentrations of urea (0–8 M, in intervals of 0.5 M) in 20 mM sodium phosphate buffer, pH 7.0 before measurements.

Stopped-Flow Kinetics

The folding and unfolding traces of PurR and its mutants were recorded at 285, 298, and 310 K by fluorescence using a Chirascan SF3 Stopped Flow instrument (deadtime of ∼1–2 ms; Applied Photophysics Ltd.) coupled to a thermostated water bath as described earlier.[52] For temperature-dependent folding kinetics, denatured PurR in 6 M urea was refolded at 0.55 M urea (final protein concentration ∼10 μM) at temperatures 285–305 K in steps of 2.5 K. The folding traces were recorded by exciting the protein with a 280 nm light-emitting diode (LED). A thousand data points were collected for every scan, and six scans were recorded at an interval of every 1 min at every temperature. The unfolding and refolding kinetics of PurR at different glucose (0.25–1.5 M) and urea concentrations (0–8 M) were performed in a similar manner to determine the rate dependence on solvent viscosity. A rolling-ball micro-viscometer (Lovis 2000 ME, Anton Paar) with a built-in temperature controller was employed to measure the dynamic viscosity of urea-glucose mixtures at 285 K.

Differential Scanning Calorimetry

A Microcal VP-DSC microcalorimeter (Malvern Ltd.) coupled to an automated sample injector was employed to measure heat capacity profiles. All samples were degassed at room temperature prior to calorimetric measurements. Desalted protein solutions of the wild-type (WT) PurR and the truncated variant (concentrations ∼25 to ∼100 μM) and buffers were scanned at a rate of 1 K/min. Calorimetric cells were maintained under an excess pressure of 60 psi to prevent boiling at high temperatures. Buffer–buffer baselines before and after the protein scans were routinely acquired to ensure there was little thermal drift. The resulting apparent heat capacities were converted into absolute units following the method of Sanchez-Ruiz and co-workers.[53]

Fluorescence Lifetime Measurements

Time-dependent fluorescence intensity decays of W37 in PurR were recorded in a ChronosBH (ISS Inc.) spectrometer coupled to a Peltier temperature controller. The excitation pulse (from a 300 nm LED) and emitted photons were passed through UV grade Glan-Thompson polarizers set at 0 and 54.7°, respectively, from the vertical z-axis. The instrument response function was measured using LUDOX solution. The emitted photons were passed through a 345 nm long-pass filter (SCHOTT) to minimize scattering artifacts. All decay curves were recorded until the peak count reached 104 or the total count approached 108. The traces were fitted to biexponential functions with the χ2 values being <1.5 at all temperatures.

WSME Model

The native-centric Wako-Saitô-Muñoz-Eaton (WSME) model, coarse-grained at the residue level,[54,55] was employed to derive the thermodynamic landscape of PurR. Briefly, the WSME model assigns a residue conformational status 0 for unfolded and 1 for folded residues, enabling every possible microstate to be represented as strings of 1s and 0s. Instead of employing the version that accounts 2 states (where N is the protein length), we employ an advanced version of the WSME model[56,57] that includes contributions from single and two stretches of folded residues (single and double sequence approximations) while also allowing for interactions between the two folded stretches, thus accounting for a total of 791,617 microstates. The statistical weight of each of the microstates includes contributions from van der Waals interactions (heavy-atom neighbors identified with a 5 Å distance cutoff from the PDB file 1pru), electrostatics (all-to-all native electrostatics with an effective dielectric constant of 29 in the Debye–Hückel formalism), implicit solvation, and excess conformational entropy of −6.1 J mol–1 K–1 per residue for nonhelical disordered residues identified by STRIDE.[58] Proline at position 47 was assigned an entropic penalty of 0, given its limited backbone flexibility. Heat capacity profiles were generated from derivatives of the total partition function (Z), while free-energy profiles and conformational landscapes were obtained by algorithmically grouping microstates with a specific number of structured residues. The final parameters, obtained by quantitatively reproducing the differential scanning calorimetry (DSC) curve, are ξ = −106.5 ± 0.62 J mol–1 (van der Waals interaction energy per native contact), ΔSconf = −16.5 ± 0.12 J mol–1 K–1 per residue for all residues except proline (entropic penalty for fixing a residue in the native conformation), and a = 1.91 ± 0.01 kJ mol–1 K–1. The parameter a determines the intercept of the native heat capacity baseline in the equation N = (a + 0.0067 × (T – 273.15))Mw/1000 where Mw is the molecular weight of PurR (6282.2 g mol–1).

Coarse-Grained Simulations

Molecular simulations were run using the Karanicolas and Brooks structure-based model that is coarse-grained to the level of Cαs.[59] In this model, the potential energy is defined as the sum of terms for bonds, angles, torsions, and nonbonded interactions. The terms for bonds and angles are harmonic with the equilibrium values corresponding to those between pairs of Cα beads. Propensities for the dihedral energy terms are derived from a statistical analysis of structures in the PDB. Finally, favorable nonbonded terms are included for pairs of beads corresponding to amino acids whose heavy atoms are “in contact” (i.e., their distance is under a cutoff of 4.5 Å) in the reference native structures. In addition to the prescription by Karanicolas and Brooks, we incorporate the effects of electrostatic and non-native interactions following Kim and Hummer (KH), as before.[60,61] Interactions between charged residues are defined using a Debye–Hückel potentialwhere q and q are the net amino acid charges, D is the dielectric constant, and ξ is the screening length. Non-native interactions are described using a sequence-dependent Lennard-Jones potential that replaces the excluded volume term in the Karanicolas and Brooks model. Further details on the models can be found elsewhere.[60,61] We generated simulation models using the PDB structure for PurR (1pru). Simulations were run at temperatures ranging between 270 and 360 K using a Langevin integrator with a friction coefficient of 0.2 ps–1 and a time step of 10 fs using Gromacs 4.0.5.[62] To analyze the data, we project the resulting trajectories on the fraction of native contacts, Q, and combine the information from multiple temperatures using the weighted histogram analysis method.[63]

Results and Discussion

Equilibrium Thermodynamics and Slow-Folding Kinetics of PurR

The equilibrium thermal unfolding of WT PurR DBD (hereon referred to as PurR) as monitored by far-UV CD at 222 nm exhibits the characteristic sigmoidal profile expected of two-state systems with well-defined pre- and post-transition regions (Figure A). The data can be fit to a two-state model with a Tm of 323.5 ± 0.5 K and ΔHm of 149.7 ± 9.5 kJ mol–1 (error bars represent 95% confidence here and throughout the text), similar to most mesophilic proteins. PurR being a small helical domain has a long-range order (LRO[64]) of just ∼0.39; this is indicative of local interactions dominating the contact energetics and predicts that PurR should fold relatively fast in the microsecond timescale. However, stopped-flow kinetics points to well-defined kinetic phases described by single exponential functions with the kinetic amplitudes matching the populations from equilibrium measurements (inset to Figures B and S2C). A chevronlike behavior is observed at all experimental temperatures with an extrapolated folding rate constant in the absence of urea of just ∼71 s–1 at 285 K that increases to ∼347 s–1 at 310 K (Figures B and S2A). PurR, therefore, falls well below the expected range in the plot of LRO versus folding rate (Figure C). Moreover, changing the conditions (increasing temperatures or IS to 0.5 M) affects the folding rate only marginally (Figures C and S2B). Thus, though PurR exhibits an apparent two-state equilibrium, it folds significantly slower than the expectation from LRO predictions, arguing for specific sequence effects determining its folding behavior. In the next few sections, we systematically explore the likely reasons for slow folding combining experiments and simulations.
Figure 2

Slow Folding of PurR. (A) Thermal unfolding profile of PurR as monitored by far-UV CD at 222 nm plotted in mean residue ellipticity units of deg cm2 dmol–1. The vertical dashed line signals the melting temperature from a two-state fit (red curve), while U and F represent unfolded and folded baselines, respectively. (B) Folding kinetics from stopped-flow experiments with open and filled circles representing the measured refolding and unfolding relaxation rates, respectively. Inset: relative kinetic amplitudes (circles) and equilibrium populations derived from a two-state model (red curve). (C) Correlation between LRO and folding rates for a database of proteins[65] (black circles). Blue, green, and red stars signal the PurR folding rate constants at 285, 298, and 310 K, respectively, at 43 mM IS, pH 7.0. Blue and green circles are the folding rates of PurR at 285 and 298 K at 500 mM IS, pH 7.0.

Slow Folding of PurR. (A) Thermal unfolding profile of PurR as monitored by far-UV CD at 222 nm plotted in mean residue ellipticity units of deg cm2 dmol–1. The vertical dashed line signals the melting temperature from a two-state fit (red curve), while U and F represent unfolded and folded baselines, respectively. (B) Folding kinetics from stopped-flow experiments with open and filled circles representing the measured refolding and unfolding relaxation rates, respectively. Inset: relative kinetic amplitudes (circles) and equilibrium populations derived from a two-state model (red curve). (C) Correlation between LRO and folding rates for a database of proteins[65] (black circles). Blue, green, and red stars signal the PurR folding rate constants at 285, 298, and 310 K, respectively, at 43 mM IS, pH 7.0. Blue and green circles are the folding rates of PurR at 285 and 298 K at 500 mM IS, pH 7.0.

Minimal Apparent Internal Friction

One of the primary origins of slow folding is internal friction. Although a rigorous definition of internal friction has been elusive,[34,35,39] it is broadly accepted that its effects are encapsulated within the folding diffusion coefficient D in Kramers rate theory (eq ). For practical purposes, internal friction effects are quantified experimentally by measuring the kinetic rate constants at varying concentrations of viscogens that increase the solvent viscosity (η) and assuming that the folding free energy surface is unaltered.[66] If the dynamics are enslaved to friction from the solvent, because the friction coefficient varies in proportion to the viscosity, we can rewrite eq as[67]where A is a constant that includes the contributions of the energy landscape to the pre-exponential (i.e., the curvatures at the bottom of the well and the barrier top). Under iso-stability conditions, if we divide the folding rate at a reference viscosity η0 by that at a working viscosity η, that is, kf,0/kf, the exponential term and A cancel out. Hence, the plot of kf,0/kf versus the ratio of the viscosities η/η0 would follow a straight line with a slope of 1 and 0 offset.[67] If instead, internal friction (σ) needs to be invoked, then eq reduces to[67]and an insensitivity to the viscosity that translates to a non-zero offset and a slope lower than 1 in the plot of kf,0/kf versus η/η0. To explore this, we measured the folding relaxation rates of PurR at different glucose (0–1.5 M, the viscogen) and urea concentrations. A chevronlike behavior is evident at various glucose concentrations (Figures S3 and S4) from which the rates are extracted at two different iso-stability conditions of 8.4 and 4 kJ mol–1. The measured relative rates scale directly with the relative solvent viscosity for PurR (Figure A), and the folding times (τ) exhibit a near-linear dependence on the relative viscosity (Figure B). These results indicate that the slow folding of PurR is not a consequence of internal friction, at least as conventionally interpreted.
Figure 3

Minimal apparent internal friction. (A) Plots of relative folding rates (ordinate) versus relative viscosity at 285 K and at two different iso-stability conditions of 8.4 and 4 kJ mol–1, respectively (blue and red). For the iso-stability condition of 8.4 kJ mol–1, the urea concentration spans the range 0–3.25 M at different glucose concentrations, while for 4 kJ mol–1 iso-stability, the urea concentration range is 1.95–5.55 M. (B) Plot of the folding time (ordinate) against relative viscosity at iso-stability conditions of 8.4 kJ mol–1. Red line is the expected linear dependence.

Minimal apparent internal friction. (A) Plots of relative folding rates (ordinate) versus relative viscosity at 285 K and at two different iso-stability conditions of 8.4 and 4 kJ mol–1, respectively (blue and red). For the iso-stability condition of 8.4 kJ mol–1, the urea concentration spans the range 0–3.25 M at different glucose concentrations, while for 4 kJ mol–1 iso-stability, the urea concentration range is 1.95–5.55 M. (B) Plot of the folding time (ordinate) against relative viscosity at iso-stability conditions of 8.4 kJ mol–1. Red line is the expected linear dependence.

Marginal Thermodynamic Barriers

An alternate possibility is that PurR folds over large free-energy barriers that in turn contribute to its unusually slow folding. DSC is an ideal avenue for extracting the thermodynamic barriers,[68] given the fundamental connection between heat capacity and partition function,[69,70] which has been validated in several small single-domain proteins.[71,72] To probe if the folding thermodynamics of PurR is characterized by large thermodynamic barriers, we measured the absolute heat capacity profile of PurR from the dependence of the apparent heat capacity on protein concentrations (Figure S5A). The resulting DSC profile is broad, upshifted from the expected Freire baseline, exhibits little pretransition, cannot be characterized by a chemical two-state model (Figure S5B), and therefore displays the features of marginal barrier systems (Figure A). The broad DSC curve is not a consequence of the disordered C-terminal tail as the PurR variant without the disordered tail does not change the overall broadness of the DSC curve and the cooperativity as monitored by far-UV CD or the relaxation rates (Figure S6). The WSME model fits with detailed energetics reproduce the experimental heat capacity profile of the WT PurR very well (see Methods for parameters). The one-dimensional free-energy profiles as a function of the number of structured residues (the reaction coordinate) reveal a downhill-like folding gradient at low temperatures and a maximal thermodynamic barrier of just ∼3 kJ mol–1 at 329 K (Figure B). The probability distribution at 329 K is broad and without well-defined unfolded or partially structured states (Figure C).
Figure 4

Marginal thermodynamic barriers. (A) Absolute heat capacity of PurR (circles) fit to the WSME model (red) together with the predicted native baseline (black). The thermodynamic fluctuations of well-folded proteins follow a specific trend with temperature and molecular weight that is represented by the Freire baseline (green). The observed upward shift of the experimental data is generally seen as evidence for large thermodynamic fluctuations in the native ensemble. The vertical dashed line signals the melting temperature from a two-state fit. (B,C) Predicted free-energy profiles (panel B, in kJ mol–1) and probability distributions (panel C) at 278 K (blue), 298 K (green), 310 K (red), and 329 K (black). Note the downhill-folding profiles at low temperatures and the marginal barrier with multiple partially structured states at 329 K. (D) Heat capacity curves derived from coarse-grained simulations with native (Go̅) and native plus non-native potentials (Go̅ + KH). (E,F) Free-energy profiles (panel E, in kJ mol–1) and the corresponding density distributions (panel F) as a function of the fraction of native contacts for the Go̅ + KH potential at 270 K (blue), 280 K (green), 290 K (red), and 310 K (black). Note that 310 K represents the apparent melting temperature in the relative energy scale of the coarse-grained simulations.

Marginal thermodynamic barriers. (A) Absolute heat capacity of PurR (circles) fit to the WSME model (red) together with the predicted native baseline (black). The thermodynamic fluctuations of well-folded proteins follow a specific trend with temperature and molecular weight that is represented by the Freire baseline (green). The observed upward shift of the experimental data is generally seen as evidence for large thermodynamic fluctuations in the native ensemble. The vertical dashed line signals the melting temperature from a two-state fit. (B,C) Predicted free-energy profiles (panel B, in kJ mol–1) and probability distributions (panel C) at 278 K (blue), 298 K (green), 310 K (red), and 329 K (black). Note the downhill-folding profiles at low temperatures and the marginal barrier with multiple partially structured states at 329 K. (D) Heat capacity curves derived from coarse-grained simulations with native (Go̅) and native plus non-native potentials (Go̅ + KH). (E,F) Free-energy profiles (panel E, in kJ mol–1) and the corresponding density distributions (panel F) as a function of the fraction of native contacts for the Go̅ + KH potential at 270 K (blue), 280 K (green), 290 K (red), and 310 K (black). Note that 310 K represents the apparent melting temperature in the relative energy scale of the coarse-grained simulations. To gain further insight into the folding of PurR, we have run coarse-grained simulations using the model by Karanicolas and Brooks.[59] To test for the role of non-native interactions in modulating barrier height magnitudes, we supplemented the original model with the transferable Kim−Hummer potential, which incorporates non-native hydrophobic and electrostatic interactions. Interestingly, the resulting thermogram is broader when non-native states are considered compared to the pure Go̅-like potential (Figure D). A similar broad thermogram is observed on different realizations of the energy function, where we separately consider the influence of non-native interactions and charges (Figure S7). In all cases, we find little evidence for large thermodynamic barriers, and the resulting barrier height magnitudes (Figures E and S7) are consistent with the predictions from the WSME model. This agreement across models with different degrees of structural resolution and energy functions is strong evidence that PurR folds over only small thermodynamic barriers (∼1 to 2 kBT) and that the slow-folding rate has other molecular origins.

Non-Cooperative Unfolding and Partially Structured States

One striking aspect of the free-energy profiles is that they point to three partially structured states (as the barriers are small, we cannot classify them as true intermediates) that are populated en route to folded state in the WSME model and at least one partially structured state in the coarse-grained simulations. To further test for this feature from experiments, we performed multiprobe spectroscopic measurements. We identify considerable differences in melting temperatures of ∼5 K in PurR when analyzed by a two-state model: Tm of 321.2 ± 1.2 from quantum yield (QY) measurements on excitation at 295 nm (QY295) (Figure S8A), 323.2 ± 1.0 from QY measurements at 274 nm excitation (QY274) (Figure S8A), 323.5 ± 0.5 K from far-UV CD (Figure A), and 326.9 ± 0.3 K from DSC (Figure A). Remarkably, the wavelength of maximum fluorescence emission (λmax) of the sole tryptophan (W37) displays a transition point of 333 K (from a first-derivative analysis, as the baseline for the unfolded state is not well defined) which is nearly 12 K higher than the melting temperature obtained from QY measurements, that is, from the same data set (Figure A). This is more apparent when one observes that λmax starts changing only at about 320 K, while the heat capacity profile already displays large changes at this temperature.
Figure 5

Non-cooperative unfolding thermodynamics from experiments and simulations. (A) Changes in fluorescence emission maximum of PurR as a function of temperature and the corresponding inflection point of 332 K (dashed line). The melting temperature of 321 K from changes in fluorescence intensity at 295 nm from the same experiment is shown as a continuous black line for reference. (B) FLT amplitudes of PurR for the long (blue) and short (red) lifetime components, together with the crossover temperature of 332 K. (C) Predicted unfolding curves for the different secondary structure elements (helix 1/H1, helix 2/H2, and helix 3/H3) of PurR from the WSME model. The corresponding inflection points from first-derivative analysis are marked at 325, 332, and 328 K in blue, green, and red, respectively. (D) Temperature dependence of the fraction of native contacts (Q) for contacts formed by the different protein helices from coarse-grained simulations. (E) Overlay of multiple snapshots corresponding to the partially structured state observed in the simulations at a reaction coordinate value of Q ∼ 0.5 (colored tube). We show a transparent cartoon representation of the native state for reference.

Non-cooperative unfolding thermodynamics from experiments and simulations. (A) Changes in fluorescence emission maximum of PurR as a function of temperature and the corresponding inflection point of 332 K (dashed line). The melting temperature of 321 K from changes in fluorescence intensity at 295 nm from the same experiment is shown as a continuous black line for reference. (B) FLT amplitudes of PurR for the long (blue) and short (red) lifetime components, together with the crossover temperature of 332 K. (C) Predicted unfolding curves for the different secondary structure elements (helix 1/H1, helix 2/H2, and helix 3/H3) of PurR from the WSME model. The corresponding inflection points from first-derivative analysis are marked at 325, 332, and 328 K in blue, green, and red, respectively. (D) Temperature dependence of the fraction of native contacts (Q) for contacts formed by the different protein helices from coarse-grained simulations. (E) Overlay of multiple snapshots corresponding to the partially structured state observed in the simulations at a reaction coordinate value of Q ∼ 0.5 (colored tube). We show a transparent cartoon representation of the native state for reference. To validate this observation further, we performed fluorescence lifetime (FLT) analysis of W37 located in helix 3 making long-range interactions with helix 2; the advantage of this technique is that the signals and the species population can be directly decoupled without resorting to baselines. FLT measurements reveal two lifetimes for W37, the longer one (∼7 ns) corresponding to the main-chain conformation or side-chain orientation sensitive to the folded state and the shorter one (∼1 ns) representing the unfolded conformation (Figure S8B,C). The corresponding amplitudes follow a clear sigmoidal pattern with the amplitude crossover at 332 K (Figure B), very similar to the inflection point of fluorescence λmax changes. The large difference in apparent melting temperatures that is consistent across different experimental probes thus suggests that W37 is differentially sensitive to the folding environment, with the fluorescence intensity (from which QY is calculated) and λmax sensitive to global and local unfolding events around the tryptophan, respectively. These experimental observations are strikingly captured by both the WSME model and coarse-grained simulations. The WSME model predicts that the individual helical elements exhibit differences in the overall stability that translates to differences in the melting temperature ranging from 325–332 K, with 332 K being the melting temperature of helix 2 (Figure C). Coarse-grained molecular simulations paint a picture of the folding–unfolding equilibrium qualitatively consistent with that of the WSME model. In Figure D, we show the average values of Q calculated for contacts involving helix 1, 2, or 3 (we note that these may include some overlapping pairs of interactions). Clearly, the melting of the interactions formed by the helices is decoupled, with helix 1 being first to unfold, helix 3 being the closest to the average unfolding, and helix 2 being third. We note that the rank order in the melting temperatures is exactly the same as that from the WSME model. The decoupled unfolding manifests as a partially structured state during (un)folding at a reaction coordinate value of ∼0.5 (Figure E) characterized by a detached helix 1 that samples varied conformations (Figure E). The emergence of this partially structured state results in different conformational changes being probed preferentially at different temperatures (Φ-values for these barrier crossing events calculated from the simulations are shown in Figure S9). At the same time, the unique peak in the heat capacity curve (Figure ) does not warrant a description in terms of separate thermodynamic transitions.

Strong Temperature Dependence on the Pre-Exponential to Folding

In this section, we explore the extent to which the pre-exponential factor described in eq needs to change with temperature to account for the observed slow folding. Since the thermodynamic free-energy profiles are available, it should be possible to extract this pre-exponential factor to protein folding (k0) by measuring the folding relaxation rates at a range of temperatures, similar to earlier work on fast-folding proteins.[73] Note that this approach does not disentangle the different terms in k0 but serves to only identify its dependence on temperature. We measured the folding relaxation rates of PurR at a range of temperatures from 285–305 K and at a final urea concentration of 0.55 M (near-native conditions; Figure A). The measured rates are surprisingly similar to the sampling rate of an excited state in the disordered ensemble of CytR[74] and the folding kinetics of an engineered folded variant CytR A28V/A48M (double mutant, DM) that exhibits an equally complex thermodynamic behavior with equilibrium melting temperature differences of ∼10 K.[52]
Figure 6

Strong temperature dependence on the pre-exponential factor to folding. (A) Observed relaxation rates of PurR from stopped-flow experiments at near-native conditions of 0.55 M urea (open red circles) starting from protein denatured in 6 M urea. The corresponding rates from chevron plots (Figures B and S2) are shown in filled red circles highlighting the internal consistency across different experiments. The fit from diffusive calculations on the free-energy profiles (Figure B) is shown as a red curve. The observed relaxation rates for the sampling of an excited folded conformation in the disordered ensemble of CytR is shown in green. Black circles represent the relaxation rates for the folded variant of CytR, termed the CytR DM (double mutant, A29V/A48M).[52] (B) Measurements of the pre-exponential factor from experiments (blue and magenta) versus that estimated for the folded PurR (red) and the disordered CytR (green).

Strong temperature dependence on the pre-exponential factor to folding. (A) Observed relaxation rates of PurR from stopped-flow experiments at near-native conditions of 0.55 M urea (open red circles) starting from protein denatured in 6 M urea. The corresponding rates from chevron plots (Figures B and S2) are shown in filled red circles highlighting the internal consistency across different experiments. The fit from diffusive calculations on the free-energy profiles (Figure B) is shown as a red curve. The observed relaxation rates for the sampling of an excited folded conformation in the disordered ensemble of CytR is shown in green. Black circles represent the relaxation rates for the folded variant of CytR, termed the CytR DM (double mutant, A29V/A48M).[52] (B) Measurements of the pre-exponential factor from experiments (blue and magenta) versus that estimated for the folded PurR (red) and the disordered CytR (green). We extract the pre-exponential term by performing diffusive calculations on the free-energy profiles generated by the WSME model (thus accounting for barrier effects) by solving the one-dimensional diffusion equation using the rata-matrix approach of Hofrichter and co-workers.[75] We employ a phenomenological Arrhenius dependence on k0 as Ae–, where Ea is the activation energy that approximates the landscape roughness including solvent effects and changes in shape of the free-energy profile with temperature.[73] The temperature dependence is captured well only when k0 varies from ∼3000 s–1 at 285 K to ∼200,000 s–1 at 320 K (Figure B). It is pertinent to note that the former is just an order of magnitude slower than the relaxation rates measured for downhill-folding proteins while agreeing well with the downhill estimates at higher temperatures. The activation energy is found to be 80 or 1.43 kJ mol–1 per residue, nearly 40% higher than the average per residue dependence of fast-folding proteins.[73] Interestingly, the pre-exponential terms match with those of the disordered CytR at low temperatures that displays a shallow, bumpy landscape from experiments and simulations.[74]

Discussion

The conformational behavior of PurR is thus observed to be complex, with some attributes from each of downhill (spread in melting temperatures, marginal barriers, and downhill-folding profiles from simulations), two-state folding (chevron kinetics and sigmoidal unfolding curves), and multistate folding (partially structured states from simulations). These features are captured well at a (semi-)quantitative level by the WSME model and variants of coarse-grained simulations, with the latter including non-native energetic effects. The simulations paint a dynamic picture in which the protein molecule struggles to fit in the helices together as it folds. Accordingly, the three helices exhibit graded thermodynamic stability that in turn lowers the thermodynamic cooperativity and barriers but promotes pockets of partial structure all along the folding coordinate that manifests as differences in melting temperatures and broad heat capacity profiles. In other words, the different structural elements are only weakly coupled, that is, exhibiting large sensitivity to perturbations, contributing to complex underlying landscape. Interestingly, the homologue CytR is disordered in solution, while the engineered folded variant of CytR with two hydrophobic substitutions (A29V/A48M) exhibits very similar features: broad DSC profile, differences in melting temperatures of 10 K, heterogeneous, and slow folding.[52] What is unique about the structure of PurR that contributes to the conformationally heterogeneous unfolding, despite exhibiting slow two-state like chevron kinetics? In this regard, it is known that DBDs function in a highly complex environment around DNA arising from counterion condensation. It is therefore likely that the structure of PurR is evolutionarily selected for folding and function in the vicinity of DNA and not in the conditions employed in the current set of experiments (43 mM IS). Increasing the solvent IS to mimic the environment around DNA (>2 M IS[76,77]) results in an unusual feature where parts of the PurR structure are lost (less negative ellipticity at 222 nm) despite the increased stability (Figure A). This hints at a complex surface electrostatic feature involving a combination of both stabilizing and destabilizing effects and partitioning of local-nonlocal electrostatics, with one effect dominating over the other depending on the conditions. The folding relaxation rates increase on increasing the solvent IS but fold faster than the dead-time of the stopped flow-instrument even at 298 K at IS greater than 0.5 M (Figure S2). In fact, our observations are consistent with single-molecule experiments on α3D that point to conflicting electrostatic interactions as the primary source of internal friction or a slower pre-exponential to folding.[49] Recent statistical mechanical modeling of PurR folding in the presence of DNA demonstrates a progressive titling of its landscape toward the folded state in the vicinity of DNA, providing hints that the folding landscape could be fine-tuned by quinary interactions.[78]
Figure 7

Electrostatic frustration and weak packing contribute to a shallow folding landscape. (A) Thermal unfolding curves of PurR at different IS conditions as monitored by far-UV CD at 222 nm and reported in mean-residue ellipticity units of deg cm2 dmol–1. Note the loss of secondary structure at high IS (black and red) despite an increase in the melting temperature. (B) Thermal unfolding curves of PurR mutants V21A (red) and I40A (green).[52] (C) Two-dimensional conformational landscape of PurR predicted by the WSME model at 329 K highlighting the lack of large thermodynamic barriers between folded (F), unfolded (U), and the numerous partially structured states (valleys in dark blue shade). nN and nC represent the number of structured residues in the N- and C-terminals, respectively. nN includes the first two helices, while the rest of the sequence falls in nC. (D,E) Two-dimensional free-energy landscape (in units of kBT) of PurR from coarse-grained simulations pointing to a complex conformational ensemble with multiple valleys (light and dark blue contours). The coordinates employed are the fraction of native contacts in helix 1 (QH1), helix 2 (QH2), and helix 3 (QH3).

Electrostatic frustration and weak packing contribute to a shallow folding landscape. (A) Thermal unfolding curves of PurR at different IS conditions as monitored by far-UV CD at 222 nm and reported in mean-residue ellipticity units of deg cm2 dmol–1. Note the loss of secondary structure at high IS (black and red) despite an increase in the melting temperature. (B) Thermal unfolding curves of PurR mutants V21A (red) and I40A (green).[52] (C) Two-dimensional conformational landscape of PurR predicted by the WSME model at 329 K highlighting the lack of large thermodynamic barriers between folded (F), unfolded (U), and the numerous partially structured states (valleys in dark blue shade). nN and nC represent the number of structured residues in the N- and C-terminals, respectively. nN includes the first two helices, while the rest of the sequence falls in nC. (D,E) Two-dimensional free-energy landscape (in units of kBT) of PurR from coarse-grained simulations pointing to a complex conformational ensemble with multiple valleys (light and dark blue contours). The coordinates employed are the fraction of native contacts in helix 1 (QH1), helix 2 (QH2), and helix 3 (QH3). Additionally, protein engineering experiments highlight an unusual packing thermodynamics with single-point mutations V21A and I40A strongly destabilizing the protein (Figure B).[52] In fact, the V21A mutation fully unfolds PurR, arguing for a weak hydrophobic effect driving folding in this system. This observation is borne out by the fact that a close family member, CytR, is disordered in solution and is a mere two hydrophobic substitutions from being folded at low temperatures.[26,52] Taken together, the molecular origin of slow folding in PurR seems to be a combination of multiple sequence-structural features that in turn affect the pre-exponential to folding and not because of large thermodynamic barrier height. We find evidence for this from a steep temperature dependence of relaxation rates and the extracted pre-exponential terms that match with downhill folding proteins at higher temperatures and are slower by an order of magnitude at lower temperatures. Additionally, the free-energy landscape of PurR is complex when assuming two conformational coordinates from the WSME model or coarse-grained simulations. In the former, it is seen that nearly all conformations are equally likely with a broad distribution of molten-globule-like conformations at the midpoint with no well-defined funnel toward the folded state (Figure C). This is also observed in two-dimensional probability density plots from coarse-grained simulations, wherein partially structured states are populated irrespective of the projections (Figure D,E; see also contact maps in Figure S10). Experiments on unfolded protein L under folding conditions highlight a dramatic slowdown in folding diffusion coefficient compared to unfolding conditions,[79] while single-molecule experiments point to large contributions from internal friction on even unfolded and disordered proteins.[80] Given these observations and since internal friction effects arise likely from microscopic barriers to dihedral motions,[34] it is surprising to find that conventional measures of internal friction fail to reveal any landscape roughness or apparent internal friction in PurR. If the established techniques for “measuring” internal friction are robust (Figure ), assuming a one-dimensional coordinate, it can therefore be concluded that the folding landscape of PurR is characterized by broad unfolded and folded basins with no well-defined transition-state ensemble. In terms of Kramers theory of reaction rates, these features hint at small curvatures (ω2) of unfolded well and barrier top in PurR, which in turn slow down the folding rate and not through a slow-folding diffusion coefficient. Experimental works narrowing the barrier increase the folding rate by an order of magnitude,[81] thus suggesting that broader barriers or unfolded wells would proportionately slow down folding. An alternate possibility is that denaturant-stabilizer mixtures (as in urea-glucose use in the current study) modulate both the intramolecular diffusion coefficient and barriers in compensatory fashions, resulting in a linear relative viscosity versus rate plot (Figure ). It is important to note that the extent of trapping in the populated partially structured states could also be different that necessitates the use of a coordinate dependent diffusion coefficient or “heterogeneous friction.”[67] A two-dimensional landscape, on the other hand, could contribute to complex distribution of folding fluxes, again requiring additional slow diffusional terms. Experiments on mutants that populate one or more of the partially structured states could be specifically employed to probe for such coordinate-dependent effects.

Conclusions

In summary, our experiments combined with simulations rule out a large thermodynamic barrier in PurR and point to the slaving of the dynamics to the shape of the underlying free-energy landscape that appears as a strong temperature dependence on the pre-exponential, slowing down folding. Structural, mutational, and thermodynamic analyses of packing and electrostatic effects indicate that PurR is at the threshold of disorder. We also find that studies combining scanning calorimetry experiments, multiprobe spectroscopy, viscosity-dependent kinetics, and thermodynamic modeling can provide an unparalleled view on the underlying folding landscape of proteins. It still remains to be seen if the slow folding determined by the slow pre-exponential is a conserved feature of LacR/PurR family members. The suitability of additional coordinates in explaining slow folding and the role of DNA in smoothening the folding landscape or speeding up the folding pre-exponential through its large negative electrostatic potential also remain to be seen.
  77 in total

1.  Comparison between long-range interactions and contact order in determining the folding rate of two-state proteins: application of long-range order to folding rate prediction.

Authors:  M M Gromiha; S Selvaraj
Journal:  J Mol Biol       Date:  2001-06-29       Impact factor: 5.469

2.  Folding at the speed limit.

Authors:  Wei Yuan Yang; Martin Gruebele
Journal:  Nature       Date:  2003-05-08       Impact factor: 49.962

3.  Quantifying the roughness on the free energy landscape: entropic bottlenecks and protein folding rates.

Authors:  Leslie L Chavez; José N Onuchic; Cecilia Clementi
Journal:  J Am Chem Soc       Date:  2004-07-14       Impact factor: 15.419

4.  Solution structure of a protein denatured state and folding intermediate.

Authors:  T L Religa; J S Markson; U Mayor; S M V Freund; A R Fersht
Journal:  Nature       Date:  2005-10-13       Impact factor: 49.962

5.  Direct observation of downhill folding of lambda-repressor in a microfluidic mixer.

Authors:  Stephen J DeCamp; Athi N Naganathan; Steven A Waldauer; Olgica Bakajin; Lisa J Lapidus
Journal:  Biophys J       Date:  2009-09-16       Impact factor: 4.033

6.  Experimental evidence for a frustrated energy landscape in a three-helix-bundle protein family.

Authors:  Beth G Wensley; Sarah Batey; Fleur A C Bone; Zheng Ming Chan; Nuala R Tumelty; Annette Steward; Lee Gyan Kwa; Alessandro Borgia; Jane Clarke
Journal:  Nature       Date:  2010-02-04       Impact factor: 49.962

7.  Nonnative interactions in the FF domain folding pathway from an atomic resolution structure of a sparsely populated intermediate: an NMR relaxation dispersion study.

Authors:  Dmitry M Korzhnev; Robert M Vernon; Tomasz L Religa; Alexandar L Hansen; David Baker; Alan R Fersht; Lewis E Kay
Journal:  J Am Chem Soc       Date:  2011-06-28       Impact factor: 15.419

8.  Navigating the downhill protein folding regime via structural homologues.

Authors:  Athi N Naganathan; Peng Li; Raúl Perez-Jimenez; Jose M Sanchez-Ruiz; Victor Muñoz
Journal:  J Am Chem Soc       Date:  2010-08-18       Impact factor: 15.419

Review 9.  Interpreting protein/DNA interactions: distinguishing specific from non-specific and electrostatic from non-electrostatic components.

Authors:  Peter L Privalov; Anatoly I Dragan; Colyn Crane-Robinson
Journal:  Nucleic Acids Res       Date:  2010-11-10       Impact factor: 16.971

10.  Entropic Control of an Excited Folded-Like Conformation in a Disordered Protein Ensemble.

Authors:  Sneha Munshi; Divya Rajendran; Athi N Naganathan
Journal:  J Mol Biol       Date:  2018-06-07       Impact factor: 5.469

View more
  1 in total

Review 1.  The Wako-Saitô-Muñoz-Eaton Model for Predicting Protein Folding and Dynamics.

Authors:  Koji Ooka; Runjing Liu; Munehito Arai
Journal:  Molecules       Date:  2022-07-12       Impact factor: 4.927

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.