Anthony Hazel1, Christophe Chipot2, James C Gumbart1. 1. School of Physics, Georgia Institute of Technology , Atlanta, Georgia 30332, United States. 2. Beckman Institute, University of Illinois at Urbana-Champaign , Urbana, Illinois 61801, United States ; Laboratoire Associé International Centre National de la Recherche Scientifique - Unité Mixte de Recherche , N°7565, BP 70239, 54506 Vandœuvre-lès-Nancy, France.
Abstract
The determination of the folding dynamics of polypeptides and proteins is critical in characterizing their functions in biological systems. Numerous computational models and methods have been developed for studying structure formation at the atomic level. Due to its small size and simple structure, deca-alanine is used as a model system in molecular dynamics (MD) simulations. The free energy of unfolding in vacuum has been studied extensively using the end-to-end distance of the peptide as the reaction coordinate. However, few studies have been conducted in the presence of explicit solvent. Previous results show a significant decrease in the free energy of extended conformations in water, but the α-helical state is still notably favored over the extended state. Although sufficient in vacuum, we show that end-to-end distance is incapable of capturing the full complexity of deca-alanine folding in water. Using α-helical content as a second reaction coordinate, we deduce a more descriptive free-energy landscape, one which reveals a second energy minimum in the extended conformations that is of comparable free energy to the α-helical state. Equilibrium simulations demonstrate the relative stability of the extended and α-helical states in water as well as the transition between the two states. This work reveals both the necessity and challenge of determining a proper reaction coordinate to fully characterize a given process.
The determination of the folding dynamics of polypeptides and proteins is critical in characterizing their functions in biological systems. Numerous computational models and methods have been developed for studying structure formation at the atomic level. Due to its small size and simple structure, deca-alanine is used as a model system in molecular dynamics (MD) simulations. The free energy of unfolding in vacuum has been studied extensively using the end-to-end distance of the peptide as the reaction coordinate. However, few studies have been conducted in the presence of explicit solvent. Previous results show a significant decrease in the free energy of extended conformations in water, but the α-helical state is still notably favored over the extended state. Although sufficient in vacuum, we show that end-to-end distance is incapable of capturing the full complexity of deca-alanine folding in water. Using α-helical content as a second reaction coordinate, we deduce a more descriptive free-energy landscape, one which reveals a second energy minimum in the extended conformations that is of comparable free energy to the α-helical state. Equilibrium simulations demonstrate the relative stability of the extended and α-helical states in water as well as the transition between the two states. This work reveals both the necessity and challenge of determining a proper reaction coordinate to fully characterize a given process.
Folding
of proteins into organized three-dimensional structures
capable of fulfilling a biological function is determined by the sequence
of amino acids[1] and is believed to proceed
hierarchically.[2,3] The emergence of secondary-structure
elements constitutes an early event in the chronology of folding,[4] which prefaces the ultimate collapse into well-defined,
compact, functional entities. Formation of stretches of secondary
structure, the elementary bricks of the protein scaffold, therefore,
represents an important milestone on the folding pathway, and a convenient
framework to investigate the basic physical principles that underlie
protein folding—notably how do elements of secondary structure
nucleate and further propagate into an ordered structure, and to what
extent is the organization of the peptide chain collective.[5] Understanding this key biological process at
the theoretical level has greatly benefited from the recent development
of novel, dedicated computer architectures[6] and the unbridled race to model larger proteins over longer time
scales.[7,8] Brute-force simulations have now reached
a cruising speed that can fold proteins as large as 100 amino-acid
residues over tens to hundreds of microseconds,[9] still at the price of substantial computational effort.
A number of unbiased, all-atom simulations in explicit solvent have
proven successful to illuminate the hierarchical nature of folding,
shedding light on the possible pathways that connect a random coil
to a functional three-dimensional structure.[10−13]Substantially shorter importance-sampling[14,15] simulations relying on simpler models consisting of short, organized
peptide segments can, however, provide valuable insight into the physical
and evolutionary principles that govern the intricate conformational
transition of a disordered protein chain into a properly folded one.[5,16−20] Among suitable candidates of secondary-structure elements for biased
simulations are α-helices, the most prevalent motif observed
in proteins,[21] stabilized by intramolecular
interactions, notably through the formation of a hydrogen bond between
the carbonyl moiety of the ith residue and the amino
moiety of the i+4th residue. Owing to its noteworthy
propensity to form α-helices,[22] alanine
has been the amino acid of predilection in theoretical investigations
of conformational equilibria in short peptide segments. Alanine-rich
peptides flanked by titratable residues have also been utilized abundantly
at the experimental level[23] to decipher
the transition pathway from a disorganized chain to a nascent chain
to an ultimately folded α-helix. In particular, they were at
the center of a controversy on the existence of 310-helices,[24] a secondary-structure motif arising from the
formation of hydrogen bonds between the ith and the i+3-th residues of the peptide chain, conjectured to act
as an observable intermediate in the conformational transition toward
the α-helical state.Turning to importance-sampling simulations
naturally raises the
question of an appropriate choice of a transition coordinate, capable
of describing folding of the peptide chain into a well-ordered secondary
structure. Even for appreciably short segments, this choice remains
an intricate problem, deeply rooted in the large number of degrees
of freedom that vary concurrently as the peptide chain evolves toward
its native, organized conformation.[25] Much
of this intricacy lies in the multidimensionality of the true reaction
coordinate,[26−29] thwarting naive attempts to resort to a limited number of geometric
variables, often of low collectivity.[30] Fruitful application of collective-variable-based methods rests
in large measure upon the fragile hypothesis of time scale separation
of slow degrees of freedom, in connection with the reaction coordinate,
as well as all other hard, fast degrees of freedom. Mapping the free-energy
landscape that underlies the folding of a short peptide, therefore,
ultimately reduces to either selecting a few relevant collective variables
or throwing into the model a plethora of order parameters to describe
the multidimensional transition space.[31] The daunting nature of this task explains why biased simulations
of complex, intertwined conformational changes remain scarce.[32]In the present contribution, we revisit
the paradigmatic capped
decamer of alanine, henceforth referred to as deca-alanine. Deca-alanine
has served on various occasions as a methodological proof of concept,
in particular in nonequilibrium work simulations in conjunction with
the Jarzynski identity,[33] and equilibrium
free-energy calculations relying upon the application of a time-dependent
bias.[34,35] Notwithstanding their rudimentary character,
model peptides like deca-alanine offer valuable thermodynamic and
kinetic information on folding, under the assumption that a reasonable,
nonambiguous transition coordinate can be designed—which is
necessarily subservient to the length of the peptide chain. They also
help shed light on common shortcomings of importance-sampling simulations
of low-dimensionality, notably hidden barriers in orthogonal space,
and have proven useful for devising remedies.[35−38] Beyond their undeniable utility
in methodological developments, they are also sufficiently simple
to serve as models of the nascent chain in more realistic biological
applications, like the coupled folding–translocation occurring
in the SecY complex.[39]Here, we extend
the exploration of reversible extension of deca-alanine
in vacuo[34,35] by examining how the aqueous environment
reshapes the free-energy landscape that underlies folding. We find
that the range of conformational states explored in water is much
greater than in a vacuum, making end-to-end distance a highly degenerate
transition coordinate. However, by adding a second coordinate describing
the helicity of deca-alanine, we demonstrate that its folding pathway
in water is more intricate than in a vacuum.
Methods
Simulations
of deca-alanine (Ala10) were performed using
the 104-atom compact helical model used by Park et al.,[33] capped with an acetylated N-terminus and amidated
C-terminus, as a starting state, with all hydrogens defined explicitly.
For simulations in explicit water, the visualization and analysis
program VMD[40] was used to place deca-alanine
in a cube of 10 850 TIP3P[41] water molecules with dimensions 70 Å ×
70 Å × 70 Å for a total of 32 659 atoms. Molecular
dynamics simulations were carried out using NAMD 2.9[42] with the CHARMM all-atom force fields (CHARMM22/CMAP[43] and CHARMM36[44,45]). The temperature
was fixed at 300 K using Langevin dynamics; the pressure was kept
constant at 1 atm using the Langevin piston method.[46] The equations of motion were integrated using the RESPA
multiple time-step algorithm with a time step of 2 fs used for all
bonded interactions, 2 fs for short-range nonbonded interactions,
and 4 fs for long-range electrostatic interactions. Long-range electrostatic
interactions were calculated using the particle-mesh Ewald method.[47] Bonds involving hydrogen atoms were constrained
to their equilibrium length, employing the Rattle algorithm.[48]PMFs were calculated using both adaptive
biasing forces (ABF)[34,42,49] and umbrella sampling (US) with
the weighted histogram analysis method (WHAM),[50] utilizing the collective variables (colvars)
module of NAMD 2.9.[35] Two reaction coordinates
were defined: (ξ) the distance from the carbonyl carbon of the
backbone of the first residue to the carbonyl carbon of the last residue
and (α) the α-helical content of all 10 alanine residues
as defined in the colvars module of NAMD. The α colvar is calculated
using a scoring function for the backbone i, i + 4 hydrogen bonding, and the dihedral angles compared
to that of a pure α-helix, normalized between 0 and 1. The default
parameters for the α colvar as defined in the colvars module
were used in all simulations. For two-dimensional PMFs in water, replica-exchange
molecular dynamics[51] was utilized with
US (REMD-US) to increase the sampling efficiency of the entire conformational
space. Integration of the 2D PMF to obtain a 1D PMF was calculated
according to the following equation:[52]where β = (kBT)−1, kB is the Boltzmann
constant, T is the temperature, W(x, y) is the 2D PMF, w(x) the corresponding 1D PMF, and (x, y) is an arbitrary point in the collective-variable
space.Along the end-to-end distance coordinate, 20 US windows
centered
at ξ = 12.5 Å, 13.5 Å, ..., 31.5 Å were used
with a force constant of 5.0 kcal/Å2·mol for
each window. Along the α-helical content coordinate, nine US
windows centered at α = 0.1, 0.2, ..., 0.9 were used in vacuum
and 17 US windows centered at α = 0.1, 0.15, ..., 0.9 were used
in water with a force constant of 500.0 kcal/α2·mol
for each window. In vacuum, US windows were simulated for 5–10
ns per window for one-dimensional PMFs and 15 ns per window for two-dimensional
PMFs. In water, US windows were simulated for 5 ns per window for
one-dimensional PMFs and 20 ns per window for two-dimensional PMFs.
The first 1–2 ns were not included in the PMF calculations
to ensure the system was in equilibrium. ABF simulations were run
for 50–100 ns in total. All ABF simulations used a threshold
of 500 samples (“fullsamples”) prior to the application
of the bias, unless noted otherwise. Starting states along each reaction
coordinate were generated either using steered molecular dynamics
(SMD) or from one-dimensional unrestrained ABF trajectories.
Results
One-Dimensional
PMFs
To examine the efficacy of our
methods, we first determined the PMF of deca-alanine in vacuum using
the end-to-end distance reaction coordinate, denoted ξ. Using
both the US and ABF approaches (see Methods), we calculated the PMF with the CHARMM22/CMAP and CHARMM36 force
fields. The two approaches yield nearly identical free-energy profiles
for both force fields (Figure 1). Examination
of the simulation trajectories shows that both methods produced only
the accordion-like folding/refolding mechanism as shown in Figure 2, where unfolding begins at one end of the peptide
and propagates to the other end, suggesting a cooperative folding
mechanism.[53]
Figure 1
One-dimensional PMF of
Ala10 in vacuum using the distance
from the N-terminus carbonyl carbon to the C-terminus carbonyl carbon
as the reaction coordinate. Red lines represent calculations using
ABF, and blue lines represent calculations using US, with solid lines
utilizing the CHARMM36 force field and dashed lines utilizing the
CHARMM22/CMAP force field.
Figure 2
Unfolding of Ala10. This “accordion-like”
unfolding mechanism of Ala10 was generated using SMD by
pulling on the C-terminus Cα while
keeping the N-terminus Cα fixed.
Three snapshots of the peptide are shown in various stages of the
SMD simulations, drawn using the “licorice” representation
for all atoms and a cartoon representation for the backbone structure,
where the thick ribbon represents those residues which are in an α-helical
state. The top image represents the initial, minimized crystal structure
of Ala10 used as the starting state. The middle image represents
an intermediate state in which the peptide is partially extended while
the remaining portion of the peptide is still in an α-helical
state. The bottom image represents the fully extended state.
One-dimensional PMF of
Ala10 in vacuum using the distance
from the N-terminus carbonyl carbon to the C-terminus carbonyl carbon
as the reaction coordinate. Red lines represent calculations using
ABF, and blue lines represent calculations using US, with solid lines
utilizing the CHARMM36 force field and dashed lines utilizing the
CHARMM22/CMAP force field.Unfolding of Ala10. This “accordion-like”
unfolding mechanism of Ala10 was generated using SMD by
pulling on the C-terminus Cα while
keeping the N-terminus Cα fixed.
Three snapshots of the peptide are shown in various stages of the
SMD simulations, drawn using the “licorice” representation
for all atoms and a cartoon representation for the backbone structure,
where the thick ribbon represents those residues which are in an α-helical
state. The top image represents the initial, minimized crystal structure
of Ala10 used as the starting state. The middle image represents
an intermediate state in which the peptide is partially extended while
the remaining portion of the peptide is still in an α-helical
state. The bottom image represents the fully extended state.The results of the CHARMM36 force
field agree quite well with previously
reported PMFs.[54−57] There is only one stable conformation around ξ = 14.2 Å,
which corresponds to the pure α-helical state, and there is
an energy barrier of ∼25 kcal/mol from the helical to extended
state (Figure 1). While the CHARMM22/CMAP force
field does yield a minimum near the α-helical state, the entire
PMF is shifted by ∼2 Å toward lower end-to-end distances,
and the energy barrier between helical and extended states is slightly
higher (∼30 kcal/mol). The corrections to the CHARMM22/CMAP
force field added to the CHARMM36 force field are evident in the difference
in the folding PMFs for Ala10. Since CHARMM36 reproduces
the expected free-energy minimum for the α-helical state in
a vacuum[58] and significantly improves agreement
with helix-formation experiments,[44] from
here on we used solely the CHARMM36 force field.The ABF and
US simulations of Ala10 were then repeated
in explicit water. Both methods yield a relatively flat PMF, compared
to the vacuum PMF, along most of the reaction coordinate (Figure 3, thick, solid lines). The trajectories reveal that
there is no longer only the folding/refolding mechanism seen in vacuum;
instead, the peptide transitions between extended states and compact,
but nonhelical, states. These nonhelical states are characterized
by various hairpin structures (Figure 4). Figure 5 shows the prevalence of low-helical, compact states
in water as opposed to vacuum, which are “contaminating”
the PMF at low end-to-end distances.
Figure 3
One-dimensional PMF of Ala10 along the end-to-end distance
of the peptide calculated using ABF (top) and US (bottom). Top graph:
no additional restraints (thick, solid line), no restraints with 50 000
fullsamples (thick, dashed line), 8 Å-radial confinement (thin,
dashed line), 10 Å-radial confinement plus antihairpin restraint
(thin, dotted-dashed line). Bottom graph: no additional restraints
(thick, solid line), 10 Å-radial confinement (thin, dashed line),
10 Å-radial confinement plus antihairpin restraint (thin, dotted-dashed
line).
Figure 4
Representative set of compact, low-α-helical
content states
of Ala10 in water. The Ala10 peptide is shown
in the “licorice” representation with the backbone α-helical
content represented in orange by a cartoon representation. Water molecules
are not shown.
Figure 5
Scatter plot of states
from ABF simulations in vacuum (top) and
water (bottom). Top graph: scatter plot of states from 50 ns ABF simulation
in vacuum using the CHARMM36 force field. Bottom graph: scatter plot
of states from 100-ns ABF simulation in water with 10 Å-radial
confinement plus antihairpin restraint.
One-dimensional PMF of Ala10 along the end-to-end distance
of the peptide calculated using ABF (top) and US (bottom). Top graph:
no additional restraints (thick, solid line), no restraints with 50 000
fullsamples (thick, dashed line), 8 Å-radial confinement (thin,
dashed line), 10 Å-radial confinement plus antihairpin restraint
(thin, dotted-dashed line). Bottom graph: no additional restraints
(thick, solid line), 10 Å-radial confinement (thin, dashed line),
10 Å-radial confinement plus antihairpin restraint (thin, dotted-dashed
line).Representative set of compact, low-α-helical
content states
of Ala10 in water. The Ala10 peptide is shown
in the “licorice” representation with the backbone α-helical
content represented in orange by a cartoon representation. Water molecules
are not shown.Scatter plot of states
from ABF simulations in vacuum (top) and
water (bottom). Top graph: scatter plot of states from 50 ns ABF simulation
in vacuum using the CHARMM36 force field. Bottom graph: scatter plot
of states from 100-ns ABF simulation in water with 10 Å-radial
confinement plus antihairpin restraint.In order to enforce the α-helical folding/refolding
mechanism
observed in vacuum, multiple additional restraints were imposed. First,
the peptide backbone was confined to a cylindrical tube of radius
10 Å centered along the end-to-end distance vector. This confinement,
as well as a smaller tube of radius 8 Å, failed to prevent the
formation of compact nonhelical states, and the PMFs produced were
either unchanged or inconsistent (Figure 3,
dashed lines), likely because convergence was not achieved. An additional
antihairpin restraint, which prevents the backbone Cα’s from passing one another in relation
to the end-to-end distance vector, was also insufficient to produce
the simple folding/refolding mechanism (Figure 3, dotted-dashed lines). The persistent formation by Ala10 of these nonhelical, compact states from extended states reveals
a more dynamic folding process than that seen in a vacuum. Indeed,
previous simulations of Ala10 have shown that the disordered
and extended states are much more soluble in water than the α-helix.[59] Instead of running the 1D US simulations longer,
because the prevalence of compact, nonhelical states makes it difficult
to determine when convergence will be achieved, we switched to a 2D
description to ensure adequate sampling of these additional states.
Two-Dimensional PMFs
To examine the effects of compact,
nonhelical states on the free energy of Ala10 folding,
we calculated a two-dimensional PMF using US, with α-helical
content as a second reaction coordinate. We first verified this 2D
description by calculating the PMF in vacuum with umbrella sampling
(Figure 6, top). There is still only one minimum
in the pure α-helical state, as was seen in the 1D PMF. In addition,
we calculated the least free-energy path,[60] which finds the path of least free-energy difference between two
local minima on a 2D free-energy surface, from the α-helical
state to an extended state. There is close agreement between this
path (Figure 6, top inset) and the 1D PMF (Figure 1), suggesting that the folding/refolding mechanism
observed in the 1D biased simulations is in fact the primary mechanism
of folding for Ala10 in vacuum.
Figure 6
Two-dimensional PMF of
Ala10 in vacuum (top) and water
(bottom) using end-to-end distance and α-helical content as
the two reaction coordinates. Green line represents the least free-energy
path from the α-helical state to the extended state. The inset
shows the PMF along the least free-energy path, as projected onto
the end-to-end distance coordinate.
Two-dimensional PMF of
Ala10 in vacuum (top) and water
(bottom) using end-to-end distance and α-helical content as
the two reaction coordinates. Green line represents the least free-energy
path from the α-helical state to the extended state. The inset
shows the PMF along the least free-energy path, as projected onto
the end-to-end distance coordinate.On the basis of the successful application to the vacuum
case,
a 2D REMD-US (see Methods) simulation was
performed for Ala10 in water. One can immediately see that
a free-energy “trough” has appeared along a family of
extended, nonhelical states (Figure 6, bottom).
These states are also of free energy comparable to the pure α-helix,
differing by less than 1 kcal/mol, and the energy barrier between
the helical and extended states is now less than 4 kcal/mol. The least
free-energy path explores a wider range of extended states before
refolding compared with the vacuum case.One notable feature
of the 2D PMF is the appearance of “bands”
in the free energy along lines of constant helical content, which
were presumed to be indications of poor overlap between neighboring
windows when implementing the WHAM algorithm. The poor overlap was
confirmed by plotting the histograms (data not shown), and thus, the
number of windows along the α coordinate was increased from
9 to 17 (see Methods). However, the bands
still remained as seen in Figure 6, bottom
graph. These can be seen more explicitly in the PMF along the least
free-energy path, which shows five distinct local minima between the
α-helical state and the fully extended state (Figure 6, bottom inset). There is less than 1 kcal/mol difference
between PMFs calculated for 15 ns per window and 20 ns per window
for the entire range of our reaction coordinates, which suggests that
the PMFs are converged.To validate the path as well as our
choice of reaction coordinates,
we also made a rough estimate of the committor distribution for the
free-energy maximum.[61,62] Fifty separate conformations
were each used to initiate 50 10-ps simulations (2500 simulations
and 25 ns in total) with random velocities. The committor was judged
to be progressing to the extended state or retreating to the helical
state based on the values of ξ and α (see Figure 6, bottom) although full commitment to either minimum
would require time scales at least 100× as long (see below).
The resulting distribution is peaked at 0.5, with some bias toward
values greater than 0.5 (see Figure S1).
Overall, the behavior of the committor near the barrier suggests that
it is representative of a true transition state.
Equilibrium
Simulations
Equilibrium simulations of
deca-alanine in water were performed starting from different states
for 50 ns each to validate the 2D PMF and to better understand the
folding pathway. Starting from a state that is near the α-helical
minimum observed in the 2D PMF, in equilibrium the peptide initially
samples the region around the minimum. As the simulation progresses,
the peptide begins to unfold roughly along the least free-energy path,
and similar bands as seen in the PMF also appear in the histogram,
despite no biasing being applied (Figure 7).
The protein folds and refolds until finally becoming fully extended
near the end of the 50-ns simulation.
Figure 7
Histograms of 50 ns equilibrium simulations
for an α-helical
(top) and extended (bottom) starting states. Green lines represent
the previously determined least free-energy path.
Histograms of 50 ns equilibrium simulations
for an α-helical
(top) and extended (bottom) starting states. Green lines represent
the previously determined least free-energy path.Starting from an extended state, Ala10 explores
the
range of extended and compact nonhelical states predicted by the free-energy
trough seen in the 2D PMF. The peptide eventually folds into an α-helix
near the end of the simulation in much the same manner as in the unfolding
case. Examination of the hydrogen bonding of Ala10 with
itself and with water during the equilibrium simulations reveals that
the transition between helical and extended states occurs in ∼5
ns with the formation or breaking of ∼4 peptide–peptide
hydrogen bonds, with a corresponding decrease or increase in water–peptide
hydrogen bonds, respectively (Figure S2), in agreement with the results of Ozer et al.[63] We see similar results when examining the water–peptide
hydrogen bonds from the REMD-US trajectories, with an increase of
∼8–10 hydrogen bonds from folded to extended states
(Figure S3).Based on the success
of the two previous equilibrium simulations,
we ran 20 additional simulations to get better sampling of the folding
mechanism: 10 starting from the α-helical minimum and 10 starting
from the extended minimum. With only two exceptions, all initially
extended states also sampled the helical state during the 50 ns simulations,
while all initially α-helical states sampled the extended states
as well. The histograms for these simulations (Figure S4) are practically identical to one another, demonstrating
the reproducibility of the conformational equilibria predicted by
the PMFs.Examination of the hydrogen bonding between the ith residue and the i+4th residue for the
simulations
in which folding was observed reveals some cooperativity at the N-terminus,
where the formation of the Ala1–Ala5hydrogen bond initiates
a propagation of hydrogen bond formation toward the C-terminus as
the peptide folds from an extended state to an α-helical state
(Figure S5, left graphs). In contrast,
the C-terminus exhibits less cooperativity, with unfolding typically
beginning at the C-terminus and propagating in the opposite direction
of folding in the majority of our simulations (Figure S5, right graphs). On average, the N-terminal hydrogen
bonds persist longer than those on the C-terminal side while the protein
is in a folded conformation. These results are consistent with a previous
study which observed that the N-terminus of Ala10 slightly
favors the α-helical conformation over the β-sheet conformation,
whereas the opposite was observed for the C-terminus.[5] We do observe, however, cases in which the N-terminal hydrogen
bonds were broken while the C-terminal hydrogen bonds remained intact
(Figure S5, bottom right graph), although
these occurred much less frequently. The REMD-US trajectories of Ala10 in water yields a similar preference for N-terminal hydrogen
bond formation (Figure S6) near the α-helical
state. The N-terminal hydrogen bonds also persist for a longer portion
of the unfolding pathway (Figure 6, bottom
graph) than the C-terminal hydrogen bonds. In addition, we observed
very little 310-helical (i, i + 3) hydrogen bonding for both the REMD-US and equilibrium trajectories
(data not shown), confirming that the 310-helix is not
a folding intermediate. Thus, the folding pathway consists of the
breaking or formation of α-helical hydrogen bonds and not the
rearrangement of those hydrogen bonds into a 310-helical
structure.
1D PMF from Integration of 2D PMF
After validation
of the free energy minima observed in the 2D PMF of Ala10 folding in water by equilibrium simulations, we integrated out the
α coordinate according to eq 1 (see Methods) to generate a 1D PMF along the distance
coordinate (Figure 8). This integrated PMF
still yields the free-energy minimum observed for Ala10 in vacuum around ξ = 14.3 Å. The main difference between
the integrated PMF in Figure 8 and the PMF
in vacuum (Figure 1) is in the unfolding region
(ξ > 15 Å), with the PMF reduced by more than 20 kcal/mol
in the extended state. This reduction is comparable to that seen in
the previous unrestrained 1D PMFs calculated for Ala10 (see
Figure 3). However, by ensuring that Ala10 more fully explores its entire conformational space through
biasing of the additional α reaction coordinate, two free energy
minima are revealed in the compact states and extended states, respectively,
that were not found by biasing of the ξ reaction coordinate
alone. The appearance of these new minima supports the suggestion
that the previous 1D PMFs had not yet converged. Calculation of the
free-energy difference between these two minima establishes that the
compact state is slightly favored over the extended states (ΔG = −0.4 kcal/mol), with compact states defined as
ξ ≤ 16.75 Å, i.e., below the peak of the energy
barrier between the minima.
Figure 8
One-dimensional PMF of Ala10 in water
calculated by
integration of the 2D PMF using eq 1.
One-dimensional PMF of Ala10 in water
calculated by
integration of the 2D PMF using eq 1.
Discussion
The
free energy of folding for Ala10 in vacuum has been
used as a benchmark for many free-energy calculation methods, using
the end-to-end distance (ξ) of the peptide as the reaction coordinate.
This choice of reaction coordinate implicitly presumes a reversible,
accordion-like folding/refolding of the peptide. We observed using
both US and ABF that in vacuum, this presumption is indeed correct
and only the simple folding/unfolding mechanism predicted is found.
In the presence of water, however, the folding/unfolding mechanism
is much more complex, and biasing the refolding of Ala10 back into an α-helix from an extended state is nontrivial.Using multiple biasing methods—US and ABF—we discovered
that the water-solvated Ala10 will explore an extended
range of compact, nonhelical states before refolding back into an
α-helix. These compact, nonhelical states appeared to be “contaminating”
the low-ξ region of the 1D PMFs calculated from the two biasing
methods creating a relatively flat PMF compared to the PMF calculated
for Ala10 in vacuum (Figure 3).
Calculation of 2D PMFs for the entire (ξ, α) collective-variable
space revealed a new free-energy minimum in a family of extended states
not observed in vacuum, along with the α-helical minimum that
was originally observed in vacuum.In water, the free energy
of the extended states decreased significantly
compared to in vacuum, with the α-helical state less than 1
kcal/mol lower in energy than the extended states—a decrease
from more than 20 kcal/mol observed in vacuum. A barrier of ∼4
kcal/mol between the two energy minima is also observed in the PMF.
Previous studies of Ala10 in water have also shown a decrease
in the free-energy difference between extended and helical states.
For example, ABF was applied to a zwitterionic form of Ala10 with charged termini to calculate a 1D PMF in water, using the average
length of the i, i + 4 hydrogen
bonds of the backbone as a reaction coordinate.[39] That PMF shows a comparable free-energy barrier of ∼5
kcal/mol between the extended and helical states, with the relative
energies differing by ∼1 kcal/mol. The decrease in the free-energy
difference was not as pronounced in other studies utilizing adaptive
SMD[63] and US[64] applied to the end-to-end distance of Ala10 with neutral
termini. Additionally, neither study discovered the free-energy minimum
in the extended states. Levy et al. found that in hydrophilic environments,
the α-helix is actually destabilized relative to β-sheet
conformations, due to their high conformational entropy compared to
that of the α-helix.[58] Although we
observed a minimum in extended states rather than β-sheet conformations,
the extended states are similarly entropically favored over the α-helical
state and extend into the range of compact, nonhelical states (Figure 6). As a check on our 2D PMF, we performed 50 ns
equilibrium simulations of Ala10 in water starting from
α-helical and extended states (Figure 7). These simulations confirmed both minima observed in the 2D PMF.
Furthermore, transitions between the two minima demonstrated cooperativity
at the N-terminus and noncooperativity at the C-terminus, as expected.[5]Although alanine is used as a model for
protein folding since it
has the highest helix propensity of all amino acids, the folding mechanism
for Ala-based peptides is still not very well understood. Experiments
studying short Ala-based peptides have led to inconclusive or contradictory
observations for the stability of the α-helical state in water.
Rohl et al. observed that Ala-based peptides are the only stable helix
formers in water for the 20 common amino acids.[65] They postulated that reducing the extent of solvation of
the coiled backbone could increase stabilization of the helix. Experiments
performed by Blondelle et al. showed that peptides of the form Ac–KYAK–NH2 (10 ≤ n ≤ 14) coexisted as a β-sheet and α-helix
to varying extents.[66] However, it was later
observed that the stability of the helix in these peptides was due
to the solubility of the flanking Lys residues, and not the intrinsic
helix propensity of Ala.[67] This was followed
up by Spek et al. stating that although the increase in α-content
of KAK is an artifact of the flanking
Lys residues, Ala is intrinsically α-helix stabilizing.[68] So, although there is some discrepancy for the
stability of secondary structures for Ala-based peptides, the evidence
suggests that these peptides do not solely exist as an α-helix
in solution as one might suspect based on its strong preference for
the helical state in vacuum. Instead they exist in some combination
of secondary structures, including α-helices and β-sheets.
Indeed, more recently, NMR data for short polyalanine peptides (Ala3–7) show that these peptides exist primarily as polyproline
II (PPII) helix-like structures with very little population
of the α-helical structure.[69]Our results detail the stability, or lack thereof, of the α-helix
for short polyalanine peptides in solution. However, one should always
be aware of the limitations of the force fields utilized in MD simulations.
Best et al. examined a range of force fields and observed that they
overemphasized the α-helix structure compared with NMR coupling
parameters for the backbone (ϕ, ψ) dihedrals.[70] By reweighting the force fields based on these
(ϕ, ψ) coupling parameters,[69] they were able to yield good agreement with the population of the
α-helix, β-sheet, and PPII structures seen
in the NMR data. Although there was better agreement with NMR for
unblocked (ionic or zwitterionic) termini than with blocked (neutral)
termini, by using the reweighting for unblocked peptides, blocked
peptides were found to yield α-content of 12–23%. Similar
results were found using AGADIR,[71] an empirical
NMR-based algorithm that determines the α-helical propensity
of a peptide based on sequence, which yields a helical propensity
of ∼15% for Ac–Ala10–NH2 in water at 300 K.In this study, we have used the most recent
iteration of the CHARMM
force field, CHARMM36, introduced in 2012.[44,45] One of the major improvements of CHARMM36 is the correction of the
α-helical bias introduced into CHARMM22 by the backbone (ϕ,
ψ) dihedral CMAP potential. This improvement was achieved by
capturing the many-body effects not present in the original CMAP potential.
The CMAP potential was optimized to match NMR data for Ala5[69] and Ac-(AAQAA)3-NH2[72] in solution, and the side-chain χ1 and χ2 dihedrals were optimized to QM energy
surfaces. The result is a better balance between secondary structures,
particularly the α-helix and β-sheet, addressing the problem
posed originally by Best et al. in 2008.[70] In particular, the helical fraction of Ac-(AAQAA)3-NH2 produced by CHARMM36 more closely matches experiments[72] than other force fields (a reduction from 95%
for CHARMM22/CMAP to 21% for CHARMM36), as well as improved cooperativity
for α-helix and β-sheet formation. Thus, the helical fraction
of Ala10, as well as its folding mechanism, determined
using CHARMM36 should also have better agreement with experimental
results, particularly when compared with CHARMM22 and CHARMM22/CMAP,
which were utilized in previous unfolding simulations of Ala10 in water.[63,64] How the balance between conformational
changes in the presence of other peptides and proteins occurs remains
uncertain. While it is known that macromolecular crowding can have
a significant effect on protein folding in vivo,[73,74] a recent study on dipeptide aggregation demonstrated that simulations
still have some difficulty reproducing such effects quantitatively.[75]Finally, our work emphasizes the challenge
and necessity of choosing
relevant reaction coordinates to fully characterize a particular system.[76] For Ala10, the end-to-end distance
is no longer sufficient to capture the diversity of conformations
explored in water, thus making it a highly degenerate reaction coordinate
(Figure 4). Contributions from compact, nonhelical
states can produce a PMF that does not reveal what one intuitively
expects it to, namely the accordion-like folding/refolding mechanism
shown in Figure 2. By tracking the α-helical
content of Ala10 during biased folding simulations, we
found many states accessible to the peptide in water that were inaccessible
in vacuum. These states arise due to Ala10’s increased
flexibility in water, where a loss of intrapeptide hydrogen bonds
is compensated by an increase in peptide-waterhydrogen bonds (Figures S2, S3). Reaction coordinates suitable
for Ala10 in vacuum, therefore, may not be suitable in
water.[26,58] Since recent studies of Ala10 folding in water have only used the end-to-end distance to characterize
the folding pathway, they observe that the preference for the α-helical
state is still significant in water compared to the extended states.
However, we have shown that the helical and extended states are of
comparable stability (|ΔG| < 1 kcal/mol),
with both states transitioning from one to the other within the course
of 50 ns equilibrium simulations.
Authors: Robert B Best; Xiao Zhu; Jihyun Shim; Pedro E M Lopes; Jeetain Mittal; Michael Feig; Alexander D Mackerell Journal: J Chem Theory Comput Date: 2012-07-18 Impact factor: 6.006