Literature DB >> 23316122

w-REXAMD: A Hamiltonian Replica Exchange Approach to Improve Free Energy Calculations for Systems with Kinetically Trapped Conformations.

Mehrnoosh Arrar¹, Cesar Augusto F de Oliveira, Mikolai Fajer, William Sinko, J Andrew McCammon.

Abstract

Free energy governs the equilibrium extent of many biological processes. High barriers separating free energy minima often limit the sampling in molecular dynamics (MD) simulations, leading to inaccurate free energies. Here, we demonstrate enhanced sampling and improved free energy calculations, relative to conventional MD, using windowed accelerated MD within a Hamiltonian replica exchange framework (w-REXAMD). We show that for a case in which multiple conformations are separated by large free energy barriers, w-REXAMD is a useful enhanced sampling technique, without any necessary reweighting.

Entities: Chemical Disease Gene

Year: 2012 PMID： 23316122 PMCID： PMC3541756 DOI： 10.1021/ct300896h

Source DB: PubMed Journal: J Chem Theory Comput ISSN： 1549-9618 Impact factor: 6.006

Introduction

The change in the Gibbs free energy is the thermodynamic property that drives equilibrium aspects of complex biological processes like the folding of proteins, drug binding, and molecular recognition of proteins in cell signaling pathways.[1−3] A great deal of progress has been made in the area of free energy calculations in recent years.[4−8] One of the obstacles in using computer simulation to calculate free energy changes, for example in the solvation of small molecules like ibuprofen, or in comparing protein–drug binding interactions, is the limit of conformational sampling.[9,10] A transformation of interest, perhaps changing a substitutent on a drug-like molecule, or a mutation in a protein active site, may involve a conformation change that is inaccessible on the time scale of the simulation. Without considering the mutation in both possible conformations, the calculated free energy change for the process is inaccurate. This sampling problem can be particularly difficult to solve if the two (or more) conformations are not known beforehand. Enhanced sampling techniques are often leveraged to mitigate the sampling problem.[11−13] Accelerated molecular dynamics (aMD)[14] is a Hamiltonian-modifying technique that has been used extensively in our group to simulate micro- to millisecond time scale conformational changes in biomolecules.[15−23] The original form of aMD adds a bias to potential energy minima, making barriers easier to cross, without defining a reaction coordinate a priori. While this method has been shown to be a useful tool in exploring the conformational space of large proteins, the simulation must be reweighted to recover the Boltzmann distribution. In this exponential reweighting, the conformations at energy minima make significantly larger contributions than all other conformations to the average value, and the effective number of data points is reduced, introducing statistical noise in ensemble averages. By using the original aMD equation in Hamiltonian replica exchange (REXAMD), Fajer et al. showed that the exponential reweighting problem could be ameliorated by simulating replicas at various states of acceleration, including an unaccelerated “ground” state, and collecting data only from the unaccelerated state.[24] Recently a new “windowed” aMD (w-aMD) method has been introduced in which the potential energy barriers are scaled down while preserving the original features of the landscape (eqs 1 and 2).[25] The advantage of this implementation relative to the original aMD implementation is that the structures already at energy minima remain on the unbiased potential energy surface, and energy barriers are scaled without affecting the overall shape of the energy landscape. While this method was shown to be an improvement on the previous barrier-lowering aMD method,[26] one limitation of the windowed aMD method is the introduction of additional parameters that determine the degree of acceleration: E1 and E2 define the window in which the dynamics are accelerated, while α1 and α2 are tuning parameters (see Supporting Information for details). Here, we implement the windowed aMD equation in a Hamiltonian replica exchange framework (w-REXAMD), in which multiple sets of acceleration parameters distinguish the Hamiltonians of exchanging replicas.where In our discussion of w-REXAMD simulations, a “state” refers to a particular set of aMD parameters introduced in eq 2. A “replica” refers to an evolving configuration which may exchange its state with that of its neighbor’s, according to the Metropolis criterion. In an analogous temperature replica exchange simulation, the “state” would be defined by the temperature, while a “replica” may visit different temperatures. We anticipate that the additional conformational sampling from multiple replicas at different acceleration states will enrich the sampling in the unaccelerated state ensemble. As with any replica exchange method though, we must acknowledge the increase in computational cost associated with the multiple replicas. To account for both the advantage in sampling simply from multiple independent simulations and the limitation of increased computational cost, we compare w-REXAMD to conventional MD in the same replica exchange framework, which we refer to as REGREX. In a REGREX simulation, multiple independent conventional MD replicas exchange their configurations at the same frequency at which exchanges are attempted in a w-REXAMD simulation. The states in a REGREX simulation are equivalent to each other, and each state contains equivalent portions of independent MD trajectories. One REGREX state, then, may also benefit from conformational space sampled in multiple replica trajectories but will still contain the same number of structures as one w-REXAMD state. We first compare the extent of conformational sampling in w-REXAMD and REGREX simulations of two systems: butane and cyclohexane, both in explicit water solvent. For the case of butane, conventional MD, and thus REGREX simulations, can exhaustively sample the easily characterizable conformational space of the system (Figure 1A) and provide a reference for a converged result. In the case of cyclohexane, however, a large (≈10 kcal/mol) barrier separates the two stable chair conformations (Figure 2A), making the interconversion between the two conformations, so-called “chair-flipping,” a rare event that occurs on the microsecond time scale.[27] Since cyclohexane is a model system in which the time scales are no longer accessible for reference REGREX simulations, we instead use umbrella sampling calculations as an appropriate reference to which we compare ensembles generated in w-REXAMD simulations.

Figure 1

Figure 2

Conformational sampling of cyclohexane. (A) Umbrella sampling along the ε coordinate shows a high barrier between chair conformations. (B) Observed probability of chair and boat-like conformations from w-REXAMD (black series), windowed-aMD (red series), and REGREX (magenta series) simulations. Error bars show standard deviation from three independent simulations. Green bars mark the expected probability from umbrella sampling calculation in A as a reference. (C) ε conformation coordinate during windowed-aMD (red series), w-REXAMD (black series), and REGREX (magenta series) shows interconversion of cyclohexane conformations.

Conformational sampling of butane. (A) A schematic of butane conformational landscape according to the C1–C2–C3–C4 dihedral angle (the cis and eclipsed (+) structures are shifted slightly so that the opposed groups are visible). Fractions of state ensembles in the trans (black series), gauche(−) (red series), gauche(+) (blue series), and eclipsed (green series) are shown for (B) REGREX, (C) w-REXAMD, and (D) w-aMD simulations. Total number of frames corresponds to 500 ps of simulation. Error bars show standard deviation from three independent simulations. Conformational sampling of cyclohexane. (A) Umbrella sampling along the ε coordinate shows a high barrier between chair conformations. (B) Observed probability of chair and boat-like conformations from w-REXAMD (black series), windowed-aMD (red series), and REGREX (magenta series) simulations. Error bars show standard deviation from three independent simulations. Green bars mark the expected probability from umbrella sampling calculation in A as a reference. (C) ε conformation coordinate during windowed-aMD (red series), w-REXAMD (black series), and REGREX (magenta series) shows interconversion of cyclohexane conformations. We then present an application of w-REXAMD to free energy calculations using an alchemical identity transformation. As mentioned earlier, multiple kinetically trapped conformations can be a major obstacle in free energy calculations. Here, we discuss a simple example of such asymmetry in the transformation of bromocyclohexane to itself. This identity transformation, in which the bromine is displaced from one carbon to another (shown in Figure 3), should result in a free energy change of zero and serves as a force-field independent benchmark. With the bromine substituent, the two chair conformations of the cyclohexane are no longer equivalent. The bromine occupies either an equatorial or axial position, and interconversion of these two conformers occurs on the microsecond time scale. In our alchemical self-transformation, the bromine in equatorial-substituted bromocyclohexane is displaced to the axial position at another carbon. The axial and equatorial conformers exist in equilibrium, so constructing the transformation in this way requires sampling of both conformers according to their thermodynamic probabilities in order to reach the analytical result of zero. On the typical nanosecond time scale of MD simulations, the axial and equatorial configurations are kinetically trapped, and the identity transformation is a good test case for the convergence problem encountered when multiple conformations are separated by high barriers. We show that the enhanced sampling in w-REXAMD simulations is useful in mitigating this convergence issue in the free energy calculation.

Figure 3

Thermodynamic cycle of alchemical identity transformation of bromocyclohexane. In alchemical transformations 1 and 2, the bromine substituent is displaced from C1 to C4, either from an equatorial to an axial position (transformation 1: “C1,eq-to-C4,ax”) or vice versa (transformation 2: “C1,ax-to-C4,eq”). Transformations 3 and 4 represent the slow conformational change isolating the two conformers in conventional MD. ΔG1 + ΔG4 = ΔG3 + ΔG2 = 0 kcal/mol.

Results and Discussion

Conformational Sampling

We distinguish between state trajectories (corresponding to the ensemble of structures sampled with a certain set of acceleration parameters) and replica trajectories (corresponding to the trajectory of a replica as it visited different states). We use the dihedral angle (C1–C2–C3–C4) of butane during the simulations to monitor the interconversion between gauche(∓) and trans conformations of butane (Figure 1A). In order to probe convergence in conformational sampling in butane, we clustered multiple independent unaccelerated state trajectories into gauche(+), gauche(−), or trans conformations and plot the cumulative fraction for each conformation during the time scale of the w-REXAMD, REGREX, and w-aMD simulations. In the w-REXAMD simulations, all conformation populations converge to fractional occupancies that lie within the error intervals reached in reference REGREX simulations (Figure 1B and C). The two equivalent gauche conformations quickly converge in the w-REXAMD simulations in considerably fewer steps than in the REGREX simulations. For the nonexchanging w-aMD simulations, the equivalent gauche conformations are sampled exhaustively (Figure 1D), but the less favorable eclipsed conformation is sampled to a greater extent, indicating the effectiveness of the windowed aMD equation in scaling down the barriers but illustrating the need for Boltzmann reweighting in order to attain information about the equilibrium ensemble. Unlike those of butane, the two stable conformations of cyclohexane are separated by a 10 kcal/mol free energy barrier, making interconversion between the two conformations a rare event.[27] Using a conformation coordinate ε (eq 3), which is a function of the six correlated cyclohexane dihedral angles (τ), we analyzed the sampled conformations of cyclohexane (chair1, boat-like, and chair2).[17] By clustering the ε coordinate over multiple trajectories, we were able to observe the fractional occupancies of the two chair conformations (ε ≈ ± 50) of cyclohexane. Chair-flipping cannot be observed on reasonable time scales of cMD (0.5 μs, data not shown), and REGREX simulations similarly only sample the initial chair conformation (Figure 2B, magenta series). The probability of each chair conformation for the unaccelerated state from w-REXAMD simulations agrees well with expected probability from reference umbrella calculations (Figure 2A and B, black series). The use of the w-aMD equation allows for enhanced sampling of cyclohexane conformations (Figure 2C, red series). Without the replica exchange framework, w-aMD simulations must be reweighted to obtain the same equilibrium populations (Figure 2B, red series). Although this reweighting is quite successful in recovering equilibrium populations for small systems such as these,[25] it has been shown to introduce statistical noise in ensemble averages.[28] This issue may be more prominent in more complex systems.

Alchemical Free Energy Calculations

In the previous section, we showed that w-REXAMD is a reasonable method to rapidly explore conformational space, particularly in the presence of high barriers separating relevant conformations. We expect, then, that w-REXAMD should also improve the convergence of free energy calculations. We consider the transformation of bromocyclohexane to itself, which has a free energy change of zero. Similar to cyclohexane, bromocyclohexane has a considerable barrier separating its two chair conformations, making sampling of the conformations a microsecond time scale event, presently inaccessible to routine alchemical free energy calculations. The presence of the bromine substituent creates a difference between the two chair conformations; one chair conformation is the axial conformer, while the other is the equatorial conformer. The equatorial conformer is thermodynamically favored, as it results in less steric hindrance between the bulky bromine and neighboring hydrogens, but bromocyclohexane exists in equilibrium between both conformers. To highlight the sampling problem, we let the two conformers be the initial and final states and considered the bromocyclohexane identity transformation, where the bromine is displaced from one carbon to another, in both directions: C1,eq-to-C4,ax, and C1,ax-to-C4,eq. In the C1,eq-to-C4,ax transformation, for example, the equatorial bromine at carbon 1 and an axial hydrogen at carbon 4 are decoupled from the rest of the system, while a bromine atom appears at the axial position on carbon 4 and a hydrogen atom appears at the equatorial position initially occupied by the bromine, at carbon 1 (Figure 3). Using the REGREX simulations at each of nine lambda intermediates along the alchemical pathway, we found that the observed free energy change for the transformation did not result in zero, and that it depended upon the directionality of the transformation (see Figure 4A). The REGREX C1,ax-to-C4,eq bromocyclohexane transformations converged to a value of approximately −0.2 kcal/mol (Figure 4A, red series), while the C1,eq-to-C4,ax self-transformations converged to approximately +0.3 kcal/mol (Figure 4A, blue series). Because of the presence of the high free energy barrier separating the two conformers, bromocyclohexane did not flip into its equatorial conformation during the alchemical transformation and instead remained exclusively in the axial conformation (see Figure 4B, blue series). In this case, where the relevant conformations are kinetically trapped, we find that the alchemical identity transformation results in a nonzero free energy change. Without any enhanced sampling, the alchemical self-transformations only complete either transformation 1 or 2 of the thermodynamic cycle in Figure 3, with ΔG1 = −ΔG2, within statistical uncertainty. In order for the analytical result to be reached with conventional MD, the conformational changes depicted by transformations 3 and 4 of the thermodynamic cycle in Figure 3 must also be completed. On the other hand, with the w-REXAMD method the initial and final states are able to sample both axial and equatorial conformations, and as a result the identity transformation quickly converges to the analytical result of 0 kcal/mol (Figure 4A, black series). It is important to note that for this small system, reweighting of the most accelerated w-aMD state trajectory also converges to the analytical result (Figure 4A, maroon series).

Figure 4

Free energy of identity transformation for bromocyclohexane. (A) REGREX transformations were done in both the C1,eq-to-C4,ax (blue series) and C1,ax-to-C4,eq (red series) directions. w-REXAMD transformations were performed for just the C1,eq-to-C4,ax direction. The free energy from the unaccelerated state of the w-REXAMD simulations is shown in the black series, and the reweighted most accelerated state is shown in maroon. Each series shows the average of three independent transformations, weighted by their statistical uncertainties. (B) Representative values for ε at λ = 0.7 for the transformations show that only one chair conformation is sampled in REGREX C1,eq-to-C4,ax (blue series) and C1,ax-to-C4,eq (red series) transformations, whereas both chair conformations are sampled in the w-REXAMD unaccelerated state of the C1,eq-to-C4,ax transformation (black series).

Conclusions

We presented an enhanced sampling method, w-REXAMD, which is capable of sampling conformations that would normally interchange on microsecond time scales. We showed that w-REXAMD could be used to improve alchemical free energy calculations, particularly when the conformation of the initial or final state is kinetically trapped. Similar to the use of accelerated molecular dynamics, an advantage of the w-REXAMD method is the gain in sampling without having to define any reaction coordinate a priori. The replica exchange framework offers the additional advantage of (1) avoiding any reweighting of data in obtaining equilibrium ensemble averages relevant for free energy calculations and (2) using multiple sets of acceleration parameters. To optimize the w-REXAMD method, future studies should focus on the distribution of the sets of acceleration parameters in order to both maximize sampling and retain efficient diffusion rates among replicas. The replica exchange framework does come with the need for additional parallel computing resources, and this need will be amplified as system size increases. In systems with multiple, slow-exchanging conformational states though, a common protocol is to calculate the change in free energy for some transformation, e.g., a mutation in an enzyme active site, for each constrained conformation separately, and then combine the transformations in a thermodynamic cycle to estimate the actual free energy of the mutation. We believe that in such cases, and particularly in cases where the conformational landscapes are not well characterized, w-REXAMD will be a useful enhanced sampling method.

Computational Methods

Hamiltonian Replica Exchange

Each thermodynamic state in the w-REXAMD simulation corresponds to a set of four parameters (E1, E2, α1, and α2) that determine the degree of acceleration of a replica at that state. Four replicas were used in each simulation, and the least accelerated state was always the unmodified Hamiltonian. Every 300 MD steps (0.3 ps), exchanges between adjacent replicas were attempted. A description of how parameters were selected can be found in the Supporting Information. Only torsional potential energy terms were accelerated in these test cases. The Amber ff99SB force field[29] was used for all simulations, with the TIP3P water model.[30]

Thermodynamic Integration

All atoms being coupled or decoupled according to the parameter λ were treated with soft-core potentials. Nine evenly spaced λ points were chosen between 0 and 1 for each transformation, and 1.5 ns of REGREX and w-REXAMD was run at each λ point (NVT) after an equilibration period of 150 ps (NPT). ⟨dV/dλ⟩λ from one state (the unaccelerated state in w-REXAMD simulations) was used for the free energy calculations. Free energies were calculated by numerically integrating the ⟨dV/dλ⟩λ values from 0 to 1 using the Trapezoid rule. The uncertainty associated with each of the nine ⟨dV/dλ⟩λ values was calculated aswhere N is the number of dV/dλ values used to compute the average and g is the statistical inefficiency, as defined by Chodera et al.,[31] which contains the integrated autocorrelation time τ (g = 1 + 2τ). The variance of the free energy from one transformation was estimated by propagating the uncertainty at each of the nine lambda values. Three independent C1,ax-to-C4,eq and C1,eq-to-C4,ax transformations were performed using REGREX, and three C1,eq-to-C4,ax transformations were done using w-REXAMD. The final free energies reported are averages weighted by each transformation’s uncertainty:where w = 1/δ2.

Umbrella Sampling

The umbrella sampling simulations were performed in NAMD[32] using the Amber ff99SB[29] force field. At each of 63 evenly spaced ε centers, ranging from ε = −62° to ε = 62°, a 2 ns MD simulation was performed with a harmonic potential centered at the value of ε. A force constant of 1.0 kcal/mol/deg2 was used for all simulations, and only the last 1.5 ns were used to construct the free energy profile. Initial structures were taken from pre-equilibrated structures from cyclohexane NVT simulations. The free energy profile was constructed using the python implementation of MBAR.[33]

28 in total

1. Accelerated molecular dynamics: a promising and efficient simulation method for biomolecules.

Authors: Donald Hamelberg; John Mongan; J Andrew McCammon
Journal: J Chem Phys Date: 2004-06-22 Impact factor: 3.488

Review 2. The many roles of computation in drug discovery.

Authors: William L Jorgensen
Journal: Science Date: 2004-03-19 Impact factor: 47.728

3. Predicting hydration free energies using all-atom molecular dynamics simulations and multiple starting conformations.

Authors: Pavel V Klimovich; David L Mobley
Journal: J Comput Aided Mol Des Date: 2010-04-06 Impact factor: 3.686

4. Relating kinetic rates and local energetic roughness by accelerated molecular-dynamics simulations.

Authors: Donald Hamelberg; Tongye Shen; J Andrew McCammon
Journal: J Chem Phys Date: 2005-06-22 Impact factor: 3.488

5. Comparison of multiple Amber force fields and development of improved protein backbone parameters.

Authors: Viktor Hornak; Robert Abel; Asim Okur; Bentley Strockbine; Adrian Roitberg; Carlos Simmerling
Journal: Proteins Date: 2006-11-15

6. A divide-and-conquer strategy to improve diffusion sampling in generalized ensemble simulations.

Authors: Donghong Min; Wei Yang
Journal: J Chem Phys Date: 2008-03-07 Impact factor: 3.488

7. Accelerating chemical reactions: exploring reactive free-energy surfaces using accelerated ab initio molecular dynamics.

Authors: Levi C T Pierce; Phineus R L Markwick; J Andrew McCammon; Nikos L Doltsinis
Journal: J Chem Phys Date: 2011-05-07 Impact factor: 3.488

8. Local elevation: a method for improving the searching properties of molecular dynamics simulation.

Authors: T Huber; A E Torda; W F van Gunsteren
Journal: J Comput Aided Mol Des Date: 1994-12 Impact factor: 3.686

Review 9. Statistical mechanics and molecular dynamics in evaluating thermodynamic properties of biomolecular recognition.

Authors: Jeff Wereszczynski; J Andrew McCammon
Journal: Q Rev Biophys Date: 2011-11-15 Impact factor: 5.318

10. Large-scale conformational changes of Trypanosoma cruzi proline racemase predicted by accelerated molecular dynamics simulation.

Authors: César Augusto F de Oliveira; Barry J Grant; Michelle Zhou; J Andrew McCammon
Journal: PLoS Comput Biol Date: 2011-10-13 Impact factor: 4.475

21 in total

1. Locally weighted histogram analysis and stochastic solution for large-scale multi-state free energy estimation.

Authors: Zhiqiang Tan; Junchao Xia; Bin W Zhang; Ronald M Levy
Journal: J Chem Phys Date: 2016-01-21 Impact factor: 3.488

2. Asynchronous Replica Exchange Software for Grid and Heterogeneous Computing.

Authors: Emilio Gallicchio; Junchao Xia; William F Flynn; Baofeng Zhang; Sade Samlalsingh; Ahmet Mentes; Ronald M Levy
Journal: Comput Phys Commun Date: 2015-11 Impact factor: 4.390

3. GPU-Accelerated Molecular Dynamics and Free Energy Methods in Amber18: Performance Enhancements and New Features.

Authors: Tai-Sung Lee; David S Cerutti; Dan Mermelstein; Charles Lin; Scott LeGrand; Timothy J Giese; Adrian Roitberg; David A Case; Ross C Walker; Darrin M York
Journal: J Chem Inf Model Date: 2018-09-25 Impact factor: 4.956

4. Roadmaps through free energy landscapes calculated using the multi-dimensional vFEP approach.

Authors: Tai-Sung Lee; Brian K Radak; Ming Huang; Kin-Yiu Wong; Darrin M York
Journal: J Chem Theory Comput Date: 2014-01-14 Impact factor: 6.006

5. Alchemical Binding Free Energy Calculations in AMBER20: Advances and Best Practices for Drug Discovery.

Authors: Tai-Sung Lee; Bryce K Allen; Timothy J Giese; Zhenyu Guo; Pengfei Li; Charles Lin; T Dwight McGee; David A Pearlman; Brian K Radak; Yujun Tao; Hsu-Chun Tsai; Huafeng Xu; Woody Sherman; Darrin M York
Journal: J Chem Inf Model Date: 2020-09-16 Impact factor: 4.956

6. Gaussian Accelerated Molecular Dynamics: Theory, Implementation, and Applications.

Authors: Yinglong Miao; J Andrew McCammon
Journal: Annu Rep Comput Chem Date: 2017-08-10

7. Large-scale asynchronous and distributed multidimensional replica exchange molecular simulations and efficiency analysis.

Authors: Junchao Xia; William F Flynn; Emilio Gallicchio; Bin W Zhang; Peng He; Zhiqiang Tan; Ronald M Levy
Journal: J Comput Chem Date: 2015-07-07 Impact factor: 3.376

8. Improving Prediction Accuracy of Binding Free Energies and Poses of HIV Integrase Complexes Using the Binding Energy Distribution Analysis Method with Flattening Potentials.

Authors: Junchao Xia; William Flynn; Ronald M Levy
Journal: J Chem Inf Model Date: 2018-07-03 Impact factor: 4.956

9. Massive-Scale Binding Free Energy Simulations of HIV Integrase Complexes Using Asynchronous Replica Exchange Framework Implemented on the IBM WCG Distributed Network.

Authors: Junchao Xia; William Flynn; Emilio Gallicchio; Keith Uplinger; Jonathan D Armstrong; Stefano Forli; Arthur J Olson; Ronald M Levy
Journal: J Chem Inf Model Date: 2019-02-22 Impact factor: 4.956

10. Unconstrained Enhanced Sampling for Free Energy Calculations of Biomolecules: A Review.

Authors: Yinglong Miao; J Andrew McCammon
Journal: Mol Simul Date: 2016-07-05 Impact factor: 2.178