We use thermodynamic integration (TI) and explicit solvent molecular dynamics (MD) simulation to estimate the absolute free energy of host-guest binding. In the unbound state, water molecules visit all of the internally accessible volume of the host, which is fully hydrated on all sides. Upon binding of an apolar guest, the toroidal host cavity is fully dehydrated; thus, during the intermediate λ stages along the integration, the hydration of the host fluctuates between hydrated and dehydrated states. Estimating free energies by TI can be especially challenging when there is a considerable difference in hydration between the two states of interest. We investigate these aspects using the popular TIP3P and TIP4P water models. TI free energy estimates through MD largely depend on water-related interactions, and water dynamics significantly affect the convergence of binding free energy calculations. Our results indicate that wetting/dewetting transitions play a major role in slowing the convergence of free energy estimation. We employ two alternative approaches-one analytical and the other empirically based on actual MD sampling-to correct for the standard state free energy. This correction is sizable (up to 4 kcal/mol), and the two approaches provide corrections that differ by about 1 kcal/mol. For the system considered here, the TIP4P water model combined with an analytical correction for the standard state free energy provides higher overall accuracy. This observation might be transferable to other systems in which water-related contributions dominate the binding process.
We use thermodynamic integration (TI) and explicit solvent molecular dynamics (MD) simulation to estimate the absolute free energy of host-guest binding. In the unbound state, water molecules visit all of the internally accessible volume of the host, which is fully hydrated on all sides. Upon binding of an apolar guest, the toroidal host cavity is fully dehydrated; thus, during the intermediate λ stages along the integration, the hydration of the host fluctuates between hydrated and dehydrated states. Estimating free energies by TI can be especially challenging when there is a considerable difference in hydration between the two states of interest. We investigate these aspects using the popular TIP3P and TIP4P water models. TI free energy estimates through MD largely depend on water-related interactions, and water dynamics significantly affect the convergence of binding free energy calculations. Our results indicate that wetting/dewetting transitions play a major role in slowing the convergence of free energy estimation. We employ two alternative approaches-one analytical and the other empirically based on actual MD sampling-to correct for the standard state free energy. This correction is sizable (up to 4 kcal/mol), and the two approaches provide corrections that differ by about 1 kcal/mol. For the system considered here, the TIP4P water model combined with an analytical correction for the standard state free energy provides higher overall accuracy. This observation might be transferable to other systems in which water-related contributions dominate the binding process.
Cucurbit[7]uril (CB[7]; Figure 1) is a synthetic
host that is attracting increasing interest for its ability to selectively
bind various neutral or positively charged aromatic guests and metal
complexes in aqueous solution.[1,2] CB[7] can achieve affinities
with some cationic guests that are greater than those typically measured
for protein–ligand complexes.[3,4] With their
recognition properties and ease of synthesis, members of the CB[n]
family have a wide range of potential applications including catalysis,
gas purification and waste-stream remediation, crystal engineering,
self-assembling and self-sorting systems, molecular machines, supramolecular
polymers, self-assembled monolayers, and gene transfection.[5] As a host for metal complexes, CB[7] has also
shown promise as a drug carrier for platinum chemotherapeutics such
as Oxaliplatin by improving stability and decreasing negative side
effects of the drug.[6] This auspicious molecular
carrier demonstrates low toxicity, efficient cellular internalization,
and delivered drug bioactivity nearly as effective as that of the
unbound drug in some cases,[7,8] suggesting that it could
be employed as a sophisticated drug delivery system.[9] In order to better utilize CB[7] in these applications,
it is important to understand its behavior within a water environment
and its molecular recognition of possible guests. Recent work on CB[7]
describes its molecular dynamics[10] and
affinity for several synthetic ferrocene and bicyclo[2.2.2]octane
based guest molecules (e.g., B2 shown in Figure 1c) both experimentally and computationally.[11−13] Even in seemingly
simple, small host–guest systems with relatively rigid and
symmetrical structures such as these (see Figure 1d), estimating the binding free energy can be quite challenging.
However, to our knowledge no study to date investigated the wetting/dewetting
events of the CB[7] system and the relevance of hydration in host–guest
binding.
Figure 1
Host and guest molecules studied. (a) Chemical representation of
glycoluril repeat unit constituting the host. (b) Side view of the
[n = 7] nonpolar host cucurbit[7]uril (CB[7]) structure.
(c) B2 nonpolar guest molecule. (d) Top view of CB[7]-B2 host–guest
complex in the structure used to initialize free energy calculations.
Host and guest molecules studied. (a) Chemical representation of
glycoluril repeat unit constituting the host. (b) Side view of the
[n = 7] nonpolar host cucurbit[7]uril (CB[7]) structure.
(c) B2 nonpolar guest molecule. (d) Top view of CB[7]-B2 host–guest
complex in the structure used to initialize free energy calculations.The importance of wetting/dewetting transitions
in receptor or
model host cavities has recently received much attention already.[14−18] One of the most important consequences of these studies was the
demonstration that noncovalent binding can largely depend on water-related
contributions, not—as sometimes assumed—primarily on
direct host–guest interaction. In particular, while the dehydration
of the ligand molecules was shown to be the driving factor for binding,
removal of solvent fluctuations can result in entropic penalties that
are sizable compared with the binding free energy.[14,16] However, our understanding of solvation effects and their thermodynamic
role in molecular recognition of more realistic systems remains quite
poor.[15,18−20]Here, we study
CB[7] hydration dynamics and its role upon binding
of B2, a recently designed bicyclo[2.2.2]octane apolar guest.[21] We investigate the dependence of hydration properties
using the popular TIP3P and TIP4P water models[22] with thermodynamic integration (TI) and explicit solvent
molecular dynamics (MD) simulation. Overall, the closest match of
estimated host–guest absolute binding free energy against experimental
data (within 0.3 kcal/mol) is achieved using the TIP4P water model.
However, both TIP3P and TIP4P water models lead to qualitatively similar
host hydration in the bound and unbound states, and along the unphysical
λ state intermediates. For the unphysical λ points at
which significant host wetting/dewetting transitions occur, we observe
remarkably large (∂V)/(∂λ) fluctuations
along the simulation time, ranging up to 140 kcal/mol and 130 kcal/mol
for TIP3P and TIP4P water models, with standard deviations up to 22
kcal/mol and 19 kcal/mol, respectively. Furthermore, these fluctuations
directly correlate with changes in host hydration, indicating that
wetting/dewetting effects play a major role in the host–guest
binding processes and, as a result, in determining free energy estimates.
The work presented herein underscores the importance of water-related
contributions and their convergence in free energy calculations of
noncovalent binding.
Computational Details
Molecular Models
Initial coordinates of the host–guest
complexes[21] were previously generated with
the Vdock[23] program from the CB[7] crystal
structure.[6,24] The general Amber force field (GAFF) was
used to describe all intramolecular energy contributions.[25,26] The partial charges of CB[7] and the bicyclo[2.2.2]octane (B2) were
calculated using the RESP program.[27,28] To ensure
symmetry, charge equivalence was enforced on each one of the seven
units of the cucurbituril host (CB[7]). The molecular electrostatic
potential was calculated at the HF/6-31G* level. Lennard-Jones parameters
for B2 and CB[7] were assigned as previously described by Moghaddam
et al.[12] These host–guest complexes
were solvated in cubic boxes with a buffer region of 12 Å.
Molecular Dynamics Simulations
All simulations were
performed using a modified version of AMBER[29,30] that enables the calculation of absolute binding free energies with
restraining and soft-core potentials. For both B2 and B2-CB[7], decoupling
of the guest partial charges (ele) occurred first at 11 λ values
(0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0). Decoupling
of the van der Waals (vdW) energy terms utilized 19 different λ
values (0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.65, 0.7, 0.75, 0.775,
0.8, 0.825, 0.85, 0.875, 0.9, 0.925, 0.95, 1.0). All simulations were
performed using rectangular periodic boundary conditions with isotropic
position scaling in the isothermic–isobaric (NPT) ensemble,
with a pressure reference of 1 atm and a relaxation time of 0.5 ps.
The system was kept at the reference temperature of 300 K using the
Langevin thermostat[31] (collision frequency
of 20 ps–1), and Newton’s equations of motion
were integrated using the leapfrog algorithm[32] with a MD time step of 0.001 ps. Using the sander Amber module, long-range electrostatics interactions were handled
with the particle-mesh Ewald (PME) procedure and long-range van der
Waals interactions approximated by a continuum model.[29] We refer the reader to ref (29) for additional computational details.
Thermodynamic Integration
The change in free energy
between two states, A and B, can be estimated using standard thermodynamic
integration (TI)[33] aswith V(λ) representing
the potential energy from a single trajectory of the system as a function
of the coupling parameter λ, and with ⟨...⟩ denoting
the cumulative backward average at the given λ point. The thermodynamic
perturbation in this work went from state A (λ = 0), in which
the guest (G) experiences full interactions with the host (H) and
the water environment, to state B (λ = 1), in which the guest-related
interaction energy vanishes. In order to improve phase-space sampling
and avoid free energy singularities, the soft-core potential of Zacharias
et al.[30,34] was employed for all guest atoms (scalpha = 0.5).[29,35] Equation 1 was integrated numerically using the trapezoidal rule.
Statistical Analysis of Uncertainties
A simulation
standard error σsim(t) of the time-varying
potential energy derivative at a given λ can be calculated aswith T being the total number
of block averages[36] throughout the single ith trajectory or all N concatenated independent
trajectories. ((∂V(λ))/(∂λ))λ denotes the potential
energy derivative, block-averaged at time t, and
⟨(∂V(λ))/(∂λ)⟩λ is the ensemble average over the entire simulation
time at a given λ. As an example, σsim(t) uncertainties are reported as error bars for ⟨(∂V(λ))/(∂λ)⟩λ vs λ in Figure 2 (vertical
bars). Then, a corresponding free-energy uncertainty can be obtained
asWe define a criterion, τsim, for convergence monitoring of all TI simulations and automated
decisions for termination of individual λ runs. This criterion
is based on the backward cumulative averages of the ((∂V(λ))/(∂λ))λ terms. This reads
Figure 2
Thermodynamic integration along the λ
coupling parameter.
vdW-decoupling TI curves of (a) guest and (b) host–guest system
simulations in TIP3P (red) and TIP4P (blue). ⟨∂V(λ)/∂λ⟩λ values and corresponding uncertainties, σ sim(t), are shown at each λ sampled.
Thermodynamic integration along the λ
coupling parameter.
vdW-decoupling TI curves of (a) guest and (b) host–guest system
simulations in TIP3P (red) and TIP4P (blue). ⟨∂V(λ)/∂λ⟩λ values and corresponding uncertainties, σ sim(t), are shown at each λ sampled.Examples of the variation of this quantity backward
in simulation
time are given in Figure 3 and Table SI-1. A criterion of τsim < 0.1
was employed to enforce proper convergence and automatically decide
when to terminate TI runs. The term proposed in eq 4 is a well-behaved quantity that tends to zero for increasingly
long trajectories.
Figure 3
Convergence monitoring of ⟨∂V(λ)/∂λ⟩λ, the potential energy derivative backward average,
across time with different criteria, τsim, during B2-CB[7] vdW
decoupling simulations where the highest ∂V/∂λ
fluctuations occur using either TIP3P (left) or TIP4P (right) water
models (λ = 0.8 and λ = 0.825, respectively).
Convergence monitoring of ⟨∂V(λ)/∂λ⟩λ, the potential energy derivative backward average,
across time with different criteria, τsim, during B2-CB[7] vdW
decoupling simulations where the highest ∂V/∂λ
fluctuations occur using either TIP3P (left) or TIP4P (right) water
models (λ = 0.8 and λ = 0.825, respectively).
Separation of Thermodynamic States
Corrections for
the standard state free energy[37,38] and separation of the
thermodynamic states of interest were obtained using a harmonic restraining
potential between the guest center of mass, rG, and the host center of mass, rH, to confine the guest to the region within the host cavity Vcavity, with a force constant kh of 2.5 kcal/mol. This Vcavity was either estimated empirically from a 10 ns MD trajectory, according
toor defined analytically as[39]for comparison. Equation 5 or 6 was then utilized to find the standard-state
free energyfor guest transfer from this restricted Vcavity volume to the bulk volume V° (as described in ref (40)).
Results and Discussion
In Figure 2, we include the following ⟨(∂V(λ))/(∂λ)⟩λ vs λ TI curves for the van der Waals decoupling
steps: (i) solvated guest vdW decoupled along λ in the unbound
state (Figure 2a) and (ii) the solvated guest
vdW decoupled along λ in the bound state (Figure 2b). Although ⟨(∂V)/(∂λ)⟩
at most λ values shows very good convergence (see also Supporting Information Figure SI-1), we observe
a region of λ space from 0.7 to 0.875 with noticeably higher
uncertainty and where convergence is considerably hindered within
the host–guest vdW decoupling simulations. This narrow region
is also where the ⟨(∂V(λ))/(∂λ)⟩λ vs λ TI curve changes most dramatically in the B2-CB[7] vdW
transformation (Figure 2b).In order
to observe the dependence of ⟨(∂V(λ))/(∂λ)⟩
on total simulation time, we use a convergence criterion based on
backward block-averaging. Figure 3 depicts
the simulation convergence monitoring used in this work (see Computational Details) at the λ points with
the largest statistical uncertainty for either TIP3P or TIP4P. These
results further confirm that the convergence of ⟨(∂V(λ))/(∂λ)⟩λ is considerably impeded at λ values within this
region of higher uncertainty (e.g λ = 0.8 for TIP3P and λ
= 0.825 for TIP4P). Interestingly, the shape of the TI curves for
the vdW decoupling simulations display remarkable dependence on the
water model employed (Figure 2). The large
dip in ⟨(∂V(λ))/(∂λ)⟩λ is noticeably
dissimilar in both width and location for the B2-CB[7] trajectories
(Figure 2b).Nevertheless, integration
of each CB[7]-B2 bound vdW decoupling
curve produces comparable differences in free energy contribution
for TIP3P and TIP4P (ΔGvdW of 11.2
and 12.3 kcal/mol, respectively). Similar results were also obtained
for the unbound B2 guest, with a ΔGvdW of −4.87 kcal/mol and −6.61 kcal/mol for TIP3P and
TIP4P, respectively (Table 1). Figure SI-1 displays the TI curves calculated from the electrostatic
decoupling steps. Not surprisingly, the electrostatic contribution
of free energy change between the two water models is even smaller
(ΔGele estimates for the unbound
state in TIP3P and TIP4P differ by 0.8 kcal/mol and those for bound
state by only 0.4 kcal/mol).
Table 1
Absolute Host–Guest Binding
Free Energy Estimatesa
correction
term for potential bias from host–guest restraint
ΔGcalcd° [kcal/mol]e
water model
system
ΔGele[kcal/mol]
ΔGvdW[kcal/mol]
empiricalb(avg)c
analyticald
empirical
analytical
ΔGexptl°[12][kcal/mol]
TIP3P
B2
–21.6
± 0.22
–4.87 ± 0.44
Volume [Å3]
–6.4 (0.076)
n/a
–12.0 ±0.9
–11.2 ±0.9
–13.4 ± 0.1
B2-CB[7]
–22.4 ± 0.23
11.2 ± 0.77
correction [kcal/mol]
–3.3 (−6.0)
–4.1
TIP4P
B2
–20.8 ± 0.33
–6.61 ± 0.60
Volume [Å3]
–6.2
(0.071)
n/a
–14.4 ± 1.0
–13.7
± 1.0
B2-CB[7]
–22.0
± 0.34
12.3 ± 0.82
correction [kcal/mol]
–3.3 (−6.0)
–4.1
Uncertainties were calculated as
in eq 3. See also Figure.SI-1 and Figure 2 for raw ΔGele and ΔGvdW data,
respectively.
From eq 5,
using maximum rG–rH center of mass (COM)-COM distance (n/a for B2 alone).
From eq 5 using
average rG–rH center of mass (COM)-COM distance (n/a for B2 alone).
From eq 6;
no restraint used for B2.
Absolute binding free energy calculation
derived from thermodynamic integration cycle (decoupled partial charges
and decoupled van der Waals terms) with correction for standard state:
(ΔGB2_ele + ΔGB2_vdW) – (ΔGB2-CB[7]_ele + ΔGB2-CB[7]_vdW + B2-CB[7] correction term for potential bias from restraint).
Uncertainties were calculated as
in eq 3. See also Figure.SI-1 and Figure 2 for raw ΔGele and ΔGvdW data,
respectively.From eq 5,
using maximum rG–rH center of mass (COM)-COM distance (n/a for B2 alone).From eq 5 using
average rG–rH center of mass (COM)-COM distance (n/a for B2 alone).From eq 6;
no restraint used for B2.Absolute binding free energy calculation
derived from thermodynamic integration cycle (decoupled partial charges
and decoupled van der Waals terms) with correction for standard state:
(ΔGB2_ele + ΔGB2_vdW) – (ΔGB2-CB[7]_ele + ΔGB2-CB[7]_vdW + B2-CB[7] correction term for potential bias from restraint).To calculate the final free energy values according
to eq 7, either an empirical or an analytical
correction
was used to take into account the standard state for ΔG. The empirical volumes were estimated from the total spherical
space of radius (rG–rH) sampled by the guest in the bound state simulations.
By using the maximum distance between the guest center of mass (COM)
and the host COM (rG–rH) in eq 5, the empirical correction
energy was estimated to be −3.3 kcal/mol for both TIP3P and
TIP4P (Table 1). If instead we use the average rG–rH during
sampling, we estimate a correction of −6.0 kcal/mol using either
water model (Table 1). Using eq 6, the calculated analytical restraining bias correction term
for both water models was −4.1 kcal/mol. We hope that the average rG–rH correction
term would converge (with further sampling in the host–guest
MD simulation) to the same value as the analytical correction term,
but we believe that the difference in free energy values here might
be attributable to underestimating the volume sampled by the guest
using this average value and overestimating it using the maximum rG–rH sampled.The absolute binding free energy estimate derived from simulations
in TIP4P (best estimate: −13.7 kcal/mol) is slightly closer
to the experimental value (−13.4 kcal/mol) than
from those in TIP3P (best estimate: −12.0 kcal/mol). Taken
as a whole, our calculations with TIP4P in combination with either
correction term and the parameters used for the CB[7]-B2 host–guest
system provide a more accurate estimate of absolute binding free energy.
Our results do suggest that the TIP4P water model may describe the
binding thermodynamics more accurately compared with the TIP3P model
by bringing our calculated value significantly closer to the experimental
value, but further studies would be necessary to confirm this.These results are in good agreement with a previous study that
showed that the TIP4P model describes water properties significantly
more accurately compared with the TIP3P water model.[41] Due to the important role of the solvation/desolvation
of the CB[7] and B2 during the binding event, we decided to further
investigate the relationship between slow simulation convergence and
host hydration with both water models.In Figures 4 and 5, we examine the average water
density surfaces throughout selected
trajectories with TIP3P and TIP4P. Because both host–guest
water model systems reach their ⟨(∂V(λ))/(∂λ)⟩λ minimum on the vdW TI curve at λ = 0.85 (Figure 2b), in Figure 4 we include
the average water density surfaces at this point in addition to those
for the unbound (λ = 1.0) and bound (λ = 0.0) cases, with
densities shown at ∼0.95 times bulk TIP3P and ∼1.1 times
bulk TIP4P. In Figure 5, we display higher
density water surfaces of the B2-bound case, with densities shown
at ∼1.3 times TIP3P bulk water density and ∼1.5 times
TIP4P bulk water density.
Figure 4
Average hydration surfaces of 50 ns vdW host–guest
simulations
for unbound (λ = 1.0; top) and bound (λ = 0.0; bottom)
thermodynamic states. Corresponding maps are also shown for the λ
= 0.85 intermediate unphysical state (middle). Hydration maps for
TIP3P (red wireframe, left of vertical slice) and TIP4P (blue wireframe,
right of vertical slice) water models were generated using the average
density of the water oxygen atom on a grid (0.3 Å resolution)
after superimposing MD snapshots
using all atoms of the host (shown as licorice representation here).
The surface isovalues represent water densities that are ∼1.1
times TIP4P bulk water density and ∼0.95 times TIP3P bulk water
density.
Figure 5
Average hydration surfaces (as in Figure 4) for bound host–guest vdW simulation (λ = 0.0)
thermodynamic
state with TIP3P (red wireframe, left side) and TIP4P (blue wireframe,
right side) water oxygen atom densities shown in and around the host
structure (licorice representation) from topview (left) and sideview
(right) for each at ∼1.5 times TIP4P bulk water density and
∼1.3 times TIP3P bulk water density.
Average hydration surfaces of 50 ns vdW host–guest
simulations
for unbound (λ = 1.0; top) and bound (λ = 0.0; bottom)
thermodynamic states. Corresponding maps are also shown for the λ
= 0.85 intermediate unphysical state (middle). Hydration maps for
TIP3P (red wireframe, left of vertical slice) and TIP4P (blue wireframe,
right of vertical slice) water models were generated using the average
density of the wateroxygen atom on a grid (0.3 Å resolution)
after superimposing MD snapshots
using all atoms of the host (shown as licorice representation here).
The surface isovalues represent water densities that are ∼1.1
times TIP4P bulk water density and ∼0.95 times TIP3P bulk water
density.Average hydration surfaces (as in Figure 4) for bound host–guest vdW simulation (λ = 0.0)
thermodynamic
state with TIP3P (red wireframe, left side) and TIP4P (blue wireframe,
right side) wateroxygen atom densities shown in and around the host
structure (licorice representation) from topview (left) and sideview
(right) for each at ∼1.5 times TIP4P bulk water density and
∼1.3 times TIP3P bulk water density.At λ = 1, the average water density from
our 50 ns simulation
was highest inside of the host. The average volume surfaces of the
TIP4P wateroxygen atoms are shown here as a few small clusters, around
and between the nitrogen atoms of the host, and three toroidal shapes
with the largest ring in the center of the host and two others near
the top and bottom (Figure 4), which is consistent
with other models of the host alone.[42] Interestingly,
at this same density value, TIP3P waters display average surfaces
that appear as more of a cylinder of ordered wateroxygen atoms inside
and more of a seven-petal flower shape outside the host but show some
similar density clusters near the host nitrogen atoms.Both
water models still show similar ordering near the host nitrogen
atoms at λ = 0.85, but we do observe some different volume density
patterns (Figure 4). TIP4P waters still form
ring-like average density surfaces, but the center toroid is smaller
and those on the outside of the host become more concave than in the
unbound (λ = 1) case. Here, we also observe an additional concave
flower-shaped surface of TIP3P waters at this density (as at λ
= 1) appearing on both ends of the host. However, while TIP3P water
molecules do occupy the host at this λ, the surfaces do not
appear inside the host at this high-density value, unlike the TIP4P
and unbound TIP3P cases.In the B2-bound host case (λ
= 0.0), again we see unique
surface patterns, even between the two water models. The TIP4P surface
almost completely covers the outside of the host, but TIP3P surfaces
only appear as large clusters on the outside of the host and as fuller
flower shapes near the host oxygen atoms (Figure 4). As expected though, no water densities are seen within
the host cavity because the guest vdW parameters remain fully coupled
in this case. Higher TIP4P density surfaces appear just as small clusters
circling the host and as two toroids, one at each end of the host,
while TIP3P surfaces form only small circular clusters along these
outside rings but do not form solid toroids there yet (Figure 5). As expected here, where the guest occupies the
host, still no water densities appear within its cavity. The unique
hydration maps observed for TIP4P and TIP3P models at the same, specific
density value are expected, owing to both the different mixed interactions
of these water models with the same host molecule as well as the well-known
structural disparities between the water models themselves.Figure 6 displays both the (∂V)/(∂λ) values (Figure 6a,c) and the number of waters inside the host (Figure 6 b,d) along simulation time for several λ
points. All waters within 4 Å of both the very top and bottom
atoms of the B2 core were measured at each frame of the 50 ns vdW
host–guest complex trajectories and compared to the (∂V)/(∂λ) measured at that time point. Clearly,
two distinctive energy extremes are represented by hydrated and dehydrated
states (Figure 6). Frequent fluctuations can
occur between the two hydration states when the guest vdW decoupling
is sufficiently high to allow for increased host hydration, but still
low enough that the host cannot remain stably hydrated throughout
the rest of the simulation. The (∂V)/(∂λ)
of the host–guest system at several of the higher λ’s
near 0.825 fluctuate between two very different hydration extremes
with periods of several nanoseconds during vdW TI simulations (Figure 6). The frequency of wetting/dewetting transitions
varies depending on the λ point considered, with highest frequencies
of up to 10–15 events occurring in under 10 ns of simulation.
This behavior is the major factor slowing free energy convergence.
The slowest simulations to converge were at λ = 0.8 for TIP3P
and at λ = 0.825 in TIP4P as these experienced the most frequent
and extreme host hydration fluctuations, and therefore energy fluctuations,
throughout their trajectories (Figure 6).
We note that apparent convergence would be misleadingly attributed
to the simulations considering MD periods shorter than the wetting/dewetting
transition time scales (see Figure 3 and Supporting Information Table SI-1 for an example).
Figure 6
Fluctuations
of (a,c) ∂V/∂λ
values and (b,d) number of water molecules in the CB[7] host system
along time for the TIP3P (red, a,b) and TIP4P (blue, c,d) water models
during 50 ns host–guest vdW simulations. The physical process
of host–guest binding goes from the unbound state (λ
= 1) to the bound state (λ = 0), while absolute binding free
energies were effectively calculated conveniently in the opposite
direction by decoupling of the ligand interactions (from λ =
0 to λ = 1).
Fluctuations
of (a,c) ∂V/∂λ
values and (b,d) number of water molecules in the CB[7] host system
along time for the TIP3P (red, a,b) and TIP4P (blue, c,d) water models
during 50 ns host–guest vdW simulations. The physical process
of host–guest binding goes from the unbound state (λ
= 1) to the bound state (λ = 0), while absolute binding free
energies were effectively calculated conveniently in the opposite
direction by decoupling of the ligand interactions (from λ =
0 to λ = 1).Because host hydration plays such a major role
in the B2-CB[7]
vdW decoupling simulations and free energy estimates, we include Table 2, which reveals some additional key differences
in the TIP3P and TIP4P host wetting events during our simulations
at λ’s where fluctuations occurred most. There is a striking
yet conceivable difference in both the percent of time that the host
is hydrated as well as the mean number of waters inside the host from
λ = 0.775 to 0.825 between the simulations using the two different
water models, with more TIP3P molecules occupying the host cavity
than TIP4P during these simulations. However, at the unbound (λ
= 1.0) and bound (λ = 0.0) states, the two models behave quite
similarly with the host stably hydrated or dehydrated throughout most
if not all of the simulation, respectively. Additionally, at λ
= 0.8 our TIP3P simulation shows up to seven waters inside the host,
while that of TIP4P fits only a maximum of six inside. The time to
reach the first maximum hydration event and the mean number of waters
inside the host are also included for each case. Overall, this analysis
does confirm the opposite behavior at the two λ end points though,
with a hydrated cavity at λ = 1 where the guest is fully decoupled
(Figures 4 and 6b),
representing the unbound state, and a dehydrated host cavity at λ
= 0 where the guest is fully coupled, representing the bound state
(Figures 4, 5, and 6b).
Table 2
Host Hydration During the End Point
and Unphysical States Employed for Thermodynamic Integrationa
host hydration features
water model
λ =
0.0
λ = 0.775
λ = 0.8
λ
= 0.825
λ = 0.85
λ = 0.875
λ
= 1.0
time hydrated
(%)
TIP3P
0
29.5
59.0
87.8
93.8
96.7
97.0
TIP4P
0
8.02
18.3
62.1
86.2
88.8
97.7
time to first max. hydration event
(ns)
TIP3P
n/a
6.47
35.0
45.7
10.2
6.88
12.7
TIP4P
n/a
21.0
3.40
2.26
2.02
2.00
1.44
mean no. of water molecules inside
the host
TIP3P
0
0.6±1
1.4±1
2.4±1
2.6±1
2.9±1
2.7±1
TIP4P
0
0.1±0.5
0.4±1
1.8±2
3.0±2
3.3±2
3.0±1
maximum no. of water molecules inside
the host
TIP3P
0
6
7
7
7
7
7
TIP4P
0
6
6
7
7
7
7
minimum no. of water molecules inside the host
TIP3P
0
0
0
0
0
0
0
TIP4P
0
0
0
0
0
0
0
Results are summarized for the λ
states at which largest wetting/dewetting transitions occur and compared
to end point states.
Results are summarized for the λ
states at which largest wetting/dewetting transitions occur and compared
to end point states.Figure 7 further confirms the
correlation
of system free energy and host hydration seen with both water models
(Figure 6, see also Supporting
Information). Here, we look at the ⟨∂V/∂λ⟩ vs the number of TIP4P or TIP3P
waters in the host cavity at λ = 0.85, a point where the host
is usually hydrated with up to seven waters but on average holds about
three waters (Table 2). Although the standard
deviations of (∂V)/(∂λ) are large,
there is a clear trend toward lower energy with an increasing number
of water molecules using both models, which is consistent across all
λ’s where the host cavity is capable of hydration (see
Figure SI-2 ). At the λ end point
states representing the unbound and bound host, we observe very different,
stable behavior and no such correlation (Figure 6 and Figure SI-2). The range of
(∂V)/(∂λ) values is smaller
(∼20 kcal/mol) at both end points, with the unbound case showing
a steadier ⟨∂V/∂λ⟩
at or near 1.2 kcal/mol, regardless of the number of waters inside
the host, and the bound case (where no waters ever fit inside the
host) averaging around 0.2 kcal/mol (Figure SI-2). Clearly, this system presents a fascinating yet challenging behavior
for TI absolute binding free energy calculations using explicit water
and demonstrates how important the role of wetting and dewetting events
is in the accuracy and convergence of free energy calculations.
Figure 7
Correlation
between ⟨∂V/∂λ⟩
values and number of waters within host for λ = 0.85 vdW simulation
with TIP3P and TIP4P water models. Vertical error bars are the standard
deviation values.
Correlation
between ⟨∂V/∂λ⟩
values and number of waters within host for λ = 0.85 vdW simulation
with TIP3P and TIP4P water models. Vertical error bars are the standard
deviation values.
Conclusions
Thermodynamic integration was used to calculate
the absolute binding
free energy of a high affinity host–guest system in explicit
solvent. Using both TIP3P and TIP4P water models, our estimations
correspond remarkably to experimental calculations. Although both
models proved to be adequate for our studies of this host–guest
system, our simulations in TIP4P did lead us to an absolute binding
free energy estimate gratifyingly closer to the experimental value.
We observe a toroidal average water density surface first appearing
inside the host in the unbound state (λ = 1.0) or ordered around
the top and bottom host oxygen molecules in the guest-bound state
(λ = 0.0), with fascinating patterns of intermediate water ordering
seen both inside and out of the host at varying λ values in
between. Significant changes in (∂V)/(∂λ)
occur along molecular dynamics simulations as the guest van der Waals
interactions are decoupled along λ space using both water models.
The dynamic behavior of host wetting and dewetting events is directly
correlated with these large energy fluctuations of the system. For
both water models, we see a decrease in (∂V)/(∂λ) with increasing water molecules inside the host
during van der Waals simulations with sufficient guest decoupling.
The work described here emphasizes the importance of wetting/dewetting
transitions and their influence on free energy estimation for the
host–guest system considered in this study. We believe that
the wetting transitions observed are relevant in more complex biological
scenarios as well, including protein–ligand binding.[18] However, it is also expected that a protein
cavity, which is accessible from the bulk from one side only, might
have quite largely different hydration properties than a host guest
system, such as CB[7], as the latter is open to bulk from two symmetric
openings. Our study is, to our knowledge, the first that addressed
these slow water transitions in a host–guest system.
Authors: Da Ma; Gaya Hettiarachchi; Duc Nguyen; Ben Zhang; James B Wittenberg; Peter Y Zavalij; Volker Briken; Lyle Isaacs Journal: Nat Chem Date: 2012-04-15 Impact factor: 24.427
Authors: Yunjie Zhao; Damian P Buck; David L Morris; Mohammad H Pourgholami; Anthony I Day; J Grant Collins Journal: Org Biomol Chem Date: 2008-11-06 Impact factor: 3.876
Authors: Andrea Rizzi; Travis Jensen; David R Slochower; Matteo Aldeghi; Vytautas Gapsys; Dimitris Ntekoumes; Stefano Bosisio; Michail Papadourakis; Niel M Henriksen; Bert L de Groot; Zoe Cournia; Alex Dickson; Julien Michel; Michael K Gilson; Michael R Shirts; David L Mobley; John D Chodera Journal: J Comput Aided Mol Des Date: 2020-01-27 Impact factor: 3.686
Authors: Andrea Rizzi; Steven Murkli; John N McNeill; Wei Yao; Matthew Sullivan; Michael K Gilson; Michael W Chiu; Lyle Isaacs; Bruce C Gibb; David L Mobley; John D Chodera Journal: J Comput Aided Mol Des Date: 2018-11-10 Impact factor: 3.686
Authors: Pei Meng Woi; Maizathul Akmam A Bakar; Ahmad Nazmi Rosli; Vannajan Sanghiran Lee; Mohd Rais Ahmad; Sharifuddin Zain; Yatimah Alias Journal: J Mol Model Date: 2014-04-27 Impact factor: 1.810
Authors: Lauren Wickstrom; Nanjie Deng; Peng He; Ahmet Mentes; Crystal Nguyen; Michael K Gilson; Tom Kurtzman; Emilio Gallicchio; Ronald M Levy Journal: J Mol Recognit Date: 2015-08-10 Impact factor: 2.137
Authors: Yiğitcan Eken; Prajay Patel; Thomas Díaz; Michael R Jones; Angela K Wilson Journal: J Comput Aided Mol Des Date: 2018-09-17 Impact factor: 3.686