Literature DB >> 26633245

Multiple Binding Poses in the Hydrophobic Cavity of Bee Odorant Binding Protein AmelOBP14.

Abstract

In the first step of olfaction, odorants are bound and solubilized by small globular odorant binding proteins (OBPs) which shuttle them to the membrane of a sensory neuron. Low ligand affinity and selectivity at this step enable the recognition of a wide range of chemicals. Honey bee Apis mellifera's OBP14 (AmelOBP14) binds different plant odorants in a largely hydrophobic cavity. In long molecular dynamics simulations in the presence and absence of ligand eugenol, we observe a highly dynamic C-terminal region which forms one side of the ligand-binding cavity, and the ligand drifts away from its crystallized orientation. Hamiltonian replica exchange simulations, allowing exchanges of conformations sampled by the real ligand with those sampled by a noninteracting dummy molecule and several intermediates, suggest an alternative, quite different ligand pose which is adopted immediately and which is stable in long simulations. Thermodynamic integration yields binding free energies which are in reasonable agreement with experimental data.

Entities: Chemical

Mesh：

Substances：

Year: 2015 PMID： 26633245 PMCID： PMC4695918 DOI： 10.1021/acs.jcim.5b00673

Source DB: PubMed Journal: J Chem Inf Model ISSN： 1549-9596 Impact factor: 4.956

Introduction

Insects depend strongly on olfactory cues both for communication and for orientation. A good understanding of their olfactory systems is important both in agriculture and in the control of insect-mediated diseases. Odorants are usually volatile and poorly soluble molecules. When they enter the aqueous lymph of the sensilla on an insect’s antennae they bind to odorant binding proteins (OBPs),[1,2] which shuttle them to the membrane-bound odorant receptors (ORs) of a sensory neuron. The signal is then relayed to the OR either directly by the ligand or by the ligand-bound OBP.[3] OBPs are small soluble proteins highly concentrated in the sensillar lymph.[1,2] Their role is not completely understood: In addition to shuttling the odorant and participation in signal relay, they might also serve to sequester and control the amount of free odorants in the extracellular space.[3] One group of OBPs are the pheromone binding proteins (PBPs), which are tuned to one specific ligand or a narrow range of related compounds. The so-called general OBPs (gOBPs) on the other hand are characterized by low ligand specificity and affinities in the micromolar range. They bind odorants in a combinatorial manner.[4,5] The same ligand can be bound—with different affinities—by more than one OBP of a species, and each OBP can bind several different ligands. In addition, multiple ligand binding poses have been proposed in some cases. For example, NMR data for ASP2 (antennal-specific protein 2) from the honey bee indicates that the hydrophobic ligand IBMP (2-isobutyl-3-methoxypyrazine) can bind in two alternative stable conformations.[6] Crystal structures of ASP1 suggest that its cavity can accommodate ligand 9-ODA (9-keto-2(E)-decenoic acid) in different orientations and in one or two copies or accompanied by additional ligands.[7] The structure of insect OBPs comprises six conserved α-helices. Typically three interlinked disulfide bonds (classical OBPs) make them very stable,[8] while also classes with only two (C-minus OBPs) and several more (plus-C OBPs) disulfide bridges exist. Changes in the protein conformation upon ligand binding have been found in pheromone binding proteins from the moth Bombyx mori (BmorPBP) and the honey bee Apis mellifera (AmelASP1).[7,9−11] In addition, an influence of pH on the structure of the C-terminus and thereby on the uptake or release of the ligand has been proposed.[7,11−18] OBP14 from Apis mellifera (AmelOBP14), one of 21 OBPs in the honey bee, is found in the larvae and in the mandibular glands of hive bees, as well as in the antennae.[19,20] Crystal structures show the 119 amino acids long protein folded into 7 α-helices[21] (Figure A). With five cysteines and two disulfide bridges, it belongs to the C-minus class of OBPs. Introduction of a third disulfide bridge as found in classical OBPs does not disturb the conformation at all,[21] while increasing the melting temperature by 10 °C.[22]

Figure 1

(A) Crystal structure of eugenol-bound AmelOBP14 (colored cartoon representation, PDB-ID 3S0E) superimposed on the apo crystal structure (gray cartoon representation, PDB-ID 3S0A). Internal cavities of the ligand-bound structure are shown as transparent gray surfaces, those of the apo structure are highly similar and omitted for clarity. The two disulfide bridges are shown in black. The N-terminus of the protein is in blue, the C-terminus in red. (B) The chemical structure of eugenol. (C) Close-up of eugenol in the binding cavity. A network of hydrogen bonds (yellow dotted lines) connects the ligand and a conserved water molecule (red sphere) to side chains in helix 2 (green), 6 (orange), and 7 (red). AmelOBP14 has been shown to bind semiochemicals (chemicals that convey information between organisms) like eugenol (4-allyl-2-methoxyphenol; Figure B) and other related phenyl compounds, but also linear molecules like citralva.[20,21] Crystal structures of AmelOBP14 have been published with three different ligands, which all fit into the largely hydrophobic and completely closed-off cavity without distorting it strongly.[21] Dissociation constants in the nano- to micromolar range have been determined experimentally for several ligands. Those that were found to bind more strongly positively affect protein thermal stability.[22,23] In addition, two forms of apo structures were reported, which do not differ much in terms of overall protein shape, neither from each other nor from the structures with a ligand bound.[21] In the current work, we set out to quantify the binding free energy of eugenol to a model of AmelOBP14. Eugenol is one of the biological ligands of AmelOBP14 with a comparably high affinity,[20,21,24] and one of the few for which a crystal structure is available. Initial simulations led us to believe that the binding orientation observed in the crystal structure is not the only stable conformation and we have performed extensive molecular dynamics simulations to investigate the binding orientations of eugenol to AmelOBP14. We study changes in the protein conformation and ligand orientation as well as the influence of active-site waters during several microseconds of MD simulations. The ligand binding free energy is computed using the double-decoupling method.[25,26] In this approach, the nonbonded interactions of the ligand are turned off in a number of discrete steps, typically described by a coupling parameter. We employ a Hamiltonian replica exchange setup to connect the various intermediate states. This approach has the added advantage that alternative conformations of the ligand, which are readily sampled when the compound is decoupled from the protein, are mixed into the simulations of the real ligand–protein complex in a physically relevant way. This way alternative binding poses that are not observed in regular MD may be sampled efficiently.[27] We propose that the protein conformation in solution is rather dynamic, and alternative binding orientations of eugenol compared to the X-ray structure may play a significant role in binding. Even though the spacious and largely hydrophobic cavity, moderate ligand binding affinities, and promiscuous binding make AmelOBP14 a challenging model for computational methods, the computed binding free energies are in reasonable agreement with experimental estimates.

Methods

We performed multiple MD simulations of AmelOBP14 in the presence and absence of its ligand eugenol. The influence of cavity water and an alternative ligand orientation was studied in further sets of simulations. Free energies were extracted from sets of simulations with varying ligand Hamiltonians, both without and with replica exchange (“RE” and “TI” simulations, respectively). All performed simulations involving the protein amount to a total of 16.66 μs of simulation time and are listed in Table .

Table 1

Overview of Performed Simulations of AmelOBP14

simulation name	#	ligand	length [ns]a	sim. engine	cavity waterb	ligand orientationc
EOL	1–4	EOL	4 × 200	gromos	+	crystal
	5–6	EOL	2 × 1000	gromacs	+	crystal
	7–8	EOL	2 × 850	gromacs	+	crystal
	9–12	EOL	4 × 200	gromacs	+	crystal
EOL-nw	1–4	EOL	4 × 200	gromos		crystal
EOL-re	1–2	EOL	2 × 200	gromos	+	RE
	3–4	EOL	2 × 1000	gromacs	+	RE
	5–6	EOL	2 × 700	gromacs	+	RE
apo	1–4		4 × 200	gromos	+	crystal
	5–6		2 × 1000	gromacs	+	crystal
	7–8		2 × 700	gromacs	+	crystal
RE	1–2	EOL	2 × 20 (× 16)	gromos	+	crystal
RE-nw	1–2	EOL	2 × 20 (× 16)	gromos		crystal
TI	1–2	EOL	2 × 20 (× 16)	gromos	+	crystal
TI-nw	1–2	EOL	2 × 20 (× 16)	gromos		crystal

The numbers in brackets indicate the replicas, i.e., parallel simulations with varying ligand Hamiltonian.

The + indicates that water molecules in addition to the one that is observable in the crystal structure were present in the cavity at the start of the simulations.

Ligand orientation at the start of the simulation, which was either the one from the crystal structure (“crystal”) or the dominant structure found in the RE simulations.

The numbers in brackets indicate the replicas, i.e., parallel simulations with varying ligand Hamiltonian. The + indicates that water molecules in addition to the one that is observable in the crystal structure were present in the cavity at the start of the simulations. Ligand orientation at the start of the simulation, which was either the one from the crystal structure (“crystal”) or the dominant structure found in the RE simulations. Initial MD simulations over 200 ns and replica exchange simulations were performed using the GROMOS11 biomolecular simulation package (http://www.gromos.net).[28] Longer MD simulations up to 1 μs were run with GROMACS 5.0.4.[29,30] Parameter set 54A8 of the GROMOS force field[31] was used in all cases. AmelOBP14 crystal structures in the apo form (PDB ID: 3S0A) and in complex with eugenol (PDB ID: 3S0E) were retrieved from the PDB data bank. Force field parameters for eugenol were derived by analogy to similar functional groups in the force field and are provided in the Supporting Information. Hydrogen atoms were added to the crystal structures according to geometric criteria followed by a short energy minimization using the steepest-descent algorithm. The proteins were placed in a rectangular periodic box and solvated with roughly 4700 simple point charge (SPC) water molecules.[32] The system was relaxed by a further steepest descent energy minimization with position restraints on the solute atoms, and three Na+ ions were added to neutralize the system. This corresponds to a concentration of about 30 mM and is in the range of experimental buffer concentrations.[21,24] For the “nw” simulations (“no water”), all water molecules inside the ligand-binding cavity, except the one which is observed in the crystal structure, were manually removed at this step. All production simulations were preceded by an equilibration period which started with initial random velocities generated from a Maxwell–Boltzmann distribution at 50 K. From there, the systems were heated up to 300 K in six steps of 20 ps, while concomitantly, position restraints on the solute atoms were reduced from 2.5 × 104 to 0.0 kJ mol–1 nm–2. At the end of this equilibration period, roto-translational constraints on the solute were introduced to keep its longest axis aligned with the longest box edge, and 300 ps of simulation were run before starting the production runs. The parallel runs of apo, EOL, and EOL-nw plain MD simulations started from equilibrated systems derived from different initial random velocities. For the plain MD EOL-re simulations, snapshots from the replica exchange (RE) simulations in which the ligand adopts an alternative pose (see below) served as starting configurations. All simulations were performed with a time step of 2 fs using the leapfrog algorithm. Temperature and pressure (1 atm) were maintained using the weak-coupling scheme with coupling times τ = 0.1 ps and τ = 0.5 ps (GROMOS) or 1.6 ps (GROMACS) and an estimated isothermal compressibility of 4.575 × 10–4 (kJ mol–1 nm–3)−1.[33] Bond lengths were constrained to their ideal values using the SHAKE algorithm[34] (GROMOS) or the LINCS algorithm[35] (GROMACS). Long-range electrostatic interactions beyond a cutoff of 1.4 nm were truncated and approximated via a generalized reaction field approach with a dielectric permittivity of 61.[36] RE simulations were started from the same equilibrated structures as the plain MD simulations, and the simulation parameters were kept the same except for the absence of pressure coupling and the fact that the ligand was coupled to a separate temperature bath. This was necessary because the ligand in the dummy state cools down dramatically due to the absence of interactions, and this low temperature is propagated via the replica exchanges into the real ligand state. Each RE simulation consisted of 16 parallel simulations (replicas) whose Hamiltonian is a combination of the Hamiltonian of the real ligand EOL and of the noninteracting dummy according to H(λ) = λ2HDUM + (1 – λ2)HEOL(λ), where λ is a coupling parameter ranging from 0 to 1. In that way, H(λ) at λ = 0 corresponds to the Hamiltonian of the real ligand, HEOL(0), and H(λ) at λ = 1 to the Hamiltonian of the dummy, HDUM. The λ-dependence of HEOL(λ) introduces the soft-core potential for all values of λ ≠ 0 to avoid singularities.[37] At regular time intervals (every 2 ps), exchanges of the Hamiltonian between neighboring replicas were attempted and accepted with a probability depending on the energy difference according to the Metropolis criterion. A simulation of the free ligand in a box of 1082 SPC water molecules was set up in the same way with 20 replicas. In the protein-bound RE simulations, a distance restraint preventing the decoupled ligand from leaving the cavity was also coupled to λ (force constant at λ = 0: 0 kJ mol–1; maximal force constant at λ = 1: 400 kJ mol–1). It was applied between the center of geometry of the ligand’s ring and the center of geometry of the Cα atoms of four carefully chosen protein residues (L6, I32, I68, and Q105), which falls into the center of the ligand binding cavity and robustly stays there during the simulations. We also performed thermodynamic integration simulations without replica exchange (“TI”-simulations in Table ) but with otherwise identical settings and lengths. Simulations at every λ value were started after a 100 ps equilibration period, the starting coordinates were taken from the end of the equilibration period of the preceding value of λ. Binding free energies were determined by the double decoupling method through the thermodynamic cycle depicted in Figure .[38,39] The free energy difference between the real and the dummy state (ΔGLIG→DUM) was calculated using thermodynamic integration according to eq .

Figure 2

Thermodynamic cycle to calculate the binding free energy (ΔGbind) using the double decoupling approach. ΔGLIG→DUMfree and ΔLIG→DUMbound are the free energies of decoupling the ligand from its surrounding in the free and bound states. ΔGrestr is the free energy of restraining a dummy ligand to a certain volume inside the protein using a harmonic distance restraint. The angular brackets indicate an ensemble average obtained at λ. was written out every 0.2 ps during the simulations. Statistical error estimates on the ensemble averages were calculated from block averaging[40] and were integrated numerically to yield the error values for the free energies. The free energy component due to the introduction of the distance restraint (ΔGrestr) was accounted for analytically.[26] Including a standard state correction, this becomeswhere V0 is the standard state volume (1.661 nm3), kB is the Boltzmann constant, T is the temperature, and K is the force constant of the harmonic distance restraint. Coordinate trajectories were written out every 2 ps and analyzed using tools from the GROMOS++ package.[41] Secondary structure elements were classified according to the DSSP rules.[42] If not stated otherwise, atom positional root-mean-square deviations (RMSDs) of backbone heavy atoms (Cα, N, C) were determined with respect to the energy minimized crystal structures after superposition of centers of mass and a rotational least-squares superposition.[43] Time traces of the number of water molecules were derived from the radial distribution function of water molecules within a distance of 0.7 nm from the center of the ligand binding cavity. Cluster analysis was performed using pairwise RMSDs after superposition of protein backbone atoms. Structures with RMSDs smaller than a defined cutoff of 0.2 nm are considered structural neighbors. The structure with the highest number of neighbors is the central member structure of the first cluster. After removing all structures belonging to the first cluster from the pool, the procedure is repeated until all structures are assigned to clusters.[44] Water density inside the cavity throughout all snapshots assigned to one cluster was determined using GROMOS++ program iondens with a grid spacing of 0.1 nm. Cavity volumes were calculated using trj_cavity.[45] Structures were visualized using Pymol (www.pymol.org; The PyMOL Molecular Graphics System, Version 1.4.1 Schroedinger, LLC).

Results and Discussion

Protein Structure and Stability

The protein conformation diverges from the crystal structure during the simulations. The atom-positional root-mean-square deviations with respect to the energy minimized crystal structure as well as the root-mean-square fluctuations for representative simulations are given in Figures S1 and S2 of the Supporting Information. To analyze the specific changes in the protein that lead to the divergence from the crystal structure, we did an analysis of conformational clusters,[44,46] where structures with a pairwise RMSD of the backbone atoms of less than 0.2 nm are clustered together. Convergence of the simulations was confirmed by monitoring the total number of distinct protein conformations observed during individual simulations with increasing simulation time.[47] The number of clusters is plotted in Figure S3, leveling off to approximately 10 clusters. Figure shows the overall population of the biggest clusters and the time series of their occurrence for all plain MD simulations together. The first cluster (in blue) was enforced around the crystal structure conformation, which is left sooner or later in most simulations, but also revisited (Figure B). The most instable substructure is the short C-terminal helix 7 which loses its α-helical conformation in all simulations. The long helix 1 is observed to bend toward or away from helix 2, thereby either making space for helix 7 or occupying space which was previously occupied by helix 7. A movement of the substructure composed of helix 2 and the adjacent loop region away from helix 1 is seen in some structures as well as a disruption of helix 2.

Figure 3

Analysis of conformational clusters obtained with a 0.2 nm backbone atom RMSD cutoff. (A) Overall occurrence and time series of the clustered conformations (clusters with an occurrence <1% were left out in the top bargraph). Cluster 0 (in blue) was enforced around the crystal structure. (B) Average structures of clusters 0–5 using the same color code as in A, overlaid on the crystal structure (black). The thickness of the tube representation corresponds to the local RMSF in all the snapshots assigned to the respective cluster. Some divergence from the initial conformation is not unexpected when going from the crystal structure environment to aqueous solution in the simulations. The C-terminal helix 7, which is the most unstable in our simulations, is engaged in crystal contacts in the X-ray structure, which could have a stabilizing effect. In other crystal structures of insect OBPs with a C-terminus of similar length, it does not form a helix but is extended along the protein surface or entering the cavity, examples are Drosophila melanogaster LUSH (PDB-ID: 1OOF),[48]Culex quinquefasciatus CquiOBP1 (3OGN),[49]Anopheles gambiae AgamOBP4 (3Q8I),[50] and AgamOBP20 (4F7F).[51] In moth pheromone binding protein BmorPBP, a pH dependent switch has been proposed, where the C-terminus enters the cavity and forms a helix under acidic conditions.[12,13] However, the conformation seems to be influenced also by other factors, as later the protein was crystallized in the same conformation in a ligand-free state at neutral pH.[10,52] Multiple conformations are also observed in the C-terminus of AmelASP1; depending on pH and the presence of the ligand, it is found more or less deep in the cavity or part of a domain-swapped dimer.[7,11] The studies on BmorPBP and AmelASP1 suggest that those OBPs change conformation upon ligand binding. AmelOBP14, on the other hand, has been crystallized in the apo-form and with several ligands without big changes to overall protein conformation or the conformation of the C-terminus[21] (PDB-ID 3S0A vs 3S0E backbone RMSD: 0.07 nm, overall RMSD: 0.13 nm). In our simulations, however, there are significant differences between the apo and ligand-bound form. When no ligand is present, the cavity closes very quickly, which is obvious not only in the first nanoseconds of all apo simulations but also when the ligand leaves the cavity as is observed in simulation EOL2 and intermittently also in EOL7 at around 500 ns (Figure ).

Figure 4

Time series of the volume of the largest cavity detected by trj_cavity[45] in selected plain MD simulations of AmelOBP14 with and without ligand EOL. Because cavity volumes fluctuate strongly between single snapshots, the data have been moving average smoothed with a window size of 20. The apoprotein has been crystallized in two forms (in different crystallization conditions and with different space groups), without big differences in conformation. In the structure we used as a starting point for our apo simulations (PDB-ID: 3S0A), the cavity is similar in size as the one in the EOL-bound structure (PDB-ID: 3S0E). Also in the second apo crystal structure (PDB-ID: 3S0F), it is only slightly smaller. The deposited density maps show electron density inside the cavity of both apo structures, which was interpreted as water molecules. However, the density in the 3S0A-apo structure resembles very closely the density of ligand citralva (Figure A and B), and the one in the 3S0F-apo structure can also be interpreted as a five-membered ring like imidazole (Figure C), which was part of the crystallization buffer. The presence of these molecules in the cavity of the apo crystal structures would explain our observations in the apo simulations. Apo-OBP14 might be more different from the ligand-bound form than suggested by the crystal structures. The closure of the active site may be explained by the loss of active site water as outlined in the next section.

Figure 5

Electron density inside the cavity of AmelOBP14 crystal structures. (A) apo crystal form 1 (PDB-ID: 3S0A), (B) with ligand citralva (PDB-ID: 3S0D), (C) apo crystal form 2 (PDB-ID: 3S0F), and (D) with ligand eugenol (PDB-ID: 3S0E). Suggested water molecules are shown as red spheres. 2Fo–Fc maps (1σ) are shown in blue; Fo–Fc maps (3/–3σ) are shown in green and red, respectively.

Active Site Water

For a long time, the role of water molecules in the active site of protein structures has been underestimated in efforts to describe protein dynamics and protein–ligand interactions computationally.[39,53] The amount of water molecules in an active site can be readily monitored from molecular simulations.[46,54] In the EOL-bound X-ray structure, only one water molecule inside the protein cavity is so stably bound that it is observable in the electron density of the crystal structure. It connects the hydroxy group of EOL to M113:N, V31:O, and S108:O in a network of hydrogen bonds (Figure C). This crystal water is also present in the X-ray structure of the apo protein, where also six additional water molecules were assigned to the electron density. During the solvation of the protein in the simulation box, further water molecules are added in the empty space of the cavity, leading to a total of four water molecules when EOL is present and about 15 when there is no ligand. During the first 10–20 ns of the simulations started from these systems, most of the water molecules leave, and on average only one remains (Figure ). This is not unexpected after the observed rapid decrease in cavity volume in the apo simulations. Considering the predominantly hydrophobic nature of the cavity and ligand, it makes sense that water molecules relocate into the more favorable environment in the bulk solution.

Figure 6

(A) Number of water molecules in the cavity during long MD simulations in the presence (top: EOL5–8, middle: EOL-re3–6) and absence (bottom, apo5–8) of the ligand. Each panel shows the time series for four independent simulations in different shades of the same color. (B) Histogram depiction of the number of water molecules in the cavity over all EOL, EOL-re, and apo simulations, respectively. The water molecules that stay inside the cavity longest are observed close to the position of the crystal water, in a kind of pocket underneath the flap which contains helix 2 (green in Figure ), close to the kink between the ligand and helix 6 and 7 (orange and red). Also, later in the long simulations, when for short periods water molecules reenter the cavity (peaks in Figure A), usually it is in this location. In an additional set of four simulations (“nw”-simulations) started with no other water molecules in the cavity except for the one observed in the crystal structure, water molecules enter in some cases for short periods of time or also the last water molecule leaves. All in all, the picture is very comparable to the simulations started with a water-filled cavity, indicating that an equilibrium is reached in this respect. Comparing the EOL-bound simulations with the apo simulations, it appears that the apo structure has less affinity for active site water. If anything, the EOL tends to recruit water molecules rather than there being a stable active site water as suggested by the X-ray structures. In the replica exchange simulations, water molecules leave most rapidly in the intermediate replicas, where the ligand-surrounding interactions are tuned down (Figure ). When the interactions are completely off (“dummy ligand”), water tends to linger a little longer, while in the replica with the strongest interactions (toward the “real ligand”) two to three water molecules remain until the end of the 20 ns. In all plain MD simulations with EOL, only one water is left after the same amount of time (Figure ). Those water molecules that stay in the RE simulations are found in the same pocket between the ligand and helix 2 as in the plain MD simulations.

Figure 7

Number of water molecules in the cavity during RE simulations. In a color gradient from green to gray, the data for the 16 replica is shown. The top and bottom panels show data for two independent sets of simulations (RE1 and RE2). The water molecules that stay in the real ligand state might be trapped in between EOL and protein, while in the dummy state there is no obstacle that prevents them from leaving. However, exchanges between the replica runs are observed reasonably often (the lowest switching efficiency is between λ = 0.6 and 0.7 with about 10%, all other steps usually switch in 20% to 100% of the cases), so it seems surprising that the trapped water molecules are not propagated to other runs and then able to leave. It seems that water molecules leave the fastest in the intermediate states (light gray in Figure ), which might be a result of the decreased interactions with the ligand, while it still takes up space in the cavity, which it does less and less as the interactions are disappearing completely. In the two RE simulations started without cavity water—except for the one observed in the crystal structure—the number of water molecules stays around 1 for the whole 20 ns (data not shown).

Ligand Poses

To get a view of the ligand behavior during the simulations, we clustered ligand conformations within an RMSD cutoff of 0.2 nm after superimposing the protein backbone of the structures. Figures and 9 show the overall population of the orientational clusters, the time series of their occurrence, and representative structures, color-coded accordingly.

Figure 8

Figure 9

Central member structures of ligand conformational clusters using the same color code as in Figure . Colored spheres indicate grid points which are occupied by water molecules in more than 15% of snapshots assigned to the corresponding cluster (the grid points are rainbow-colored with a minimum (blue) of 15 and a maximum (red) of 70%). In all snapshots, the protein has been brought into the same orientation by superimposing the backbone atoms on the AmelOBP14 crystal structure. Hydrogen bonds which occur in more than 30% of the snapshots of a cluster are indicated by yellow dashed lines. A superposition and additional views of the first three clusters are provided in Figure S4 in the Supporting Information.

Analysis of ligand conformational clusters obtained with an RMSD cutoff of 0.2 nm for the heavy atoms of the ligand when atoms of the protein backbone are superimposed. Top: overall occurrence of the conformations represented by the clusters. Bottom: time series of their occurrence. Cluster 0 (in blue) was enforced around the crystal structure. The corresponding structure snapshots are found in Figure . Central member structures of ligand conformational clusters using the same color code as in Figure . Colored spheres indicate grid points which are occupied by water molecules in more than 15% of snapshots assigned to the corresponding cluster (the grid points are rainbow-colored with a minimum (blue) of 15 and a maximum (red) of 70%). In all snapshots, the protein has been brought into the same orientation by superimposing the backbone atoms on the AmelOBP14 crystal structure. Hydrogen bonds which occur in more than 30% of the snapshots of a cluster are indicated by yellow dashed lines. A superposition and additional views of the first three clusters are provided in Figure S4 in the Supporting Information. In the EOL-bound crystal structure, the electron density defines the position and orientation of the ligand very well (Figure D). The hydroxy group of EOL forms hydrogen bonds to S108:OG (helix 6) and L111:O (helix 7) and a water molecule (Figure C). Together with the adjacent backbone of T112, these are the only hydrophilic groups inside the cavity of the AmelOBP14 crystal structure. The hydrophobic tail of EOL points to the center of the hydrophobic cavity (Figure , cluster 0). In our simulations, the ligand shows extensive flexibility and freedom to adopt different poses (Figure ). In many of them, the hydroxy group still forms hydrogen bonds to the protein, most commonly to residues in helix 7, but also helix 2 and helix 1. Several of the conformations where the ligand moves toward the space that is occupied by helix 7 (red in the figures) in the crystal structure are clearly only accessible because of the conformational changes and flexibility of this helix. An interdependence between the ligand conformations and the conformation of the protein, in particular helix 7, which provides H-bond donors and acceptors, would therefore not be surprising. While there is no obvious correlation between the changes of the ligand conformation and the loss of α-helical conformation of helix 7 or the protein backbone RMSD or backbone conformational clusters in general, such a correlation is observed by the concurrence of the protein backbone conformation of cluster 2 in Figure and the ligand conformational cluster 3 (Figure ). While the cavity is completely closed-off in the crystal structure, the ligand seems close to leaving the cavity in several cases. It leaves for good only in simulation EOL2 (after about 140 ns), but also in simulation EOL7 it is found outside the cavity for short periods of time after which it returns to conformations visualized by cluster 4 or 7 in Figure . A comparison of the 200 ns EOL and EOL-nw simulations suggests that the presence or absence of water molecules in the cavity at the start of the simulations has an influence on the stability of the most crystal-like ligand orientation. In the nw-simulations, it is retained much longer than in the simulations started with cavity water even though the number of cavity waters is the same in the two sets of simulations after the first 20 ns (Figure ). One possible explanation might be that when the water molecules pass the ligand on their way out of the cavity, they give it some impulse to move away from the crystal conformation. Possibly without this kind of impulse, in the simulations which are started without cavity water, the ligand is more inert and stays in the original orientation for a longer time. However, with time the ligand samples similar conformations in both cases. We also clustered ligand conformations in the RE simulations in the same manner as described for the plain MDs, considering only snapshots from the first replica, which has the full ligand-surrounding interactions. We obtain a different set of conformations than for the long MD simulations which were started from the crystal structure conformation. The by far most favored conformation in the two RE simulations started with cavity water is not similar to any sampled conformations in the MDs (“RE-conformation,” cluster 2 in Figure ). Its hydroxy group is still interacting with hydrophilic groups in helix 7, but the ring has flipped so that the methoxy group is turned down toward helix 2 (green helix in Figure ) and toward the additional 1–2 water molecules trapped in the cavity in the simulations started with water in the cavity. In the RE-nw simulations, the ligand is at first predominantly oriented in a similar way to that in the major cluster observed in the plain MD simulations (Figure ). However, also here the RE conformation is observed and becomes more prominent with time. In all the RE simulations, the ligand moves away remarkably quickly (within 1 ns) from the initial orientation, which is the one found in the crystal structure. When plain MD simulations are started with a ligand in the orientation of the most populated cluster of the RE simulations (i.e., like cluster 2 in Figure ), the ligand is stable in this orientation for hundreds of nanoseconds (Figure , EOL-re simulations). Only in two of the longer simulations, the ring flips to move the methoxy group up and orient itself more like in the crystal structure. The alternative pose observed in the RE simulations seems at least as stable in regular MD as the crystal structure pose which is abandoned frequently for alternative binding poses. We conclude that multiple binding modes of eugenol should be taken into account. Coordinates of the central member structures of the first few clusters are available in the Supporting Information.

Free Energy of Binding

By gradually turning off the interactions of the ligand with its surroundings in RE simulations of the unbound and protein-bound state (double decoupling), we can derive the binding free energy of EOL in the AmelOBP14 cavity using the thermodynamic cycle depicted in Figure . For the free ligand, ΔGEOL→DUM converges very quickly (Figure S5 in the Supporting Information), and the simulation was not prolonged beyond 5 ns. In the protein-bound state, convergence is much slower, as seen in the drift of the ΔGEOL→DUM values in Figure . Convergence seems to be reached faster in the simulations without the additional water in the cavity: within 10 ns, both RE-nw simulations reach values of ΔGEOL→DUM within each other’s error bounds and do not change drastically anymore. In the simulations with cavity water, fluctuations in ΔGEOL→DUM are stronger and a slight drift remains for simulation RE1.

Figure 10

Cumulative timeseries of ΔGEOL→DUM showing its convergence with time in four independent RE simulations.

Cumulative timeseries of ΔGEOL→DUM showing its convergence with time in four independent RE simulations. All simulations converge to reasonably similar values for the binding free energies which are in the range of experimental data (Table ). Binding affinities are available from fluorescence measurements following the replacement of a fluorescent probe[20,21] and electrical measurements on a reduced graphene oxide (rGO) transistor device.[24] Our data tend more toward the latter, more directly measured values.

Table 2

Comparison of Binding Free Energies from RE Simulations and Experimental Measurementsa

	ΔG_EOL→DUM	ΔG_restraint	ΔG_bind
free
EOL	52.3 ± 0.7
protein-bound
RE1	95.9 ± 1.6	–13.4	–30.2
RE2	99.0 ± 1.9	–13.4	–33.2
RE-nw1	94.4 ± 1.6	–13.4	–28.7
RE-nw2	95.3 ± 1.2	–13.4	–29.6
experimental
rGO transistor[24]			–25.3
fluorescence replacement[21]			–40.5

All free energies are in kJ mol–1.

All free energies are in kJ mol–1. We also calculated free energies in the same manner from sets of simulations without replica exchange (“TI”-simulations). There is a bigger spread in the data, errors on the binding free energy are larger, and the resulting curves for thermodynamic integration are less smooth (Figure S6 and Table S1 in the Supporting Information). Likely, this is a result of sudden changes in preferred ligand orientation when going from one intermediate simulation to the next, which can be observed in time series of the ligand root-mean-square deviation (Figure S7 in the Supporting Information).

Conclusion

The main advantage of the double decoupling approach with Hamiltonian replica exchange MD is the possibility to overcome barriers in the replicas with decreased ligand-surrounding interactions, yielding more varied ligand poses which can then be propagated back into the real ligand state via replica exchange. The ligand conformation observed in the AmelOBP14 crystal structure is not particularly stable in our simulations. On the one hand, in the plain MD simulations, the ligand drifts away from the original orientation to a variety of different poses, which can even lead to ligand release. This is not surprising, considering that the ligand forms hydrogen bonds to the C-terminal helix 7, which is highly unstable, and it is close to other parts of the protein which undergo larger changes during long simulations. On the other hand, in the RE simulations, the ligand in the real state immediately finds a very stable orientation where a flipping of the ring has moved the methoxy group to the other side of the cavity, while the hydroxy group still keeps its hydrogen bond. The fact that this ligand pose is never observed in plain MD simulations started from the crystal structure but is quite stable if the simulations are started from this new ligand orientation indicates that the two conformations are separated by a considerable energy barrier. The movement of the ligand to more distant positions with alternative H-bonding partners along with changes in the protein in the long simulations starting from the crystal structure might furthermore reduce the probability of the ligand to find a way to this newly identified orientation, so that we do not observe it even during extensive simulation times. In the RE simulations, the ligand overcomes the barrier very quickly, while its environment is bound to stay more similar because of the intrinsically shorter time scales. Our results suggest that a ligand orientation different from the crystal structure is favorable in the solvated protein–ligand complex, which is not easily accessible without enhanced sampling techniques. An experimental validation of our predictions would be possible through elaborate NMR experiments. OBPs are not highly selective and, like A. mellifera OBP14, many can fit a range of different ligands into their cavities.[8] The spacious hydrophobic cavity with a hydrophilic anchor only on one side seems well suited to accommodate multiple possible binding poses. As observed previously for other promiscuous proteins, a versatile binding site allows for not only the accommodation of various ligands but also that of different binding poses.[55,56] The crystal structure of AmelOBP14 shows very well-defined electron density for eugenol in the cavity (Figure D). However, in the absence of conformational restrictions induced by crystal contacts in helix 7, the ligand might be free to adopt a much wider range of different orientations within the cavity.

45 in total

1. NMR characterization of a pH-dependent equilibrium between two folded solution conformations of the pheromone-binding protein from Bombyx mori.

Authors: F Damberger; L Nikonova; R Horst; G Peng; W S Leal; K Wüthrich
Journal: Protein Sci Date: 2000-05 Impact factor: 6.725

2. Simulations of apo and holo-fatty acid binding protein: structure and dynamics of protein, ligand and internal water.

Authors: Dirk Bakowies; Wilfred F van Gunsteren
Journal: J Mol Biol Date: 2002-01-25 Impact factor: 5.469

3. Efficient Characterization of Protein Cavities within Molecular Simulation Trajectories: trj_cavity.

Authors: Teresa Paramo; Alexandra East; Diana Garzón; Martin B Ulmschneider; Peter J Bond
Journal: J Chem Theory Comput Date: 2014-05-13 Impact factor: 6.006

Review 4. The role of water molecules in computational drug design.

Authors: Stephanie B A de Beer; Nico P E Vermeulen; Chris Oostenbrink
Journal: Curr Top Med Chem Date: 2010 Impact factor: 3.295

5. A novel mechanism of ligand binding and release in the odorant binding protein 20 from the malaria mosquito Anopheles gambiae.

Authors: Brian P Ziemba; Emma J Murphy; Hannah T Edlin; David N M Jones
Journal: Protein Sci Date: 2012-11-29 Impact factor: 6.725

6. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit.

Authors: Sander Pronk; Szilárd Páll; Roland Schulz; Per Larsson; Pär Bjelkmar; Rossen Apostolov; Michael R Shirts; Jeremy C Smith; Peter M Kasson; David van der Spoel; Berk Hess; Erik Lindahl
Journal: Bioinformatics Date: 2013-02-13 Impact factor: 6.937

7. Selective and pH-dependent binding of a moth pheromone to a pheromone-binding protein.

Authors: Walter S Leal; Angela M Chen; Melissa L Erickson
Journal: J Chem Ecol Date: 2005-09-28 Impact factor: 2.626

8. Structure of a specific alcohol-binding site defined by the odorant binding protein LUSH from Drosophila melanogaster.

Authors: Schoen W Kruse; Rui Zhao; Dean P Smith; David N M Jones
Journal: Nat Struct Biol Date: 2003-07-27

9. Pheromone binding and inactivation by moth antennae.

Authors: R G Vogt; L M Riddiford
Journal: Nature Date: 1981 Sep 10-16 Impact factor: 49.962

10. Queen bee pheromone binding protein pH-induced domain swapping favors pheromone release.

Authors: Marion E Pesenti; Silvia Spinelli; Valérie Bezirard; Loïc Briand; Jean-Claude Pernollet; Valérie Campanacci; Mariella Tegoni; Christian Cambillau
Journal: J Mol Biol Date: 2009-05-28 Impact factor: 5.469

4 in total

1. Eugenol prevents amyloid formation of proteins and inhibits amyloid-induced hemolysis.

Authors: Kriti Dubey; Bibin G Anand; Dolat Singh Shekhawat; Karunakar Kar
Journal: Sci Rep Date: 2017-02-01 Impact factor: 4.379

2. Larvicidal activity of novel anthraquinone analogues and their molecular docking studies.

Authors: Keerthana Selvaraj; Daoud Ali; Saud Alarifi; Sathish Kumar Chidambaram; Surendrakumar Radhakrishnan; Idhayadhulla Akbar
Journal: Saudi J Biol Sci Date: 2020-09-21 Impact factor: 4.219

3. Tyrosinase-mediated synthesis of larvicidal active 1,5-diphenyl pent-4-en-1-one derivatives against Culex quinquefasciatus and investigation of their ichthyotoxicity.

Authors: SathishKumar Chidambaram; Daoud Ali; Saud Alarifi; Raman Gurusamy; SurendraKumar Radhakrishnan; Idhayadhulla Akbar
Journal: Sci Rep Date: 2021-10-20 Impact factor: 4.379

4. Deciphering the Odorant Binding, Activation, and Discrimination Mechanism of Dhelobp21 from Dastarus Helophoroides.

Authors: Guang-Qiang Yu; Dong-Zhen Li; Yu-Lin Lu; Ya-Qi Wang; De-Xin Kong; Man-Qun Wang
Journal: Sci Rep Date: 2018-09-10 Impact factor: 4.379

4 in total