Literature DB >> 22050600

Induced fit or conformational selection? The role of the semi-closed state in the maltose binding protein.

Denis Bucher¹, Barry J Grant, J Andrew McCammon.

Abstract

A full characterization of the thermodynamic forces underlying ligand-associated conformational changes in proteins is essential for understanding and manipulating diverse biological processes, including transport, signaling, and enzymatic activity. Recent experiments on the maltose binding protein (MBP) have provided valuable data about the different conformational states implicated in the ligand recognition process; however, a complete picture of the accessible pathways and the associated changes in free energy remains elusive. Here we describe results from advanced accelerated molecular dynamics (aMD) simulations, coupled with adaptively biased force (ABF) and thermodynamic integration (TI) free energy methods. The combination of approaches allows us to track the ligand recognition process on the microsecond time scale and provides a detailed characterization of the protein's dynamic and the relative energy of stable states. We find that an induced-fit (IF) mechanism is most likely and that a mechanism involving both a conformational selection (CS) step and an IF step is also possible. The complete recognition process is best viewed as a "Pac Man" type action where the ligand is initially localized to one domain and naturally occurring hinge-bending vibrations in the protein are able to assist the recognition process by increasing the chances of a favorable encounter with side chains on the other domain, leading to a population shift. This interpretation is consistent with experiments and provides new insight into the complex recognition mechanism. The methods employed here are able to describe IF and CS effects and provide formally rigorous means of computing free energy changes. As such, they are superior to conventional MD and flexible docking alone and hold great promise for future development and applications to drug discovery.

Entities: Chemical Disease Gene Species

Mesh：

Substances：

Year: 2011 PMID： 22050600 PMCID： PMC3226325 DOI： 10.1021/bi201481a

Source DB: PubMed Journal: Biochemistry ISSN： 0006-2960 Impact factor: 3.162

Interactions between proteins and small molecules are central to life-sustaining biochemical processes and the action of many important therapeutic agents. However, the detailed mechanisms of these processes are often poorly understood and can involve a variety of motions on a wide range of time scales.[1,2] For instance, many large proteins are built from multiple domains with ligand binding sites localized at the interdomain clefts. The presence of bound substrates often stabilizes a closed conformation of the cleft, while the absence of a ligand favors an open conformation. These conformational transitions play critical roles in the ligand recognition processes and are essential for diverse functions, including transport, signaling, and enzymatic activity.(3) Motions between stable protein states can expand or contract a binding site, yielding distinct chemical interactions with ligands.[4−8] This inherent protein flexibility presents an important complication for computer-based approaches in drug discovery, as it is effectively harder to hit a moving target.[9,10] Conceptually, the realization that proteins can undergo conformational changes during or after ligand binding has led to the replacement of the traditional view of proteins as rigid objects in the so-called “lock-and-key” model(11) and to the development of the “induced-fit” (IF)(12) and “conformational selection” (CS)[4,13,14] models of molecular recognition (Figure 1). Both IF and CS mechanisms incorporate the idea that proteins are free to explore energy landscapes that often display multiple stable conformational states in equilibrium.[15−17] The CS model describes a scenario in which the unbound protein preexists in an ensemble of conformations. The IF model describes structural plasticity in the protein that is mainly ligand-induced. However, it is important to note that these models are not mutually exclusive. Indeed, a survey of the recent literature indicates that many recognition processes can also involve some elements of both CS and IF mechanisms.[5,18−20] Improving descriptions of both IF and CS effects in computer-based drug discovery is a particularly important current challenge.[6,21,22]

Figure 1

Three possible mechanisms for a protein–ligand association coupled to a conformational change. (A) The apoprotein is sufficiently flexible to exhibit open-to-closed motions, and the role of the ligand is to stabilize the closed form (CS model). (B) The unliganded protein is unable to reach the closed form, and structural plasticity is ligand-dependent (IF model). (C) A mixture of both CS and IF steps is required to reach the closed conformation. Theoretical studies of bacterial periplasmic binding proteins (PBPs) can provide valuable information about the key roles of conformational changes occurring during ligand recognition. PBPs are expressed by Gram-negative bacteria and function to transport a variety of nutrient molecules, such as amino acids, ions, vitamins, and carbohydrates.(23) Upon ligand binding, the two domains of PBPs are able to undergo hinge-bending motions to sequester ligands inside the interdomain cleft. This action has been dubbed a “Venus-flytrap mechanism”, because of its resemblance to the traps on the carnivorous plant that close only when stimulated by prey.(24) Similar folds exist in a number of important human drug targets, including glutamate receptors.(25) In addition, modified PBPs are being researched as glucose nanosensors that could benefit more than 100 million diabetes patients worldwide.(26) Many crystallographic structures of PBPs have been determined and provide a rich source of information about their different stable conformational states. However, static X-ray structures do not always provide information about the high-energy conformers that can be used for ligand recognition. For this reason, the detailed recognition mechanism is often unknown, and different approaches have been used to complement crystallographic studies and provide descriptions of the protein’s intrinsic flexibility, such as nuclear magnetic resonance (NMR) spectroscopy,[7,27] fluorescence,(28) and molecular dynamics (MD) simulations, as reviewed in recent publications.[8,29,30] Previously, we have studied the hinge-bending motion of the maltose binding protein (MBP) in atomic detail.(8) MBP is part of the maltose/maltodextrin system of Escherichia coli, which is responsible for the uptake and efficient catabolism of maltodextrins. Our simulation study of apo MBP showed the existence of two stable apo states: the open (O) and semi-closed (S) states (Figure 2). Paramagnetic relaxation enhancement rates were back-calculated from the simulations and found to be consistent with previous NMR experiments.(7) Furthermore, a similar structure for the S state was later produced by Kondo et al, using a different force field,(31) and both structures were found to be comparable to a published NMR structure [Protein Data Bank (PDB) entry 2H25],(32) with a root-mean-square deviation (rmsd) of <2 Å. In addition, the study helped uncover the existence of a motion in the balancing interface region that is coupled to the O → S transition. However, the role of the S state during the ligand recognition process remains unknown, and its study could lead to a deeper understanding of recognition processes, as several other PBPs also appear to display partially closed states, such as LAO-BP(31) and RBP.(29)

Figure 2

Side and top views of MBP, showing the size and shape of the binding site that is modulated by the conformational state of the protein. Red indicates that the residues are close to the center of mass of the protein, where the binding site is located. The two domains (the N-terminal domain, NTD, and the C-terminal domain, CTD) are connected via a short helix and a two-stranded β-sheet that form an interdomain hinge region. X-ray structures are shown for the unliganded open state (PDB entry 1OMP)(38) and the liganded closed state (PDB entry 3MBP).(37) The semi-closed structure is taken from our previous computer simulations.(6)

In this paper, we extend our initial study of MBP to explore the ligand recognition mechanism in holo MBP and study the role of the S state during ligand recognition. In principle, conventional MD (cMD) simulations could be used to study the exchanges between conformational states in hinge-bending proteins; however, these conformational transitions typically consist of structural rearrangements on spatial scales of 10–100 Å, and the transition time for such rearrangement can be in microseconds or even milliseconds. These slow time scales render difficult the calculation of converged properties for domain motions, because current computer power often limits MD simulations to the submicrosecond time scale. Here, the accelerated MD (aMD)(33) method is used to enhance the sampling by 1–2 orders of magnitude and to allow the full characterization of the microsecond hinge-bending motions of MBP. In addition to performing cMD and aMD simulations, we computed free energy profiles with the adaptive biased force (ABF)[34,35] method to determine the relative energy of different conformational states. Finally, the binding affinity for a maltotriose ligand is determined by performing thermodynamic integration (TI), using the double-decoupling method.(36) The combined results of these calculations provide an unprecedented characterization of the thermodynamic properties underlying the recognition mechanism in MBP. Side and top views of MBP, showing the size and shape of the binding site that is modulated by the conformational state of the protein. Red indicates that the residues are close to the center of mass of the protein, where the binding site is located. The two domains (the N-terminal domain, NTD, and the C-terminal domain, CTD) are connected via a short helix and a two-stranded β-sheet that form an interdomain hinge region. X-ray structures are shown for the unliganded open state (PDB entry 1OMP)(38) and the liganded closed state (PDB entry 3MBP).(37) The semi-closed structure is taken from our previous computer simulations.(6)

Methods

The liganded and unliganded crystal structures of MBP (PDB entries 3MBP(37) and 1OMP,(38) respectively) were used as the basis of our simulation systems. The 3MBP structure corresponds to the closed conformation in the presence of maltotriose. It was determined at a resolution of 1.70 Å. Topology files have been prepared with the tleap module of the AMBER10 package,(39) and the ff03 force field,(40) with TIP3P water.(41) TI calculations were conducted with AMBER code, and ABF, aMD, and cMD simulations were conducted with the NAMD2 code.[42,43] Simulations were conducted in the NVT ensemble, using a time step of 2.0 fs in combination with the SHAKE algorithm.(44) The temperature was regulated with a Langevin thermostat, using a damping coefficient of 5 ps–1. Additional simulation parameters and system details are as described in ref (8). All the simulations are summarized in Table 1.

Table 1

Summary of the Simulations Performeda

simulation name	type (EQ, ABF, TI, aMD)	no. and simulation time (ns)
open-apo-EQ	EQ	2 × 50b
open-apo-aMD	aMD	2 × 50b
open-holo-EQ	EQ	10 × 40
open-holo-aMD	aMD	10 × 40
open-holo-TI	TI	10 × 6
semi-apo-EQ	EQ	2 × 50b
semi-holo-EQ	EQ	10 × 40
semi-holo-aMD	aMD	10 × 40
semi-holo-TI	TI	10 × 6
closed-holo-TI	TI	10 × 6
closed-holo-ABF1	ABF	1 × 50
closed-holo-ABF2	ABF	20 × 20
closed-apo-ABF3	ABF	20 × 20

Abbreviations: EQ, equilibrium; ABF, adaptive biased force; TI, thermodynamic integration; aMD, accelerated MD. The total simulation time was 4.1 μs.

Simulations discussed in ref (8).

Abbreviations: EQ, equilibrium; ABF, adaptive biased force; TI, thermodynamic integration; aMD, accelerated MD. The total simulation time was 4.1 μs. Simulations discussed in ref (8). To study the conformational change in holo MBP, 10 aMD and 10 cMD simulations were conducted for 40 ns each, starting from liganded O and S structures. The aMD method was able to accelerate the sampling of hinge-bending motions by facilitating variations in torsional angles. The following parameters were used: [Eb – V0, α] = [1300 kcal/mol, 260 kcal/mol]. The boost parameter, Eb, was chosen to be as high as the highest energy barriers, so that the bias potential, ΔV, is activated most of the time. The parameter α, which controls the roughness of the biased potential, was chosen so that the average ΔV remains around ∼40 kcal/mol during the simulations. At this acceleration level, it was possible to enhance significantly the hinge-bending motions, and sampling of side chain orientations, while maintaining the integrity of the protein secondary structure. ABF simulations were performed to generate a good pose for liganded O and S structures and to compute free energy profiles for the O → C transitions in apo and holo MBP. The basic idea of the ABF method is to measure the mean force along some chosen reaction coordinate (RC) and use it to help overcome free energy barriers and provide estimates of the free energy. On the basis of our previous experience with aMD simulations of apo MBP,(8) the radius of gyration (Rg) calculated for all α-carbons was found to be a good RC for studying the conformational change. To open the liganded protein, the RC was sampled between values of 20 and 23 Å, using 500 samples prior to application of the bias force, and a bin width of 0.1 Å. A single ABF simulation of 50 ns was able to scan the three stable states of MBP (Figure 3a). To ascertain the accuracy of the liganded S and O structures, we compared unliganded structures using structural parameters such as the rmsd and the interdomain closure angle (θ). While a single ABF simulation can in principle provide a free energy estimate along the RC, the efficiency of the sampling is limited by the rate at which the system can diffuse along the RC. To accelerate the convergence of the free energy calculations, the RC was divided into overlapping windows with a width of 0.2, for values of Rg between 20.6 and 22.4 Å. Statistics were accumulated by running 20 independent ABF simulations in parallel for both apo and holo MBP, for 20 ns each, to obtain a converged profile within 0.5 kcal/mol. The resulting free energy profile is shown in Figure 4b as a function of the interdomain closure angle (θ), as it was found to increase nearly linearly with the RC.

Figure 3

Single ABF simulation initiated from the liganded C state to generate models for the liganded O and S states. After equilibration (∼2 ns), the structures of the binding site were studied. With the exception of E153 and E111 (red), all polar residues making H-bonds with maltotriose (yellow) belong to the NTD (blue). Sugar-stacking residues (green) all belong to the CTD, except for W62.

Figure 4

Free energy calculations. (a) Evolution of TI estimates toward a converged value [ΔGAB(t) – ΔGAB,conv], as a function of the sampling per λ point, for the binding affinity of maltotriose with MBP in the O, S, and C states. (b) Free energy profiles computed with the ABF method used to display the ΔGAB values obtained from TI calculations. Empty circles denote data for the apo form and filled circles data for the holo form. The plot shows that a “population shift” occurs in the stability of different conformations in the presence of a ligand.

TI calculations, based on the double-decoupling scheme, were performed to study the role of the S state in ligand recognition, using maltotriose as the ligand. In this scheme, the free energy simulations involve gradually turning off the electrostatic and van der Waals interaction of the ligand molecule from the rest of the system. This involves two sets of simulations: the transfer of the ligand from bulk water to the gas phase, ΔG°2, and the transfer of the ligand from the binding site pocket in the complex to the gas phase, ΔG°1. During the latter part of the simulation, the ligand is weakly coupled to the protein and would have to explore the entire simulation box for ΔG°1 to converge. A practical solution to this problem is to use a harmonic potential that can constrain the ligand position during the calculations and facilitate the proper convergence of the free energy. To choose an appropriate force constant k, an analytic formula based on the equipartition theorem is given by Gilson et al.:(36)where ∂r2 is is the atomic fluctuation of the constrained atom or center of mass of the ligand during the course of a MD simulation with the fully unperturbed potential function, R is the molar gas constant, and T = 300 K. In this case, a harmonic potential of 1 kcal mol–1 Å–2 was chosen, and the free energy was calculated using the appropriate correction,(36) aswhere C0 is the standard concentration and the volume element, VI, can be calculated as The binding free energy of maltotriose was computed for MBP in the O, S, and C states. The ligand was decoupled from its environment in several steps characterized by a λ value between 0 and 1. The derivative of the potential energy with λ was sampled for each λ and integrated using the trapezoid rule to compute the free energy difference. The electrostatic interaction was first decoupled in 11 steps, with λ values of 0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, and 1.0. The van der Waals (vdW) part of the interaction energy was then decoupled in 21 steps (every 0.05), using soft-core potentials.[45,46] Each point was equilibrated prior to the production run by 0.5 ns of MD simulation and run for at least 5 ns. The convergence of the calculations was ascertained by running five independent simulations for selected λ points starting from different initial coordinates. In addition, a time series of an error estimate, σsim(t), was back-calculated at the end of the runs by comparing the current estimate of ∂H/∂λ at time t and the converged ∂H/∂λ(47) (see also Figure 4a):where T is the total number of block averages containing uncorrelated points, [∂H(λ)/∂λ]λ denotes the Hamiltonian derivative, block-averaged at time t, and [∂H(λ)/∂λ]λ is the ensemble average over the entire simulation time at a given λ. The error bars calculated in this way were found to be converged within ∼0.5 kcal/mol for sampling times longer than 5 ns per λ (Figure 4a). In addition to thermodynamic factors, kinetic factors such as the rate of exchange between stable states can affect the recognition rate. To study the impact of conformational exchange on association kinetics, we proposed a model of the conformational dynamics between the O and S states based on the two-state model introduced by Szabo et al.(48) for studying the bimolecular association rate in the presence of stochastic gating. In this model, the biomolecular association rate, kG, is calculated aswhere kUG is the steady-state ungated biomolecular association rate to the protein fixed in its most favorable conformations, k1 and k2 are the closing and opening rates, respectively, (k1 + k2)−1 is the domain gating period, and κ̃(s) is the Laplace transform of s. For MBP, the C state is not visited in the apo form[7,8,31] and kG corresponds to the rate of O ⇔ S transitions, whereas kUG corresponds to the recognition rate for a system initially fixed in the O state. Using this model, the time for the O to S exchange becomes the domain gating period, τex, and eq 5 is reduced to two limiting cases depending on the relative values of τex relative to the characteristic diffusion relaxation time, τD:(48) Equations 6 and 7 represent substrate association in the presence of “slow” and “fast” exchange, respectively, and imply that domain motion interferes with the substrate binding kinetics only in the slow exchange limit (τex ≫ τD). In these equations, Dlig is the sum of the translational diffusion coefficients of the ligand and protein, which can be approximated as the diffusion coefficient of the ligand. The collision distance of the system, rc, can be approximated as the sum of the smallest radii of spheres that contain each molecule. To obtain an estimate of the average time needed for a transition between O and S states, τex, we used the Smoluchowski diffusion equation, assuming a constant diffusion coefficient. The diffusion coefficient of the protein, Dprot, was estimated with the Hydropro method,(49) as the sum of the diffusion coefficient of the two domains, setting the temperature to 298.15 K, the viscosity to that of water at 298.15 K (0.890 cP), and the bead radii of the protein to 3.2 Å. For the simplest case of a flat free energy profile, the time required for the diffusion of the two domains becomes τ0 = L2/2Dprot,(50) or ∼25 ns. However, for MBP, the free energy profile is not flat and a better estimate for τex includes a penalty caused by the roughness of the profile. This stochastic diffusion process was modeled as a Markov chain, with discrete states corresponding to successive milestones on the free energy profile. Such a Markov random walk satisfies the condition of detailed balance under equilibrium conditions in the absence of net flux, and the stochastic evolution of the system obeys the Smoluchowski equation.(51) The Smoluchowski equation was discretized in space,(52) using the equation k = k0 exp[−1/2(β/ΔG)], where β = 1/kBT, k0 is the rate for a simple diffusion between two states i and j, and ΔG is their free energy difference.

Results

Below we first report the results of free energy calculations exploring the binding affinity of MBP for the maltotriose ligand when the protein is in the O, S, and C states. Next, we present results on kinetic aspects of the recognition process. Finally, simulations of the conformational change that provide a complete picture of the recognition process are described.

Free Energy Calculations

Although kinetic factors can influence the yields of different molecular complexes in cellular and other nonequilibrium environments, the primary factors that are used in the analysis of molecular recognition can be described by thermodynamics.(53) To explore the initial interactions between maltotriose and MBP, we built computational models of the liganded O and S states (see Methods). After equilibration, inspection of trajectories initiated from the O, S, and C ligand-bound structures indicated that maltotriose interacts more favorably with active site residues in the C state. Details of these interactions have been described previously in detail[54,55] and involve principally the hydroxyl groups of the sugar making hydrogen bonds with the NTD domain of the protein (D14, N12, and E65) and aromatic residues on the CTD (W230, Y155, W340, and Y341) that stack favorably with the sugar ring of the ligand (Figure 3). Single ABF simulation initiated from the liganded C state to generate models for the liganded O and S states. After equilibration (∼2 ns), the structures of the binding site were studied. With the exception of E153 and E111 (red), all polar residues making H-bonds with maltotriose (yellow) belong to the NTD (blue). Sugar-stacking residues (green) all belong to the CTD, except for W62. Interestingly, in the bound O state, several favorable interactions with both NTD and CTD are lost because of the extended size of the binding site. In particular, the sugar stacking residues on the CTD (W230, Y155, W340, and Y341) are not favorably positioned. In contrast, the sugar stacking residues in the S state can interact more favorably with the ligand. Residues Y155 and W340 form interdomain H-bonds, between W340 (CTD) and D65 (NTD), and between the backbone carbonyl of E111 and Y155.(8) These H-bonds help orient the sugar-stacking side chains so that they can interact favorably with the ligand, effectively locking the protein structure in the S state. TI calculations were used to quantify the difference in binding affinity for maltotriose in the O, S, and C states. The standard Free Energy, ΔGAB, was calculated as the energy required for removing the ligand from the binding site and moving it into the solvent. The calculations indicated that the O state displays only a weak affinity for the ligand (ΔGAB = −3.3 kcal/mol), whereas the S conformation leads to a more favorable binding energy (ΔGAB = −7.5 kcal/mol). The C state was found to display the highest affinity for maltotriose (ΔGAB = −17.7 kcal/mol). The strong binding in the C structure, when compared to that in the O and S structures, reflects the multivalent attachment of the maltotriose ligand to the receptor that includes multiple additional interactions with the NTD, such as electrostatic interactions employing the OH groups of the glucose units. When compared to the ligand in bulk, electrostatic interactions were found to play an important role in the high ligand affinity for the C state (ΔGelec values of −8.2, −1.0, and −0.2 kcal/mol for the C, S, and O states, respectively), whereas vdW interactions caused by improved sugar stacking partly explain the difference in affinity between the O and S states (ΔGvdW values of −9.5, −6.5, and −3.1 kcal/mol for the C, S, and O states, respectively). As the C state is not visited by apo MBP,[7,8,31] these results suggest that the S state is the most reactive state for ligand recognition. The binding affinities can be compared to the affinity constants determined for MBP from the association and dissociation rates (kon/koff) and measured with stopped-flow fluorescence spectroscopy.(56) In these experiments, kon is understood to be the rate of binding to the protein (which can be in the O or S state) whereas koff is much smaller and corresponds to the rate of dissociation from the C state after the conformational change. The association constant is calculated via the equation Kexp = kon/koff, and the ligand association free energy isIn our TI calculations, the time scale sampled by the simulations does not allow for the relaxation of the apoprotein from the Capo state into the more stable Oapo state. This was confirmed by computing the rmsd and the radius of gyration of the final structure. Therefore, the computed dissociation free energy for the C state corresponds only to the following transformation: The final protein conformation (Capo) is high in energy, and the protein relaxation energy can be calculated with the ABF method or estimated via NMR spectroscopy:(57)Combining eqs 10 and 11 and changing the sign of the free energy provide an estimate of the free energy of ligand association (eq 12), which is in good agreement with experiments (eq 9): To provide a complete view of the free energy profiles, we used ABF simulations to determine the energy requirements for opening and closing MBP. The apo and holo curves were aligned using the value computed with TI for the maltotriose binding affinity in the C state. The two ABF curves were found to correctly reproduce the ligand binding energy of the O and S states, also computed with TI (Figure 4b and Table 2). The C apo state was found to be high in energy in the apo from, consistent with the observation that this state is not visited by apo MBP. In addition, the plot highlights the existence of a ligand-induced population shift toward the closed structure and indicates a barrier-less transition from the O form to the C form in holo MBP.

Table 2

Free Energies (kilocalories per mole) of Protein–Ligand Association for Maltotriose

conformation	ΔG from TI	ΔG from ABF
open	–3.3 ± 0.5	–2.9 ± 0.5
semi-closed	–7.5 ± 0.5	–7.0 ± 0.5
closed	–17.7 ± 0.5	not available

Kinetic Aspects of the Recognition Process

Although informative, the free energy profiles presented above do not tell us whether the S state is actually used by MBP during ligand recognition. In practice, the biological activity of different conformational states can be determined by several factors, in particular (I) the binding free energy, (II) the occupancy, or population, of the different states, (III) the accessibility of the binding site, (IV) the rate at which the different states can exchange, (V) the ligand concentration, and (VI) the possibility of allosteric or cooperative interactions with the ligand. Kinetic effects could be important for MBP, as the S state displays a high affinity for the ligand but has a low occupancy (∼5%(7)). If the characteristic diffusion time of the ligand is similar to the time required for the O ⇔ S transitions, then the ligand is likely to be presented to both states and hence will be able to preferably bind the most favorable state (i.e., the S state). However, if the exchange between the two states is slow compared to ligand diffusion, the extent to which the minor S state will be able to compensate for its small equilibrium probability is limited, and the rate of the recognition will be influenced by the rate of the conformational transition (eq 6). A rough estimate of the rate of exchange between the O and S states was calculated on the basis of the computed free energy profiles, and the Smoluchowski diffusion equation (see Methods). The diffusion coefficient for the two domains of MBP was assumed to be constant at a Dprot value of ∼2 × 10–6 cm2/s (=2 × Å2/ns), which was estimated with the Hydropro method.(49) The two rigid domains have to move by a total distance L of ∼10 Å during the O → S transition, and for the simplest case of a flat free energy profile, the time required for such diffusion (τ0 = L2/2D) is 25 ns. In MBP, however, the free energy profile is not flat, and the S state is slightly higher in energy by ∼2 kcal/mol.(7) In addition, there is an energy barrier of ∼3.7 kcal/mol that was calculated by the ABF method. With this free energy profile, the time scale obtained for the O → S transitions, τex, is estimated to be ∼200 ns. Alternatively, a slightly different estimate for the free energy barrier (∼7 kcal/mol) has been obtained by Kondo et al.,(31) who used umbrella sampling. The small discrepancy between the two profiles may be due to the difference in the computational models or the fact that these authors used sampling times 40 times shorter (0.5 ns) than the sampling times per window used here (20 ns). For instance, Kondo et al did not report observing the motion of the balancing interface during O → S transitions, which suggests that the shorter sampling times may have led to a slight overestimate of the actual energy barrier. On the basis of their profile, our estimate for τex is ∼2 μs. Therefore, a conservative range of values for the time scale for the O ⇔ S transitions is between ∼200 ns and ∼2 μs. This estimate is consistent with our observation that the O → S transition in apo MBP occurs in aMD simulations(8) but is not observed in 50 ns cMD simulations. This estimate also lies within the larger bracket of possible time scale (20 ns to 20 μs) indicated by NMR experiments.(7) The time scale of the O → S transition should be compared to the characteristic diffusion time for the ligand, which was also estimated. Dlig was calculated with the Stokes–Einstein relation as kBT/6πμr, where r is the ligand radius (∼8 Å), μ is the viscosity of water (0.001 in SI units), and rc (eq 8) is set to ∼20 Å. With these values, τD is roughly ∼16 ns. Therefore, τex is found to be at least 10 times slower than τD, which implies that the protein conformational change is in the slow exchange mode (eq 6), a result consistent with the measured first-order kinetics.(56) In summary, the evaluation of kinetic factors suggests that the S state is unlikely to compensate fully for its low relative occupancy of ∼5%. Instead, the free energy profile (Figure 4b) indicates that the O → S conformational change is fast and generally occurs within nanoseconds, but only after the ligand interacts with the O state.

Standard and Accelerated MD Simulations

To confirm the scenario outlined above, we performed direct simulations of liganded MBP. Previous MD simulations(58) have indicated that the O → C conformational change of liganded MBP occurs within ∼30 ns. This suggests that the presence of a ligand at the interdomain cleft can accelerate the transition from the O state to the S state, possibly by assisting the reorganization of relevant side chains and/or by establishing bridges linking the two domains. To explore these questions, we conducted 10 cMD and 10 aMD simulations for 40 ns each, starting from liganded O and S states. During the aMD simulations, eight of the 10 trajectories showed the conformational transitions to the C structure (Figure 5). In the cMD runs, two conformational transitions to the C form were observed. This contrasts with our previous simulations of the apo state with cMD,(8) where no O → S transitions could be observed on a similar time scale (40 ns). Thus, simulations show that the ligand is able to accelerate significantly the rate of the conformational change. This result is consistent with the free energy profiles presented above and with our estimates for the kinetic rates.

Figure 5

Representative trajectories. (a) Two aMD trajectories are shown that were started from the liganded O state and evolved toward the C state (blue). Instead, in standard MD simulations, only one closing event was observed, and most simulations did not reach the C state within 40 ns, as shown here for one trajectory (orange). (b) Most cMD simulations remained in their original state (orange), whereas most aMD simulations rapidly evolved toward the most stable structure. Two such transitions are shown here, O → S (green) and S → C (blue). (c) Schematic drawing of the two different conformational changes in MBP, showing the variation of the interdomain angle as a function of the number of new protein–ligand contacts in the binding site. The ligand recognition is found to proceed in the same way in all productive trajectories, which can be described as follows. (1) The ligand is initially interacting with the sugar-stacking residues of the CTD (Y155 and W340) and with E111, which belongs to the hinge region. (2) The sugar-stacking residues of the CTD improve their positions to better stabilize the ligand, and hydrogen bonds form between Y155 and the carbonyl group of E111 and between W340 and D65, which locks the protein in the S state. During this process, the tip of the balancing interface moves into solution. (3) Favorable electrostatic interactions with negatively charged ligands of the NTD, in particular E44 and D65, exert a force pulling the NTD toward the CTD. (4) A global conformational change occurs, reducing the interdomain angle from ∼145° to ∼135° and the radius of gyration from ∼21.4 to ∼20.6 Å, to reach the closed form of MBP. Another interesting question about the conformational change is whether local changes in the binding site occur before or after the global structural change in the protein. Simulations suggest that the O → S and S → C transitions behave differently in that respect. In the case of the O → S transition, the number of protein–ligand contacts in the binding site (Figure 3) increases only after the conformational change is completed. For example, simulations show that the O → S transition is less dependent on the presence of the ligand (Figure 5c), as it can also occur in apo MBP.(8) Moreover, the reverse S → O conformational change is also observed in simulations of holo MBP (Figure 5a) and does not appear to be prevented by the presence of the ligand. In contrast, the S → C transition is unidirectional and found to be mostly driven by the interactions of the ligand with charged side chains of the NTD (such as E44 and D65), which occur in the simulations prior to the global conformational change.

Discussion

These results indicate that an IF model best describes the ligand recognition mechanism in MBP. The rate of O to S exchange in apo MBP appears to be too far below that required for compensating fully for the small occupancy of the S state. Given that the S state has a population of only ∼5%, it is likely that initial ligand recognition involves predominantly the O state. However, we have shown that the S state displays high affinity for the ligand. This conclusion is consistent with the experiments of Telmer and Shilton, who have reported that a point mutation in the balancing interface region of MBP (Figure 2) can improve recognition by a factor of ∼200.(59) This result is explained by our observation that such mutations can induce a population shift toward the S state.(8) These forces are not electrostatic in nature but nevertheless develop into long-range (>20 Å) influences. Similarly, remote mutations in viral proteins, such as HIV-1 protease, can lead to population shifts and drug resistance without modifying the catalytic abilities of the enzyme.(60) The presence of two stable states in MBP may allow for interactions with more diverse ligands, such as, for example, sugars with one to seven glucose units. In agreement with this proposal, less promiscuous PBPs, such as GlnBP, do not appear to possess a partially closed conformation,(61) while more promiscuous PBPs, such as LAO-BP, appear to possess a semi-closed apo conformation.(31) Interestingly, PBPs are able to create a selective binding site without displaying a molecular surface complementary to the ligand on a subangstrom level. Gerstein et al first hypothesized that in most hinge-bending proteins the open and closed states are only slightly different in energy and are in dynamic equilibrium at room temperature,(3) pointing toward a CS mechanism for most PBPs. However, recent studies have indicated that different recognition scenarios may operate within the PBP superfamily (Figure 1). For instance, a CS mechanism has been proposed for the ferric binding protein (FBP),(62) the ribose binding protein (RBP),(29) the glucose/galactose binding protein (GGBP),(63) and the choline binding protein (ChoX).(64) This mechanism has also been proposed for the l-lysine l-arginine l-ornithine binding protein (LAO-BP) transition,(65) although more recent theoretical studies point toward possible mechanisms involving both CS and IF steps.[31,66] In contrast, the glutamine binding protein (GlnBP) is believed to be less flexible than other PBPs, and NMR paramagnetic relaxation experiments suggest an IF mechanism.(61) In addition to the protein intrinsic flexibility, the IF mechanism is also generally more likely to prevail over the CS mechanism when the ligand affinity is high or the ligand concentration is high.(20) Findings from this study support a recognition mechanism for MBP that involves two possible pathways: either a pure IF mechanism or a mechanism involving both a CS step and an IF step. The IF mechanism is found to dominate, as MBP exists predominantly in the O state (95% occupancy), and the time scale of the conformational change in apo MBP is slow compared to the ligand diffusion rate (∼200 ns to 2 μs vs ∼16 ns). In fact, a similar IF mechanism may also occur in other PBPs that appear to possess minor semi-closed states, such as LAO-BP(31) and RBP,(29) as CS effects are typically slower than IF adaptations, due to the fact that they involve global transitions between protein states separated by energy barriers. Our calculations of the free energy show that the ligand induces a population shift in MBP that helps drive the system toward the closed conformation. The similarities we observe between all simulated pathways suggest that the shape of the free energy well is analogous to the funnel model used to describe protein folding pathways. The complete recognition process is best viewed as a “Pac Man” type action where the ligand is initially localized to one domain and naturally occurring hinge-bending vibrations in the protein are able to assist the recognition process by increasing the chances of a favorable encounter with side chains on the other domain, leading to a population shift. Furthermore, ligand binding to one domain also allostrerically affects the linker and interdomain interface (see also ref (67)). Indeed, the aMD simulations could uncover the detailed mechanism of the population shift and indicated that the ligand is able to create a bridge between the CTD and charged residues of the NTD. As new methods are being developed to incorporate descriptions of CS and IF effects in computer-assisted drug discovery, there are good prospects for the rational design of inhibitors targeting flexible proteins. For instance, when comparing several popular ligand docking tools for carbohydrates, Agostino et al. found that most codes can reproduce accurately the ligand binding geometry (pose) of experimentally determined structures when using X-ray structures that were determined with the ligand bound (thereby explicitly incorporating CS and IF effects(68)). However, when using the flexible version of these docking algorithms, the poses were more unreliable as indicated by large rmsds from the known complex. In practice, ligand-bound structures are often not available, and it remains a significant challenge to model accurately IF and CS effects. As shown here for MBP, free energy calculations coupled with aMD simulations can help in the assessment of the role of CS and IF mechanisms, as well as in determining the role of high-energy states in molecular recognition. The aMD simulations were found to enhance the sampling by 1 or 2 orders of magnitude, allowing for the exploration of slow hinge-bending motions. The combination of methods provided key details about protein dynamics, and the relative free energy difference between accessible states. In this regard, these methods appear far superior to cMD and flexible docking alone. As such, they hold great promise for future development and applications to drug discovery.

57 in total

Review 1. The Venus flytrap of periplasmic binding proteins: an ancient protein module present in multiple drug receptors.

Authors: C B Felder; R C Graul; A Y Lee; H P Merkle; W Sadee
Journal: AAPS PharmSci Date: 1999

Review 2. Molecular dynamics simulations of biomolecules.

Authors: Martin Karplus; J Andrew McCammon
Journal: Nat Struct Biol Date: 2002-09

3. Crystallographic evidence of a large ligand-induced hinge-twist motion between the two domains of the maltodextrin binding protein involved in active transport and chemotaxis.

Authors: A J Sharff; L E Rodseth; J C Spurlino; F A Quiocho
Journal: Biochemistry Date: 1992-11-10 Impact factor: 3.162

4. A point-charge force field for molecular mechanics simulations of proteins based on condensed-phase quantum mechanical calculations.

Authors: Yong Duan; Chun Wu; Shibasish Chowdhury; Mathew C Lee; Guoming Xiong; Wei Zhang; Rong Yang; Piotr Cieplak; Ray Luo; Taisung Lee; James Caldwell; Junmei Wang; Peter Kollman
Journal: J Comput Chem Date: 2003-12 Impact factor: 3.376

5. A new strategy for structure determination of large proteins in solution without deuteration.

Authors: Yingqi Xu; Yu Zheng; Jing-Song Fan; Daiwen Yang
Journal: Nat Methods Date: 2006-11 Impact factor: 28.547

6. The energy landscapes and motions of proteins.

Authors: H Frauenfelder; S G Sligar; P G Wolynes
Journal: Science Date: 1991-12-13 Impact factor: 47.728

7. Hinge-bending in L-arabinose-binding protein. The "Venus's-flytrap" model.

Authors: B Mao; M R Pear; J A McCammon; F A Quiocho
Journal: J Biol Chem Date: 1982-02-10 Impact factor: 5.157

8. Atomic structures of periplasmic binding proteins and the high-affinity active transport systems in bacteria.

Authors: F A Quiocho
Journal: Philos Trans R Soc Lond B Biol Sci Date: 1990-01-30 Impact factor: 6.237

9. Insights into the conformational equilibria of maltose-binding protein by analysis of high affinity mutants.

Authors: Patrick G Telmer; Brian H Shilton
Journal: J Biol Chem Date: 2003-06-06 Impact factor: 5.157

10. Perturbation-response scanning reveals ligand entry-exit mechanisms of ferric binding protein.

Authors: Canan Atilgan; Ali Rana Atilgan
Journal: PLoS Comput Biol Date: 2009-10-23 Impact factor: 4.475

33 in total

1. Substrate transport activation is mediated through second periplasmic loop of transmembrane protein MalF in maltose transport complex of Escherichia coli.

Authors: Tomas Jacso; Erwin Schneider; Bernd Rupp; Bernd Reif
Journal: J Biol Chem Date: 2012-03-26 Impact factor: 5.157

2. Equilibrium fluctuations of a single folded protein reveal a multitude of potential cryptic allosteric sites.

Authors: Gregory R Bowman; Phillip L Geissler
Journal: Proc Natl Acad Sci U S A Date: 2012-07-02 Impact factor: 11.205

3. Coevolutionary signals across protein lineages help capture multiple protein conformations.

Authors: Faruck Morcos; Biman Jana; Terence Hwa; José N Onuchic
Journal: Proc Natl Acad Sci U S A Date: 2013-12-02 Impact factor: 11.205

4. Implicit ligand theory: rigorous binding free energies and thermodynamic expectations from molecular docking.

Authors: David D L Minh
Journal: J Chem Phys Date: 2012-09-14 Impact factor: 3.488

5. A single-molecule dissection of ligand binding to a protein with intrinsic dynamics.

Authors: Eunkyung Kim; Sanghwa Lee; Aram Jeon; Jung Min Choi; Hee-Seung Lee; Sungchul Hohng; Hak-Sung Kim
Journal: Nat Chem Biol Date: 2013-03-17 Impact factor: 15.040

6. Both protein dynamics and ligand concentration can shift the binding mechanism between conformational selection and induced fit.

Authors: Nicholas Greives; Huan-Xiang Zhou
Journal: Proc Natl Acad Sci U S A Date: 2014-06-30 Impact factor: 11.205

7. Scalable molecular dynamics on CPU and GPU architectures with NAMD.

Authors: James C Phillips; David J Hardy; Julio D C Maia; John E Stone; João V Ribeiro; Rafael C Bernardi; Ronak Buch; Giacomo Fiorin; Jérôme Hénin; Wei Jiang; Ryan McGreevy; Marcelo C R Melo; Brian K Radak; Robert D Skeel; Abhishek Singharoy; Yi Wang; Benoît Roux; Aleksei Aksimentiev; Zaida Luthey-Schulten; Laxmikant V Kalé; Klaus Schulten; Christophe Chipot; Emad Tajkhorshid
Journal: J Chem Phys Date: 2020-07-28 Impact factor: 3.488

8. Perspective: pre-chemistry conformational changes in DNA polymerase mechanisms.

Authors: Tamar Schlick; Karunesh Arora; William A Beard; Samuel H Wilson
Journal: Theor Chem Acc Date: 2012-11-23 Impact factor: 1.702

9. dNTP-dependent conformational transitions in the fingers subdomain of Klentaq1 DNA polymerase: insights into the role of the "nucleotide-binding" state.

Authors: Paul J Rothwell; William J Allen; Evangelos Sisamakis; Stanislav Kalinin; Suren Felekyan; Jerker Widengren; Gabriel Waksman; Claus A M Seidel
Journal: J Biol Chem Date: 2013-03-22 Impact factor: 5.157

10. Ligand concentration regulates the pathways of coupled protein folding and binding.

Authors: Kyle G Daniels; Nam K Tonthat; David R McClure; Yu-Chu Chang; Xin Liu; Maria A Schumacher; Carol A Fierke; Scott C Schmidler; Terrence G Oas
Journal: J Am Chem Soc Date: 2014-01-09 Impact factor: 15.419