| Literature DB >> 31750313 |
Francesco Delfino1,2, Yuri Porozov1,3, Eugene Stepanov4,5, Gaik Tamazian6, Valentina Tozzini2.
Abstract
Transitions between different conformational states are ubiquitous in proteins, being involved in signaling, catalysis, and other fundamental activities in cells. However, modeling those processes is extremely difficult, due to the need of efficiently exploring a vast conformational space in order to seek for the actual transition path for systems whose complexity is already high in the stable states. Here we report a strategy that simplifies this task attacking the complexity on several sides. We first apply a minimalist coarse-grained model to Calmodulin, based on an empirical force field with a partial structural bias, to explore the transition paths between the apo-closed state and the Ca-bound open state of the protein. We then select representative structures along the trajectory based on a structural clustering algorithm and build a cleaned-up trajectory with them. We finally compare this trajectory with that produced by the online tool MinActionPath, by minimizing the action integral using a harmonic network model, and with that obtained by the PROMPT morphing method, based on an optimal mass transportation-type approach including physical constraints. The comparison is performed both on the structural and energetic level, using the coarse-grained and the atomistic force fields upon reconstruction. Our analysis indicates that this method returns trajectories capable of exploring intermediate states with physical meaning, retaining a very low computational cost, which can allow systematic and extensive exploration of the multi-stable proteins transition pathways.Entities:
Keywords: PROMPT; classical molecular dynamics; coarse grained models; minimal action path; proteins conformational transitions; transition path sampling
Year: 2019 PMID: 31750313 PMCID: PMC6843051 DOI: 10.3389/fmolb.2019.00104
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
Figure 1The model system. (A) The atomistic representation of the protein chain (side chains are omitted for clarity) (B) coarse grained representation. In both cases the internal variables are reported. (C) The apo-closed form (named A) and the calcium-bound open form (named B) of Calmodulin (pdb codes: 1WRZ, 1EXR).
Functional forms (first and second columns) and parameterization (third column) of the MCG FF.
| Restrains | |||
| rcut = 8.5 Å | |||
| r0 = 9.5 Å |
An illustration of the statistics-based parameterization procedure is also reported in the plots. Upper plot: The dots represent the inverse bond angle fluctuations as a function of the bond angle, evaluated using atomistic simulations of different test proteins (yellow a globular protein, blue the calmodulin itself, different symbols for different runs). This curve can be fitted as damped sin (cyan line). Assuming statistical equilibrium one has an angle dependent effective elastic constant from the equation k′ = k.
Figure 2Simulations results from Langevin dynamics at 300 K, γ = 8 ps−1 (A) and 130 K, γ = 2 ps−1 (B). Temperature (upper plots), total and potential energies (central plot) and σ are reported along the simulations from A to B (using FFB and starting from configuration A, green lines), and from B to A (using FFA and starting from configuration B, red lines). For the 300 K simulation also the running averages are reported for the potential energy as yellow and blue lines, respectively. (C,D) Scatter plot of the LD simulations (same color coding as previous) compared with MAP and PROMPT paths evaluation (color coding as in the legend of C). The connected dots are the representative elements of the PP clustering procedure. Sample configurations are reported in colors corresponding to the lines and their approximate location in the plots are indicated by arrows.
Figure 3Simulation data analysis and comparison with PROMPT and MAP (A) Potential energy vs. σ along the simulations at 300 K (dotted lines) and at 130 K (dashed lines), with the FFA (red) and FFB (green) force fields (scales for FFA and FFB are shifted of 3 Kcal/mole to align the activated state as explained in the text. Both scales are reported on the left and right axis, in colors corresponding to the FF they refer to). Colored dot with error bars are averages over subsets of structures classified by σ intervals (errorbars correspond to standard deviations of data from average values). Representative closed (σ = 0) and open (σ = 1) structures are reported under the plot. (B) Potential energies vs. σ evaluated over the representative structures of the clusters outputted by PP procedure. Squares connected by solid lines: representatives optimized by the PP procedure (filled = from the 300 K simulation, empty = from the 130 K simulations, red with FFA, green with FFB). Circles connected by dashed/dotted lines: same as previous, but evaluated over a trajectory of structures extracted from the simulations, the nearest to the optimal ones. (Same color and empty/filled code as for squares; shift of scales as in A). (C) Comparison of the 130 K “optimal” energies with energies of trajectories from MAP (cyan) and PROMPT (magenta) evaluated with FFA (dotted) and FFB (dashed). Representative structures of the activated states are reported in corresponding colors. Same scale shift as in (A); the vertical scales are broken to zoom over the low energies. (D) Potential energy evaluated with the atomistic FF over the same trajectories as in (C) (same color coding). Representative structures are reported in corresponding colors; the Ramachandran plot of the activated states of PROMPT and MAP are reported (yellow squared dots superimposed to the standard map in colors). Both in (C,D) the area with distorted sheets in the activated state of PROMPT is highlighted with a yellow circle.