| Literature DB >> 36099639 |
José Ramón López-Blanco1, Yves Dehouck2, Ugo Bastolla2, Pablo Chacón1.
Abstract
We propose and validate a novel method to efficiently explore local protein loop conformations based on a new formalism for constrained normal mode analysis (NMA) in internal coordinates. The manifold of possible loop configurations imposed by the position and orientation of the fixed loop ends is reduced to an orthogonal set of motions (or modes) encoding concerted rotations of all the backbone dihedral angles. We validate the sampling power on a set of protein loops with highly variable experimental structures and demonstrate that our approach can efficiently explore the conformational space of closed loops. We also show an acceptable resemblance of the ensembles around equilibrium conformations generated by long molecular simulations and constrained NMA on a set of exposed and diverse loops. In comparison with other methods, the main advantage is the lack of restrictions on the number of dihedrals that can be altered simultaneously. Furthermore, the method is computationally efficient since it only requires the diagonalization of a tiny matrix, and the modes of motions are energetically contextualized by the elastic network model, which includes both the loop and the neighboring residues.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36099639 PMCID: PMC9516680 DOI: 10.1021/acs.jcim.2c00870
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 6.162
Figure 1Constrained normal modes of the 66–76 loop in the structure of the Bordetella bronchiseptica hydrolase (PDB: 3IRS, chain C). The conformational ensembles (rainbow colored) represent the motions encoded in each mode. They are generated by perturbing the loop conformation along a given mode direction until reaching an RMSD of 3 Å from the initial loop structure (light blue backbone stick representation). The displayed ensembles include intermediate conformations, every 0.25 Å along the pathway. The rainbow colors indicate the amplitude and direction, from positive (yellow) to negative (red). Only the 3 lowest and the 2 highest frequency modes are displayed here, but the motions along all 17 modes are visualized in Movie S1.
Mean Values of the Initial and Final Morphing RMSD
| data set | RMSD | RMSD | RMSD | RMSD | |
|---|---|---|---|---|---|
| 392 | 2.5 ± 2.5 Å | 0.55 ± 0.33 Å | 0.51 ± 0.31 Å | 0.44 ± 0.31 Å | |
| 184 | 4.2 ± 2.2 Å | 0.76 ± 0.28 Å | 0.69 ± 0.27 Å | 0.63 ± 0.27 Å | |
| 80 | 4.3 ± 2.1 Å | 0.85 ± 0.29 Å | 0.73 ± 0.26 Å | 0.71 ± 0.31 Å | |
| 104 | 4.2 ± 2.3 Å | 0.69 ± 0.27 Å | 0.65 ± 0.26 Å | 0.58 ± 0.23 Å | |
| REMD | 15 | 10.4 ± 3.3 Å | 0.71 ± 0.18 Å | 0.71 ± 0.17 Å | 0.64 ± 0.25 Å |
Mean and standard deviation of the initial RMSD between pairs of experimentally observed loop conformations, on the full dataset and three subsets (see Section ).
Mean and standard deviation of the final RMSD between target and reconstructed loop conformation.
The procedure is performed after the superposition of the loop-flanking residues, rather than the complete structures. The flanking residues are the anchors and two more on either side of the loop.
The ω backbone dihedrals are considered as degrees of freedom, along φ and ψ.
Figure 2Two illustrative morphing cases: (1) on the left: transition of loops 81–93 of the PPPK kinase, from 3hsz (gray) to 3ht0 (orange); (2) on the right: transition of loops 317–337 of the MopE protein, from 2vov (gray) to 2vox (orange). The corresponding final morphed conformations are represented in blue.
REMD and NMA Sampling Space Comparation
| RMSD | modes | 90%var | Bf | RMSD | NMA | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MD | NMA | MD | NMA | γ90% | Z90% | avg | NMA | MD | time | speed | ||
| 2eaq | 1.5 | 1.4 | 2 | 8 | 0.64 | 602 | 0.85 | 1.51 | 1.15 | 0.99 | 37.8 | 1.2 |
| 5w0g | 1.0 | 1.3 | 6 | 5 | 0.69 | 184 | 0.56 | 0.75 | 0.48 | 0.32 | 27.2 | 1.6 |
| 2ns0 | 0.7 | 1.3 | 9 | 6 | 0.69 | 196 | 0.90 | 0.57 | 0.44 | 0.32 | 28.6 | 1.2 |
| 4dpb | 0.8 | 1.3 | 7 | 6 | 0.73 | 271 | 0.75 | 0.50 | 0.42 | 0.27 | 30.5 | 2.0 |
| 5nod | 1.0 | 1.4 | 5 | 4 | 0.80 | 183 | 0.92 | 1.18 | 0.47 | 0.40 | 26.1 | 1.5 |
| 6elm | 1.6 | 1.3 | 7 | 6 | 0.70 | 559 | 0.93 | 1.49 | 0.96 | 0.48 | 32.2 | 1.7 |
| 3bv8 | 0.5 | 1.3 | 14 | 5 | 0.78 | 111 | 0.97 | 0.30 | 0.26 | 0.25 | 31.8 | 1.5 |
| 5e9p | 2.8 | 2.3 | 3 | 5 | 0.71 | 47 | 0.98 | 2.34 | 1.08 | 0.31 | 31.7 | 1.8 |
| 4bpf | 1.3 | 1.3 | 4 | 5 | 0.45 | 48 | 0.90 | 0.57 | 0.44 | 0.26 | 35.2 | 1.5 |
| 6fmb | 1.2 | 1.3 | 8 | 6 | 0.74 | 116 | 0.89 | 0.99 | 0.55 | 0.41 | 41.3 | 0.9 |
| 5k2l | 0.8 | 1.3 | 10 | 5 | 0.74 | 279 | 0.93 | 0.55 | 0.33 | 0.21 | 31.7 | 1.3 |
| 3k3v | 3.5 | 2.4 | 5 | 5 | 0.67 | 38 | 0.97 | 1.74 | 1.32 | 0.84 | 36.7 | 0.9 |
| 3fdr | 1.1 | 1.1 | 7 | 9 | 0.80 | 231 | 0.78 | 0.62 | 0.48 | 0.45 | 55.0 | 3.1 |
| 4qy7 | 1.3 | 1.2 | 5 | 5 | 0.65 | 144 | 0.60 | 0.81 | 0.69 | 0.46 | 37.3 | 5.0 |
| 3dkm | 3.6 | 3.4 | 4 | 3 | 0.73 | 107 | 0.89 | 3.63 | 2.46 | 2.40 | 39.8 | 1.7 |
| Avg | 1.5 | 1.5 | 6.4 | 5.5 | 0.70 | 207 | 0.85 | 1.13 | 0.78 | 0.56 | 34.8 | 1.8 |
Backbone RMSD deviation from the average reference loop sampled by REMD.
Number of eigenvectors needed to explain 90% of the variance.
Corresponding similarity indexes and Z-scores obtained with 90% variance.
B-factor Spearman correlations.
Minimum of RMSD deviation with respect to the crystal conformation.
Required time for the sampling 10K loops, and speed factor with respect to RCD sampling.