| Literature DB >> 25883538 |
Steffen Lindert, J Andrew McCammon.
Abstract
Many excellent methods exist that incorporate cryo-electron microscopy (cryoEM) data to constrain computational protein structure prediction and refinement. Previously, it was shown that iteration of two such orthogonal sampling and scoring methods – Rosetta and molecular dynamics (MD) simulations – facilitated exploration of conformational space in principle. Here, we go beyond a proof-of-concept study and address significant remaining limitations of the iterative MD–Rosetta protein structure refinement protocol. Specifically, all parts of the iterative refinement protocol are now guided by medium-resolution cryoEM density maps, and previous knowledge about the native structure of the protein is no longer necessary. Models are identified solely based on score or simulation time. All four benchmark proteins showed substantial improvement through three rounds of the iterative refinement protocol. The best-scoring final models of two proteins had sub-Ångstrom RMSD to the native structure over residues in secondary structure elements. Molecular dynamics was most efficient in refining secondary structure elements and was thus highly complementary to the Rosetta refinement which is most powerful in refining side chains and loop regions.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25883538 PMCID: PMC4393324 DOI: 10.1021/ct500995d
Source DB: PubMed Journal: J Chem Theory Comput ISSN: 1549-9618 Impact factor: 6.006
Figure 1Flowchart of the iterative cryoEM-guided Rosetta–MD protocol. For each of the proteins, the starting model for the first round of the protocol was the best scoring model from a previous benchmark. The protocol started with an MDFF run, followed by a Rosetta run, repeated twice: MDFF1–Rosetta1–MDFF2–Rosetta2–MDFF3–Rosetta3. The best scoring models after the third round of Rosetta was picked as the final model.
Figure 2Summary of the results of all rounds of the iterative MDFF–Rosetta refinement. The RMSD values over all protein residues with respect to the native structure in all three rounds of iterative cryoEM-guided protein structure refinement are shown. The first panel (black dot, labeled initial model) shows the top-scoring model of the last round of Rosetta-only refinement in (15e). A trend line has been added to facilitate comparison to the initial model. For the MDFF simulations, final models after each of the two separate steps (MD simulation with low density forces (red) and subsequent minimization with high forces (blue)) are shown. For each of the three rounds of Rosetta refinement, the RMSD values of the top scoring model (thick black dot) and the subsequent four best-scoring models (four thin black dots) are shown. A blue bar indicates the range of the RMSDs of all models built during that round of Rosetta refinement (irrespective of their individual scores), with the low-RMSD end of the bar corresponding to the lowest-RMSD model built. The high-RMSD end of the bar is beyond the plotting limit of 3 Å for all proteins and rounds.
RMSDs of the Models Built with Respect to Native Structure over All Residues and over All Residues in Secondary Structure Elements (in Parentheses)I
| protein | start | MDFF1 | Rosetta1 | MDFF2 | Rosetta2 | MDFF3 | Rosetta3 | best |
|---|---|---|---|---|---|---|---|---|
| 1X91 | 1.82(1.19) | 1.42(0.88) | 1.53(0.94) | 1.38(0.89) | 1.37(0.85) | 1.41(0.92) | 1.29(0.79) | 0.98(0.74) |
| 1DVO | 2.50(1.65) | 2.18(1.28) | 2.16(1.19) | 1.56(0.99) | 1.89(1.01) | 2.14(1.17) | 2.14(0.98) | 1.54(1.01) |
| 1ICX | 2.65(2.14) | 1.90(1.45) | 2.05(1.72) | 1.75(1.38) | 2.21(1.87) | 1.84(1.39) | 1.80(1.37) | 1.80(1.37) |
| 2FD5 | 2.16(1.46) | 1.89(1.17) | 2.03(1.26) | 1.94(1.19) | 2.07(1.31) | 1.96(1.27) | 2.01(1.38) | 1.55(1.29) |
RMSDs of the starting models.
RMSDs of the models after 1 ns of the first round of MDFF (step 1, density map scaling factor 0.3).
RMSDs of the top scoring model after the first round of Rosetta refinement.
RMSDs of the models after 1 ns of the second round of MDFF (step 1, density map scaling factor 0.3).
RMSDs of the top scoring model after the second round of Rosetta refinement.
RMSDs of the models after 1 ns of the third round of MDFF (step 1, density map scaling factor 0.3).
RMSDs of the top scoring model after the third round of Rosetta refinement.
RMSDs of the best Rosetta models ever built during the iterative MD–Rosetta refinement protocol.
All RMSDs shown are in Å.
Figure 3Lowest RMSD models after three rounds of iterative MDFF–Rosetta refinement for 1X91 (panels A and C) and 1ICX (panels B and D). The native structure is shown in gold, while the model is shown in turquois. A, B) Ribbon backbone representation of the proteins. The overall structure within secondary structure elements has been recovered in the models. C, D) Stereoviews of side chain non-hydrogen coordinates are shown in addition to ribbon backbone representation. Most side chain conformations within the interface of secondary structure elements have been built correctly.
Figure 4Model quality evolution of 1X91 during the three rounds of MD. The RMSD of the MD structure with respect to the native model is shown for all protein residues (blue) and for residues in secondary structure elements (green). RMSDs of specific reference models are displayed by vertical lines: the full length RMSD of the starting model (black line), the RMSD over SSEs of the starting model (dashed black line), the full length RMSD of the best scoring model from the previous Rosetta round (red line), and the RMSD over SSEs of the best scoring model from the previous Rosetta round (dashed red line). For the first round of MD, the red and black lines coincide. Success in this stage of the protocol is characterized by the blue line breaking through the red line (corresponding to MD sampling lower RMSD models than the best scoring model seen in the last Rosetta round) and the green line breaking through the dashed red line.
Figure 5RMSD vs score plots for all four benchmark proteins. The first panel (blue, labeled round3) shows the results of the last round of Rosetta-only refinement in ref (15e). The other three panels show the results for the first (green), second (yellow), and third (red) Rosetta round of the iterative MDFF–Rosetta protocol, respectively. The native structure, relaxed in the Rosetta force field, is shown in all panels (black). The best-scoring structure in each of the three rounds of the iterative MDFF–Rosetta protocol is shown as a solid black dot.