| Literature DB >> 27847872 |
Alberto Perez1, Joseph A Morrone1, Emiliano Brini1, Justin L MacCallum2, Ken A Dill3.
Abstract
We report a key proof of principle of a new acceleration method [Modeling Employing Limited Data (MELD)] for predicting protein structures by molecular dynamics simulation. It shows that such Boltzmann-satisfying techniques are now sufficiently fast and accurate to predict native protein structures in a limited test within the Critical Assessment of Structure Prediction (CASP) community-wide blind competition.Entities:
Keywords: Protein folding; molecular dynamics; replica exchange; simulations; structure prediction
Mesh:
Substances:
Year: 2016 PMID: 27847872 PMCID: PMC5106196 DOI: 10.1126/sciadv.1601274
Source DB: PubMed Journal: Sci Adv ISSN: 2375-2548 Impact factor: 14.136
Fig. 1Hydrophobic core instructive.
(A) Hydrophobic residues in the native structure of protein G. (B) True native contacts that we seek (green lines) and the many possible hydrophobic contacts that must be considered in searching for the native structure (red lines).
Fig. 2MELD performance in CASP11.
MELD predictions submitted prospectively are shown in blue. Experimental structures are shown in red. Below the structures are Hubbard plots (cumulative α-carbon accuracy), where each line assesses the quality of one submitted model of the target, and where the blue line is the MELD result. The best predictions are given by “elbows” that pass most closely through the bottom-right corner. (A) MELD predictions of three proteins given only sequence data. (B) MELD prediction of 212-residue target with simulated unassigned NMR data. (C) MELD prediction in the presence and absence of data.
Fig. 3Cluster populations are indicative of success.
(A) Clustering of all conformations for protein T0816. (A) Conformational clustering occurs most strongly at low replica index (low temperature) and in near-native structures. Circle sizes represent cluster populations. The circle center coordinates denote the average RMSD and average replica index of the structures belonging to that cluster. Unfolded clusters tend to have low population. (B) Summary of top cluster populations for tested T0 targets. The populations converge to a single dominant native-like structure for the three protein targets we tried that were less than 100 residues long. Modeling longer chains did not converge to a single population.
Summary of NMR-like data provided for Ts targets.
| Ts761 | 237 | 2 | 3867 | 29,210 | 828 | 5183 | 6 | 8.6 |
| Ts763 | 131 | 2 | 2537 | 11,516 | 693 | 2619 | 5 | 3.0 |
| Ts785 | 112 | 3 | 1072 | 4,009 | 210 | 510 | 4 | 5.1 |
| Ts800 | 212 | 1 | 2251 | 19,759 | 489 | 3313 | 6 | 3.2 |
| Ts802 | 118 | 3 | 900 | 2,014 | 223 | 475 | 7 | 2.0 |
| Ts810 | 113 | 1 | 1174 | 3,627 | 129 | 317 | 4 | 5.3 |
| Ts818 | 134 | 1 | 873 | 2,228 | 149 | 426 | 14 | 4.5 |
| Ts824 | 110 | ? | 867 | 1,600 | 182 | 365 | 12 | 1.8 |
| Ts826 | 201 | 1 | 2531 | 23,959 | 152 | 816 | 8 | 11.9 |
| Ts832 | 209 | 1 | 2146 | 17,630 | 274 | 2346 | 5 | 4.0 |