| Literature DB >> 29650980 |
Shahrooz Zarbafian1, Mohammad Moghadasi2, Athar Roshandelpoor2, Feng Nan2, Keyong Li2, Pirooz Vakli2,1, Sandor Vajda3, Dima Kozakov4, Ioannis Ch Paschalidis5,6,7,8.
Abstract
We propose a novel stochastic global optimization algorithm with applications to the refinement stage of protein docking prediction methods. Our approach can process conformations sampled from multiple clusters, each roughly corresponding to a different binding energy funnel. These clusters are obtained using a density-based clustering method. In each cluster, we identify a smooth "permissive" subspace which avoids high-energy barriers and then underestimate the binding energy function using general convex polynomials in this subspace. We use the underestimator to bias sampling towards its global minimum. Sampling and subspace underestimation are repeated several times and the conformations sampled at the last iteration form a refined ensemble. We report computational results on a comprehensive benchmark of 224 protein complexes, establishing that our refined ensemble significantly improves the quality of the conformations of the original set given to the algorithm. We also devise a method to enhance the ensemble from which near-native models are selected.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29650980 PMCID: PMC5955889 DOI: 10.1038/s41598-018-23982-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The near-native energy landscape of the 2YVJ complex using ClusPro conformations.
Figure 2The flowchart of the SSDU procedure.
Figure 3The x-axes of these plots list 156 out of 224 protein complexes that have either ClusPro or SSDU non-zero CAPRI Acceptable (or better) quality solutions. The complexes are sorted by the number of ClusPro counts and the y-axis shows the number of Acceptable (or better) quality solutions out of an ensemble of 1000 or 1500 conformations for enzymes/antibodies and other types, respectively, produced by ClusPro, or refined by SDU and SSDU.
Figure 4The x-axes of these plots list 110 out of 224 protein complexes that have either ClusPro or SSDU non-zero CAPRI Medium (or better) quality solutions. The complexes are sorted by the number of ClusPro counts and the y-axis shows the number of Medium (or better) quality solutions out of an ensemble of 1000 or 1500 conformations for enzymes/antibodies and other types, respectively, produced by ClusPro, or refined by SDU and SSDU.
Figure 5The x-axes of these plots list 29 out of 224 protein complexes that have either ClusPro or SSDU non-zero CAPRI High quality solutions. The complexes are sorted by the number of ClusPro counts and the y-axis shows the number of High quality solutions out of an ensemble of 1000 or 1500 conformations for enzymes/antibodies and other types, respectively, produced by ClusPro, or refined by SDU and SSDU.
Percentage improvement of Acceptable (or better), Medium (or better) and High quality solutions by SSDU versus SDU and ClusPro for a benchmark of 224 complexes. Note that for each of the entries in the table, complexes with zero number of solutions for both ClusPro and SDU/SSDU are removed.
| Benchmark | SSDU vs. ClusPro | SSDU vs. SDU | |
|---|---|---|---|
| Acceptable (or better) | Average | 24.62% | 21.31% |
| Total | 53.14% | 30.37% | |
| Medium (or better) | Average | 53.26% | 58.25% |
| Total | 132.69% | 112.43% | |
| High | Average | 410.71% | 405.88% |
| Total | 424.93% | 157.06% |
Figure 6The percentage of increase in the number of Acceptable/Medium/High quality solutions among the top 3, 5 and 10 clusters achieved by SSDU over ClusPro.