| Literature DB >> 17729286 |
Cinque S Soto1, Marc Fasnacht, Jiang Zhu, Lucy Forrest, Barry Honig.
Abstract
We describe a fast and accurate protocol, LoopBuilder, for the prediction of loop conformations in proteins. The procedure includes extensive sampling of backbone conformations, side chain addition, the use of a statistical potential to select a subset of these conformations, and, finally, an energy minimization and ranking with an all-atom force field. We find that the Direct Tweak algorithm used in the previously developed LOOPY program is successful in generating an ensemble of conformations that on average are closer to the native conformation than those generated by other methods. An important feature of Direct Tweak is that it checks for interactions between the loop and the rest of the protein during the loop closure process. DFIRE is found to be a particularly effective statistical potential that can bias conformation space toward conformations that are close to the native structure. Its application as a filter prior to a full molecular mechanics energy minimization both improves prediction accuracy and offers a significant savings in computer time. Final scoring is based on the OPLS/SBG-NP force field implemented in the PLOP program. The approach is also shown to be quite successful in predicting loop conformations for cases where the native side chain conformations are assumed to be unknown, suggesting that it will prove effective in real homology modeling applications. (c) 2007 Wiley-Liss, Inc.Entities:
Mesh:
Substances:
Year: 2008 PMID: 17729286 PMCID: PMC2553011 DOI: 10.1002/prot.21612
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
Loop Prediction Accuracy of Published Methods
| RMSD (Å) | ||||||
|---|---|---|---|---|---|---|
| Loop length | Modeller | LOOPY | RAPPER | Rosetta | PLOP | PLOP II |
| 8 | 2.5 | 1.45 | 2.28 | 1.45 | 0.84 | NA |
| 9 | 3.5 | 2.68 | 2.41 | NA | 1.28 | NA |
| 10 | 3.5 | 2.21 | 3.48 | NA | 1.22 | NA |
| 11 | 5.5 | 3.52 | 4.94 | NA | 1.63 | 1.00 |
| 12 | 6.0 | 3.42 | 4.99 | 3.62 | 2.28 | 1.15 |
| 13 | 6.5 | NA | NA | NA | NA | 1.25 |
Data taken from Figure 9 of Fiser et al.14
Data taken from Table I of Xiang et al.8
Data taken from Table III of de Bakker et al.15
Data taken from Tables IV and VV of Rohl et al.13
Data taken from Table IV of Jacobson et al.9
Data taken from Table II of Zhu and Pincus et al.12
Estimated Time in Minutes Required to Generate 10,000 Closed and Sterically Feasible Loop Conformations
| Tusable | |||
|---|---|---|---|
| Algorithm | Eight | Eleven | Twelve |
| Random Tweak | 1.99 | 8.47 | 10.17 |
| CCD | 159.46 | 511.10 | 527.77 |
| Wriggling | 5.67 | 28.50 | 22.50 |
| PLOP-build | 3.39 | 35.00 | 71.67 |
| Direct Tweak | 34.00 | 73.44 | 75.65 |
| LOOPYbb | 22.86 | 62.21 | 59.15 |
See Equation 1.
The implementation of Canutescu and Dunbrack24 is about seven times faster.
Numbers of Cases Where Scoring Functions Rank the Native Loop as Lowest in Energy for Loop Ensembles Generated With LOOPY
| Scoring functions | ||||
|---|---|---|---|---|
| Loop length | DFIRE | LOOPY-sVDW+ | RAPDF | |
| 8 | 63 | 48 | 18 | 17 |
| 9 | 56 | 37 | 26 | 20 |
| 10 | 40 | 28 | 18 | 10 |
| 11 | 54 | 35 | 26 | 13 |
| 12 | 40 | 28 | 23 | 13 |
| 13 | 40 | 32 | 23 | 8 |
Number of loop targets studied.
Zhu et al.30 implementation of the DFIRE statistical potential.
Modified softened van der Waals scoring function.26
RAPDF18 statistical potential.
Average and Median Prediction Accuracies Using Loop Ensembles Generated With LOOPY
| Average (median) prediction accuracy | ||||
|---|---|---|---|---|
| Loop length | LOOPY | LOOPY/PLOP | DFIRE | Loop builder |
| 8 | 1.89 (1.59) | 1.96 (1.72) | 1.69 (1.40) | 1.31 (0.97) |
| 9 | 2.71 (2.04) | 3.67 (3.69) | 2.52 (1.97) | 1.88 (1.17) |
| 10 | 2.42 (2.18) | 3.40 (3.16) | 2.41 (2.22) | 1.93 (1.64) |
| 11 | 3.02 (2.48) | 4.36 (3.66) | 3.43 (2.68) | 2.50 (1.95) |
| 12 | 3.15 (2.71) | 4.11 (3.95) | 3.15 (2.74) | 2.65 (2.41) |
| 13 | 4.44 (3.46) | 5.84 (5.68) | 4.35 (3.63) | 3.74 (2.85) |
LOOPY prediction.
Prediction based on a PLOP energy minimization of the 50 low energy loop conformations according to LOOPY.
Prediction based on a DFIRE ranking of the loops generated using LOOPY.
Prediction obtained from LoopBuiulder which applies a PLOP energy minimization to the 50 low energy loop conformations selected by DFIRE.
Performance Characteristics of Loop Closure Procedures
| Loop lengths | ||||||
|---|---|---|---|---|---|---|
| 8 | 11 | 12 | ||||
| Algorithm | fVDW | RMSDmin | fVDW | RMSDmin | fVDW | RMSDmin |
| Random Tweak | 0.19 | 1.22 | 0.06 | 2.22 | 0.06 | 2.64 |
| CCD | 0.17 | 1.20 | 0.05 | 2.11 | 0.05 | 2.57 |
| Wriggling | 0.14 | 1.43 | 0.03 | 2.24 | 0.04 | 2.68 |
| PLOP-build | 0.17 | 0.99 | 0.02 | 2.18 | 0.01 | 2.69 |
| Direct Tweak | 0.82 | 0.69 | 0.74 | 1.20 | 0.78 | 1.48 |
| LOOPYbb | 0.83 | 0.89 | 0.66 | 1.51 | 0.69 | 1.80 |
Fraction of closed and sterically feasible loop conformations.
RMSD averaged over loop conformations from each ensemble with the smallest RMSD to native.
Implementation of Xiang et al.8,26
Implementation of Zhu et al.30
In-house implementation of the Wriggling algorithm.25
Dihedral angle based build up procedure of Jacobson et al.9 obtained from the author.
Implementation of Xiang et al.8,26
Figure 1Box plot for various RMSD values obtained from different scoring functions. See text for details.
Figure 2The lowest RMSD to native conformation as a function of the number of top scoring loops (RMSDBest) according to DFIRE. The curves represent averages taken over each loop length.
Average and Median Loop Prediction Accuracies Obtained With Loop Builder Using Both Native and Repacked Side Chains
| Average (median) prediction accuracy | |||
|---|---|---|---|
| Loop length | Native | Repack | Repack |
| 8 | 1.31 (0.97) | 1.37 (1.17) | 1.17 (0.79) |
| 9 | 1.88 (1.17) | 1.99 (1.53) | 1.69 (0.91) |
| 10 | 1.93 (1.73) | 2.22 (1.90) | 1.82 (1.48) |
| 11 | 2.50 (1.95) | 2.94 (2.69) | 2.52 (2.28) |
| 12 | 2.65 (2.41) | 3.21 (2.81) | 2.71 (2.28) |
Ensemble sizes of 1000 for eight, 2000 for nine, and 5000 for ten, eleven, and twelve-residue loops.
Ensemble size of 10,000 loop conformations was used for all loop lengths.
Prediction Accuracies for 8 and 12-Residue Data Set of Rohl et al. Using LoopBuilder With Repacked Side Chains
| Average (median) prediction accuracy | |||
|---|---|---|---|
| Loop length | LoopBuilder | LoopBuilder | Rohl |
| 8 | 1.63 (1.14) | 1.35 (0.99) | 1.46 (1.20) |
| 12 | 3.70 (2.77) | 3.54 (3.11) | 3.56 (3.28) |
Ensemble size of 1000 for eight and 5000 for twelve-residue loops.
Ensemble size of 10,000.
Average and median prediction accuracies for Rohl et al.13 were computed over the same set of loop targets considered in Columns 2 and 3.