| Literature DB >> 25411785 |
Martin Carlsen1, Patrice Koehl2, Peter Røgen1.
Abstract
Knowledge-based potentials are energy functions derived from the analysis of databases of protein structures and sequences. They can be divided into two classes. Potentials from the first class are based on a direct conversion of the distributions of some geometric properties observed in native protein structures into energy values, while potentials from the second class are trained to mimic quantitatively the geometric differences between incorrectly folded models and native structures. In this paper, we focus on the relationship between energy and geometry when training the second class of knowledge-based potentials. We assume that the difference in energy between a decoy structure and the corresponding native structure is linearly related to the distance between the two structures. We trained two distance-based knowledge-based potentials accordingly, one based on all inter-residue distances (PPD), while the other had the set of all distances filtered to reflect consistency in an ensemble of decoys (PPE). We tested four types of metric to characterize the distance between the decoy and the native structure, two based on extrinsic geometry (RMSD and GTD-TS*), and two based on intrinsic geometry (Q* and MT). The corresponding eight potentials were tested on a large collection of decoy sets. We found that it is usually better to train a potential using an intrinsic distance measure. We also found that PPE outperforms PPD, emphasizing the benefits of capturing consistent information in an ensemble. The relevance of these results for the design of knowledge-based potentials is discussed.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25411785 PMCID: PMC4239004 DOI: 10.1371/journal.pone.0109335
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Properties of the different protein decoy sets used in this study.
| Decoy set | Nprot h | Nres h | Ndecoys h | RMSD | MT | GDT-TS | Q |
| Titan-HRD | 142 | 127 (35) | 854 (119) | 2.4 (0.5) | 2.7 (1) | 0.76 (0.1) | 0.85 (0.04) |
| Titan-HRD*
| 1155 | 111 (35) | 994 (138) | 2.6 (0.6) | 2.7 (1) | 0.75 (0.1) | 0.85 (0.04) |
| TASSER Set II | 55 | 80 (17) | 438 (98) | 6.3 (1.5) | 9.3 (3.2) | 0.54 (0.05) | 0.77 (0.03) |
| hg Structal | 28 | 150 (7) | 29 (0) | 4.1 (1.2) | 4.4 (1.5) | 0.71 (0.07) | 0.85 (0.04) |
| 4-state | 7 | 64 (4.9) | 664 (15) | 5.2 (1.4)) | 8.3 (2.9) | 0.53 (0.11) | 0.75 (0.05) |
| fisa | 4 | 60 (10) | 500 (0.4) | 7.5 (1.8) | 8.6 (1.7) | 0.47 (0.06) | 0.75 (0.06) |
| fisa CASP3 | 5 | 88 (15) | 1437 (390) | 12 (1.6) | 21 (4.1) | 0.3 (0.03) | 0.67 (0.02) |
| lmds | 10 | 53 (10) | 433 (79) | 7.7 (1.1) | 12 (2.6) | 0.46 (0.04) | 0.72 (0.03) |
| lattice ssfit | 8 | 71 (10) | 1997 (1.5) | 9.9 (1.0) | 17 (2.4) | 0.3 (0.03) | 0.64 (0.02) |
| Rosetta-All | 41 | 82 (25)) | 999 (0.5) | 12 (1.4) | 29 (5.6) | 0.27 (0.03) | 0.61 (0.02) |
| Rosetta-Tsai | 29 | 63 (9.4) | 1862 (43) | 7.4 (2.1) | 11 (3.9) | 0.46 (0.08) | 0.73 (0.04) |
| Rosetta-Baker | 57 | 88 (20) | 100 (0) | 8.5 (1.4) | 15 (3.3) | 0.45 (0.05) | 0.76 (0.03) |
| CASP5 | 41 | 202 (78) | 117 (41) | 13 (3.7) | 29 (14) | 0.38 (0.12) | 0.68 (0.08) |
| CASP6 | 39 | 172 (71) | 216 (34) | 13 (4.9) | 27 (16) | 0.39 (0.12) | 0.70 (0.08) |
| CASP7 | 64 | 183 (80) | 349 (40) | 10 (3.4) | 17 (10) | 0.47 (0.11) | 0.75 (0.07) |
| CASP8 | 77 | 187 (81) | 334 (67) | 8.8 (3.1) | 13 (8.6) | 0.54 (0.11) | 0.79 (0.06) |
| CASP9 | 81 | 180 (81) | 402 (95) | 11 (4.9) | 19 (14) | 0.49 (0.12) | 0.77 (0.07) |
| CASP-HRD | 109 | 188 (79) | 192 (72) | 2.8 (0.4) | 2.2 (0.6) | 0.76 (0.03) | 0.89 (0.02) |
| CASP10-stage1 | 93 | 232 (102) | 18 (1.9) | 13 (4.3) | 20 (9.4) | 0.46 (0.08) | 0.76 (0.05) |
| CASP10-stage2 | 93 | 232 (102) | 132 (7.6) | 11 (3.7) | 17 (8.2) | 0.55 (0.03) | 0.80 (0.03) |
| TSA TM>0.5 | 242 | 179 (77) | 291 (119) | 6.3 (2.67) | 9.4 (5.5) | 0.63 (0.09) | 0.82 (0.05) |
| TSA TM <0.5 | 303 | 110 (48) | 602 (436) | 12 (3.9) | 23 (12) | 0.34 (0.1) | 0.68 (0.07) |
Training set (Titan HRD) and test set (Titan HRD*) from the Titan High resolution decoy set [20], available at http://titan.princeton.edu/2010-10-11/Decoys/.
Tasser Set II is a structurally non-redundant set of protein structures and decoys derived with the program TASSER. It is available at http://zhanglab.ccmb.med.umich.edu/decoys/.
Decoy sets from the Decoys ‘R’ us repository http://dd.compbio.washington.edu.
Different decoy Rosetta-based decoy sets (see text for details), available at http://depts.washington.edu/bakerpg/decoys/.
Collection of models from the successive CASP5 to CASP9 experiments, available from the CASP web site http://predictioncenter.org. CASP-HRD is a high resolution subset of the union of the five sets CASP5 to CASP9, which includes models that have a TM-score larger than 0.5 and a RMSD less than 4 Å to the native structures.
The Stage_1 and Stage_2 decoy sets used in the CASP10 quality assessment category, available from the CASP web site http://predictioncenter.org. For details on how these sets are prepared, see [48].
All high and low resolution targets (TSA TM-score>0.5)/(TSA TM-score <0.5) are listed in Files S1 and S2 respectively found in the supporting information.
Nprot is the number of different proteins in the dataset, Nres is the average number of residues computed over all proteins in a dataset, and Ndecoys is the average number of decoys per proteins, averaged over the dataset. RMSD, MT, GDT-TS, and Q are the distance measures between the decoys and the corresponding native structures, averaged over all decoys and all proteins. We provide both the average values and the average mean absolute deviations (in parenthesis).
Correlations between the four distance measures.
| Distance | |||||
| Test set | Distance | RMSD | MT | GDT-TS* | Q* |
| Titan-HRD | RMSD | 1 | 0.92 (0.06) | 0.92 (0.04) | 0.87 (0.08) |
| MT | 0.92 (0.06) | 1 | 0.92 (0.03) | 0.94 (0.03) | |
| GDT-TS* | 0.92 (0.04) | 0.92 (0.03) | 1 | 0.95 (0.03) | |
| Q* | 0.87 (0.08) | 0.94 (0.03) | 0.95 (0.03) | 1 | |
| CASP-HRD | RMSD | 1 | 0.74 (0.16) | 0.73 (0.14) | 0.6 (0.19) |
| MT | 0.74 (0.16) | 1 | 0.72 (0.13) | 0.83 (0.07) | |
| GDT-TS* | 0.73 (0.14) | 0.72 (0.13) | 1 | 0.74 (0.13) | |
| Q* | 0.6 (0.19) | 0.83 (0.07) | 0.74 (0.13) | 1 | |
| CASP10-stage1 | RMSD | 1 | 0.83 (0.16) | 0.71 (0.24) | 0.68 (0.24) |
| MT | 0.83 (0.16) | 1 | 0.73 (0.2) | 0.82 (0.14) | |
| GDT-TS* | 0.71 (0.24) | 0.73 (0.2) | 1 | 0.86 (0.12) | |
| Q* | 0.68 (0.24) | 0.82 (0.14) | 0.86 (0.12) | 1 | |
| CASP10-stage2 | RMSD | 1 | 0.78 (0.16) | 0.51 (0.22) | 0.49 (0.19) |
| MT | 0.78 (0.16) | 1 | 0.52 (0.2) | 0.69 (0.14) | |
| GDT-TS* | 0.51 (0.22) | 0.52 (0.2) | 1 | 0.64 (0.17) | |
| Q* | 0.49 (0.19) | 0.69 (0.14) | 0.64 (0.17) | 1 | |
| TSA | RMSD | 1 | 0.92 (0.06) | 0.8 (0.15) | 0.82 (0.11) |
| MT | 0.92 (0.06) | 1 | 0.78 (0.14) | 0.85(0.08) | |
| TM-score> 0.5 | GDT-TS* | 0.8 (0.15) | 0.78 (0.14) | 1 | 0.89 (0.12) |
| Q* | 0.82 (0.11) | 0.85 (0.08) | 0.89 (0.12) | 1 | |
| TSA | RMSD | 1 | 0.8 (0.12) | 0.59 (0.24) | 0.56 (0.18) |
| MT | 0.8 (0.12) | 1 | 0.54 (0.2) | 0.68(0.14) | |
| TM-score <0.5 | GDT-TS* | 0.59 (0.24) | 0.54 (0.2) | 1 | 0.67 (0.22) |
| Q* | 0.56 (0.18) | 0.68 (0.14) | 0.67 (0.22) | 1 | |
Pearson's correlation coefficient Corr(d,d) between the two distance measures d 1 and d 2. We provide both the average value and the mean absolute deviation (in parenthesis) over the data set considered.
Comparing the best models picked by different distance measures.
| Distance | |||||
| Test set | Distance | RMSD | MT | GDT-TS* | Q* |
| Titan-HRD | RMSD | 1 | 0.88 (0.12) | 0.91 (0.09) | 0.76 (0.17) |
| MT | 0.94 (0.06) | 1 | 0.92 (0.08) | 0.91 (0.07) | |
| GDT-TS* | 0.96 (0.04) | 0.94 (0.07) | 1 | 0.91 (0.08) | |
| Q* | 0.87 (0.09) | 0.92 (0.07) | 0.89 (0.09) | 1 | |
| CASP-HRD | RMSD | 1 | 0.71 (0.26) | 0.79 (0.22) | 0.49 (0.38) |
| MT | 0.76 (0.22) | 1 | 0.76 (0.22) | 0.76 (0.23) | |
| GDT-TS* | 0.8 (0.22) | 0.68 (0.27) | 1 | 0.48 (0.39) | |
| Q* | 0.57 (0.33) | 0.81 (0.16) | 0.66 (0.24) | 1 | |
| CASP10-stage1 | RMSD | 1 | 0.81 (0.24) | 0.75 (0.31) | 0.79 (0.23) |
| MT | 0.9 (0.13) | 1 | 0.85 (0.19) | 0.94 (0.09) | |
| GDT-TS* | 0.79 (0.24) | 0.78 (0.24) | 1 | 0.82 (0.2) | |
| Q* | 0.78 (0.22) | 0.88 (0.14) | 0.8 (0.23) | 1 | |
| CASP10-stage2 | RMSD | 1 | 0.76 (0.22) | 0.71 (0.3) | 0.63 (0.29) |
| MT | 0.83 (0.18) | 1 | 0.73 (0.24) | 0.83 (0.19) | |
| GDT-TS* | 0.73 (0.26) | 0.65 (0.24) | 1 | 0.59 (0.29) | |
| Q* | 0.62 (0.29) | 0.82 (0.18) | 0.62 (0.23) | 1 | |
| TSA | RMSD | 1 | 0.9 (0.11) | 0.84 (0.19) | 0.81 (0.18) |
| MT | 0.94 (0.07) | 1 | 0.88 (0.14) | 0.92 (0.09) | |
| TM-score> 0.5 | GDT-TS* | 0.85 (0.16) | 0.79 (0.21) | 1 | 0.73 (0.24) |
| Q* | 0.79 (0.18) | 0.89 (0.11) | 0.81 (0.16) | 1 | |
| TSA | RMSD | 1 | 0.83 (0.19) | 0.73 (0.27) | 0.71 (0.27) |
| MT | 0.87 (0.14) | 1 | 0.74 (0.27) | 0.88 (0.14) | |
| TM-score <0.5 | GDT-TS* | 0.74 (0.27) | 0.7 (0.27) | 1 | 0.67 (0.27) |
| Q* | 0.68 (0.27) | 0.85 (0.16) | 0.68 (0.27) | 1 | |
R-score R(d,d) between the two distance measures d 1 and d 1. We provide both the average value and the mean absolute deviation (in parenthesis) over the data set considered.
Figure 1Showing nine different types of residue pair interactions for our single model method PPD (continuous lines) and our consensus method PPE (dotted lines) when trained on RMSD (blue), MT(red), GDT-TS(green) and Q(black).
Energy-distance correlations.
| PPD | PPE | RAPDF | GOAP | AMBER | ||||||||
| Training distance | Training distance | |||||||||||
| Decoy set | Test Distance | RMSD | MT | GDT-TS* | Q* | RMSD | MT | GDT-TS* | Q* | |||
| Titan-HRD | RMSD | 0.77 (0.05) | 0.82 (0.04) | 0.81 (0.05) | 0.79 (0.04) | 0.81 (0.05) | 0.88 (0.03) | 0.85 (0.03) | 0.82 (0.04) | 0.5 (0.14) | 0.64(0.11) | 0.01 (0.02) |
| MT | 0.82 (0.04) | 0.89 (0.03) | 0.87 (0.03) | 0.87 (0.03) | 0.86 (0.04) | 0.95 (0.01) | 0.92 (0.02) | 0.89 (0.02) | 0.47 (0.16) | 0.63(0.11) | 0.01 (0.02) | |
| GDT-TS* | 0.83 (0.06) | 0.91 (0.02) | 0.91 (0.02) | 0.9 (0.03) | 0.86 (0.04) | 0.93 (0.02) | 0.95 (0.01) | 0.92 (0.02) | 0.43 (0.18) | 0.63(0.12) | 0.001 (0.02) | |
| Q* | 0.82 (0.05) | 0.92 (0.2) | 0.92 (0.02) | 0.93 (0.02) | 0.87 (0.03) | 0.95 (0.01) | 0.97 (0.01) | 0.95 (0.01) | 0.37 (0.22) | 0.57(0.13) | 0.002 (0.02) | |
| CASP-HRD | RMSD | 0.32 (0.16) | 0.31 (0.16) | 0.31 (0.16) | 0.26 (0.16) | 0.42 (0.16) | 0.51 (0.17) | 0.51 (0.18) | 0.45 (0.17) | 0.31(0.15) | 0.3 (0.15) | 0.01 (0) |
| MT | 0.45 (0.11) | 0.49 (0.11) | 0.49 (0.12) | 0.43 (0.12) | 0.55 (0.13) | 0.69 (0.11) | 0.68 (0.13) | 0.61 (0.13) | 0.37 (0.14) | 0.41(0.13) | 0.02 (0) | |
| GDT-TS* | 0.38 (0.14) | 0.39 (0.13) | 0.39 (0.13) | 0.33 (0.14) | 0.51 (0.15) | 0.6 (0.14) | 0.65 (0.14) | 0.58 (0.17) | 0.39 (0.14) | 0.43(0.11) | 0.02 (0) | |
| Q* | 0.46 (0.12) | 0.6(0.11) | 0.64 (0.1) | 0.57 (0.1) | 0.54 (0.12) | 0.69 (0.12) | 0.75 (0.12) | 0.71 (0.1) | 0.32 (0.15) | 0.42(0.12) | 0.02 (0) | |
| CASP10-stage1 | RMSD | 0.44 (0.22) | 0.53 (0.18) | 0.53 (0.18) | 0.48 (0.2) | 0.5 (0.22) | 0.54 (0.22) | 0.52 (0.21) | 0.5 (0.21) | 0.18(0.24) | 0.32 (0.26) | −0.03 (0) |
| MT | 0.47 (0.19) | 0.61(0.17) | 0.62 (0.17) | 0.55 (0.18) | 0.56 (0.16) | 0.63 (0.13) | 0.61 (0.13) | 0.57 (0.13) | 0.13 (0.22 | 0.34(0.24) | −0.06 (0) | |
| GDT-TS* | 0.4 (0.21) | 0.49 (0.23) | 0.51 (0.2) | 0.43 (0.21) | 0.57 (0.21) | 0.63 (0.16) | 0.63 (0.16) | 0.59 (0.2) | 0.22 (0.28) | 0.4(0.2) | −0.05 (0) | |
| Q* | 0.51 (0.22) | 0.63(0.16) | 0.63 (0.16) | 0.56 (0.18) | 0.68 (0.12) | 0.75 (0.06) | 0.75 (0.07) | 0.72 (0.1) | 0.24 (0.3) | 0.41(0.2) | −0.05 (0) | |
| CASP10-stage2 | RMSD | 0.33 (0.18) | 0.34 (0.2) | 0.34 (0.2) | 0.31 (0.2) | 0.31 (0.18) | 0.37 (0.19) | 0.34 (0.16) | 0.3 (0.21) | 0.15(0.13) | 0.2 (0.14) | −0.005 (0) |
| MT | 0.42 (0.16) | 0.49 (0.16) | 0.49 (0.15) | 0.45 (0.14) | 0.4 (0.16) | 0.5 (0.19) | 0.48 (0.15) | 0.42 (0.16) | 0.19 (0.14) | 0.29(0.14) | 0.003 (0) | |
| GDT-TS* | 0.31 (0.15) | 0.29 (0.14) | 0.29 (0.13) | 0.25 (0.14) | 0.37 (0.16) | 0.42 (0.19) | 0.44 (0.18) | 0.38 (0.18) | 0.29 (0.14) | 0.37(0.17) | 0.007 (0) | |
| Q* | 0.45 (0.22) | 0.56(0.14) | 0.58 (0.12) | 0.52 (0.15) | 0.51 (0.18) | 0.62 (0.17) | 0.66 (0.14) | 0.62 (0.15) | 0.28(0.13) | 0.41(0.13) | −0.006 (0) | |
| TSA | RMSD | 0.62 (0.12) | 0.62 (0.13) | 0.63 (0.13) | 0.59 (0.14) | 0.74 (0.09) | 0.8 (0.07) | 0.78 (0.08) | 0.73 (0.09) | 0.5 (0.14) | 0.58(0.13) | 0.02 (0.01) |
| MT | 0.65 (0.11) | 0.69 (0.1) | 0.7 (0.1) | 0.65 (0.12) | 0.75 (0.08) | 0.83 (0.06) | 0.8 (0.06) | 0.74 (0.07) | 0.5 (0.16) | 0.58(0.12) | 0.03 (0.01) | |
| TM-score > 0.5 | GDT-TS* | 0.6 (0.15) | 0.59 (0.14) | 0.6 (0.13) | 0.54 (0.16) | 0.78 (0.06) | 0.85 (0.04) | 0.84 (0.04) | 0.79 (0.06) | 0.61 (0.11) | 0.7(0.1) | 0.03 (0.01) |
| Q* | 0.69 (0.11) | 0.71 (0.09) | 0.72 (0.1) | 0.68 (0.1) | 0.87 (0.04) | 0.94 (0.02) | 0.93 (0.02) | 0.9 (0.03) | 0.57 (0.13) | 0.67(0.11) | 0.03 (0.01) | |
| TSA | RMSD | 0.3 (0.16) | 0.34 (0.18) | 0.34 (0.18) | 0.32 (0.18) | 0.29 (0.23) | 0.36 (0.27) | 0.34 (0.27) | 0.29 (0.23) | 0.16 (0.13) | 0.25(0.15) | 0 (0.01) |
| MT | 0.38 (0.14) | 0.47 (0.15) | 0.47 (0.15) | 0.45 (0.17) | 0.34 (0.23) | 0.45 (0.24) | 0.41 (0.23) | 0.35 (0.22) | 0.19 (0.14) | 0.29(0.13) | −0.003(0.01) | |
| TM-score < 0.5 | GDT-TS* | 0.27 (0.19) | 0.27 (0.2) | 0.28 (0.19) | 0.24 (0.18) | 0.36 (0.29) | 0.44 (0.33) | 0.42 (0.32) | 0.36 (0.29) | 0.26 (0.16) | 0.33(0.19) | 0.004 (0.02) |
| Q* | 0.41 (0.17) | 0.47 (0.16) | 0.46 (0.17) | 0.45 (0.15) | 0.53 (0.19) | 0.63 (0.18) | 0.61 (0.18) | 0.57 (0.19) | 0.23 (0.17) | 0.3(0.19) | −0.004 (0.02) | |
All-atom statistical distance-based potential [49].
All-atom orientation-dependent statistical potential [7].
The semi-empirical physical potential AMBER99SB-ILDN[50].
PPD and PPE have been trained on the distance measure d 1 and tested against the distance measure d 2.
Average value, and mean absolute deviation (in parenthesis) over the data set.
Energy-distance Rvalues.
| PPD | PPE | RAPDF | GOAP | AMBER | ||||||||
| Training distance | Training distance | |||||||||||
| Decoy set | Test Distance | RMSD | MT | GDT-TS* | Q* | RMSD | MT | GDT-TS* | Q* | |||
| Titan-HRD | RMSD | 0.57 (0.17) | 0.52 (0.17) | 0.51 (0.19) | 0.48 (0.2) | 0.61 (0.14) | 0.63 (0.17) | 0.62 (0.14) | 0.61 (0.15) | 0.43 (0.18) | 0.56 (0.15) | 0.26 (0.47) |
| MT | 0.73 (0.1) | 0.71 (0.11) | 0.69 (0.13) | 0.69 (0.15) | 0.76 (0.11) | 0.8 (0.11) | 0.79 (0.12) | 0.8 (0.1) | 0.5 (0.18) | 0.6(0.17) | 0.23 (0.53) | |
| GDT-TS* | 0.78 (0.08) | 0.74 (0.09) | 0.74 (0.08) | 0.73 (0.09) | 0.82 (0.07) | 0.86 (0.06) | 0.86 (0.07) | 0.86 (0.07) | 0.52 (0.22) | 0.67(0.12) | 0.18 (0.58) | |
| Q* | 0.73 (0.11) | 0.76 (0.12) | 0.73 (0.1) | 0.77 (0.1) | 0.77 (0.09) | 0.84 (0.09) | 0.85 (0.09) | 0.86 (0.08) | 0.37 (0.19) | 0.53(0.17) | 0.16 (0.51) | |
| CASP-HRD | RMSD | 0.19 (0.31) | 0.00 (0.42) | −0.04 (0.48) | 0.03 (0.4) | 0.27 (0.3) | 0.33 (0.31) | 0.34 (0.27) | 0.3 (0.31) | 0.14 (0.37) | 0.22(0.37) | −0.11 (0.34) |
| MT | 0.31 (0.26) | 0.24 (0.3) | 0.14 (0.31) | 0.16 (0.36) | 0.38 (0.24) | 0.43 (0.22) | 0.43 (0.21) | 0.38 (0.25) | 0.24 (0.4) | 0.46(0.32) | −0.09 (0.47) | |
| GDT-TS* | 0.14 (0.3) | −0.09 (0.4) | −0.08 (0.42) | −0.06 (0.43) | 0.28 (0.23) | 0.31 (0.25) | 0.32 (0.24) | 0.28 (0.24) | 0.12 (0.44) | 0.34(0.33) | −0.22 (0.38) | |
| Q* | 0.27 (0.25) | 0.35 (0.33) | 0.34 (0.32) | 0.33 (0.31) | 0.28 (0.2) | 0.39 (0.24) | 0.4 (0.25) | 0.41 (0.26) | 0.13 (0.4) | 0.43(0.26) | −0.17(0.42) | |
| CASP10-stage1 | RMSD | 0.55 (0.23) | 0.53 (0.3) | 0.57(0.26) | 0.48 (0.39) | 0.5 (0.29) | 0.52 (0.26) | 0.53 (0.26) | 0.51 (0.27) | 0.32 (0.34) | 0.44(0.26) | 0.13 (0.6) |
| MT | 0.69(0.1) | 0.7(0.12) | 0.72(0.12) | 0.64(0.14) | 0.58(0.15) | 0.62(0.12) | 0.63(0.13) | 0.6(0.15) | 0.37(0.28) | 0.52(0.2) | 0.16(0.6) | |
| GDT-TS* | 0.52(0.32) | 0.47(0.39) | 0.52(0.37) | 0.42(0.43) | 0.43(0.4) | 0.46(0.37) | 0.51(0.33) | 0.46(0.36) | 0.27(0.44) | 0.53(0.22) | 0.15(0.37) | |
| Q* | 0.6(0.24) | 0.64(0.15) | 0.67(0.14) | 0.57(0.19) | 0.57(0.22) | 0.6(0.18) | 0.63(0.17) | 0.61(0.18) | 0.32(0.38) | 0.47(0.31) | 0.19(0.5) | |
| CASP10-stage2 | RMSD | 0.38(0.29) | 0.23(0.36) | 0.29(0.3) | 0.26(0.35) | 0.36(0.31) | 0.35(0.32) | 0.32(0.32) | 0.35(0.34) | 0.29(0.28) | 0.39(0.29) | 0.11(0.42) |
| MT | 0.55(0.23) | 0.46(0.31) | 0.5(0.29) | 0.49(0.33) | 0.45(0.32) | 0.47(0.32) | 0.47(0.32) | 0.48(0.3) | 0.44(0.32) | 0.52(0.24) | 0.17(0.45) | |
| GDT-TS* | 0.23(0.35) | 0.11(0.33) | 0.14(0.36) | 0.14(0.34) | 0.25(0.32) | 0.23(0.28) | 0.29(0.32) | 0.25(0.32) | 0.23(0.3) | 0.39(0.3) | 0.01(0.27) | |
| Q* | 0.45(0.32) | 0.46(0.31) | 0.51(0.29) | 0.48(0.3) | 0.44(0.3) | 0.47(0.24) | 0.51(0.27) | 0.53(0.27) | 0.33(0.28) | 0.41(0.27) | 0.13(0.43) | |
| TSA | RMSD | 0.47 (0.24) | 0.22 (0.41) | 0.21 (0.41) | 0.22 (0.41) | 0.69 (0.14) | 0.69(0.13) | 0.69 (0.13) | 0.68 (0.14) | 0.44 (0.25) | 0.59(0.19) | 0.24 (0.38) |
| MT | 0.62 (0.14) | 0.4 (0.24) | 0.39 (0.27) | 0.41 (0.24) | 0.8 (0.08) | 0.81 (0.07) | 0.82 (0.08) | 0.8 (0.08) | 0.54 (0.18) | 0.74(0.1) | 0.32 (0.33) | |
| TM-score> 0.5 | GDT-TS* | 0.29 (0.28) | 0.09 (0.43) | 0.08 (0.47) | 0.09 (0.45) | 0.58 (0.16) | 0.6 (0.16) | 0.61 (0.16) | 0.57 (0.18) | 0.37 (0.31) | 0.56(0.41) | 0.09 (0.26) |
| Q* | 0.49 (0.19) | 0.32 (0.3) | 0.3 (0.33) | 0.33 (0.28) | 0.68 (0.14) | 0.72 (0.14) | 0.74 (0.13) | 0.74 (0.13) | 0.38 (0.29) | 0.59(0.43) | 0.14 (0.2) | |
| TSA | RMSD | 0.16 (0.35) | 0.19 (0.3) | 0.19 (0.34) | 0.16 (0.32) | 0.26 (0.35) | 0.33 (0.37) | 0.32 (0.36) | 0.29 (0.35) | 0.19 (0.35) | 0.27(0.4) | 0.04 (0.41) |
| MT | 0.27 (0.34) | 0.38 (0.3) | 0.39 (0.33) | 0.37 (0.29) | 0.36 (0.33) | 0.44 (0.31) | 0.42 (0.32) | 0.41 (0.31) | 0.28 (0.34) | 0.4(0.33) | 0.04 (0.46) | |
| TM-score <0.5 | GDT-TS* | 0.07 (0.3) | 0.07 (0.28) | 0.1 (0.3) | 0.06 (0.29) | 0.26 (0.3) | 0.32 (0.3) | 0.32 (0.3) | 0.29 (0.3) | 0.19 (0.3) | 0.28(0.36) | 0.05 (0.3) |
| Q* | 0.18 (0.35) | 0.28 (0.34) | 0.29 (0.35) | 0.3 (0.31) | 0.42 (0.27) | 0.5 (0.27) | 0.49 (0.29) | 0.49 (0.27) | 0.19 (0.29) | 0.31(0.32) | 0.01 (0.31) | |
All-atom statistical distance-based potential [49].
All-atom orientation-dependent statistical potential [7].
The semi-empirical physical potential AMBER99SB-ILDN [50].
PPD and PPE have been trained on the distance measure d 1 and tested against the distance measure d 2.
Average value, and mean absolute deviation (in parenthesis) over the data set.
Figure 2Energy-distance correlations as a function of the quality of the decoy set.
For each decoy set in Titan-HRD, CASP-HRD, and TSA (a total of 797 sets), we plot the correlation Corr(E, d 1) as a function of the mean value of d 1 over the decoy set, where E is either the PPD energy (red, plus sign +) or the PPE energy (black, cross sign x) trained on the set Titan-HRD with the distance measure d 1, and d 1 is one of the fourth distance measures considered, namely RMSD (panel A), MT (panel B), GDT-TS* (panel C), and Q* (panel D). The corresponding running means computed over 20 equidistant intervals for PPD (red, solid line) and PPE (black, dashed line) are shown. Clearly, the quality of the correlation energy-distance decreases as the diversity of the decoy set increases.
Figure 3R scores versus Energy-distance correlations.
For each decoy set in Titan-HRD, CASP-HRD, and TSA, we plot the R score as a function of the correlation coefficient , where E is either the PPD energy (red, plus sign +) or the PPE energy (black, cross sign x) trained on the set Titan-HRD with the distance measure d 1, and d 1 is one of the fourth distance measures considered, namely RMSD (panel A), MT (panel B), GDT-TS* (panel C), and Q* (panel D). The corresponding running means computed over 20 equidistant intervals for PPD (red, solid line) and PPE (black, dashed line) are shown. Note that compares the best decoy picked based on the energy value E with the decoy closest to the native structure according to the distance measure d 1. There is a clear correlation between these two values for all four distance measures.
Assessing the best decoys selected by energy functions on different decoy datasets.
| Best | PPD | PPE | RAPDF | AMBER | GOAP | ||
| Titan-HRD | RMSD | 1.1(0.21) | 1.7(0.29) | 1.6(0.27) | 1.9(0.4) | 2.1(0.55) | 1.7(0.3) |
| MT | 0.75(0.22) | 1.4(0.38) | 1.2(0.38) | 1.8(0.62) | 2.3(0.89) | 1.6(0.54) | |
| GDT-TS | 0.94(0.02) | 0.89(0.03) | 0.92(0.03) | 0.85(0.05) | 0.8(0.09) | 0.88(0.03) | |
| Q | 0.94(0.01) | 0.92(0.02) | 0.93(0.02) | 0.88(0.03) | 0.86(0.04) | 0.89(0.03) | |
| 4-state | RMSD | 1.1(0.1) | 3.8(0.44) | 2.2(0.21) | 2.1(0.22) | 3.6(1.5) | 1.6(0.24) |
| MT | 0.9(0.33) | 5.8(2.1) | 1.2(0.31) | 2.6(0.52) | 6.2(3.4) | 1.5(0.38) | |
| GDT-TS | 0.91(0.03) | 0.55(0.06) | 0.86(0.08) | 0.8(0.04) | 0.67(0.1) | 0.86(0.04) | |
| Q | 0.94(0.02) | 0.75(0.03) | 0.92(0.02) | 0.87(0.04) | 0.79(0.1) | 0.9(0.02) | |
| fisa | RMSD | 3.7(0.76) | 5.7(0.78) | 6.5(1.4) | 4.4(0.72) | 8.5(1.5) | 4.5(0.45) |
| MT | 3.8(1.5) | 7.9(3.7) | 5.5(2) | 5.4(2.5) | 10(4.2) | 4.9(1.9) | |
| GDT-TS | 0.65(0.07) | 0.51(0.14) | 0.54(0.08) | 0.6(0.06) | 0.46(0.08) | 0.59(0.06) | |
| Q | 0.82(0.02) | 0.79(0.03) | 0.78(0.03) | 0.78(0.02) | 0.73(0.02) | 0.79(0.03) | |
| fisa CASP3 | RMSD | 6(2) | 12(1.6) | 12(2.4) | 12(4) | 12(1) | 11(1.6) |
| MT | 8.7(4.2) | 21(7.3) | 17(8.2) | 23(4.5) | 19(2.5) | 18(7.7) | |
| GDT-TS | 0.47(0.12) | 0.32(0.01) | 0.34(0.02) | 0.32(0.02) | 0.29(0.01) | 0.33(0.04) | |
| Q | 0.76(0.06) | 0.72(0.04) | 0.72(0.04) | 0.68(0.04) | 0.67(0.07) | 0.69(0.06) | |
| hg Structal | RMSD | 1.9(0.5) | 2.6(1) | 2.5(0.56) | 2.2(0.5) | 3.3(0.71) | 2.4(0.6) |
| MT | 1.8(0.3) | 2.5(0.61) | 3(0.28) | 2.4(0.35) | 3.7(0.8) | 2.7(0.28) | |
| GDT-TS | 0.86(0.06) | 0.82(0.14) | 0.85(0.07) | 0.84(0.07) | 0.77(0.08) | 0.84(0.08) | |
| Q | 0.93(0.03) | 0.92(0.03) | 0.92(0.03) | 0.92(0.04) | 0.89(0.03) | 0.92(0.04) | |
| lmds | RMSD | 5.7(0.33) | 9.9(0.72) | 9.8(0.89) | 9.8(0.92) | 10(0.61) | 10(0.65) |
| MT | 8(0.78) | 14(3.7) | 17(5.5) | 16(2.5) | 19(1.6) | 19(4.5) | |
| GDT-TS | 0.45(0.04) | 0.29(0.04) | 0.32(0.05) | 0.31(0.05) | 0.28(0.03) | 0.3(0.03) | |
| Q | 0.74(0.02) | 0.67(0.05) | 0.67(0.04) | 0.65(0.04) | 0.63(0.03) | 0.63(0.05) | |
| lattice ssfit | RMSD | 3.8(0.46) | 7.6(1.3) | 7.4(1.6) | 7.7(1.9) | 8(2.6) | 8.5(1.2) |
| MT | 5.2(2.2) | 9.8(4.5) | 10(5.1) | 11(4.8) | 12(6.4) | 12(5) | |
| GDT-TS | 0.62(0.06) | 0.45(0.07) | 0.48(0.07) | 0.49(0.12) | 0.45(0.07) | 0.44(0.04) | |
| Q | 0.8(0.06) | 0.74(0.07) | 0.75(0.04) | 0.74(0.05) | 0.72(0.07) | 0.72(0.06) | |
| CASP5 | RMSD | 6.7(2.9) | 13(6.2) | 11(6) | 10(5.2) | 11(6.1) | 10(5.2) |
| MT | 8.5(4.2) | 20(11) | 18(7.7) | 20(12) | 22(11) | 20(9.9) | |
| GDT-TS | 0.58(0.19) | 0.36(0.17) | 0.48(0.24) | 0.44(0.19) | 0.46(0.21) | 0.5(0.23) | |
| Q | 0.82(0.09) | 0.72(0.13) | 0.77(0.09) | 0.7(0.1)2 | 0.72(0.13) | 0.75(0.1) | |
| CASP6 | RMSD | 4.8(1.5) | 10(5.1) | 11(4.5) | 9.7(5.1) | 12(5.9) | 8(3.1) |
| MT | 5.4(1.9) | 23(11) | 18(4.3) | 19(5.7) | 24(15) | 12(4) | |
| GDT-TS | 0.64(0.14) | 0.33(0.14) | 0.52(0.18) | 0.49(0.27) | 0.38(0.17) | 0.54(0.17) | |
| Q | 0.85(0.06) | 0.7(0.07) | 0.79(0.08) | 0.75(0.11) | 0.68(0.11) | 0.79(0.09) | |
| CASP7 | RMSD | 4.5(1.8) | 8.8(4.9) | 7.1(3.1) | 7.9(3.9) | 11(5.1) | 7.8(3.4) |
| MT | 3.8(1.6) | 9.5(3.8) | 6.6(2.8) | 10(4.3) | 18(9.1) | 8.3(3.5) | |
| GDT-TS | 0.66(0.13) | 0.49(0.21) | 0.56(0.14) | 0.56(0.17) | 0.43(0.21) | 0.58(0.13) | |
| Q | 0.88(0.04) | 0.81(0.08) | 0.85(0.06) | 0.81(0.06) | 0.74(0.08) | 0.82(0.06) | |
| CASP8 | RMSD | 4.1(1.3) | 7.4(2.7) | 6.4(1.8) | 9.8(5.5) | 9.7(5.2) | 7.5(3.1) |
| MT | 3.2(1.3) | 10(4.1) | 6.1(2.2) | 14(6.4) | 15(8) | 8.7(2.7) | |
| GDT-TS | 0.7(0.1) | 0.53(0.17) | 0.63(0.13) | 0.51(0.22) | 0.47(0.19) | 0.61(0.16) | |
| Q | 0.89(0.04) | 0.81(0.07) | 0.85(0.06) | 0.79(0.09) | 0.75(0.1) | 0.83(0.07) | |
| CASP9 | RMSD | 4.8(1.4) | 9.7(4.9) | 7.5(2.6) | 9.4(4.7) | 9.8(4.9) | 8.2(3.3) |
| MT | 3.7(1.3) | 14(7.8) | 7.3(2.6) | 12(3.9) | 13(5.2) | 8.5(2.6) | |
| GDT-TS | 0.68(0.1) | 0.4(0.15) | 0.6(0.12) | 0.52(0.19) | 0.51(0.21) | 0.57(0.15) | |
| Q | 0.88(0.04) | 0.75(0.12) | 0.85(0.04) | 0.8(0.09) | 0.79(0.09) | 0.83(0.07) | |
| TASSER Set II | RMSD | 3.2(1) | 5.6(1.9) | 5.2(1.4) | 5.2(1.6) | 6.4(2.1) | 5.4(1.9) |
| MT | 3.7(1.2) | 7.1(2.3) | 6.6(2.5) | 6.7(2.2) | 11(5.4) | 6.8(2.2) | |
| GDT-TS | 0.69(0.09) | 0.57(0.12) | 0.59(0.12) | 0.59(0.1) | 0.52(0.13) | 0.59(0.12) | |
| Q | 0.85(0.05) | 0.81(0.06) | 0.82(0.06) | 0.8(0.05) | 0.75(0.08) | 0.79(0.05) | |
| Rosetta-All | RMSD | 6.4(1.3) | 11(2.1) | 11(2.2) | 12(3.2) | 16(4.7) | 11(2.6) |
| MT | 12(4.7) | 22(8.9) | 26(7.2) | 25(9.9) | 47(14) | 24(7.5) | |
| GDT-TS | 0.41(0.06) | 0.29(0.04) | 0.29(0.04) | 0.28(0.04) | 0.25(0.04) | 0.28(0.04) | |
| Q | 0.72(0.06) | 0.64(0.07) | 0.64(0.07) | 0.62(0.07) | 0.59(0.07) | 0.62(0.07) | |
| Rosetta-Baker | RMSD | 4.7(2.2) | 7.5(3.4) | 8.4(4.3) | 7.6(4.1) | 8.2(2.7) | 6.9(3.6) |
| MT | 6.9(3) | 13(7.5) | 15(9.2) | 13(7.8) | 13(6.8) | 11(5.4) | |
| GDT-TS | 0.6(0.21) | 0.47(0.15) | 0.46(0.13) | 0.48(0.15) | 0.46(0.13) | 0.5(0.18) | |
| Q | 0.84(0.08) | 0.77(0.1) | 0.77(0.11) | 0.77(0.01) | 0.76(0.07) | 0.79(0.09) | |
| Rosetta-Tsai | RMSD | 2.8(0.8) | 6.9(2.8) | 5(1.4) | 7.3(1.8) | 5.7(2) | 6.2(2.1) |
| MT | 3(1.1) | 9.3(4.9) | 5.5(2.2) | 10(5.8) | 8.3(3) | 7.1(2.3) | |
| GDT-TS | 0.72(0.08) | 0.47(0.08) | 0.59(0.09) | 0.45(0.06) | 0.54(0.09) | 0.52(0.12) | |
| Q | 0.86(0.05) | 0.77(0.08) | 0.81(0.08) | 0.74(0.07) | 0.77(0.07) | 0.77(0.06) | |
| CASP-HRD | RMSD | 2(0.56) | 2.6(0.73) | 2.6(0.71) | 2.6(0.76) | 2.8(0.7) | 2.6(0.74) |
| MT | 1.1(0.5) | 2(0.8) | 1.7(0.61) | 2(0.83) | 2.3(0.11) | 1.7(0.79) | |
| GDT-TS | 0.83(0.07) | 0.75(0.06) | 0.78(0.07) | 0.77(0.07) | 0.75(0.06) | 0.78(0.07) | |
| Q | 0.93(0.03) | 0.9(0.03) | 0.91(0.03) | 0.89(0.04) | 0.88(0.04) | 0.91(0.03) | |
| CASP10-stage1 | RMSD | 4.7(1.2) | 6.6(2.3) | 7.1(2.4) | 8.2(3.2) | 12(4.7) | 7.3(2.4) |
| MT | 4(2) | 6.6(2) | 8(1.6) | 10(3.5) | 20(3.4) | 7.9(2.2) | |
| GDT-TS | 0.71(0.1) | 0.62(0.13) | 0.63(0.15) | 0.57(0.16) | 0.55(0.19) | 0.63(0.14) | |
| Q | 0.88(0.04) | 0.84(0.04) | 0.85(0.04) | 0.82(0.06) | 0.8(0.06) | 0.84(0.06) | |
| CASP10-stage2 | RMSD | 4(1.1) | 5.9(1.7) | 5.9(1.6) | 6.7(1.9) | 9(2.2) | 6.2(1.6) |
| MT | 3.1(1) | 5.4(1.6) | 5.8(1.6) | 6.6(1.8) | 14(1.9) | 5.4(1.5) | |
| GDT-TS | 0.73(0.09) | 0.64(0.14) | 0.66(0.11) | 0.65(0.12) | 0.65(0.11) | 0.67(0.11) | |
| Q | 0.89(0.04) | 0.86(0.03) | 0.87(0.04) | 0.86(0.05) | 0.85(0.04) | 0.86(0.04) |
All-atom statistical distance-based potential [49].
The semi-empirical physical potential AMBER99SB-ILDN [50].
All-atom orientation-dependent statistical potential [7].
Average value, and mean absolute deviation (in parenthesis) over the data set.
Only ensembles who contains a decoy with a GDT–TS> 0.4 are included. Compare with Figure 2 in [48].