| Literature DB >> 20226048 |
Dmitry Rykunov1, Andras Fiser.
Abstract
BACKGROUND: Scoring functions, such as molecular mechanic forcefields and statistical potentials are fundamentally important tools in protein structure modeling and quality assessment.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20226048 PMCID: PMC2853469 DOI: 10.1186/1471-2105-11-128
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Performance of various statistical potentials on models of CASP5-8 experiments.
| Scoring function | models only | native included | |||
|---|---|---|---|---|---|
| Averagea | Ranked 1b | Averagec | Raw averaged | Ranked 1e | |
| QMEAN6 | 2.87 | 85 | 1.71 | 3.26 | 113 |
| QMEAN-all_atom | 3.59 | 74 | 1.71 | 2.9 | 119 |
| QMEAN-SSE_agree | 3.74 | 62 | 3.72 | 9.62 | 39 |
| QMEAN-ACC_agree | 4.04 | 40 | 3.78 | 8.83 | 48 |
| RF_CB_SRS_OD | 4.16 | 61 | 2.08 | 3.6 | 110 |
| RF_CB_OD | 4.62 | 62 | 2.00 | 3.65 | 111 |
| RF_HA_SRS | 4.65 | 49 | 1.38 | 1.66 | 137 |
| RF_CB_SRS | 4.72 | 56 | 2.18 | 3.46 | 114 |
| OPUS_CA | 4.72 | 79 | 5.13 | 9.93 | 55 |
| VSCORE-combined | 4.79 | 53 | 2.20 | 3.79 | 117 |
| QMEAN-pairwise | 4.80 | 54 | 3.15 | 5.86 | 85 |
| Rosetta | 5.01 | 57 | 4.09 | 8.03 | 68 |
| Dong-pair | 5.01 | 58 | 6.32 | 14.41 | 4 |
| RF_CB | 5.06 | 52 | 2.46 | 4.31 | 106 |
| VSCORE-pair | 5.08 | 54 | 1.85 | 2.81 | 128 |
| PROSA-combined | 5.11 | 57 | 3.38 | 6.27 | 87 |
| OPUS_PSP | 5.39 | 54 | 2.99 | 4.11 | 118 |
| RF_HA | 5.44 | 62 | 2.78 | 4.37 | 112 |
| DOPE | 5.77 | 54 | 3.27 | 5.97 | 95 |
| dFIRE | 6.03 | 50 | 5.69 | 11.8 | 33 |
| PROSA-pair | 6.03 | 56 | 3.54 | 6.02 | 95 |
| QMEAN-torsion | 6.71 | 45 | 3.24 | 4.66 | 114 |
| Shortle2006 | 6.85 | 35 | 1.79 | 2.54 | 129 |
| Liang_geometric | 6.88 | 44 | 2.48 | 3.94 | 114 |
| QMEAN-solvation | 7.32 | 33 | 6.27 | 10.87 | 54 |
| Shortle2005 | 7.73 | 42 | 3.39 | 5.19 | 109 |
| Floudas-CM | 7.75 | 38 | 7.05 | 12.77 | 42 |
| Floudas-Ca | 7.79 | 33 | 8.36 | 16.01 | 10 |
| NAMD 1000 | 8.06 | 24 | 4.96 | 8.56 | 78 |
| Melo-ANOLEA | 9.62 | 19 | 5.19 | 8.37 | 86 |
| PC2CA | 9.75 | 19 | 5.06 | 8.35 | 85 |
| Melo-NL | 9.99 | 14 | 5.85 | 9.45 | 80 |
| NAMD 1 | 11.91 | 5 | 10.98 | 18.04 | 24 |
| Random¶ | 9.72 | 13.9 | 10.1 | 10.1 | 8.3 |
a The average rank (over 143 decoy sets) in the absence of native structures.
b The number of sets when the best model was ranked as first, in the absence of native structures.
c The average rank when native structures are present.
d The average rank when native structures are present, calculated without compensation for the gap in ranking between experimental structure and first model (see text).
e The number of sets when the best model was ranked as first when native structures are present.
¶Expected random values were generated by picking a wining model from the decoy sets randomly. Average values over 1000 random trials are shown.
Figure 1Statistical significance of rank distributions obtained with different methods. Only p-values obtained with one-tailed Wilcoxon test above 0.05 are shown. Alternative hypothesis in the test was that location of the distribution obtained with scoring function in a row is lower than the one in the column. Interaction centers are color coded in row titles as follows: black is used for residue-based potentials, blue is used for atom-based potentials, and red designates composite scoring functions. Certain interaction types are color coded in column titles, specifically, scoring functions making use of solvent accessibility (green), torsion angle dependence (blue), secondary structure dependence (red), and orientation-dependence (in italic).
Figure 2Performance of different potentials as a function of the quality of the best available model. Average rank calculated by (A) composite functions, (B) heavy-atom based functions, and (C) residue-based potential functions for targets having best model with GDT_TS better than 95.0 (11 targets), between 87.5 and 95.0 (25 targets), between 80.0 and 87.5 (32 targets), between 72.5 and 80.0 (47 targets) and between 65.0 and 72.5 (28 targets).
Figure 3Definition of residue orientation used to derive potentials. (A) interaction i → j is considered "parallel" if the scalar product of Cα-Cβ vectors for residues i (vector a) and j (vector b) is positive; (B) interaction i → j is considered "antiparallel i facing j" if scalar product of a and b is non-positive and scalar product of vector a and vector from Cα atom of residue i to Cα atom of residue j (vector c) is positive; (C) interaction i → j is considered as "antiparallel i pointing away from j" if both a·b and a·c scalar products are non-positive.
Figure 4Superimposition of models of different quality with the experimental solution structure. Experimental structure of the CASP8 target T0502 (violet) and its models, (A) METATASSER_TS5, GDT_TS = 80.357, (B) 3Dpro_TS4, GDT_TS = 60.204, and (C) panther_server_TS2, GDT_TS = 44.643 are shown as Cα traces. Those parts of the models, where the experimental positions are not known are colored white. Parts of models deviating from experimentally determined positions less than 4 Å are colored green, and the rest is colored bronze. Cα-Cα pseudobonds longer than 3.9 Å are shown thin. This plot has been generated using MOLSCRIPT software [63].