| Literature DB >> 11545673 |
S Cristobal1, A Zemla, D Fischer, L Rychlewski, A Elofsson.
Abstract
BACKGROUND: Prediction of protein structures is one of the fundamental challenges in biology today. To fully understand how well different prediction methods perform, it is necessary to use measures that evaluate their performance. Every two years, starting in 1994, the CASP (Critical Assessment of protein Structure Prediction) process has been organized to evaluate the ability of different predictors to blindly predict the structure of proteins. To capture different features of the models, several measures have been developed during the CASP processes. However, these measures have not been examined in detail before. In an attempt to develop fully automatic measures that can be used in CASP, as well as in other type of benchmarking experiments, we have compared twenty-one measures. These measures include the measures used in CASP3 and CASP2 as well as have measures introduced later. We have studied their ability to distinguish between the better and worse models submitted to CASP3 and the correlation between them.Entities:
Mesh:
Year: 2001 PMID: 11545673 PMCID: PMC55330 DOI: 10.1186/1471-2105-2-5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Description of measures.
| Name | CASP-name | Type | Measure | Reference |
| Murzin | - | Manual | - | [ |
| crn | CRN | Global | Å/N | [ |
| arms | ARms | Global | Å | [ |
| cspc | CSpc | Global | % | [ |
| csnc | CSns | Global | % | [ |
| ccrct | CCrct | Global | N | [ |
| GDT | GDT TS | Alignment dependent | S(N) | [ |
| MaxSub | - | Alignment dependent | S(Å,N) | [ |
| LGscore | - | Alignment dependent | S(Å,N) | this work |
| S | - | Alignment dependent | S(Å,N) | this work |
| sf0 | sf0 | Alignment independent | N | [ |
| sf4 | sf0+sf4 | Alignment independent | N | [ |
| align | ALIGN A4 P | Alignment independent | % | [ |
| LGA | - | Alignment independent | S(Å,N) | this work |
| eqr1 | eqr | Alignment independent | N | [ |
| LGscore2 | - | Alignment independent | S(Å,N) | this work |
| acrct | ACrct | Template based | N | [ |
| aspc | ASpc | Template based | % | [ |
| asp4 | ASp4 | Template based | % | [ |
| covr | Covr | Template based | % | [ |
| sclen | SClen | Template based | % | [ |
Å = Rmsd in Ångstrm. N = Number of residues. % = Fraction of residues. S(N) = Score dependent on number of residues. S(Å,N) = Score dependent on quality and number of residues.
Figure 1The four types of measures: In this example the evaluation starts with a model (right) and the correct structure of a protein (left). The left part of the model is quite good, but a large loop is inserted which results in a shift in the alignment in the central part of the model. Some residues in the model are not aligned to the template. This creates a shift that results in the right part of the model being correctly aligned again. A global measure (A) would use the complete model and compare it with the complete correct structure and probably not score this model very well. As shown in the (B) an alignment dependent measure would only consider the first and last fragments as correct. Measures that use an alignment independent approach (C) first do a structural alignment and then and the most significant fragment. The shifted residues in the center of the model would be included in the evaluation. In template based measures (D) the template used to build the model is used for comparison. In our example the template extends to the right of the model and it also has one loop that is longer so that there is a gap in the model that is not shown. The correct structure is superimposed on the template and the resulting alignment is compared to the alignment of the model.
Figure 2The two most significant axes from a principle component analysis of all measures after model based normalization are shown. Each measure is represented by a cross. In both figures two global measures, arms and crn, are excluded.
Figure 3The two most significant axes from a principle component analysis of all measures after target based normalization are shown. Each measure is represented by a cross. In both figures two global measures, arms and crn, are excluded.
Fraction of TOP models scoring
| Measure | A | A-B | A-E | ALL (A-F) |
| crn | 0.20 | 0.25 | 0.40 | 0.39 |
| arms | 0.10 | 0.25 | 0.30 | 0.30 |
| cspc | 0.30 | 0.30 | 0.47 | 0.47 |
| csns | 0.20 | 0.30 | 0.49 | 0.51 |
| ccrct | 0.30 | 0.45 | 0.43 | 0.44 |
| GDT | 0.30 | 0.35 | 0.36 | 0.29 |
| MaxSub | 0.40 | 0.50 | 0.60 | 0.39 |
| LGscore | 0.30 | 0.45 | 0.60 | 0.46 |
| S | 0.49 | 0.43 | ||
| sf0 | 0.40 | 0.50 | 0.58 | 0.52 |
| sf4 | 0.40 | 0.40 | 0.51 | |
| align | 0.30 | 0.45 | 0.51 | 0.47 |
| LGA | 0.30 | 0.30 | ||
| eqr1 | 0.20 | 0.35 | 0.55 | 0.69 |
| LGscore2 | 0.30 | 0.30 | 0.51 | 0.62 |
| acrct | 0.40 | 0.45 | 0.55 | 0.43 |
| aspc | 0.30 | 0.45 | 0.55 | 0.43 |
| asp4 | 0.30 | 0.50 | 0.58 | 0.45 |
| covr | 0.20 | 0.30 | 0.58 | 0.49 |
| sclen | 0.10 | 0.30 | 0.57 | 0.48 |
All measures for best T0046 models.
| T0046 | Murzin | crn | arms | cspc | csns | ccrct | GDT | MaxSub | LGscore | S | sf0 | sf4 | align | LGA | eqr1 | LGscore2 | acrct | aspc | asp4 | covr | sclen |
| 061 | 6.00 | -0.85 | -8.48 | 31.11 | 23.73 | 70.00 | 37.19 | 0.00 | 1.81 | 0.37 | 40.00 | 75.00 | 63.03 | 3.16 | 75.00 | 1.91 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 074 | 5.00 | -1.20 | -12.70 | 33.33 | 27.12 | 80.00 | 34.88 | 0.22 | 2.36 | 0.35 | 37.00 | 58.00 | 47.06 | 3.00 | 69.00 | 3.72 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 212 | 4.00 | -0.79 | -6.62 | 50.99 | 34.92 | 103.00 | 33.83 | 0.25 | 3.03 | 0.26 | 34.00 | 53.00 | 44.54 | 2.98 | 66.00 | 2.51 | 21.00 | 25.00 | 66.67 | 71.43 | 76.00 |
| 003 | 3.00 | -0.95 | -8.16 | 46.55 | 27.46 | 81.00 | 29.62 | 0.22 | 2.28 | 0.28 | 31.00 | 60.00 | 47.06 | 2.66 | 68.00 | 2.23 | 21.00 | 24.42 | 69.77 | 69.77 | 72.00 |
| 085 | 1.00 | -1.33 | -14.60 | 25.11 | 20.00 | 59.00 | 19.75 | 0.00 | 1.09 | 0.16 | 22.00 | 22.00 | 7.56 | 2.37 | 53.00 | 2.17 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 005 | 1.00 | -1.80 | -21.47 | 20.61 | 15.93 | 47.00 | 20.80 | 0.00 | 0.86 | 0.19 | 14.00 | 20.00 | 1.68 | 2.78 | 68.00 | 2.49 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 217 | 1.00 | -1.26 | -10.81 | 23.56 | 15.25 | 45.00 | 20.17 | 0.00 | 0.68 | 0.17 | 24.00 | 24.00 | 2.52 | 2.79 | 65.00 | 2.97 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 053 | 1.00 | -1.47 | -17.47 | 12.34 | 12.88 | 38.00 | 19.12 | 0.00 | 0.56 | 0.18 | 21.00 | 21.00 | 21.01 | 2.49 | 66.00 | 2.18 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 224 | 1.00 | -1.84 | -14.88 | 9.47 | 6.10 | 18.00 | 15.76 | 0.00 | 0.28 | 0.13 | 10.00 | 10.00 | 19.33 | 2.40 | 49.00 | 2.99 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 033 | 1.00 | -1.53 | -15.90 | 4.71 | 3.05 | 9.00 | 17.44 | 0.00 | 0.20 | 0.16 | 11.00 | 11.00 | 4.20 | 2.39 | 57.00 | 2.19 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 273 | 1.00 | -1.42 | -16.92 | 9.79 | 7.80 | 23.00 | 17.02 | 0.00 | 0.18 | 0.13 | 7.00 | 7.00 | 6.72 | 2.40 | 50.00 | 2.37 | 0.00 | 0.00 | 0.00 | 60.50 | 72.00 |
| 090 | 1.00 | -1.38 | -13.22 | 10.09 | 7.46 | 22.00 | 16.39 | 0.00 | 0.08 | 0.15 | 11.00 | 18.00 | 5.88 | 2.46 | 48.00 | 2.55 | 0.00 | 0.00 | 0.00 | 60.42 | 62.00 |
| 072 | 1.00 | -1.49 | -108.00 | 54.26 | 3.81 | 2.71 | 16.17 | 0.00 | 0.05 | 0.15 | 11.00 | 26.00 | 23.53 | 2.18 | 61.00 | 1.31 | 22.34 | 5.00 | 5.32 | 4.41 | 0.00 |
| 023 | 1.00 | -1.47 | -17.44 | 3.08 | 2.03 | 6.00 | 17.23 | 0.00 | 0.04 | 0.12 | 12.00 | 12.00 | 20.17 | 2.30 | 56.00 | 2.48 | 0.00 | 0.00 | 0.00 | 63.44 | 67.00 |
| 166 | 1.00 | -1.71 | -10.10 | 33.57 | 15.93 | 47.00 | 18.28 | 0.00 | 0.04 | 0.13 | 10.00 | 10.00 | 0.00 | 2.21 | 46.00 | 2.39 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 176 | 1.00 | -1.28 | -12.39 | 19.71 | 13.90 | 41.00 | 16.39 | 0.00 | 0.03 | 0.14 | 9.00 | 10.00 | 0.00 | 2.34 | 44.00 | 2.83 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 017 | 1.00 | -1.93 | -14.06 | 12.41 | 6.10 | 18.00 | 16.38 | 0.00 | 0.01 | 0.14 | 11.00 | 27.00 | 20.17 | 2.07 | 53.00 | 1.68 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 035 | 0.00 | -1.35 | -16.04 | 22.94 | 16.95 | 50.00 | 23.95 | 0.00 | 1.78 | 0.24 | 31.00 | 31.00 | 0.84 | 1.94 | 44.00 | 1.58 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 045 | 0.00 | -1.38 | -16.33 | 17.20 | 9.15 | 27.00 | 17.86 | 0.00 | 1.10 | 0.17 | 20.00 | 20.00 | 0.00 | 1.30 | 32.00 | 1.74 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 060 | 0.00 | -1.14 | -13.54 | 10.13 | 8.14 | 24.00 | 20.80 | 0.00 | 0.99 | 0.17 | 19.00 | 19.00 | 0.00 | 1.13 | 33.00 | 1.82 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 179 | 0.00 | -1.15 | -12.32 | 11.49 | 9.15 | 27.00 | 22.48 | 0.00 | 0.73 | 0.19 | 15.00 | 20.00 | 22.69 | 1.78 | 51.00 | 3.13 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 028 | 0.00 | -1.70 | -19.51 | 9.04 | 5.08 | 15.00 | 16.17 | 0.00 | 0.11 | 0.15 | 17.00 | 17.00 | 0.84 | 2.09 | 57.00 | 1.98 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 076 | 0.00 | -1.05 | -11.70 | 10.89 | 9.49 | 28.00 | 19.96 | 0.00 | 0.11 | 0.19 | 15.00 | 25.00 | 0.00 | 1.99 | 54.00 | 2.95 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 222 | 0.00 | -1.29 | -14.82 | 1.66 | 1.69 | 5.00 | 18.28 | 0.00 | 0.09 | 0.16 | 16.00 | 17.00 | 1.68 | 2.02 | 44.00 | 2.16 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 105 | 0.00 | -1.46 | -16.22 | 0.80 | 0.68 | 2.00 | 16.17 | 0.00 | 0.08 | 0.15 | 8.00 | 8.00 | 21.01 | 2.06 | 51.00 | 2.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| 266 | 0.00 | -1.59 | -18.91 | 8.40 | 3.39 | 10.00 | 14.71 | 0.00 | 0.08 | 0.12 | 5.00 | 5.00 | 0.00 | 1.10 | 29.00 | 1.38 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Figure 4Comparison between manual assessors ranking and average ranking from all other non-global measures using model based normalization. The groups that differ most between the manual ranking and the average ranking are shown. It should be noted that the official manual ranking used slightly different targets than we used here, therefore the manual ranking is not identical with the official CASP3 ranking this is to ensure that exactly the same targets were used in the comparison between the manual and automatic measures.