| Literature DB >> 20025768 |
Xin Deng1, Jesse Eickholt, Jianlin Cheng.
Abstract
BACKGROUND: Disordered regions are segments of the protein chain which do not adopt stable structures. Such segments are often of interest because they have a close relationship with protein expression and functionality. As such, protein disorder prediction is important for protein structure prediction, structure determination and function annotation.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20025768 PMCID: PMC3087350 DOI: 10.1186/1471-2105-10-436
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Results for protein disorder predictors that participated in CASP8 on 117 targets.
| Disorder Predictor | ROC Score | Weighed Score | Total Res. | Pos. Sens. | Pos. Spec. | Neg. Sens. | Neg. Spec. | Sens. Prod. | F-meas. |
|---|---|---|---|---|---|---|---|---|---|
| MULTICOM | 0.879 | 6.89 | 25148 | 0.532 | 0.619 | 0.973 | 0.962 | 0.518 | 0.572 |
| DISOclust | 0.862 | 7.822 | 24021 | 0.753 | 0.248 | 0.813 | 0.976 | 0.612 | 0.373 |
| fais-server | 0.861 | 6.613 | 24021 | 0.522 | 0.533 | 0.963 | 0.961 | 0.503 | 0.528 |
| MULTICOM-CMFR* | 0.859 | 8.06 | 25431 | 0.724 | 0.299 | 0.859 | 0.974 | 0.622 | 0.423 |
| 3Dpro* | 0.855 | 4.826 | 23934 | 0.378 | 0.727 | 0.988 | 0.949 | 0.373 | 0.497 |
| GS-MetaServer | 0.852 | 5.356 | 24976 | 0.41 | 0.711 | 0.986 | 0.953 | 0.404 | 0.52 |
| mariner1* | 0.846 | 7.469 | 23148 | 0.622 | 0.396 | 0.923 | 0.968 | 0.574 | 0.484 |
| Distill-Punch* | 0.846 | 0.798 | 23311 | 0.067 | 0.75 | 0.998 | 0.93 | 0.067 | 0.123 |
| Metaprdos | 0.842 | 6.431 | 22363 | 0.505 | 0.539 | 0.966 | 0.961 | 0.488 | 0.522 |
| Distill-Punch* | 0.839 | 0.823 | 23730 | 0.063 | 0.881 | 0.999 | 0.93 | 0.063 | 0.118 |
| CBRC_POODLE* | 0.839 | 6.259 | 24021 | 0.522 | 0.403 | 0.937 | 0.96 | 0.489 | 0.455 |
| GeneSilicoMeta | 0.839 | 6.338 | 24976 | 0.489 | 0.627 | 0.976 | 0.959 | 0.478 | 0.55 |
| DISOPRED* | 0.831 | 6.497 | 24021 | 0.522 | 0.485 | 0.955 | 0.961 | 0.498 | 0.503 |
| Biomine* | 0.825 | 6.394 | 23472 | 0.517 | 0.47 | 0.952 | 0.96 | 0.492 | 0.493 |
| OnD-CRF* | 0.82 | 5.214 | 24021 | 0.47 | 0.31 | 0.914 | 0.955 | 0.429 | 0.373 |
| Spritz* | 0.791 | 5.431 | 24021 | 0.434 | 0.514 | 0.966 | 0.954 | 0.42 | 0.471 |
| GSMetaDisorder | 0.772 | 6.31 | 24504 | 0.579 | 0.289 | 0.88 | 0.961 | 0.51 | 0.386 |
| Distill* | 0.758 | 5.637 | 24021 | 0.671 | 0.173 | 0.738 | 0.965 | 0.495 | 0.276 |
| LEE-SERVER* | 0.741 | 3.595 | 21955 | 0.31 | 0.449 | 0.968 | 0.943 | 0.3 | 0.367 |
Results of our evaluation of all the protein disorder predictors ranked by ROC Score (AUC). Total Res. represents the total number of residues predicted. Pos. Spec. and Neg. Spec. are the positive and negative specificities. Pos. Sens. and Neg. Sens. are the positive and negative sensitivities. F-meas. is the F-Measure of the positive sensitivity and positive specificity and Sens. Prod. is the product of the positive sensitivity and negative sensitivity. * denotes ab initio methods. MULTICOM-CMFR is the predictor name of PreDisorder in CASP8.
Results for protein disorder predictors that participated in CASP8 on the 20 NMR targets (T0437, T0460, T0462, T0464, T0466, T0467, T0468, T0469, T0471, T0472, T0473, T0474, T0475, T0476, T0480, T0482, T0484, T0492, T0498, T0499).
| Disorder Predictor | ROC Score | Weighed Score | Total Res. | Sens. | Spec. | Sens. Prod. | F-measure. |
|---|---|---|---|---|---|---|---|
| GSMetaDisorder | 0.800 | 3.801 | 1654 | 0.786 | 0.700 | 0.550 | 0.082 |
| MULTICOM-CMFR* | 0.792 | 4.393 | 1895 | 0.816 | 0.700 | 0.572 | 0.125 |
| fais-server | 0.769 | 5.407 | 1601 | 0.510 | 0.874 | 0.446 | 0.186 |
| OnD-CRF* | 0.761 | 5.671 | 1601 | 0.429 | 0.925 | 0.397 | 0.226 |
| MULTICOM | 0.760 | 5.283 | 1895 | 0.469 | 0.878 | 0.412 | 0.155 |
| Biomine* | 0.753 | 4.753 | 1489 | 0.465 | 0.845 | 0.393 | 0.139 |
| CBRC_POODLE* | 0.744 | 3.864 | 1601 | 0.490 | 0.774 | 0.379 | 0.113 |
| DISOPRED* | 0.741 | 5.458 | 1601 | 0.531 | 0.870 | 0.462 | 0.188 |
| Metaprdos | 0.731 | 5.971 | 1601 | 0.449 | 0.938 | 0.421 | 0.263 |
| 3Dpro* | 0.729 | 6.115 | 1411 | 0.458 | 0.948 | 0.434 | 0.312 |
| Distill* | 0.720 | 3.042 | 1601 | 0.694 | 0.636 | 0.441 | 0.105 |
| GS-MetaServer | 0.715 | 5.525 | 1775 | 0.367 | 0.933 | 0.343 | 0.197 |
| DISOclust | 0.709 | 3.011 | 1601 | 0.571 | 0.682 | 0.390 | 0.098 |
| Distill-Punch* | 0.703 | 4.107 | 1601 | 0.000 | 0.986 | 0.000 | 0.000 |
| GeneSilicoMeta | 0.703 | 5.540 | 1775 | 0.469 | 0.897 | 0.421 | 0.185 |
| Spritz* | 0.679 | 4.772 | 1601 | 0.224 | 0.943 | 0.212 | 0.149 |
| Mariner1* | 0.666 | 4.636 | 1601 | 0.592 | 0.788 | 0.466 | 0.143 |
| LEE-SERVER* | 0.654 | 5.486 | 1684 | 0.349 | 0.932 | 0.325 | 0.176 |
* denotes ab initio methods.
Results for protein disorder predictors that participated in CASP8 on the 97 x-ray targets.
| Disorder Predictor | ROC Score | Weighed Score | Total Res. | Sens. | Spec. | Sens. Prod. | F-measure. |
|---|---|---|---|---|---|---|---|
| MULTICOM | 0.887 | 7.021 | 23253 | 0.534 | 0.981 | 0.523 | 0.610 |
| DISOclust | 0.868 | 8.165 | 22420 | 0.758 | 0.823 | 0.624 | 0.397 |
| fais-server | 0.867 | 6.699 | 22420 | 0.523 | 0.969 | 0.506 | 0.555 |
| MULTICOM-CMFR* | 0.865 | 8.355 | 23536 | 0.721 | 0.873 | 0.630 | 0.455 |
| 3Dpro* | 0.862 | 4.745 | 22523 | 0.375 | 0.990 | 0.372 | 0.507 |
| GS-MetaServer | 0.861 | 5.343 | 23201 | 0.411 | 0.991 | 0.407 | 0.541 |
| Mariner1* | 0.857 | 7.679 | 21547 | 0.623 | 0.933 | 0.582 | 0.518 |
| Distill-Punch* | 0.856 | 0.554 | 21710 | 0.069 | 0.999 | 0.069 | 0.128 |
| CBRC_POODLE* | 0.848 | 6.430 | 22420 | 0.523 | 0.949 | 0.496 | 0.493 |
| GeneSilicoMeta | 0.848 | 6.399 | 23201 | 0.490 | 0.982 | 0.481 | 0.579 |
| Metaprdos | 0.846 | 6.467 | 20762 | 0.507 | 0.968 | 0.491 | 0.536 |
| DISOPRED* | 0.837 | 6.571 | 22420 | 0.521 | 0.961 | 0.501 | 0.528 |
| Biomine* | 0.829 | 6.506 | 21983 | 0.519 | 0.959 | 0.498 | 0.522 |
| OnD-CRF* | 0.822 | 5.181 | 22420 | 0.471 | 0.913 | 0.430 | 0.379 |
| Spritz* | 0.800 | 5.478 | 22420 | 0.440 | 0.968 | 0.426 | 0.486 |
| GSMetaDisorder | 0.778 | 6.492 | 22850 | 0.576 | 0.894 | 0.515 | 0.417 |
| Distill* | 0.761 | 5.822 | 22420 | 0.670 | 0.746 | 0.499 | 0.289 |
| LEE-SERVER* | 0.749 | 3.438 | 20271 | 0.309 | 0.971 | 0.300 | 0.379 |
* denotes ab initio methods.
Figure 1ROC curves of CASP8 predictors (ordered by ROC score) on the CASP8 dataset which consisted of 117 protein targets.
Figure 2Example output from PreDisorder showing probability of disorder for each residue in a sequence (CASP8 target T0470). The red curve represents predicted disorder probabilities. The green curve denotes real disorder annotations (1 - disorder; 0 - not disorder).