| Literature DB >> 18430742 |
Abstract
MOTIVATION: Order and Disorder prediction using Conditional Random Fields (OnD-CRF) is a new method for accurately predicting the transition between structured and mobile or disordered regions in proteins. OnD-CRF applies CRFs relying on features which are generated from the amino acids sequence and from secondary structure prediction. Benchmarking results based on CASP7 targets, and evaluation with respect to several CASP criteria, rank the OnD-CRF model highest among the fully automatic server group. AVAILABILITY: http://babel.ucmp.umu.se/ond-crf/Entities:
Mesh:
Substances:
Year: 2008 PMID: 18430742 PMCID: PMC2387219 DOI: 10.1093/bioinformatics/btn132
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Comparing OnD-CRF with prediction methods that participated in CASP7
| Method | AUC | ACC | ||||
|---|---|---|---|---|---|---|
| CASP7 Automatic Server Group | ||||||
| OnD-CRFa | 0.839 | 0.688 | 0.813 | 0.560 | 0.750 | 0.501 |
| DISproa | 0.822 | 0.597 | 0.854 | 0.510 | 0.726 | 0.451 |
| GeneSilicoMetaServerd | 0.804 | 0.527 | 0.912 | 0.481 | 0.720 | 0.440 |
| BIME@NTU_serva | 0.798 | 0.591 | 0.839 | 0.496 | 0.715 | 0.430 |
| DISOPREDa | 0.837 | 0.425 | 0.953 | 0.405 | 0.689 | 0.378 |
| Distilla | 0.724 | 0.558 | 0.788 | 0.440 | 0.673 | 0.346 |
| MBI-NTU-serva | 0.796 | 0.327 | 0.971 | 0.318 | 0.649 | 0.298 |
| DRIPPREDb | 0.758 | 0.383 | 0.908 | 0.348 | 0.646 | 0.291 |
| CASP7 Human Expert Group | ||||||
| ISTZORANb | 0.860 | 0.725 | 0.837 | 0.607 | 0.781 | 0.562 |
| faisa | 0.844 | 0.556 | 0.924 | 0.514 | 0.740 | 0.481 |
| CBRC-DRa | 0.850 | 0.454 | 0.966 | 0.439 | 0.710 | 0.420 |
| BIME@NTUc | 0.804 | 0.536 | 0.883 | 0.473 | 0.710 | 0.419 |
| IUPredb | 0.777 | 0.396 | 0.947 | 0.375 | 0.672 | 0.343 |
| CBRC-DP_DRa | 0.704 | 0.338 | 0.971 | 0.328 | 0.655 | 0.309 |
| Okab | 0.609 | 0.280 | 0.937 | 0.262 | 0.609 | 0.218 |
| Softberrya | 0.704 | 0.201 | 0.971 | 0.195 | 0.586 | 0.172 |
The entries are sorted with respect to the weighted score Sw.
Number of predicted targets: a96; b95; c94; d93; AUC: Area Under ROC Curve (Bordoli et al., 2007); Sens = TP/(TP + FN); Sspec = TN/(TN + FP); Sprod = Ssens × Sspec; ACC = (Ssens + Sspec)/2; Sw = (WdisorderNTP – WorderNFP +WorderNTN – WdisorderNFN)/(WdisorderNdisorder + WorderNorder).
Fig. 1.OnD-CRF Prediction analysis for CRK. The blue curve represents the predicted disorder probability at each amino acid position. The horizontal red line at 0.05 probability, represents the boundary between order and disorder. The NMR structures of the three CRK domains are shown above the graph. Their boundaries are marked as magenta, green and blue bars, respectively, and overlap with the mostly ordered regions of the OnD-CRF prediction. Note the accurately predicted flexible ‘DE loop’ in the SH2 domain between residues 65–85 (dashed line).