| Literature DB >> 16844983 |
Alessandro Vullo1, Oscar Bortolami, Gianluca Pollastri, Silvio C E Tosatto.
Abstract
Intrinsically disordered proteins have long stretches of their polypeptide chain, which do not adopt a single native structure composed of stable secondary and tertiary structure in the absence of binding partners. The prediction of intrinsically disordered regions in proteins from sequence is increasingly becoming of interest, as the presence of many such regions in the complete genome sequences are discovered and important functional roles are associated with them. We have developed a machine learning approach based on two support vector machines (SVM) to discriminate disordered regions from sequence. The SVM are trained and benchmarked on two sets, representing long and short disordered regions. A preliminary version of Spritz was shown to perform consistently well at the recent biannual CASP-6 experiment [Critical Assessment of Techniques for Protein Structure Prediction (CASP), 2004]. The fully developed Spritz method is freely available as a web server at http://distill.ucd.ie/spritz/ and http://protein.cribi.unipd.it/spritz/.Entities:
Mesh:
Year: 2006 PMID: 16844983 PMCID: PMC1538873 DOI: 10.1093/nar/gkl166
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1The Spritz web server interface.
Results of SVM trained on SD and LD dataset
| C | AUC | Q2 (5% FPR) | |
|---|---|---|---|
| SD | |||
| 5-fold CV | 0.44 | 0.82 | 93.2 |
| CASP-6 | 0.44 | 0.79* | 91.5 |
| LD | 0.14 | 0.6 | 53.9 |
| LD | |||
| 5-fold CV | 0.59 | 0.85 | 72.2 |
| CASP-6 | 0.35 | 0.85 | 91.5 |
| SD | 0.21 | 0.8 | 92.6 |
| CaspIta | |||
| CASP-6 | 0.41 | 0.73 | 93.2 |
| VSL-1 | |||
| CASP-6 | 0.32 | 0.88 | 82.4 |
*An indicates a probable underestimation of the AUC due to insufficient data in subintervals of [0,1]. The official results for two top scoring CASP-6 groups are shown for comparison.
Figure 2ROC curve of both the long disorder (SVM LD) and the short disorder (SVM SD) experts as computed from the CASP-6 targets.