| Literature DB >> 18366622 |
Joshua Hecker1, Jack Y Yang, Jianlin Cheng.
Abstract
BACKGROUND: Many protein regions and some entire proteins have no definite tertiary structure, existing instead as dynamic, disorder ensembles under different physiochemical circumstances. Identification of these protein disorder regions is important for protein production, protein structure prediction and determination, and protein function annotation. A number of different disorder prediction software and web services have been developed since the first predictor was designed by Dunker's lab in 1997. However, most of the software packages use a pre-defined threshold to select ordered or disordered residues. In many situations, users need to choose ordered or disordered residues at different sensitivity and specificity levels.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18366622 PMCID: PMC2386074 DOI: 10.1186/1471-2164-9-S1-S9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Sensitivity and specificity over varying thresholds
| Threshold | 0.01 | 0.05 | 0.15 | 0.25 | 0.35 | 0.45 | 0.50 | 0.55 | 0.65 | 0.75 | 0.85 | 0.95 | 0.99 |
| Sensitivity | 0.98 | 0.85 | 0.72 | 0.64 | 0.57 | 0.51 | 0.49 | 0.46 | 0.39 | 0.32 | 0.24 | 0.13 | 0.01 |
| Specificity | 0.07 | 0.25 | 0.50 | 0.62 | 0.69 | 0.76 | 0.79 | 0.81 | 0.85 | 0.88 | 0.90 | 0.97 | 1.00 |
Figure 1Sensitivity and specificity over a varying decision threshold from 0.01 to 0.99, in steps of 0.01.
Figure 2Sensitivity vs. specificity over varying threshold
Figure 3Example output from modified DISpro. Displays probability of disorder for each residue in a sequence.
Figure 4ROC curves of eight predictors on the CASP7 dataset consisted of 95 protein targets.
The ROC scores of eight predictors on the CASP7 dataset
| Predictor | ROC score |
| DISpro | 0.864 |
| DISOPRED | 0.862 |
| GeneSilico | 0.851 |
| MBI | 0.839 |
| BIME | 0.834 |
| DRIP-PRED | 0.804 |
| Distill | 0.757 |
| ProfBval | 0.710 |
Figure 5Frequency of lengths of disordered regions.