| Literature DB >> 16549034 |
Yu Xue1, Ao Li, Lirong Wang, Huanqing Feng, Xuebiao Yao.
Abstract
BACKGROUND: As a reversible and dynamic post-translational modification (PTM) of proteins, phosphorylation plays essential regulatory roles in a broad spectrum of the biological processes. Although many studies have been contributed on the molecular mechanism of phosphorylation dynamics, the intrinsic feature of substrates specificity is still elusive and remains to be delineated.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16549034 PMCID: PMC1435943 DOI: 10.1186/1471-2105-7-163
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The outline of the training and procedure of PPSP.
The performances of self-consistency, Jack-knife validation and n -fold (4-, 6-, 8-, 10-fold in this work) cross-validation for four well-studied PKs of PKA, CK2, ATM and S6K. The n- fold cross-validation has been performed for the large data sets (N ≥ 30).
| 90.11 | 91.70 | 83.21 | 90.01 | 93.02 | 94.06 | 92.85 | 97.97 | ||
| 90.11 | 90.46 | 83.21 | 88.44 | 86.05 | 91.89 | 92.86 | 91.05 | ||
| 4- | 90.11 | 90.43 | 81.02 | 87.90 | 86.37 | 90.14 | N/A | N/A | |
| 6- | 90.11 | 90.52 | 81.02 | 88.34 | 86.37 | 90.60 | N/A | N/A | |
| 8- | 90.11 | 90.45 | 81.75 | 88.48 | 86.05 | 90.65 | N/A | N/A | |
| 10- | 90.11 | 90.48 | 81.75 | 88.22 | 86.05 | 91.39 | N/A | N/A | |
| 173 | 142 | 43 | 14 | ||||||
| 8, 408 | 5, 332 | 2, 048 | 683 | ||||||
The self-consistency performance and Jack-knife validation for four novel PKs of TRK, mTOR, SyK and MET/RON.
| 92.31 | 97.40 | 93.33 | 96.46 | 92.59 | 91.98 | 89.47 | 95.90 | ||
| 84.62 | 96.10 | 93.33 | 91.27 | 77.79 | 86.79 | 73.68 | 91.80 | ||
| 13 | 14 | 27 | 19 | ||||||
| 77 | 433 | 251 | 122 | ||||||
With the default cut-off of PPSP, the percentile of the sites predicted to be potential true positive hits is listed. Both random ennea-peptides and data sets from human proteome have been computed, separately.
| S | T | Y | S | T | Y | |
| 11.75% | 2.20% | 14.61% | 3.20% | |||
| 9.18% | 3.18% | 12.60% | 5.65% | |||
| 8.42% | 1.96% | 8.95% | 2.13% | |||
| 14.72% | 3.71% | 14.90% | 3.89% | |||
| 5.95% | 7.09% | 8.20% | 9.14% | |||
| 3.59% | 3.94% | |||||
| 7.00% | 9.74% | |||||
| 13.37% | 13.77% | |||||
Figure 2the distribution of risk difference of random and human proteome data set of PKA-specific site prediction is diagramed in Figure 2. A. Distribution of Risk Difference of random data set (serine) of PKA-specific site prediction. B. Distribution of Risk Difference of random data set (threonine) of PKA-specific site prediction. C. Distribution of Risk Difference of human proteome data set (serine) of PKA-specific site prediction. D. Distribution of Risk Difference of Human proteome data set (threonine) of PKA-specific site prediction.
The prediction performance of Scansite, NetPhosK, KinasePhos and GPS for four well-studied PKs of PKA, CK2, ATM and S6K.
| Defaulta | 90.11 | 91.7 | 0.3841 | 83.21 | 90.01 | 0.3596 | 93.02 | 94.06 | 0.4627 | 92.85 | 97.97 | 0.6618 | |
| Highb | 21.98 | 99.96 | 0.4450 | 10.95 | 99.86 | 0.2655 | 18.6 | 99.8 | 0.3443 | N/A | N/A | N/A | |
| Medium | 44.51 | 99.39 | 0.5084 | 27.01 | 99.11 | 0.3342 | 25.58 | 98.89 | 0.2756 | N/A | N/A | N/A | |
| Low | 47.8 | 98.29 | 0.4041 | 54.02 | 96.34 | 0.3684 | 51.16 | 94.89 | 0.2739 | N/A | N/A | N/A | |
| Default | 79.12 | 90.65 | 0.3165 | 82.48 | 89.43 | 0.3464 | 86.01 | 98.51 | 0.6786 | 82.35 | 97.14 | 0.5404 | |
| 90% (Sp)d | 90.72 | 91.3 | 0.3783 | 72.53 | 91.58 | 0.3384 | 88.37 | 87.8 | 0.3137 | N/A | N/A | N/A | |
| 95% (Sp) | 89.18 | 94.62 | 0.4595 | 64.58 | 94.93 | 0.3806 | 88.37 | 92.14 | 0.3893 | N/A | N/A | N/A | |
| 100% (Sp) | 76.8 | 98.47 | 0.6154 | 54.86 | 98.66 | 0.5222 | 86.05 | 96.89 | 0.5497 | N/A | N/A | N/A | |
| Default | 88.88 | 90.57 | 0.3564 | 82.99 | 87.59 | 0.3210 | 90.86 | 89.55 | 0.3498 | 94.9 | 91.34 | 0.3964 | |
a. The default parameters are employed for PPSP, NetPhosK and GPS.
b. ScanSite 2.0 has three thresholds for prediction, including high, medium and low stringencies.
c. N/A – not available.
d. KinasePhos has paid attention to prediction specificity with three cut-off values, as 90%, 95% and 100%.
The experimental verified vs. predicted CK2-specific phosphorylation sites of Bluetongue virus (BTV) nonstructural protein 2 (NS2), Drosophila transcription factor GAGA and human Calmodulin protein.
| S249, S259 | S378, S388 | T79, S81, S101, T117 | ||
| T247, | T123, S335, S337, S380, | T5, | ||
| T87, T88, S204, T247, | T5, T28, T44, T62, | |||
| high | N/A | N/A | ||
| medium | N/A | |||
| low | T88, S182, T247, | T385, T394 | ||
| 90% (Sp) | T247, | S240, S241, S339, | T5, T44, | |
| 95% (Sp) | S240, S339, | T5, T44, | ||
| 100% (Sp) | S339, S521 | T5, | ||
| T123, S337, S339, S385, S386, | T5, T44, | |||
Figure 3The prediction results of Bluetongue virus (BTV) nonstructural protein 2 (NS2), Drosophila transcription factor GAGA and human Calmodulin protein with PPSP. Figure 3A – prediction results of NS2; Figure 3B – prediction results of GAGA; Figure 3C – prediction results of Calmodulin.
Figure 4The diagram of potential phosphorylation sites of human RasGrf1 (Q13972) and TID1 (Q96EY1) by TRK.