| Literature DB >> 15980458 |
Hsien-Da Huang1, Tzong-Yi Lee, Shih-Wei Tzeng, Jorng-Tzong Horng.
Abstract
KinasePhos is a novel web server for computationally identifying catalytic kinase-specific phosphorylation sites. The known phosphorylation sites from public domain data sources are categorized by their annotated protein kinases. Based on the profile hidden Markov model, computational models are learned from the kinase-specific groups of the phosphorylation sites. After evaluating the learned models, the model with highest accuracy was selected from each kinase-specific group, for use in a web-based prediction tool for identifying protein phosphorylation sites. Therefore, this work developed a kinase-specific phosphorylation site prediction tool with both high sensitivity and specificity. The prediction tool is freely available at http://KinasePhos.mbc.nctu.edu.tw/.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15980458 PMCID: PMC1160232 DOI: 10.1093/nar/gki471
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1The flow of the proposed scheme.
Figure 2The KinasePhos web interface.
The selected models learned and used in the web server
| Residues | Protein kinases | Score threshold | Precision | Sensitivity | Specificity | Accuracy |
|---|---|---|---|---|---|---|
| Serine | S_PKA | −4.5 | 0.85 | 0.91 | 0.84 | 0.88 |
| S_PKC | −4.5 | 0.87 | 0.77 | 0.88 | 0.82 | |
| S_PKG (27) | −9.5 | 0.94 | 0.96 | 0.93 | 0.95 | |
| S_PKB (37) | −6.5 | 0.88 | 0.76 | 0.89 | 0.82 | |
| S_CaM-II (37) | −8.0 | 0.84 | 0.76 | 0.86 | 0.81 | |
| S_CKI (30) | −7.0 | 0.82 | 0.65 | 0.86 | 0.76 | |
| S_CKII | −3.5 | 0.95 | 0.79 | 0.96 | 0.87 | |
| S_cdc2 (43) | −10 | 0.94 | 0.94 | 0.94 | 0.94 | |
| S_MAPK (27) | −6.0 | 0.97 | 0.77 | 0.97 | 0.87 | |
| S_CDK | −6.5 | 0.83 | 0.87 | 0.82 | 0.85 | |
| S_ATM (38) | −8.0 | 0.92 | 0.87 | 0.92 | 0.90 | |
| S_IKK (32) | −8.0 | 0.75 | 0.75 | 0.75 | 0.75 | |
| Average | 0.88 | 0.84 | 0.88 | 0.86 | ||
| Threonine | T_PKA (19) | −7.0 | 0.97 | 0.94 | 0.97 | 0.95 |
| T_PKC (37) | −8.5 | 0.85 | 0.83 | 0.85 | 0.84 | |
| T_CKII (17) | −9.0 | 0.79 | 0.98 | 0.75 | 0.86 | |
| T_cdc2 (23) | −9.5 | 1.00 | 0.95 | 1.00 | 0.97 | |
| T_MAPK (15) | −9.5 | 1.00 | 1.00 | 1.00 | 1.00 | |
| T_CDK (35) | −6.5 | 0.94 | 0.86 | 0.94 | 0.90 | |
| Average | 0.91 | 0.92 | 0.91 | 0.91 | ||
| Tyrosine | Y_EGFR (30) | −5.5 | 0.89 | 0.83 | 0.89 | 0.86 |
| Y_INSR (16) | −9.5 | 0.82 | 0.78 | 0.83 | 0.80 | |
| Y_Src (28) | −5.0 | 0.86 | 0.81 | 0.87 | 0.84 | |
| Y_Abl (27) | −2.0 | 0.93 | 0.48 | 0.96 | 0.72 | |
| Y_Syk (22) | −8.5 | 0.83 | 0.91 | 0.82 | 0.86 | |
| Y_Jak | −3.5 | 0.91 | 0.66 | 0.93 | 0.80 | |
| Average | 0.86 | 0.81 | 0.87 | 0.84 |
aThe dataset is clustered by MDD.
The prediction accuracy comparison between NetPhos, DISPHOS, rBPNN and KinasePhos
| Residue types | NetPhos | DISPHOS | rBPNN | KinasePhos |
|---|---|---|---|---|
| Serine | 0.69 | 0.75 | No data | 0.86 |
| Threonine | 0.72 | 0.80 | No data | 0.91 |
| Tyrosine | 0.61 | 0.82 | No data | 0.84 |
| Total or average | 0.67 | 0.79 | 0.87 | 0.87 |