| Literature DB >> 23941207 |
Liang Zou1, Mang Wang, Yi Shen, Jie Liao, Ao Li, Minghui Wang.
Abstract
BACKGROUND: Dynamic protein phosphorylation is an essential regulatory mechanism in various organisms. In this capacity, it is involved in a multitude of signal transduction pathways. Kinase-specific phosphorylation data lay the foundation for reconstruction of signal transduction networks. For this reason, precise annotation of phosphorylated proteins is the first step toward simulating cell signaling pathways. However, the vast majority of kinase-specific phosphorylation data remain undiscovered and existing experimental methods and computational phosphorylation site (P-site) prediction tools have various limitations with respect to addressing this problem.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23941207 PMCID: PMC3765618 DOI: 10.1186/1471-2105-14-247
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Prediction performance of models with different single-side window sizes . (A) The escalating trend for AUC with the improvement of m. The slope of the left side is larger than that of the right. (B) The optimal m for kinases is diverse. Sensitivity was evaluated when the corresponding specificity was greater than or equal to 99%.
Figure 2Difference of amino acid distributions in positive and negative data. Panels (A) and (B) represent distinct amino acid distribution patterns in CK2 alpha’s positive and negative datasets, respectively. Panels (C) and (D) represent different amino acid distribution patterns in CDC2’s positive and negative datasets, respectively. The X-axis represents the single side window size m.
Figure 3Performance of two sequence encoding strategies: CMS and binary encoding. (A) Performance of CK2 alpha models using the CMS and binary encoding strategies. (B) Performance of CDC2 models using CMS and binary encoding strategies. The red lines represent the CMS method and the black lines represent the binary method.
Figure 4Comparing with kinase-specific P-site prediction tools: KinasePhos2.0, Musite, and GPS2.1 at high specificities. Panel (A) depicts the performance of the tool in CK2 alpha kinase and (B) illustrates the performance in CDC2 kinase. The ROC curves of PKIS are plotted in red solid lines.
Comparison of PKIS with kinase-specific P-site prediction tools on testing data
| Erk2 (MAPK1) | 13.9% | 97.6% | 5.7% | 97.2% | 4.4% | 97.4% | 3.8% | 97.4% | 13.9% | 97.6% |
| p38a (MAPK14) | 13.5% | 97.3% | 0.0% | 96.3% | 8.1% | 96.6% | 0.0% | 97.3% | 5.4% | 97.3% |
| CK2alpha | 60.7% | 99.1% | 58.3% | 99.0% | 49.1% | 99.1% | 35.6% | 99.1% | 53.4% | 99.0% |
| CDC2 | 37.5% | 93.3% | 12.5% | 92.0% | 0.0% | 90.3% | 0.0% | 93.2% | 12.5% | 93.2% |
| PKCa | 37.3% | 99.8% | 0.0% | 99.4% | 0.0% | 99.6% | 1.7% | 99.4% | 10.2% | 99.7% |
| SYK | 45.0% | 93.0% | 25.0% | 93.0% | NA | NA | 35.0% | 94.4% | 45.0% | 93.0% |
| LCK | 40.0% | 97.4% | 26.7% | 92.1% | 6.7% | 93.4% | 20.0% | 96.1% | 40.0% | 97.4% |
| FYN | 23.5% | 94.6% | 11.8% | 94.6% | 5.9% | 90.5% | 23.5% | 94.6% | 23.5% | 94.6% |
NA: This tool could not predict whether a residue had been phosphorylated by the corresponding kinase or not.
Significant KEGG pathways enriched in the combined dataset
| Natural-killer-cell-mediated cytotoxicity | 9 (2) | 2.71E-08 | 1.27E-06 |
| Fc-gamma-R-mediated phagocytosis | 6 (1) | 3.29E-05 | 7.72E-04 |
| Fc epsilon RI signaling pathway | 5 (1) | 2.52E-04 | 2.95E-03 |
| B cell receptor signaling pathway 1 | 5 (2) | 2.16E-04 | 3.38E-03 |
| Pathogenic | 4 (2) | 1.54E-03 | 1.44E-02 |
| ErbB signaling pathway | 4 (0) | 5.15E-03 | 3.97E-02 |
1 Term not found when only kinase-specific phosphorylated data was used in Phospho.ELM.
2 The number of Syk’s substrates predicted by PKIS at high specificity.