| Literature DB >> 21703007 |
Tzong-Yi Lee1, Neil Arvin Bretaña, Cheng-Tsung Lu.
Abstract
BACKGROUND: Protein phosphorylation catalyzed by kinases plays crucial regulatory roles in intracellular signal transduction. Due to the difficulty in performing high-throughput mass spectrometry-based experiment, there is a desire to predict phosphorylation sites using computational methods. However, previous studies regarding in silico prediction of plant phosphorylation sites lack the consideration of kinase-specific phosphorylation data. Thus, we are motivated to propose a new method that investigates different substrate specificities in plant phosphorylation sites.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21703007 PMCID: PMC3228547 DOI: 10.1186/1471-2105-12-261
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1System flow of this study.
Data statistics of the experimentally verified phosphorylation sites collected from TAIR9 protein database.
| Phosphorylated residue | Type of Data | Original data | Non-redundant data | Balanced data |
|---|---|---|---|---|
| positive | 2516 | 2506 | 2506 | |
| negative | 97965 | 87877 | 2506 | |
| positive | 382 | 378 | 378 | |
| negative | 51434 | 10402 | 378 | |
| positive | 108 | 108 | 108 | |
| negative | 26405 | 1681 | 108 | |
Two Sample Logo in plant phosphoserine, phosphothreonine, and phosphotyrosine.
| Phosphorylated Residue | Number of Non-redundant data | Entropy Plot of Sequence Logo |
|---|---|---|
| 2506 | ||
| 378 | ||
| 108 | ||
The substrate site specificity and predictive performance in nine MDD-clustered subgroups of phosphoserine.
| Group | Number of data | Entropy plot of substrate motif | HMMER bit score | Pre | Sn | Sp | Acc |
|---|---|---|---|---|---|---|---|
| 624 | -3 | 89.8% | 90.6% | 89.7% | 90.2% | ||
| 786 | -4 | 76.6% | 71.9% | 78.1% | 75.0% | ||
| 355 | -7 | 72.1% | 68.9% | 73.2% | 71.0% | ||
| 230 | -6 | 90.3% | 93.0% | 90.0% | 91.5% | ||
| 77 | -12 | 76.9% | 75.0% | 74.3% | 74.5% | ||
| 93 | -10 | 90.2% | 88.0% | 90.4% | 89.1% | ||
| 109 | -8 | 90.8% | 83.3% | 91.7% | 87.5% | ||
| 61 | -12 | 77.7% | 85.0% | 80.2% | 82.7% | ||
| 171 | -6 | 79.6% | 83.1% | 78.5% | 80.8% | ||
The substrate site specificity and predictive performance in six MDD-clustered subgroups of phosphothreonine.
| Group | Number of data | Entropy plot of substrate motif | HMMER bit score | Pre | Sn | Sp | Acc |
|---|---|---|---|---|---|---|---|
| 43 | -16 | 79.4% | 74.1% | 81.1% | 77.7% | ||
| 88 | -9 | 89.4% | 91.9% | 88.6% | 90.3% | ||
| 77 | -9 | 82.1% | 80.3% | 81.9% | 81.1% | ||
| 34 | -15 | 78.6% | 76.6% | 77.1% | 76.5% | ||
| 42 | -13 | 79.4% | 71.1% | 81.1% | 75.9% | ||
| 94 | -8 | 74.9% | 61.0% | 79.8% | 70.5% | ||
The substrate site specificity and predictive performance in six MDD-clustered subgroups of phosphotyrosine.
| Group | Number of data | Entropy plot of substrate motif | HMMER bit score | Pre | Sn | Sp | Acc |
|---|---|---|---|---|---|---|---|
| 11 | -16 | 93.3% | 90.0% | 90.0% | 90.0% | ||
| 7 | -14 | 80.0% | 80.0% | 100% | 90.0% | ||
| 15 | -18 | 85.0% | 90.0% | 86.6% | 85.0% | ||
| 20 | -17 | 90.0% | 90.0% | 90.0% | 90.0% | ||
| 16 | -16 | 90.0% | 86.6% | 88.3% | 87.1% | ||
| 39 | -16 | 93.3% | 92.1% | 92.5% | 92.4% | ||
The comparison of five-fold cross-validation between single HMM and MDD-clustered HMMs.
| Method | Phosphorylated residue | Pre | Sn | Sp | Acc |
|---|---|---|---|---|---|
| S | 49.5% | 58.6% | 70.0% | 66.2% | |
| T | 45.4% | 60.5% | 63.4% | 62.5% | |
| Y | 75.4% | 90.6% | 84.7% | 86.6% | |
| S | 82.6% | 82.0% | 82.9% | 82.4% | |
| T | 80.6% | 75.8% | 81.6% | 78.6% | |
| Y | 88.6% | 88.1% | 91.2% | 89.0% | |
The comprison of independent testing.
| Method | Phosphorylated residue | Pre (%) | Sn (%) | Sp (%) | Acc (%) |
|---|---|---|---|---|---|
| S | 52.6% | 50.6% | 77.2% | 68.3% | |
| T | 50.0% | 20.9% | 89.5% | 66.6% | |
| Y | 36.3% | 28.5% | 75.0% | 59.5% | |
| S | 74.2% | 81.4% | 71.6% | 76.5% | |
| T | 71.8% | 77.1% | 69.7% | 73.4% | |
| Y | 72.8% | 83.7% | 68.7% | 76.2% | |
| S | 55.5% | 61.9% | 50.6% | 56.2% | |
| T | 55.0% | 57.1% | 53.3% | 55.2% | |
| Y | 4.0% | 14.2% | 78.5% | 46.4% | |
| S | 58.6% | 71.6% | 49.3% | 60.5% | |
| T | 47.5% | 36.1% | 59.6% | 47.8% | |
| Y | 44.4% | 28.5% | 64.2% | 46.4% | |
| S | 66.8% | 60.6% | 69.3% | 65.0% | |
| T | 47.5% | 36.1% | 61.2% | 48.6% | |
| Y | 57.1% | 47.5% | 64.2% | 55.8% | |
The MDD-clustered subgroups matched to the well-known kinase groups in non-plant organisms of Phospho.ELM.
| Group | Number of data | Entropy plot of motif | Matched kinase | Entropy plot of kinase motif |
|---|---|---|---|---|
| 624 | MAPK | |||
| CDK | ||||
| 786 | CK2 | |||
| 230 | CAMK2 | |||
| 43 | MAPK | |||
| CDK | ||||
| 77 | CK2 | |||
Figure 2Cases study of (A) protein AT5G05040.1 from .