| Literature DB >> 25324856 |
Antonio Palmeri1, Fabrizio Ferrè1, Manuela Helmer-Citterich1.
Abstract
Phosphate plays a chemically unique role in shaping cellular signaling of all current living systems, especially eukaryotes. Protein phosphorylation has been studied at several levels, from the near-site context, both in sequence and structure, to the crowded cellular environment, and ultimately to the systems-level perspective. Despite the tremendous advances in mass spectrometry and efforts dedicated to the development of ad hoc highly sophisticated methods, phosphorylation site inference and associated kinase identification are still unresolved problems in kinome biology. The sequence and structure of the substrate near-site context are not sufficient alone to model the in vivo phosphorylation rules, and they should be integrated with orthogonal information in all possible applications. Here we provide an overview of the different contexts that contribute to protein phosphorylation, discussing their potential impact in phosphorylation site annotation and in predicting kinase-substrate specificity.Entities:
Keywords: cellular signaling; kinase-peptide specificity; kinase-substrate specificity; phosphorylation context; phosphorylation prediction; signaling networks; substrate recruitment
Year: 2014 PMID: 25324856 PMCID: PMC4179730 DOI: 10.3389/fgene.2014.00315
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Computational methods for kinase-specific phosphorylation site prediction.
| Scansite | PSSM | OPL experimental data | Yaffe et al., | nr | 62 | nr | nr | nr | |
| Scansite 2.0 | PSSM | OPL experimental data | Obenauer et al., | nr | 62 | nr | nr | nr | |
| Predikin 1.0 | Structural | PhosphoBase | Brinkworth et al., | nr | All kinases with structural homologs | nr | Up to 70% of known | nr | |
| PhosphoBase sites in top 5% | |||||||||
| NetphosK | Artificial Neural Network | PhosphoBase | Blom et al., | YES | 6 | nr | 0.22–0.61 MCC depending on the kinase | 3-fold | |
| GPS | BLOSUM62 similarity + Markov Clustering | phosphoELM | Zhou et al., | YES | 52 | nr | 91.8 Sn, 85.0 Sp (PKA) | nr | |
| 94,4 Sn, 97,1 Sp (Aurora-B) | |||||||||
| KinasePhos 1.0 | HMM | PhosphoBase + SwissProt | Huang et al., | Partial | 18 | nr | 0.87 | k-fold | |
| PPSP | Bayesian | phosphoELM | Xue et al., | YES | 68 | 40–90% | 90.1Sn 91.7Sp (PKA) 83.2 Sn 90.0Sp (CK2) 93.0Sn 94.1 Sp (ATM) 92.8 Sn 98.0 Sp (S6K) | Leave-one-out | |
| pkaPS | Structural | phosphoELM | Neuberger et al., | YES | 1 | 30% | 95.8 Sn 93.5 Sp (PKA) | Neighbor-jacknife | |
| Cluster-based | phosphoELM + MitoCheck | Moses et al., | YES | 1 | nr | nr | |||
| KinasePhos 2.0 | SVM | phosphoELM + SwissProt | Wong et al., | 4 | YES | 91 | Leave-one-out | ||
| NetworKIN | Artificial Neural Network/PSSM + PPIN context | phosphoELM | Linding et al., | YES | 222 kinases | YES | nr | YES | |
| NetPhorest | Artificial Neural Network/PSSM | phosphoELM | Miller et al., | YES | 222 kinases | YES | 0.58–1 AUC | YES | |
| PhoScan | PSSM | phosphoELM | Li et al., | 3 | At high-stringency cutoffs, ~50% Sn up to 99%. Sp At low-stringency cutoffs, ~90% Sn and Sp for CDK, PKA, CK2 | nr | |||
| MetaPredPS | MetaPredictor | phosphoELM + PhosphoSite + SwissProt | Wan et al., | YES | 15 | YES | 0.86 (CDK) | YES | |
| 0.89 (CK2) | |||||||||
| 0.85 (PKA) | |||||||||
| 0.78 (PKC) | |||||||||
| Predikin 2.0 | Structural with HMM-based kinase identification | phosphoELM + SwissProt | Saunders et al., | YES | All kinases with structural homologs | nr | 0.86–0.88 AUC (S/T) | 10-fold | |
| 0.66–0.76 AUC (Y) | |||||||||
| GPS 2.0 | Optimized BLOSUM62 similarity | phosphoELM | Xue et al., | YES | 70 | YES | 83.1 Sn 95.0 Sp (PKA) | Leave-one-out | |
| 100.0 Sn 94.0 Sp (ATM) 78.0 Sn 95.2 Sp (CDC2) 54.0 Sn 95.3 Sp (Src) | |||||||||
| CRPhos | Conditional Random Fields | phosphoELM | Dang et al., | YES | 4 | YES | 0.93 AUC (CK2) | k-fold | |
| 0.96 AUC (PKA) | |||||||||
| 0.97 AUC (CDK) | |||||||||
| 0.91 AUC (PKC) | |||||||||
| Phos3D | SVM using structural information | phosphoELM | Durek et al., | YES | 6 | YES | 0.73 (Ser kinases) | YES | |
| 0.69 (Thr kinases) | |||||||||
| 0.67 (Tyr kinases) | |||||||||
| Musite | SVM | phosphoELM + SwissProt + PhosphoPep + PhosphAt | Gao et al., | YES | 13 in 6 organisms | 50% | 1.7–85.5 Sn, 99.9–90.0 Sp (PKA) | k-fold | |
| 6.1–83.6 Sn, 99.9–90.0 Sp (CK2) | |||||||||
| 0.9–81.9 Sn, 99.9-90.0 Sp (MAPK) | |||||||||
| GPS 2.1 | Optimized BLOSUM62 similarity | phodphoELM | Xue et al., | Partial | 70 | nr | 70.73 as average on a set of kinase families | nr | |
| PrediKin | Structural with HMM-based kinase identification | phosphoELM + SwissProt | Ellis and Kobe, | YES | All kinases with structural homologs | nr | 0.86–0.88 AUC (S/T) | nr | |
| 0.66–0.76 AUC (Y) | |||||||||
| but lower Frobenius distance for PWMs | |||||||||
| RegPhos | SVM + PPI + subcellular localization | dbPTM | Lee et al., | Partial | ca. 100 | YES | 87.7 (PKC) | 5-fold | |
| 92.1 (PIKK) | |||||||||
| 92.8 (CDK) | |||||||||
| 91.9 (INSR) | |||||||||
| ConDens | Conservation of Local Motif Density | No need for training dataset | Lai et al., | YES | All kinases with known motifs | Does not apply | 0.79 AUC | Does not apply | |
| PKIS | SVM | phosphoELM | Zou et al., | YES | 56 | YES | 13.9 Sn 97.6 Sp (Erk2) | Leave-one-out | |
| 13.5 Sn 97.3 Sp (p38a) | |||||||||
| 60.7 Sn (0.1 Sp (CK2 α) | |||||||||
| 37.5 Sn 93.3 Sp (CDC2) | |||||||||
| 37.3 Sn 99.8 Sp (PKCα) | |||||||||
| 45.0 Sn 93.0 Sp (SYK) | |||||||||
| 40.0 Sn 97.4 Sp (LCK) | |||||||||
| 23.5 Sn 94.6 Sp (FYN) | |||||||||
| RegPhos 2.0 | SVM + PPI + subcellular localization | dbPTM | Huang et al., | 122 | nr | nr | nr | ||
| SVM | phosphoELM | Xu et al., | YES | 7 | 70% | nr | 10-fold | ||
| phos_pred | Random Forest | phosphoELM | Fan et al., | YES | 54 | YES | 68.1–83.4 Sn, 99.1–97.2 Sp (CK2α) | nr | |
| 68.9–82.2 Sn, 100–95.6 Sp (GSK3B) | |||||||||
| 71.5–83.5 Sn, 99.1–98.3 Sp (MAPK1) | |||||||||
| 83.8–88.0 Sn, 99.1–98.3 Sp (MAPK3) etc. |
The methods that are currently available for predicting kinase-specific phosphorylation. nr, not reported.
Impossible to benchmark, because no other methods were available at that time.
Figure 1Specificity levels in Protein Phosphorylation. (A) Peptide specificity in a tyrosine kinase, Insulin Receptor (IR), a proline-directed kinase, Cyclin-dependent kinase 2 (CDK2), and a serine threonine kinase, cAMP-dependent protein kinase catalytic subunit alpha (PKA). Peptide preferences for each kinase are represented as sequence logos (top). The binding pockets of the three kinases have been visualized with UCSF Chimera, and the surfaces colored according to their electrostatic potential: red, positive; blue, negative; white, neutral (bottom). The structures from left to right show IR in complex with a peptide (pdb 1IR3), CDK2 in complex with a substrate peptide and cyclin A (pdb 1QMZ), which contributes to peptide specificity with a negative charged surface shown in the upper right of the figure, and PKA in complex with a peptide inhibitor (pdb 3FJQ). (B) Substrate recruitment. The kinase-substrate complexes concentration can be locally increased with docking motifs, protein interaction domains, and scaffold proteins. As an example of a docking motif, MAPK p38 bound to the docking site on its nuclear substrate MEF2A is shown on the left, colored in purple (pdb 1LEW). The protein interaction domains SH3 and SH2 domains in Src are fundamental for Src activation (inactive Src: pdb 2SRC), as shown in the cartoon in the middle (Xu et al., 1999). MAPK Fus3 in complex with a Ste5 peptide (pdb 2F49) is shown on the right.
Figure 2Methodologies for the experimental identification of phosphorylation sites. In the inner circle the techniques for the detection of phosphorylation sites are reported, while the outer circle displays the major techniques for dissecting kinase-substrates interactions, both at the level of direct determination of kinase-substrate interaction (kinase activity assays, OPL, MS with ATP analogs, structural data, western blot, optogenetics) and at the contextual information generation level, i.e., the methodologies that allow the identification of interacting domain preferences, domain-peptide interactions, etc. (Y2H, Y3H, phage display, structural data, western blot, optogenetics). In the field of PTM identification, future advancements in MS will allow the monitoring of multiple PTMs co-modulation, while for kinase-substrate interactions, the use of ATP analogs coupled with MS/MS is currently the most promising high-throughput technique to link kinases to their substrates in vivo. Y2H/Y3H: yeast 2/3 hybrid system (Y3H could be deployed for the study of scaffold proteins-mediated interactions).