| Literature DB >> 28458782 |
Martina Audagnotto1, Matteo Dal Peraro1.
Abstract
Post-translational modifications (PTMs) occur in almost all proteins and play an important role in numerous biological processes by significantly affecting proteins' structure and dynamics. Several computationpan>al approaches have been developed to study PTMs (e.g., phosphorylationpan>, sumoylationpan> or palmitoylationpan>) showing the importance of these techniques in predicting modified sites that can be further investigated with experimental approaches. In this review, we summarize some of the available online platforms and their contribution in the study of PTMs. Moreover, we discuss the emerging capabilities of molecular modeling and simulation that are able to complement these bioinformatics methods, providing deeper molecular insights into the biological function of post-translational modified proteins.Entities:
Year: 2017 PMID: 28458782 PMCID: PMC5397102 DOI: 10.1016/j.csbj.2017.03.004
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Schematic representation of PTMs discussed in this review.
Schematic relationship between PTMs and their implication in biological functions.
PTM prediction webservers. Abbreviations: artificial neuronal network (ANN); support vector machine (SVM); random forest method (RFM); Hidden Markov model (HMM); weight matrix (WM); group based phosphorylation scoring method (GPS); binary profile of patterns (BPP); composition profile of patterns (CPP); PSSM profile of patterns (PPM); average surface accessibility (ASA); neuronal network (NN); knowledge-based (KB); conditional random field (CRF); group-based prediction (GBP); binary profile bayesian (BPB); information gain (IG); Bayesian discriminant (BD); enrichment based method (EBM); binary-relative adaptive binomial score Bayesian (Bi-BSP); logistic regression model (LRM); synthetic minority oversampling technique (SMOT); Markov chain clustering (MCC); particle swarm optimization (PSO); genetic variability (GV); position frequency matrix (PFM); covariance discriminant algorithm (CD): machine learning (ML).
| PTM type | Covalent attachment of small chemical groups | Year | Description | Method | Information | |
|---|---|---|---|---|---|---|
| Phosphorylation | NetPhos 3.1 | 1999 | K-specific and K-independent | ANN | Prediction based on 17 different kinases | |
| Scansite | 2003 | K-specific | WM | Identification of short protein sequence motifs that are recognized by modular signaling domains or mediated specific interaction with proteins | ||
| PhosphoSitePlus | 2004 | K-specific | – | Repository of human and mouse phosphorylation sites | ||
| GPS | 2005 | K-specific | GPS | Prediction based on 71 PK groups ( | ||
| KinasePhos 2.0 | 2007 | K-specific | SVM | SVM coupled with protein coupling pattern | ||
| PhosphoELM | 2010 | K-independent | – | Repository of | ||
| PPRED | 2010 | K-independent | SVM | Prediction based on evolutionary information | ||
| PhosPhortholog | 2015 | K-independent | – | Database for cross-species comparison | ||
| Glycosylation | bigPI | 1999 | GPI-anchor | KB | Prediction for protozoa and metazoa | |
| O-GlycBase | 1999 | O-glycosylated | – | Repository of O-glycosylated proteins based on protein sequence database and scientific literature | ||
| GlycoMod | 2001 | N-,O-glycosylated | Experimental determined | Match between the experimentally determined masses and the predicted protease (SWISSPROT and TrEMBL databases) | ||
| YinOYang | 2001 | N-,C-,O-glycosylated | NN | Prediction based on eukaryotes protein sequences | ||
| NetNGlyC | 2002 | N-glycosylated | NN | Prediction for procaryotes | ||
| GlyProt | 2005 | N-glycosylated | SWEET-II | 3D model of glycoproteins based on a PDB structure without attached glycans | ||
| GPP | 2008 | N-,C-,O-glycosylated | RF | Prediction of glycosylation sites and the propensity of association with modified residues | ||
| NGlycPred | 2012 | N-glycosylated | RF | Combination of different structure and residues pattern information | ||
| GLYCOPP | 2012 | N-,O-glycosylated | SVM | Prediction based on different approaches (BPP, CPP, PPP, ASA + BPP) | ||
| NetOGlyC | 2013 | O-glycosylated | NN | Prediction for prokaryotes | ||
| S-nitrosylation | GlycoMine | 2015 | N-,C-,O-glycosylated | RF | Determination of the features important for glycosylation site specificity | |
| GPS-SNO | 2010 | SNO sites | GBP | Prediction of putative SNO based on a database of 504 experimentally verified SNO | ||
| Methylation | iSNO-PseAAC | 2013 | SNO sites | CRF | Identification of nitrosylated protein on an independent data set (731 experimentally verified SNO and 810 experimentally non verified SNO) | |
| MeMo | 2006 | R-,L-methylated | SVM | Prediction based on orthogonal binary coding scheme for representing protein sequence fragments | ||
| BPB-PPMS | 2009 | R-,L-methylated | BPB and SVM | Prediction based on experimental data | ||
| MASA | 2009 | K-,R-,E-,N-methylated | SVM | Prediction based on structural information (SASA and secondary structures) | ||
| PMes | 2012 | R-,K-methylated | SVM | Prediction based on physiochemical properties (VdW volume, position weight aminoacid, composition, solvent, SASA) | ||
| MethK | 2014 | K-methylated histone | SVM | Differentiation between K-methylated Histone and K-methylated non-Histone | ||
| iMethyl-PseAAC | 2014 | R-,K-methylated | SVM | Prediction based on physiochemical properties, sequence evolution, biochemical and structural disorder information | ||
| N-acetylation | PSSMe | 2016 | R-,L-methylated | IGF | Prediction based on species-specific models | |
| NetAcet | 2004 | Nα-acetylated | NN | Prediction for yeast and mammalian | ||
| PAIL | 2006 | Nε-,K-acetylated | BD | Prediction based on dataset of 246 acetylated substrates | ||
| N-Ace | 2010 | K-,A-,G-,M-,S- and T-acetylated | SVM | Prediction based on physiochemical properties | ||
| ASEB | 2012 | K-acetylated | EBM | Prediction based on protein-protein interaction information | ||
| BRABSB-PHKA | 2012 | K-acetylated | Bi-BSB | Prediction for human-specific lysine acetylated sites | ||
| PSKacePred | 2012 | K-acetylated | SVM | Prediction based on amynoacid composition, evolutionary similarity and physiochemical properties | ||
| LAceP | 2014 | K-acetylated | LRM | Prediction based on physiochemical properties | ||