| Literature DB >> 28813517 |
Pawan Kumar1, Joseph Joy1, Ashutosh Pandey1, Dinesh Gupta1.
Abstract
Protein methylation is an important Post-Translational Modification (PTMs) of proteins. Arginine methylation carries out and regulates several important biological functions, including gene regulation and signal transduction. Experimental identification of arginine methylation site is a daunting task as it is costly as well as time and labour intensive. Hence reliable prediction tools play an important task in rapid screening and identification of possible methylation sites in proteomes. Our preliminary assessment using the available prediction methods on collected data yielded unimpressive results. This motivated us to perform a comprehensive data analysis and appraisal of features relevant in the context of biological significance, that led to the development of a prediction tool PRmePRed with better performance. The PRmePRed perform reasonably well with an accuracy of 84.10%, 82.38% sensitivity, 83.77% specificity, and Matthew's correlation coefficient of 66.20% in 10-fold cross-validation. PRmePRed is freely available at http://bioinfo.icgeb.res.in/PRmePRed/.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28813517 PMCID: PMC5557562 DOI: 10.1371/journal.pone.0183318
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The relationship between different evaluation parameters and feature subsets.
A) The relationship between the Accuracy and number of features. B) The relationship between the Sensitivity and number of features. C) The relationship between the Specificity and number of features. D) The relationship between the MCC and number of features.
Comparisons with best models of different window lengths.
| Window Length | MCC | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|
| WL_19 (100) | 0.662 | 84.10% | 82.38% | 83.77% |
| WL_23 (150) | 0.606 | 80.93% | 80.36% | 80.25% |
| WL_27 (150) | 0.605 | 81.07% | 80.00% | 80.49% |
| WL_31 (200) | 0.629 | 81.35% | 80.22% | 82.45% |
| WL_35 (250) | 0.641 | 82.01% | 80.28% | 83.77% |
Comparison of PRmePRed with other prediction methods.
| Method (yr. developed) | Algorithm | MCC | Accuracy | Sensitivity | Specificity |
|---|---|---|---|---|---|
| MeMo (Chen et al. 2006)[ | SVM | 0.462 | 0.6839 | 0.3811 | 0.987 |
| MASA (Shien et al. 2009)[ | SVM | 0.411 | 0.6503 | 0.3095 | 0.991 |
| BPB-PPMS (Shao et al. 2009)[ | SVM | 0.253 | 0.5601 | 0.1202 | 1.000 |
| PMeS (Shi et al. 2012)[ | SVM | 0.159 | 0.5756 | 0.4253 | 0.726 |
| iMethyl-PseAAC (2014)[ | SVM | 0.302 | 0.5866 | 0.1768 | 0.997 |
| PSSMe (Wen et al. 2016) [ | SVM | 0.444 | 0.7162 | 0.6003 | 0.832 |
| MePred-RF (Wei et al. 2017) [ | RF | 0.462 | 0.6908 | 0.4095 | 0.972 |
| PRmePRed (2017) | SVM | 0.737 | 0.8683 | 0.8709 | 0.866 |
Fig 2ROC curve for SVM classifier with different datasets.
A) ROC curve for SVM classifier with training set. B) ROC curve for SVM classifier with test set. C) ROC curve for SVM classifier with independent set.
Fig 3Comparisons with other classifiers based on evaluation parameters.