| Literature DB >> 24977164 |
Wang-Ren Qiu1, Xuan Xiao2, Wei-Zhong Lin1, Kuo-Chen Chou3.
Abstract
Before becoming the native proteins during the biosynthesis, their polypeptide chains created by ribosome's translating mRNA will undergo a series of "product-forming" steps, such as cutting, folding, and posttranslational modification (PTM). Knowledge of PTMs in proteins is crucial for dynamic proteome analysis of various human diseases and epigenetic inheritance. One of the most important PTMs is the Arg- or Lys-methylation that occurs on arginine or lysine, respectively. Given a protein, which site of its Arg (or Lys) can be methylated, and which site cannot? This is the first important problem for understanding the methylation mechanism and drug development in depth. With the avalanche of protein sequences generated in the postgenomic age, its urgency has become self-evident. To address this problem, we proposed a new predictor, called iMethyl-PseAAC. In the prediction system, a peptide sample was formulated by a 346-dimensional vector, formed by incorporating its physicochemical, sequence evolution, biochemical, and structural disorder information into the general form of pseudo amino acid composition. It was observed by the rigorous jackknife test and independent dataset test that iMethyl-PseAAC was superior to any of the existing predictors in this area.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24977164 PMCID: PMC4054830 DOI: 10.1155/2014/947416
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Schematic drawing to show the involvement of the Arg-methylation and Lys-methylation in gene regulation (adapted from [13] with permission).
Figure 2Schematic drawing to show the Chou's peptide formulation for studying (a) Arg-methylation and (b) Lys-methylation (adapted from [32, 33] with permission).
The metrics rates obtained by the jackknife test on the Arg-methylation system, where the positive dataset contains 185 samples (see Online Supporting Information S3), while the negative dataset consists of seven subsets with each containing 185 samples except for the 6th subset that contains 186 samples (see Online Supporting Information S5).
| Negative subset | Acc (%) | MCC | Sn (%) | Sp (%) |
|---|---|---|---|---|
| 1 | 72.16 | 0.45 | 64.32 | 80.00 |
| 2 | 78.65 | 0.57 | 79.46 | 77.84 |
| 3 | 73.24 | 0.47 | 66.49 | 80.00 |
| 4 | 78.71 | 0.57 | 78.38 | 79.03 |
| 5 | 77.30 | 0.55 | 70.81 | 83.78 |
| 6 | 75.68 | 0.51 | 75.68 | 75.68 |
| 7 | 77.57 | 0.56 | 67.57 | 87.57 |
|
| ||||
| Average | 76.19 | 0.53 | 71.81 | 80.56 |
Figure 3A semiscreenshot to show the top page of iMethyl-PseAAC. Its web-site address is at http://www.jci-bioinfo.cn/iMethyl-PseAAC.
The metrics rates obtained by the jackknife test on the Lys-methylation system, where the positive dataset contains 226 samples (see Online Supporting Information S4), while the negative dataset consists of seven subsets with each containing 217 samples except for the 5th subset that contains 216 samples (see Online Supporting Information S6).
| Negative subset | Acc (%) | MCC | Sn (%) | Sp (%) |
|---|---|---|---|---|
| 1 | 73.36 | 0.47 | 72.57 | 74.19 |
| 2 | 65.01 | 0.30 | 63.27 | 66.82 |
| 3 | 71.11 | 0.42 | 70.35 | 71.89 |
| 4 | 67.27 | 0.35 | 63.72 | 70.97 |
| 5 | 73.76 | 0.48 | 73.01 | 74.54 |
| 6 | 72.23 | 0.45 | 64.16 | 80.65 |
| 7 | 72.46 | 0.45 | 73.01 | 71.89 |
|
| ||||
| Average | 70.74 | 0.42 | 68.58 | 72.99 |
Comparison of iMethyl-PseAAC with the existing web-server predictors when tested for identifying Arg-methylation sites by the independent dataset (see Online Supporting Information S7).
| Predictor | Acc (%) | MCC | Sn (%) | Sp (%) |
|---|---|---|---|---|
| PMeSa | 76.00 | 0.45 | 70.00 | 78.18 |
| BPB-PPMSb | 93.33 | 0.83 | 85.00 | 96.36 |
| iMethyl-PseAAC | 97.33 | 0.94 | 100.00 | 96.36 |
aFrom [23].
bFrom [20].
Comparison of iMethyl-PseAAC with the existing web-server predictors when tested for identifying Lys-methylation sites by the independent dataset (see Online Supporting Information S8).
| Predictor | Acc (%) | MCC | Sn (%) | Sp (%) |
|---|---|---|---|---|
| PMeSa | 65.00 | 0.35 | 78.57 | 57.69 |
| BPB-PPMSb | 70.00 | 0.36 | 64.29 | 73.08 |
| iMethyl-PseAAC | 75.00 | 0.60 | 100.00 | 61.54 |
aSee footnote a of Table 3.
bSee footnote b of Table 3.