| Literature DB >> 27314023 |
Ji-Yong An1, Fan-Rong Meng1, Zhu-Hong You2, Yu-Hong Fang1, Yu-Jun Zhao1, Ming Zhang1.
Abstract
We propose a novel computational method known as RVM-LPQ that combines the Relevance Vector Machine (RVM) model and Local Phase Quantization (LPQ) to predict PPIs from protein sequences. The main improvements are the results of representing protein sequences using the LPQ feature representation on a Position Specific Scoring Matrix (PSSM), reducing the influence of noise using a Principal Component Analysis (PCA), and using a Relevance Vector Machine (RVM) based classifier. We perform 5-fold cross-validation experiments on Yeast and Human datasets, and we achieve very high accuracies of 92.65% and 97.62%, respectively, which is significantly better than previous works. To further evaluate the proposed method, we compare it with the state-of-the-art support vector machine (SVM) classifier on the Yeast dataset. The experimental results demonstrate that our RVM-LPQ method is obviously better than the SVM-based method. The promising experimental results show the efficiency and simplicity of the proposed method, which can be an automatic decision support tool for future proteomics research.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27314023 PMCID: PMC4893571 DOI: 10.1155/2016/4783801
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The flow chart of the proposed method.
5-fold cross-validation results shown by using our proposed method on the Yeast dataset.
| Testing set | Ac (%) | Sn (%) | Pe (%) | MCC (%) |
|---|---|---|---|---|
| 1 | 92.76 | 92.73 | 92.79 | 86.56 |
| 2 | 93.79 | 93.27 | 93.41 | 88.34 |
| 3 | 91.28 | 92.12 | 90.43 | 84.08 |
| 4 | 92.27 | 92.02 | 92.50 | 85.72 |
| 5 | 93.17 | 93.02 | 93.32 | 87.27 |
| Average | 92.65 ± 0.95 | 92.63 ± 0.55 | 92.67 ± 1.40 | 86.40 ± 1.61 |
5-fold cross-validation results shown by using our proposed method on the Human dataset.
| Testing set | Ac (%) | Sn (%) | Pe (%) | MCC (%) |
|---|---|---|---|---|
| 1 | 98.10 | 98.99 | 97.25 | 96.27 |
| 2 | 97.67 | 99.49 | 96.02 | 95.45 |
| 3 | 97.37 | 99.25 | 95.55 | 94.87 |
| 4 | 97.24 | 98.96 | 95.72 | 94.63 |
| 5 | 99.26 | 99.22 | 99.31 | 98.54 |
| Average | 97.92 ± 0.81 | 99.18 ± 0.21 | 96.77 ± 1.57 | 95.95 ± 1.58 |
5-fold cross-validation results shown by using our proposed method on the Yeast dataset.
| Testing set | Ac (%) | Sn (%) | Sp (%) | MCC (%) |
|---|---|---|---|---|
| SVM + PSSM + LPQ | ||||
| 1 | 85.96 | 84.77 | 87.13 | 75.86 |
| 2 | 84.18 | 82.86 | 85.43 | 73.33 |
| 3 | 85.52 | 84.10 | 86.97 | 75.22 |
| 4 | 85.29 | 84.12 | 86.47 | 74.91 |
| 5 | 85.76 | 86.16 | 88.45 | 75.55 |
| Average | 85.34 ± 0.69 | 84.40 ± 1.20 | 86.89 ± 1.09 | 74.97 ± 0.98 |
|
| ||||
| RVM + PSSM + LPQ | ||||
| 1 | 92.76 | 92.73 | 92.79 | 86.56 |
| 2 | 93.79 | 93.27 | 93.41 | 88.34 |
| 3 | 91.28 | 92.12 | 90.43 | 84.08 |
| 4 | 92.27 | 92.02 | 92.50 | 85.72 |
| 5 | 93.17 | 93.02 | 93.32 | 87.27 |
|
|
|
|
|
|
Figure 2Comparison of ROC curves performed between RVM and SVM on the Yeast dataset.
Predicting ability of different methods on the Yeast dataset.
| Model | Testing set | Ac (%) | Sn (%) | Pe (%) | MCC (%) |
|---|---|---|---|---|---|
| Guo et al.'s work [ | ACC | 89.33 ± 2.67 | 89.93 ± 3.60 | 88.77 ± 6.16 | N/A |
| AC | 87.36 ± 1.38 | 87.30 ± 4.68 | 87.82 ± 4.33 | N/A | |
|
| |||||
| Zhou et al.'s work [ | SVM + LD | 88.56 ± 0.33 | 87.37 ± 0.22 | 89.50 ± 0.60 | 77.15 ± 0.68 |
|
| |||||
| Yang et al.'s work [ | Cod1 | 75.08 ± 1.13 | 75.81 ± 1.20 | 74.75 ± 1.23 | N/A |
| Cod2 | 80.04 ± 1.06 | 76.77 ± 0.69 | 82.17 ± 1.35 | N/A | |
| Cod3 | 80.41 ± 0.47 | 78.14 ± 0.90 | 81.66 ± 0.99 | N/A | |
| Cod4 | 86.15 ± 1.17 | 81.03 ± 1.74 | 90.24 ± 1.34 | N/A | |
|
| |||||
| You et al.'s work [ | PCA-EELM | 87.00 ± 0.29 | 86.15 ± 0.43 | 87.59 ± 0.32 | 77.36 ± 0.44 |
|
| |||||
|
|
|
|
|
|
|
Predicting ability of different methods on the Human dataset.
| Model | Ac (%) | Sn (%) | Pe (%) | MCC (%) |
|---|---|---|---|---|
| LDA + RF [ | 96.4 | 94.2 | N/A | 92.8 |
| LDA + RoF [ | 95.7 | 97.6 | N/A | 91.8 |
| LDA + SVM [ | 90.7 | 89.7 | N/A | 81.3 |
| AC + RF [ | 95.5 | 94.0 | N/A | 91.4 |
| AC + RoF [ | 95.1 | 93.3 | N/A | 91.0 |
| AC + SVM [ | 89.3 | 94.0 | N/A | 79.2 |
|
|
|
|
|
|