| Literature DB >> 28155718 |
Yu-An Huang1, Zhu-Hong You2, Xing Chen3, Gui-Ying Yan4.
Abstract
BACKGROUND: Protein-protein interactions (PPIs) are essential to most biological processes. Since bioscience has entered into the era of genome and proteome, there is a growing demand for the knowledge about PPI network. High-throughput biological technologies can be used to identify new PPIs, but they are expensive, time-consuming, and tedious. Therefore, computational methods for predicting PPIs have an important role. For the past years, an increasing number of computational methods such as protein structure-based approaches have been proposed for predicting PPIs. The major limitation in principle of these methods lies in the prior information of the protein to infer PPIs. Therefore, it is of much significance to develop computational methods which only use the information of protein amino acids sequence.Entities:
Keywords: Continuous wavelet transform; Protein sequence; Protein-protein interactions; Sparse representation based classifier
Mesh:
Year: 2016 PMID: 28155718 PMCID: PMC5260127 DOI: 10.1186/s12918-016-0360-6
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
5-fold cross validation result obtained in predicting Yeast PPIs dataset
| Test set | Accu.(%) | Prec.(%) | Sen.(%) | MCC(%) | AUC(%) |
|---|---|---|---|---|---|
| 1 | 93.43 | 96.98 | 89.93 | 87.70 | 97.70 |
| 2 | 92.27 | 95.01 | 89.39 | 85.71 | 96.99 |
| 3 | 92.36 | 96.62 | 87.64 | 85.81 | 97.39 |
| 4 | 92.62 | 95.65 | 89.19 | 89.30 | 97.09 |
| 5 | 91.83 | 95.10 | 87.95 | 84.94 | 96.80 |
| Average | 92.50 ± 0.59 | 95.87 ± 0.89 | 88.82 ± 0.98 | 86.09 ± 1.02 | 97.20 ± 0.35 |
5-fold cross validation result obtained in predicting H.pylori PPIs dataset
| Test set | Accu.(%) | Prec.(%) | Sen.(%) | MCC(%) | AUC(%) |
|---|---|---|---|---|---|
| 1 | 85.03 | 82.18 | 90.67 | 74.28 | 92.36 |
| 2 | 83.30 | 78.25 | 91.12 | 71.91 | 91.33 |
| 3 | 84.34 | 80.00 | 90.46 | 73.44 | 91.84 |
| 4 | 84.17 | 82.99 | 89.27 | 72.83 | 92.04 |
| 5 | 84.59 | 78.85 | 91.11 | 73.79 | 91.96 |
| Average | 84.28 ± 0.64 | 80.45 ± 2.07 | 90.54 ± 0.77 | 73.25 ± 0.92 | 91.91 ± 0.37 |
Fig. 1The flowchart for the feature extraction process
Fig. 2ROC curves from proposed method result for Saccharomyces cerevisiae PPIs dataset
Fig. 3ROC curves from proposed method result for H.pylori PPIs dataset
5-fold cross validation result obtained in predicting Human PPIs dataset
| Classification model | Testing set | Accu.(%) | Prec.(%) | Sen.(%) | MCC(%) |
|---|---|---|---|---|---|
| WSRC | 1 | 95.53 | 99.14 | 91.17 | 91.35 |
| 2 | 95.89 | 98.61 | 92.59 | 92.06 | |
| 3 | 95.22 | 99.19 | 91.09 | 90.86 | |
| 4 | 95.83 | 98.74 | 92.31 | 91.94 | |
| 5 | 95.22 | 99.04 | 91.08 | 90.85 | |
| Average | 95.54 ± 0.32 | 98.95 ± 0.25 | 91.65 ± 0.74 | 91.41 ± 0.58 | |
| SVM | 1 | 87.68 | 87.60 | 85.64 | 78.26 |
| 2 | 87.56 | 88.04 | 85.18 | 78.10 | |
| 3 | 87.68 | 88.66 | 86.14 | 78.38 | |
| 4 | 90.07 | 89.54 | 89.31 | 82.05 | |
| 5 | 87.63 | 89.92 | 84.05 | 78.23 | |
| Average | 88.13 ± 1.09 | 88.75 ± 0.98 | 86.06 ± 1.97 | 79.00 ± 1.71 |
Fig. 4ROC curves from proposed method result for Human PPIs dataset
Fig. 5ROC curves from SVM-based method result for Human PPIs dataset
Performance comparison of different methods on the Yeast dataset
| Model | Method | Accu.(%) | Prec.(%) | Sen.(%) | MCC(%) |
|---|---|---|---|---|---|
| Guos’ work [ | ACC | 89.33 ± 2.67 | 88.87 ± 6.16 | 89.93 ± 3.68 | N/A |
| AC | 87.36 ± 1.38 | 87.82 ± 4.33 | 87.30 ± 4.68 | N/A | |
| Zhous’ work [ | SVM + LD | 88.56 ± 0.33 | 89.50 ± 0.60 | 87.37 ± 0.22 | 77.15 ± 0.68 |
| Yangs’ work [ | Cod1 | 75.08 ± 1.13 | 74.75 ± 1.23 | 75.81 ± 1.20 | N/A |
| Cod2 | 80.04 ± 1.06 | 82.17 ± 1.35 | 76.77 ± 0.69 | N/A | |
| Cod3 | 80.41 ± 0.47 | 81.86 ± 0.99 | 78.14 ± 0.90 | N/A | |
| Cod4 | 86.15 ± 1.17 | 90.24 ± 1.34 | 81.03 ± 1.74 | N/A | |
| Proposed method | WSRC | 92.50 ± 0.59 | 95.87 ± 0.89 | 88.82 ± 0.98 | 86.09 ± 1.02 |
Performance comparison of different methods on the H.pylori dataset
| Model | Accu.(%) | Prec.(%) | Sen.(%) | MCC(%) |
|---|---|---|---|---|
| Phylogenetic booststrap [ | 75.80 | 80.20 | 69.80 | N/A |
| HKNN [ | 84.00 | 84.00 | 86.00 | N/A |
| Signature products [ | 83.40 | 85.70 | 79.90 | N/A |
| Boosting [ | 79.52 | 81.69 | 80.37 | 70.64 |
| Proposed method | 84.28 | 80.45 | 90.54 | 73.25 |