| Literature DB >> 31881833 |
Zhan-Heng Chen1,2, Zhu-Hong You3,4, Li-Ping Li1, Yan-Bin Wang1, Yu Qiu1,2, Peng-Wei Hu5.
Abstract
BACKGROUND: Identification of protein-protein interactions (PPIs) is crucial for understanding biological processes and investigating the cellular functions of genes. Self-interacting proteins (SIPs) are those in which more than two identical proteins can interact with each other and they are the specific type of PPIs. More and more researchers draw attention to the SIPs detection, and several prediction model have been proposed, but there are still some problems. Hence, there is an urgent need to explore a efficient computational model for SIPs prediction.Entities:
Keywords: Finite impulse response filter; PSSM; Random projection; Self-interacting proteins
Mesh:
Substances:
Year: 2019 PMID: 31881833 PMCID: PMC6933882 DOI: 10.1186/s12864-019-6301-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Results measured by RP-FIRF method on human dataset with 5-fold cross-validation
| Testing set | Acc (%) | Sen (%) | PE (%) | MCC (%) |
|---|---|---|---|---|
| 1 | 98.10 | 76.84 | 100.00 | 86.77 |
| 2 | 97.76 | 74.51 | 100.00 | 85.28 |
| 3 | 97.70 | 71.63 | 100.00 | 83.59 |
| 4 | 98.01 | 73.05 | 100.00 | 84.57 |
| 5 | 97.87 | 76.28 | 100.00 | 86.34 |
| Average | 97.89 ± 0.17 | 74.46 ± 2.18 | 100.00 ± 0.00 | 85.31 ± 1.29 |
Results measured by RP-FIRF method on yeast dataset with 5-fold cross-validation
| Testing set | Acc (%) | Sen (%) | PE (%) | MCC (%) |
|---|---|---|---|---|
| 1 | 97.43 | 78.01 | 99.10 | 86.65 |
| 2 | 97.35 | 77.08 | 100.00 | 86.51 |
| 3 | 97.35 | 75.57 | 99.00 | 85.22 |
| 4 | 97.51 | 78.32 | 100.00 | 87.28 |
| 5 | 97.11 | 76.16 | 100.00 | 85.87 |
| Average | 97.35 ± 0.15 | 77.03 ± 1.17 | 99.62 ± 0.52 | 86.31 ± 0.79 |
Comparison results of RP and SVM with FIRF feature vectors on yeast dataset
| Testing set | Acc (%) | Sen (%) | PE (%) | MCC (%) |
|---|---|---|---|---|
| SVM + PSSM+FIRF | ||||
| 1 | 92.36 | 32.62 | 100.00 | 54.81 |
| 2 | 89.15 | 6.25 | 100.00 | 23.59 |
| 3 | 94.21 | 45.04 | 100.00 | 65.04 |
| 4 | 93.65 | 44.76 | 100.00 | 64.62 |
| 5 | 92.21 | 35.76 | 100.00 | 57.31 |
| Average | 92.32 ± 1.96 | 32.89 ± 15.86 | 100.00 ± 0.00 | 53.07 ± 17.08 |
| RP + PSSM+FIRF | ||||
| 1 | 97.43 | 78.01 | 99.10 | 86.65 |
| 2 | 97.35 | 77.08 | 100.00 | 86.51 |
| 3 | 97.35 | 75.57 | 99.00 | 85.22 |
| 4 | 97.51 | 78.32 | 100.00 | 87.28 |
| 5 | 97.11 | 76.16 | 100.00 | 85.87 |
| Average | 97.35 ± 0.15 | 77.03 ± 1.17 | 99.62 ± 0.52 | 86.31 ± 0.79 |
Comparison results of RP and SVM with FIRF feature vectors on human dataset
| Testing set | Acc (%) | Sen (%) | PE (%) | MCC (%) |
|---|---|---|---|---|
| SVM + PSSM+FIRF | ||||
| 1 | 96.32 | 55.09 | 100.00 | 72.78 |
| 2 | 95.94 | 53.92 | 100.00 | 71.85 |
| 3 | 96.37 | 55.32 | 100.00 | 72.95 |
| 4 | 96.78 | 56.25 | 100.00 | 73.73 |
| 5 | 95.66 | 51.60 | 100.00 | 70.18 |
| Average | 96.21 ± 0.43 | 54.44 ± 1.79 | 100.00 ± 0.00 | 72.30 ± 1.36 |
| RP + PSSM+FIRF | ||||
| 1 | 98.10 | 76.84 | 100.00 | 86.77 |
| 2 | 97.76 | 74.51 | 100.00 | 85.28 |
| 3 | 97.70 | 71.63 | 100.00 | 83.59 |
| 4 | 98.01 | 73.05 | 100.00 | 84.57 |
| 5 | 97.87 | 76.28 | 100.00 | 86.34 |
| Average | 97.89 ± 0.17 | 74.46 ± 2.18 | 100.00 ± 0.00 | 85.31 ± 1.29 |
Fig. 1Comparison of ROC curves between RP and SVM on human dataset
Fig. 2Comparison of ROC curves between RP and SVM on yeast dataset
Performance results between RP-FIRF model and the other methods on yeast dataset
| Model | Acc (%) | Sp (%) | Sen (%) | MCC (%) | AUC |
|---|---|---|---|---|---|
| SLIPPER [ | 71.90 | 72.18 | 69.72 | 28.42 | 0.7723 |
| DXECPPI [ | 87.46 | 94.93 | 29.44 | 28.25 | 0.6934 |
| PPIevo [ | 66.28 | 87.46 | 60.14 | 18.01 | 0.6728 |
| LocFuse [ | 66.66 | 68.10 | 55.49 | 15.77 | 0.7087 |
| CRS [ | 72.69 | 74.37 | 59.58 | 23.68 | 0.7115 |
| SPAR [ | 76.96 | 80.02 | 53.24 | 24.84 | 0.7455 |
| Proposed method | 97.35 | 99.96 | 77.03 | 86.31 | 0.8896 |
Performance results between RP-FIRF model and the other methods on human dataset
| Model | Acc (%) | Sp (%) | Sen (%) | MCC (%) | AUC |
|---|---|---|---|---|---|
| SLIPPER [ | 91.10 | 95.06 | 47.26 | 41.97 | 0.8723 |
| DXECPPI [ | 30.90 | 25.83 | 87.08 | 8.25 | 0.5806 |
| PPIevo [ | 78.04 | 25.82 | 87.83 | 20.82 | 0.7329 |
| LocFuse [ | 80.66 | 80.50 | 50.83 | 20.26 | 0.7087 |
| CRS [ | 91.54 | 96.72 | 34.17 | 36.33 | 0.8196 |
| SPAR [ | 92.09 | 97.40 | 33.33 | 38.36 | 0.8229 |
| Proposed method | 97.89 | 100.00 | 74.46 | 85.31 | 0.8842 |
Confusion Matrix
| Predict | |||
|---|---|---|---|
| Negative | Positive | ||
| Actual | Negative | TN | FN |
| Positive | FP | TP | |