| Literature DB >> 25474679 |
Zhu-Hong You, Lin Zhu, Chun-Hou Zheng, Hong-Jie Yu, Su-Ping Deng, Zhen Ji.
Abstract
BACKGROUND: Identifying protein-protein interactions (PPIs) is essential for elucidating protein functions and understanding the molecular mechanisms inside the cell. However, the experimental methods for detecting PPIs are both time-consuming and expensive. Therefore, computational prediction of protein interactions are becoming increasingly popular, which can provide an inexpensive way of predicting the most likely set of interactions at the entire proteome scale, and can be used to complement experimental approaches. Although much progress has already been achieved in this direction, the problem is still far from being solved and new approaches are still required to overcome the limitations of the current prediction models.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25474679 PMCID: PMC4271571 DOI: 10.1186/1471-2105-15-S15-S9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparison of the prediction performance by the proposed method and some state-of-the-art works on the yeast dataset.
|
| Test set | |||||
|---|---|---|---|---|---|---|
| Our | 1 | 91.66 | 92.40 | 92.00 | 85.28 | 97.15 |
| 2 | 90.49 | 92.01 | 91.11 | 83.79 | 96.91 | |
| 3 | 91.13 | 91.05 | 91.19 | 83.94 | 96.97 | |
| 4 | 90.41 | 91.48 | 91.01 | 83.64 | 97.07 | |
| 5 | 89.64 | 92.76 | 91.47 | 84.38 | 97.23 | |
| Average | ||||||
| Guos' work | ACC | 89.93 ± 3.68 | 88.87 ± 6.16 | 89.33 ± 2.67 | N/A | N/A |
| AC | 87.30 ± 4.68 | 87.82 ± 4.33 | 87.36 ± 1.38 | N/A | N/A | |
| Zhous' | SVM+LD | 87.37 ± 0.22 | 89.50 ± 0.60 | 88.56 ± 0.33 | 77.15 ± 0.68 | 95.07 ± 0.39 |
| Yangs' | Cod1 | 75.81 ± 1.20 | 74.75 ± 1.23 | 75.08 ± 1.13 | N/A | N/A |
| Cod2 | 76.77 ± 0.69 | 82.17 ± 1.35 | 80.04 ± 1.06 | N/A | N/A | |
| Cod3 | 78.14 ± 0.90 | 81.86 ± 0.99 | 80.41 ± 0.47 | N/A | N/A | |
| Cod4 | 81.03 ± 1.74 | 90.24 ± 1.34 | 86.15 ± 1.17 | N/A | N/A | |
Here, N/A means not available.
Performance comparison of different methods on the H
| Methods | SN (%) | PE (%) | ACC (%) | MCC (%) |
|---|---|---|---|---|
| Phylogenetic bootstrap | 69.8 | 80.2 | 75.8 | N/A |
| Ensemble of HKNN | 86.7 | 85 | N/A | |
| Signature products | 79.9 | 85.7 | 83.4 | N/A |
| Boosting | 80.37 | 81.69 | 79.52 | 70.64 |
| HKNN | 84 | 84 | N/A | |
| Proposed method | 83.24 | 84.91 |
Figure 1The Schematic diagram for constructing continuous and discontinuous descriptor regions for a hypothetical protein sequence using 4-bit binary form. Each protein sequence is divided into 14 (2^4-2) sub-sequences (regions S1 - S14) of varying length to represent multiple overlapping continuous or discontinuous segments.
Division of amino acids into seven groups based on the dipoles and volumes of the side chains.
| Group 1 | Group 2 | Group 3 | Group 4 | Group 5 | Group 6 | Group 7 |
|---|---|---|---|---|---|---|
| A,G,V | C | M,S,T,Y | F,I,L,P | H,N,Q,W | K,R | C |
Figure 2Sequence of a hypothetic protein indicating the construction of composition, transition and distribution descriptors of a protein region.