| Literature DB >> 23815620 |
Zhu-Hong You1, Ying-Ke Lei, Lin Zhu, Junfeng Xia, Bing Wang.
Abstract
BACKGROUND: Protein-protein interactions (PPIs) play crucial roles in the execution of various cellular processes and form the basis of biological mechanisms. Although large amount of PPIs data for different species has been generated by high-throughput experimental techniques, current PPI pairs obtained with experimental methods cover only a fraction of the complete PPI networks, and further, the experimental methods for identifying PPIs are both time-consuming and expensive. Hence, it is urgent and challenging to develop automated computational methods to efficiently and accurately predict PPIs.Entities:
Mesh:
Year: 2013 PMID: 23815620 PMCID: PMC3654889 DOI: 10.1186/1471-2105-14-S8-S10
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The prediction performance comparison of PCA-EELM with PCA-SVM
| Classification Model | Test set | Sens. (%) | Prec. (%) | Accu. (%) | MCC (%) | Testing Time (Seconds) |
|---|---|---|---|---|---|---|
| PCA-EELM | 1 | 86.60 | 87.32 | 87.03 | 77.42 | 45.1482 |
| 2 | 85.97 | 87.34 | 86.77 | 77.04 | 45.4615 | |
| 3 | 85.95 | 87.68 | 86.95 | 77.30 | 46.7825 | |
| 4 | 86.60 | 88.10 | 87.47 | 78.07 | 43.2015 | |
| 5 | 85.64 | 87.52 | 86.73 | 76.97 | 44.1237 | |
| Average | ||||||
| PCA-SVM | 1 | 81.76 | 82.86 | 82.16 | 70.69 | 52.6035 |
| 2 | 85.77 | 81.65 | 83.37 | 72.25 | 51.7143 | |
| 3 | 87.52 | 80.67 | 83.10 | 71.78 | 51.7143 | |
| 4 | 85.77 | 82.44 | 83.77 | 72.79 | 51.8547 | |
| 5 | 85.48 | 79.09 | 81.74 | 70.09 | 51.4335 | |
| Average | . | |||||
Performance comparison of different methods on the H.pylori dataset. Here, N/A means not available.
| Methods | SN (%) | PE (%) | ACC (%) | MCC (%) |
|---|---|---|---|---|
| Phylogenetic bootstrap | 69.8 | 80.2 | 75.8 | N/A |
| HKNN | 86 | 84 | 84 | N/A |
| Signature products | 79.9 | 85.7 | 83.4 | N/A |
| Ensemble of HKNN | 86.7 | 85 | 86.6 | N/A |
| Boosting | 80.37 | 81.69 | 79.52 | 70.64 |
| Proposed method |
Figure 1The architecture of the proposed PCA-EELM protein interaction prediction method.
The original values of the six physicochemical properties for each amino acid
| Amino acid | H | VSC | P1 | P2 | SASA | NCISC |
|---|---|---|---|---|---|---|
| A | 0.62 | 27.5 | 8.1 | 0.046 | 1.181 | 0.007187 |
| C | 0.29 | 44.6 | 5.5 | 0.128 | 1.461 | -0.03661 |
| D | -0.9 | 40 | 13 | 0.105 | 1.587 | -0.02382 |
| E | -0.74 | 62 | 12.3 | 0.151 | 1.862 | 0.006802 |
| F | 1.19 | 115.5 | 5.2 | 0.29 | 2.228 | 0.037552 |
| G | 0.48 | 0 | 9 | 0 | 0.881 | 0.179052 |
| H | -0.4 | 79 | 10.4 | 0.23 | 2.025 | -0.01069 |
| I | 1.38 | 93.5 | 5.2 | 0.186 | 1.81 | 0.021631 |
| K | -1.5 | 100 | 11.3 | 0.219 | 2.258 | 0.017708 |
| L | 1.06 | 93.5 | 4.9 | 0.186 | 1.931 | 0.051672 |
| M | 0.64 | 94.1 | 5.7 | 0.221 | 2.034 | 0.002683 |
| N | -0.78 | 58.7 | 11.6 | 0.134 | 1.655 | 0.005392 |
| P | 0.12 | 41.9 | 8 | 0.131 | 1.468 | 0.239531 |
| Q | -0.85 | 80.7 | 10.5 | 0.18 | 1.932 | 0.049211 |
| R | -2.53 | 105 | 10.5 | 0.291 | 2.56 | 0.043587 |
| S | -0.18 | 29.3 | 9.2 | 0.062 | 1.298 | 0.004627 |
| T | -0.05 | 51.3 | 8.6 | 0.108 | 1.525 | 0.003352 |
| V | 1.08 | 71.5 | 5.9 | 0.14 | 1.645 | 0.057004 |
| W | 0.81 | 145.5 | 5.4 | 0.409 | 2.663 | 0.037977 |
| Y | 0.26 | 117.3 | 6.2 | 0.298 | 2.368 | 0.023599 |
H, hydrophobicity; VSC, volume of side chains; P1, polarity; P2, polarizability;
SASA, solvent accessible surface area; NCISC, net charge index of side chains
Division of amino acids based on the dipoles and volumes of the side chains
| Group | |
|---|---|
| 1 | A, G, V |
| 2 | C |
| 3 | D, E |
| 4 | F, I, L, P |
| 5 | H, N, Q, W |
| 6 | K, R |
| 7 | M, S, T, Y |
Figure 2The 10 regions (A-J) used by the Local Descriptor technique for a theoretical protein sequence. The regions A-D and E-F are obtained by dividing the entire sequence into four equal regions and into two equal regions respectively. Region G represents the central 50% of the sequence. Regions H, I and J are the first, final and central 75% of the sequence.
Figure 3The structure of ELM model