| Literature DB >> 35307006 |
Michela Quadrini1,2, Sebastian Daberdaku3,4, Carlo Ferrari3.
Abstract
BACKGROUND: Protein-protein interactions have pivotal roles in life processes, and aberrant interactions are associated with various disorders. Interaction site identification is key for understanding disease mechanisms and design new drugs. Effective and efficient computational methods for the PPI prediction are of great value due to the overall cost of experimental methods. Promising results have been obtained using machine learning methods and deep learning techniques, but their effectiveness depends on protein representation and feature selection.Entities:
Keywords: Graph convolutional networks; Hierarchical representation; Protein–protein interaction
Mesh:
Substances:
Year: 2022 PMID: 35307006 PMCID: PMC8934516 DOI: 10.1186/s12859-022-04624-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
HQI8 indices
| Entry name | Description |
|---|---|
| BLAM930101 | Alpha helix propensity of position 44 in T4 lysozyme |
| BIOV880101 | Information value for accessibility; average fraction 35% |
| MAXF760101 | Normalized frequency of alpha-helix |
| TSAJ990101 | Volumes including the crystallographic waters using the ProtOr |
| NAKH920108 | AA composition of MEM of multi-spanning proteins |
| CEDJ970104 | Composition of amino acids in intracellular protein (percent) |
| LIFS790101 | Conformational preference for all beta-strands |
| MIYS990104 | Optimized relative partition energies - method C |
Fig. 1Adjacency graph construction for a given set of proteins
Fig. 2PPI interface residue classification with a semi-supervised GCN framework
Measures of F1 score, classification accuracy, precision, recall, MCC and ROC-AUC obtained on the test set of the ligands classes
| A class—ligands | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| F1 | Accuracy | Precision | Recall | MCC | AUC-ROC | |||||||
| b | u | b | u | b | u | b | u | b | u | b | u | |
| Residue Sequence | 0.208 | 0.271 | 0.731 | 0.480 | 0.180 | 0.170 | 0.261 | 0.749 | 0.054 | 0.118 | 0.588 | 0.613 |
| Contact Map 6 Å | 0.266 | 0.259 | 0.297 | 0.302 | 0.158 | 0.155 | 0.919 | 0.943 | 0.074 | 0.108 | 0.704 | |
| Contact Map 8 Å | 0.254 | 0.238 | 0.201 | 0.180 | 0.149 | 0.140 | 0.981 | 0.979 | 0.027 | 0.019 | 0.649 | 0.713 |
| Contact Map 10 Å | 0.252 | 0.240 | 0.199 | 0.224 | 0.148 | 0.141 | 0.966 | 0.939 | 0.028 | 0.030 | 0.625 | 0.697 |
| Contact Map 12 Å | 0.259 | 0.252 | 0.299 | 0.289 | 0.154 | 0.151 | 0.848 | 0.883 | 0.036 | 0.060 | 0.589 | 0.664 |
| Hierarchical Representation 6Å | 0.282 | 0.264 | 0.445 | 0.344 | 0.174 | 0.159 | 0.806 | 0.918 | 0.127 | 0.117 | 0.635 | 0.661 |
| Hierarchical Representation 8Å | 0.272 | 0.243 | 0.340 | 0.192 | 0.163 | 0.143 | 0.896 | 0.975 | 0.099 | 0.028 | 0.633 | 0.670 |
| Hierarchical Representation 10Å | 0.272 | 0.243 | 0.380 | 0.204 | 0.164 | 0.143 | 0.839 | 0.978 | 0.091 | 0.038 | 0.624 | 0.685 |
| Hierarchical Representation 12Å | 0.267 | 0.237 | 0.340 | 0.161 | 0.160 | 0.139 | 0.850 | 0.983 | 0.060 | 0.003 | 0.611 | 0.684 |
| Daberdaku | 0.093 | 0.097 | 0.811 | 0.059 | 0.067 | 0.052 | 0.182 | 0.987 | 0.019 | -0.016 | 0.538 | 0.473 |
| SPPIDER | 0.630 | 0.575 | ||||||||||
| NPS-HomPPI | 0.610 | 0.626 | ||||||||||
| PrISE | 0.622 | 0.569 | ||||||||||
Measures of F1 score, classification accuracy, precision, recall, MCC and ROC-AUC obtained on the test set of the receptors classes
| A class—receptors | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| F1 | Accuracy | Precision | Recall | MCC | AUC-ROC | |||||||
| Residue Sequence | 0.328 | 0.345 | 0.851 | 0.857 | 0.243 | 0.265 | 0.534 | 0.548 | 0.287 | 0.306 | 0.845 | 0.835 |
| Contact Map 6 Å | 0.622 | 0.629 | 0.947 | 0.930 | 0.631 | 0.526 | 0.665 | 0.864 | 0.608 | 0.631 | 0.975 | 0.980 |
| Contact Map 8 Å | 0.630 | 0.620 | 0.950 | 0.925 | 0.658 | 0.502 | 0.660 | 0.889 | 0.620 | 0.624 | 0.974 | 0.978 |
| Contact Map 10 Å | 0.578 | 0.628 | 0.920 | 0.940 | 0.467 | 0.602 | 0.823 | 0.724 | 0.576 | 0.617 | 0.970 | 0.975 |
| Contact Map 12 Å | 0.552 | 0.580 | 0.932 | 0.920 | 0.523 | 0.483 | 0.645 | 0.801 | 0.535 | 0.574 | 0.966 | 0.967 |
| Hierarchical Representation 6Å | 0.657 | 0.592 | 0.952 | 0.910 | 0.667 | 0.447 | 0.701 | 0.951 | 0.647 | 0.609 | 0.971 | 0.978 |
| Hierarchical Representation 8Å | 0.647 | 0.665 | 0.950 | 0.945 | 0.663 | 0.608 | 0.690 | 0.803 | 0.637 | 0.659 | 0.976 | 0.980 |
| Hierarchical Representation 10Å | 0.619 | 0.632 | 0.952 | 0.927 | 0.718 | 0.510 | 0.595 | 0.907 | 0.615 | 0.638 | 0.975 | 0.978 |
| Hierarchical Representation 12Å | 0.619 | 0.604 | 0.948 | 0.918 | 0.656 | 0.474 | 0.647 | 0.910 | 0.611 | 0.612 | 0.975 | 0.977 |
| Daberdaku | 0.272 | 0.274 | 0.862 | 0.876 | 0.166 | 0.169 | 0.917 | 0.883 | 0.346 | 0.341 | 0.954 | 0.939 |
| SPPIDER | 0.773 | 0.754 | ||||||||||
| NPS-HomPPI | 0.796 | 0.780 | ||||||||||
| PrISE | 0.770 | 0.758 | ||||||||||
Number of complexes and in each class of the Dataset
| Class | Data partition | Complex | Positive | Negative | ||
|---|---|---|---|---|---|---|
| b (%) | u (%) | b (%) | u (%) | |||
| Train | 8 | 8 | 9 | 92 | 91 | |
| Validation | 3 | 9 | 10 | 91 | 90 | |
| Test | 7 | 8 | 10 | 92 | 90 | |
| Train | 9 | 14 | 16 | 86 | 84 | |
| Validation | 3 | 14 | 15 | 86 | 85 | |
| Test | 8 | 14 | 15 | 86 | 85 | |
| Train | 4 | 9 | 11 | 91 | 89 | |
| Validation | 3 | 8 | 9 | 92 | 91 | |
| Test | 5 | 8 | 9 | 92 | 91 | |
| Train | 4 | 13 | 13 | 87 | 87 | |
| Validation | 3 | 14 | 13 | 86 | 87 | |
| Test | 4 | 16 | 16 | 84 | 84 | |
| Train | 18 | 15 | 15 | 85 | 85 | |
| Validation | 12 | 15 | 16 | 85 | 84 | |
| Test | 14 | 15 | 16 | 85 | 84 | |
| Train | 16 | 29 | 33 | 71 | 67 | |
| Validation | 12 | 32 | 34 | 68 | 66 | |
| Test | 16 | 30 | 32 | 70 | 68 | |
| Train | 14 | 13 | 13 | 87 | 87 | |
| Validation | 3 | 12 | 12 | 88 | 88 | |
| Test | 9 | 11 | 11 | 89 | 89 | |
| Train | 10 | 20 | 21 | 80 | 79 | |
| Validation | 5 | 26 | 22 | 74 | 78 | |
| Test | 11 | 25 | 22 | 75 | 78 | |
| Train | 7 | 9 | 12 | 91 | 88 | |
| Validation | 3 | 10 | 12 | 90 | 88 | |
| Test | 7 | 11 | 12 | 89 | 88 | |
| Train | 7 | 25 | 21 | 75 | 79 | |
| Validation | 3 | 25 | 23 | 75 | 77 | |
| Test | 6 | 22 | 21 | 78 | 79 | |
| Train | 8 | 9 | 9 | 91 | 91 | |
| Validation | 3 | 10 | 9 | 90 | 91 | |
| Test | 8 | 12 | 12 | 88 | 88 | |
| Train | 9 | 24 | 24 | 76 | 76 | |
| Validation | 2 | 22 | 21 | 78 | 79 | |
| Test | 7 | 19 | 20 | 81 | 80 | |
| Train | 10 | 14 | 13 | 86 | 87 | |
| Validation | 4 | 12 | 11 | 88 | 89 | |
| Test | 9 | 13 | 14 | 87 | 86 | |
| Train | 9 | 23 | 23 | 77 | 77 | |
| Validation | 5 | 21 | 21 | 79 | 79 | |
| Test | 10 | 23 | 24 | 77 | 76 | |
| Train | 17 | 16 | 15 | 84 | 85 | |
| Validation | 11 | 15 | 15 | 85 | 85 | |
| Test | 19 | 14 | 13 | 86 | 87 | |
| Train | 16 | 18 | 18 | 82 | 82 | |
| Validation | 14 | 19 | 20 | 81 | 80 | |
| Test | 20 | 20 | 21 | 80 | 81 | |
Positive examples are residue pairs that participate in the interface, negative examples are pairs that do not
Fig. 3Average Receiver Operating Characteristic curve comparison of the proposed PPI interface prediction method by using hierarchical representation, contact map and sequence as protein representation with different thresholds (6Å, 8Å, 10Å, 12Å) for each protein class
Fig. 4Average Precision-Recall curve comparison of the proposed PPI interface prediction method by using hierarchical representation, contact map and sequence as protein representation with different thresholds (6Å, 8Å, 10Å, 12Å) for each protein class
Measures of F1 score, classification accuracy, precision, recall, MCC and ROC-AUC obtained on the test set of the ligands classes
| F1 | Accuracy | Precision | Recall | MCC | AUC-ROC | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| b | u | b | u | b | u | b | u | b | u | b | u | |
| 2-layer architecture—ligands | ||||||||||||
| Contact Map 6 Å | 0.390 | 0.364 | 0.247 | 0.232 | 0.247 | 0.229 | 1.00 | 1.00 | 0.00 | 0.014 | 0.836 | 0.831 |
| Contact Map 8 Å | 0.390 | 0.356 | 0.247 | 0.230 | 0.247 | 0.228 | 1.00 | 1.00 | 0.00 | 0.010 | 0.768 | 0.779 |
| Contact Map 10 Å | 0.390 | 0.366 | 0.247 | 0.235 | 0.247 | 0.229 | 1.00 | 1.00 | 0.00 | 0.012 | 0.811 | 0.765 |
| Contact Map 12 Å | 0.390 | 0.364 | 0.247 | 0.228 | 0.247 | 0.228 | 1.00 | 1.00 | 0.00 | 0.00 | 0.745 | 0.662 |
| Hierarchical Representation 6Å | 0.390 | 0.385 | 0.247 | 0.290 | 0.247 | 0.243 | 1.00 | 1.00 | 0.00 | 0.09 | 0.790 | 0.779 |
| Hierarchical Representation 8Å | 0.390 | 0.382 | 0.247 | 0.283 | 0.247 | 0.241 | 1.00 | 1.00 | 0.00 | 0.08 | 0.830 | 0.815 |
| Hierarchical Representation 10Å | 0.390 | 0.375 | 0.247 | 0.263 | 0.247 | 0.239 | 1.00 | 1.00 | 0.00 | 0.06 | 0.852 | 0.858 |
| Hierarchical Representation 12Å | 0.390 | 0.364 | 0.247 | 0.228 | 0.247 | 0.228 | 1.00 | 1.00 | 0.00 | 0.00 | 0.871 | 0.861 |