| Literature DB >> 26681483 |
Yerukala Sathipati Srinivasulu, Jyun-Rong Wang, Kai-Ti Hsu, Ming-Ju Tsai, Phasit Charoenkwan, Wen-Lin Huang, Hui-Ling Huang, Shinn-Ying Ho.
Abstract
BACKGROUND: Protein-protein interactions (PPIs) are involved in various biological processes, and underlying mechanism of the interactions plays a crucial role in therapeutics and protein engineering. Most machine learning approaches have been developed for predicting the binding affinity of protein-protein complexes based on structure and functional information. This work aims to predict the binding affinity of heterodimeric protein complexes from sequences only.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26681483 PMCID: PMC4682391 DOI: 10.1186/1471-2105-16-S18-S14
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Performance results of SVM-BAC using 162 training and 54 test complexes.
| Method | Training | SEN | SPE | AUC | Test | SEN | SPE | AUC |
|---|---|---|---|---|---|---|---|---|
| SVM-BAC | 85.80% | 0.888 | 0.827 | 0.86 | 83.33% (54) | 0.888 | 0.777 | 0.82 |
| SMO [ | 76.1% | 0.756 | 0.767 | 0.76 | 83.3% (30) | 0.813 | 0.857 | 0.84 |
SEN (Sensitivity), SPE (Specificity) and AUC (Area under the ROC curve)
Figure 1ROC curve for the SVM-BAC performance evaluation. The area under the ROC curve (AUC) is 0.86 using the training dataset.
The 14 physicochemical properties identified by SVM-BAC.
| Rank | Aaindex_ID | Description | MED |
|---|---|---|---|
| 1 | GUYH850105 | Apparent partition energies calculated from Chothia index [ | 35.18 |
| 2 | SNEP660104 | Principal component IV [ | 32.71 |
| 3 | RACS820113 | Value of theta (i) (Rackovsky-Scheraga, 1982) [ | 31.48 |
| 4 | MITS020101 | Amphiphilicity index (Mitaku et al., 2002) [ | 31.48 |
| 5 | MAXF760103 | Normalized frequency of zeta R (Maxfield-Scheraga, 1976) [ | 27.77 |
| 6 | CIDH920104 | Normalized hydrophobicity scales for alpha/beta-proteins (Cid et al., 1992) [ | 19.13 |
| 7 | AURR980119 | Normalized positional residue frequency at helix termini C"' (Aurora-Rose, 1998) [ | 17.90 |
| 8 | TANS770103 | Normalized frequency of extended structure (Tanaka-Scheraga, 1977) [ | 16.66 |
| 9 | CHOP780101 | Normalized frequency of beta turn (Chou-Fasman,1978a) [ | 12.96 |
| 10 | PALJ810107 | Normalized frequency of alpha-helix in all-alpha class (Palau et al., 1981) [ | 12.96 |
| 11 | QIAN880116 | Weights for beta-sheet at the window position of -4 (Qian-Sejnowski, 1988) [ | 12.96 |
| 12 | PALJ810110 | Normalized frequency of beta-sheet in all-beta class (Palau et al., 1981) [ | 10.49 |
| 13 | TAKK010101 | Side-chain contribution to protein stability (kJ/mol) (Takano-Yutani, 2001) [ | 9.25 |
| 14 | Nm-Protein | Average medium-range contacts folding [ | 4.32 |
Figure 2Difference accuracies of individual physicochemical properties using knock-out analysis.
Estimation performance of SVR using Jackknife test on 200 heterodimeric complexes
| Estimation method | Features (sequence descriptors) | Coefficient correlation ( | Mean absolute error ( |
|---|---|---|---|
| SVR | 14 | 0.34 | 1.4 |
Figure 3Estimation performance of jackknife test using the SVR-based method for 200 heterodimeric complexes.
Figure 4Mean absolute error of .
Amino acid composition (AAC) of high binding affinity (HBA) and low binding affinity (LBA) complexes and three physicochemical properties.
| Amino acid | HBA_AAC (%) | LBA_AAC (%) | Composition difference (%) | aGUYH850105 | bSNEP660104 | cCHOP780101 |
|---|---|---|---|---|---|---|
| Ala | 7 | 6.8 | 0.2 | -0.27 | -0.062 | 0.66 |
| Arg | 4.2 | 4.8 | -0.6 | 2 | -0.167 | 0.95 |
| Asn | 4.8 | 4.5 | 0.3 | 0.61 | 0.166 | 1.56 |
| Asp | 5.2 | 5.9 | -0.7 | 0.5 | -0.079 | 1.46 |
| Cys | 2.6 | 1.7 | 0.9 | -0.23 | 0.38 | 1.19 |
| Glu | 3.9 | 4.2 | -0.3 | 1 | -0.025 | 0.98 |
| Gln | 5.7 | 7 | -1.3 | 0.33 | -0.184 | 0.74 |
| Gly | 7.8 | 6.6 | 1.2 | -0.22 | -0.017 | 1.56 |
| His | 2.1 | 2.3 | -0.2 | 0.37 | 0.056 | 0.95 |
| Ile | 4.7 | 5.3 | -0.6 | -0.8 | -0.309 | 0.47 |
| Leu | 8.2 | 9 | -0.8 | -0.44 | -0.264 | 0.59 |
| Lys | 5.7 | 6.4 | -0.7 | 1.17 | -0.371 | 1.01 |
| Met | 1.7 | 2.3 | -0.6 | -0.31 | 0.077 | 0.6 |
| Phe | 3.5 | 4 | -0.5 | -0.55 | 0.074 | 0.6 |
| Pro | 4.9 | 4.7 | 0.2 | 0.36 | -0.036 | 1.52 |
| Ser | 8.6 | 6.7 | 1.9 | 0.17 | 0.47 | 1.43 |
| Thr | 6.7 | 6 | 0.7 | 0.18 | 0.348 | 0.96 |
| Trp | 1.7 | 1.4 | 0.3 | 0.05 | 0.05 | 0.96 |
| Tyr | 3.9 | 3.5 | 0.4 | 0.48 | 0.22 | 1.14 |
| Val | 7 | 6.9 | 0.1 | -0.65 | -0.212 | 0.5 |
aGUYH850105 = Apparent partition energies.
bSNEP660104 = Principal component analysis IV.
cCHOP780101 = Normalized frequencies of beta turn.
Figure 5Surface hydrophobicity of 1MAH. The color of the surfaces represents the level of hydrophobicity. The blue, white, and brown colors represent low, mediate, and high hydrophobicity, respectively. (a) Secondary structures of 1MAH (b) Surface hydrophobicity of 1MAH. Protein structures are drawn using Discovery studio 4.0.
Figure 6The numbers of hydrogen bonds at interfaces in heterodimeric complexes. X-axis denotes the identification numbers of high and low binding affinity complexes. Y-axis denotes the number of hydrogen bonds at interfaces in a protein complex.
Figure 7Structure of PROMMP-2/TIMP-2 COMPLEX (PDB code 1GXD). Left: View of the enzyme-inhibitor complex complete structure. Right: A close-up view of type-2 beta turn from the whole complex structure and arrangement of amino acids shown as balls-and-stick model.