| Literature DB >> 28424069 |
Tanchanok Wisitponchai1,2, Watshara Shoombuatong3, Vannajan Sanghiran Lee4,5, Kuntida Kitidee6,7, Chatchai Tayapiwatana8,9.
Abstract
BACKGROUND: Computational analysis of protein-protein interaction provided the crucial information to increase the binding affinity without a change in basic conformation. Several docking programs were used to predict the near-native poses of the protein-protein complex in 10 top-rankings. The universal criteria for discriminating the near-native pose are not available since there are several classes of recognition protein. Currently, the explicit criteria for identifying the near-native pose of ankyrin-protein complexes (APKs) have not been reported yet.Entities:
Keywords: AnkPlex; Ankyrin-protein complexes; Decision tree; Logistic regression model; Machine learning methods; Near-native docking pose
Mesh:
Substances:
Year: 2017 PMID: 28424069 PMCID: PMC5395911 DOI: 10.1186/s12859-017-1628-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The molecular architecture of ankyrin and its three recognition areas, as shown in ribbon style. a Amino acid sequence of an internal repeat [7] in which the recognition residues are shown in three colours. b The ribbon style of an internal repeat of ankyrin related to the above sequence. c The structure of the 3 internal domains of ankyrin flanked by the N-cap and C-cap. The recognition area consisted of six variable residues [7] (red and blue are positioned on the helix and turn, respectively) and two constant amino acids (green) on the second domain
Fig. 2The flowchart system for the proposed AnkPlex
Number of docking poses classified as near-native and non-near-native in Ank-TRN (C1-C7) and Ank-TEST (U1 and U2)
| Complex | Near-native | Non-near-native | ||||
|---|---|---|---|---|---|---|
| PSCa | PSC + DE + ELECb | PSC + DE + ELECb | ||||
| 54Kp | 54Kp | regKp | 2Kp | 54Kp | regKp | |
| C1 | 81 | 157 | 136 | 83 | 53,843 | 6,626 |
| C2 | 4 | 72 | 53 | 19 | 53,921 | 4,865 |
| C3 | 125 | 194 | 179 | 54 | 53,806 | 7,055 |
| C4 | 83 | 131 | 104 | 74 | 53,869 | 4,450 |
| C5 | 31 | 101 | 59 | 8 | 53,891 | 8,565 |
| C6 | 84 | 42 | 28 | 18 | 53,958 | 6,177 |
| C7 | 35 | 119 | 110 | 74 | 53,881 | 6,596 |
| Total | 443 | 816 | 669 | 330 | 384,169 | 44,334 |
| Mean | 63.29 | 116.57 | 95.57 | 47.14 | 53,881 | 6,333.43 |
| std. | 41.43 | 51.05 | 52.58 | 31.49 | 49.74 | 1,377.39 |
| U1 | NDc | 83 | 83 | 57 | NDc | 6,833 |
| U2 | NDc | 183 | 32 | 13 | NDc | 7,183 |
aZDOCK was calculated by PSC alone. bZDOCK was calculated by combining PSC, DE, and ELEC. cThe data were not used for analysis
Summary of statistical analysis of near-native and non-near-native poses of ankyrin-target complexes
| Feature | Near-native | Non-near-native |
|
|---|---|---|---|
| ZDock | 36.73 ± 6.32 | 33.41 ± 4.87 | <0.001 |
| ZRankElec | 8.22 ± 16.56 | 29.17 ± 22.41 | <0.001 |
| ZRank | −54.03 ± 25.84 | −21.82 ± 31.33 | <0.001 |
| ZRankSolv | 3.66 ± 6.99 | 7.88 ± 10.10 | <0.001 |
| ZRankVdw | −65.91 ± 20.44 | −58.87 ± 20.98 | <0.001 |
| E_vdw1 | −56.21 ± 57.81 | −47.46 ± 101.79 | <0.001 |
| E_elec1 | -1.05 ± 2.18 | -0.94 ± 2.37 | 0.18 |
| E_vdw2 | −70.01 ± 42.72 | −74.33 ± 43.00 | 0.01 |
| E_elec2 | −18.21 ± 8.85 | −8.18 ± 10.25 | <0.001 |
| E_sol | 4.43 ± 6.26 | 9.07 ± 8.87 | <0.001 |
| E_RDock | −11.96 ± 9.40 | 1.72 ± 12.10 | <0.001 |
Comparison of performances of 10 top-ranking OLMDT among various types of features and datasets in terms of 10-fold cross-validation
| Rank | OLMDT | PRES(%) | REC(%) | ACC(%) | #PPTRN |
|---|---|---|---|---|---|
| 1 | ABDEHIJK_g14 | 82.10 ± 8.49 | 70.58 ± 15.03 | 6.96 ± 4.82 | 6 |
| 2 | ADEFHIK_g25 | 81.64 ± 9.08 | 72.05 ± 14.65 | 7.05 ± 5.06 | 6 |
| 3 | ABDEGHIJK_g14 | 81.14 ± 8.83 | 73.66 ± 16.61 | 6.83 ± 4.68 | 6 |
| 4 | BCEGHIJK_g13 | 81.04 ± 8.37 | 72.88 ± 14.37 | 6.03 ± 3.31 | 6 |
| 5 | AEFGHJK_g36 | 81.03 ± 8.71 | 73.85 ± 13.05 | 6.91 ± 5.17 | 6 |
| 6 | ABFIJK_g16 | 80.84 ± 8.48 | 69.17 ± 14.68 | 6.33 ± 4.32 | 6 |
| 7 | ABDEFGIJK_g25 | 80.84 ± 7.98 | 75.54 ± 8.88 | 6.41 ± 3.80 | 6 |
| 8 | ABIJK_g16 | 80.82 ± 8.43 | 69.17 ± 14.68 | 6.31 ± 4.31 | 6 |
| 9 | ABCGHJK_g13 | 80.80 ± 7.86 | 73.55 ± 12.44 | 6.30 ± 3.98 | 6 |
| 10 | ABDEFGIK_g25 | 80.78 ± 8.09 | 75.67 ± 9.12 | 6.38 ± 3.73 | 6 |
11 feathers (A, B, C, D, E, F, G, H, I, J, K) are ZDock, ZRankElec, ZRank, ZRankSolv, ZRankVdw, E_vdw1, E_elec1, E_vdw2, E_elec2, E_sol, E_RDock
Comparison of performances of 10 top-ranking OLMLG among various Types of features and datasets in terms of 10-fold cross-validation
| Rank | OLMLG | PRES(%) | REC(%) | ACC(%) | #PPTRN |
|---|---|---|---|---|---|
| 1 | CDFGJ_g10 | 74.76 ± 11.62 | 67.83 ± 14.80 | 4.91 ± 3.75 | 6 |
| 2 | CDFJ_g10 | 74.75 ± 11.59 | 67.66 ± 15.08 | 4.90 ± 3.75 | 6 |
| 3 | CDJ_g10 | 74.74 ± 11.55 | 67.83 ± 14.80 | 4.91 ± 3.76 | 6 |
| 4 | CDGJ_g10 | 74.72 ± 11.55 | 67.83 ± 14.80 | 4.91 ± 3.76 | 6 |
| 5 | BCDEFGJ_g10 | 74.64 ± 10.30 | 68.06 ± 17.77 | 4.47 ± 2.86 | 4 |
| 6 | BCDEGHJ_g10 | 74.60 ± 10.41 | 68.27 ± 17.78 | 4.48 ± 2.86 | 4 |
| 7 | BCDEFGHJ_g10 | 74.54 ± 10.17 | 68.45 ± 17.85 | 4.46 ± 2.85 | 4 |
| 8 | BCDEFHJ_g10 | 74.51 ± 10.20 | 68.69 ± 17.58 | 4.47 ± 2.84 | 4 |
| 9 | ACDGJ_g36 | 74.48 ± 11.91 | 68.96 ± 15.52 | 4.94 ± 3.80 | 5 |
| 10 | BCDEGJ_g41 | 74.48 ± 10.78 | 67.93 ± 16.77 | 4.57 ± 3.21 | 4 |
11 feathers (A, B, C, D, E, F, G, H, I, J, K) are ZDock, ZRankElec, ZRank, ZRankSolv, ZRankVdw, E_vdw1, E_elec1, E_vdw2, E_elec2, E_sol, E_RDock
Fig. 3Characteristics of the optimal AnkPlex
Comparison of performances of AnkPlex with single learning method and ZDOCK programa
| Method | Number of TP docking poses (rank) | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| C1 | C2 | C3 | C4 | C5 | C6 | C7 | U1 | U2 | |
| 2Kp(ZDOCK) | 0 (13) | 1 (10) | 6 (1) | 7 (1) | 1 (10) | 1 (10) | 8 (1) | 0 (14) | 0 (104) |
| OLMDT (ABEHIJ_g56) | 1 (4) | 1 (4) | 8 (1) | 7 (1) | 0 (14) | 2 (1) | 7 (1) | 0 (13) | 0 (160) |
| OLMLG (CDFGHJ_g30) | 2 (1) | 1 (3) | 8 (1) | 10 (1) | 1 (9) | 0 (19) | 7 (1) | 1 (6) | 1 (5) |
| AnkPlex | 2 (1) | 1 (3) | 8 (1) | 10 (1) | 1 (9) | 3 (2) | 7 (1) | 1 (6) | 1 (5) |
aThe number of TP docking poses is the summation of the TP docking poses found in 10 top-ranking poses, where the maximum and the minimum are 10 and 0, respectively. The rank is denoted by the order in which the first TP docking poses are found. For example, on C6, AnkPlex yields three TP docking poses on the top 10 ranking poses, and the orders of the three TP docking poses are 2, 8, and 9. Thus, the rank of AnkPlex on C6 is 2
Number of near-native poses in 10 top-ranking poses obtained from ZDOCK program with 2Kp
| Feature | Number of near-native poses in 10 top-ranking poses | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| C1 | C2 | C3 | C4 | C5 | C6 | C7 | #PPTRN | U1 | U2 | #PPTEST | |
| ZDock | 0 | 0 | 0 | 3 | 0 | 0 | 4 | 2 | 0 | 0 | 0 |
| ZRankElec | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| ZRank | 0 | 1 | 6 | 7 | 1 | 1 | 8 | 6 | 0 | 0 | 0 |
| ZRankSolv | 0 | 0 | 0 | 7 | 1 | 0 | 0 | 2 | 0 | 0 | 0 |
| ZRankVdw | 0 | 0 | 4 | 0 | 0 | 3 | 1 | 3 | 0 | 0 | 0 |
| E_vdw1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| E_elec1 | 0 | 0 | 0 | 0 | 0 | 7 | 0 | 1 | 0 | 0 | 0 |
| E_vdw2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| E_elec2 | 0 | 0 | 0 | 0 | 0 | 6 | 0 | 1 | 0 | 0 | 0 |
| E_sol | 0 | 0 | 0 | 9 | 1 | 0 | 0 | 2 | 0 | 0 | 0 |
| E_RDock | 8 | 0 | 0 | 4 | 0 | 0 | 5 | 3 | 0 | 0 | 0 |
Fig. 4The correlation coefficients between the dot products of the features and their weights of the best-ranking LGS for the near-native poses in the nine ankyrin-protein complexes