| Literature DB >> 34113650 |
Xiao Wang1, Sean T Flannery1, Daisuke Kihara1,2.
Abstract
Physical interactions of proteins play key functional roles in many important cellular processes. To understand molecular mechanisms of such functions, it is crucial to determine the structure of protein complexes. To complement experimental approaches, which usually take a considerable amount of time and resources, various computational methods have been developed for predicting the structures of protein complexes. In computational modeling, one of the challenges is to identify near-native structures from a large pool of generated models. Here, we developed a deep learning-based approach named Graph Neural Network-based DOcking decoy eValuation scorE (GNN-DOVE). To evaluate a protein docking model, GNN-DOVE extracts the interface area and represents it as a graph. The chemical properties of atoms and the inter-atom distances are used as features of nodes and edges in the graph, respectively. GNN-DOVE was trained, validated, and tested on docking models in the Dockground database and further tested on a combined dataset of Dockground and ZDOCK benchmark as well as a CAPRI scoring dataset. GNN-DOVE performed better than existing methods, including DOVE, which is our previous development that uses a convolutional neural network on voxelized structure models.Entities:
Keywords: deep learning; docking model evaluation; graph neural networks; protein docking; protein structure prediction
Year: 2021 PMID: 34113650 PMCID: PMC8185212 DOI: 10.3389/fmolb.2021.647915
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
Dockground dataset splits for training and testing GNN.
| Fold | PDB ID |
|---|---|
| 1 | 1A2K, 1E96 (1he1, 1he8, 1wq1), 1F6M, 1MA9 (2btf), 1G20, 1KU6, 1T6G, 1UGH, 1YVB, 2CKH, 3PRO |
| 2 | 1AKJ (1p7q, 2bnq), 1DFJ, 1NBF (1r4m, 1xd3, 2bkr), 1GPW, 1HXY, 1U7F, 1UEX, 1ZY8, 2GOO, 1EWY |
| 3 | 1AVW (1 |
| 4 | 1BVN (1tmq), 1F51, 1FM9, 1A2Y (1g6v, 1gpq, 1jps, 1wej, 1l9b, 1s6v), 1W1I, 2A5T, 3FAP |
There are in total 29 representative targets shown in the upper case; targets in the lower case in a parenthesis indicate that they belong to the same group.
FIGURE 1Framework of GNN-DOVE. GNN-DOVE extracts the interface region of protein complex and further reconstructs graph with/without intermolecular interactions as input, then outputs the probability that indicates if the input structure is acceptable or not. (A) Overall logical steps of the pipeline. (B) Architecture of the GNN network with the gated graph attention mechanism.
Atom features.
| Features | Representation |
|---|---|
| Atom type | C, N, O, S, H (one hot) |
| The degree (connections) of atom | 0, 1, 2, 3, 4, 5 (one hot) |
| The number of connected hydrogen atoms | 0, 1, 2, 3, 4 (one hot) |
| The number of implicit valence electrons | 0, 1, 2, 3, 4, 5 (one hot) |
| Aromatic | 0 or 1 |
FIGURE 4Comparison of iRMSD, lRMSD, and fnat. For each method, the best value among the top 10 scored decoys was plotted. (A) Comparison against all eight methods. (B) Comparison against DOVE.
FIGURE 2Performance on the Dockground dataset. GNN-DOVE was compared with DOVE and seven other scoring methods. (A) The panel shows the fraction of target complexes among the 58 complexes in the benchmark set for which a method selected at least one acceptable model (within top x scored models). (B) Considering the complexes are grouped into 29 groups, we also compared the hit rate of different methods based on the group classification. The hit rates for complexes in each group were averaged and then re-averaged over the 29 groups. (C) Results when 46 complex groups were considered that were formed with interface similarity. The hit rates for complexes in each group were averaged and then re-averaged over the 46 groups.
FIGURE 3The hit rate is shown for each fold in the cross validation on the Dockground dataset. Protein complexes in the test set of each fold are listed in Table 1. In the same way as Figure 2A, a hit rate was computed for individual complexes separately and averaged over the complexes. (A) The hit rate of the fold 1 test set. The model was trained on the fold 2, 3, and 4 subsets. (B) The fold 2 test set. (C) The fold 3 test set. (D) The fold 4 test set.
FIGURE 5t-SNE plots of decoy selection. Decoys from all the testing target complexes in the four different folds in the cross-testing are plotted, which in total include 580 correct decoys (black circles) and 5,591 incorrect decoys (gray stars). Encoded features of those decoys are taken from the output of the last fully connected layer of GNN, which is a vector of 128 elements. To visualize the different embedding, we use t-SNE to project them into a 2D space. The four panels correspond to the embedding of models on the four-fold testing sets.
FIGURE 6Examples of decoys with an acceptable quality but not selected within the top 10 by DOVE. Two subunits docked are shown in cyan and light brown, and the interface regions of the two subunits are presented in the stick representation and in blue and green, respectively. To highlight the missed atoms from the input cube of DOVE, they are shown in red spheres. (A) A medium-quality decoy for 1bui. iRMSD: 2.54 Å, lRMSD: 2.93 Å, fnat: 0.551. (B) A medium-quality decoy for 1g20. iRMSD: 2.14 Å, lRMSD: 3.86 Å, fnat: 0.453.
Performance on the Dockground+ZDOCK testing dataset.
| ID | GNN-DOVE | GOAP | ITScore | ZRANK | ZRANK2 | IRAD | Total |
|---|---|---|---|---|---|---|---|
| 1AK4 | 1 | 10 | 1 | 1 | 7 | 0 | 179 |
| 1AY7 | 8 | 0 | 3 | 9 | 8 | 8 | 176 |
| 1EER | 0 | 0 | 0 | 0 | 3 | 0 | 41 |
| 1GLA | 5 | 1 | 0 | 8 | 4 | 8 | 165 |
| 1HCF | 9 | 0 | 8 | 3 | 3 | 7 | 183 |
| 1JIW | 3 | 0 | 2 | 0 | 1 | 2 | 106 |
| 1JTG | 8 | 0 | 10 | 10 | 0 | 10 | 177 |
| 1KAC | 7 | 0 | 5 | 8 | 2 | 6 | 183 |
| 1KTZ | 0 | 1 | 0 | 1 | 3 | 0 | 77 |
| 1MAH | 9 | 0 | 8 | 9 | 0 | 9 | 179 |
| 2MTA | 7 | 0 | 4 | 9 | 0 | 9 | 186 |
| 2VDB | 9 | 1 | 9 | 7 | 2 | 6 | 173 |
| 3D5S | 7 | 0 | 10 | 6 | 1 | 5 | 156 |
| 1BUH (1) | 3 | 8 | 9 | 6 | 4 | 9 | 183 |
| 1FQ1 (1) | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
| 1JWH (1) | 6 | 6 | 7 | 6 | 2 | 8 | 171 |
| 2OZA (1) | 1 | 0 | 1 | 0 | 0 | 0 | 19 |
| 1EFN (2) | 1 | 0 | 0 | 4 | 3 | 4 | 130 |
| 1GCQ (2) | 2 | 9 | 0 | 1 | 8 | 4 | 142 |
| Hit rate | 0.842 | 0.368 | 0.684 | 0.789 | 0.737 | 0.737 | — |
| Group HR | 0.867 | 0.333 | 0.717 | 0.833 | 0.767 | 0.767 |
In the ID column, the number in a parentheses indicates which group the target belongs to. Thus, four complexes belong to the same similarity group, and the other two belong to another group. The rest of the complexes are single entry groups. Group HR indicates the group hit rate. In Group HR, the fraction of complexes within each group that have at least one hit (acceptable model) within the top 10 ranks was first computed, and then averaged across all the groups. The total column indicates the total number of acceptable docking models for a given target.
Performance on the CAPRI scoring dataset.
| ID | GNN-DOVE | GOAP | ITScore | ZRANK | ZRANK2 | IRAD | Total |
|---|---|---|---|---|---|---|---|
| (T29) | 2/0/0 | 1/0/0 | 0/0/0 | 0/0/0 | 2/2/0 | 1/1/0 | 167/78/2 |
| (T30) | 1/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 2/0/0 |
| T32 | 0/0/0 | 1/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 15/3/0 |
| T35 | 1/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 3/0/0 |
| (T37) | 0/0/0 | 1/0/0 | 3/0/1 | 1/0/0 | 4/1/0 | 4/1/0 | 99/46/11 |
| T39 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 4/3/0 |
| (T40) | 4/4/0 | 1/0/1 | 7/3/4 | 1/1/0 | 9/8/1 | 3/3/0 | 588/206/193 |
| T41 | 5/0/0 | 4/2/2 | 1/1/0 | 4/0/0 | 2/0/0 | 3/0/0 | 371/120/2 |
| T46 | 1/0/0 | 0/0/0 | 0/0/0 | 5/0/0 | 6/0/0 | 6/0/0 | 24/0/0 |
| T47 | 9/4/5 | 10/0/10 | 2/1/0 | 9/5/4 | 9/3/5 | 10/2/7 | 611/307/278 |
| T50 | 6/0/0 | 0/0/0 | 4/1/0 | 0/0/0 | 2/0/0 | 2/0/0 | 133/36/0 |
| T53 | 2/2/0 | 7/6/0 | 3/0/0 | 1/0/0 | 7/3/0 | 4/2/0 | 130/17/0 |
| (T54) | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 0/0/0 | 19/1/0 |
| Hit | 9/3/1 | 7/2/3 | 6/4/2 | 6/2/1 | 8/5/2 | 8/5/1 | 13/10/5 |
| Hit-NR | 6/2/1 | 4/2/2 | 4/3/0 | 4/1/1 | 5/2/1 | 5/2/1 | 8/7/2 |
The IDs in parentheses are those which have structure or sequence similarity to one of the complexes used in training. Results for a complex by a method have three numbers separated by /. The first number is the number of decoys selected within the top 10 ranked models, which has an acceptable or better quality. The second and third numbers are the number of models with medium or higher quality, and the number of high-quality models. The numbers in the total column indicate the total number of decoys in the three quality classifications in the decoy set of each target. The last two rows report the summary of the performance. Three numbers are the number of targets where the method identified at least one acceptable or higher-quality models, at least one medium- or higher-quality models, or at least one high-quality model, respectively. The hit row lists the results when all 13 targets were considered. Hit-NR only considers targets that are not in parentheses.
Ranking of GNN-DOVE among other scorer groups on the CAPRI scoring dataset.
| Group | Performance | # Submitted targets | |
|---|---|---|---|
| All | Nonredundant | ||
| iScore | 9/6/2 | 6/5/1 | 13 (8) |
| GNN-DOVE | 9/3/1 | 6/2/1 | 13 (8) |
| GraphRank | 8/4/1 | 5/3/1 | 13 (8) |
| Bates | 8/4/1 | 5/2/0 | 10 (5) |
| Bonvin | 8/3/2 | 5/2/1 | 9 (5) |
| Weng | 8/2/3 | 5/2/1 | 9 (6) |
| Zou | 7/1/4 | 5/1/2 | 9 (6) |
| Wang | 6/3/2 | 4/2/1 | 6 (4) |
| Fernandez-Recio | 5/3/2 | 4/4/1 | 8 (7) |
| Elber | 5/1/1 | 4/1/0 | 5 (4) |
| Wolfson | 4/0/1 | 1/0/0 | 5 (2) |
| Camacho | 3/1/2 | 1/1/1 | 5 (2) |
Results of the existing methods were taken from Table 2 of the article by Geng et al. (2020). The numbers in the nonredundant column only considered targets in Table 4 that are not in the parentheses. The last column shows the number of targets that each group has submitted their prediction among the 13 targets listed in Table 4. The numbers in parentheses report the number of submitted targets among those which do not have similarity to the training set we used (i.e., discarding the targets in parentheses in Table 4).