| Literature DB >> 35419466 |
Nicolas Renaud1, Yong Jung2, Vasant Honavar2,3, Cunliang Geng1,4, Alexandre M J J Bonvin4, Li C Xue4,5.
Abstract
Computational docking is a promising tool to model three-dimensional (3D) structures of protein-protein complexes, which provides fundamental insights of protein functions in the cellular life. Singling out near-native models from the huge pool of generated docking models (referred to as the scoring problem) remains as a major challenge in computational docking. We recently published iScore, a novel graph kernel based scoring function. iScore ranks docking models based on their interface graph similarities to the training interface graph set. iScore uses a support vector machine approach with random-walk graph kernels to classify and rank protein-protein interfaces. Here, we present the software for iScore. The software provides executable scripts that fully automate the computational workflow. In addition, the creation and analysis of the interface graph can be distributed across different processes using Message Passing interface (MPI) and can be offloaded to GPUs thanks to dedicated CUDA kernels.Entities:
Keywords: Graph kernel functions; MPI; Position-specific scoring matrix (PSSM); Protein–protein docking; Scoring; Support vector machines
Year: 2020 PMID: 35419466 PMCID: PMC9005067 DOI: 10.1016/j.softx.2020.100462
Source DB: PubMed Journal: SoftwareX
Fig. 1.Computational workflow of iScore during the training of a SVM model and during the utilization of pre-trained model to rank new PPIs.
Fig. 3.Visualization of the connection graph of PDB ID: 1IRA using iScore.h5x with the PyMol molecular viewer. All interface residues are colored differently following a rainbow color palette to facilitate their identification. The residues that are not part of the interface are represented as thin gray lines. The connection between interface residues are shown as white lines.
Fig. 2.(a) Scaling of iScore.train.mpi and iScore.predict.mpi with respect to the number of MPI processes. The training and testing set contained 234 and 599 conformations respectively. (b) Average run time on CPU (Intel Xeon E5-2650 v4 @ 2.20 GHz) and GPU (Nvidia GeForce GTX 1080 Ti) for the calculation of RWGK for two graphs containing n nodes and 3n edges.
Fig. 4.Comparison of the hit rates obtained by iScore and HADDOCK for four CAPRI targets.
Performance of iScore and HADDOCK scoring functions on four CAPRI test cases. The number in bracket represents the number of near-native conformations for each case. The number in each column represents the number of near-native conformations in the top 10, top 50 and top 100 conformations predicted by the two methods.
| iScore | HADDOCK | |||||
|---|---|---|---|---|---|---|
| Top 10 | Top 50 | Top 100 | Top 10 | Top 50 | Top 100 | |
| T32 (15) | 6 | 9 | 10 | 0 | 0 | 0 |
| T41 (371) | 8 | 48 | 97 | 1 | 24 | 46 |
| T47 (611) | 10 | 50 | 99 | 10 | 50 | 100 |
| T50 (133) | 0 | 4 | 10 | 1 | 9 | 14 |
Code metadata
| Current code version | 0.2.0 |
| Permanent link to code/repository used for this code version |
|
| Legal Code License | Apache-2.0 |
| Code versioning system used | git |
| Software code languages, tools, and services used | python, MPI, CUDA. |
| Compilation requirements, operating environments & dependencies | numpy, libSVM, pdb2sql, h5x, pytest, biopython, mpi4py, numpy, scipy, h5py, matplotlib |
| If available Link to developer documentation/manual |
|
| Support email for questions |
|