| Literature DB >> 35524575 |
Charles A Santana1,2, Sandro C Izidoro3, Raquel C de Melo-Minardi1,2, Jonathan D Tyzack4, António J M Ribeiro4, Douglas E V Pires5, Janet M Thornton4, Sabrina de A Silveira6.
Abstract
Proteins are essential macromolecules for the maintenance of living systems. Many of them perform their function by interacting with other molecules in regions called binding sites. The identification and characterization of these regions are of fundamental importance to determine protein function, being a fundamental step in processes such as drug design and discovery. However, identifying such binding regions is not trivial due to the drawbacks of experimental methods, which are costly and time-consuming. Here we propose GRaSP-web, a web server that uses GRaSP (Graph-based Residue neighborhood Strategy to Predict binding sites), a residue-centric method based on graphs that uses machine learning to predict putative ligand binding site residues. The method outperformed 6 state-of-the-art residue-centric methods (MCC of 0.61). Also, GRaSP-web is scalable as it takes 10-20 seconds to predict binding sites for a protein complex (the state-of-the-art residue-centric method takes 2-5h on the average). It proved to be consistent in predicting binding sites for bound/unbound structures (MCC 0.61 for both) and for a large dataset of multi-chain proteins (4500 entries, MCC 0.61). GRaSPWeb is freely available at https://grasp.ufv.br.Entities:
Year: 2022 PMID: 35524575 PMCID: PMC9252730 DOI: 10.1093/nar/gkac323
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.GRaSP-web workflow. In (A) we see an input protein. It is modeled as a neighborhood graph in (B), which is encoded as a feature vector (C). A training dataset is built with data extracted from the BioLip database in (D). Binding site residues are predicted by an ensemble classifier in (E). In (F), the results are presented: potential binding sites, suggested ligands and ranked feature importance. This figure was inspired by aCSM workflow (19).
Figure 2.GRaSP-web results page. Binding site residues are presented coupled with confidence scores in (A). A molecular viewer shows in (B) binding site residues clustered as potential binding sites. A set of ligands are suggested for each binding site in (C). The relative importance of descriptors is presented in (D).
Figure 3.Binding site residues predicted by GRaSP-web (in orange) for the multiple chain protein structure of alpha-chymotrypsin (PDB: 2CHA).
Figure 4.Protein structure of HIV protease. (A) Superposition of the HIV protease structures 4PHV (bound state in beige) and 3PHV (unbound state in magenta). (B, C) Prediction performed by GRaSP-web in orange in both conformational states of HIV protease.