Literature DB >> 22411892

A comparative assessment of ranking accuracies of conventional and machine-learning-based scoring functions for protein-ligand binding affinity prediction.

Hossam M Ashtawy1, Nihar R Mahapatra.   

Abstract

Accurately predicting the binding affinities of large sets of protein-ligand complexes efficiently is a key challenge in computational biomolecular science, with applications in drug discovery, chemical biology, and structural biology. Since a scoring function (SF) is used to score, rank, and identify drug leads, the fidelity with which it predicts the affinity of a ligand candidate for a protein's binding site has a significant bearing on the accuracy of virtual screening. Despite intense efforts in developing conventional SFs, which are either force-field based, knowledge-based, or empirical, their limited ranking accuracy has been a major roadblock toward cost-effective drug discovery. Therefore, in this work, we explore a range of novel SFs employing different machine-learning (ML) approaches in conjunction with a variety of physicochemical and geometrical features characterizing protein-ligand complexes. We assess the ranking accuracies of these new ML-based SFs as well as those of conventional SFs in the context of the 2007 and 2010 PDBbind benchmark data sets on both diverse and protein-family-specific test sets. We also investigate the influence of the size of the training data set and the type and number of features used on ranking accuracy. Within clusters of protein-ligand complexes with different ligands bound to the same target protein, we find that the best ML-based SF is able to rank the ligands correctly based on their experimentally determined binding affinities 62.5 percent of the time and identify the top binding ligand 78.1 percent of the time. For this SF, the Spearman correlation coefficient between ranks of ligands ordered by predicted and experimentally determined binding affinities is 0.771. Given the challenging nature of the ranking problem and that SFs are used to screen millions of ligands, this represents a significant improvement over the best conventional SF we studied, for which the corresponding ranking performance values are 57.8 percent, 73.4 percent, and 0.677.

Mesh:

Substances:

Year:  2012        PMID: 22411892     DOI: 10.1109/TCBB.2012.36

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  14 in total

1.  Deep neural network affinity model for BACE inhibitors in D3R Grand Challenge 4.

Authors:  Bo Wang; Ho-Leung Ng
Journal:  J Comput Aided Mol Des       Date:  2020-01-08       Impact factor: 3.686

2.  Exploring fragment-based target-specific ranking protocol with machine learning on cathepsin S.

Authors:  Yuwei Yang; Jianing Lu; Chao Yang; Yingkai Zhang
Journal:  J Comput Aided Mol Des       Date:  2019-11-15       Impact factor: 3.686

3.  TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions.

Authors:  Zixuan Cang; Guo-Wei Wei
Journal:  PLoS Comput Biol       Date:  2017-07-27       Impact factor: 4.475

4.  AGL-Score: Algebraic Graph Learning Score for Protein-Ligand Binding Scoring, Ranking, Docking, and Screening.

Authors:  Duc Duy Nguyen; Guo-Wei Wei
Journal:  J Chem Inf Model       Date:  2019-07-01       Impact factor: 4.956

5.  Benchmarking methods and data sets for ligand enrichment assessment in virtual screening.

Authors:  Jie Xia; Ermias Lemma Tilahun; Terry-Elinor Reid; Liangren Zhang; Xiang Simon Wang
Journal:  Methods       Date:  2014-12-03       Impact factor: 3.608

6.  Convolutional neural network scoring and minimization in the D3R 2017 community challenge.

Authors:  Jocelyn Sunseri; Jonathan E King; Paul G Francoeur; David Ryan Koes
Journal:  J Comput Aided Mol Des       Date:  2018-07-10       Impact factor: 3.686

7.  BgN-Score and BsN-Score: bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes.

Authors:  Hossam M Ashtawy; Nihar R Mahapatra
Journal:  BMC Bioinformatics       Date:  2015-02-23       Impact factor: 3.169

8.  Different combinations of atomic interactions predict protein-small molecule and protein-DNA/RNA affinities with similar accuracy.

Authors:  Raquel Dias; Bryan Kolazckowski
Journal:  Proteins       Date:  2015-09-23

9.  Machine-learning scoring functions for identifying native poses of ligands docked to known and novel proteins.

Authors:  Hossam M Ashtawy; Nihar R Mahapatra
Journal:  BMC Bioinformatics       Date:  2015-04-17       Impact factor: 3.169

10.  Multipose binding in molecular docking.

Authors:  Kalina Atkovska; Sergey A Samsonov; Maciej Paszkowski-Rogacz; M Teresa Pisabarro
Journal:  Int J Mol Sci       Date:  2014-02-14       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.