Literature DB >> 34708263

Using diverse potentials and scoring functions for the development of improved machine-learned models for protein-ligand affinity and docking pose prediction.

Omar N A Demerdash1.   

Abstract

The advent of computational drug discovery holds the promise of significantly reducing the effort of experimentalists, along with monetary cost. More generally, predicting the binding of small organic molecules to biological macromolecules has far-reaching implications for a range of problems, including metabolomics. However, problems such as predicting the bound structure of a protein-ligand complex along with its affinity have proven to be an enormous challenge. In recent years, machine learning-based methods have proven to be more accurate than older methods, many based on simple linear regression. Nonetheless, there remains room for improvement, as these methods are often trained on a small set of features, with a single functional form for any given physical effect, and often with little mention of the rationale behind choosing one functional form over another. Moreover, it is not entirely clear why one machine learning method is favored over another. In this work, we endeavor to undertake a comprehensive effort towards developing high-accuracy, machine-learned scoring functions, systematically investigating the effects of machine learning method and choice of features, and, when possible, providing insights into the relevant physics using methods that assess feature importance. Here, we show synergism among disparate features, yielding adjusted R2 with experimental binding affinities of up to 0.871 on an independent test set and enrichment for native bound structures of up to 0.913. When purely physical terms that model enthalpic and entropic effects are used in the training, we use feature importance assessments to probe the relevant physics and hopefully guide future investigators working on this and other computational chemistry problems.
© 2021. The Author(s), under exclusive licence to Springer Nature Switzerland AG.

Entities:  

Keywords:  Binding affinity; Binding-pose prediction; Docking; Machine learning; Rescoring

Mesh:

Substances:

Year:  2021        PMID: 34708263     DOI: 10.1007/s10822-021-00423-4

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  63 in total

1.  The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures.

Authors:  Renxiao Wang; Xueliang Fang; Yipin Lu; Shaomeng Wang
Journal:  J Med Chem       Date:  2004-06-03       Impact factor: 7.446

2.  The PDBbind database: methodologies and updates.

Authors:  Renxiao Wang; Xueliang Fang; Yipin Lu; Chao-Yie Yang; Shaomeng Wang
Journal:  J Med Chem       Date:  2005-06-16       Impact factor: 7.446

3.  Benchmarking sets for molecular docking.

Authors:  Niu Huang; Brian K Shoichet; John J Irwin
Journal:  J Med Chem       Date:  2006-11-16       Impact factor: 7.446

4.  Comparative assessment of scoring functions on a diverse test set.

Authors:  Tiejun Cheng; Xun Li; Yan Li; Zhihai Liu; Renxiao Wang
Journal:  J Chem Inf Model       Date:  2009-04       Impact factor: 4.956

5.  Classification of current scoring functions.

Authors:  Jie Liu; Renxiao Wang
Journal:  J Chem Inf Model       Date:  2015-02-19       Impact factor: 4.956

6.  Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions.

Authors:  Zhihai Liu; Minyi Su; Li Han; Jie Liu; Qifan Yang; Yan Li; Renxiao Wang
Journal:  Acc Chem Res       Date:  2017-02-09       Impact factor: 22.384

7.  PDB-wide collection of binding data: current status of the PDBbind database.

Authors:  Zhihai Liu; Yan Li; Li Han; Jie Li; Jie Liu; Zhixiong Zhao; Wei Nie; Yuchen Liu; Renxiao Wang
Journal:  Bioinformatics       Date:  2014-10-09       Impact factor: 6.937

8.  Comparative assessment of scoring functions on an updated benchmark: 1. Compilation of the test set.

Authors:  Yan Li; Zhihai Liu; Jie Li; Li Han; Jie Liu; Zhixiong Zhao; Renxiao Wang
Journal:  J Chem Inf Model       Date:  2014-06-02       Impact factor: 4.956

9.  Prediction of drug binding affinities by comparative binding energy analysis.

Authors:  A R Ortiz; M T Pisabarro; F Gago; R C Wade
Journal:  J Med Chem       Date:  1995-07-07       Impact factor: 7.446

10.  Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking.

Authors:  Michael M Mysinger; Michael Carchia; John J Irwin; Brian K Shoichet
Journal:  J Med Chem       Date:  2012-07-05       Impact factor: 7.446

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.