Literature DB >> 27859414

Improving scoring-docking-screening powers of protein-ligand scoring functions using random forest.

Cheng Wang1, Yingkai Zhang1,2.   

Abstract

The development of new protein-ligand scoring functions using machine learning algorithms, such as random forest, has been of significant interest. By efficiently utilizing expanded feature sets and a large set of experimental data, random forest based scoring functions (RFbScore) can achieve better correlations to experimental protein-ligand binding data with known crystal structures; however, more extensive tests indicate that such enhancement in scoring power comes with significant under-performance in docking and screening power tests compared to traditional scoring functions. In this work, to improve scoring-docking-screening powers of protein-ligand docking functions simultaneously, we have introduced a Δvina RF parameterization and feature selection framework based on random forest. Our developed scoring function Δvina RF20 , which employs 20 descriptors in addition to the AutoDock Vina score, can achieve superior performance in all power tests of both CASF-2013 and CASF-2007 benchmarks compared to classical scoring functions. The Δvina RF20 scoring function and its code are freely available on the web at: https://www.nyu.edu/projects/yzhang/DeltaVina.
© 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

Entities:  

Keywords:  docking; machine learning; protein-ligand binding affinity; random forest; scoring function

Mesh:

Substances:

Year:  2016        PMID: 27859414      PMCID: PMC5140681          DOI: 10.1002/jcc.24667

Source DB:  PubMed          Journal:  J Comput Chem        ISSN: 0192-8651            Impact factor:   3.376


  67 in total

1.  Assessing scoring functions for protein-ligand interactions.

Authors:  Philippe Ferrara; Holger Gohlke; Daniel J Price; Gerhard Klebe; Charles L Brooks
Journal:  J Med Chem       Date:  2004-06-03       Impact factor: 7.446

2.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy.

Authors:  Richard A Friesner; Jay L Banks; Robert B Murphy; Thomas A Halgren; Jasna J Klicic; Daniel T Mainz; Matthew P Repasky; Eric H Knoll; Mee Shelley; Jason K Perry; David E Shaw; Perry Francis; Peter S Shenkin
Journal:  J Med Chem       Date:  2004-03-25       Impact factor: 7.446

3.  General and targeted statistical potentials for protein-ligand interactions.

Authors:  Wijnand T M Mooij; Marcel L Verdonk
Journal:  Proteins       Date:  2005-11-01

4.  Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes.

Authors:  Richard A Friesner; Robert B Murphy; Matthew P Repasky; Leah L Frye; Jeremy R Greenwood; Thomas A Halgren; Paul C Sanschagrin; Daniel T Mainz
Journal:  J Med Chem       Date:  2006-10-19       Impact factor: 7.446

5.  Comparative assessment of scoring functions on a diverse test set.

Authors:  Tiejun Cheng; Xun Li; Yan Li; Zhihai Liu; Renxiao Wang
Journal:  J Chem Inf Model       Date:  2009-04       Impact factor: 4.956

Review 6.  Machine learning: Trends, perspectives, and prospects.

Authors:  M I Jordan; T M Mitchell
Journal:  Science       Date:  2015-07-17       Impact factor: 47.728

Review 7.  Virtual screening strategies in drug discovery: a critical review.

Authors:  A Lavecchia; C Di Giovanni
Journal:  Curr Med Chem       Date:  2013       Impact factor: 4.530

8.  Low-Quality Structural and Interaction Data Improves Binding Affinity Prediction via Random Forest.

Authors:  Hongjian Li; Kwong-Sak Leung; Man-Hon Wong; Pedro J Ballester
Journal:  Molecules       Date:  2015-06-12       Impact factor: 4.411

9.  NNScore 2.0: a neural-network receptor-ligand scoring function.

Authors:  Jacob D Durrant; J Andrew McCammon
Journal:  J Chem Inf Model       Date:  2011-11-03       Impact factor: 4.956

10.  Pharmacophore-based similarity scoring for DOCK.

Authors:  Lingling Jiang; Robert C Rizzo
Journal:  J Phys Chem B       Date:  2014-10-10       Impact factor: 2.991

View more
  49 in total

1.  Generative network complex (GNC) for drug discovery.

Authors:  Christopher Grow; Kaifu Gao; Duc Duy Nguyen; Guo-Wei Wei
Journal:  Commun Inf Syst       Date:  2019

2.  Convex-PL: a novel knowledge-based potential for protein-ligand interactions deduced from structural databases using convex optimization.

Authors:  Maria Kadukova; Sergei Grudinin
Journal:  J Comput Aided Mol Des       Date:  2017-09-18       Impact factor: 3.686

3.  Nonparametric chemical descriptors for the calculation of ligand-biopolymer affinities with machine-learning scoring functions.

Authors:  Edelmiro Moman; Maria A Grishina; Vladimir A Potemkin
Journal:  J Comput Aided Mol Des       Date:  2019-11-14       Impact factor: 3.686

4.  Incorporating Explicit Water Molecules and Ligand Conformation Stability in Machine-Learning Scoring Functions.

Authors:  Jianing Lu; Xuben Hou; Cheng Wang; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2019-10-31       Impact factor: 4.956

5.  Exploring fragment-based target-specific ranking protocol with machine learning on cathepsin S.

Authors:  Yuwei Yang; Jianing Lu; Chao Yang; Yingkai Zhang
Journal:  J Comput Aided Mol Des       Date:  2019-11-15       Impact factor: 3.686

6.  AGL-Score: Algebraic Graph Learning Score for Protein-Ligand Binding Scoring, Ranking, Docking, and Screening.

Authors:  Duc Duy Nguyen; Guo-Wei Wei
Journal:  J Chem Inf Model       Date:  2019-07-01       Impact factor: 4.956

Review 7.  A guide to machine learning for biologists.

Authors:  Joe G Greener; Shaun M Kandathil; Lewis Moffat; David T Jones
Journal:  Nat Rev Mol Cell Biol       Date:  2021-09-13       Impact factor: 94.444

Review 8.  A review of mathematical representations of biomolecular data.

Authors:  Duc Duy Nguyen; Zixuan Cang; Guo-Wei Wei
Journal:  Phys Chem Chem Phys       Date:  2020-02-26       Impact factor: 3.676

9.  Protein-ligand docking using FFT based sampling: D3R case study.

Authors:  Dzmitry Padhorny; David R Hall; Hanieh Mirzaei; Artem B Mamonov; Mohammad Moghadasi; Andrey Alekseenko; Dmitri Beglov; Dima Kozakov
Journal:  J Comput Aided Mol Des       Date:  2017-11-03       Impact factor: 3.686

10.  Three-Dimensional Convolutional Neural Networks and a Cross-Docked Data Set for Structure-Based Drug Design.

Authors:  Paul G Francoeur; Tomohide Masuda; Jocelyn Sunseri; Andrew Jia; Richard B Iovanisci; Ian Snyder; David R Koes
Journal:  J Chem Inf Model       Date:  2020-09-10       Impact factor: 4.956

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.