Literature DB >> 27490034

Improving AutoDock Vina Using Random Forest: The Growing Accuracy of Binding Affinity Prediction by the Effective Exploitation of Larger Data Sets.

Hongjian Li1, Kwong-Sak Leung1, Man-Hon Wong1, Pedro J Ballester2,3.   

Abstract

There is a growing body of evidence showing that machine learning regression results in more accurate structure-based prediction of protein-ligand binding affinity. Docking methods that aim at optimizing the affinity of ligands for a target rely on how accurate their predicted ranking is. However, despite their proven advantages, machine-learning scoring functions are still not widely applied. This seems to be due to insufficient understanding of their properties and the lack of user-friendly software implementing them. Here we present a study where the accuracy of AutoDock Vina, arguably the most commonly-used docking software, is strongly improved by following a machine learning approach. We also analyse the factors that are responsible for this improvement and their generality. Most importantly, with the help of a proposed benchmark, we demonstrate that this improvement will be larger as more data becomes available for training Random Forest models, as regression models implying additive functional forms do not improve with more training data. We discuss how the latter opens the door to new opportunities in scoring function development. In order to facilitate the translation of this advance to enhance structure-based molecular design, we provide software to directly re-score Vina-generated poses and thus strongly improve their predicted binding affinity. The software is available at http://istar.cse.cuhk.edu.hk/rf-score-3.tgz and http://crcm. marseille.inserm.fr/fileadmin/rf-score-3.tgz.
© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Keywords:  Docking; Drug lead optimization; Machine learning

Mesh:

Year:  2015        PMID: 27490034     DOI: 10.1002/minf.201400132

Source DB:  PubMed          Journal:  Mol Inform        ISSN: 1868-1743            Impact factor:   3.353


  53 in total

Review 1.  Open source molecular modeling.

Authors:  Somayeh Pirhadi; Jocelyn Sunseri; David Ryan Koes
Journal:  J Mol Graph Model       Date:  2016-07-30       Impact factor: 2.518

2.  Graphene oxide as selective transporter of flavonols for physiological target DNA: A two-color fluorescence approach.

Authors:  Bidisha Sengupta; Justin Coleman; John Johnson; Manliang Feng
Journal:  Spectrochim Acta A Mol Biomol Spectrosc       Date:  2019-02-12       Impact factor: 4.098

3.  Improving scoring-docking-screening powers of protein-ligand scoring functions using random forest.

Authors:  Cheng Wang; Yingkai Zhang
Journal:  J Comput Chem       Date:  2016-11-17       Impact factor: 3.376

4.  Optimization and Evaluation of Site-Identification by Ligand Competitive Saturation (SILCS) as a Tool for Target-Based Ligand Optimization.

Authors:  Vincent D Ustach; Sirish Kaushik Lakkaraju; Sunhwan Jo; Wenbo Yu; Wenjuan Jiang; Alexander D MacKerell
Journal:  J Chem Inf Model       Date:  2019-05-08       Impact factor: 4.956

5.  Incorporating Explicit Water Molecules and Ligand Conformation Stability in Machine-Learning Scoring Functions.

Authors:  Jianing Lu; Xuben Hou; Cheng Wang; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2019-10-31       Impact factor: 4.956

6.  DG-GL: Differential geometry-based geometric learning of molecular datasets.

Authors:  Duc Duy Nguyen; Guo-Wei Wei
Journal:  Int J Numer Method Biomed Eng       Date:  2019-02-07       Impact factor: 2.747

7.  Deep neural network affinity model for BACE inhibitors in D3R Grand Challenge 4.

Authors:  Bo Wang; Ho-Leung Ng
Journal:  J Comput Aided Mol Des       Date:  2020-01-08       Impact factor: 3.686

8.  TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions.

Authors:  Zixuan Cang; Guo-Wei Wei
Journal:  PLoS Comput Biol       Date:  2017-07-27       Impact factor: 4.475

Review 9.  A review of mathematical representations of biomolecular data.

Authors:  Duc Duy Nguyen; Zixuan Cang; Guo-Wei Wei
Journal:  Phys Chem Chem Phys       Date:  2020-02-26       Impact factor: 3.676

10.  Interaction with specific HSP90 residues as a scoring function: validation in the D3R Grand Challenge 2015.

Authors:  Diogo Santos-Martins
Journal:  J Comput Aided Mol Des       Date:  2016-08-22       Impact factor: 3.686

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.