Literature DB >> 20236947

A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking.

Pedro J Ballester1, John B O Mitchell.   

Abstract

MOTIVATION: Accurately predicting the binding affinities of large sets of diverse protein-ligand complexes is an extremely challenging task. The scoring functions that attempt such computational prediction are essential for analysing the outputs of molecular docking, which in turn is an important technique for drug discovery, chemical biology and structural biology. Each scoring function assumes a predetermined theory-inspired functional form for the relationship between the variables that characterize the complex, which also include parameters fitted to experimental or simulation data and its predicted binding affinity. The inherent problem of this rigid approach is that it leads to poor predictivity for those complexes that do not conform to the modelling assumptions. Moreover, resampling strategies, such as cross-validation or bootstrapping, are still not systematically used to guard against the overfitting of calibration data in parameter estimation for scoring functions.
RESULTS: We propose a novel scoring function (RF-Score) that circumvents the need for problematic modelling assumptions via non-parametric machine learning. In particular, Random Forest was used to implicitly capture binding effects that are hard to model explicitly. RF-Score is compared with the state of the art on the demanding PDBbind benchmark. Results show that RF-Score is a very competitive scoring function. Importantly, RF-Score's performance was shown to improve dramatically with training set size and hence the future availability of more high-quality structural and interaction data is expected to lead to improved versions of RF-Score. CONTACT: pedro.ballester@ebi.ac.uk; jbom@st-andrews.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20236947      PMCID: PMC3524828          DOI: 10.1093/bioinformatics/btq112

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  38 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Consideration of molecular weight during compound selection in virtual target-based database screening.

Authors:  Yongping Pan; Niu Huang; Sam Cho; Alexander D MacKerell
Journal:  J Chem Inf Comput Sci       Date:  2003 Jan-Feb

3.  Assessing scoring functions for protein-ligand interactions.

Authors:  Philippe Ferrara; Holger Gohlke; Daniel J Price; Gerhard Klebe; Charles L Brooks
Journal:  J Med Chem       Date:  2004-06-03       Impact factor: 7.446

4.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy.

Authors:  Richard A Friesner; Jay L Banks; Robert B Murphy; Thomas A Halgren; Jasna J Klicic; Daniel T Mainz; Matthew P Repasky; Eric H Knoll; Mee Shelley; Jason K Perry; David E Shaw; Perry Francis; Peter S Shenkin
Journal:  J Med Chem       Date:  2004-03-25       Impact factor: 7.446

5.  General and targeted statistical potentials for protein-ligand interactions.

Authors:  Wijnand T M Mooij; Marcel L Verdonk
Journal:  Proteins       Date:  2005-11-01

6.  Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes.

Authors:  Richard A Friesner; Robert B Murphy; Matthew P Repasky; Leah L Frye; Jeremy R Greenwood; Thomas A Halgren; Paul C Sanschagrin; Daniel T Mainz
Journal:  J Med Chem       Date:  2006-10-19       Impact factor: 7.446

7.  Molecular docking for substrate identification: the short-chain dehydrogenases/reductases.

Authors:  Angelo D Favia; Irene Nobeli; Fabian Glaser; Janet M Thornton
Journal:  J Mol Biol       Date:  2007-11-01       Impact factor: 5.469

8.  y-Randomization and its variants in QSPR/QSAR.

Authors:  Christoph Rücker; Gerta Rücker; Markus Meringer
Journal:  J Chem Inf Model       Date:  2007-09-20       Impact factor: 4.956

9.  Comparative assessment of scoring functions on a diverse test set.

Authors:  Tiejun Cheng; Xun Li; Yan Li; Zhihai Liu; Renxiao Wang
Journal:  J Chem Inf Model       Date:  2009-04       Impact factor: 4.956

10.  Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes.

Authors:  M D Eldridge; C W Murray; T R Auton; G V Paolini; R P Mee
Journal:  J Comput Aided Mol Des       Date:  1997-09       Impact factor: 3.686

View more
  144 in total

1.  Experimental versus predicted affinities for ligand binding to estrogen receptor: iterative selection and rescoring of docked poses systematically improves the correlation.

Authors:  James S Wright; James M Anderson; Hooman Shadnia; Tony Durst; John A Katzenellenbogen
Journal:  J Comput Aided Mol Des       Date:  2013-08-24       Impact factor: 3.686

Review 2.  Open source molecular modeling.

Authors:  Somayeh Pirhadi; Jocelyn Sunseri; David Ryan Koes
Journal:  J Mol Graph Model       Date:  2016-07-30       Impact factor: 2.518

3.  Characterizing Protein-Ligand Binding Using Atomistic Simulation and Machine Learning: Application to Drug Resistance in HIV-1 Protease.

Authors:  Troy W Whitfield; Debra A Ragland; Konstantin B Zeldovich; Celia A Schiffer
Journal:  J Chem Theory Comput       Date:  2020-01-16       Impact factor: 6.006

4.  A comparative study of family-specific protein-ligand complex affinity prediction based on random forest approach.

Authors:  Yu Wang; Yanzhi Guo; Qifan Kuang; Xuemei Pu; Yue Ji; Zhihang Zhang; Menglong Li
Journal:  J Comput Aided Mol Des       Date:  2014-12-20       Impact factor: 3.686

5.  Biased Docking for Protein-Ligand Pose Prediction.

Authors:  Juan Pablo Arcon; Adrián G Turjanski; Marcelo A Martí; Stefano Forli
Journal:  Methods Mol Biol       Date:  2021

Review 6.  Receptor-ligand molecular docking.

Authors:  Isabella A Guedes; Camila S de Magalhães; Laurent E Dardenne
Journal:  Biophys Rev       Date:  2013-12-21

Review 7.  A review of mathematical representations of biomolecular data.

Authors:  Duc Duy Nguyen; Zixuan Cang; Guo-Wei Wei
Journal:  Phys Chem Chem Phys       Date:  2020-02-26       Impact factor: 3.676

8.  Characterization of small molecule binding. I. Accurate identification of strong inhibitors in virtual screening.

Authors:  Bo Ding; Jian Wang; Nan Li; Wei Wang
Journal:  J Chem Inf Model       Date:  2013-01-09       Impact factor: 4.956

9.  Novel small molecule binders of human N-glycanase 1, a key player in the endoplasmic reticulum associated degradation pathway.

Authors:  Bharath Srinivasan; Hongyi Zhou; Sreyoshi Mitra; Jeffrey Skolnick
Journal:  Bioorg Med Chem       Date:  2016-08-13       Impact factor: 3.641

10.  Neural-Network Scoring Functions Identify Structurally Novel Estrogen-Receptor Ligands.

Authors:  Jacob D Durrant; Kathryn E Carlson; Teresa A Martin; Tavina L Offutt; Christopher G Mayne; John A Katzenellenbogen; Rommie E Amaro
Journal:  J Chem Inf Model       Date:  2015-09-04       Impact factor: 4.956

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.