Literature DB >> 33758923

Machine learning on ligand-residue interaction profiles to significantly improve binding affinity prediction.

Beihong Ji1, Xibing He1, Jingchen Zhai1, Yuzhao Zhang1, Viet Hoang Man1, Junmei Wang1.   

Abstract

Structure-based virtual screenings (SBVSs) play an important role in drug discovery projects. However, it is still a challenge to accurately predict the binding affinity of an arbitrary molecule binds to a drug target and prioritize top ligands from an SBVS. In this study, we developed a novel method, using ligand-residue interaction profiles (IPs) to construct machine learning (ML)-based prediction models, to significantly improve the screening performance in SBVSs. Such a kind of the prediction model is called an IP scoring function (IP-SF). We systematically investigated how to improve the performance of IP-SFs from many perspectives, including the sampling methods before interaction energy calculation and different ML algorithms. Using six drug targets with each having hundreds of known ligands, we conducted a critical evaluation on the developed IP-SFs. The IP-SFs employing a gradient boosting decision tree (GBDT) algorithm in conjunction with the MIN + GB simulation protocol achieved the best overall performance. Its scoring power, ranking power and screening power significantly outperformed the Glide SF. First, compared with Glide, the average values of mean absolute error and root mean square error of GBDT/MIN + GB decreased about 38 and 36%, respectively. Second, the mean values of squared correlation coefficient and predictive index increased about 225 and 73%, respectively. Third, more encouragingly, the average value of the areas under the curve of receiver operating characteristic for six targets by GBDT, 0.87, is significantly better than that by Glide, which is only 0.71. Thus, we expected IP-SFs to have broad and promising applications in SBVSs.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  binding affinity prediction; machine learning (ML); machine learning-based scoring function (ML-based SF); scoring function (SF); scoring power; structure-based virtual screening (SBVS)

Mesh:

Substances:

Year:  2021        PMID: 33758923      PMCID: PMC8425425          DOI: 10.1093/bib/bbab054

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  69 in total

1.  Predicting protein-ligand binding affinities using novel geometrical descriptors and machine-learning methods.

Authors:  Wei Deng; Curt Breneman; Mark J Embrechts
Journal:  J Chem Inf Comput Sci       Date:  2004 Mar-Apr

2.  Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 2. Explicit Solvent Particle Mesh Ewald.

Authors:  Romelia Salomon-Ferrer; Andreas W Götz; Duncan Poole; Scott Le Grand; Ross C Walker
Journal:  J Chem Theory Comput       Date:  2013-08-20       Impact factor: 6.006

3.  Development of quantitative structure-binding affinity relationship models based on novel geometrical chemical descriptors of the protein-ligand interfaces.

Authors:  Shuxing Zhang; Alexander Golbraikh; Alexander Tropsha
Journal:  J Med Chem       Date:  2006-05-04       Impact factor: 7.446

Review 4.  What is a support vector machine?

Authors:  William S Noble
Journal:  Nat Biotechnol       Date:  2006-12       Impact factor: 54.908

Review 5.  Prediction of protein-ligand interactions. Docking and scoring: successes and gaps.

Authors:  Andrew R Leach; Brian K Shoichet; Catherine E Peishoff
Journal:  J Med Chem       Date:  2006-10-05       Impact factor: 7.446

6.  Binding affinity prediction for protein-ligand complexes based on β contacts and B factor.

Authors:  Qian Liu; Chee Keong Kwoh; Jinyan Li
Journal:  J Chem Inf Model       Date:  2013-11-05       Impact factor: 4.956

7.  Classical scoring functions for docking are unable to exploit large volumes of structural and interaction data.

Authors:  Hongjian Li; Jiangjun Peng; Pavel Sidorov; Yee Leung; Kwong-Sak Leung; Man-Hon Wong; Gang Lu; Pedro J Ballester
Journal:  Bioinformatics       Date:  2019-10-15       Impact factor: 6.937

8.  Improved Generalized Born Solvent Model Parameters for Protein Simulations.

Authors:  Hai Nguyen; Daniel R Roe; Carlos Simmerling
Journal:  J Chem Theory Comput       Date:  2013-04-09       Impact factor: 6.006

9.  Binding affinity prediction with property-encoded shape distribution signatures.

Authors:  Sourav Das; Michael P Krein; Curt M Breneman
Journal:  J Chem Inf Model       Date:  2010-02-22       Impact factor: 4.956

10.  ChEMBL web services: streamlining access to drug discovery data and utilities.

Authors:  Mark Davies; Michał Nowotka; George Papadatos; Nathan Dedman; Anna Gaulton; Francis Atkinson; Louisa Bellis; John P Overington
Journal:  Nucleic Acids Res       Date:  2015-04-16       Impact factor: 16.971

View more
  2 in total

1.  In silico binding affinity prediction for metabotropic glutamate receptors using both endpoint free energy methods and a machine learning-based scoring function.

Authors:  Jingchen Zhai; Xibing He; Yuchen Sun; Zhuoya Wan; Beihong Ji; Shuhan Liu; Song Li; Junmei Wang
Journal:  Phys Chem Chem Phys       Date:  2022-08-03       Impact factor: 3.945

Review 2.  Structure-based protein-ligand interaction fingerprints for binding affinity prediction.

Authors:  Debby D Wang; Moon-Tong Chan; Hong Yan
Journal:  Comput Struct Biotechnol J       Date:  2021-11-25       Impact factor: 7.271

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.