Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Can machine learning consistently improve the scoring power of classical scoring functions? Insights into the role of machine learning in scoring functions.

Literature DB >> 31982914

Can machine learning consistently improve the scoring power of classical scoring functions? Insights into the role of machine learning in scoring functions.

Chao Shen, Ye Hu, Zhe Wang, Xujun Zhang, Haiyang Zhong, Gaoang Wang, Xiaojun Yao, Lei Xu, Dongsheng Cao, Tingjun Hou.

Abstract

How to accurately estimate protein-ligand binding affinity remains a key challenge in computer-aided drug design (CADD). In many cases, it has been shown that the binding affinities predicted by classical scoring functions (SFs) cannot correlate well with experimentally measured biological activities. In the past few years, machine learning (ML)-based SFs have gradually emerged as potential alternatives and outperformed classical SFs in a series of studies. In this study, to better recognize the potential of classical SFs, we have conducted a comparative assessment of 25 commonly used SFs. Accordingly, the scoring power was systematically estimated by using the state-of-the-art ML methods that replaced the original multiple linear regression method to refit individual energy terms. The results show that the newly-developed ML-based SFs consistently performed better than classical ones. In particular, gradient boosting decision tree (GBDT) and random forest (RF) achieved the best predictions in most cases. The newly-developed ML-based SFs were also tested on another benchmark modified from PDBbind v2007, and the impacts of structural and sequence similarities were evaluated. The results indicated that the superiority of the ML-based SFs could be fully guaranteed when sufficient similar targets were contained in the training set. Moreover, the effect of the combinations of features from multiple SFs was explored, and the results indicated that combining NNscore2.0 with one to four other classical SFs could yield the best scoring power. However, it was not applicable to derive a generic target-specific SF or SF combination.

Keywords: ML-based SF; binding affinity; machine learning (ML); scoring function (SF); scoring power

Year: 2021 PMID： 31982914 DOI： 10.1093/bib/bbz173

Source DB: PubMed Journal: Brief Bioinform ISSN： 1467-5463 Impact factor: 11.622

Keyword Cloud
Cited

7 in total

4. Machine-learning scoring functions trained on complexes dissimilar to the test set already outperform classical counterparts on a blind benchmark.

Authors: Hongjian Li; Gang Lu; Kam-Heung Sze; Xianwei Su; Wai-Yee Chan; Kwong-Sak Leung
Journal: Brief Bioinform Date: 2021-11-05 Impact factor: 11.622

5. Prediction of Binding Free Energy of Protein-Ligand Complexes with a Hybrid Molecular Mechanics/Generalized Born Surface Area and Machine Learning Method.

Authors: Lina Dong; Xiaoyang Qu; Yuan Zhao; Binju Wang
Journal: ACS Omega Date: 2021-11-21

Review 6. Progress and Impact of Latin American Natural Product Databases.

Authors: Alejandro Gómez-García; José L Medina-Franco
Journal: Biomolecules Date: 2022-08-30

7. XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein-Ligand Scoring and Ranking.

Authors: Lina Dong; Xiaoyang Qu; Binju Wang
Journal: ACS Omega Date: 2022-06-13

7 in total

Can machine learning consistently improve the scoring power of classical scoring functions? Insights into the role of machine learning in scoring functions.

1. Improving Structure-Based Virtual Screening with Ensemble Docking and Machine Learning.

2. Lin_F9: A Linear Empirical Scoring Function for Protein-Ligand Docking.

3. Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review.

4. Machine-learning scoring functions trained on complexes dissimilar to the test set already outperform classical counterparts on a blind benchmark.

5. Prediction of Binding Free Energy of Protein-Ligand Complexes with a Hybrid Molecular Mechanics/Generalized Born Surface Area and Machine Learning Method.

Review 6. Progress and Impact of Latin American Natural Product Databases.

7. XLPFE: A Simple and Effective Machine Learning Scoring Function for Protein-Ligand Scoring and Ranking.