Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Accuracy or novelty: what can we gain from target-specific machine-learning-based scoring functions in virtual screening?

Literature DB >> 33418562

Accuracy or novelty: what can we gain from target-specific machine-learning-based scoring functions in virtual screening?

Chao Shen¹, Gaoqi Weng¹, Xujun Zhang¹, Elaine Lai-Han Leung², Xiaojun Yao², Jinping Pang¹, Xin Chai¹, Dan Li¹, Ercheng Wang¹, Dongsheng Cao³, Tingjun Hou¹.

Abstract

Machine-learning (ML)-based scoring functions (MLSFs) have gradually emerged as a promising alternative for protein-ligand binding affinity prediction and structure-based virtual screening. However, clouds of doubts have still been raised against the benefits of this novel type of scoring functions (SFs). In this study, to benchmark the performance of target-specific MLSFs on a relatively unbiased dataset, the MLSFs trained from three representative protein-ligand interaction representations were assessed on the LIT-PCBA dataset, and the classical Glide SP SF and three types of ligand-based quantitative structure-activity relationship (QSAR) models were also utilized for comparison. Two major aspects in virtual screening campaigns, including prediction accuracy and hit novelty, were systematically explored. The calculation results illustrate that the tested target-specific MLSFs yielded generally superior performance over the classical Glide SP SF, but they could hardly outperform the 2D fingerprint-based QSAR models. Although substantial improvements could be achieved by integrating multiple types of protein-ligand interaction features, the MLSFs were still not sufficient to exceed MACCS-based QSAR models. In terms of the correlations between the hit ranks or the structures of the top-ranked hits, the MLSFs developed by different featurization strategies would have the ability to identify quite different hits. Nevertheless, it seems that target-specific MLSFs do not have the intrinsic attributes of a traditional SF and may not be a substitute for classical SFs. In contrast, MLSFs can be regarded as a new derivative of ligand-based QSAR models. It is expected that our study may provide valuable guidance for the assessment and further development of target-specific MLSFs.

Entities: Chemical

Keywords: hit novelty; machine learning; prediction accuracy; target-specific scoring function; virtual screening

Year: 2021 PMID： 33418562 DOI： 10.1093/bib/bbaa410

Source DB: PubMed Journal: Brief Bioinform ISSN： 1467-5463 Impact factor: 11.622

Keyword Cloud
Cited

4 in total

Review 1. Delta Machine Learning to Improve Scoring-Ranking-Screening Performances of Protein-Ligand Scoring Functions.

Authors: Chao Yang; Yingkai Zhang
Journal: J Chem Inf Model Date: 2022-05-17 Impact factor: 6.162

Review 2. Structure-based virtual screening for PDL1 dimerizers: Evaluating generic scoring functions.

Authors: Viet-Khoa Tran-Nguyen; Saw Simeon; Muhammad Junaid; Pedro J Ballester
Journal: Curr Res Struct Biol Date: 2022-06-09

Review 3. Resources and computational strategies to advance small molecule SARS-CoV-2 discovery: lessons from the pandemic and preparing for future health crises.

Authors: Natesh Singh; Bruno O Villoutreix
Journal: Comput Struct Biotechnol J Date: 2021-04-26 Impact factor: 7.271

Review 4. Protein-Ligand Docking in the Machine-Learning Era.

Authors: Chao Yang; Eric Anthony Chen; Yingkai Zhang
Journal: Molecules Date: 2022-07-18 Impact factor: 4.927

4 in total