Literature DB >> 32568385

The impact of compound library size on the performance of scoring functions for structure-based virtual screening.

Louison Fresnais, Pedro J Ballester.   

Abstract

Larger training datasets have been shown to improve the accuracy of machine learning (ML)-based scoring functions (SFs) for structure-based virtual screening (SBVS). In addition, massive test sets for SBVS, known as ultra-large compound libraries, have been demonstrated to enable the fast discovery of selective drug leads with low-nanomolar potency. This proof-of-concept was carried out on two targets using a single docking tool along with its SF. It is thus unclear whether this high level of performance would generalise to other targets, docking tools and SFs. We found that screening a larger compound library results in more potent actives being identified in all six additional targets using a different docking tool along with its classical SF. Furthermore, we established that a way to improve the potency of the retrieved molecules further is to rank them with more accurate ML-based SFs (we found this to be true in four of the six targets; the difference was not significant in the remaining two targets). A 3-fold increase in average hit rate across targets was also achieved by the ML-based SFs. Lastly, we observed that classical and ML-based SFs often find different actives, which supports using both types of SFs on those targets.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  big data; docking; drug design; machine learning; virtual screening

Year:  2021        PMID: 32568385     DOI: 10.1093/bib/bbaa095

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  5 in total

Review 1.  Delta Machine Learning to Improve Scoring-Ranking-Screening Performances of Protein-Ligand Scoring Functions.

Authors:  Chao Yang; Yingkai Zhang
Journal:  J Chem Inf Model       Date:  2022-05-17       Impact factor: 6.162

Review 2.  Structure-based virtual screening for PDL1 dimerizers: Evaluating generic scoring functions.

Authors:  Viet-Khoa Tran-Nguyen; Saw Simeon; Muhammad Junaid; Pedro J Ballester
Journal:  Curr Res Struct Biol       Date:  2022-06-09

3.  OptNCMiner: a deep learning approach for the discovery of natural compounds modulating disease-specific multi-targets.

Authors:  Seo Hyun Shin; Seung Man Oh; Jung Han Yoon Park; Ki Won Lee; Hee Yang
Journal:  BMC Bioinformatics       Date:  2022-06-07       Impact factor: 3.307

4.  New machine learning and physics-based scoring functions for drug discovery.

Authors:  Isabella A Guedes; André M S Barreto; Diogo Marinho; Eduardo Krempser; Mélaine A Kuenemann; Olivier Sperandio; Laurent E Dardenne; Maria A Miteva
Journal:  Sci Rep       Date:  2021-02-04       Impact factor: 4.379

5.  Target-Specific Machine Learning Scoring Function Improved Structure-Based Virtual Screening Performance for SARS-CoV-2 Drugs Development.

Authors:  Muhammad Tahir Ul Qamar; Xi-Tong Zhu; Ling-Ling Chen; Laila Alhussain; Maha A Alshiekheid; Abdulrahman Theyab; Mohammad Algahtani
Journal:  Int J Mol Sci       Date:  2022-09-20       Impact factor: 6.208

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.