Literature DB >> 30624935

Hit Dexter 2.0: Machine-Learning Models for the Prediction of Frequent Hitters.

Conrad Stork1, Ya Chen1, Martin Šícho1,2, Johannes Kirchmair1,3,4.   

Abstract

Assay interference caused by small molecules continues to pose a significant challenge for early drug discovery. A number of rule-based and similarity-based approaches have been derived that allow the flagging of potentially "badly behaving compounds", "bad actors", or "nuisance compounds". These compounds are typically aggregators, reactive compounds, and/or pan-assay interference compounds (PAINS), and many of them are frequent hitters. Hit Dexter is a recently introduced machine learning approach that predicts frequent hitters independent of the underlying physicochemical mechanisms (including also the binding of compounds based on "privileged scaffolds" to multiple binding sites). Here we report on the development of a second generation of machine learning models which now covers both primary screening assays and confirmatory dose-response assays. Protein sequence clustering was newly introduced to minimize the overrepresentation of structurally and functionally related proteins. The models correctly classified compounds of large independent test sets as (highly) promiscuous or nonpromiscuous with Matthews correlation coefficient (MCC) values of up to 0.64 and area under the receiver operating characteristic curve (AUC) values of up to 0.96. The models were also utilized to characterize sets of compounds with specific biological and physicochemical properties, such as dark chemical matter, aggregators, compounds from a high-throughput screening library, drug-like compounds, approved drugs, potential PAINS, and natural products. Among the most interesting outcomes is that the new Hit Dexter models predict the presence of large fractions of (highly) promiscuous compounds among approved drugs. Importantly, predictions of the individual Hit Dexter models are generally in good agreement and consistent with those of Badapple, an established statistical model for the prediction of frequent hitters. The new Hit Dexter 2.0 web service, available at http://hitdexter2.zbh.uni-hamburg.de , not only provides user-friendly access to all machine learning models presented in this work but also to similarity-based methods for the prediction of aggregators and dark chemical matter as well as a comprehensive collection of available rule sets for flagging frequent hitters and compounds including undesired substructures.

Mesh:

Substances:

Year:  2019        PMID: 30624935     DOI: 10.1021/acs.jcim.8b00677

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  10 in total

1.  Machine Learning in Drug Discovery: A Review.

Authors:  Suresh Dara; Swetha Dhamercherla; Surender Singh Jadav; Ch Madhu Babu; Mohamed Jawed Ahsan
Journal:  Artif Intell Rev       Date:  2021-08-11       Impact factor: 9.588

Review 2.  Applications of Deep-Learning in Exploiting Large-Scale and Heterogeneous Compound Data in Industrial Pharmaceutical Research.

Authors:  Laurianne David; Josep Arús-Pous; Johan Karlsson; Ola Engkvist; Esben Jannik Bjerrum; Thierry Kogej; Jan M Kriegl; Bernd Beck; Hongming Chen
Journal:  Front Pharmacol       Date:  2019-11-05       Impact factor: 5.810

3.  Identification of Compounds That Interfere with High-Throughput Screening Assay Technologies.

Authors:  Laurianne David; Jarrod Walsh; Noé Sturm; Isabella Feierberg; J Willem M Nissink; Hongming Chen; Jürgen Bajorath; Ola Engkvist
Journal:  ChemMedChem       Date:  2019-09-19       Impact factor: 3.466

4.  Discovery of novel selective PI3Kγ inhibitors through combining machine learning-based virtual screening with multiple protein structures and bio-evaluation.

Authors:  Jingyu Zhu; Kan Li; Lei Xu; Yanfei Cai; Yun Chen; Xinling Zhao; Huazhong Li; Gang Huang; Jian Jin
Journal:  J Adv Res       Date:  2021-04-20       Impact factor: 10.479

Review 5.  On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach.

Authors:  Sangsoo Lim; Sangseon Lee; Yinhua Piao; MinGyu Choi; Dongmin Bang; Jeonghyeon Gu; Sun Kim
Journal:  Comput Struct Biotechnol J       Date:  2022-08-05       Impact factor: 6.155

Review 6.  Progress and Impact of Latin American Natural Product Databases.

Authors:  Alejandro Gómez-García; José L Medina-Franco
Journal:  Biomolecules       Date:  2022-08-30

7.  Evolving scenario of big data and Artificial Intelligence (AI) in drug discovery.

Authors:  Manish Kumar Tripathi; Abhigyan Nath; Tej P Singh; A S Ethayathulla; Punit Kaur
Journal:  Mol Divers       Date:  2021-06-23       Impact factor: 3.364

8.  Improved correction of F508del-CFTR biogenesis with a folding facilitator and an inhibitor of protein ubiquitination.

Authors:  Jennifer L Goeckeler-Fried; Rajiah Aldrin Denny; Disha Joshi; Clare Hill; Mads B Larsen; Annette N Chiang; Raymond A Frizzell; Peter Wipf; Eric J Sorscher; Jeffrey L Brodsky
Journal:  Bioorg Med Chem Lett       Date:  2021-07-08       Impact factor: 2.940

Review 9.  Systematic review on the application of machine learning to quantitative structure-activity relationship modeling against Plasmodium falciparum.

Authors:  Osondu Everestus Oguike; Chikodili Helen Ugwuishiwu; Caroline Ngozi Asogwa; Charles Okeke Nnadi; Wilfred Ofem Obonga; Anthony Amaechi Attama
Journal:  Mol Divers       Date:  2022-01-22       Impact factor: 3.364

10.  Fine-tuning of a generative neural network for designing multi-target compounds.

Authors:  Thomas Blaschke; Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2021-05-28       Impact factor: 4.179

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.