| Literature DB >> 32525674 |
Kevin McCloskey1, Eric A Sigel2, Steven Kearnes1, Ling Xue2, Xia Tian2, Dennis Moccia2,3, Diana Gikunju2, Sana Bazzaz2, Betty Chan2, Matthew A Clark2, John W Cuozzo2, Marie-Aude Guié2, John P Guilinger2, Christelle Huguet2, Christopher D Hupp2, Anthony D Keefe2, Christopher J Mulhern2, Ying Zhang2, Patrick Riley1.
Abstract
DNA-encoded small molecule libraries (DELs) have enabled discovery of novel inhibitors for many distinct protein targets of therapeutic value. We demonstrate a new approach applying machine learning to DEL selection data by identifying active molecules from large libraries of commercial and easily synthesizable compounds. We train models using only DEL selection data and apply automated or automatable filters to the predictions. We perform a large prospective study (∼2000 compounds) across three diverse protein targets: sEH (a hydrolase), ERα (a nuclear receptor), and c-KIT (a kinase). The approach is effective, with an overall hit rate of ∼30% at 30 μM and discovery of potent compounds (IC50 < 10 nM) for every target. The system makes useful predictions even for molecules dissimilar to the original DEL, and the compounds identified are diverse, predominantly drug-like, and different from known ligands. This work demonstrates a powerful new approach to hit-finding.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32525674 DOI: 10.1021/acs.jmedchem.0c00452
Source DB: PubMed Journal: J Med Chem ISSN: 0022-2623 Impact factor: 7.446