Alexander L Perryman1, Jimmy S Patel1, Riccardo Russo2, Eric Singleton2, Nancy Connell2, Sean Ekins3, Joel S Freundlich4,5. 1. Department of Pharmacology, Physiology and Neuroscience, and Medicine, Rutgers University-New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA. 2. Division of Infectious Diseases, Department of Medicine, and the Ruy V. Lourenço Center for the Study of Emerging and Re-emerging Pathogens, Rutgers University-New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA. 3. Collaborations Pharmaceuticals, Inc., Main Campus Drive Lab 3510, Raleigh, North Carolina,, 27606, USA. 4. Department of Pharmacology, Physiology and Neuroscience, and Medicine, Rutgers University-New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA. freundjs@rutgers.edu. 5. Division of Infectious Diseases, Department of Medicine, and the Ruy V. Lourenço Center for the Study of Emerging and Re-emerging Pathogens, Rutgers University-New Jersey Medical School, Medical Sciences Building, I-503, 185 South Orange Ave, Newark, NJ, 07103, USA. freundjs@rutgers.edu.
Abstract
PURPOSE: To advance translational research of potential therapeutic small molecules against infectious microbes, the compounds must display a relative lack of mammalian cell cytotoxicity. Vero cell cytotoxicity (CC50) is a common initial assay for this metric. We explored the development of naïve Bayesian models that can enhance the probability of identifying non-cytotoxic compounds. METHODS: Vero cell cytotoxicity assays were identified in PubChem, reformatted, and curated to create a training set with 8741 unique small molecules. These data were used to develop Bayesian classifiers, which were assessed with internal cross-validation, external tests with a set of 193 compounds from our laboratory, and independent validation with an additional diverse set of 1609 unique compounds from PubChem. RESULTS: Evaluation with independent, external test and validation sets indicated that cytotoxicity Bayesian models constructed with the ECFP_6 descriptor were more accurate than those that used FCFP_6 fingerprints. The best cytotoxicity Bayesian model displayed predictive power in external evaluations, according to conventional and chance-corrected statistics, as well as enrichment factors. CONCLUSIONS: The results from external tests demonstrate that our novel cytotoxicity Bayesian model displays sufficient predictive power to help guide translational research. To assist the chemical tool and drug discovery communities, our curated training set is being distributed as part of the Supplementary Material. Graphical Abstract Naive Bayesian models have been trained with publically available data and offer a useful tool for chemical biology and drug discovery to select for small molecules with a high probability of exhibiting acceptably low Vero cell cytotoxicity.
PURPOSE: To advance translational research of potential therapeutic small molecules against infectious microbes, the compounds must display a relative lack of mammalian cell cytotoxicity. Vero cell cytotoxicity (CC50) is a common initial assay for this metric. We explored the development of naïve Bayesian models that can enhance the probability of identifying non-cytotoxic compounds. METHODS:Vero cell cytotoxicity assays were identified in PubChem, reformatted, and curated to create a training set with 8741 unique small molecules. These data were used to develop Bayesian classifiers, which were assessed with internal cross-validation, external tests with a set of 193 compounds from our laboratory, and independent validation with an additional diverse set of 1609 unique compounds from PubChem. RESULTS: Evaluation with independent, external test and validation sets indicated that cytotoxicity Bayesian models constructed with the ECFP_6 descriptor were more accurate than those that used FCFP_6 fingerprints. The best cytotoxicity Bayesian model displayed predictive power in external evaluations, according to conventional and chance-corrected statistics, as well as enrichment factors. CONCLUSIONS: The results from external tests demonstrate that our novel cytotoxicity Bayesian model displays sufficient predictive power to help guide translational research. To assist the chemical tool and drug discovery communities, our curated training set is being distributed as part of the Supplementary Material. Graphical Abstract Naive Bayesian models have been trained with publically available data and offer a useful tool for chemical biology and drug discovery to select for small molecules with a high probability of exhibiting acceptably low Vero cell cytotoxicity.
Authors: J N Weinstein; T G Myers; P M O'Connor; S H Friend; A J Fornace; K W Kohn; T Fojo; S E Bates; L V Rubinstein; N L Anderson; J K Buolamwini; W W van Osdol; A P Monks; D A Scudiero; E A Sausville; D W Zaharevitz; B Bunow; V N Viswanadhan; G S Johnson; R E Wittes; K D Paull Journal: Science Date: 1997-01-17 Impact factor: 47.728
Authors: Lewis H Mervin; Qing Cao; Ian P Barrett; Mike A Firth; David Murray; Lisa McWilliams; Malcolm Haddrick; Mark Wigglesworth; Ola Engkvist; Andreas Bender Journal: ACS Chem Biol Date: 2016-09-13 Impact factor: 5.100
Authors: Scott G Franzblau; Mary Ann DeGroote; Sang Hyun Cho; Koen Andries; Eric Nuermberger; Ian M Orme; Khisimuzi Mdluli; Iñigo Angulo-Barturen; Thomas Dick; Veronique Dartois; Anne J Lenaerts Journal: Tuberculosis (Edinb) Date: 2012-08-30 Impact factor: 3.131
Authors: Janaina Cruz Pereira; Samer S Daher; Kimberley M Zorn; Matthew Sherwood; Riccardo Russo; Alexander L Perryman; Xin Wang; Madeleine J Freundlich; Sean Ekins; Joel S Freundlich Journal: Pharm Res Date: 2020-07-13 Impact factor: 4.200
Authors: Kimberley M Zorn; Shengxi Sun; Cecelia L McConnon; Kelley Ma; Eric K Chen; Daniel H Foil; Thomas R Lane; Lawrence J Liu; Nelly El-Sakkary; Danielle E Skinner; Sean Ekins; Conor R Caffrey Journal: ACS Infect Dis Date: 2021-01-12 Impact factor: 5.084
Authors: Sean Ekins; Ana C Puhl; Kimberley M Zorn; Thomas R Lane; Daniel P Russo; Jennifer J Klein; Anthony J Hickey; Alex M Clark Journal: Nat Mater Date: 2019-04-18 Impact factor: 43.841
Authors: Melina Mottin; Lindsay K Caesar; David Brodsky; Nathalya C M R Mesquita; Ketllyn Zagato de Oliveira; Gabriela Dias Noske; Bruna K P Sousa; Paulo R P S Ramos; Hannah Jarmer; Bonnie Loh; Kimberley M Zorn; Daniel H Foil; Pedro M Torres; Rafael V C Guido; Glaucius Oliva; Frank Scholle; Sean Ekins; Nadja B Cech; Carolina H Andrade; Scott M Laster Journal: Bioorg Chem Date: 2022-01-31 Impact factor: 5.307
Authors: Jimmy S Patel; Javiera Norambuena; Hassan Al-Tameemi; Yong-Mo Ahn; Alexander L Perryman; Xin Wang; Samer S Daher; James Occi; Riccardo Russo; Steven Park; Matthew Zimmerman; Hsin-Pin Ho; David S Perlin; Véronique Dartois; Sean Ekins; Pradeep Kumar; Nancy Connell; Jeffrey M Boyd; Joel S Freundlich Journal: ACS Infect Dis Date: 2021-08-03 Impact factor: 5.578