Literature DB >> 29672063

Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery.

Thomas Lane1,2, Daniel P Russo1,3, Kimberley M Zorn1, Alex M Clark4, Alexandru Korotcov5, Valery Tkachenko5, Robert C Reynolds6, Alexander L Perryman7, Joel S Freundlich7,8, Sean Ekins1.   

Abstract

Tuberculosis is a global health dilemma. In 2016, the WHO reported 10.4 million incidences and 1.7 million deaths. The need to develop new treatments for those infected with Mycobacterium tuberculosis ( Mtb) has led to many large-scale phenotypic screens and many thousands of new active compounds identified in vitro. However, with limited funding, efforts to discover new active molecules against Mtb needs to be more efficient. Several computational machine learning approaches have been shown to have good enrichment and hit rates. We have curated small molecule Mtb data and developed new models with a total of 18,886 molecules with activity cutoffs of 10 μM, 1 μM, and 100 nM. These data sets were used to evaluate different machine learning methods (including deep learning) and metrics and to generate predictions for additional molecules published in 2017. One Mtb model, a combined in vitro and in vivo data Bayesian model at a 100 nM activity yielded the following metrics for 5-fold cross validation: accuracy = 0.88, precision = 0.22, recall = 0.91, specificity = 0.88, kappa = 0.31, and MCC = 0.41. We have also curated an evaluation set ( n = 153 compounds) published in 2017, and when used to test our model, it showed the comparable statistics (accuracy = 0.83, precision = 0.27, recall = 1.00, specificity = 0.81, kappa = 0.36, and MCC = 0.47). We have also compared these models with additional machine learning algorithms showing Bayesian machine learning models constructed with literature Mtb data generated by different laboratories generally were equivalent to or outperformed deep neural networks with external test sets. Finally, we have also compared our training and test sets to show they were suitably diverse and different in order to represent useful evaluation sets. Such Mtb machine learning models could help prioritize compounds for testing in vitro and in vivo.

Entities:  

Keywords:  deep learning; drug discovery; machine learning; support vector machine; tuberculosis

Mesh:

Substances:

Year:  2018        PMID: 29672063      PMCID: PMC6167198          DOI: 10.1021/acs.molpharmaceut.8b00083

Source DB:  PubMed          Journal:  Mol Pharm        ISSN: 1543-8384            Impact factor:   4.939


  75 in total

1.  Open Source Bayesian Models. 2. Mining a "Big Dataset" To Create and Validate Models with ChEMBL.

Authors:  Alex M Clark; Sean Ekins
Journal:  J Chem Inf Model       Date:  2015-06-03       Impact factor: 4.956

2.  Pyrazinamide inhibits the eukaryotic-like fatty acid synthetase I (FASI) of Mycobacterium tuberculosis.

Authors:  O Zimhony; J S Cox; J T Welch; C Vilchèze; W R Jacobs
Journal:  Nat Med       Date:  2000-09       Impact factor: 53.440

Review 3.  The Next Era: Deep Learning in Pharmaceutical Research.

Authors:  Sean Ekins
Journal:  Pharm Res       Date:  2016-09-06       Impact factor: 4.200

4.  Resazurin microtiter assay plate: simple and inexpensive method for detection of drug resistance in Mycobacterium tuberculosis.

Authors:  Juan-Carlos Palomino; Anandi Martin; Mirtha Camacho; Humberto Guerra; Jean Swings; Françoise Portaels
Journal:  Antimicrob Agents Chemother       Date:  2002-08       Impact factor: 5.191

5.  Antituberculosis activity of the molecular libraries screening center network library.

Authors:  Joseph A Maddry; Subramaniam Ananthan; Robert C Goldman; Judith V Hobrath; Cecil D Kwong; Clinton Maddox; Lynn Rasmussen; Robert C Reynolds; John A Secrist; Melinda I Sosa; E Lucile White; Wei Zhang
Journal:  Tuberculosis (Edinb)       Date:  2009-09-26       Impact factor: 3.131

6.  Computational models for drug inhibition of the human apical sodium-dependent bile acid transporter.

Authors:  Xiaowan Zheng; Sean Ekins; Jean-Pierre Raufman; James E Polli
Journal:  Mol Pharm       Date:  2009 Sep-Oct       Impact factor: 4.939

7.  AZOrange - High performance open source machine learning for QSAR modeling in a graphical programming environment.

Authors:  Jonna C Stålring; Lars A Carlsson; Pedro Almeida; Scott Boyer
Journal:  J Cheminform       Date:  2011-07-28       Impact factor: 5.514

8.  Open Source Bayesian Models. 1. Application to ADME/Tox and Drug Discovery Datasets.

Authors:  Alex M Clark; Krishna Dole; Anna Coulon-Spektor; Andrew McNutt; George Grass; Joel S Freundlich; Robert C Reynolds; Sean Ekins
Journal:  J Chem Inf Model       Date:  2015-06-03       Impact factor: 4.956

9.  Machine learning methods in chemoinformatics.

Authors:  John B O Mitchell
Journal:  Wiley Interdiscip Rev Comput Mol Sci       Date:  2014-09-01

10.  New target prediction and visualization tools incorporating open source molecular fingerprints for TB Mobile 2.0.

Authors:  Alex M Clark; Malabika Sarker; Sean Ekins
Journal:  J Cheminform       Date:  2014-08-04       Impact factor: 5.514

View more
  40 in total

1.  Machine Learning Platform to Discover Novel Growth Inhibitors of Neisseria gonorrhoeae.

Authors:  Janaina Cruz Pereira; Samer S Daher; Kimberley M Zorn; Matthew Sherwood; Riccardo Russo; Alexander L Perryman; Xin Wang; Madeleine J Freundlich; Sean Ekins; Joel S Freundlich
Journal:  Pharm Res       Date:  2020-07-13       Impact factor: 4.200

Review 2.  Advancing computer-aided drug discovery (CADD) by big data and data-driven machine learning modeling.

Authors:  Linlin Zhao; Heather L Ciallella; Lauren M Aleksunes; Hao Zhu
Journal:  Drug Discov Today       Date:  2020-07-11       Impact factor: 7.851

3.  Doing it All - How Families are Reshaping Rare Disease Research.

Authors:  Sean Ekins; Ethan O Perlstein
Journal:  Pharm Res       Date:  2018-08-16       Impact factor: 4.200

4.  High-throughput screening and Bayesian machine learning for copper-dependent inhibitors of Staphylococcus aureus.

Authors:  Alex G Dalecki; Kimberley M Zorn; Alex M Clark; Sean Ekins; Whitney T Narmore; Nichole Tower; Lynn Rasmussen; Robert Bostwick; Olaf Kutsch; Frank Wolschendorf
Journal:  Metallomics       Date:  2019-03-20       Impact factor: 4.526

5.  Comparing Machine Learning Algorithms for Predicting Drug-Induced Liver Injury (DILI).

Authors:  Eni Minerali; Daniel H Foil; Kimberley M Zorn; Thomas R Lane; Sean Ekins
Journal:  Mol Pharm       Date:  2020-06-08       Impact factor: 4.939

Review 6.  Molecule Property Analyses of Active Compounds for Mycobacterium tuberculosis.

Authors:  Vadim Makarov; Elena Salina; Robert C Reynolds; Phyo Phyo Kyaw Zin; Sean Ekins
Journal:  J Med Chem       Date:  2020-04-20       Impact factor: 7.446

7.  A Machine Learning Strategy for Drug Discovery Identifies Anti-Schistosomal Small Molecules.

Authors:  Kimberley M Zorn; Shengxi Sun; Cecelia L McConnon; Kelley Ma; Eric K Chen; Daniel H Foil; Thomas R Lane; Lawrence J Liu; Nelly El-Sakkary; Danielle E Skinner; Sean Ekins; Conor R Caffrey
Journal:  ACS Infect Dis       Date:  2021-01-12       Impact factor: 5.084

8.  Comparing Machine Learning Models for Aromatase (P450 19A1).

Authors:  Kimberley M Zorn; Daniel H Foil; Thomas R Lane; Wendy Hillwalker; David J Feifarek; Frank Jones; William D Klaren; Ashley M Brinkman; Sean Ekins
Journal:  Environ Sci Technol       Date:  2020-11-19       Impact factor: 9.028

9.  Comparison of Machine Learning Models for the Androgen Receptor.

Authors:  Kimberley M Zorn; Daniel H Foil; Thomas R Lane; Wendy Hillwalker; David J Feifarek; Frank Jones; William D Klaren; Ashley M Brinkman; Sean Ekins
Journal:  Environ Sci Technol       Date:  2020-10-21       Impact factor: 9.028

10.  Identification of early liver toxicity gene biomarkers using comparative supervised machine learning.

Authors:  Brandi Patrice Smith; Loretta Sue Auvil; Michael Welge; Colleen Bannon Bushell; Rohit Bhargava; Navin Elango; Kamin Johnson; Zeynep Madak-Erdogan
Journal:  Sci Rep       Date:  2020-11-05       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.