Literature DB >> 24968215

Are bigger data sets better for machine learning? Fusing single-point and dual-event dose response data for Mycobacterium tuberculosis.

Sean Ekins1, Joel S Freundlich, Robert C Reynolds.   

Abstract

Tuberculosis is a major, neglected disease for which the quest to find new treatments continues. There is an abundance of data from large phenotypic screens in the public domain against Mycobacterium tuberculosis (Mtb). Since machine learning methods can learn from past data, we were interested in addressing whether more data builds better models. We now describe using Bayesian machine learning to assess whether we can improve our models by combining the large quantities of single-point data with the much smaller (higher quality) dual-event data sets, which use both dose-response data for both whole-cell antitubercular activity and Vero cell cytotoxicity. We have evaluated 12 models ranging from different single-point, dual-event dose-response, single-point and dual-event dose-response as well as combined data sets for three distinct data sets from the same laboratory. We used a fourth data set of active and inactive compounds from the same group as well as a smaller set of 177 active compounds from GlaxoSmithKline as test sets. Our data suggest combining single-point with dual-event dose-response data does not diminish the internal or external predictive ability of the models based on the receiver operator curve (ROC) for these models (internal ROC range 0.83-0.91, external ROC range 0.62-0.83) compared to the orders of magnitude smaller dual-event models (internal ROC range 0.6-0.83 and external ROC 0.54-0.83). In conclusion, models developed with 1200-5000 compounds appear to be as predictive as those generated with 25 000-350 000 molecules. Our results have implications for justifying further high-throughput screening versus focused testing based on model predictions.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24968215      PMCID: PMC4951206          DOI: 10.1021/ci500264r

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  55 in total

1.  Analysis of pharmacology data and the prediction of adverse drug reactions and off-target effects from chemical structure.

Authors:  Andreas Bender; Josef Scheiber; Meir Glick; John W Davies; Kamal Azzaoui; Jacques Hamon; Laszlo Urban; Steven Whitebread; Jeremy L Jenkins
Journal:  ChemMedChem       Date:  2007-06       Impact factor: 3.466

2.  Depletion of antibiotic targets has widely varying effects on growth.

Authors:  Jun-Rong Wei; Vidhya Krishnamoorthy; Kenan Murphy; Jee-Hyun Kim; Dirk Schnappinger; Tom Alber; Christopher M Sassetti; Kyu Y Rhee; Eric J Rubin
Journal:  Proc Natl Acad Sci U S A       Date:  2011-02-22       Impact factor: 11.205

Review 3.  Why are membrane targets discovered by phenotypic screens and genome sequencing in Mycobacterium tuberculosis?

Authors:  Robert C Goldman
Journal:  Tuberculosis (Edinb)       Date:  2013-09-18       Impact factor: 3.131

Review 4.  Computational databases, pathway and cheminformatics tools for tuberculosis drug discovery.

Authors:  Sean Ekins; Joel S Freundlich; Inhee Choi; Malabika Sarker; Carolyn Talcott
Journal:  Trends Microbiol       Date:  2010-12-02       Impact factor: 17.079

5.  A collaborative database and computational models for tuberculosis drug discovery.

Authors:  Sean Ekins; Justin Bradford; Krishna Dole; Anna Spektor; Kellan Gregory; David Blondeau; Moses Hohman; Barry A Bunin
Journal:  Mol Biosyst       Date:  2010-02-09

6.  Antituberculosis activity of the molecular libraries screening center network library.

Authors:  Joseph A Maddry; Subramaniam Ananthan; Robert C Goldman; Judith V Hobrath; Cecil D Kwong; Clinton Maddox; Lynn Rasmussen; Robert C Reynolds; John A Secrist; Melinda I Sosa; E Lucile White; Wei Zhang
Journal:  Tuberculosis (Edinb)       Date:  2009-09-26       Impact factor: 3.131

7.  In silico activity profiling reveals the mechanism of action of antimalarials discovered in a high-throughput screen.

Authors:  David Plouffe; Achim Brinker; Case McNamara; Kerstin Henson; Nobutaka Kato; Kelli Kuhen; Advait Nagle; Francisco Adrián; Jason T Matzen; Paul Anderson; Tae-Gyu Nam; Nathanael S Gray; Arnab Chatterjee; Jeff Janes; S Frank Yan; Richard Trager; Jeremy S Caldwell; Peter G Schultz; Yingyao Zhou; Elizabeth A Winzeler
Journal:  Proc Natl Acad Sci U S A       Date:  2008-06-25       Impact factor: 11.205

Review 8.  A medicinal chemists' guide to the unique difficulties of lead optimization for tuberculosis.

Authors:  Véronique Dartois; Clifton E Barry
Journal:  Bioorg Med Chem Lett       Date:  2013-07-12       Impact factor: 2.823

9.  Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence.

Authors:  S T Cole; R Brosch; J Parkhill; T Garnier; C Churcher; D Harris; S V Gordon; K Eiglmeier; S Gas; C E Barry; F Tekaia; K Badcock; D Basham; D Brown; T Chillingworth; R Connor; R Davies; K Devlin; T Feltwell; S Gentles; N Hamlin; S Holroyd; T Hornsby; K Jagels; A Krogh; J McLean; S Moule; L Murphy; K Oliver; J Osborne; M A Quail; M A Rajandream; J Rogers; S Rutter; K Seeger; J Skelton; R Squares; S Squares; J E Sulston; K Taylor; S Whitehead; B G Barrell
Journal:  Nature       Date:  1998-06-11       Impact factor: 49.962

Review 10.  Comprehensive analysis of methods used for the evaluation of compounds against Mycobacterium tuberculosis.

Authors:  Scott G Franzblau; Mary Ann DeGroote; Sang Hyun Cho; Koen Andries; Eric Nuermberger; Ian M Orme; Khisimuzi Mdluli; Iñigo Angulo-Barturen; Thomas Dick; Veronique Dartois; Anne J Lenaerts
Journal:  Tuberculosis (Edinb)       Date:  2012-08-30       Impact factor: 3.131

View more
  20 in total

Review 1.  Molecule Property Analyses of Active Compounds for Mycobacterium tuberculosis.

Authors:  Vadim Makarov; Elena Salina; Robert C Reynolds; Phyo Phyo Kyaw Zin; Sean Ekins
Journal:  J Med Chem       Date:  2020-04-20       Impact factor: 7.446

Review 2.  Collaborative drug discovery for More Medicines for Tuberculosis (MM4TB).

Authors:  Sean Ekins; Anna Coulon Spektor; Alex M Clark; Krishna Dole; Barry A Bunin
Journal:  Drug Discov Today       Date:  2016-11-22       Impact factor: 7.851

Review 3.  The Next Era: Deep Learning in Pharmaceutical Research.

Authors:  Sean Ekins
Journal:  Pharm Res       Date:  2016-09-06       Impact factor: 4.200

4.  Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery.

Authors:  Thomas Lane; Daniel P Russo; Kimberley M Zorn; Alex M Clark; Alexandru Korotcov; Valery Tkachenko; Robert C Reynolds; Alexander L Perryman; Joel S Freundlich; Sean Ekins
Journal:  Mol Pharm       Date:  2018-04-26       Impact factor: 4.939

5.  Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets.

Authors:  Alexandru Korotcov; Valery Tkachenko; Daniel P Russo; Sean Ekins
Journal:  Mol Pharm       Date:  2017-11-13       Impact factor: 4.939

6.  Mycobacterium abscessus drug discovery using machine learning.

Authors:  Alan A Schmalstig; Kimberley M Zorn; Sebastian Murcia; Andrew Robinson; Svetlana Savina; Elena Komarova; Vadim Makarov; Miriam Braunstein; Sean Ekins
Journal:  Tuberculosis (Edinb)       Date:  2022-01-20       Impact factor: 3.131

7.  Predicting Mouse Liver Microsomal Stability with "Pruned" Machine Learning Models and Public Data.

Authors:  Alexander L Perryman; Thomas P Stratton; Sean Ekins; Joel S Freundlich
Journal:  Pharm Res       Date:  2015-09-28       Impact factor: 4.200

8.  Bigger data, collaborative tools and the future of predictive drug discovery.

Authors:  Sean Ekins; Alex M Clark; S Joshua Swamidass; Nadia Litterman; Antony J Williams
Journal:  J Comput Aided Mol Des       Date:  2014-06-19       Impact factor: 3.686

9.  Bioactivity Comparison across Multiple Machine Learning Algorithms Using over 5000 Datasets for Drug Discovery.

Authors:  Thomas R Lane; Daniel H Foil; Eni Minerali; Fabio Urbina; Kimberley M Zorn; Sean Ekins
Journal:  Mol Pharm       Date:  2020-12-16       Impact factor: 4.939

10.  Quantum Machine Learning Algorithms for Drug Discovery Applications.

Authors:  Kushal Batra; Kimberley M Zorn; Daniel H Foil; Eni Minerali; Victor O Gawriljuk; Thomas R Lane; Sean Ekins
Journal:  J Chem Inf Model       Date:  2021-05-25       Impact factor: 6.162

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.