Literature DB >> 30114914

Comparing Multiple Machine Learning Algorithms and Metrics for Estrogen Receptor Binding Prediction.

Daniel P Russo1,2, Kimberley M Zorn1, Alex M Clark3, Hao Zhu2, Sean Ekins1.   

Abstract

Many chemicals that disrupt endocrine function have been linked to a variety of adverse biological outcomes. However, screening for endocrine disruption using in vitro or in vivo approaches is costly and time-consuming. Computational methods, e.g., quantitative structure-activity relationship models, have become more reliable due to bigger training sets, increased computing power, and advanced machine learning algorithms, such as multilayered artificial neural networks. Machine learning models can be used to predict compounds for endocrine disrupting capabilities, such as binding to the estrogen receptor (ER), and allow for prioritization and further testing. In this work, an exhaustive comparison of multiple machine learning algorithms, chemical spaces, and evaluation metrics for ER binding was performed on public data sets curated using in-house cheminformatics software (Assay Central). Chemical features utilized in modeling consisted of binary fingerprints (ECFP6, FCFP6, ToxPrint, or MACCS keys) and continuous molecular descriptors from RDKit. Each feature set was subjected to classic machine learning algorithms (Bernoulli Naive Bayes, AdaBoost Decision Tree, Random Forest, Support Vector Machine) and Deep Neural Networks (DNN). Models were evaluated using a variety of metrics: recall, precision, F1-score, accuracy, area under the receiver operating characteristic curve, Cohen's Kappa, and Matthews correlation coefficient. For predicting compounds within the training set, DNN has an accuracy higher than that of other methods; however, in 5-fold cross validation and external test set predictions, DNN and most classic machine learning models perform similarly regardless of the data set or molecular descriptors used. We have also used the rank normalized scores as a performance-criteria for each machine learning method, and Random Forest performed best on the validation set when ranked by metric or by data sets. These results suggest classic machine learning algorithms may be sufficient to develop high quality predictive models of ER activity.

Entities:  

Keywords:  Bayesian; deep learning; estrogen receptor; machine learning; support vector machine

Mesh:

Substances:

Year:  2018        PMID: 30114914      PMCID: PMC6181119          DOI: 10.1021/acs.molpharmaceut.8b00546

Source DB:  PubMed          Journal:  Mol Pharm        ISSN: 1543-8384            Impact factor:   4.939


  65 in total

1.  Analysis of pharmacology data and the prediction of adverse drug reactions and off-target effects from chemical structure.

Authors:  Andreas Bender; Josef Scheiber; Meir Glick; John W Davies; Kamal Azzaoui; Jacques Hamon; Laszlo Urban; Steven Whitebread; Jeremy L Jenkins
Journal:  ChemMedChem       Date:  2007-06       Impact factor: 3.466

2.  Open Source Bayesian Models. 2. Mining a "Big Dataset" To Create and Validate Models with ChEMBL.

Authors:  Alex M Clark; Sean Ekins
Journal:  J Chem Inf Model       Date:  2015-06-03       Impact factor: 4.956

3.  Deep-Learning-Based Drug-Target Interaction Prediction.

Authors:  Ming Wen; Zhimin Zhang; Shaoyu Niu; Haozhi Sha; Ruihan Yang; Yonghuan Yun; Hongmei Lu
Journal:  J Proteome Res       Date:  2017-03-13       Impact factor: 4.466

Review 4.  Endocrine disrupting chemicals targeting estrogen receptor signaling: identification and mechanisms of action.

Authors:  Erin K Shanle; Wei Xu
Journal:  Chem Res Toxicol       Date:  2010-11-05       Impact factor: 3.739

Review 5.  Deep learning for computational chemistry.

Authors:  Garrett B Goh; Nathan O Hodas; Abhinav Vishnu
Journal:  J Comput Chem       Date:  2017-03-08       Impact factor: 3.376

6.  Predictive Modeling of Estrogen Receptor Binding Agents Using Advanced Cheminformatics Tools and Massive Public Data.

Authors:  Kathryn Ribay; Marlene T Kim; Wenyi Wang; Daniel Pinolini; Hao Zhu
Journal:  Front Environ Sci       Date:  2016-03-08

Review 7.  The Next Era: Deep Learning in Pharmaceutical Research.

Authors:  Sean Ekins
Journal:  Pharm Res       Date:  2016-09-06       Impact factor: 4.200

8.  CDK-Taverna: an open workflow environment for cheminformatics.

Authors:  Thomas Kuhn; Egon L Willighagen; Achim Zielesny; Christoph Steinbeck
Journal:  BMC Bioinformatics       Date:  2010-03-29       Impact factor: 3.169

9.  Open Source Bayesian Models. 1. Application to ADME/Tox and Drug Discovery Datasets.

Authors:  Alex M Clark; Krishna Dole; Anna Coulon-Spektor; Andrew McNutt; George Grass; Joel S Freundlich; Robert C Reynolds; Sean Ekins
Journal:  J Chem Inf Model       Date:  2015-06-03       Impact factor: 4.956

Review 10.  A Curated Database of Rodent Uterotrophic Bioactivity.

Authors:  Nicole C Kleinstreuer; Patricia C Ceger; David G Allen; Judy Strickland; Xiaoqing Chang; Jonathan T Hamm; Warren M Casey
Journal:  Environ Health Perspect       Date:  2015-10-02       Impact factor: 9.031

View more
  47 in total

Review 1.  Déjà vu: Stimulating open drug discovery for SARS-CoV-2.

Authors:  Sean Ekins; Melina Mottin; Paulo R P S Ramos; Bruna K P Sousa; Bruno Junior Neves; Daniel H Foil; Kimberley M Zorn; Rodolpho C Braga; Megan Coffee; Christopher Southan; Ana C Puhl; Carolina Horta Andrade
Journal:  Drug Discov Today       Date:  2020-04-19       Impact factor: 7.851

Review 2.  Advancing computer-aided drug discovery (CADD) by big data and data-driven machine learning modeling.

Authors:  Linlin Zhao; Heather L Ciallella; Lauren M Aleksunes; Hao Zhu
Journal:  Drug Discov Today       Date:  2020-07-11       Impact factor: 7.851

3.  High-throughput screening and Bayesian machine learning for copper-dependent inhibitors of Staphylococcus aureus.

Authors:  Alex G Dalecki; Kimberley M Zorn; Alex M Clark; Sean Ekins; Whitney T Narmore; Nichole Tower; Lynn Rasmussen; Robert Bostwick; Olaf Kutsch; Frank Wolschendorf
Journal:  Metallomics       Date:  2019-03-20       Impact factor: 4.526

4.  Comparing Machine Learning Algorithms for Predicting Drug-Induced Liver Injury (DILI).

Authors:  Eni Minerali; Daniel H Foil; Kimberley M Zorn; Thomas R Lane; Sean Ekins
Journal:  Mol Pharm       Date:  2020-06-08       Impact factor: 4.939

5.  A Machine Learning Strategy for Drug Discovery Identifies Anti-Schistosomal Small Molecules.

Authors:  Kimberley M Zorn; Shengxi Sun; Cecelia L McConnon; Kelley Ma; Eric K Chen; Daniel H Foil; Thomas R Lane; Lawrence J Liu; Nelly El-Sakkary; Danielle E Skinner; Sean Ekins; Conor R Caffrey
Journal:  ACS Infect Dis       Date:  2021-01-12       Impact factor: 5.084

Review 6.  Big Data and Artificial Intelligence Modeling for Drug Discovery.

Authors:  Hao Zhu
Journal:  Annu Rev Pharmacol Toxicol       Date:  2019-09-13       Impact factor: 13.820

7.  Comparing Machine Learning Models for Aromatase (P450 19A1).

Authors:  Kimberley M Zorn; Daniel H Foil; Thomas R Lane; Wendy Hillwalker; David J Feifarek; Frank Jones; William D Klaren; Ashley M Brinkman; Sean Ekins
Journal:  Environ Sci Technol       Date:  2020-11-19       Impact factor: 9.028

8.  Application of Artificial Intelligence Methods to Pharmacy Data for Cancer Surveillance and Epidemiology Research: A Systematic Review.

Authors:  Andrew E Grothen; Bethany Tennant; Catherine Wang; Andrea Torres; Bonny Bloodgood Sheppard; Glenn Abastillas; Marina Matatova; Jeremy L Warner; Donna R Rivera
Journal:  JCO Clin Cancer Inform       Date:  2020-11

9.  Comparison of Machine Learning Models for the Androgen Receptor.

Authors:  Kimberley M Zorn; Daniel H Foil; Thomas R Lane; Wendy Hillwalker; David J Feifarek; Frank Jones; William D Klaren; Ashley M Brinkman; Sean Ekins
Journal:  Environ Sci Technol       Date:  2020-10-21       Impact factor: 9.028

10.  Machine Learning Models for Estrogen Receptor Bioactivity and Endocrine Disruption Prediction.

Authors:  Kimberley M Zorn; Daniel H Foil; Thomas R Lane; Daniel P Russo; Wendy Hillwalker; David J Feifarek; Frank Jones; William D Klaren; Ashley M Brinkman; Sean Ekins
Journal:  Environ Sci Technol       Date:  2020-09-15       Impact factor: 9.028

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.