| Literature DB >> 32857505 |
Kimberley M Zorn1, Daniel H Foil1, Thomas R Lane1, Daniel P Russo2, Wendy Hillwalker3, David J Feifarek3, Frank Jones3, William D Klaren3, Ashley M Brinkman3, Sean Ekins1.
Abstract
The U.S. Environmental Protection Agency (EPA) periodically releases in vitro data across a variety of targets, including the estrogen receptor (ER). In 2015, the EPA used these data to construct mathematical models of ER agonist and antagonist pathways to prioritize chemicals for endocrine disruption testing. However, mathematical models require in vitro data prior to predicting estrogenic activity, but machine learning methods are capable of prospective prediction from the molecular structure alone. The current study describes the generation and evaluation of Bayesian machine learning models grouped by the EPA's ER agonist pathway model using multiple data types with proprietary software, Assay Central. External predictions with three test sets of in vitro and in vivo reference chemicals with agonist activity classifications were compared to previous mathematical model publications. Training data sets were subjected to additional machine learning algorithms and compared with rank normalized scores of internal five-fold cross-validation statistics. External predictions were found to be comparable or superior to previous studies published by the EPA. When assessing six additional algorithms for the training data sets, Assay Central performed similarly at a reduced computational cost. This study demonstrates that machine learning can prioritize chemicals for future in vitro and in vivo testing of ER agonism.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32857505 PMCID: PMC8194504 DOI: 10.1021/acs.est.0c03982
Source DB: PubMed Journal: Environ Sci Technol ISSN: 0013-936X Impact factor: 9.028