Literature DB >> 33513003

Trade-off Predictivity and Explainability for Machine-Learning Powered Predictive Toxicology: An in-Depth Investigation with Tox21 Data Sets.

Leihong Wu1, Ruili Huang2, Igor V Tetko3,4, Zhonghua Xia3, Joshua Xu1, Weida Tong1.   

Abstract

Selecting a model in predictive toxicology often involves a trade-off between prediction performance and explainability: should we sacrifice the model performance to gain explainability or vice versa. Here we present a comprehensive study to assess algorithm and feature influences on model performance in chemical toxicity research. We conducted over 5000 models for a Tox21 bioassay data set of 65 assays and ∼7600 compounds. Seven molecular representations as features and 12 modeling approaches varying in complexity and explainability were employed to systematically investigate the impact of various factors on model performance and explainability. We demonstrated that end points dictated a model's performance, regardless of the chosen modeling approach including deep learning and chemical features. Overall, more complex models such as (LS-)SVM and Random Forest performed marginally better than simpler models such as linear regression and KNN in the presented Tox21 data analysis. Since a simpler model with acceptable performance often also is easy to interpret for the Tox21 data set, it clearly was the preferred choice due to its better explainability. Given that each data set had its own error structure both for dependent and independent variables, we strongly recommend that it is important to conduct a systematic study with a broad range of model complexity and feature explainability to identify model balancing its predictivity and explainability.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33513003      PMCID: PMC8867471          DOI: 10.1021/acs.chemrestox.0c00373

Source DB:  PubMed          Journal:  Chem Res Toxicol        ISSN: 0893-228X            Impact factor:   3.739


  29 in total

1.  New publicly available chemical query language, CSRML, to support chemotype representations for application to data mining and modeling.

Authors:  Chihae Yang; Aleksey Tarkhov; Jörg Marusczyk; Bruno Bienfait; Johann Gasteiger; Thomas Kleinoeder; Tomasz Magdziarz; Oliver Sacher; Christof H Schwab; Johannes Schwoebel; Lothar Terfloth; Kirk Arvidson; Ann Richard; Andrew Worth; James Rathman
Journal:  J Chem Inf Model       Date:  2015-02-19       Impact factor: 4.956

2.  Mold(2), molecular descriptors from 2D structures for chemoinformatics and toxicoinformatics.

Authors:  Huixiao Hong; Qian Xie; Weigong Ge; Feng Qian; Hong Fang; Leming Shi; Zhenqiang Su; Roger Perkins; Weida Tong
Journal:  J Chem Inf Model       Date:  2008-06-20       Impact factor: 4.956

Review 3.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

4.  Comparative Study of Multitask Toxicity Modeling on a Broad Chemical Space.

Authors:  Sergey Sosnin; Dmitry Karlov; Igor V Tetko; Maxim V Fedorov
Journal:  J Chem Inf Model       Date:  2019-01-23       Impact factor: 4.956

5.  The Tox21 10K Compound Library: Collaborative Chemistry Advancing Toxicology.

Authors:  Ann M Richard; Ruili Huang; Suramya Waidyanatha; Paul Shinn; Bradley J Collins; Inthirany Thillainadarajah; Christopher M Grulke; Antony J Williams; Ryan R Lougee; Richard S Judson; Keith A Houck; Mahmoud Shobair; Chihae Yang; James F Rathman; Adam Yasgar; Suzanne C Fitzpatrick; Anton Simeonov; Russell S Thomas; Kevin M Crofton; Richard S Paules; John R Bucher; Christopher P Austin; Robert J Kavlock; Raymond R Tice
Journal:  Chem Res Toxicol       Date:  2020-11-03       Impact factor: 3.739

6.  High lipophilicity and high daily dose of oral medications are associated with significant risk for drug-induced liver injury.

Authors:  Minjun Chen; Jürgen Borlak; Weida Tong
Journal:  Hepatology       Date:  2013-05-27       Impact factor: 17.425

Review 7.  Drug-induced liver injury severity and toxicity (DILIst): binary classification of 1279 drugs by human hepatotoxicity.

Authors:  Shraddha Thakkar; Ting Li; Zhichao Liu; Leihong Wu; Ruth Roberts; Weida Tong
Journal:  Drug Discov Today       Date:  2019-11-01       Impact factor: 7.851

8.  Online chemical modeling environment (OCHEM): web platform for data storage, model development and publishing of chemical information.

Authors:  Iurii Sushko; Sergii Novotarskyi; Robert Körner; Anil Kumar Pandey; Matthias Rupp; Wolfram Teetz; Stefan Brandmaier; Ahmed Abdelaziz; Volodymyr V Prokopenko; Vsevolod Y Tanchuk; Roberto Todeschini; Alexandre Varnek; Gilles Marcou; Peter Ertl; Vladimir Potemkin; Maria Grishina; Johann Gasteiger; Christof Schwab; Igor I Baskin; Vladimir A Palyulin; Eugene V Radchenko; William J Welsh; Vladyslav Kholodovych; Dmitriy Chekmarev; Artem Cherkasov; Joao Aires-de-Sousa; Qing-You Zhang; Andreas Bender; Florian Nigsch; Luc Patiny; Antony Williams; Valery Tkachenko; Igor V Tetko
Journal:  J Comput Aided Mol Des       Date:  2011-06-10       Impact factor: 3.686

9.  Development of Decision Forest Models for Prediction of Drug-Induced Liver Injury in Humans Using A Large Set of FDA-approved Drugs.

Authors:  Huixiao Hong; Shraddha Thakkar; Minjun Chen; Weida Tong
Journal:  Sci Rep       Date:  2017-12-11       Impact factor: 4.379

10.  Extended Functional Groups (EFG): An Efficient Set for Chemical Characterization and Structure-Activity Relationship Studies of Chemical Compounds.

Authors:  Elena S Salmina; Norbert Haider; Igor V Tetko
Journal:  Molecules       Date:  2015-12-23       Impact factor: 4.411

View more
  2 in total

Review 1.  A review of explainable and interpretable AI with applications in COVID-19 imaging.

Authors:  Jordan D Fuhrman; Naveena Gorre; Qiyuan Hu; Hui Li; Issam El Naqa; Maryellen L Giger
Journal:  Med Phys       Date:  2021-12-07       Impact factor: 4.506

2.  Class imbalance learning with Bayesian optimization applied in drug discovery.

Authors:  Shenmin Guan; Ning Fu
Journal:  Sci Rep       Date:  2022-02-08       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.