Literature DB >> 32361862

Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions.

Raquel Rodríguez-Pérez1, Jürgen Bajorath2.   

Abstract

Difficulties in interpreting machine learning (ML) models and their predictions limit the practical applicability of and confidence in ML in pharmaceutical research. There is a need for agnostic approaches aiding in the interpretation of ML models regardless of their complexity that is also applicable to deep neural network (DNN) architectures and model ensembles. To these ends, the SHapley Additive exPlanations (SHAP) methodology has recently been introduced. The SHAP approach enables the identification and prioritization of features that determine compound classification and activity prediction using any ML model. Herein, we further extend the evaluation of the SHAP methodology by investigating a variant for exact calculation of Shapley values for decision tree methods and systematically compare this variant in compound activity and potency value predictions with the model-independent SHAP method. Moreover, new applications of the SHAP analysis approach are presented including interpretation of DNN models for the generation of multi-target activity profiles and ensemble regression models for potency prediction.

Entities:  

Keywords:  Black box character; Compound activity; Compound potency prediction; Feature importance; Machine learning; Model interpretation; Multi-target modeling; Shapley values; Structure–activity relationships

Year:  2020        PMID: 32361862      PMCID: PMC7449951          DOI: 10.1007/s10822-020-00314-0

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  20 in total

1.  An approach to the interpretation of backpropagation neural network models in QSAR studies.

Authors:  I I Baskin; A O Ait; N M Halberstam; V A Palyulin; N S Zefirov
Journal:  SAR QSAR Environ Res       Date:  2002-03       Impact factor: 3.000

2.  Machine learning methods for property prediction in chemoinformatics: Quo Vadis?

Authors:  Alexandre Varnek; Igor Baskin
Journal:  J Chem Inf Model       Date:  2012-05-25       Impact factor: 4.956

3.  From Local Explanations to Global Understanding with Explainable AI for Trees.

Authors:  Scott M Lundberg; Gabriel Erion; Hugh Chen; Alex DeGrave; Jordan M Prutkin; Bala Nair; Ronit Katz; Jonathan Himmelfarb; Nisha Bansal; Su-In Lee
Journal:  Nat Mach Intell       Date:  2020-01-17

4.  Trade-off between accuracy and interpretability for predictive in silico modeling.

Authors:  Ulf Johansson; Cecilia Sönströd; Ulf Norinder; Henrik Boström
Journal:  Future Med Chem       Date:  2011-04       Impact factor: 3.808

Review 5.  Machine-learning approaches in drug discovery: methods and applications.

Authors:  Antonio Lavecchia
Journal:  Drug Discov Today       Date:  2014-11-04       Impact factor: 7.851

6.  Computational Method for the Systematic Identification of Analog Series and Key Compounds Representing Series and Their Biological Activity Profiles.

Authors:  Dagmar Stumpfe; Dilyana Dimova; Jürgen Bajorath
Journal:  J Med Chem       Date:  2016-08-08       Impact factor: 7.446

Review 7.  Interpretation of Quantitative Structure-Activity Relationship Models: Past, Present, and Future.

Authors:  Pavel Polishchuk
Journal:  J Chem Inf Model       Date:  2017-10-13       Impact factor: 4.956

8.  Interpretation of Compound Activity Predictions from Complex Machine Learning Models Using Local Approximations and Shapley Values.

Authors:  Raquel Rodríguez-Pérez; Jürgen Bajorath
Journal:  J Med Chem       Date:  2019-09-26       Impact factor: 7.446

9.  Support Vector Machine Classification and Regression Prioritize Different Structural Features for Binary Compound Activity and Potency Value Prediction.

Authors:  Raquel Rodríguez-Pérez; Martin Vogt; Jürgen Bajorath
Journal:  ACS Omega       Date:  2017-10-04

10.  Assessing Scaffold Diversity of Kinase Inhibitors Using Alternative Scaffold Concepts and Estimating the Scaffold Hopping Potential for Different Kinases.

Authors:  Dilyana Dimova; Jürgen Bajorath
Journal:  Molecules       Date:  2017-05-03       Impact factor: 4.411

View more
  25 in total

1.  Artificial intelligence predicts disk re-herniation following lumbar microdiscectomy: development of the "RAD" risk profile.

Authors:  Garrett K Harada; Zakariah K Siyaji; G Michael Mallow; Alexander L Hornung; Fayyazul Hassan; Bryce A Basques; Haseeb A Mohammed; Arash J Sayari; Dino Samartzis; Howard S An
Journal:  Eur Spine J       Date:  2021-06-07       Impact factor: 3.134

2.  EdgeSHAPer: Bond-centric Shapley value-based explanation method for graph neural networks.

Authors:  Andrea Mastropietro; Giuseppe Pasculli; Christian Feldmann; Raquel Rodríguez-Pérez; Jürgen Bajorath
Journal:  iScience       Date:  2022-08-30

3.  Exploring kinase family inhibitors and their moiety preferences using deep SHapley additive exPlanations.

Authors:  You-Wei Fan; Wan-Hsin Liu; Yun-Ti Chen; Yen-Chao Hsu; Nikhil Pathak; Yu-Wei Huang; Jinn-Moon Yang
Journal:  BMC Bioinformatics       Date:  2022-06-20       Impact factor: 3.307

4.  Can machines learn the mutation signatures of SARS-CoV-2 and enable viral-genotype guided predictive prognosis?

Authors:  Sunil Nagpal; Nishal Kumar Pinna; Namrata Pant; Rohan Singh; Divyanshu Srivastava; Sharmila S Mande
Journal:  J Mol Biol       Date:  2022-06-11       Impact factor: 6.151

Review 5.  A Surgeon's Guide to Understanding Artificial Intelligence and Machine Learning Studies in Orthopaedic Surgery.

Authors:  Rohan M Shah; Clarissa Wong; Nicholas C Arpey; Alpesh A Patel; Srikanth N Divi
Journal:  Curr Rev Musculoskelet Med       Date:  2022-02-10

6.  Predicting disease activity in patients with multiple sclerosis: An explainable machine-learning approach in the Mavenclad trials.

Authors:  Sreetama Basu; Alain Munafo; Ali-Frederic Ben-Amor; Sanjeev Roy; Pascal Girard; Nadia Terranova
Journal:  CPT Pharmacometrics Syst Pharmacol       Date:  2022-05-09

7.  Prognostic Assessment of COVID-19 in the Intensive Care Unit by Machine Learning Methods: Model Development and Validation.

Authors:  Pan Pan; Yichao Li; Yongjiu Xiao; Bingchao Han; Longxiang Su; Mingliang Su; Yansheng Li; Siqi Zhang; Dapeng Jiang; Xia Chen; Fuquan Zhou; Ling Ma; Pengtao Bao; Lixin Xie
Journal:  J Med Internet Res       Date:  2020-11-11       Impact factor: 5.428

8.  Colombian Contributions Fighting Leishmaniasis: A Systematic Review on Antileishmanials Combined with Chemoinformatics Analysis.

Authors:  Jeysson Sánchez-Suárez; Freddy A Bernal; Ericsson Coy-Barrera
Journal:  Molecules       Date:  2020-12-03       Impact factor: 4.411

9.  Prediction of Tumor Shrinkage Pattern to Neoadjuvant Chemotherapy Using a Multiparametric MRI-Based Machine Learning Model in Patients With Breast Cancer.

Authors:  Yuhong Huang; Wenben Chen; Xiaoling Zhang; Shaofu He; Nan Shao; Huijuan Shi; Zhenzhe Lin; Xueting Wu; Tongkeng Li; Haotian Lin; Ying Lin
Journal:  Front Bioeng Biotechnol       Date:  2021-07-06

10.  Artificial Intelligence for Risk Prediction of Rehospitalization with Acute Kidney Injury in Sepsis Survivors.

Authors:  Shuo-Ming Ou; Kuo-Hua Lee; Ming-Tsun Tsai; Wei-Cheng Tseng; Yuan-Chia Chu; Der-Cherng Tarng
Journal:  J Pers Med       Date:  2022-01-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.