Literature DB >> 34039411

Benchmarks for interpretation of QSAR models.

Mariia Matveieva1, Pavel Polishchuk2.   

Abstract

Interpretation of QSAR models is useful to understand the complex nature of biological or physicochemical processes, guide structural optimization or perform knowledge-based validation of QSAR models. Highly predictive models are usually complex and their interpretation is non-trivial. This is particularly true for modern neural networks. Various approaches to interpretation of these models exist. However, it is difficult to evaluate and compare performance and applicability of these ever-emerging methods. Herein, we developed several benchmark data sets with end-points determined by pre-defined patterns. These data sets are purposed for evaluation of the ability of interpretation approaches to retrieve these patterns. They represent tasks with different complexity levels: from simple atom-based additive properties to pharmacophore hypothesis. We proposed several quantitative metrics of interpretation performance. Applicability of benchmarks and metrics was demonstrated on a set of conventional models and end-to-end graph convolutional neural networks, interpreted by the previously suggested universal ML-agnostic approach for structural interpretation. We anticipate these benchmarks to be useful in evaluation of new interpretation approaches and investigation of decision making of complex "black box" models.

Entities:  

Keywords:  Atom contributions; Benchmark data set; Graph convolutional neural networks; Interpretability metrics; QSAR model interpretation; Synthetic data set

Year:  2021        PMID: 34039411     DOI: 10.1186/s13321-021-00519-x

Source DB:  PubMed          Journal:  J Cheminform        ISSN: 1758-2946            Impact factor:   5.514


  16 in total

1.  Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery.

Authors:  Xin Yang; Yifei Wang; Ryan Byrne; Gisbert Schneider; Shengyong Yang
Journal:  Chem Rev       Date:  2019-07-11       Impact factor: 60.622

2.  Interpretation of QSAR Models by Coloring Atoms According to Changes in Predicted Activity: How Robust Is It?

Authors:  Robert P Sheridan
Journal:  J Chem Inf Model       Date:  2019-03-04       Impact factor: 4.956

3.  Using attribution to decode binding mechanism in neural network models for chemistry.

Authors:  Kevin McCloskey; Ankur Taly; Federico Monti; Michael P Brenner; Lucy J Colwell
Journal:  Proc Natl Acad Sci U S A       Date:  2019-05-24       Impact factor: 11.205

Review 4.  Interpretation of Quantitative Structure-Activity Relationship Models: Past, Present, and Future.

Authors:  Pavel Polishchuk
Journal:  J Chem Inf Model       Date:  2017-10-13       Impact factor: 4.956

5.  Interpretation of Compound Activity Predictions from Complex Machine Learning Models Using Local Approximations and Shapley Values.

Authors:  Raquel Rodríguez-Pérez; Jürgen Bajorath
Journal:  J Med Chem       Date:  2019-09-26       Impact factor: 7.446

6.  A self-attention based message passing neural network for predicting molecular lipophilicity and aqueous solubility.

Authors:  Bowen Tang; Skyler T Kramer; Meijuan Fang; Yingkun Qiu; Zhen Wu; Dong Xu
Journal:  J Cheminform       Date:  2020-02-21       Impact factor: 5.514

7.  Unmasking Clever Hans predictors and assessing what machines really learn.

Authors:  Sebastian Lapuschkin; Stephan Wäldchen; Alexander Binder; Grégoire Montavon; Wojciech Samek; Klaus-Robert Müller
Journal:  Nat Commun       Date:  2019-03-11       Impact factor: 14.919

8.  Vulnerability of the Oceanic Whitetip Shark to Pelagic Longline Fisheries.

Authors:  Mariana Travassos Tolotti; Pascal Bach; Fábio Hazin; Paulo Travassos; Laurent Dagorn
Journal:  PLoS One       Date:  2015-10-22       Impact factor: 3.240

9.  Similarity maps - a visualization strategy for molecular fingerprints and machine-learning methods.

Authors:  Sereina Riniker; Gregory A Landrum
Journal:  J Cheminform       Date:  2013-09-24       Impact factor: 5.514

10.  Revealing cytotoxic substructures in molecules using deep learning.

Authors:  Henry E Webel; Talia B Kimber; Silke Radetzki; Martin Neuenschwander; Marc Nazaré; Andrea Volkamer
Journal:  J Comput Aided Mol Des       Date:  2020-04-16       Impact factor: 3.686

View more
  2 in total

1.  Prediction and Screening Model for Products Based on Fusion Regression and XGBoost Classification.

Authors:  Jiaju Wu; Linggang Kong; Ming Yi; Qiuxian Chen; Zheng Cheng; Hongfu Zuo; Yonghui Yang
Journal:  Comput Intell Neurosci       Date:  2022-07-31

2.  Improving QSAR Modeling for Predictive Toxicology using Publicly Aggregated Semantic Graph Data and Graph Neural Networks.

Authors:  Joseph D Romano; Yun Hao; Jason H Moore
Journal:  Pac Symp Biocomput       Date:  2022
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.