Literature DB >> 25137527

Introduction of a methodology for visualization and graphical interpretation of Bayesian classification models.

Jenny Balfer1, Jürgen Bajorath.   

Abstract

Supervised machine learning models are widely used in chemoinformatics, especially for the prediction of new active compounds or targets of known actives. Bayesian classification methods are among the most popular machine learning approaches for the prediction of activity from chemical structure. Much work has focused on predicting structure-activity relationships (SARs) on the basis of experimental training data. By contrast, only a few efforts have thus far been made to rationalize the performance of Bayesian or other supervised machine learning models and better understand why they might succeed or fail. In this study, we introduce an intuitive approach for the visualization and graphical interpretation of naïve Bayesian classification models. Parameters derived during supervised learning are visualized and interactively analyzed to gain insights into model performance and identify features that determine predictions. The methodology is introduced in detail and applied to assess Bayesian modeling efforts and predictions on compound data sets of varying structural complexity. Different classification models and features determining their performance are characterized in detail. A prototypic implementation of the approach is provided.

Mesh:

Year:  2014        PMID: 25137527     DOI: 10.1021/ci500410g

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  5 in total

1.  Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions.

Authors:  Raquel Rodríguez-Pérez; Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2020-05-02       Impact factor: 3.686

2.  Bayesian models trained with HTS data for predicting β-haematin inhibition and in vitro antimalarial activity.

Authors:  Kathryn J Wicht; Jill M Combrinck; Peter J Smith; Timothy J Egan
Journal:  Bioorg Med Chem       Date:  2014-12-20       Impact factor: 3.641

3.  Explaining Support Vector Machines: A Color Based Nomogram.

Authors:  Vanya Van Belle; Ben Van Calster; Sabine Van Huffel; Johan A K Suykens; Paulo Lisboa
Journal:  PLoS One       Date:  2016-10-10       Impact factor: 3.240

4.  Implicit-descriptor ligand-based virtual screening by means of collaborative filtering.

Authors:  Raghuram Srinivas; Pavel V Klimovich; Eric C Larson
Journal:  J Cheminform       Date:  2018-11-22       Impact factor: 5.514

5.  Explainable machine learning predictions of dual-target compounds reveal characteristic structural features.

Authors:  Christian Feldmann; Maren Philipps; Jürgen Bajorath
Journal:  Sci Rep       Date:  2021-11-03       Impact factor: 4.379

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.