Literature DB >> 23504478

QSAR with experimental and predictive distributions: an information theoretic approach for assessing model quality.

David J Wood1, Lars Carlsson, Martin Eklund, Ulf Norinder, Jonna Stålring.   

Abstract

We propose that quantitative structure-activity relationship (QSAR) predictions should be explicitly represented as predictive (probability) distributions. If both predictions and experimental measurements are treated as probability distributions, the quality of a set of predictive distributions output by a model can be assessed with Kullback-Leibler (KL) divergence: a widely used information theoretic measure of the distance between two probability distributions. We have assessed a range of different machine learning algorithms and error estimation methods for producing predictive distributions with an analysis against three of AstraZeneca's global DMPK datasets. Using the KL-divergence framework, we have identified a few combinations of algorithms that produce accurate and valid compound-specific predictive distributions. These methods use reliability indices to assign predictive distributions to the predictions output by QSAR models so that reliable predictions have tight distributions and vice versa. Finally we show how valid predictive distributions can be used to estimate the probability that a test compound has properties that hit single- or multi- objective target profiles.

Entities:  

Mesh:

Year:  2013        PMID: 23504478      PMCID: PMC3639359          DOI: 10.1007/s10822-013-9639-5

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  24 in total

Review 1.  Multi-parameter optimization: identifying high quality compounds with a balance of properties.

Authors:  Matthew D Segall
Journal:  Curr Pharm Des       Date:  2012       Impact factor: 3.116

2.  Three useful dimensions for domain applicability in QSAR models using random forest.

Authors:  Robert P Sheridan
Journal:  J Chem Inf Model       Date:  2012-03-09       Impact factor: 4.956

3.  Applicability domains for classification problems: Benchmarking of distance to models for Ames mutagenicity set.

Authors:  Iurii Sushko; Sergii Novotarskyi; Robert Körner; Anil Kumar Pandey; Artem Cherkasov; Jiazhong Li; Paola Gramatica; Katja Hansen; Timon Schroeter; Klaus-Robert Müller; Lili Xi; Huanxiang Liu; Xiaojun Yao; Tomas Öberg; Farhad Hormozdiari; Phuong Dao; Cenk Sahinalp; Roberto Todeschini; Pavel Polishchuk; Anatoliy Artemenko; Victor Kuz'min; Todd M Martin; Douglas M Young; Denis Fourches; Eugene Muratov; Alexander Tropsha; Igor Baskin; Dragos Horvath; Gilles Marcou; Christophe Muller; Alexander Varnek; Volodymyr V Prokopenko; Igor V Tetko
Journal:  J Chem Inf Model       Date:  2010-10-29       Impact factor: 4.956

4.  Similarity to molecules in the training set is a good discriminator for prediction accuracy in QSAR.

Authors:  Robert P Sheridan; Bradley P Feuston; Vladimir N Maiorov; Simon K Kearsley
Journal:  J Chem Inf Comput Sci       Date:  2004 Nov-Dec

Review 5.  Focus on success: using a probabilistic approach to achieve an optimal balance of compound properties in drug discovery.

Authors:  Matt D Segall; Alan P Beresford; Joelle Mr Gola; Dan Hawksley; Mike H Tarbit
Journal:  Expert Opin Drug Metab Toxicol       Date:  2006-04       Impact factor: 4.481

6.  Application of belief theory to similarity data fusion for use in analog searching and lead hopping.

Authors:  Steven W Muchmore; Derek A Debe; James T Metz; Scott P Brown; Yvonne C Martin; Philip J Hajduk
Journal:  J Chem Inf Model       Date:  2008-04-17       Impact factor: 4.956

Review 7.  Strategies for the generation, validation and application of in silico ADMET models in lead generation and optimization.

Authors:  Matthew Paul Gleeson; Dino Montanari
Journal:  Expert Opin Drug Metab Toxicol       Date:  2012-07-31       Impact factor: 4.481

8.  pH-dependent bidirectional transport of weakly basic drugs across Caco-2 monolayers: implications for drug-drug interactions.

Authors:  Sibylle Neuhoff; Anna-Lena Ungell; Ismael Zamora; Per Artursson
Journal:  Pharm Res       Date:  2003-08       Impact factor: 4.200

9.  Critical assessment of QSAR models of environmental toxicity against Tetrahymena pyriformis: focusing on applicability domain and overfitting by variable selection.

Authors:  Igor V Tetko; Iurii Sushko; Anil Kumar Pandey; Hao Zhu; Alexander Tropsha; Ester Papa; Tomas Oberg; Roberto Todeschini; Denis Fourches; Alexandre Varnek
Journal:  J Chem Inf Model       Date:  2008-08-26       Impact factor: 4.956

10.  The C1C2: a framework for simultaneous model selection and assessment.

Authors:  Martin Eklund; Ola Spjuth; Jarl Es Wikberg
Journal:  BMC Bioinformatics       Date:  2008-09-02       Impact factor: 3.169

View more
  5 in total

1.  Assessment of uncertainty in chemical models by Bayesian probabilities: Why, when, how?

Authors:  Ullrika Sahlin
Journal:  J Comput Aided Mol Des       Date:  2014-12-10       Impact factor: 3.686

2.  Time dependent analysis of assay comparability: a novel approach to understand intra- and inter-site variability over time.

Authors:  Susanne Winiwarter; Brian Middleton; Barry Jones; Paul Courtney; Bo Lindmark; Ken M Page; Alan Clark; Claire Landqvist
Journal:  J Comput Aided Mol Des       Date:  2015-02-20       Impact factor: 3.686

3.  Using beta binomials to estimate classification uncertainty for ensemble models.

Authors:  Robert D Clark; Wenkel Liang; Adam C Lee; Michael S Lawless; Robert Fraczkiewicz; Marvin Waldman
Journal:  J Cheminform       Date:  2014-06-22       Impact factor: 5.514

4.  Prediction of the potency of mammalian cyclooxygenase inhibitors with ensemble proteochemometric modeling.

Authors:  Isidro Cortes-Ciriano; Daniel S Murrell; Gerard Jp van Westen; Andreas Bender; Thérèse E Malliavin
Journal:  J Cheminform       Date:  2015-01-16       Impact factor: 5.514

5.  How accurately can we predict the melting points of drug-like compounds?

Authors:  Igor V Tetko; Yurii Sushko; Sergii Novotarskyi; Luc Patiny; Ivan Kondratov; Alexander E Petrenko; Larisa Charochkina; Abdullah M Asiri
Journal:  J Chem Inf Model       Date:  2014-12-09       Impact factor: 4.956

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.