Literature DB >> 15554660

Similarity to molecules in the training set is a good discriminator for prediction accuracy in QSAR.

Robert P Sheridan1, Bradley P Feuston, Vladimir N Maiorov, Simon K Kearsley.   

Abstract

How well can a QSAR model predict the activity of a molecule not in the training set used to create the model? A set of retrospective cross-validation experiments using 20 diverse in-house activity sets were done to find a good discriminator of prediction accuracy as measured by root-mean-square difference between observed and predicted activity. Among the measures we tested, two seem useful: the similarity of the molecule to be predicted to the nearest molecule in the training set and/or the number of neighbors in the training set, where neighbors are those more similar than a user-chosen cutoff. The molecules with the highest similarity and/or the most neighbors are the best-predicted. This trend holds true for narrow training sets and, to a lesser degree, for many diverse training sets and does not depend on which QSAR method or descriptor is used. One may define the similarity using a different descriptor than that used for the QSAR model. The similarity dependence for diverse training sets is somewhat unexpected. It appears to be greater for those data sets where the association of similar activities vs similar structures (as encoded in the Patterson plot) is stronger. We propose a way to estimate the reliability of the prediction of an arbitrary chemical structure on a given QSAR model, given the training set from which the model was derived.

Year:  2004        PMID: 15554660     DOI: 10.1021/ci049782w

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  35 in total

1.  Prediction of beta-strand packing interactions using the signature product.

Authors:  W Michael Brown; Shawn Martin; Joseph P Chabarek; Charlie Strauss; Jean-Loup Faulon
Journal:  J Mol Model       Date:  2005-12-07       Impact factor: 1.810

2.  Utilizing high throughput screening data for predictive toxicology models: protocols and application to MLSCN assays.

Authors:  Rajarshi Guha; Stephan C Schürer
Journal:  J Comput Aided Mol Des       Date:  2008-02-19       Impact factor: 3.686

3.  Analysis and use of fragment-occurrence data in similarity-based virtual screening.

Authors:  Shereena M Arif; John D Holliday; Peter Willett
Journal:  J Comput Aided Mol Des       Date:  2009-06-18       Impact factor: 3.686

4.  A probabilistic method to report predictions from a human liver microsomes stability QSAR model: a practical tool for drug discovery.

Authors:  Ignacio Aliagas; Alberto Gobbi; Timothy Heffron; Man-Ling Lee; Daniel F Ortwine; Mark Zak; S Cyrus Khojasteh
Journal:  J Comput Aided Mol Des       Date:  2015-02-24       Impact factor: 3.686

Review 5.  Effective absorption modeling in relative bioavailability study risk assessment.

Authors:  John P Rose
Journal:  AAPS J       Date:  2012-09-11       Impact factor: 4.009

6.  Industrial applications of in silico ADMET.

Authors:  Bernd Beck; Tim Geppert
Journal:  J Mol Model       Date:  2014-06-28       Impact factor: 1.810

7.  Evaluation and calibration of high-throughput predictions of chemical distribution to tissues.

Authors:  Robert G Pearce; R Woodrow Setzer; Jimena L Davis; John F Wambaugh
Journal:  J Pharmacokinet Pharmacodyn       Date:  2017-10-14       Impact factor: 2.745

8.  Prediction of Cytochrome P450 Profiles of Environmental Chemicals with QSAR Models Built from Drug-like Molecules.

Authors:  Hongmao Sun; Henrike Veith; Menghang Xia; Christopher P Austin; Raymond R Tice; Ruili Huang
Journal:  Mol Inform       Date:  2012-10-11       Impact factor: 3.353

9.  Estimation of the applicability domain of kernel-based machine learning models for virtual screening.

Authors:  Nikolas Fechner; Andreas Jahn; Georg Hinselmann; Andreas Zell
Journal:  J Cheminform       Date:  2010-03-11       Impact factor: 5.514

10.  Evaluation of computational docking to identify pregnane X receptor agonists in the ToxCast database.

Authors:  Sandhya Kortagere; Matthew D Krasowski; Erica J Reschly; Madhukumar Venkatesh; Sridhar Mani; Sean Ekins
Journal:  Environ Health Perspect       Date:  2010-06-17       Impact factor: 9.031

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.