Literature DB >> 18556228

Fingerprint-based clustering applied to define a QSAR model use radius.

D G Sprous1.   

Abstract

In ongoing research, QSAR has been a tool applied to evaluate compound qualities associated with skin permeability and membership in either a druglike class or specific nondruglike type classes. A need that arose from this pursuit was to know the boundaries of the QSAR models within which molecules could be analyzed. To satisfy this need, a method of QSAR model validation was developed which moves away from the simple declaration of correlation to a description of expected correlation as a function of similarity to the training set. This extension of the "validation" and "predictive" concepts to include a border is referred to henceforth as the QSAR model use radius. By defining this metric, it is possible to select for models which have predictivity exterior to their training sets. The heart of this approach is the common use of division into training sets and test sets to demonstrate an ability to successfully predict outside of the training set. The new rigor introduced is to repetitively cluster and systematically increase the permitted dissimilarity within those clusters. The training sets are assembled by taking one and only one compound from each cluster at a specific level of permitted dissimilarity. The QSAR model is developed over these training sets and applied to predict the remaining compounds. In this manner, it is possible to point where there is adequate similarity to predict a compound and where there is not. This method is especially useful for large, chemically redundant systems of greater than 250 compounds where leave-one-out crossvalidation is of limited use. To illustrate this technique, the results of defining the use radius for (a) a skin permeability model (based on 276 compounds), (b) a drug compound and "safe" compound partition (3000 compounds) and (c) a kinase inhibitor and drug compound partition ( approximately 1300 compounds) are discussed.

Mesh:

Substances:

Year:  2008        PMID: 18556228     DOI: 10.1016/j.jmgm.2008.04.009

Source DB:  PubMed          Journal:  J Mol Graph Model        ISSN: 1093-3263            Impact factor:   2.518


  3 in total

1.  2D binary QSAR modeling of LPA3 receptor antagonism.

Authors:  James I Fells; Ryoko Tsukahara; Jianxiong Liu; Gabor Tigyi; Abby L Parrill
Journal:  J Mol Graph Model       Date:  2010-03-07       Impact factor: 2.518

2.  Pesticides, cosmetics, drugs: identical and opposite influences of various molecular features as measures of endpoints similarity and dissimilarity.

Authors:  Andrey A Toropov; Alla P Toropova; Marco Marzo; Edoardo Carnesecchi; Gianluca Selvestrel; Emilio Benfenati
Journal:  Mol Divers       Date:  2020-04-23       Impact factor: 2.943

3.  An efficient piecewise linear model for predicting activity of caspase-3 inhibitors.

Authors:  Loghman Firoozpour; Khadijeh Sadatnezhad; Sholeh Dehghani; Eslam Pourbasheer; Alireza Foroumadi; Abbas Shafiee; Massoud Amanlou
Journal:  Daru       Date:  2012-09-10       Impact factor: 3.117

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.