Literature DB >> 14768902

Validation subset selections for extrapolation oriented QSPAR models.

Csaba Szántai-Kis1, István Kövesdi, György Kéri, László Orfi.   

Abstract

One of the most important features of QSPAR models is their predictive ability. The predictive ability of QSPAR models should be checked by external validation. In this work we examined three different types of external validation set selection methods for their usefulness in in-silico screening. The usefulness of the selection methods was studied in such a way that: 1) We generated thousands of QSPR models and stored them in 'model banks'. 2) We selected a final top model from the model banks based on three different validation set selection methods. 3) We predicted large data sets, which we called 'chemical universe sets', and calculated the corresponding SEPs. The models were generated from small fractions of the available water solubility data during a GA Variable Subset Selection procedure. The external validation sets were constructed by random selections, uniformly distributed selections or by perimeter-oriented selections. We found that the best performing models on the perimeter-oriented external validation sets usually gave the best validation results when the remaining part of the available data was overwhelmingly large, i.e., when the model had to make a lot of extrapolations. We also compared the top final models obtained from external validation set selection methods in three independent and different sizes of 'chemical universe sets'.

Entities:  

Mesh:

Year:  2003        PMID: 14768902     DOI: 10.1023/b:modi.0000006538.99122.00

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  9 in total

1.  Estimation of aqueous solubility for a diverse set of organic compounds based on molecular topology

Authors: 
Journal:  J Chem Inf Comput Sci       Date:  2000-05

2.  A fuzzy ARTMAP based on quantitative structure-property relationships (QSPRs) for predicting aqueous solubility of organic compounds.

Authors:  D Yaffe; Y Cohen; G Espinosa; A Arenas; F Giralt
Journal:  J Chem Inf Comput Sci       Date:  2001 Sep-Oct

3.  Prediction of aqueous solubility of organic compounds by the general solubility equation (GSE).

Authors:  Y Ran; N Jain; S H Yalkowsky
Journal:  J Chem Inf Comput Sci       Date:  2001 Sep-Oct

4.  Estimating the water solubilities of crystalline compounds from their chemical structures alone.

Authors:  J W McFarland; A Avdeef; C M Berger; O A Raevsky
Journal:  J Chem Inf Comput Sci       Date:  2001 Sep-Oct

5.  Prediction of aqueous solubility of heteroatom-containing organic compounds from molecular structure.

Authors:  N R McElroy; P C Jurs
Journal:  J Chem Inf Comput Sci       Date:  2001 Sep-Oct

6.  Predictive QSAR modeling based on diversity sampling of experimental datasets for the training and test set selection.

Authors:  Alexander Golbraikh; Alexander Tropsha
Journal:  Mol Divers       Date:  2002       Impact factor: 2.943

Review 7.  Reliability of logP predictions based on calculated molecular descriptors: a critical review.

Authors:  D Eros; I Kövesdi; L Orfi; K Takács-Novák; Gy Acsády; Gy Kéri
Journal:  Curr Med Chem       Date:  2002-10       Impact factor: 4.530

8.  Comparison of predictive ability of water solubility QSPR models generated by MLR, PLS and ANN methods.

Authors:  Dániel Erös; György Kéri; István Kövesdi; Csaba Szántai-Kis; György Mészáros; László Orfi
Journal:  Mini Rev Med Chem       Date:  2004-02       Impact factor: 3.862

9.  Aqueous solubility prediction of drugs based on molecular topology and neural network modeling.

Authors:  J Huuskonen; M Salo; J Taskinen
Journal:  J Chem Inf Comput Sci       Date:  1998 May-Jun
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.