Literature DB >> 15729854

Validation tools for variable subset regression.

Knut Baumann1, Nikolaus Stiefl.   

Abstract

Variable selection is applied frequently in QSAR research. Since the selection process influences the characteristics of the finally chosen model, thorough validation of the selection technique is very important. Here, a validation protocol is presented briefly and two of the tools which are part of this protocol are introduced in more detail. The first tool, which is based on permutation testing, allows to assess the inflation of internal figures of merit (such as the cross-validated prediction error). The other tool, based on noise addition, can be used to determine the complexity and with it the stability of models generated by variable selection. The obtained statistical information is important in deciding whether or not to trust the predictive abilities of a specific model. The graphical output of the validation tools is easily accessible and provides a reliable impression of model performance. Among others, the tools were employed to study the influence of leave-one-out and leave-multiple-out cross-validation on model characteristics. Here, it was confirmed that leave-multiple-out cross-validation yields more stable models. To study the performance of the entire validation protocol, it was applied to eight different QSAR data sets with default settings. In all cases internal and external model performance was good, indicating that the protocol serves its purpose quite well.

Mesh:

Year:  2004        PMID: 15729854     DOI: 10.1007/s10822-004-4071-5

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  19 in total

1.  Global 3D-QSAR methods: MS-WHIM and autocorrelation.

Authors:  E Gancia; G Bravi; P Mascagni; A Zaliani
Journal:  J Comput Aided Mol Des       Date:  2000-03       Impact factor: 3.686

2.  An Introduction to Model Selection.

Authors: 
Journal:  J Math Psychol       Date:  2000-03       Impact factor: 2.223

3.  Mapping property distributions of molecular surfaces: algorithm and evaluation of a novel 3D quantitative structure-activity relationship technique.

Authors:  Nikolaus Stiefl; Knut Baumann
Journal:  J Med Chem       Date:  2003-04-10       Impact factor: 7.446

4.  Evaluation of extended parameter sets for the 3D-QSAR technique MaP: implications for interpretability and model quality exemplified by antimalarially active naphthylisoquinoline alkaloids.

Authors:  Nikolaus Stiefl; Gerhard Bringmann; Christian Rummey; Knut Baumann
Journal:  J Comput Aided Mol Des       Date:  2003 May-Jun       Impact factor: 3.686

5.  Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins.

Authors:  R D Cramer; D E Patterson; J D Bunce
Journal:  J Am Chem Soc       Date:  1988-08-01       Impact factor: 15.419

6.  Self-organizing molecular field analysis: a tool for structure-activity studies.

Authors:  D D Robinson; P J Winn; P D Lyne; W G Richards
Journal:  J Med Chem       Date:  1999-02-25       Impact factor: 7.446

7.  Comparative molecular field analysis (CoMFA) and docking studies of non-nucleoside HIV-1 RT inhibitors (NNIs).

Authors:  M L Barreca; A Carotti; A Carrieri; A Chimirri; A M Monforte; M P Calace; A Rao
Journal:  Bioorg Med Chem       Date:  1999-11       Impact factor: 3.641

8.  Change correlations in structure-activity studies using multiple regression analysis.

Authors:  J G Topliss; R J Costello
Journal:  J Med Chem       Date:  1972-10       Impact factor: 7.446

9.  Modeling of poly(ADP-ribose)polymerase (PARP) inhibitors. Docking of ligands and quantitative structure-activity relationship analysis.

Authors:  G Costantino; A Macchiarulo; E Camaioni; R Pellicciari
Journal:  J Med Chem       Date:  2001-11-08       Impact factor: 7.446

10.  Measurement of soluble thrombomodulin in sera from various clinical stages of diabetic nephropathy.

Authors:  H Rinno; T Kuramoto; T Iijima; M Yagame; Y Tomino
Journal:  J Clin Lab Anal       Date:  1996       Impact factor: 2.352

View more
  2 in total

1.  Reliable estimation of prediction errors for QSAR models under model uncertainty using double cross-validation.

Authors:  Désirée Baumann; Knut Baumann
Journal:  J Cheminform       Date:  2014-11-26       Impact factor: 5.514

2.  Identification of electronic and structural descriptors of adenosine analogues related to inhibition of leishmanial glyceraldehyde-3-phosphate dehydrogenase.

Authors:  Norka B H Lozano; Rafael F Oliveira; Karen C Weber; Kathia M Honorio; Rafael V Guido; Adriano D Andricopulo; Albérico B F Da Silva
Journal:  Molecules       Date:  2013-04-29       Impact factor: 4.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.