Literature DB >> 15729853

QSAR modeling based on the bias/variance compromise: a harmonious and parsimonious approach.

John H Kalivas1, Joel B Forrester, Heather A Seipel.   

Abstract

Modeling quantitative structure-activity relationships (QSAR) is considered with an emphasis on prediction. An abundance of methods are available to develop such models. Using a harmonious approach that balances the bias and variance of predictions, the best calibration models are identified relative to the bias and variance criteria used. Criteria utilized to determine the adequacy of models are the root mean square error of calibration (RMSEC) and validation (RMSEV), respective R2 values, and the norm of the regression vector. QSAR data from the literature are used to demonstrate concepts. For these data sets and criteria used, it is suggested that models obtained by ridge regression (RR) are more harmonious and parsimonious than models obtained by partial least squares (PLS) and principal component regression (PCR) when the data is mean-centered. The most harmonious RR models have the best bias/variance tradeoff, reflected by the smallest RMSEC, RMSEV, and regression vector norms and the largest calibration and validation R2 values. The most parsimonious RR models have the smallest effective rank.

Entities:  

Mesh:

Year:  2004        PMID: 15729853     DOI: 10.1007/s10822-004-4063-5

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  4 in total

1.  Development of quantitative structure-activity relationship and classification models for a set of carbonic anhydrase inhibitors.

Authors:  Brian E Mattioni; Peter C Jurs
Journal:  J Chem Inf Comput Sci       Date:  2002 Jan-Feb

2.  Assessment of pareto calibration, stability, and wavelength selection.

Authors:  Kelly J Anderson; John H Kalivas
Journal:  Appl Spectrosc       Date:  2003-03       Impact factor: 2.388

3.  Prediction of protein retention times in anion-exchange chromatography systems using support vector regression.

Authors:  Minghu Song; Curt M Breneman; Jinbo Bi; N Sukumar; Kristin P Bennett; Steven Cramer; Nihal Tugcu
Journal:  J Chem Inf Comput Sci       Date:  2002 Nov-Dec

4.  Prediction of dihydrofolate reductase inhibition and selectivity using computational neural networks and linear discriminant analysis.

Authors:  Brian E Mattioni; Peter C Jurs
Journal:  J Mol Graph Model       Date:  2003-03       Impact factor: 2.518

  4 in total
  1 in total

1.  Statistical variation in progressive scrambling.

Authors:  Robert D Clark; Peter C Fox
Journal:  J Comput Aided Mol Des       Date:  2004 Jul-Sep       Impact factor: 3.686

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.