Literature DB >> 12653524

Assessing model fit by cross-validation.

Douglas M Hawkins1, Subhash C Basak, Denise Mills.   

Abstract

When QSAR models are fitted, it is important to validate any fitted model-to check that it is plausible that its predictions will carry over to fresh data not used in the model fitting exercise. There are two standard ways of doing this-using a separate hold-out test sample and the computationally much more burdensome leave-one-out cross-validation in which the entire pool of available compounds is used both to fit the model and to assess its validity. We show by theoretical argument and empiric study of a large QSAR data set that when the available sample size is small-in the dozens or scores rather than the hundreds, holding a portion of it back for testing is wasteful, and that it is much better to use cross-validation, but ensure that this is done properly.

Year:  2003        PMID: 12653524     DOI: 10.1021/ci025626i

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  93 in total

1.  Computational analysis of HIV-1 protease protein binding pockets.

Authors:  Gene M Ko; A Srinivas Reddy; Sunil Kumar; Barbara A Bailey; Rajni Garg
Journal:  J Chem Inf Model       Date:  2010-10-25       Impact factor: 4.956

2.  Usefulness of graphical invariants in quantitative structure-activity correlations of tuberculostatic drugs of the isonicotinic acid hydrazide type.

Authors:  Manish C Bagchi; Bhim C Maiti; Denise Mills; Subhash C Basak
Journal:  J Mol Model       Date:  2003-12-23       Impact factor: 1.810

3.  Virtual generation of agents against Mycobacterium tuberculosis. A QSAR study.

Authors:  Emili Besalú; Robert Ponec; Jesus Vicente de Julián-Ortiz
Journal:  Mol Divers       Date:  2003       Impact factor: 2.943

4.  A QSAR study of radical scavenging antioxidant activity of a series of flavonoids using DFT based quantum chemical descriptors--the importance of group frontier electron density.

Authors:  Ananda Sarkar; Tapas Ranjan Middya; Atish Dipnakar Jana
Journal:  J Mol Model       Date:  2011-11-13       Impact factor: 1.810

5.  QSAR classification of metabolic activation of chemicals into covalently reactive species.

Authors:  Chin Yee Liew; Chuen Pan; Andre Tan; Ke Xin Magneline Ang; Chun Wei Yap
Journal:  Mol Divers       Date:  2012-02-28       Impact factor: 2.943

6.  Toward better QSAR/QSPR modeling: simultaneous outlier detection and variable selection using distribution of model features.

Authors:  Dongsheng Cao; Yizeng Liang; Qingsong Xu; Yifeng Yun; Hongdong Li
Journal:  J Comput Aided Mol Des       Date:  2010-11-13       Impact factor: 3.686

7.  A DXA-based mathematical model predicts midthigh muscle mass from magnetic resonance imaging in typically developing children but not in those with quadriplegic cerebral palsy.

Authors:  Christopher M Modlesky; Matthew L Cavaiola; Jarvis J Smith; David A Rowe; David L Johnson; Freeman Miller
Journal:  J Nutr       Date:  2010-10-27       Impact factor: 4.798

8.  QSAR studies on a number of pyrrolidin-2-one antiarrhythmic arylpiperazinyls.

Authors:  Alicja Nowaczyk; Katarzyna Kulig
Journal:  Med Chem Res       Date:  2011-01-07       Impact factor: 1.965

Review 9.  Big-Data Science in Porous Materials: Materials Genomics and Machine Learning.

Authors:  Kevin Maik Jablonka; Daniele Ongari; Seyed Mohamad Moosavi; Berend Smit
Journal:  Chem Rev       Date:  2020-06-10       Impact factor: 60.622

10.  Statistical variation in progressive scrambling.

Authors:  Robert D Clark; Peter C Fox
Journal:  J Comput Aided Mol Des       Date:  2004 Jul-Sep       Impact factor: 3.686

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.