Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 External validation and prediction employing the predictive squared correlation coefficient test set activity mean vs training set activity mean.

Literature DB >> 18954136

External validation and prediction employing the predictive squared correlation coefficient test set activity mean vs training set activity mean.

Gerrit Schüürmann¹, Ralf-Uwe Ebert, Jingwen Chen, Bin Wang, Ralph Kühne.

Abstract

The external prediction capability of quantitative structure-activity relationship (QSAR) models is often quantified using the predictive squared correlation coefficient, q (2). This index relates the predictive residual sum of squares, PRESS, to the activity sum of squares, SS, without postprocessing of the model output, the latter of which is automatically done when calculating the conventional squared correlation coefficient, r (2). According to the current OECD guidelines, q (2) for external validation should be calculated with SS referring to the training set activity mean. Our present findings including a mathematical proof demonstrate that this approach yields a systematic overestimation of the prediction capability that is triggered by the difference between the training and test set activity means. Example calculations with three regression models and data sets taken from literature show further that for external test sets, q (2) based on the training set activity mean may become even larger than r (2). As a consequence, we suggest to always use the test set activity mean when quantifying the external prediction capability through q (2) and to revise the respective OECD guidance document accordingly. The discussion includes a comparison between r (2) and q (2) value ranges and the q (2) statistics for cross-validation.

Mesh：

Year: 2008 PMID： 18954136 DOI： 10.1021/ci800253u

Source DB: PubMed Journal: J Chem Inf Model ISSN： 1549-9596 Impact factor: 4.956

Keyword Cloud
Cited

48 in total

1. Fragment-guided approach to incorporating structural information into a CoMFA study: BACE-1 as an example.

Authors: Lívia Barros Salum; Napoleão Fonseca Valadares
Journal: J Comput Aided Mol Des Date: 2010-07-27 Impact factor: 3.686

2. Computational predictive models for P-glycoprotein inhibition of in-house chalcone derivatives and drug-bank compounds.

Authors: Trieu-Du Ngo; Thanh-Dao Tran; Minh-Tri Le; Khac-Minh Thai
Journal: Mol Divers Date: 2016-07-18 Impact factor: 2.943

3. Modeling bioconcentration factor (BCF) using mechanistically interpretable descriptors computed from open source tool "PaDEL-Descriptor".

Authors: Subrata Pramanik; Kunal Roy
Journal: Environ Sci Pollut Res Int Date: 2013-10-30 Impact factor: 4.223

4. The importance of molecular structures, endpoints' values, and predictivity parameters in QSAR research: QSAR analysis of a series of estrogen receptor binders.

Authors: Jiazhong Li; Paola Gramatica
Journal: Mol Divers Date: 2009-11-17 Impact factor: 2.943

5. Application of electron conformational-genetic algorithm approach to 1,4-dihydropyridines as calcium channel antagonists: pharmacophore identification and bioactivity prediction.

Authors: Nazmiye Geçen; Emin Sarıpınar; Ersin Yanmaz; Kader Sahin
Journal: J Mol Model Date: 2011-03-31 Impact factor: 1.810

6. Nonlinear QSAR modeling for predicting cytotoxicity of ionic liquids in leukemia rat cell line: an aid to green chemicals designing.

Authors: Shikha Gupta; Nikita Basant; Kunwar P Singh
Journal: Environ Sci Pollut Res Int Date: 2015-04-28 Impact factor: 4.223

7. Modeling the binding affinity of structurally diverse industrial chemicals to carbon using the artificial intelligence approaches.

Authors: Shikha Gupta; Nikita Basant; Premanjali Rai; Kunwar P Singh
Journal: Environ Sci Pollut Res Int Date: 2015-07-11 Impact factor: 4.223

8. Hormone activity of hydroxylated polybrominated diphenyl ethers on human thyroid receptor-beta: in vitro and in silico investigations.

Authors: Fei Li; Qing Xie; Xuehua Li; Na Li; Ping Chi; Jingwen Chen; Zijian Wang; Ce Hao
Journal: Environ Health Perspect Date: 2010-05 Impact factor: 9.031

9. QSPR Modeling of Bioconcentration Factors of Nonionic Organic Compounds.

Authors: Omar Deeb; Padmakar V Khadikar; Mohammad Goodarzi
Journal: Environ Health Insights Date: 2010-07-06

10. Assessment and validation of the CAESAR predictive model for bioconcentration factor (BCF) in fish.

Authors: Anna Lombardo; Alessandra Roncaglioni; Elena Boriani; Chiara Milan; Emilio Benfenati
Journal: Chem Cent J Date: 2010-07-29 Impact factor: 4.215