Literature DB >> 22721530

Real external predictivity of QSAR models. Part 2. New intercomparable thresholds for different validation criteria and the need for scatter plot inspection.

Nicola Chirico1, Paola Gramatica.   

Abstract

The evaluation of regression QSAR model performance, in fitting, robustness, and external prediction, is of pivotal importance. Over the past decade, different external validation parameters have been proposed: Q(F1)(2), Q(F2)(2), Q(F3)(2), r(m)(2), and the Golbraikh-Tropsha method. Recently, the concordance correlation coefficient (CCC, Lin), which simply verifies how small the differences are between experimental data and external data set predictions, independently of their range, was proposed by our group as an external validation parameter for use in QSAR studies. In our preliminary work, we demonstrated with thousands of simulated models that CCC is in good agreement with the compared validation criteria (except r(m)(2)) using the cutoff values normally applied for the acceptance of QSAR models as externally predictive. In this new work, we have studied and compared the general trends of the various criteria relative to different possible biases (scale and location shifts) in external data distributions, using a wide range of different simulated scenarios. This study, further supported by visual inspection of experimental vs predicted data scatter plots, has highlighted problems related to some criteria. Indeed, if based on the cutoff suggested by the proponent, r(m)(2) could also accept not predictive models in two of the possible biases (location, location plus scale), while in the case of scale shift bias, it appears to be the most restrictive. Moreover, Q(F1)(2) and Q(F2)(2) showed some problems in one of the possible biases (scale shift). This analysis allowed us to also propose recalibrated, and intercomparable for the same data scatter, new thresholds for each criterion in defining a QSAR model as really externally predictive in a more precautionary approach. An analysis of the results revealed that the scatter plot of experimental vs predicted external data must always be evaluated to support the statistical criteria values: in some cases high statistical parameter values could hide models with unacceptable predictions.

Year:  2012        PMID: 22721530     DOI: 10.1021/ci300084j

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  56 in total

1.  Computational predictive models for P-glycoprotein inhibition of in-house chalcone derivatives and drug-bank compounds.

Authors:  Trieu-Du Ngo; Thanh-Dao Tran; Minh-Tri Le; Khac-Minh Thai
Journal:  Mol Divers       Date:  2016-07-18       Impact factor: 2.943

2.  Estimation of influential points in any data set from coefficient of determination and its leave-one-out cross-validated counterpart.

Authors:  Gergely Tóth; Zsolt Bodai; Károly Héberger
Journal:  J Comput Aided Mol Des       Date:  2013-10-20       Impact factor: 3.686

3.  Structure-activity analysis of harmful algae inhibition by congeneric compounds: case studies of fatty acids and thiazolidinediones.

Authors:  Haomin Huang; Xi Xiao; Jiyan Shi; Yingxu Chen
Journal:  Environ Sci Pollut Res Int       Date:  2014-02-25       Impact factor: 4.223

4.  Modeling and Prediction of Solvent Effect on Human Skin Permeability using Support Vector Regression and Random Forest.

Authors:  Hiromi Baba; Jun-ichi Takahara; Fumiyoshi Yamashita; Mitsuru Hashida
Journal:  Pharm Res       Date:  2015-06-02       Impact factor: 4.200

5.  Modeling the binding affinity of structurally diverse industrial chemicals to carbon using the artificial intelligence approaches.

Authors:  Shikha Gupta; Nikita Basant; Premanjali Rai; Kunwar P Singh
Journal:  Environ Sci Pollut Res Int       Date:  2015-07-11       Impact factor: 4.223

6.  Concentration-dependent adsorption of organic contaminants by graphene nanosheets: quantum-mechanical models.

Authors:  Suman Lata
Journal:  J Mol Model       Date:  2021-01-25       Impact factor: 1.810

7.  QSAR modeling for predicting mutagenic toxicity of diverse chemicals for regulatory purposes.

Authors:  Nikita Basant; Shikha Gupta
Journal:  Environ Sci Pollut Res Int       Date:  2017-04-24       Impact factor: 4.223

8.  Acute aquatic toxicity of organic solvents modeled by QSARs.

Authors:  A Levet; C Bordes; Y Clément; P Mignon; C Morell; H Chermette; P Marote; P Lantéri
Journal:  J Mol Model       Date:  2016-11-09       Impact factor: 1.810

9.  In silico prediction of the developmental toxicity of diverse organic chemicals in rodents for regulatory purposes.

Authors:  Nikita Basant; Shikha Gupta; Kunwar P Singh
Journal:  Toxicol Res (Camb)       Date:  2016-02-29       Impact factor: 3.524

10.  Modeling the toxicity of chemical pesticides in multiple test species using local and global QSTR approaches.

Authors:  Nikita Basant; Shikha Gupta; Kunwar P Singh
Journal:  Toxicol Res (Camb)       Date:  2015-12-10       Impact factor: 3.524

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.