Literature DB >> 26099013

Beware of R(2): Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models.

D L J Alexander1, A Tropsha2, David A Winkler3,4,5,6.   

Abstract

The statistical metrics used to characterize the external predictivity of a model, i.e., how well it predicts the properties of an independent test set, have proliferated over the past decade. This paper clarifies some apparent confusion over the use of the coefficient of determination, R(2), as a measure of model fit and predictive power in QSAR and QSPR modeling. R(2) (or r(2)) has been used in various contexts in the literature in conjunction with training and test data for both ordinary linear regression and regression through the origin as well as with linear and nonlinear regression models. We analyze the widely adopted model fit criteria suggested by Golbraikh and Tropsha ( J. Mol. Graphics Modell. 2002 , 20 , 269 - 276 ) in a strict statistical manner. Shortcomings in these criteria are identified, and a clearer and simpler alternative method to characterize model predictivity is provided. The intent is not to repeat the well-documented arguments for model validation using test data but rather to guide the application of R(2) as a model fit statistic. Examples are used to illustrate both correct and incorrect uses of R(2). Reporting the root-mean-square error or equivalent measures of dispersion, which are typically of more practical importance than R(2), is also encouraged, and important challenges in addressing the needs of different categories of users such as computational chemists, experimental scientists, and regulatory decision support specialists are outlined.

Entities:  

Mesh:

Year:  2015        PMID: 26099013      PMCID: PMC4530125          DOI: 10.1021/acs.jcim.5b00206

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  11 in total

1.  Robust QSAR models using Bayesian regularized neural networks.

Authors:  F R Burden; D A Winkler
Journal:  J Med Chem       Date:  1999-08-12       Impact factor: 7.446

2.  Beware of q2!

Authors:  Alexander Golbraikh; Alexander Tropsha
Journal:  J Mol Graph Model       Date:  2002-01       Impact factor: 2.518

3.  Are free energy calculations useful in practice? A comparison with rapid scoring functions for the p38 MAP kinase protein system.

Authors:  D A Pearlman; P S Charifson
Journal:  J Med Chem       Date:  2001-10-11       Impact factor: 7.446

4.  Exhaustive QSPR studies of a large diverse set of ionic liquids: how accurately can we predict melting points?

Authors:  Alexandre Varnek; Natalia Kireeva; Igor V Tetko; Igor I Baskin; Vitaly P Solov'ev
Journal:  J Chem Inf Model       Date:  2007-03-24       Impact factor: 4.956

5.  Trends and plot methods in MLR studies.

Authors:  Emili Besalú; Jesus V de Julian-Ortiz; Lionello Pogliani
Journal:  J Chem Inf Model       Date:  2007-04-25       Impact factor: 4.956

Review 6.  Bayesian regularization of neural networks.

Authors:  Frank Burden; Dave Winkler
Journal:  Methods Mol Biol       Date:  2008

7.  Three-dimensional quantitative similarity-activity relationships (3D QSiAR) from SEAL similarity matrices.

Authors:  H Kubinyi; F A Hamprecht; T Mietzner
Journal:  J Med Chem       Date:  1998-07-02       Impact factor: 7.446

8.  Some case studies on application of "r(m)2" metrics for judging quality of quantitative structure-activity relationship predictions: emphasis on scaling of response data.

Authors:  Kunal Roy; Pratim Chakraborty; Indrani Mitra; Probir Kumar Ojha; Supratik Kar; Rudra Narayan Das
Journal:  J Comput Chem       Date:  2013-01-08       Impact factor: 3.376

9.  The rm2 metrics and regression through origin approach: reliable and useful validation tools for predictive QSAR models (Commentary on 'Is regression through origin useful in external validation of QSAR models?').

Authors:  Kunal Roy; Supratik Kar
Journal:  Eur J Pharm Sci       Date:  2014-05-29       Impact factor: 4.384

10.  Is regression through origin useful in external validation of QSAR models?

Authors:  Ali Shayanfar; Shadi Shayanfar
Journal:  Eur J Pharm Sci       Date:  2014-04-08       Impact factor: 4.384

View more
  80 in total

1.  Prediction of Oswestry Disability Index (ODI) using PROMIS-29 in a national sample of lumbar spine surgery patients.

Authors:  Jacquelyn S Pennings; Clinton J Devin; Inamullah Khan; Mohamad Bydon; Anthony L Asher; Kristin R Archer
Journal:  Qual Life Res       Date:  2019-06-06       Impact factor: 4.147

2.  Untargeted LC-MS metabolomic studies of Asteraceae species to discover inhibitors of Leishmania major dihydroorotate dehydrogenase.

Authors:  Lucas A Chibli; Annylory L Rosa; Maria Cristina Nonato; Fernando B Da Costa
Journal:  Metabolomics       Date:  2019-04-04       Impact factor: 4.290

3.  Shrimp count size: GC/MS-based metabolomics approach and quantitative descriptive analysis (QDA) reveal the importance of size in white leg shrimp (Litopenaeus vannamei).

Authors:  Safira Latifa Erlangga Putri; Gede Suantika; Magdalena Lenny Situmorang; Josephine Christina; Corazon Nikijuluw; Sastia Prama Putri; Eiichiro Fukusaki
Journal:  Metabolomics       Date:  2021-01-29       Impact factor: 4.290

4.  QSAR modeling for predicting mutagenic toxicity of diverse chemicals for regulatory purposes.

Authors:  Nikita Basant; Shikha Gupta
Journal:  Environ Sci Pollut Res Int       Date:  2017-04-24       Impact factor: 4.223

5.  Development of predictive QSAR models for Vibrio fischeri toxicity of ionic liquids and their true external and experimental validation tests.

Authors:  Rudra Narayan Das; Tânia E Sintra; João A P Coutinho; Sónia P M Ventura; Kunal Roy; Paul L A Popelier
Journal:  Toxicol Res (Camb)       Date:  2016-07-07       Impact factor: 3.524

Review 6.  Toward a systematic exploration of nano-bio interactions.

Authors:  Xue Bai; Fang Liu; Yin Liu; Cong Li; Shenqing Wang; Hongyu Zhou; Wenyi Wang; Hao Zhu; David A Winkler; Bing Yan
Journal:  Toxicol Appl Pharmacol       Date:  2017-03-24       Impact factor: 4.219

7.  Modeling the pH and temperature dependence of aqueousphase hydroxyl radical reaction rate constants of organic micropollutants using QSPR approach.

Authors:  Shikha Gupta; Nikita Basant
Journal:  Environ Sci Pollut Res Int       Date:  2017-09-16       Impact factor: 4.223

Review 8.  Sparse QSAR modelling methods for therapeutic and regenerative medicine.

Authors:  David A Winkler
Journal:  J Comput Aided Mol Des       Date:  2018-02-14       Impact factor: 3.686

9.  Extra precision glide docking, free energy calculation and molecular dynamics studies of 1,2-diarylethane derivatives as potent urease inhibitors.

Authors:  Sheetal Gupta; A V Bajaj
Journal:  J Mol Model       Date:  2018-08-29       Impact factor: 1.810

10.  Cross-domain correlates of cannabis use disorder severity among young adults.

Authors:  Randi Melissa Schuster; Maya Hareli; Amelia D Moser; Kelsey Lowman; Jodi Gilman; Christine Ulysse; David Schoenfeld; A Eden Evins
Journal:  Addict Behav       Date:  2019-01-23       Impact factor: 3.913

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.