Literature DB >> 33749419

Sample-size dependence of validation parameters in linear regression models and in QSAR.

D Kovács1, P Király1, G Tóth1.   

Abstract

The dependence of statistical validation parameters was investigated on the size of the sample taken in fit of multivariate linear curves. We observed that R2 and related internal parameters were misleading as they overestimated the goodness-of-fit of models at small sample size. Cross-validation metrics showed correct trends. It was possible to scale the leave-one-out and the leave-many-out results close to identical by correcting the degrees of freedom of the models. y and x-randomized validation parameters were calculated and the methods provided close to identical results. We suggest to use the simplest methods in both cases. The external parameters followed correct trends with respect to the sample size, but their sensitivity differed. We plotted the Roy-Ojha metrics in 2D and we coloured them with respect to other external parameters to provide an easy classification of models. The rank correlations were calculated between the performance parameters. Up to a sample size, goodness-of-fit and robustness were distinguishable, but above a certain sample size, the parameters were redundant. The external-internal pairs were weakly correlated. Our data show that all the three aspects of validation are necessary at small sample sizes, but the internal check of robustness is not informative above a given sample size.

Keywords:  Coefficient of determination; cross-validation; goodness-of-fit; predictivity; robustness

Mesh:

Year:  2021        PMID: 33749419     DOI: 10.1080/1062936X.2021.1890208

Source DB:  PubMed          Journal:  SAR QSAR Environ Res        ISSN: 1026-776X            Impact factor:   3.000


  1 in total

1.  Response surface approach to optimize temperature, pH and time on antioxidant properties of wild bush (Plectranthus rugosus) honey from high altitude region (Kashmir Valley) of India.

Authors:  Gulzar Ahmad Nayik; Vikas Nanda; Beenish Zohra; B N Dar; Mohammad Javed Ansari; Sami Al Obaid; Otilia Bobis
Journal:  Saudi J Biol Sci       Date:  2021-10-25       Impact factor: 4.219

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.