Literature DB >> 20075479

Sensitivity analysis of kappa-fold cross validation in prediction error estimation.

Juan Diego Rodríguez1, Aritz Pérez, Jose Antonio Lozano.   

Abstract

In the machine learning field, the performance of a classifier is usually measured in terms of prediction error. In most real-world problems, the error cannot be exactly calculated and it must be estimated. Therefore, it is important to choose an appropriate estimator of the error. This paper analyzes the statistical properties, bias and variance, of the kappa-fold cross-validation classification error estimator (kappa-cv). Our main contribution is a novel theoretical decomposition of the variance of the kappa-cv considering its sources of variance: sensitivity to changes in the training set and sensitivity to changes in the folds. The paper also compares the bias and variance of the estimator for different values of kappa. The experimental study has been performed in artificial domains because they allow the exact computation of the implied quantities and we can rigorously specify the conditions of experimentation. The experimentation has been performed for two classifiers (naive Bayes and nearest neighbor), different numbers of folds, sample sizes, and training sets coming from assorted probability distributions. We conclude by including some practical recommendation on the use of kappa-fold cross validation.

Year:  2010        PMID: 20075479     DOI: 10.1109/TPAMI.2009.187

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  106 in total

1.  Estimating national-scale ground-level PM25 concentration in China using geographically weighted regression based on MODIS and MISR AOD.

Authors:  Wei You; Zengliang Zang; Lifeng Zhang; Yi Li; Weiqi Wang
Journal:  Environ Sci Pollut Res Int       Date:  2016-01-16       Impact factor: 4.223

2.  CT imaging markers to improve radiation toxicity prediction in prostate cancer radiotherapy by stacking regression algorithm.

Authors:  Shayan Mostafaei; Hamid Abdollahi; Shiva Kazempour Dehkordi; Isaac Shiri; Abolfazl Razzaghdoust; Seyed Hamid Zoljalali Moghaddam; Afshin Saadipoor; Fereshteh Koosha; Susan Cheraghi; Seied Rabi Mahdavi
Journal:  Radiol Med       Date:  2019-09-24       Impact factor: 3.469

3.  Incorporating neurophysiological concepts in mathematical thermoregulation models.

Authors:  Boris R M Kingma; M J Vosselman; A J H Frijns; A A van Steenhoven; W D van Marken Lichtenbelt
Journal:  Int J Biometeorol       Date:  2013-01-27       Impact factor: 3.787

4.  Classification based hypothesis testing in neuroscience: Below-chance level classification rates and overlooked statistical properties of linear parametric classifiers.

Authors:  Hamidreza Jamalabadi; Sarah Alizadeh; Monika Schönauer; Christian Leibold; Steffen Gais
Journal:  Hum Brain Mapp       Date:  2016-03-26       Impact factor: 5.038

5.  Drug design by machine-trained elastic networks: predicting Ser/Thr-protein kinase inhibitors' activities.

Authors:  Cyrus Ahmadi Toussi; Javad Haddadnia; Chérif F Matta
Journal:  Mol Divers       Date:  2020-03-28       Impact factor: 2.943

6.  Deriving alternative criteria sets for alcohol use disorders using statistical optimization: Results from the National Survey on Drug Use and Health.

Authors:  Cassandra L Boness; Jordan E Stevens; Douglas Steinley; Timothy Trull; Kenneth J Sher
Journal:  Exp Clin Psychopharmacol       Date:  2018-12-17       Impact factor: 3.157

7.  Toward more efficient diagnostic criteria sets and rules: The use of optimization approaches in addiction science.

Authors:  Jordan E Stevens; Douglas Steinley; Yoanna E McDowell; Cassandra L Boness; Timothy J Trull; Christopher S Martin; Kenneth J Sher
Journal:  Addict Behav       Date:  2019-02-05       Impact factor: 3.913

8.  Using Complete Enumeration to Derive "One-Size-Fits-All" Versus "Subgroup-Specific" Diagnostic Rules for Substance Use Disorder.

Authors:  Cassandra L Boness; Jordan E Loeffelman; Douglas Steinley; Timothy Trull; Kenneth J Sher
Journal:  Assessment       Date:  2020-02-10

9.  Development and cross-validation of prognostic models to assess the treatment effect of cisplatin/pemetrexed chemotherapy in lung adenocarcinoma patients.

Authors:  Wenjun Mou; Zhaoqi Liu; Yuan Luo; Meng Zou; Chao Ren; Chunyan Zhang; Xinyu Wen; Yong Wang; Yaping Tian
Journal:  Med Oncol       Date:  2014-08-14       Impact factor: 3.064

10.  EEG markers predictive of epilepsy risk in pediatric cerebral malaria - A feasibility study.

Authors:  Archana A Patel; Ali Jannati; Sameer C Dhamne; Monica Sapuwa; Elizabeth Kalanga; Maitreyi Mazumdar; Gretchen L Birbeck; Alexander Rotenberg
Journal:  Epilepsy Behav       Date:  2020-11-21       Impact factor: 2.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.