Literature DB >> 9744903

Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms.

.   

Abstract

This article reviews five approximate statistical tests for determining whether one learning algorithm outperforms another on a particular learning task. These tests are compared experimentally to determine their probability of incorrectly detecting a difference when no difference exists (type I error). Two widely used statistical tests are shown to have high probability of type I error in certain situations and should never be used: a test for difference of two proportions and a paired-differences t test based on taking several random train-test splits. A third test, a paired-differences t test based on 10-fold cross-validation, exhibits somewhat elevated probability of type I error. A fourth test, McNemar's test, is shown to have low type I error. The fifth test is a new test, 5 x 2 cv, based on five iterations of twofold cross-validation. Experiments show that this test also has acceptable type I error. The article also measures the power (ability to detect algorithm differences when they do exist) of these tests. The cross-validated t test is the most powerful. The 5 x 2 cv test is shown to be slightly more powerful than McNemar's test. The choice of the best test is determined by the computational cost of running the learning algorithm. For algorithms that can be executed only once, McNemar's test is the only test with acceptable type I error. For algorithms that can be executed 10 times, the 5 x 2 cv test is recommended, because it is slightly more powerful and because it directly measures variation due to the choice of training set.

Entities:  

Year:  1998        PMID: 9744903     DOI: 10.1162/089976698300017197

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  237 in total

1.  System identification applied to a visuomotor task: near-optimal human performance in a noisy changing task.

Authors:  R J Baddeley; H A Ingram; R C Miall
Journal:  J Neurosci       Date:  2003-04-01       Impact factor: 6.167

2.  Optimizing area under the ROC curve using semi-supervised learning.

Authors:  Shijun Wang; Diana Li; Nicholas Petrick; Berkman Sahiner; Marius George Linguraru; Ronald M Summers
Journal:  Pattern Recognit       Date:  2015-01-01       Impact factor: 7.740

3.  A fully-automatic caudate nucleus segmentation of brain MRI: application in volumetric analysis of pediatric attention-deficit/hyperactivity disorder.

Authors:  Laura Igual; Joan Carles Soliva; Antonio Hernández-Vela; Sergio Escalera; Xavier Jiménez; Oscar Vilarroya; Petia Radeva
Journal:  Biomed Eng Online       Date:  2011-12-05       Impact factor: 2.819

4.  Finding related sentence pairs in MEDLINE.

Authors:  Larry H Smith; W John Wilbur
Journal:  Inf Retr Boston       Date:  2010-01-23       Impact factor: 2.293

5.  High-throughput prediction of protein antigenicity using protein microarray data.

Authors:  Christophe N Magnan; Michael Zeller; Matthew A Kayala; Adam Vigil; Arlo Randall; Philip L Felgner; Pierre Baldi
Journal:  Bioinformatics       Date:  2010-10-07       Impact factor: 6.937

Review 6.  Big-Data Science in Porous Materials: Materials Genomics and Machine Learning.

Authors:  Kevin Maik Jablonka; Daniele Ongari; Seyed Mohamad Moosavi; Berend Smit
Journal:  Chem Rev       Date:  2020-06-10       Impact factor: 60.622

7.  Classification of pallidal oscillations with increasing parkinsonian severity.

Authors:  Allison T Connolly; Alicia L Jensen; Kenneth B Baker; Jerrold L Vitek; Matthew D Johnson
Journal:  J Neurophysiol       Date:  2015-04-15       Impact factor: 2.714

8.  Rough set rule induction for suitability assessment.

Authors:  Patricia A Berger
Journal:  Environ Manage       Date:  2004-10       Impact factor: 3.266

9.  Using classification models for the generation of disease-specific medications from biomedical literature and clinical data repository.

Authors:  Liqin Wang; Peter J Haug; Guilherme Del Fiol
Journal:  J Biomed Inform       Date:  2017-04-20       Impact factor: 6.317

10.  Utilizing ECG-Based Heartbeat Classification for Hypertrophic Cardiomyopathy Identification.

Authors:  Quazi Abidur Rahman; Larisa G Tereshchenko; Matthew Kongkatong; Theodore Abraham; M Roselle Abraham; Hagit Shatkay
Journal:  IEEE Trans Nanobioscience       Date:  2015-04-24       Impact factor: 2.935

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.