Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms.

Literature DB >> 9744903

Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms.

Abstract

This article reviews five approximate statistical tests for determining whether one learning algorithm outperforms another on a particular learning task. These tests are compared experimentally to determine their probability of incorrectly detecting a difference when no difference exists (type I error). Two widely used statistical tests are shown to have high probability of type I error in certain situations and should never be used: a test for difference of two proportions and a paired-differences t test based on taking several random train-test splits. A third test, a paired-differences t test based on 10-fold cross-validation, exhibits somewhat elevated probability of type I error. A fourth test, McNemar's test, is shown to have low type I error. The fifth test is a new test, 5 x 2 cv, based on five iterations of twofold cross-validation. Experiments show that this test also has acceptable type I error. The article also measures the power (ability to detect algorithm differences when they do exist) of these tests. The cross-validated t test is the most powerful. The 5 x 2 cv test is shown to be slightly more powerful than McNemar's test. The choice of the best test is determined by the computational cost of running the learning algorithm. For algorithms that can be executed only once, McNemar's test is the only test with acceptable type I error. For algorithms that can be executed 10 times, the 5 x 2 cv test is recommended, because it is slightly more powerful and because it directly measures variation due to the choice of training set.

Entities: Disease

Year: 1998 PMID： 9744903 DOI： 10.1162/089976698300017197

Source DB: PubMed Journal: Neural Comput ISSN： 0899-7667 Impact factor: 2.026

Keyword Cloud
Cited

237 in total

1. System identification applied to a visuomotor task: near-optimal human performance in a noisy changing task.

Authors: R J Baddeley; H A Ingram; R C Miall
Journal: J Neurosci Date: 2003-04-01 Impact factor: 6.167

2. Optimizing area under the ROC curve using semi-supervised learning.

Authors: Shijun Wang; Diana Li; Nicholas Petrick; Berkman Sahiner; Marius George Linguraru; Ronald M Summers
Journal: Pattern Recognit Date: 2015-01-01 Impact factor: 7.740

3. A fully-automatic caudate nucleus segmentation of brain MRI: application in volumetric analysis of pediatric attention-deficit/hyperactivity disorder.

Authors: Laura Igual; Joan Carles Soliva; Antonio Hernández-Vela; Sergio Escalera; Xavier Jiménez; Oscar Vilarroya; Petia Radeva
Journal: Biomed Eng Online Date: 2011-12-05 Impact factor: 2.819

4. Finding related sentence pairs in MEDLINE.

Authors: Larry H Smith; W John Wilbur
Journal: Inf Retr Boston Date: 2010-01-23 Impact factor: 2.293

5. High-throughput prediction of protein antigenicity using protein microarray data.

Authors: Christophe N Magnan; Michael Zeller; Matthew A Kayala; Adam Vigil; Arlo Randall; Philip L Felgner; Pierre Baldi
Journal: Bioinformatics Date: 2010-10-07 Impact factor: 6.937

Review 6. Big-Data Science in Porous Materials: Materials Genomics and Machine Learning.

Authors: Kevin Maik Jablonka; Daniele Ongari; Seyed Mohamad Moosavi; Berend Smit
Journal: Chem Rev Date: 2020-06-10 Impact factor: 60.622

7. Classification of pallidal oscillations with increasing parkinsonian severity.

Authors: Allison T Connolly; Alicia L Jensen; Kenneth B Baker; Jerrold L Vitek; Matthew D Johnson
Journal: J Neurophysiol Date: 2015-04-15 Impact factor: 2.714

8. Rough set rule induction for suitability assessment.

Authors: Patricia A Berger
Journal: Environ Manage Date: 2004-10 Impact factor: 3.266

9. Using classification models for the generation of disease-specific medications from biomedical literature and clinical data repository.

Authors: Liqin Wang; Peter J Haug; Guilherme Del Fiol
Journal: J Biomed Inform Date: 2017-04-20 Impact factor: 6.317

10. Utilizing ECG-Based Heartbeat Classification for Hypertrophic Cardiomyopathy Identification.

Authors: Quazi Abidur Rahman; Larisa G Tereshchenko; Matthew Kongkatong; Theodore Abraham; M Roselle Abraham; Hagit Shatkay
Journal: IEEE Trans Nanobioscience Date: 2015-04-24 Impact factor: 2.935