Literature DB >> 31612223

Better-than-chance classification for signal detection.

Jonathan D Rosenblatt1, Yuval Benjamini2, Roee Gilron3, Roy Mukamel4, Jelle J Goeman5.   

Abstract

The estimated accuracy of a classifier is a random quantity with variability. A common practice in supervised machine learning, is thus to test if the estimated accuracy is significantly better than chance level. This method of signal detection is particularly popular in neuroimaging and genetics. We provide evidence that using a classifier's accuracy as a test statistic can be an underpowered strategy for finding differences between populations, compared to a bona fide statistical test. It is also computationally more demanding than a statistical test. Via simulation, we compare test statistics that are based on classification accuracy, to others based on multivariate test statistics. We find that the probability of detecting differences between two distributions is lower for accuracy-based statistics. We examine several candidate causes for the low power of accuracy-tests. These causes include: the discrete nature of the accuracy-test statistic, the type of signal accuracy-tests are designed to detect, their inefficient use of the data, and their suboptimal regularization. When the purpose of the analysis is the evaluation of a particular classifier, not signal detection, we suggest several improvements to increase power. In particular, to replace V-fold cross-validation with the Leave-One-Out Bootstrap.
© The Author 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Keywords:  High dimension; Multivariate testing; Neuroimaging; Statistical genetics; Supervised learning

Year:  2021        PMID: 31612223      PMCID: PMC8036001          DOI: 10.1093/biostatistics/kxz035

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  15 in total

1.  Permutation tests for classification: towards statistical significance in image-based studies.

Authors:  Polina Golland; Bruce Fischl
Journal:  Inf Process Med Imaging       Date:  2003-07

2.  Exact testing with random permutations.

Authors:  Jesse Hemerik; Jelle Goeman
Journal:  Test (Madr)       Date:  2017-11-30       Impact factor: 2.345

3.  Information-based functional brain mapping.

Authors:  Nikolaus Kriegeskorte; Rainer Goebel; Peter Bandettini
Journal:  Proc Natl Acad Sci U S A       Date:  2006-02-28       Impact factor: 11.205

4.  Calculating confidence intervals for prediction error in microarray classification using resampling.

Authors:  Wenyu Jiang; Sudhir Varma; Richard Simon
Journal:  Stat Appl Genet Mol Biol       Date:  2008-03-01

5.  Simulation-based hypothesis testing of high dimensional means under covariance heterogeneity.

Authors:  Jinyuan Chang; Chao Zheng; Wen-Xin Zhou; Wen Zhou
Journal:  Biometrics       Date:  2017-03-31       Impact factor: 2.571

6.  Statistical inference and multiple testing correction in classification-based multi-voxel pattern analysis (MVPA): random permutations and cluster size control.

Authors:  Johannes Stelzer; Yi Chen; Robert Turner
Journal:  Neuroimage       Date:  2012-10-04       Impact factor: 6.556

Review 7.  Machine learning classifiers and fMRI: a tutorial overview.

Authors:  Francisco Pereira; Tom Mitchell; Matthew Botvinick
Journal:  Neuroimage       Date:  2008-11-21       Impact factor: 6.556

8.  An fMRI-based neurologic signature of physical pain.

Authors:  Tor D Wager; Lauren Y Atlas; Martin A Lindquist; Mathieu Roy; Choong-Wan Woo; Ethan Kross
Journal:  N Engl J Med       Date:  2013-04-11       Impact factor: 91.245

9.  Shrinkage-based diagonal discriminant analysis and its applications in high-dimensional data.

Authors:  Herbert Pang; Tiejun Tong; Hongyu Zhao
Journal:  Biometrics       Date:  2009-12       Impact factor: 2.571

10.  The human voice areas: Spatial organization and inter-individual variability in temporal and extra-temporal cortices.

Authors:  Cyril R Pernet; Phil McAleer; Marianne Latinus; Krzysztof J Gorgolewski; Ian Charest; Patricia E G Bestelmeyer; Rebecca H Watson; David Fleming; Frances Crabbe; Mitchell Valdes-Sosa; Pascal Belin
Journal:  Neuroimage       Date:  2015-06-24       Impact factor: 6.556

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.