| Literature DB >> 19259405 |
A-L Boulesteix1, C Strobl, T Augustin, M Daumer.
Abstract
For the last eight years, microarray-based class prediction has been the subject of numerous publications in medicine, bioinformatics and statistics journals. However, in many articles, the assessment of classification accuracy is carried out using suboptimal procedures and is not paid much attention. In this paper, we carefully review various statistical aspects of classifier evaluation and validation from a practical point of view. The main topics addressed are accuracy measures, error rate estimation procedures, variable selection, choice of classifiers and validation strategy.Entities:
Keywords: accuracy measures; classification; conditional and unconditional error rate; error rate estimation; gene expression; high-dimensional data; validation data; variable selection
Year: 2008 PMID: 19259405 PMCID: PMC2623308 DOI: 10.4137/cin.s408
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351