Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Is cross-validation valid for small-sample microarray classification?

Literature DB >> 14960464

Is cross-validation valid for small-sample microarray classification?

Ulisses M Braga-Neto¹, Edward R Dougherty.

Abstract

MOTIVATION: Microarray classification typically possesses two striking attributes: (1) classifier design and error estimation are based on remarkably small samples and (2) cross-validation error estimation is employed in the majority of the papers. Thus, it is necessary to have a quantifiable understanding of the behavior of cross-validation in the context of very small samples.
RESULTS: An extensive simulation study has been performed comparing cross-validation, resubstitution and bootstrap estimation for three popular classification rules-linear discriminant analysis, 3-nearest-neighbor and decision trees (CART)-using both synthetic and real breast-cancer patient data. Comparison is via the distribution of differences between the estimated and true errors. Various statistics for the deviation distribution have been computed: mean (for estimator bias), variance (for estimator precision), root-mean square error (for composition of bias and variance) and quartile ranges, including outlier behavior. In general, while cross-validation error estimation is much less biased than resubstitution, it displays excessive variance, which makes individual estimates unreliable for small samples. Bootstrap methods provide improved performance relative to variance, but at a high computational cost and often with increased bias (albeit, much less than with resubstitution).

Entities: Disease Gene Species

Mesh：

Year: 2004 PMID： 14960464 DOI： 10.1093/bioinformatics/btg419

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

124 in total

1. Age-specific profiles of tissue-level composition and mechanical properties in murine cortical bone.

Authors: Mekhala Raghavan; Nadder D Sahar; David H Kohn; Michael D Morris
Journal: Bone Date: 2012-01-20 Impact factor: 4.398

2. Shift-invariant discrete wavelet transform analysis for retinal image classification.

Authors: April Khademi; Sridhar Krishnan
Journal: Med Biol Eng Comput Date: 2007-10-23 Impact factor: 2.602

3. Decorrelation of the true and estimated classifier errors in high-dimensional settings.

Authors: Blaise Hanczar; Jianping Hua; Edward R Dougherty
Journal: EURASIP J Bioinform Syst Biol Date: 2007

4. Quantification of the impact of feature selection on the variance of cross-validation error estimation.

Authors: Yufei Xiao; Jianping Hua; Edward R Dougherty
Journal: EURASIP J Bioinform Syst Biol Date: 2007

5. Validation of computational methods in genomics.

Authors: Edward R Doughtery; Hua Jianping; Michael L Bittner
Journal: Curr Genomics Date: 2007-03 Impact factor: 2.236

6. Combining multiple microarray studies using bootstrap meta-analysis.

Authors: Andrea B Barrett; John H Phan; May D Wang
Journal: Conf Proc IEEE Eng Med Biol Soc Date: 2008

7. Which is better: holdout or full-sample classifier design?

Authors: Marcel Brun; Qian Xu; Edward R Dougherty
Journal: EURASIP J Bioinform Syst Biol Date: 2008

8. Multiple-rule bias in the comparison of classification rules.

Authors: Mohammadmahdi R Yousefi; Jianping Hua; Edward R Dougherty
Journal: Bioinformatics Date: 2011-05-05 Impact factor: 6.937

9. Highly polygenic architecture of antidepressant treatment response: Comparative analysis of SSRI and NRI treatment in an animal model of depression.

Authors: Karim Malki; Maria Grazia Tosto; Héctor Mouriño-Talín; Sabela Rodríguez-Lorenzo; Oliver Pain; Irfan Jumhaboy; Tina Liu; Panos Parpas; Stuart Newman; Artem Malykh; Lucia Carboni; Rudolf Uher; Peter McGuffin; Leonard C Schalkwyk; Kevin Bryson; Mark Herbster
Journal: Am J Med Genet B Neuropsychiatr Genet Date: 2016-10-01 Impact factor: 3.568

10. Emerging translational bioinformatics: knowledge-guided biomarker identification for cancer diagnostics.

Authors: John H Phan; Qiqin Yin-Goen; Andrew N Young; May D Wang
Journal: Conf Proc IEEE Eng Med Biol Soc Date: 2009