Literature DB >> 14960464

Is cross-validation valid for small-sample microarray classification?

Ulisses M Braga-Neto1, Edward R Dougherty.   

Abstract

MOTIVATION: Microarray classification typically possesses two striking attributes: (1) classifier design and error estimation are based on remarkably small samples and (2) cross-validation error estimation is employed in the majority of the papers. Thus, it is necessary to have a quantifiable understanding of the behavior of cross-validation in the context of very small samples.
RESULTS: An extensive simulation study has been performed comparing cross-validation, resubstitution and bootstrap estimation for three popular classification rules-linear discriminant analysis, 3-nearest-neighbor and decision trees (CART)-using both synthetic and real breast-cancer patient data. Comparison is via the distribution of differences between the estimated and true errors. Various statistics for the deviation distribution have been computed: mean (for estimator bias), variance (for estimator precision), root-mean square error (for composition of bias and variance) and quartile ranges, including outlier behavior. In general, while cross-validation error estimation is much less biased than resubstitution, it displays excessive variance, which makes individual estimates unreliable for small samples. Bootstrap methods provide improved performance relative to variance, but at a high computational cost and often with increased bias (albeit, much less than with resubstitution).

Entities:  

Mesh:

Year:  2004        PMID: 14960464     DOI: 10.1093/bioinformatics/btg419

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  124 in total

1.  Age-specific profiles of tissue-level composition and mechanical properties in murine cortical bone.

Authors:  Mekhala Raghavan; Nadder D Sahar; David H Kohn; Michael D Morris
Journal:  Bone       Date:  2012-01-20       Impact factor: 4.398

2.  Shift-invariant discrete wavelet transform analysis for retinal image classification.

Authors:  April Khademi; Sridhar Krishnan
Journal:  Med Biol Eng Comput       Date:  2007-10-23       Impact factor: 2.602

3.  Decorrelation of the true and estimated classifier errors in high-dimensional settings.

Authors:  Blaise Hanczar; Jianping Hua; Edward R Dougherty
Journal:  EURASIP J Bioinform Syst Biol       Date:  2007

4.  Quantification of the impact of feature selection on the variance of cross-validation error estimation.

Authors:  Yufei Xiao; Jianping Hua; Edward R Dougherty
Journal:  EURASIP J Bioinform Syst Biol       Date:  2007

5.  Validation of computational methods in genomics.

Authors:  Edward R Doughtery; Hua Jianping; Michael L Bittner
Journal:  Curr Genomics       Date:  2007-03       Impact factor: 2.236

6.  Combining multiple microarray studies using bootstrap meta-analysis.

Authors:  Andrea B Barrett; John H Phan; May D Wang
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2008

7.  Which is better: holdout or full-sample classifier design?

Authors:  Marcel Brun; Qian Xu; Edward R Dougherty
Journal:  EURASIP J Bioinform Syst Biol       Date:  2008

8.  Multiple-rule bias in the comparison of classification rules.

Authors:  Mohammadmahdi R Yousefi; Jianping Hua; Edward R Dougherty
Journal:  Bioinformatics       Date:  2011-05-05       Impact factor: 6.937

9.  Highly polygenic architecture of antidepressant treatment response: Comparative analysis of SSRI and NRI treatment in an animal model of depression.

Authors:  Karim Malki; Maria Grazia Tosto; Héctor Mouriño-Talín; Sabela Rodríguez-Lorenzo; Oliver Pain; Irfan Jumhaboy; Tina Liu; Panos Parpas; Stuart Newman; Artem Malykh; Lucia Carboni; Rudolf Uher; Peter McGuffin; Leonard C Schalkwyk; Kevin Bryson; Mark Herbster
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2016-10-01       Impact factor: 3.568

10.  Emerging translational bioinformatics: knowledge-guided biomarker identification for cancer diagnostics.

Authors:  John H Phan; Qiqin Yin-Goen; Andrew N Young; May D Wang
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2009
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.