Literature DB >> 24288447

On Kolmogorov Asymptotics of Estimators of the Misclassification Error Rate in Linear Discriminant Analysis.

Amin Zollanvari1, Marc G Genton.   

Abstract

We provide a fundamental theorem that can be used in conjunction with Kolmogorov asymptotic conditions to derive the first moments of well-known estimators of the actual error rate in linear discriminant analysis of a multivariate Gaussian model under the assumption of a common known covariance matrix. The estimators studied in this paper are plug-in and smoothed resubstitution error estimators, both of which have not been studied before under Kolmogorov asymptotic conditions. As a result of this work, we present an optimal smoothing parameter that makes the smoothed resubstitution an unbiased estimator of the true error. For the sake of completeness, we further show how to utilize the presented fundamental theorem to achieve several previously reported results, namely the first moment of the resubstitution estimator and the actual error rate. We provide numerical examples to show the accuracy of the succeeding finite sample approximations in situations where the number of dimensions is comparable or even larger than the sample size.

Entities:  

Keywords:  Double asymptotics; Error estimation; Kolmogorov asymptotic analysis; Plug-in error; Resubstitution; Smoothed resubstitution; True error

Year:  2013        PMID: 24288447      PMCID: PMC3840470          DOI: 10.1007/s13171-013-0029-9

Source DB:  PubMed          Journal:  Sankhya Ser A        ISSN: 0976-836X


  10 in total

1.  Prediction of cancer outcome with microarrays: a multiple random validation strategy.

Authors:  Stefan Michiels; Serge Koscielny; Catherine Hill
Journal:  Lancet       Date:  2005 Feb 5-11       Impact factor: 79.321

Review 2.  Critical review of published microarray studies for cancer outcome and guidelines on statistical analysis and reporting.

Authors:  Alain Dupuy; Richard M Simon
Journal:  J Natl Cancer Inst       Date:  2007-01-17       Impact factor: 13.506

3.  Validation of computational methods in genomics.

Authors:  Edward R Doughtery; Hua Jianping; Michael L Bittner
Journal:  Curr Genomics       Date:  2007-03       Impact factor: 2.236

4.  Some examples of discrimination.

Authors:  C A B SMITH
Journal:  Ann Eugen       Date:  1947-06

5.  On dimensionality, sample size, classification error, and complexity of classification algorithm in pattern recognition.

Authors:  S Raudys; V Pikelis
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1980-03       Impact factor: 6.226

6.  Statistical properties of error estimators in performance assessment of recognition systems.

Authors:  J Kittler; P A Devijver
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  1982-02       Impact factor: 6.226

7.  Identification of combination gene sets for glioma classification.

Authors:  Seungchan Kim; Edward R Dougherty; Ilya Shmulevich; Kenneth R Hess; Stanley R Hamilton; Jeffrey M Trent; Gregory N Fuller; Wei Zhang
Journal:  Mol Cancer Ther       Date:  2002-11       Impact factor: 6.261

8.  Estimation of error rates in discriminant analysis with selection of variables.

Authors:  S M Snapinn; J D Knoke
Journal:  Biometrics       Date:  1989-03       Impact factor: 2.571

9.  On the epistemological crisis in genomics.

Authors:  Edward R Dougherty
Journal:  Curr Genomics       Date:  2008-04       Impact factor: 2.236

10.  Expression profiling to predict the clinical behaviour of ovarian cancer fails independent evaluation.

Authors:  Olivier Gevaert; Frank De Smet; Toon Van Gorp; Nathalie Pochet; Kristof Engelen; Frederic Amant; Bart De Moor; Dirk Timmerman; Ignace Vergote
Journal:  BMC Cancer       Date:  2008-01-22       Impact factor: 4.430

  10 in total
  1 in total

Review 1.  High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries.

Authors:  Amin Zollanvari
Journal:  Cancer Inform       Date:  2016-04-12
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.