Literature DB >> 14757253

Many are called, but few are chosen. Feature selection and error estimation in high dimensional spaces.

Helene Schulerud1, Fritz Albregtsen.   

Abstract

We address the problems of feature selection and error estimation when the number of possible feature candidates is large and the number of training samples is limited. A Monte Carlo study has been performed to illustrate the problems when using stepwise feature selection and discriminant analysis. The simulations demonstrate that in order to find the correct features, the necessary ratio of number of training samples to feature candidates is not a constant. It depends on the number of feature candidates, training samples and the Mahalanobis distance between the classes. Moreover, the leave-one-out error estimate may be a highly biased error estimate when feature selection is performed on the same data as the error estimation. It may even indicate complete separation of the classes, while no real difference between the classes exists. However, if feature selection and leave-one-out error estimation are performed in one process, an unbiased error estimate is achieved, but with high variance. The holdout error estimate gives a reliable estimate with low variance, depending on the size of the test set.

Mesh:

Year:  2004        PMID: 14757253     DOI: 10.1016/s0169-2607(03)00018-x

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


  4 in total

1.  Alzheimer disease: quantitative structural neuroimaging for detection and prediction of clinical and structural changes in mild cognitive impairment.

Authors:  Linda K McEvoy; Christine Fennema-Notestine; J Cooper Roddey; Donald J Hagler; Dominic Holland; David S Karow; Christopher J Pung; James B Brewer; Anders M Dale
Journal:  Radiology       Date:  2009-02-06       Impact factor: 11.105

2.  Support vector regression-based QSAR models for prediction of antioxidant activity of phenolic compounds.

Authors:  Ying Shi
Journal:  Sci Rep       Date:  2021-04-22       Impact factor: 4.379

3.  Entropy-based adaptive nuclear texture features are independent prognostic markers in a total population of uterine sarcomas.

Authors:  Birgitte Nielsen; Tarjei Sveinsgjerd Hveem; Wanja Kildal; Vera M Abeler; Gunnar B Kristensen; Fritz Albregtsen; Håvard E Danielsen
Journal:  Cytometry A       Date:  2014-12-05       Impact factor: 4.355

4.  Chromatin changes predict recurrence after radical prostatectomy.

Authors:  Tarjei S Hveem; Andreas Kleppe; Ljiljana Vlatkovic; Elin Ersvær; Håkon Wæhre; Birgitte Nielsen; Marte Avranden Kjær; Manohar Pradhan; Rolf Anders Syvertsen; John Arne Nesheim; Knut Liestøl; Fritz Albregtsen; Håvard E Danielsen
Journal:  Br J Cancer       Date:  2016-04-28       Impact factor: 7.640

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.