Literature DB >> 25177107

Bias and Stability of Single Variable Classifiers for Feature Ranking and Selection.

Shobeir Fakhraei1, Hamid Soltanian-Zadeh2, Farshad Fotouhi3.   

Abstract

Feature rankings are often used for supervised dimension reduction especially when discriminating power of each feature is of interest, dimensionality of dataset is extremely high, or computational power is limited to perform more complicated methods. In practice, it is recommended to start dimension reduction via simple methods such as feature rankings before applying more complex approaches. Single Variable Classifier (SVC) ranking is a feature ranking based on the predictive performance of a classifier built using only a single feature. While benefiting from capabilities of classifiers, this ranking method is not as computationally intensive as wrappers. In this paper, we report the results of an extensive study on the bias and stability of such feature ranking method. We study whether the classifiers influence the SVC rankings or the discriminative power of features themselves has a dominant impact on the final rankings. We show the common intuition of using the same classifier for feature ranking and final classification does not always result in the best prediction performance. We then study if heterogeneous classifiers ensemble approaches provide more unbiased rankings and if they improve final classification performance. Furthermore, we calculate an empirical prediction performance loss for using the same classifier in SVC feature ranking and final classification from the optimal choices.

Entities:  

Keywords:  AdaBoost; Bias; Dimension Reduction; Feature Ranking; Feature Selection; K-Nearest Neighbors; Logistic Regression; Multilayer Perceptron; Naïve Bayes; Random Forests; Single Variable Classifier; Stability; Support Vector Machines

Year:  2014        PMID: 25177107      PMCID: PMC4144463          DOI: 10.1016/j.eswa.2014.05.007

Source DB:  PubMed          Journal:  Expert Syst Appl        ISSN: 0957-4174            Impact factor:   6.954


  1 in total

1.  A novel feature selection approach for biomedical data classification.

Authors:  Yonghong Peng; Zhiqing Wu; Jianmin Jiang
Journal:  J Biomed Inform       Date:  2009-07-30       Impact factor: 6.317

  1 in total
  2 in total

1.  Artery/vein classification of retinal vessels using classifiers fusion.

Authors:  Samra Irshad; Xiao-Xia Yin; Yanchun Zhang
Journal:  Health Inf Sci Syst       Date:  2019-11-08

2.  Machine Learning Prediction Models for In-Hospital Mortality After Transcatheter Aortic Valve Replacement.

Authors:  Dagmar F Hernandez-Suarez; Yeunjung Kim; Pedro Villablanca; Tanush Gupta; Jose Wiley; Brenda G Nieves-Rodriguez; Jovaniel Rodriguez-Maldonado; Roberto Feliu Maldonado; Istoni da Luz Sant'Ana; Cristina Sanina; Pedro Cox-Alomar; Harish Ramakrishna; Angel Lopez-Candales; William W O'Neill; Duane S Pinto; Azeem Latib; Abiel Roche-Lima
Journal:  JACC Cardiovasc Interv       Date:  2019-07-22       Impact factor: 11.195

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.