Literature DB >> 26327447

An experimental comparison of feature selection methods on two-class biomedical datasets.

P Drotár1, J Gazda2, Z Smékal3.   

Abstract

Feature selection is a significant part of many machine learning applications dealing with small-sample and high-dimensional data. Choosing the most important features is an essential step for knowledge discovery in many areas of biomedical informatics. The increased popularity of feature selection methods and their frequent utilisation raise challenging new questions about the interpretability and stability of feature selection techniques. In this study, we compared the behaviour of ten state-of-the-art filter methods for feature selection in terms of their stability, similarity, and influence on prediction performance. All of the experiments were conducted on eight two-class datasets from biomedical areas. While entropy-based feature selection appears to be the most stable, the feature selection techniques yielding the highest prediction performance are minimum redundance maximum relevance method and feature selection based on Bhattacharyya distance. In general, univariate feature selection techniques perform similarly to or even better than more complex multivariate feature selection techniques with high-dimensional datasets. However, with more complex and smaller datasets multivariate methods slightly outperform univariate techniques.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Keywords:  Classification performance; Feature selection; Multivariate FS; Stability; Univariate FS

Mesh:

Year:  2015        PMID: 26327447     DOI: 10.1016/j.compbiomed.2015.08.010

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  10 in total

1.  LassoNet: Neural Networks with Feature Sparsity.

Authors:  Ismael Lemhadri; Feng Ruan; Robert Tibshirani
Journal:  Proc Mach Learn Res       Date:  2021-04

2.  Robust clinical marker identification for diabetic kidney disease with ensemble feature selection.

Authors:  Xing Song; Lemuel R Waitman; Yong Hu; Alan S L Yu; David C Robbins; Mei Liu
Journal:  J Am Med Inform Assoc       Date:  2019-03-01       Impact factor: 4.497

3.  Feature Ranking in Predictive Models for Hospital-Acquired Acute Kidney Injury.

Authors:  Lijuan Wu; Yong Hu; Xiaoxiao Liu; Xiangzhou Zhang; Weiqi Chen; Alan S L Yu; John A Kellum; Lemuel R Waitman; Mei Liu
Journal:  Sci Rep       Date:  2018-11-23       Impact factor: 4.379

4.  Comparison of four variable selection methods to determine the important variables in predicting the prognosis of traumatic brain injury patients by support vector machine.

Authors:  Saeedeh Pourahmad; Soheila Rasouli-Emadi; Fatemeh Moayyedi; Hosseinali Khalili
Journal:  J Res Med Sci       Date:  2019-11-27       Impact factor: 1.852

5.  Cost-sensitive learning strategies for high-dimensional and imbalanced data: a comparative study.

Authors:  Barbara Pes; Giuseppina Lai
Journal:  PeerJ Comput Sci       Date:  2021-12-24

6.  Determination of biomarkers from microarray data using graph neural network and spectral clustering.

Authors:  Kun Yu; Weidong Xie; Linjie Wang; Shoujia Zhang; Wei Li
Journal:  Sci Rep       Date:  2021-12-13       Impact factor: 4.379

7.  Exploration of Potential miRNA Biomarkers and Prediction for Ovarian Cancer Using Artificial Intelligence.

Authors:  Farzaneh Hamidi; Neda Gilani; Reza Arabi Belaghi; Parvin Sarbakhsh; Tuba Edgünlü; Pasqualina Santaguida
Journal:  Front Genet       Date:  2021-11-25       Impact factor: 4.599

8.  Mapping the Corn Residue-Covered Types Using Multi-Scale Feature Fusion and Supervised Learning Method by Chinese GF-2 PMS Image.

Authors:  Wancheng Tao; Yi Dong; Wei Su; Jiayu Li; Fu Xuan; Jianxi Huang; Jianyu Yang; Xuecao Li; Yelu Zeng; Baoguo Li
Journal:  Front Plant Sci       Date:  2022-06-21       Impact factor: 6.627

9.  Data Integration-Possibilities of Molecular and Clinical Data Fusion on the Example of Thyroid Cancer Diagnostics.

Authors:  Alicja Płuciennik; Aleksander Płaczek; Agata Wilk; Sebastian Student; Małgorzata Oczko-Wojciechowska; Krzysztof Fujarewicz
Journal:  Int J Mol Sci       Date:  2022-10-06       Impact factor: 6.208

10.  IDRMutPred: predicting disease-associated germline nonsynonymous single nucleotide variants (nsSNVs) in intrinsically disordered regions.

Authors:  Jing-Bo Zhou; Yao Xiong; Ke An; Zhi-Qiang Ye; Yun-Dong Wu
Journal:  Bioinformatics       Date:  2020-12-22       Impact factor: 6.937

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.