Literature DB >> 31751882

A new feature selection method based on symmetrical uncertainty and interaction gain.

Xiaohui Lin1, Chao Li2, Weijie Ren2, Xiao Luo2, Yanpeng Qi2.   

Abstract

Defining important information from complex biological data is of great significance in biological study. It is known that the physiological and pathological changes in an organism are usually influenced by molecule interactions. Analyzing biological data by fusing the evaluation of the individual molecules and molecule interactions could induce a more accurate and comprehensive understanding of the organism. This study proposes an Interaction Gain - Recursive Feature Elimination (IG-RFE) method which evaluates the feature importance by combining the relevance between feature and class label and the interaction among features. Symmetrical uncertainty is adopted to measure the relevance between feature and the class label. The average normalized interaction gain of feature f, every other features and the class label is calculated to reflect the interaction of feature f with other features in the feature set F. Based on the combination of symmetrical uncertainty and normalized interaction gain, less important features are removed iteratively. To show the performance of IG-RFE, it was compared with seven efficient feature selection methods, MIFS, mRMR, CMIM, ReliefF, FCBF, PGVNS and SVM-RFE, on eleven public datasets. The experiment results showed the superiority of IG-RFE in accuracy, sensitivity, specificity and stability. Hence, integrating feature individual discriminative ability and the interaction among features could better evaluate feature importance in biological data analysis.
Copyright © 2019 Elsevier Ltd. All rights reserved.

Keywords:  Biological data analysis; Feature selection; Interaction gain

Year:  2019        PMID: 31751882     DOI: 10.1016/j.compbiolchem.2019.107149

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  3 in total

1.  A prediction model of outcome of SARS-CoV-2 pneumonia based on laboratory findings.

Authors:  Gang Wu; Shuchang Zhou; Yujin Wang; Wenzhi Lv; Shili Wang; Ting Wang; Xiaoming Li
Journal:  Sci Rep       Date:  2020-08-20       Impact factor: 4.379

2.  A Feature Selection Algorithm Integrating Maximum Classification Information and Minimum Interaction Feature Dependency Information.

Authors:  Li Zhang
Journal:  Comput Intell Neurosci       Date:  2021-12-28

3.  Ranking Series of Cancer-Related Gene Expression Data by Means of the Superposing Significant Interaction Rules Method.

Authors:  Emili Besalú; Jesus Vicente De Julián-Ortiz
Journal:  Biomolecules       Date:  2020-09-08
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.