Literature DB >> 32339114

A new feature selection algorithm based on relevance, redundancy and complementarity.

Chao Li1, Xiao Luo1, Yanpeng Qi1, Zhenbo Gao1, Xiaohui Lin2.   

Abstract

Defining important information from biological data is critical for the study of disease diagnosis, drug efficacy and individualized treatment. Hence, the feature selection technique is widely applied. Many feature selection methods measure features based on relevance, redundancy and complementarity. Feature complementarity means that two features' cooperation can provide more information than the simple summation of their individual information. In this paper, we studied the feature selection technique and proposed a new feature selection algorithm based on relevance, redundancy and complementarity (FS-RRC). On selecting the feature subset, FS-RRC not only evaluates the feature relevance with the class label and the redundancy among the features but also evaluates the feature complementarity. If complementary features exist for a selected relevant feature, FS-RRC retains the feature with the largest complementarity to the selected feature subset. To show the performance of FS-RRC, it was compared with eleven efficient feature selection methods, MIFS, mRMR, CMIM, ReliefF, FCBF, PGVNS, MCRMCR, MCRMICR, RCDFS, SAFE and SVM-RFE on two synthetic datasets and fifteen public biological datasets. The experimental results showed the superiority of FS-RRC in accuracy, sensitivity, specificity, stability and time complexity. Hence, integrating feature individual discriminative ability, redundancy and complementarity can define more powerful feature subset for biological data analysis, and feature complementarity can help to study the biomedical phenomena more accurately.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Keywords:  Biological data analysis; Feature complementarity; Feature redundancy; Feature relevance; Feature selection

Mesh:

Year:  2020        PMID: 32339114     DOI: 10.1016/j.compbiomed.2020.103667

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  3 in total

1.  A Path-Based Feature Selection Algorithm for Enterprise Credit Risk Evaluation.

Authors:  Marui Du; Yue Ma; Zuoquan Zhang
Journal:  Comput Intell Neurosci       Date:  2022-05-09

Review 2.  Research progress of reduced amino acid alphabets in protein analysis and prediction.

Authors:  Yuchao Liang; Siqi Yang; Lei Zheng; Hao Wang; Jian Zhou; Shenghui Huang; Lei Yang; Yongchun Zuo
Journal:  Comput Struct Biotechnol J       Date:  2022-07-04       Impact factor: 6.155

3.  A Highly Discriminative Hybrid Feature Selection Algorithm for Cancer Diagnosis.

Authors:  Tarneem Elemam; Mohamed Elshrkawey
Journal:  ScientificWorldJournal       Date:  2022-08-09
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.