Literature DB >> 28113635

Feature Selection for Optimized High-Dimensional Biomedical Data Using an Improved Shuffled Frog Leaping Algorithm.

Bin Hu, Yongqiang Dai, Yun Su, Philip Moore, Xiaowei Zhang, Chengsheng Mao, Jing Chen, Lixin Xu.   

Abstract

High dimensional biomedical datasets contain thousands of features which can be used in molecular diagnosis of disease, however, such datasets contain many irrelevant or weak correlation features which influence the predictive accuracy of diagnosis. Without a feature selection algorithm, it is difficult for the existing classification techniques to accurately identify patterns in the features. The purpose of feature selection is to not only identify a feature subset from an original set of features [without reducing the predictive accuracy of classification algorithm] but also reduce the computation overhead in data mining. In this paper, we present our improved shuffled frog leaping algorithm which introduces a chaos memory weight factor, an absolute balance group strategy, and an adaptive transfer factor. Our proposed approach explores the space of possible subsets to obtain the set of features that maximizes the predictive accuracy and minimizes irrelevant features in high-dimensional biomedical data. To evaluate the effectiveness of our proposed method, we have employed the K-nearest neighbor method with a comparative analysis in which we compare our proposed approach with genetic algorithms, particle swarm optimization, and the shuffled frog leaping algorithm. Experimental results show that our improved algorithm achieves improvements in the identification of relevant subsets and in classification accuracy.

Entities:  

Mesh:

Year:  2016        PMID: 28113635     DOI: 10.1109/TCBB.2016.2602263

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  8 in total

1.  Supervised Nonnegative Matrix Factorization to Predict ICU Mortality Risk.

Authors:  Guoqing Chao; Chengsheng Mao; Fei Wang; Yuan Zhao; Yuan Luo
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2019-01-24

Review 2.  A review of swarm-based metaheuristic optimization techniques and their application to doubly fed induction generator.

Authors:  Kumeshan Reddy; Akshay K Saha
Journal:  Heliyon       Date:  2022-10-07

Review 3.  Data analysis methods for defining biomarkers from omics data.

Authors:  Chao Li; Zhenbo Gao; Benzhe Su; Guowang Xu; Xiaohui Lin
Journal:  Anal Bioanal Chem       Date:  2021-12-24       Impact factor: 4.142

4.  Classification of high dimensional biomedical data based on feature selection using redundant removal.

Authors:  Bingtao Zhang; Peng Cao
Journal:  PLoS One       Date:  2019-04-09       Impact factor: 3.240

5.  An Efficient hybrid filter-wrapper metaheuristic-based gene selection method for high dimensional datasets.

Authors:  Jamshid Pirgazi; Mohsen Alimoradi; Tahereh Esmaeili Abharian; Mohammad Hossein Olyaee
Journal:  Sci Rep       Date:  2019-12-09       Impact factor: 4.379

Review 6.  Review of Big Data Analytics, Artificial Intelligence and Nature-Inspired Computing Models towards Accurate Detection of COVID-19 Pandemic Cases and Contact Tracing.

Authors:  Israel Edem Agbehadji; Bankole Osita Awuzie; Alfred Beati Ngowi; Richard C Millham
Journal:  Int J Environ Res Public Health       Date:  2020-07-24       Impact factor: 3.390

7.  Feature Selection in High Dimensional Biomedical Data Based on BF-SFLA.

Authors:  Yongqiang Dai; Lili Niu; Linjing Wei; Jie Tang
Journal:  Front Neurosci       Date:  2022-04-18       Impact factor: 5.152

8.  Bio-inspired computation for big data fusion, storage, processing, learning and visualization: state of the art and future directions.

Authors:  Ana I Torre-Bastida; Josu Díaz-de-Arcaya; Eneko Osaba; Khan Muhammad; David Camacho; Javier Del Ser
Journal:  Neural Comput Appl       Date:  2021-08-03       Impact factor: 5.606

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.