Literature DB >> 28650835

A New Representation in PSO for Discretization-Based Feature Selection.

Binh Tran, Bing Xue, Mengjie Zhang, Bing Xue, Binh Tran, Mengjie Zhang.   

Abstract

In machine learning, discretization and feature selection (FS) are important techniques for preprocessing data to improve the performance of an algorithm on high-dimensional data. Since many FS methods require discrete data, a common practice is to apply discretization before FS. In addition, for the sake of efficiency, features are usually discretized individually (or univariate). This scheme works based on the assumption that each feature independently influences the task, which may not hold in cases where feature interactions exist. Therefore, univariate discretization may degrade the performance of the FS stage since information showing feature interactions may be lost during the discretization process. Initial results of our previous proposed method [evolve particle swarm optimization (EPSO)] showed that combining discretization and FS in a single stage using bare-bones particle swarm optimization (BBPSO) can lead to a better performance than applying them in two separate stages. In this paper, we propose a new method called potential particle swarm optimization (PPSO) which employs a new representation that can reduce the search space of the problem and a new fitness function to better evaluate candidate solutions to guide the search. The results on ten high-dimensional datasets show that PPSO select less than 5% of the number of features for all datasets. Compared with the two-stage approach which uses BBPSO for FS on the discretized data, PPSO achieves significantly higher accuracy on seven datasets. In addition, PPSO obtains better (or similar) classification performance than EPSO on eight datasets with a smaller number of selected features on six datasets. Furthermore, PPSO also outperforms the three compared (traditional) methods and performs similar to one method on most datasets in terms of both generalization ability and learning capacity.

Entities:  

Year:  2017        PMID: 28650835     DOI: 10.1109/TCYB.2017.2714145

Source DB:  PubMed          Journal:  IEEE Trans Cybern        ISSN: 2168-2267            Impact factor:   11.448


  5 in total

1.  A Cooperative Coevolutionary Approach to Discretization-Based Feature Selection for High-Dimensional Data.

Authors:  Yu Zhou; Junhao Kang; Xiao Zhang
Journal:  Entropy (Basel)       Date:  2020-06-01       Impact factor: 2.524

2.  A graph-based gene selection method for medical diagnosis problems using a many-objective PSO algorithm.

Authors:  Saeid Azadifar; Ali Ahmadi
Journal:  BMC Med Inform Decis Mak       Date:  2021-11-27       Impact factor: 2.796

3.  An adaptive and altruistic PSO-based deep feature selection method for Pneumonia detection from Chest X-rays.

Authors:  Rishav Pramanik; Sourodip Sarkar; Ram Sarkar
Journal:  Appl Soft Comput       Date:  2022-08-10       Impact factor: 8.263

4.  Improved Binary Grasshopper Optimization Algorithm for Feature Selection Problem.

Authors:  Gui-Ling Wang; Shu-Chuan Chu; Ai-Qing Tian; Tao Liu; Jeng-Shyang Pan
Journal:  Entropy (Basel)       Date:  2022-05-31       Impact factor: 2.738

5.  Feature Selection Based on Adaptive Particle Swarm Optimization with Leadership Learning.

Authors:  Zhiwei Ye; Yi Xu; Qiyi He; Mingwei Wang; Wanfang Bai; Hongwei Xiao
Journal:  Comput Intell Neurosci       Date:  2022-08-28
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.