| Literature DB >> 32441000 |
Ge Zhang1,2, Jincui Hou1, Jianlin Wang1,2, Chaokun Yan3,4, Junwei Luo5.
Abstract
Due to the presence of irrelevant or redundant data in microarray datasets, capturing potential patterns accurately and directly via existing models is difficult. Feature selection (FS) has become a necessary strategy to identify and screen out the most relevant attributes. However, the high dimensionality of microarray datasets poses a serious challenge to most existing FS algorithms. For this purpose, we propose a novel feature selection strategy in this paper, called IG-MBKH. A pre-screening method of feature ranking which is based on information gain (IG) and an improved binary krill herd (MBKH) algorithm are integrated in this strategy. When searching for feature subsets using MBKH, a hyperbolic tangent function, an adaptive transfer factor, and a chaos memory weight factor are introduced to facilitate a better searching the possible feature subsets. The results indicates that the IG-MBKH algorithm can achieve improvement in convergence, the number of features and classification accuracy when compared to the BKH, MBKH, and several newest algorithms. Furthermore, we evaluate the impact of different classifiers on the performance of the strategy we propose.Entities:
Keywords: Feature selection; Information gain; Microarray datasets; Modified binary krill herd algorithm
Mesh:
Year: 2020 PMID: 32441000 DOI: 10.1007/s12539-020-00372-w
Source DB: PubMed Journal: Interdiscip Sci ISSN: 1867-1462 Impact factor: 2.233