Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A centroid-based gene selection method for microarray data classification.

Literature DB >> 27056739

A centroid-based gene selection method for microarray data classification.

Shun Guo¹, Donghui Guo², Lifei Chen³, Qingshan Jiang⁴.

Abstract

For classification problems based on microarray data, the data typically contains a large number of irrelevant and redundant features. In this paper, a new gene selection method is proposed to choose the best subset of features for microarray data with the irrelevant and redundant features removed. We formulate the selection problem as a L1-regularized optimization problem, based on a newly defined linear discriminant analysis criterion. Instead of calculating the mean of the samples, a kernel-based approach is used to estimate the class centroid to define both the between-class separability and the within-class compactness for the criterion. Theoretical analysis indicates that the global optimal solution of the L1-regularized criterion can be reached with a general condition, on which an efficient algorithm is derived to the feature selection problem in a linear time complexity with respect to the number of features and the number of samples. The experimental results on ten publicly available microarray datasets demonstrate that the proposed method performs effectively and competitively compared with state-of-the-art methods.

Keywords: Class centroid; Classification; Gene selection; L1 regularization; Microarray data

Mesh：

Substances：
Biomarkers, Tumor

Year: 2016 PMID： 27056739 DOI： 10.1016/j.jtbi.2016.03.034

Source DB: PubMed Journal: J Theor Biol ISSN： 0022-5193 Impact factor: 2.691

Keyword Cloud
Cited

4 in total

A centroid-based gene selection method for microarray data classification.

1. Optimizing ANFIS using simulated annealing algorithm for classification of microarray gene expression cancer data.

2. Gene selection for microarray data classification via subspace learning and manifold regularization.

3. Feature selection for high-dimensional temporal data.

4. CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests.