Literature DB >> 23746736

Sparse maximum margin discriminant analysis for feature extraction and gene selection on gene expression data.

Yan Cui1, Chun-Hou Zheng, Jian Yang, Wen Sha.   

Abstract

Dimensionality reduction is necessary for gene expression data classification. In this paper, we propose a new method for reducing the dimensionality of gene expression data. First, based on a sparse representation, we developed a new criterion for characterizing the margin, which is called sparse maximum margin discriminant analysis (SMMDA); this approach can be used to find an optimal transform matrix such that the sparse margin is maximal in the transformed space. Second, using SMMDA, we present a new feature extraction method for gene expression data. Third, based on SMMDA, we propose a new discriminant gene selection method. During gene selection, we first found the one-dimensional projection of the gene expression data in the most separable direction using SMMDA. Then, we applied the sparse representation technique to regress the projection, and we obtained the relevance vector for the gene set. Discriminant genes were then selected according to this vector. Compared with the conventional method of maximum margin discriminant analysis, the proposed SMMDA method successfully avoids the difficulty of parameter selection. Extensive experiments using publicly available gene expression datasets showed that SMMDA is efficient for feature extraction and gene selection.
Copyright © 2013 Elsevier Ltd. All rights reserved.

Mesh:

Year:  2013        PMID: 23746736     DOI: 10.1016/j.compbiomed.2013.04.018

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  3 in total

Review 1.  Systematic Review of an Automated Multiclass Detection and Classification System for Acute Leukaemia in Terms of Evaluation and Benchmarking, Open Challenges, Issues and Methodological Aspects.

Authors:  M A Alsalem; A A Zaidan; B B Zaidan; M Hashim; O S Albahri; A S Albahri; Ali Hadi; K I Mohammed
Journal:  J Med Syst       Date:  2018-09-19       Impact factor: 4.460

2.  Gene expression feature selection for prostate cancer diagnosis using a two-phase heuristic-deterministic search strategy.

Authors:  Saleh Shahbeig; Akbar Rahideh; Mohammad Sadegh Helfroush; Kamran Kazemi
Journal:  IET Syst Biol       Date:  2018-08       Impact factor: 1.615

3.  An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets.

Authors:  Arezo Torang; Paraag Gupta; David J Klinke
Journal:  BMC Bioinformatics       Date:  2019-08-22       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.