Literature DB >> 25549938

A fast gene selection method for multi-cancer classification using multiple support vector data description.

Jin Cao1, Li Zhang2, Bangjun Wang1, Fanzhang Li1, Jiwen Yang2.   

Abstract

For cancer classification problems based on gene expression, the data usually has only a few dozen sizes but has thousands to tens of thousands of genes which could contain a large number of irrelevant genes. A robust feature selection algorithm is required to remove irrelevant genes and choose the informative ones. Support vector data description (SVDD) has been applied to gene selection for many years. However, SVDD cannot address the problems with multiple classes since it only considers the target class. In addition, it is time-consuming when applying SVDD to gene selection. This paper proposes a novel fast feature selection method based on multiple SVDD and applies it to multi-class microarray data. A recursive feature elimination (RFE) scheme is introduced to iteratively remove irrelevant features, so the proposed method is called multiple SVDD-RFE (MSVDD-RFE). To make full use of all classes for a given task, MSVDD-RFE independently selects a relevant gene subset for each class. The final selected gene subset is the union of these relevant gene subsets. The effectiveness and accuracy of MSVDD-RFE are validated by experiments on five publicly available microarray datasets. Our proposed method is faster and more effective than other methods.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Gene expression data; Gene selection; Multi-class classification; Support vector data description; Support vector machine

Mesh:

Year:  2014        PMID: 25549938     DOI: 10.1016/j.jbi.2014.12.009

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  6 in total

Review 1.  Systematic Review of an Automated Multiclass Detection and Classification System for Acute Leukaemia in Terms of Evaluation and Benchmarking, Open Challenges, Issues and Methodological Aspects.

Authors:  M A Alsalem; A A Zaidan; B B Zaidan; M Hashim; O S Albahri; A S Albahri; Ali Hadi; K I Mohammed
Journal:  J Med Syst       Date:  2018-09-19       Impact factor: 4.460

2.  In silico markers: an evolutionary and statistical approach to select informative genes of human breast cancer subtypes.

Authors:  Shib Sankar Bhowmick; Debotosh Bhattacharjee; Luis Rato
Journal:  Genes Genomics       Date:  2019-04-19       Impact factor: 1.839

3.  Gene selection approach based on improved swarm intelligent optimisation algorithm for tumour classification.

Authors:  Cong Jin; Shu-Wei Jin
Journal:  IET Syst Biol       Date:  2016-06       Impact factor: 1.615

4.  A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification.

Authors:  Lin Sun; Xianglin Kong; Jiucheng Xu; Zhan'ao Xue; Ruibing Zhai; Shiguang Zhang
Journal:  Sci Rep       Date:  2019-06-20       Impact factor: 4.379

5.  Elastic Correlation Adjusted Regression (ECAR) scores for high dimensional variable importance measuring.

Authors:  Yuan Zhou; Botao Fa; Ting Wei; Jianle Sun; Zhangsheng Yu; Yue Zhang
Journal:  Sci Rep       Date:  2021-12-02       Impact factor: 4.379

6.  A novel gene expression test method of minimizing breast cancer risk in reduced cost and time by improving SVM-RFE gene selection method combined with LASSO.

Authors:  Madhuri Gupta; Bharat Gupta
Journal:  J Integr Bioinform       Date:  2020-12-29
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.