Literature DB >> 24561345

A novel class dependent feature selection method for cancer biomarker discovery.

Wengang Zhou1, Julie A Dickerson2.   

Abstract

Identifying key biomarkers for different cancer types can improve diagnosis accuracy and treatment. Gene expression data can help differentiate between cancer subtypes. However the limitation of having a small number of samples versus a larger number of genes represented in a dataset leads to the overfitting of classification models. Feature selection methods can help select the most distinguishing feature sets for classifying different cancers. A new class dependent feature selection approach integrates the F-statistic, Maximum Relevance Binary Particle Swarm Optimization (MRBPSO) and Class Dependent Multi-category Classification (CDMC) system. This feature selection method combines filter and wrapper based methods. A set of highly differentially expressed genes (features) are pre-selected using the F statistic for each dataset as a filter for selecting the most meaningful features. MRBPSO and CDMC function as a wrapper to select desirable feature subsets for each class and classify the samples using those chosen class-dependent feature subsets. The performance of the proposed methods is evaluated on eight real cancer datasets. The results indicate that the class-dependent approaches can effectively identify biomarkers related to each cancer type and improve classification accuracy compared to class independent feature selection methods.
Copyright © 2014 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Binary particle swarm optimization; Cancer biomarker discovery; Class dependent multi-category classification; Feature selection; Support vector machine

Mesh:

Substances:

Year:  2014        PMID: 24561345     DOI: 10.1016/j.compbiomed.2014.01.014

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  3 in total

1.  A comparative study of improvements Pre-filter methods bring on feature selection using microarray data.

Authors:  Yingying Wang; Xiaomao Fan; Yunpeng Cai
Journal:  Health Inf Sci Syst       Date:  2014-10-16

2.  Evolutionary sequential genetic search technique-based cancer classification using fuzzy rough nearest neighbour classifier.

Authors:  Loganathan Meenachi; Srinivasan Ramakrishnan
Journal:  Healthc Technol Lett       Date:  2018-08-15

3.  The Cross-Entropy Based Multi-Filter Ensemble Method for Gene Selection.

Authors:  Yingqiang Sun; Chengbo Lu; Xiaobo Li
Journal:  Genes (Basel)       Date:  2018-05-17       Impact factor: 4.096

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.