Literature DB >> 21241823

An efficient statistical feature selection approach for classification of gene expression data.

B Chandra1, Manish Gupta.   

Abstract

Classification of gene expression data plays a significant role in prediction and diagnosis of diseases. Gene expression data has a special characteristic that there is a mismatch in gene dimension as opposed to sample dimension. All genes do not contribute for efficient classification of samples. A robust feature selection algorithm is required to identify the important genes which help in classifying the samples efficiently. In order to select informative genes (features) based on relevance and redundancy characteristics, many feature selection algorithms have been introduced in the past. Most of the earlier algorithms require computationally expensive search strategy to find an optimal feature subset. Existing feature selection methods are also sensitive to the evaluation measures. The paper introduces a novel and efficient feature selection approach based on statistically defined effective range of features for every class termed as ERGS (Effective Range based Gene Selection). The basic principle behind ERGS is that higher weight is given to the feature that discriminates the classes clearly. Experimental results on well-known gene expression datasets illustrate the effectiveness of the proposed approach. Two popular classifiers viz. Nave Bayes Classifier (NBC) and Support Vector Machine (SVM) have been used for classification. The proposed feature selection algorithm can be helpful in ranking the genes and also is capable of identifying the most relevant genes responsible for diseases like leukemia, colon tumor, lung cancer, diffuse large B-cell lymphoma (DLBCL), prostate cancer.
Copyright © 2011 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21241823     DOI: 10.1016/j.jbi.2011.01.001

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  14 in total

1.  Identification of tissue-specific tumor biomarker using different optimization algorithms.

Authors:  Shib Sankar Bhowmick; Debotosh Bhattacharjee; Luis Rato
Journal:  Genes Genomics       Date:  2018-12-08       Impact factor: 1.839

Review 2.  Contribution of bioinformatics prediction in microRNA-based cancer therapeutics.

Authors:  Jasjit K Banwait; Dhundy R Bastola
Journal:  Adv Drug Deliv Rev       Date:  2014-11-06       Impact factor: 15.470

3.  Automated Detection of Alzheimer's Disease Using Brain MRI Images- A Study with Various Feature Extraction Techniques.

Authors:  U Rajendra Acharya; Steven Lawrence Fernandes; Joel En WeiKoh; Edward J Ciaccio; Mohd Kamil Mohd Fabell; U John Tanik; V Rajinikanth; Chai Hong Yeong
Journal:  J Med Syst       Date:  2019-08-09       Impact factor: 4.460

4.  Gene expression feature selection for prostate cancer diagnosis using a two-phase heuristic-deterministic search strategy.

Authors:  Saleh Shahbeig; Akbar Rahideh; Mohammad Sadegh Helfroush; Kamran Kazemi
Journal:  IET Syst Biol       Date:  2018-08       Impact factor: 1.615

Review 5.  Data analysis methods for defining biomarkers from omics data.

Authors:  Chao Li; Zhenbo Gao; Benzhe Su; Guowang Xu; Xiaohui Lin
Journal:  Anal Bioanal Chem       Date:  2021-12-24       Impact factor: 4.142

6.  Win percentage: a novel measure for assessing the suitability of machine classifiers for biological problems.

Authors:  R Mitchell Parry; John H Phan; May D Wang
Journal:  BMC Bioinformatics       Date:  2012-03-21       Impact factor: 3.169

7.  An improved feature selection based on effective range for classification.

Authors:  Jianzhong Wang; Shuang Zhou; Yugen Yi; Jun Kong
Journal:  ScientificWorldJournal       Date:  2014-02-04

8.  A comparative analysis of swarm intelligence techniques for feature selection in cancer classification.

Authors:  Chellamuthu Gunavathi; Kandasamy Premalatha
Journal:  ScientificWorldJournal       Date:  2014-08-03

9.  Unsupervised gene selection using biological knowledge : application in sample clustering.

Authors:  Sudipta Acharya; Sriparna Saha; N Nikhil
Journal:  BMC Bioinformatics       Date:  2017-11-22       Impact factor: 3.169

10.  A New Strategy for Analyzing Time-Series Data Using Dynamic Networks: Identifying Prospective Biomarkers of Hepatocellular Carcinoma.

Authors:  Xin Huang; Jun Zeng; Lina Zhou; Chunxiu Hu; Peiyuan Yin; Xiaohui Lin
Journal:  Sci Rep       Date:  2016-08-31       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.