Literature DB >> 19182978

A two-stage feature selection method for gene expression data.

Li-Yeh Chuang1, Chao-Hsuan Ke, Hsueh-Wei Chang, Cheng-Hong Yang.   

Abstract

Microarray data referencing gene expression profiles provide valuable answers to a variety of problems, and contributes to advances in clinical medicine. Gene expression data typically has a high dimension and a small sample size. Generally, only relatively small numbers of gene expression data are strongly correlated with a certain phenotype. To analyze gene expression profiles correctly, feature (gene) selection is crucial for classification. Feature (gene) selection has certain advantages, such as effective extraction of genes that influence classification accuracy, elimination of irrelevant genes, and improvement of the classification accuracy calculation. In this paper, we propose a two-stage feature selection method, which uses information gain to implement a gene-ranking process, and combines an improved particle swarm optimization with the K-nearest neighbor method and support vector machine classifiers to calculate the classification accuracy. The experimental results show that the proposed method can effectively select relevant gene subsets, and achieves higher classification accuracy than previous studies.

Mesh:

Year:  2009        PMID: 19182978     DOI: 10.1089/omi.2008.0083

Source DB:  PubMed          Journal:  OMICS        ISSN: 1536-2310


  6 in total

1.  Performance Analysis of Deep Learning Models for Binary Classification of Cancer Gene Expression Data.

Authors:  Subhasree Majumder; Vipin Pal; Anju Yadav; Amitabha Chakrabarty
Journal:  J Healthc Eng       Date:  2022-03-09       Impact factor: 2.682

2.  Left ventricular global transcriptional profiling in human end-stage dilated cardiomyopathy.

Authors:  Dilek Colak; Namik Kaya; Jawaher Al-Zahrani; Albandary Al Bakheet; Paul Muiya; Editha Andres; John Quackenbush; Nduna Dzimiri
Journal:  Genomics       Date:  2009-03-28       Impact factor: 5.736

3.  A feature selection method based on multiple kernel learning with expression profiles of different types.

Authors:  Wei Du; Zhongbo Cao; Tianci Song; Ying Li; Yanchun Liang
Journal:  BioData Min       Date:  2017-02-02       Impact factor: 2.522

4.  Discovery of significant porcine SNPs for swine breed identification by a hybrid of information gain, genetic algorithm, and frequency feature selection technique.

Authors:  Kitsuchart Pasupa; Wanthanee Rathasamuth; Sissades Tongsima
Journal:  BMC Bioinformatics       Date:  2020-05-26       Impact factor: 3.169

5.  Integrated Left Ventricular Global Transcriptome and Proteome Profiling in Human End-Stage Dilated Cardiomyopathy.

Authors:  Dilek Colak; Ayodele A Alaiya; Namik Kaya; Nzioka P Muiya; Olfat AlHarazi; Zakia Shinwari; Editha Andres; Nduna Dzimiri
Journal:  PLoS One       Date:  2016-10-06       Impact factor: 3.240

6.  A framework model using multifilter feature selection to enhance colon cancer classification.

Authors:  Murad Al-Rajab; Joan Lu; Qiang Xu
Journal:  PLoS One       Date:  2021-04-16       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.