Literature DB >> 15607418

A robust hybrid between genetic algorithm and support vector machine for extracting an optimal feature gene subset.

Li Li1, Wei Jiang, Xia Li, Kathy L Moser, Zheng Guo, Lei Du, Qiuju Wang, Eric J Topol, Qing Wang, Shaoqi Rao.   

Abstract

Development of a robust and efficient approach for extracting useful information from microarray data continues to be a significant and challenging task. Microarray data are characterized by a high dimension, high signal-to-noise ratio, and high correlations between genes, but with a relatively small sample size. Current methods for dimensional reduction can further be improved for the scenario of the presence of a single (or a few) high influential gene(s) in which its effect in the feature subset would prohibit inclusion of other important genes. We have formalized a robust gene selection approach based on a hybrid between genetic algorithm and support vector machine. The major goal of this hybridization was to exploit fully their respective merits (e.g., robustness to the size of solution space and capability of handling a very large dimension of feature genes) for identification of key feature genes (or molecular signatures) for a complex biological phenotype. We have applied the approach to the microarray data of diffuse large B cell lymphoma to demonstrate its behaviors and properties for mining the high-dimension data of genome-wide gene expression profiles. The resulting classifier(s) (the optimal gene subset(s)) has achieved the highest accuracy (99%) for prediction of independent microarray samples in comparisons with marginal filters and a hybrid between genetic algorithm and K nearest neighbors.

Mesh:

Year:  2005        PMID: 15607418     DOI: 10.1016/j.ygeno.2004.09.007

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  23 in total

1.  Ranking analysis of microarray data: a powerful method for identifying differentially expressed genes.

Authors:  Yuan-De Tan; Myriam Fornage; Yun-Xin Fu
Journal:  Genomics       Date:  2006-09-18       Impact factor: 5.736

2.  Investigating the efficacy of nonlinear dimensionality reduction schemes in classifying gene and protein expression studies.

Authors:  George Lee; Carlos Rodriguez; Anant Madabhushi
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2008 Jul-Sep       Impact factor: 3.710

3.  Identification of disease-causing genes using microarray data mining and Gene Ontology.

Authors:  Azadeh Mohammadi; Mohammad H Saraee; Mansoor Salehi
Journal:  BMC Med Genomics       Date:  2011-01-26       Impact factor: 3.063

4.  Phenomics: the systematic study of phenotypes on a genome-wide scale.

Authors:  R M Bilder; F W Sabb; T D Cannon; E D London; J D Jentsch; D Stott Parker; R A Poldrack; C Evans; N B Freimer
Journal:  Neuroscience       Date:  2009-01-20       Impact factor: 3.590

5.  Pattern-driven neighborhood search for biclustering of microarray data.

Authors:  Wassim Ayadi; Mourad Elloumi; Jin-Kao Hao
Journal:  BMC Bioinformatics       Date:  2012-05-08       Impact factor: 3.169

6.  Towards precise classification of cancers based on robust gene functional expression profiles.

Authors:  Zheng Guo; Tianwen Zhang; Xia Li; Qi Wang; Jianzhen Xu; Hui Yu; Jing Zhu; Haiyun Wang; Chenguang Wang; Eric J Topol; Qing Wang; Shaoqi Rao
Journal:  BMC Bioinformatics       Date:  2005-03-17       Impact factor: 3.169

7.  Gene selection algorithms for microarray data based on least squares support vector machine.

Authors:  E Ke Tang; P N Suganthan; Xin Yao
Journal:  BMC Bioinformatics       Date:  2006-02-27       Impact factor: 3.169

8.  Discovery of Time-Delayed Gene Regulatory Networks based on temporal gene expression profiling.

Authors:  Xia Li; Shaoqi Rao; Wei Jiang; Chuanxing Li; Yun Xiao; Zheng Guo; Qingpu Zhang; Lihong Wang; Lei Du; Jing Li; Li Li; Tianwen Zhang; Qing K Wang
Journal:  BMC Bioinformatics       Date:  2006-01-18       Impact factor: 3.169

9.  A robust hybrid approach based on estimation of distribution algorithm and support vector machine for hunting candidate disease genes.

Authors:  Li Li; Hongmei Chen; Chang Liu; Fang Wang; Fangfang Zhang; Lihua Bai; Yihan Chen; Luying Peng
Journal:  ScientificWorldJournal       Date:  2013-02-07

10.  Tissue-based Alzheimer gene expression markers-comparison of multiple machine learning approaches and investigation of redundancy in small biomarker sets.

Authors:  Lena Scheubert; Mitja Luštrek; Rainer Schmidt; Dirk Repsilber; Georg Fuellen
Journal:  BMC Bioinformatics       Date:  2012-10-15       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.