Literature DB >> 16118263

Analysis of recursive gene selection approaches from microarray data.

Fan Li1, Yiming Yang.   

Abstract

MOTIVATION: Finding a small subset of most predictive genes from microarray for disease prediction is a challenging problem. Support vector machines (SVMs) have been found to be successful with a recursive procedure in selecting important genes for cancer prediction. However, it is not well understood how much of the success depends on the choice of the specific classifier and how much on the recursive procedure. We answer this question by examining multiple classifers [SVM, ridge regression (RR) and Rocchio] with feature selection in recursive and non-recursive settings on three DNA microarray datasets (ALL-AML Leukemia data, Breast Cancer data and GCM data).
RESULTS: We found recursive RR most effective. On the AML-ALL dataset, it achieved zero error rate on the test set using only three genes (selected from over 7000), which is more encouraging than the best published result (zero error rate using 8 genes by recursive SVM). On the Breast Cancer dataset and the two largest categories of the GCM dataset, the results achieved by recursive RR are also very encouraging. A further analysis of the experimental results shows that different classifiers penalize redundant features to different extent and this property plays an important role in the recursive feature selection process. RR classifier tends to penalize redundant features to a much larger extent than the SVM does. This may be the reason why recursive RR has a better performance in selecting genes.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16118263     DOI: 10.1093/bioinformatics/bti618

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

1.  PUGSVM: a caBIG™ analytical tool for multiclass gene selection and predictive classification.

Authors:  Guoqiang Yu; Huai Li; Sook Ha; Ie-Ming Shih; Robert Clarke; Eric P Hoffman; Subha Madhavan; Jianhua Xuan; Yue Wang
Journal:  Bioinformatics       Date:  2010-12-24       Impact factor: 6.937

2.  Multicategory reclassification statistics for assessing improvements in diagnostic accuracy.

Authors:  Jialiang Li; Binyan Jiang; Jason P Fine
Journal:  Biostatistics       Date:  2012-11-28       Impact factor: 5.899

3.  Outcome prediction based on microarray analysis: a critical perspective on methods.

Authors:  Michalis Zervakis; Michalis E Blazadonakis; Georgia Tsiliki; Vasiliki Danilatou; Manolis Tsiknakis; Dimitris Kafetzopoulos
Journal:  BMC Bioinformatics       Date:  2009-02-07       Impact factor: 3.169

4.  Kernelized partial least squares for feature reduction and classification of gene microarray data.

Authors:  Walker H Land; Xingye Qiao; Daniel E Margolis; William S Ford; Christopher T Paquette; Joseph F Perez-Rogers; Jeffrey A Borgia; Jack Y Yang; Youping Deng
Journal:  BMC Syst Biol       Date:  2011-12-23

5.  Google goes cancer: improving outcome prediction for cancer patients by network-based ranking of marker genes.

Authors:  Christof Winter; Glen Kristiansen; Stephan Kersting; Janine Roy; Daniela Aust; Thomas Knösel; Petra Rümmele; Beatrix Jahnke; Vera Hentrich; Felix Rückert; Marco Niedergethmann; Wilko Weichert; Marcus Bahra; Hans J Schlitt; Utz Settmacher; Helmut Friess; Markus Büchler; Hans-Detlev Saeger; Michael Schroeder; Christian Pilarsky; Robert Grützmann
Journal:  PLoS Comput Biol       Date:  2012-05-17       Impact factor: 4.475

6.  Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE.

Authors:  Satoshi Niijima; Satoru Kuhara
Journal:  BMC Bioinformatics       Date:  2006-12-25       Impact factor: 3.169

7.  Development and experimental test of support vector machines virtual screening method for searching Src inhibitors from large compound libraries.

Authors:  Bucong Han; Xiaohua Ma; Ruiying Zhao; Jingxian Zhang; Xiaona Wei; Xianghui Liu; Xin Liu; Cunlong Zhang; Chunyan Tan; Yuyang Jiang; Yuzong Chen
Journal:  Chem Cent J       Date:  2012-11-23       Impact factor: 4.215

8.  Selecting informative genes for discriminant analysis using multigene expression profiles.

Authors:  Xin Yan; Tian Zheng
Journal:  BMC Genomics       Date:  2008-09-16       Impact factor: 3.969

9.  Recursive cluster elimination (RCE) for classification and feature selection from gene expression data.

Authors:  Malik Yousef; Segun Jung; Louise C Showe; Michael K Showe
Journal:  BMC Bioinformatics       Date:  2007-05-02       Impact factor: 3.169

10.  Classification and biomarker identification using gene network modules and support vector machines.

Authors:  Malik Yousef; Mohamed Ketany; Larry Manevitz; Louise C Showe; Michael K Showe
Journal:  BMC Bioinformatics       Date:  2009-10-15       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.