Literature DB >> 18052912

Selection of biologically relevant genes with a wrapper stochastic algorithm.

Kim-Anh Lê Cao1, Olivier Gonçalves, Philippe Besse, Sébastien Gadat.   

Abstract

We investigate an important issue of a meta-algorithm for selecting variables in the framework of microarray data. This wrapper method starts from any classification algorithm and weights each variable (i.e. gene) relative to its efficiency for classification. An optimization procedure is then inferred which exhibits important genes for the studied biological process. Theory and application with the SVM classifier were presented in Gadat and Younes, 2007 and we extend this method with CART. The classification error rates are computed on three famous public databases (Leukemia, Colon and Prostate) and compared with those from other wrapper methods (RFE, lo norm SVM, Random Forests). This allows the assessment of the statistical relevance of the proposed algorithm. Furthermore, a biological interpretation with the Ingenuity Pathway Analysis software outputs clearly shows that the gene selections from the different wrapper methods raise very relevant biological information, compared to a classical filter gene selection with T-test.

Entities:  

Mesh:

Year:  2007        PMID: 18052912     DOI: 10.2202/1544-6115.1312

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  5 in total

1.  Novel multivariate methods for integration of genomics and proteomics data: applications in a kidney transplant rejection study.

Authors:  Oliver P Günther; Heesun Shin; Raymond T Ng; W Robert McMaster; Bruce M McManus; Paul A Keown; Scott J Tebbutt; Kim-Anh Lê Cao
Journal:  OMICS       Date:  2014-11

2.  Integrative mixture of experts to combine clinical factors and gene markers.

Authors:  Kim-Anh Lê Cao; Emmanuelle Meugnier; Geoffrey J McLachlan
Journal:  Bioinformatics       Date:  2010-03-11       Impact factor: 6.937

3.  Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems.

Authors:  Kim-Anh Lê Cao; Simon Boitard; Philippe Besse
Journal:  BMC Bioinformatics       Date:  2011-06-22       Impact factor: 3.169

4.  Predicting qualitative phenotypes from microarray data - the Eadgene pig data set.

Authors:  Christèle Robert-Granié; Kim-Anh Lê Cao; Magali Sancristobal
Journal:  BMC Proc       Date:  2009-07-16

5.  Growth rate regulated genes and their wide involvement in the Lactococcus lactis stress responses.

Authors:  Clémentine Dressaire; Emma Redon; Helene Milhem; Philippe Besse; Pascal Loubière; Muriel Cocaign-Bousquet
Journal:  BMC Genomics       Date:  2008-07-21       Impact factor: 3.969

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.