Literature DB >> 17646335

Gene selection via the BAHSIC family of algorithms.

Le Song1, Justin Bedo, Karsten M Borgwardt, Arthur Gretton, Alex Smola.   

Abstract

MOTIVATION: Identifying significant genes among thousands of sequences on a microarray is a central challenge for cancer research in bioinformatics. The ultimate goal is to detect the genes that are involved in disease outbreak and progression. A multitude of methods have been proposed for this task of feature selection, yet the selected gene lists differ greatly between different methods. To accomplish biologically meaningful gene selection from microarray data, we have to understand the theoretical connections and the differences between these methods. In this article, we define a kernel-based framework for feature selection based on the Hilbert-Schmidt independence criterion and backward elimination, called BAHSIC. We show that several well-known feature selectors are instances of BAHSIC, thereby clarifying their relationship. Furthermore, by choosing a different kernel, BAHSIC allows us to easily define novel feature selection algorithms. As a further advantage, feature selection via BAHSIC works directly on multiclass problems.
RESULTS: In a broad experimental evaluation, the members of the BAHSIC family reach high levels of accuracy and robustness when compared to other feature selection techniques. Experiments show that features selected with a linear kernel provide the best classification performance in general, but if strong non-linearities are present in the data then non-linear kernels can be more suitable. AVAILABILITY: Accompanying homepage is http://www.dbs.ifi.lmu.de/~borgward/BAHSIC. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17646335     DOI: 10.1093/bioinformatics/btm216

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Investigating the efficacy of nonlinear dimensionality reduction schemes in classifying gene and protein expression studies.

Authors:  George Lee; Carlos Rodriguez; Anant Madabhushi
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2008 Jul-Sep       Impact factor: 3.710

2.  Incorporating EBO-HSIC with SVM for Gene Selection Associated with Cervical Cancer Classification.

Authors:  S Geeitha; M Thangamani
Journal:  J Med Syst       Date:  2018-10-06       Impact factor: 4.460

3.  TRAF6 and IRF7 control HIV replication in macrophages.

Authors:  Mélissa Sirois; Lynda Robitaille; Robin Allary; Mohak Shah; Christopher H Woelk; Jérôme Estaquier; Jacques Corbeil
Journal:  PLoS One       Date:  2011-11-28       Impact factor: 3.240

4.  Kernelized partial least squares for feature reduction and classification of gene microarray data.

Authors:  Walker H Land; Xingye Qiao; Daniel E Margolis; William S Ford; Christopher T Paquette; Joseph F Perez-Rogers; Jeffrey A Borgia; Jack Y Yang; Youping Deng
Journal:  BMC Syst Biol       Date:  2011-12-23

5.  A Method to Extract Feature Variables Contributed in Nonlinear Machine Learning Prediction.

Authors:  Mayumi Suzuki; Takuma Shibahara; Yoshihiro Muragaki
Journal:  Methods Inf Med       Date:  2020-05-07       Impact factor: 2.176

6.  SlimPLS: a method for feature selection in gene expression-based disease classification.

Authors:  Michael Gutkin; Ron Shamir; Gideon Dror
Journal:  PLoS One       Date:  2009-07-29       Impact factor: 3.240

7.  Sequence2Vec: a novel embedding approach for modeling transcription factor binding affinity landscape.

Authors:  Hanjun Dai; Ramzan Umarov; Hiroyuki Kuwahara; Yu Li; Le Song; Xin Gao
Journal:  Bioinformatics       Date:  2017-11-15       Impact factor: 6.937

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.