Literature DB >> 16220686

Multiple SVM-RFE for gene selection in cancer classification with expression data.

Kai-Bo Duan1, Jagath C Rajapakse, Haiying Wang, Francisco Azuaje.   

Abstract

This paper proposes a new feature selection method that uses a backward elimination procedure similar to that implemented in support vector machine recursive feature elimination (SVM-RFE). Unlike the SVM-RFE method, at each step, the proposed approach computes the feature ranking score from a statistical analysis of weight vectors of multiple linear SVMs trained on subsamples of the original training data. We tested the proposed method on four gene expression datasets for cancer classification. The results show that the proposed feature selection method selects better gene subsets than the original SVM-RFE and improves the classification accuracy. A Gene Ontology-based similarity assessment indicates that the selected subsets are functionally diverse, further validating our gene selection method. This investigation also suggests that, for gene expression-based cancer classification, average test error from multiple partitions of training and test sets can be recommended as a reference of performance quality.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16220686     DOI: 10.1109/tnb.2005.853657

Source DB:  PubMed          Journal:  IEEE Trans Nanobioscience        ISSN: 1536-1241            Impact factor:   2.935


  71 in total

Review 1.  Classification algorithms for phenotype prediction in genomics and proteomics.

Authors:  Habtom W Ressom; Rency S Varghese; Zhen Zhang; Jianhua Xuan; Robert Clarke
Journal:  Front Biosci       Date:  2008-01-01

2.  DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information.

Authors:  Farman Ali; Saeed Ahmed; Zar Nawab Khan Swati; Shahid Akbar
Journal:  J Comput Aided Mol Des       Date:  2019-05-23       Impact factor: 3.686

3.  DNA methylation profiling of medulloblastoma allows robust subclassification and improved outcome prediction using formalin-fixed biopsies.

Authors:  Edward C Schwalbe; Daniel Williamson; Janet C Lindsey; Dolores Hamilton; Sarra L Ryan; Hisham Megahed; Miklós Garami; Peter Hauser; Bożena Dembowska-Baginska; Danuta Perek; Paul A Northcott; Michael D Taylor; Roger E Taylor; David W Ellison; Simon Bailey; Steven C Clifford
Journal:  Acta Neuropathol       Date:  2013-01-05       Impact factor: 17.088

4.  Automatic Myonuclear Detection in Isolated Single Muscle Fibers Using Robust Ellipse Fitting and Sparse Representation.

Authors:  Hai Su; Fuyong Xing; Jonah D Lee; Charlotte A Peterson; Lin Yang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2014 Jul-Aug       Impact factor: 3.710

5.  Data mining approaches for genome-wide association of mood disorders.

Authors:  Mehdi Pirooznia; Fayaz Seifuddin; Jennifer Judy; Pamela B Mahon; James B Potash; Peter P Zandi
Journal:  Psychiatr Genet       Date:  2012-04       Impact factor: 2.458

Review 6.  Non-negative matrix factorization of multimodal MRI, fMRI and phenotypic data reveals differential changes in default mode subnetworks in ADHD.

Authors:  Ariana Anderson; Pamela K Douglas; Wesley T Kerr; Virginia S Haynes; Alan L Yuille; Jianwen Xie; Ying Nian Wu; Jesse A Brown; Mark S Cohen
Journal:  Neuroimage       Date:  2013-12-19       Impact factor: 6.556

7.  Revealing metabolite biomarkers for acupuncture treatment by linear programming based feature selection.

Authors:  Yong Wang; Qiao-Feng Wu; Chen Chen; Ling-Yun Wu; Xian-Zhong Yan; Shu-Guang Yu; Xiang-Sun Zhang; Fan-Rong Liang
Journal:  BMC Syst Biol       Date:  2012-07-16

Review 8.  Semantic similarity in biomedical ontologies.

Authors:  Catia Pesquita; Daniel Faria; André O Falcão; Phillip Lord; Francisco M Couto
Journal:  PLoS Comput Biol       Date:  2009-07-31       Impact factor: 4.475

9.  AdaBoost-based multiple SVM-RFE for classification of mammograms in DDSM.

Authors:  Sejong Yoon; Saejoon Kim
Journal:  BMC Med Inform Decis Mak       Date:  2009-11-03       Impact factor: 2.796

10.  Classification and feature selection algorithms for multi-class CGH data.

Authors:  Jun Liu; Sanjay Ranka; Tamer Kahveci
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.