Literature DB >> 27081431

Feature Import Vector Machine: A General Classifier with Flexible Feature Selection.

Samiran Ghosh1, Yazhen Wang2.   

Abstract

The support vector machine (SVM) and other reproducing kernel Hilbert space (RKHS) based classifier systems are drawing much attention recently due to its robustness and generalization capability. General theme here is to construct classifiers based on the training data in a high dimensional space by using all available dimensions. The SVM achieves huge data compression by selecting only few observations which lie close to the boundary of the classifier function. However when the number of observations are not very large (small n) but the number of dimensions/features are large (large p), then it is not necessary that all available features are of equal importance in the classification context. Possible selection of an useful fraction of the available features may result in huge data compression. In this paper we propose an algorithmic approach by means of which such an optimal set of features could be selected. In short, we reverse the traditional sequential observation selection strategy of SVM to that of sequential feature selection. To achieve this we have modified the solution proposed by Zhu and Hastie (2005) in the context of import vector machine (IVM), to select an optimal sub-dimensional model to build the final classifier with sufficient accuracy.

Entities:  

Keywords:  Classification; Import Vector Machine; Radial Basis Function; Regularization; Reproducing Kernel Hilbert Space; Support Vector Machine

Year:  2015        PMID: 27081431      PMCID: PMC4829386          DOI: 10.1002/sam.11259

Source DB:  PubMed          Journal:  Stat Anal Data Min        ISSN: 1932-1864            Impact factor:   1.051


  8 in total

1.  RankGene: identification of diagnostic genes based on expression data.

Authors:  Yang Su; T M Murali; Vladimir Pavlovic; Michael Schaffer; Simon Kasif
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

2.  Systematic benchmarking of microarray data classification: assessing the role of non-linearity and dimensionality reduction.

Authors:  Nathalie Pochet; Frank De Smet; Johan A K Suykens; Bart L R De Moor
Journal:  Bioinformatics       Date:  2004-07-01       Impact factor: 6.937

3.  The generalized LASSO.

Authors:  Volker Roth
Journal:  IEEE Trans Neural Netw       Date:  2004-01

4.  Penalized logistic regression for detecting gene interactions.

Authors:  Mee Young Park; Trevor Hastie
Journal:  Biostatistics       Date:  2007-04-11       Impact factor: 5.899

5.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

6.  Predicting the clinical status of human breast cancer by using gene expression profiles.

Authors:  M West; C Blanchette; H Dressman; E Huang; S Ishida; R Spang; H Zuzan; J A Olson; J R Marks; J R Nevins
Journal:  Proc Natl Acad Sci U S A       Date:  2001-09-18       Impact factor: 11.205

7.  Gene mining: a novel and powerful ensemble decision approach to hunting for disease genes using microarray expression profiling.

Authors:  Xia Li; Shaoqi Rao; Yadong Wang; Binsheng Gong
Journal:  Nucleic Acids Res       Date:  2004-05-17       Impact factor: 16.971

8.  Coupled two-way clustering analysis of breast cancer and colon cancer gene expression data.

Authors:  Gad Getz; Hilah Gal; Itai Kela; Daniel A Notterman; Eytan Domany
Journal:  Bioinformatics       Date:  2003-06-12       Impact factor: 6.937

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.