Literature DB >> 32455423

Bayesian Non-linear Support Vector Machine for High-Dimensional Data with Incorporation of Graph Information on Features.

Wenli Sun1, Changgee Chang1, Qi Long1.   

Abstract

Support vector machine (SVM) is a popular classification method for analysis of high dimensional data such as genomics data. Recently a number of linear SVM methods have been developed to achieve feature selection through either frequentist regularization or Bayesian shrinkage, but the linear assumption may not be plausible for many real applications. In addition, recent work has demonstrated that incorporating known biological knowledge, such as those from functional genomics, into the statistical analysis of genomic data offers great promise of improved predictive accuracy and feature selection. Such biological knowledge can often be represented by graphs. In this article, we propose a novel knowledge-guided nonlinear Bayesian SVM approach for analysis of high-dimensional data. Our model uses graph information that represents the relationship among the features to guide feature selection. To achieve knowledge-guided feature selection, we assign an Ising prior to the indicators representing inclusion/exclusion of the features in the model. An efficient MCMC algorithm is developed for posterior inference. The performance of our method is evaluated and compared with several penalized linear SVM and the standard kernel SVM method in terms of prediction and feature selection in extensive simulation studies. Also, analyses of genomic data from a cancer study show that our method yields a more accurate prediction model for patient survival and reveals biologically more meaningful results than the existing methods.

Entities:  

Keywords:  Bayesian support vector machine; Gaussian process; Ising prior; Knowledge-guided; Pathway information

Year:  2020        PMID: 32455423      PMCID: PMC7243270          DOI: 10.1109/bigdata47090.2019.9006473

Source DB:  PubMed          Journal:  Proc IEEE Int Conf Big Data


  17 in total

1.  Prediction of protein retention times in anion-exchange chromatography systems using support vector regression.

Authors:  Minghu Song; Curt M Breneman; Jinbo Bi; N Sukumar; Kristin P Bennett; Steven Cramer; Nihal Tugcu
Journal:  J Chem Inf Comput Sci       Date:  2002 Nov-Dec

2.  Variable selection for discriminant analysis with Markov random field priors for the analysis of microarray data.

Authors:  Francesco C Stingo; Marina Vannucci
Journal:  Bioinformatics       Date:  2010-12-14       Impact factor: 6.937

3.  Network-constrained regularization and variable selection for analysis of genomic data.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Bioinformatics       Date:  2008-03-01       Impact factor: 6.937

4.  penalizedSVM: a R-package for feature selection SVM classification.

Authors:  Natalia Becker; Wiebke Werft; Grischa Toedt; Peter Lichter; Axel Benner
Journal:  Bioinformatics       Date:  2009-04-27       Impact factor: 6.937

5.  Knowledge-Guided Bayesian Support Vector Machine for High-Dimensional Data with Application to Analysis of Genomics Data.

Authors:  Wenli Sun; Changgee Chang; Yize Zhao; Qi Long
Journal:  Proc IEEE Int Conf Big Data       Date:  2019-01-24

6.  Gene selection using support vector machines with non-convex penalty.

Authors:  Hao Helen Zhang; Jeongyoun Ahn; Xiaodong Lin; Cheolwoo Park
Journal:  Bioinformatics       Date:  2005-10-25       Impact factor: 6.937

7.  Network-based penalized regression with application to genomic data.

Authors:  Sunkyung Kim; Wei Pan; Xiaotong Shen
Journal:  Biometrics       Date:  2013-07-03       Impact factor: 2.571

8.  Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1.

Authors:  Roel G W Verhaak; Katherine A Hoadley; Elizabeth Purdom; Victoria Wang; Yuan Qi; Matthew D Wilkerson; C Ryan Miller; Li Ding; Todd Golub; Jill P Mesirov; Gabriele Alexe; Michael Lawrence; Michael O'Kelly; Pablo Tamayo; Barbara A Weir; Stacey Gabriel; Wendy Winckler; Supriya Gupta; Lakshmi Jakkula; Heidi S Feiler; J Graeme Hodgson; C David James; Jann N Sarkaria; Cameron Brennan; Ari Kahn; Paul T Spellman; Richard K Wilson; Terence P Speed; Joe W Gray; Matthew Meyerson; Gad Getz; Charles M Perou; D Neil Hayes
Journal:  Cancer Cell       Date:  2010-01-19       Impact factor: 31.743

9.  Incorporating predictor network in penalized regression with application to microarray data.

Authors:  Wei Pan; Benhuai Xie; Xiaotong Shen
Journal:  Biometrics       Date:  2009-07-23       Impact factor: 2.571

10.  Elastic SCAD as a novel penalization method for SVM classification tasks in high-dimensional data.

Authors:  Natalia Becker; Grischa Toedt; Peter Lichter; Axel Benner
Journal:  BMC Bioinformatics       Date:  2011-05-09       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.