Literature DB >> 16111861

Bayesian approach to feature selection and parameter tuning for support vector machine classifiers.

Carl Gold1, Alex Holub, Peter Sollich.   

Abstract

A Bayesian point of view of SVM classifiers allows the definition of a quantity analogous to the evidence in probabilistic models. By maximizing this one can systematically tune hyperparameters and, via automatic relevance determination (ARD), select relevant input features. Evidence gradients are expressed as averages over the associated posterior and can be approximated using Hybrid Monte Carlo (HMC) sampling. We describe how a Nyström approximation of the Gram matrix can be used to speed up sampling times significantly while maintaining almost unchanged classification accuracy. In experiments on classification problems with a significant number of irrelevant features this approach to ARD can give a significant improvement in classification performance over more traditional, non-ARD, SVM systems. The final tuned hyperparameter values provide a useful criterion for pruning irrelevant features, and we define a measure of relevance with which to determine systematically how many features should be removed. This use of ARD for hard feature selection can improve classification accuracy in non-ARD SVMs. In the majority of cases, however, we find that in data sets constructed by human domain experts the performance of non-ARD SVMs is largely insensitive to the presence of some less relevant features. Eliminating such features via ARD then does not improve classification accuracy, but leads to impressive reductions in the number of features required, by up to 75%.

Entities:  

Mesh:

Year:  2005        PMID: 16111861     DOI: 10.1016/j.neunet.2005.06.044

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  5 in total

1.  A biomedical decision support system using LS-SVM classifier with an efficient and new parameter regularization procedure for diagnosis of heart valve diseases.

Authors:  Emre Comak; Ahmet Arslan
Journal:  J Med Syst       Date:  2010-06-04       Impact factor: 4.460

2.  Network Medicine: New Paradigm in the -Omics Era.

Authors:  Nancy Lan Guo
Journal:  Anat Physiol       Date:  2011-12-13

Review 3.  Network-based identification of biomarkers coexpressed with multiple pathways.

Authors:  Nancy Lan Guo; Ying-Wooi Wan
Journal:  Cancer Inform       Date:  2014-10-16

4.  Increasing the Accuracy of Hourly Multi-Output Solar Power Forecast with Physics-Informed Machine Learning.

Authors:  Daniel Vázquez Pombo; Henrik W Bindner; Sergiu Viorel Spataru; Poul Ejnar Sørensen; Peder Bacher
Journal:  Sensors (Basel)       Date:  2022-01-19       Impact factor: 3.576

5.  A Genetic Algorithm Based Support Vector Machine Model for Blood-Brain Barrier Penetration Prediction.

Authors:  Daqing Zhang; Jianfeng Xiao; Nannan Zhou; Mingyue Zheng; Xiaomin Luo; Hualiang Jiang; Kaixian Chen
Journal:  Biomed Res Int       Date:  2015-10-04       Impact factor: 3.411

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.