Literature DB >> 25258621

Optimal SVM parameter selection for non-separable and unbalanced datasets.

Peng Jiang1, Samy Missoum1, Zhao Chen2.   

Abstract

This article presents a study of three validation metrics used for the selection of optimal parameters of a support vector machine (SVM) classifier in the case of non-separable and unbalanced datasets. This situation is often encountered when the data is obtained experimentally or clinically. The three metrics selected in this work are the area under the ROC curve (AUC), accuracy, and balanced accuracy. These validation metrics are tested using computational data only, which enables the creation of fully separable sets of data. This way, non-separable datasets, representative of a real-world problem, can be created by projection onto a lower dimensional sub-space. The knowledge of the separable dataset, unknown in real-world problems, provides a reference to compare the three validation metrics using a quantity referred to as the "weighted likelihood". As an application example, the study investigates a classification model for hip fracture prediction. The data is obtained from a parameterized finite element model of a femur. The performance of the various validation metrics is studied for several levels of separability, ratios of unbalance, and training set sizes.

Entities:  

Keywords:  Cross validation; Non-separable and unbalanced datasets; Support vector machines; Validation metrics

Year:  2014        PMID: 25258621      PMCID: PMC4170691          DOI: 10.1007/s00158-014-1105-z

Source DB:  PubMed          Journal:  Struct Multidiscipl Optim        ISSN: 1615-147X            Impact factor:   4.542


  8 in total

1.  Gaussian processes for classification: mean-field algorithms.

Authors:  M Opper; O Winther
Journal:  Neural Comput       Date:  2000-11       Impact factor: 2.026

2.  Bounds on error expectation for support vector machines.

Authors:  V Vapnik; O Chapelle
Journal:  Neural Comput       Date:  2000-09       Impact factor: 2.026

3.  Basic principles of ROC analysis.

Authors:  C E Metz
Journal:  Semin Nucl Med       Date:  1978-10       Impact factor: 4.446

4.  Accuracy of finite element predictions in sideways load configurations for the proximal human femur.

Authors:  L Grassi; E Schileo; F Taddei; L Zani; M Juszczyk; L Cristofolini; M Viceconti
Journal:  J Biomech       Date:  2011-11-12       Impact factor: 2.712

Review 5.  On the modelling bone tissue fracture and healing of the bone tissue.

Authors:  Manuel Doblaré; José Manuel García
Journal:  Acta Cient Venez       Date:  2003

6.  Comparison of the elastic and yield properties of human femoral trabecular and cortical bone tissue.

Authors:  Harun H Bayraktar; Elise F Morgan; Glen L Niebur; Grayson E Morris; Eric K Wong; Tony M Keaveny
Journal:  J Biomech       Date:  2004-01       Impact factor: 2.712

7.  Comparison of the predicted and observed secondary structure of T4 phage lysozyme.

Authors:  B W Matthews
Journal:  Biochim Biophys Acta       Date:  1975-10-20

8.  Patient-centered yes/no prognosis using learning machines.

Authors:  I R König; J D Malley; S Pajevic; C Weimar; H-C Diener; A Ziegler
Journal:  Int J Data Min Bioinform       Date:  2008       Impact factor: 0.667

  8 in total
  1 in total

1.  Fusion of clinical and stochastic finite element data for hip fracture risk prediction.

Authors:  Peng Jiang; Samy Missoum; Zhao Chen
Journal:  J Biomech       Date:  2015-10-09       Impact factor: 2.712

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.