| Literature DB >> 25893896 |
Abstract
Choosing an appropriate kernel is very important and critical when classifying a new problem with Support Vector Machine. So far, more attention has been paid on constructing new kernels and choosing suitable parameter values for a specific kernel function, but less on kernel selection. Furthermore, most of current kernel selection methods focus on seeking a best kernel with the highest classification accuracy via cross-validation, they are time consuming and ignore the differences among the number of support vectors and the CPU time of SVM with different kernels. Considering the tradeoff between classification success ratio and CPU time, there may be multiple kernel functions performing equally well on the same classification problem. Aiming to automatically select those appropriate kernel functions for a given data set, we propose a multi-label learning based kernel recommendation method built on the data characteristics. For each data set, the meta-knowledge data base is first created by extracting the feature vector of data characteristics and identifying the corresponding applicable kernel set. Then the kernel recommendation model is constructed on the generated meta-knowledge data base with the multi-label classification method. Finally, the appropriate kernel functions are recommended to a new data set by the recommendation model according to the characteristics of the new data set. Extensive experiments over 132 UCI benchmark data sets, with five different types of data set characteristics, eleven typical kernels (Linear, Polynomial, Radial Basis Function, Sigmoidal function, Laplace, Multiquadric, Rational Quadratic, Spherical, Spline, Wave and Circular), and five multi-label classification methods demonstrate that, compared with the existing kernel selection methods and the most widely used RBF kernel function, SVM with the kernel function recommended by our proposed method achieved the highest classification performance.Entities:
Mesh:
Year: 2015 PMID: 25893896 PMCID: PMC4403820 DOI: 10.1371/journal.pone.0120455
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Framework of the kernel recommendation method.
Data characteristic measures.
| Simple, statistical and information theoretical measures | Problem complexity measures | ||
|---|---|---|---|
| Notation | Measures | Notation | Measures |
| F | Number of features | BL | Length of class boundary |
| OL | Number of outliers | 1NN | Nonlinearity of 1NN classifier |
| I | Number of instances | Fisher | Maximum Fisher’s discriminant ratio |
| C | Number of class labels | NLP | Nonlinearity of linear classifier by LP |
| MV | Number of missing values | AP | Average number of points per dimension |
|
| Number of binary features | RS | Percentage of retained adherence subsets |
| MI | Number of missing instances | RNN | Ratio of average intra/inter class nearest neighbor distances |
|
| Number of nominal and numeric features | ||
|
| Percentage of majority and minority class | ||
| Prc | Prctile | ||
|
| Variance | Model-based measures | |
|
| Kurtosis | ||
|
| Skewness | Notation | Measures |
| TM | Trim mean | TW | width of tree |
| HM | Harmonic mean | TH | height of tree |
| GM | Geometric mean | Nodes | number of nodes |
| CG | Center of gravity | Leaves | number of leaves |
| IQR | Interquartile range | longBranch | length of longest branches |
|
| Standard deviation | shortBranch | length of shortest branches |
| CC | Canonical correlation | minAttr | minimum occurrence of attributes |
|
| Correlation coefficient | meanBranch | mean of the length of each branch |
|
| Mean absolute deviation | maxAttr | maximum occurrence of attributes |
| NSR | Noise-signal ratio | maxLevel | maximum number of nodes at one level |
|
| Entropy of classes | meanLevel | mean of the number of nodes at one level |
|
| Mean entropy of variables | devLevel | standard deviation of the number of nodes |
| ENV | Equivalent number of variables | devBranch | standard deviation of the length of each branch |
|
| Maximum and minimum eigenvalue | meanAttr | mean of the number of occurrences of attributes |
|
| Mean mutual entropy of class and variables | devAttr | standard deviation of the number of occurrences of attributes |
Algorithm 1. PerformanceEvaluation().
|
| |
| SVM—the default classifier; | |
|
| |
|
| |
| 1 |
|
| 2 |
|
| 3 | randomize the order of |
| 4 | generate |
| 5 |
|
| 6 | TestData = bins[ |
| 7 | TrainingData = |
| 8 |
|
| 9 | ( |
| 10 | (AUC[ |
| 11 | Time[ |
| 12 |
|
| 13 | ARR[ |
| 14 | return |
Algorithm 2. ApplicableKernelIdentification().
|
| |
|
| |
|
| |
| 1 | ARR = |
| 2 |
|
| 3 |
|
| 4 |
|
| 5 | // |
| 6 |
|
| 7 |
|
| 8 | // |
| 9 |
|
| 10 |
|
Algorithm 3. ModelConstruction().
|
| |
|
| |
|
| |
|
| |
| 1 |
|
| 2 |
|
| 3 |
|
| 4 |
|
| 5 | //for the purpose of determining a optimal kernel function |
| 6 | Test = jackknife( |
| 7 | Training = |
| 8 | feaSubset = featureSelection(Training, |
| 9 |
|
| 10 |
|
| 11 | recModel[ |
| 12 | recKernel[ |
| 13 | recPerformance = evaluate(recKernel, actKernel); |
| 14 |
|
Description of the 132 data sets.
| ID | Name | Attributes | Instances | Classes | Source |
|---|---|---|---|---|---|
| 1 | anneal | 31 | 898 | 5 | UCI |
| 2 | audiology | 69 | 226 | 24 | UCI |
| 3 | autos | 25 | 205 | 6 | UCI |
| 4 | balance-scale | 4 | 625 | 3 | UCI |
| 5 | breast-cancer | 9 | 286 | 2 | UCI |
| 6 | breast-w | 9 | 699 | 2 | UCI |
| 7 | bridges_version1 | 11 | 105 | 6 | UCI |
| 8 | bridges_version2 | 11 | 105 | 6 | UCI |
| 9 | car | 6 | 1728 | 4 | UCI |
| 10 | cmc | 9 | 1473 | 3 | UCI |
| 11 | colic | 22 | 368 | 2 | UCI |
| 12 | credit-a | 15 | 690 | 2 | UCI |
| 13 | credit-g | 20 | 1000 | 2 | UCI |
| 14 | dermatology | 34 | 366 | 6 | UCI |
| 15 | diabetes | 8 | 768 | 2 | UCI |
| 16 | ecoli | 7 | 336 | 8 | UCI |
| 17 | flags | 28 | 194 | 8 | UCI |
| 18 | glass | 9 | 214 | 6 | UCI |
| 19 | haberman | 3 | 306 | 2 | UCI |
| 20 | hayes-roth_test | 3 | 28 | 3 | UCI |
| 21 | hayes-roth_train | 4 | 132 | 3 | UCI |
| 22 | heart-c | 13 | 303 | 2 | UCI |
| 23 | heart-h | 12 | 294 | 2 | UCI |
| 24 | heart-statlog | 13 | 270 | 2 | UCI |
| 25 | hepatitis | 19 | 155 | 2 | UCI |
| 26 | ionosphere | 33 | 351 | 2 | UCI |
| 27 | iris | 4 | 150 | 3 | UCI |
| 28 | kdd_synthetic_control | 60 | 600 | 6 | UCI |
| 29 | labor | 16 | 57 | 2 | UCI |
| 30 | liver-disorders | 6 | 345 | 2 | UCI |
| 31 | lung-cancer | 56 | 32 | 2 | UCI |
| 32 | lymph | 18 | 148 | 4 | UCI |
| 33 | mfeat-fourier | 76 | 2000 | 10 | UCI |
| 34 | mfeat-karhunen | 64 | 2000 | 10 | UCI |
| 35 | mfeat-morphological | 6 | 2000 | 10 | UCI |
| 36 | mfeat-zernike | 47 | 2000 | 10 | UCI |
| 37 | molecular-biology_promoters | 57 | 106 | 4 | UCI |
| 38 | monks-problems-1_test | 6 | 432 | 2 | UCI |
| 39 | monks-problems-1_train | 6 | 124 | 2 | UCI |
| 40 | monks-problems-2_test | 6 | 432 | 2 | UCI |
| 41 | monks-problems-2_train | 6 | 169 | 2 | UCI |
| 42 | monks-problems-3_test | 6 | 432 | 2 | UCI |
| 43 | monks-problems-3_train | 6 | 122 | 2 | UCI |
| 44 | postoperative-patient-data | 8 | 90 | 3 | UCI |
| 45 | primary-tumor | 17 | 339 | 21 | UCI |
| 46 | segment | 18 | 2310 | 7 | UCI |
| 47 | shuttle-landing-control | 6 | 15 | 2 | UCI |
| 48 | solar-flare_1 | 12 | 323 | 2 | UCI |
| 49 | solar-flare_2 | 11 | 1066 | 3 | UCI |
| 50 | sonar | 60 | 208 | 2 | UCI |
| 51 | soybean | 35 | 683 | 19 | UCI |
| 52 | spect_test | 22 | 187 | 2 | UCI |
| 53 | spect_train | 22 | 80 | 2 | UCI |
| 54 | spectf_test | 44 | 269 | 2 | UCI |
| 55 | spectf_train | 44 | 80 | 2 | UCI |
| 56 | spectrometer | 101 | 531 | 48 | UCI |
| 57 | sponge | 44 | 76 | 3 | UCI |
| 58 | tae | 5 | 151 | 3 | UCI |
| 59 | tic-tac-toe | 9 | 958 | 2 | UCI |
| 60 | vehicle | 18 | 846 | 4 | UCI |
| 61 | vote | 16 | 435 | 2 | UCI |
| 62 | vowel | 13 | 990 | 11 | UCI |
| 63 | wine | 13 | 178 | 3 | UCI |
| 64 | zoo | 16 | 101 | 7 | UCI |
| 65 | anneal.ORIG | 18 | 898 | 5 | UCI |
| 66 | australian | 14 | 690 | 2 | UCI |
| 67 | hypothyroid | 27 | 3772 | 4 | UCI |
| 68 | kr-vs-kp | 36 | 3196 | 2 | UCI |
| 69 | landsat_test | 36 | 2000 | 6 | UCI |
| 70 | landsat_train | 36 | 4435 | 6 | UCI |
| 71 | mfeat-factors | 216 | 2000 | 10 | UCI |
| 72 | mfeat-pixel | 240 | 2000 | 10 | UCI |
| 73 | mushroom | 21 | 8124 | 2 | UCI |
| 74 | nursery | 8 | 12960 | 5 | UCI |
| 75 | optdigits | 62 | 5620 | 10 | UCI |
| 76 | page-blocks | 10 | 5473 | 5 | UCI |
| 77 | pendigits | 16 | 10992 | 10 | UCI |
| 78 | sick | 27 | 3772 | 2 | UCI |
| 79 | spambase | 57 | 4601 | 2 | UCI |
| 80 | splice | 60 | 3190 | 3 | UCI |
| 81 | waveform-5000 | 40 | 5000 | 3 | UCI |
| 82 | ar3 | 29 | 63 | 2 | Promise |
| 83 | ar5 | 29 | 36 | 2 | Promise |
| 84 | usp05-ft | 14 | 72 | 6 | Promise |
| 85 | ar1 | 29 | 121 | 2 | Promise |
| 86 | ar4 | 29 | 107 | 2 | Promise |
| 87 | ar6 | 29 | 101 | 2 | Promise |
| 88 | cm1_req | 8 | 89 | 2 | Promise |
| 89 | jEdit_4.0_4.2 | 8 | 274 | 2 | Promise |
| 90 | jEdit_4.2_4.3 | 8 | 369 | 2 | Promise |
| 91 | jm1 | 21 | 10885 | 2 | Promise |
| 92 | kc1 | 21 | 2109 | 2 | Promise |
| 93 | kc1-class-level-binary | 86 | 145 | 2 | Promise |
| 94 | kc1-class-level-top5 | 86 | 145 | 2 | Promise |
| 95 | kc2 | 21 | 522 | 2 | Promise |
| 96 | kc3 | 39 | 458 | 2 | Promise |
| 97 | mc1 | 38 | 9466 | 2 | Promise |
| 98 | mc2 | 39 | 161 | 2 | Promise |
| 99 | mozilla4 | 5 | 15545 | 2 | Promise |
| 100 | mw1 | 37 | 403 | 2 | Promise |
| 101 | pc1 | 21 | 1109 | 2 | Promise |
| 102 | pc2 | 36 | 5589 | 2 | Promise |
| 103 | pc3 | 37 | 1563 | 2 | Promise |
| 104 | pc4 | 37 | 1458 | 2 | Promise |
| 105 | tae_trainPublic | 5 | 76 | 3 | Examples |
| 106 | Balance | 3 | 17 | 2 | DASL |
| 107 | Brainsize | 6 | 40 | 2 | DASL |
| 108 | Calories | 2 | 40 | 3 | DASL |
| 109 | Cars | 6 | 38 | 6 | DASL |
| 110 | Eggs | 3 | 48 | 2 | DASL |
| 111 | Fiber | 4 | 48 | 4 | DASL |
| 112 | FleaBeetles | 2 | 74 | 3 | DASL |
| 113 | Fridaythe13th | 5 | 61 | 12 | DASL |
| 114 | Hotdogs | 2 | 54 | 3 | DASL |
| 115 | LarynxCancer | 1 | 41 | 2 | DASL |
| 116 | PopularKids | 10 | 478 | 2 | DASL |
| 117 | Pottery | 5 | 26 | 4 | DASL |
| 118 | Companies | 6 | 79 | 9 | DASL |
| 119 | Michelson | 1 | 100 | 5 | DASL |
| 120 | db1-bf | 6 | 63 | 5 | Amirms |
| 121 | eucalyptus | 19 | 736 | 5 | Agricultural |
| 122 | grub-damage | 8 | 155 | 4 | Agricultural |
| 123 | pasture | 21 | 36 | 3 | Agricultural |
| 124 | squash-stored | 24 | 52 | 3 | Agricultural |
| 125 | squash-unstored | 23 | 52 | 3 | Agricultural |
| 126 | white-clover | 31 | 63 | 4 | Agricultural |
| 127 | ada_agnostic | 47 | 4562 | 2 | Agnostic-vs-Prior |
| 128 | ada_agnostic_train | 47 | 4147 | 2 | Agnostic-vs-Prior |
| 129 | ada_agnostic_valid | 44 | 415 | 2 | Agnostic-vs-Prior |
| 130 | ada_prior | 14 | 4562 | 2 | Agnostic-vs-Prior |
| 131 | ada_prior_train | 14 | 4147 | 2 | Agnostic-vs-Prior |
| 132 | ada_prior_valid | 14 | 415 | 2 | Agnostic-vs-Prior |
Fig 2Comparison of different recommendation methods in terms of HR.
Fig 3Comparison of different recommendation methods in terms of Precision.
Fig 4Comparison of different recommendation methods in terms of overall classification performance (ARR).
Fig 5The classification performance (ARR) of SVM with the real best kernel vs. with the recommended kernels.
Statistical test results of performance differences between our method and other methods.
| Performance metrics | Data Characteristics | Multi-label vs. Single-label | Multi-label vs AliKSM | ||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ||
| Hit Rate | Statistics measures | 1.16E-13 | 3.13E-07 | 9.60E-10 | 2.37E-19 | 2.04E-12 | 4.53E-12 |
| Complexity measures | 1.18E-11 | 2.36E-10 | 1.20E-08 | 3.10E-14 | 5.08E-14 | 7.72E-15 | |
| LandMarking measures | 2.64E-14 | 2.70E-13 | 7.78E-05 | 2.88E-19 | 4.56E-19 | 2.22E-07 | |
| Model-based measures | 2.36E-10 | 4.47E-12 | 1.80E-12 | 7.57E-13 | 5.72E-18 | 7.72E-15 | |
| Structure measures | 4.07E-19 | 1.16E-15 | 2.26E-15 | 4.65E-27 | 4.52E-22 | 6.67E-24 | |
| Precision | Statistics measures | 1.16E-13 | 3.13E-07 | 9.60E-10 | 2.37E-19 | 2.04E-12 | 4.53E-12 |
| Complexity measures | 1.18E-11 | 2.36E-10 | 1.20E-08 | 3.10E-14 | 5.08E-14 | 7.72E-15 | |
| LandMarking measures | 2.64E-14 | 2.70E-13 | 7.78E-05 | 2.88E-19 | 4.56E-19 | 2.22E-07 | |
| Model-based measures | 2.36E-10 | 4.47E-12 | 1.80E-12 | 7.57E-13 | 5.72E-18 | 7.72E-15 | |
| Structure measures | 4.07E-19 | 1.16E-15 | 2.26E-15 | 4.65E-27 | 4.52E-22 | 6.67E-24 | |
| ARR | Statistics measures | 2.63E-05 | 1.22E-01 | 5.75E-02 | 3.83E-08 | 1.74E-03 | 1.05E-02 |
| Complexity measures | 1.46E-03 | 7.32E-03 | 6.53E-02 | 1.18E-04 | 2.14E-04 | 2.79E-03 | |
| LandMarking measures | 2.66E-04 | 5.24E-05 | 9.18E-01 | 1.57E-06 | 5.77E-07 | 6.66E-01 | |
| Model-based measures | 5.87E-03 | 4.02E-03 | 6.39E-03 | 5.05E-04 | 2.95E-05 | 9.61E-04 | |
| Structure measures | 3.41E-08 | 3.08E-06 | 1.14E-05 | 1.48E-12 | 1.74E-10 | 5.95E-10 | |
| Performance metrics | Data Characteristics | Multi-label vs. MKL-Poly | Multi-label vs MKL-RBF | ||||
|
|
|
|
|
|
| ||
| ARR | Statistics measures | 5.14E-01 | 5.00E-01 | 2.38E-01 | 1.10E-05 | 1.65E-02 | 3.99E-05 |
| Complexity measures | 5.82E-01 | 8.61E-01 | 9.03E-01 | 1.98E-04 | 2.19E-03 | 5.43E-03 | |
| LandMarking measures | 1.47E-01 | 4.03E-01 | 7.60E-02 | 2.99E-06 | 5.84E-05 | 4.13E-01 | |
| Model-based measures | 9.60E-01 | 6.55E-01 | 9.60E-01 | 4.58E-03 | 1.52E-02 | 2.90E-03 | |
| Structure measures | 2.30E-02 | 1.88E-02 | 7.07E-04 | 4.52E-09 | 3.78E-09 | 4.13E-11 | |
* There is no significant difference between the performance of both methods.
Fig 6Comparison of different multi-label classification methods in terms of HR, Precision and ARR.
Fig 7Comparison of different feature selection methods in terms of HR, Precision and ARR.