| Literature DB >> 26167527 |
Zhi-Xia Yang1, Yuan-Hai Shao2, Yao-Lin Jiang3.
Abstract
A novel learning framework of nonparallel hyperplanes support vector machines (NPSVMs) is proposed for binary classification and multiclass classification. This framework not only includes twin SVM (TWSVM) and its many deformation versions but also extends them into multiclass classification problem when different parameters or loss functions are chosen. Concretely, we discuss the linear and nonlinear cases of the framework, in which we select the hinge loss function as example. Moreover, we also give the primal problems of several extension versions of TWSVM's deformation versions. It is worth mentioning that, in the decision function, the Euclidean distance is replaced by the absolute value |w (T) x + b|, which keeps the consistency between the decision function and the optimization problem and reduces the computational cost particularly when the kernel function is introduced. The numerical experiments on several artificial and benchmark datasets indicate that our framework is not only fast but also shows good generalization.Entities:
Year: 2015 PMID: 26167527 PMCID: PMC4488010 DOI: 10.1155/2015/497617
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Results of linear SVM, GEPSVM, TWSVM, and BNPSVM on Example 1 dataset.
Tenfold testing percentage test set accuracy (%) on example data sets.
| Data set | SVM | GEPSVM | TWSVM | BHNPSVM |
|---|---|---|---|---|
| Accuracy % | Accuracy % | Accuracy % | Accuracy % | |
| Time (s) | Time (s) | Time (s) | Time (s) | |
| Example 1 | 70.90 | 95.45 | 98.18 |
|
| (202 × 2) | 0.122 | 0.0005 | 0.0064 | 0.0052 |
|
| ||||
| Data set | “1-v-1” | “1-v-r” | MBSVM | MHNPSVM |
| Accuracy % | Accuracy % | Accuracy % | Accuracy % | |
| Time (s) | Time (s) | Time (s) | Time (s) | |
|
| ||||
| Example 2 | 87.33 | 86.67 | 89.33 |
|
| (330 × 2) | 0.098 | 0.0006 | 0.0079 | 0.0095 |
Figure 2Results of linear “1-v-1,” “1-v-r,” MBSVM, and MHNPSVM on Example 2 dataset.
The detailed characteristics of the datasets.
| Data | #Ins | #Fea | #class | Data | #Ins | #Fea | #class |
|---|---|---|---|---|---|---|---|
| Hepatitis | 155 | 19 | 2 | Votes | 435 | 16 | 2 |
| WBPC | 198 | 34 | 2 | Sonar | 208 | 60 | 2 |
| Heart-statlog | 270 | 13 | 2 | BUPA | 345 | 6 | 2 |
| Pima-Indian | 768 | 8 | 2 | CMC | 1473 | 9 | 2 |
| Australian | 690 | 14 | 2 | Iris | 150 | 3 | 4 |
| Wine | 178 | 3 | 13 | Ecoli | 336 | 8 | 8 |
| Vowel | 528 | 11 | 10 | Glass | 214 | 6 | 13 |
| Vehicle | 846 | 4 | 18 | Car | 1728 | 6 | 4 |
| Segment | 2310 | 7 | 19 | Satimage | 4435 | 6 | 36 |
#Ins is the number of the training points; #attributes is the number of attributes; #class is the number of class.
Tenfold testing percentage test set accuracy (%) on binary-class UCI data sets for linear classifiers.
| Data sets | TWSVM | SVM | GEPSVM | BHNPSVM |
|---|---|---|---|---|
| Accuracy % | Accuracy % | Accuracy % | Accuracy % | |
| Time (s) | Time (s) | Time (s) | Time (s) | |
| Hepatitis | 82.89 ± 6.30* | 84.13 ± 5.58 | 80.07 ± 5.43 |
|
| 0.012 | 0.012 | 0.0006 | 0.0304 | |
| BUPA liver | 66.40 ± 7.74* | 67.78 ± 5.51 | 61.33 ± 6.26 |
|
| 0.840 | 0.0549 | 0.0012 | 0.2143 | |
| Heart-statlog |
| 83.12 ± 5.41 | 75.37 ± 7.02 |
|
| 0.023 | 0.0281 | 0.0022 | 0.1092 | |
| Votes | 95.85 ± 2.75 | 95.80 ± 2.65 | 91.93 ± 3.18 |
|
| 0.797 | 1.1446 | 0.0039 | 0.1027 | |
| WPBC |
| 83.30 ± 4.53 | 76.76 ± 6.67 | 81.32 ± 1.36* |
| 0.012 | 0.0432 | 0.0002 | 0.0465 | |
| Sonar | 77.00 ± 6.10 |
| 73.16 ± 8.33 | 74.15 ± 1.73 |
| 0.007 | 0.0946 | 0.0225 | 0.007 | |
| Australian | 85.94 ± 5.84 |
| 80.00 ± 3.99 | 85.27 ± 3.26 |
| 0.3460 | 0.2350 | 0.0029 | 0.4250 | |
| Pima-Indian | 73.80 ± 4.97* |
| 75.47 ± 4.64 | 77.05 ± 0.48* |
| 0.121 | 0.261 | 0.0016 | 0.4793 | |
| CMC | 68.28 ± 2.21* | 67.82 ± 2.63 | 66.76 ± 2.98 |
|
| 1.247 | 0.597 | 0.0050 | 1.197 | |
|
| ||||
| Mean accuracy | 79.81 | 80.88 | 75.65 |
|
| Mean time | 0.38 | 0.27 | 0.004 | 0.29 |
A greater difference between BHNPSVM and TWSVM.
Tenfold testing percentage test set accuracy (%) on binary-class UCI datasets for nonlinear classifiers.
| Datasets | TWSVM | SVM | GEPSVM | BHNPSVM |
|---|---|---|---|---|
| Accuracy % | Accuracy % | Accuracy % | Accuracy % | |
| Time (s) | Time (s) | Time (s) | Time (s) | |
| Hepatitis | 83.39 ± 7.31 |
| 80.00 ± 5.2 | 83.40 ± 3.58 |
| 0.016 | 0.0142 | 0.0035 | 0.0697 | |
| BUPA liver | 67.83 ± 6.49* | 68.32 ± 7.20 | 63.01 ± 7.46 |
|
| 0.033 | 0.0129 | 1.305 | 0.1522 | |
| Heart-statlog | 82.96 ± 4.67* | 83.33 ± 9.11 |
| 84.04 ± 4.56* |
| 0.029 | 0.0250 | 0.438 | 0.1120 | |
| Votes | 94.91 ± 4.37 |
| 94.5 ± 3.37 | 95.21 ± 5.18 |
| 0.072 | 0.0495 | 0.087 | 0.0152 | |
| WPBC |
| 80.18 ± 6.90 | 80.07 ± 5.97 | 80.89 ± 1.17 |
| 0.029 | 0.0148 | 0.0043 | 0.0468 | |
| Sonar |
| 88.93 ± 10.43 | 81.93 ± 4.41 | 88.05 ± 1.79 |
| 0.014 | 0.0781 | 0.020 | 0.2896 | |
| Australian | 75.8 ± 4.91* |
| 69.55 ± 5.37 | 77.58 ± 2.53* |
| 0.420 | 0.0425 | 0.334 | 0.497 | |
| Pima-Indian | 73.74 ± 5.2* | 76.09 ± 3.58 | 74.66 ± 5.00 |
|
| 0.427 | 0.442 | 15.892 | 0.381 | |
| CMC | 73.95 ± 3.48* | 68.98 ± 3.44 | 68.67 ± 3.84 |
|
| 1.708 | 1.755 | 1.042 | 1.920 | |
|
| ||||
| Mean accuracy | 80.39 | 81.23 | 77.66 |
|
| Mean time | 0.3053 | 0.27 | 2.1251 | 0.3871 |
A greater difference between BHNPSVM and TWSVM.
Tenfold testing percentage test set accuracy (%) on multiclass UCI datasets for linear classifiers.
| Dataset | 1-v-1 | 1-v-r | MBSVM | MHNPSVM |
|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | |
| Time (s) | Time (s) | Time (s) | Time (s) | |
| Iris | 96.83 ± 1.75 | 95.73 ± 3.78 | 95.00 ± 4.95 |
|
| 0.025 | 0.014 | 0.009 | 0.010 | |
| Wine | 96.59 ± 1.48 |
| 94.77 ± 4.07 | 95.88 ± 2.21 |
| 0.058 | 0.021 | 0.028 | 0.023 | |
| Ecoli |
| 86.77 ± 0.87 | 85.72 ± 1.02 | 86.78 ± 0.75 |
| 0.863 | 0.522 | 0.097 | 0.089 | |
| Vowel | 54.21 ± 2.24 | 57.44 ± 3.26 | 59.42 ± 4.96* |
|
| 1.459 | 0.580 | 0.160 | 0.172 | |
| Glass | 94.16 ± 1.84 | 94.42 ± 4.06 | 92.80 ± 9.80* |
|
| 1.037 | 0.405 | 0.183 | 0.105 | |
| Vehicle | 77.79 ± 2.21 |
| 77.59 ± 2.16 | 77.13 ± 1.87 |
| 28.11 | 10.05 | 2.96 | 2.58 | |
| Car | 86.78 ± 0.50 | 86.72 ± 0.31 | 84.09 ± 0.33* |
|
| 16.042 | 13.79 | 5.92 | 6.05 | |
| Segment | 91.60 ± 2.428 |
| 92.68 ± 1.87 | 93.04 ± 2.01 |
| 28.078 | 15.26 | 17.04 | 17.55 | |
| Satimage | 91.80 ± 0.81 | 90.20 ± 1.13 |
| 91.40 ± 1.49 |
| 60.50 | 32.29 | 47.45 | 45.27 | |
|
| ||||
| Mean accuracy | 86.38 | 86.64 | 86.05 |
|
| Mean time | 15.13 | 8.10 | 8.21 | 7.98 |
A greater difference between MHNPSVM and MBSVM.
Tenfold testing percentage test set accuracy (%) on multiclass UCI datasets for nonlinear classifiers.
| Dataset | 1-v-1 | 1-v-r | MBSVM | MHNPSVM |
|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | |
| Time (s) | Time (s) | Time (s) | Time (s) | |
| Iris |
| 97.63 ± 5.46 | 98.12 ± 2.08 | 98.74 ± 1.92 |
| 0.0054 | 0.0264 | 0.037 | 0.030 | |
| Wine | 97.08 ± 3.32 |
| 96.45 ± 1.29 | 97.28 ± 0.96 |
| 7.294 | 4.6504 | 0.592 | 0.523 | |
| Ecoli | 92.27 ± 1.03 | 90.35 ± 0.47 | 91.06 ± 1.45* |
|
| 0.382 | 0.0843 | 0.154 | 0.182 | |
| Glass | 98.09 ± 1.04 | 99.14 ± 0.97 | 98.76 ± 1.22 |
|
| 0.692 | 0.1085 | 0.089 | 0.092 | |
| Vowel | 91.37 ± 0.86 |
| 80.42 ± 4.37* | 85.86 ± 4.72* |
| 1.482 | 0.3844 | 0.623 | 0.593 | |
| Vehicle | 81.03 ± 5.73 | 82.49 ± 4.26 | 82.01 ± 1.33 |
|
| 19.562 | 11.456 | 2.81 | 2.50 | |
| Car |
| 87.36 ± 0.68 | 85.74 ± 0.33 | 86.57 ± 0.46 |
| 3.6571 | 0.9405 | 1.832 | 1.944 | |
| Segment | 95.15 ± 6.02 | 94.65 ± 4.38 |
| 95.90 ± 3.29 |
| 128.42 | 91.69 | 53.27 | 49.58 | |
| Satimage | 93.80 ± 1.46 | 93.05 ± 1.46 | 94.03 ± 1.93 |
|
| 190.27 | 132.47 | 89.05 | 88.36 | |
|
| ||||
| Mean accuracy | 92.90 |
| 91.39 | 92.73 |
| Mean time | 39.08 | 26.87 | 16.50 | 15.98 |
A greater difference between MHNPSVM and MBSVM.