| Literature DB >> 30768635 |
Maoxiang Chu1,2, Xiaoping Liu2, Rongfen Gong1, Jie Zhao3.
Abstract
This paper formulates a support vector machine with quantile hyper-spheres (QHSVM) for pattern classification. The idea of QHSVM is to build two quantile hyper-spheres with the same center for positive or negative training samples. Every quantile hyper-sphere is constructed by using pinball loss instead of hinge loss, which makes the new classification model be insensitive to noise, especially the feature noise around the decision boundary. Moreover, the robustness and generalization of QHSVM are strengthened through maximizing the margin between two quantile hyper-spheres, maximizing the inner-class clustering of samples and optimizing the independent quadratic programming for a target class. Besides that, this paper proposes a novel local center-based density estimation method. Based on it, ρ-QHSVM with surrounding and clustering samples is given. Under the premise of high accuracy, the execution speed of ρ-QHSVM can be adjusted. The experimental results in artificial, benchmark and strip steel surface defects datasets show that the QHSVM model has distinct advantages in accuracy and the ρ-QHSVM model is fit for large-scale datasets.Entities:
Mesh:
Year: 2019 PMID: 30768635 PMCID: PMC6377146 DOI: 10.1371/journal.pone.0212361
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The testing accuracies for linear classifiers on datasets with noise.
| SVM | Pin-SVM | TWSVM | THSVM | QHSVM | |
|---|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | |
| 0% | 97.41 | 97.31 | 97.41 | ||
| 5% | 96.87 | 97.17 | 96.92 | 97.02 | |
| 10% | 96.18 | 96.57 | 96.18 | 96.13 | |
| 20% | 94.5 | 95.49 | 94.89 | 95.04 |
The testing accuracies for nonlinear classifiers on datasets with noise.
| SVM | Pin-SVM | TWSVM | THSVM | QHSVM | |
|---|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | |
| 0% | 97.96 | 97.91 | 97.86 | 97.86 | |
| 5% | 97.46 | 97.71 | 97.46 | 97.56 | |
| 10% | 96.82 | 97.12 | 96.92 | 96.97 | |
| 20% | 95.29 | 96.03 | 95.49 | 95.73 |
The testing accuracies and execution time for six classifiers on different-scale datasets with noise.
| Classifiers | Performance | |||||
|---|---|---|---|---|---|---|
| SVM | — | Accuracy (%) | 96.87 | 96.52 | 97.13 | 97.03 |
| Time (s) | 0.729 | 36.41 | 174.4 | 344.6 | ||
| Pin-SVM | — | Accuracy (%) | 97.17 | 96.92 | 97.6 | 97.36 |
| Time (s) | 0.666 | 39.07 | 211 | 367.3 | ||
| TWSVM | — | Accuracy (%) | 96.67 | 96.6 | 97.06 | 97.07 |
| Time (s) | 0.235 | 13.08 | 57.23 | 113.6 | ||
| THSWM | — | Accuracy (%) | 96.87 | 96.74 | 97.26 | 97.19 |
| Time (s) | 0.292 | 14.1 | 56.08 | 127.7 | ||
| QHSVM | — | Accuracy (%) | 97.31 | 97.79 | 97.61 | |
| Time (s) | 1.226 | 62.74 | 334 | 719.8 | ||
| 0.7 | Accuracy (%) | 97.26 | 97.71 | 97.61 | ||
| Time (s) | 0.877 | 42.67 | 111.8 | 345.3 | ||
| 0.6 | Accuracy (%) | 96.97 | 97.31 | |||
| Time (s) | 0.548 | 33.27 | 93.06 | 212.1 | ||
| 0.5 | Accuracy (%) | 96.74 | 97.07 | 97.71 | 97.56 | |
| Time (s) | 0.25 | 14.47 | 60.57 | 122.9 | ||
| 0.4 | Accuracy (%) | 94.3 | 96.87 | 97.61 | 97.61 | |
| Time (s) | 0.2 | 11.35 | 42.59 | 92.9 | ||
| 0.3 | Accuracy (%) | 94.3 | 96.72 | 97.06 | 97.56 | |
| Time (s) | 0.102 | 5.837 | 17.42 | 52.72 |
The details of fifteen benchmark datasets.
| Datasets | Number of samples | Dimension | Datasets | Number of samples | Dimension |
|---|---|---|---|---|---|
| Heart | 270 | 13 | Wifi | 2000 | 6 |
| Ionosphere | 351 | 34 | Splice | 3190 | 60 |
| Breast | 569 | 30 | Wilt | 4889 | 5 |
| Thyroid | 215 | 5 | Musk | 6598 | 166 |
| Australian | 690 | 14 | Spambase | 4601 | 57 |
| WPBC | 198 | 34 | |||
| Pima | 768 | 8 | |||
| German | 1000 | 20 | |||
| Sonar | 208 | 60 | |||
| ILPD | 583 | 10 |
The testing accuracies of five classifiers on ten benchmark datasets.
| Datasets | SVM | Pin-SVM | TWSVM | THSVM | QHSVM |
|---|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | |
| Heart ( | 81.58±3.91 | 81.30±4.65 | 80.85±5.59 | 81.30±4.14 | |
| 79.85±3.90 | 79.92±3.88 | 79.72±5.24 | 79.91±4.00 | ||
| 77.84±3.81 | 77.90±3.06 | 78.04±2.76 | |||
| Ionosphere ( | 91.30±4.10 | 91.44±3.26 | 89.94±3.97 | 91.68±4.02 | |
| 90.27±4.59 | 90.97±3.59 | 89.46±3.82 | 91.52±3.78 | ||
| 89.33±4.65 | 90.03±3.51 | 88.19±4.38 | 89.33±3.55 | ||
| Breast ( | 93.67±1.53 | 93.82±2.02 | 93.46±2.12 | 93.55±1.38 | |
| 93.06±2.11 | 93.05±1.72 | 92.96±2.76 | 92.73±2.80 | ||
| 92.61±2.20 | 92.61±2.71 | 92.47±2.88 | 93.00±2.35 | ||
| Thyroid ( | 93.43±3.79 | 93.58±2.43 | 93.68±2.79 | 94.45±2.89 | |
| 93.28±3.28 | 93.42±2.95 | 93.17±3.83 | 93.50±2.91 | ||
| 91.76±3.50 | 92.18±3.20 | 91.72±3.79 | 92.04±4.14 | ||
| Australian ( | 74.28±3.48 | 74.42±6.18 | 79.91±4.23 | 79.91±2.24 | |
| 79.63±4.75 | 79.63±3.64 | 79.68±2.47 | 79.36±4.27 | ||
| 79.56±3.06 | 78.08±3.77 | 78.25±3.36 | |||
| WPBC ( | 79.92±7.52 | 79.66±4.78 | 80.12±6.23 | 80.04±5.04 | |
| 78.86±7.21 | 79.52±4.43 | 79.37±5.36 | 79.74±5.36 | ||
| 77.70±6.69 | 77.96±4.29 | 77.80±5.49 | 77.77±5.13 | ||
| Pima ( | 73.55±4.56 | 72.80±4.59 | 71.41±4.98 | 72.80±4.47 | |
| 71.43±5.29 | 73.02±4.15 | 71.25±5.08 | 71.44±5.11 | ||
| 70.92±4.47 | 70.95±4.08 | 70.84±4.89 | 72.47±4.27 | ||
| German ( | 72.56±2.63 | 71.17±2.56 | 71.59±3.20 | 71.59±2.76 | |
| 71.08±3.69 | 71.08±2.83 | 71.02±3.53 | 71.20±2.75 | ||
| 70.89±3.27 | 71.02±2.37 | 70.65±3.85 | 70.90±3.08 | ||
| Sonar ( | 84.63±4.27 | 85.62±4.39 | 85.10±4.69 | 85.26±4.05 | |
| 81.66±4.02 | 81.85±4.47 | 82.12±3.96 | 83.27±4.52 | ||
| 80.24±3.78 | 80.69±5.15 | 79.95±3.67 | 80.73±4.09 | ||
| ILPD ( | 86.46±3.81 | 86.81±3.17 | 86.47±3.02 | 87.05±1.96 | |
| 86.20±3.30 | 86.91±2.75 | 86.47±3.67 | 85.94±2.83 | ||
| 86.12±3.16 | 85.99±3.35 | 85.94±3.18 | 86.12±2.94 |
The testing accuracies and execute time of five classifiers on five large-scale datasets.
| Datasets | SVM | Pin-SVM | TWSVM | THSVM | |
|---|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | |
| Wifi ( | 74.47±4.30 | 74.60±3.32 | 74.72±4.44 | 74.93±4.24 | |
| 377.5 | 450.1 | 121.3 | 127 | 97.02 | |
| 73.59±4.97 | 74.38±4.55 | 73.73±5.08 | 74.04±5.25 | ||
| 372.6 | 465.5 | 130 | 128 | 94.96 | |
| 72.38±5.12 | 73.94±4.10 | 73.16±3.79 | 73.61±4.64 | ||
| 364.3 | 474.3 | 127.2 | 117.2 | 92.94 | |
| Splice ( | 86.92±2.09 | 87.34±2.09 | 87.29±2.48 | 87.29±2.28 | |
| 787.9 | 728.2 | 238.7 | 263.2 | 178.3 | |
| 86.21±2.99 | 86.44±2.51 | 86.48±3.26 | 86.08±3.65 | ||
| 803 | 749.2 | 257 | 267.4 | 186 | |
| 84.26±2.97 | 84.79±3.14 | 85.09±2.94 | 85.42±2.77 | ||
| 811.7 | 761.6 | 269 | 274.3 | 195.3 | |
| Wilt ( | 83.82±1.03 | 84.10±1.08 | 84.16±0.90 | 84.66±0.84 | |
| 818.2 | 904 | 267.3 | 250.7 | 206.2 | |
| 83.19±2.18 | 84.28±0.99 | 83.44±1.09 | 84.32±0.77 | ||
| 846 | 900.5 | 259.7 | 258.2 | 211.1 | |
| 83.60±1.41 | 84.32±0.95 | 83.09±1.00 | 83.58±1.00 | ||
| 787.4 | 887.1 | 264.6 | 255.3 | 207 | |
| Musk ( | 95.72±2.81 | 95.11±1.81 | 94.67±2.43 | 95.51±1.89 | |
| 2029 | 2194 | 751.7 | 755.7 | 520.9 | |
| 94.51±2.61 | 94.20±2.51 | 94.67±2.02 | 94.89±2.23 | ||
| 2081 | 2030 | 746.1 | 756.7 | 534.5 | |
| 93.94±1.78 | 93.82±2.38 | 94.06±1.95 | 94.03±1.95 | ||
| 2114 | 2111 | 700.6 | 761.6 | 548.4 | |
| Spambase ( | 88.76±3.03 | 87.03±3.09 | 88.24±5.11 | 88.78±3.51 | |
| 1082 | 1069 | 345.8 | 367.6 | 261.2 | |
| 87.30±3.65 | 87.77±3.04 | 86.21±3.59 | 86.47±4.57 | ||
| 1094 | 1080 | 354.3 | 373.6 | 255.5 | |
| 86.56±3.49 | 85.57±4.10 | 85.39±3.65 | 86.56±3.63 | ||
| 1062 | 1091 | 354.3 | 364.6 | 264.5 |
The accuracies of five classifiers for all types of defects.
| Defects | SVM | Pin-SVM | TWSVM | THSVM | |
|---|---|---|---|---|---|
| Patches | 91.88±2.93 | 92.87±2.33 | 91.94±2.26 | 92.30±2.05 | |
| Inclusion | 92.17±2.01 | 95.21±2.07 | 93.15±2.40 | 94.78±3.82 | |
| Scratches | 88.13±3.42 | 88.51±3.06 | 87.85±2.76 | 87.83±4.13 | |
| Scale | 84.23±2.53 | 88.46±2.20 | 85.97±2.93 | 86.54±3.11 |