| Literature DB >> 27722068 |
Qiuling Hou1, Meng Lv1, Ling Zhen1, Ling Jing1.
Abstract
Although support vector machine (SVM) has become a powerful tool for pattern classification and regression, a major disadvantage is it fails to exploit the underlying correlation between any pair of data points as much as possible. Inspired by the modified pairwise constraints trick, in this paper, we propose a novel classifier termed as support vector machine with hypergraph-based pairwise constraints to improve the performance of the classical SVM by introducing a new regularization term with hypergraph-based pairwise constraints (HPC). The new classifier is expected to not only learn the structural information of each point itself, but also acquire the prior distribution knowledge about each constrained pair by combining the discrimination metric and hypergraph learning together. Three major contributions of this paper can be summarized as follows: (1) acquiring the high-order relationships between different samples by hypergraph learning; (2) presenting a more reasonable discriminative regularization term by combining the discrimination metric and hypergraph learning; (3) improving the performance of the existing SVM classifier by introducing HPC regularization term. And the comprehensive experimental results on twenty-five datasets demonstrate the validity and advantage of our approach.Entities:
Keywords: Discrimination metric; Hypergraph learning; Modified pairwise constraints; Support vector machine
Year: 2016 PMID: 27722068 PMCID: PMC5035294 DOI: 10.1186/s40064-016-3315-x
Source DB: PubMed Journal: Springerplus ISSN: 2193-1801
Fig. 1Simple graph versus hypergraph. Left: an article set and an author set . The element is set to 1 if is an author of article , and 0 otherwise. Middle: an undirected simple graph in which two articles are joined together by an edge if there is at least one author in common. While, this graph cannot tell us whether the same person is the author of three or more articles or not. Right: a hypergraph which could completely illustrate the high-order relationships among authors and articles
The objects and motivations of the comparison
| Situation | Objects | Motivations |
|---|---|---|
| 1 | SRSVM versus SVM | To demonstrate the prior structural information within classes in the data is effective for classification |
| 2 | MPCSVM versus SVM | To demonstrate the discriminative information about each constrained pair in data is effective for classification |
| 3 | MPCSVM versus SRSVM | To display the discriminative information is more effective than the structures in data within classes for classification |
| 4 | HPCSVM versus MPCSVM | To demonstrate our newly-designed HPC regularization term is more reasonable than the MPC regularization term |
| 5 | HPCSVM versus LSSVM | To display our proposed HPCSVM is also better than SVM’s variant |
Test accuracy on UCI datasets for linear classifiers
| Datasets | SVM | LSSVM | SRSVM | MPCSVM | HPCSVM | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy | Time (s) |
| Accuracy | Time (s) |
| Accuracy | Time (s) |
| Accuracy | Time (s) |
| Accuracy | Time (s) | |
| Hepatitis (155 × 19) | 83.30 ± 6.64 | 0.06 | 0.6983 | 83.99 ± 8.98 | 0.02 | 0.8315 | 84.99 ± 8.25 | 0.07 | 0.5145 | 84.61 ± 7.11 | 0.09 | 0.9033 |
| 0.06 |
| Heartstatlog (270 × 13) | 80.00 ± 5.16 | 0.16 | 0.2433 | 84.07 ± 6.04 | 0.03 | 0.8072 | 84.44 ± 5.05 | 0.21 | 0.8825 |
| 0.23 |
| 0.17 | |
| Teaching (151 × 5) | 68.85 ± 2.80 | 0.06 | 0.2216 | 72.54 ± 5.36 | 0.03 | 0.8757 | 72.94 ± 5.36 | 0.07 | 0.8238 | 72.85 ± 5.31 | 0.07 | 0.8596 |
| 0.05 |
| Haberman (306 × 3) | 73.53 ± 0.49 | 0.25 | 0.4298 | 74.50 ± 2.57 | 0.04 | 0.7847 |
| 0.28 | 0.8765 | 75.47 ± 3.55 | 0.26 | 0.9067 | 75.15 ± 3.86 | 0.17 |
| Breast (683 × 9) | 96.04 ± 3.68 | 1.87 | 0.5185 | 96.05 ± 1.95 | 0.22 | 0.2999 | 97.19 ± 1.42 | 2.18 | 0.9124 | 97.21 ± 1.88 | 2.29 | 0.9009 |
| 1.59 |
| BUPA (345 × 6) | 67.53 ± 3.73 | 0.30 | 0.2073 | 68.70 ± 6.19 | 0.05 | 0.5306 | 69.99 ± 4.55 | 0.32 | 0.7536 |
| 0.34 | 1 |
| 0.24 |
| Diabetes (768 × 8) | 71.48 ± 2.65 | 2.54 | 0.0435 | 77.61 ± 4.65 | 0.34 | 0.8534 | 75.61 ± 4.65 | 2.28 | 0.5216 | 78.01 ± 3.87 | 2.40 | 0.9365 |
| 1.86 |
| Seeds (210 × 7) | 95.24 ± 6.10 | 0.10 | 0.0037 | 97.14 ± 2.78 | 0.03 | 0.5796 |
| 0.12 | 0.8215 | 98.10 ± 1.78 | 0.14 | 1 | 98.10 ± 1.78 | 0.10 |
| Sonar (208 × 60) | 72.16 ± 9.52 | 0.10 | 0.3209 | 77.34 ± 8.47 | 0.02 | 0.8122 | 78.34 ± 7.73 | 0.08 | 0.9354 |
| 0.16 | 0.8664 | 78.78 ± 8.12 | 0.10 |
| Parkinsons (195 × 22) | 84.09 ± 1.92 | 0.08 | 0.0191 | 88.23 ± 1.03 | 0.02 | 0.5307 | 88.53 ± 1.03 | 0.14 | 0.6346 | 88.27 ± 3.69 | 0.13 | 0.6874 |
| 0.09 |
| Spect (267 × 44) | 79.78 ± 0.70 | 0.25 | 0.0687 | 79.40 ± 0.19 | 0.04 | 0.0438 | 82.40 ± 0.19 | 0.28 | 0.8765 |
| 0.27 | 0.7601 | 83.14 ± 3.13 | 0.18 |
| Ionosphere (351 × 33) | 87.75 ± 2.33 | 0.45 | 0.4902 | 86.61 ± 1.16 | 0.07 | 0.0959 | 88.86 ± 1.41 | 0.50 | 0.9645 |
| 0.40 | 0.4913 | 88.89 ± 2.12 | 0.28 |
| Heartcancer (303 × 14) | 96.33 ± 7.33 | 0.16 | 0.3466 | 92.09 ± 1.89 | 0.03 | 0.0000 | 98.09 ± 1.78 | 0.25 | 0.5024 |
| 0.29 | NaN |
| 0.21 |
| Heart_diseas e(294 × 13) | 72.79 ± 6.84 | 0.27 | 0.0326 | 82.33 ± 4.19 | 0.04 | 0.6854 | 83.00 ± 3.82 | 0.30 | 0.8456 | 83.35 ± 5.04 | 0.26 | 0.9258 |
| 0.19 |
| Fertility (100 × 9) | 86.29 ± 4.93 | 0.02 | 0.3105 | 88.08 ± 1.94 | 0.01 | 0.6001 | 90.08 ± 1.98 | 0.04 | 0.6570 |
| 0.04 | 0.6982 | 89.13 ± 3.34 | 0.03 |
| Ech_diogram (131 × 10) | 87.00 ± 7.04 | 0.05 | 0.5075 | 88.39 ± 6.69 | 0.03 | 0.7137 | 89.39 ± 5.96 | 0.07 | 0.9035 |
| 0.06 | 1 |
| 0.05 |
| Balancescale (576 × 4) | 95.49 ± 1.77 | 0.89 | 0.4746 | 95.32 ± 2.08 | 0.17 | 0.4419 |
| 1.15 | 0.9654 | 96.36 ± 0.83 | 1.20 | 0.9986 | 96.36 ± 1.49 | 0.83 |
| WPBC (198 × 34) | 78.27 ± 5.00 | 0.10 | 0.2938 | 80.36 ± 4.09 | 0.03 | 0.5125 | 82.36 ± 4.67 | 0.11 | 0.9941 | 82.37 ± 4.34 | 0.14 | 0.9971 |
| 0.10 |
| WDBC (569 × 30) | 93.51 ± 5.13 | 0.87 | 0.0000 | 96.13 ± 1.89 | 0.18 | 0.0994 |
| 1.13 | 0.9235 | 98.07 ± 0.86 | 1.32 | 1 | 98.07 ± 0.86 | 0.93 |
| Vertebral (310 × 6) | 85.48 ± 2.89 | 0.25 | 0.4520 | 85.16 ± 4.26 | 0.04 | 0.4731 | 86.12 ± 4.52 | 0.30 | 0.7538 |
| 0.28 | 1 |
| 0.21 |
| Australian (690 × 14) | 85.51 ± 1.62 | 2.49 | 0.2854 | 86.09 ± 1.12 | 0.20 | 0.4450 | 86.87 ± 2.15 | 2.25 | 0.8856 | 86.96 ± 1.87 | 1.89 | 0.9218 |
| 1.55 |
| BTSC (748 × 4) | 75.55 ± 2.52 | 2.51 | 0.0000 | 77.27 ± 0.68 | 0.32 | 0.1793 | 77.54 ± 1.03 | 2.28 | 0.2058 | 79.28 ± 2.92 | 2.08 | 0.8968 |
| 1.67 |
| Tic_tac_toe (958 × 27) |
| 2.20 | 1 |
| 0.26 | 1 |
| 3.25 | 1 |
| 5.79 | 1 |
| 3.00 |
| German (1000 × 24) | 74.90 ± 2.22 | 4.27 | 0.1839 | 76.80 ± 2.46 | 0.28 | 0.6755 | 77.10 ± 2.62 | 6.26 | 0.7640 | 77.50 ± 1.38 | 5.21 | 0.8997 |
| 4.08 |
| CMC (1473 × 9) | 77.39 ± 0.13 | 11.80 | 0.0936 | 77.46 ± 0.52 | 1.64 | 0.1467 |
| 14.03 | 0.9023 | 78.07 ± 0.96 | 12.36 | 0.6478 | 78.41 ± 1.06 | 9.06 |
Test accuracy on UCI datasets for nonlinear classifiers
| Datasets | SVM | LSSVM | SRSVM | MPCSVM | HPCSVM | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy | Time (s) |
| Accuracy | Time (s) |
| Accuracy | Time (s) |
| Accuracy | Time (s) |
| Accuracy | Time (s) | |
| Hepatitis (155 × 19) | 82.01 ± 9.43 | 0.10 | 0.7550 | 83.34 ± 8.36 | 0.08 | 0.9095 | 83.78 ± 8.12 | 0.12 | 0.9643 | 84.01 ± 7.98 | 0.16 | 0.9971 |
| 0.14 |
| Heartstatlog (270 × 13) | 82.96 ± 2.46 | 0.27 | 0.5264 | 83.70 ± 6.13 | 0.21 | 0.7863 | 84.70 ± 5.24 | 0.33 | 0.9654 | 84.44 ± 5.32 | 0.41 | 0.9218 |
| 0.36 |
| Teaching (151 × 5) | 82.14 ± 3.19 | 0.10 | 0.7812 | 82.14 ± 3.19 | 0.07 | 0.7812 |
| 0.11 | 0.8574 | 82.83 ± 3.59 | 0.13 | 82.83 ± 3.59 | 0.12 | |
| Haberman (306 × 3) | 73.53 ± 0.48 | 0.35 | 0.1056 | 74.51 ± 3.03 | 0.26 | 0.5231 | 75.32 ± 2.65 | 0.46 | 0.8032 | 75.48 ± 2.98 | 0.50 | 0.8693 |
| 0.44 |
| Breast (683 × 9) | 95.60 ± 4.54 | 2.20 | 0.4155 | 97.22 ± 1.50 | 1.34 | 0.6948 | 97.62 ± 1.24 | 3.26 | 0.9544 | 97.51 ± 1.36 | 3.51 | 0.8913 |
| 3.01 |
| BUPA (345 × 6) | 71.01 ± 3.30 | 0.46 | 0.0602 | 73.33 ± 2.67 | 0.42 | 0.2547 | 74.43 ± 2.49 | 0.48 | 0.5642 | 75.36 ± 2.75 | 0.68 | 0.8832 |
| 0.65 |
| Diabetes (768 × 8) | 72.39 ± 1.86 | 2.57 | 0.0384 | 77.48 ± 4.65 | 1.68 | 0.7826 | 76.48 ± 4.26 | 2.86 | 0.5145 | 78.01 ± 5.01 | 3.89 | 0.9104 |
| 3.44 |
| Seeds (210 × 7) | 95.95 ± 3.10 | 0.17 | 0.1614 | 96.19 ± 2.43 | 0.13 | 0.7599 |
| 0.20 | 0.5462 | 96.19 ± 1.17 | 0.25 | 0.6666 | 96.01 ± 1.78 | 0.23 |
| Sonar (208 × 60) | 88.47 ± 6.96 | 0.17 | 1 | 88.47 ± 6.96 | 0.13 | 1 |
| 0.21 | 0.7459 | 88.47 ± 6.96 | 0.28 | 88.47 ± 6.96 | 0.23 | |
| Parkinsons (195 × 22) | 87.21 ± 2.68 | 0.14 | 0.0016 | 93.87 ± 2.01 | 0.12 | 0.2447 | 94.28 ± 2.26 | 0.18 | 0.5568 |
| 0.24 | 0.9888 | 95.92 ± 2.58 | 0.19 |
| Spect (267 × 44) | 79.79 ± 4.06 | 0.27 | 0.2639 | 82.40 ± 4.94 | 0.22 | 0.7521 | 83.40 ± 4.72 | 0.35 | 0.9325 |
| 0.45 | 0.9181 | 83.51 ± 4.67 | 0.42 |
| Ionosphere (351 × 33) | 94.31 ± 2.54 | 0.50 | 0.2920 | 95.72 ± 1.29 | 0.36 | 0.7860 | 95.45 ± 2.08 | 0.58 | 0.6734 |
| 0.78 | 0.8215 | 96.02 ± 1.65 | 0.67 |
| Heartcancer (303 × 14) | 96.38 ± 2.62 | 0.35 | 0.0247 | 95.05 ± 3.12 | 0.27 | 0.0131 | 98.05 ± 2.96 | 0.46 | 0.5672 |
| 0.54 | NaN |
| 0.46 |
| Heart_disease (294 × 13) | 75.85 ± 2.71 | 0.31 | 0.0178 | 79.31 ± 4.76 | 0.24 | 0.7579 | 82.31 ± 4.26 | 0.38 | 0.8973 | 83.00 ± 3.53 | 0.46 | 0.9085 |
| 0.40 |
| Fertility (100 × 9) | 87.12 ± 4.45 | 0.05 | 0.2239 | 88.18 ± 4.61 | 0.03 | 0.5031 | 88.08 ± 1.94 | 0.06 | 0.4589 |
| 0.07 |
| 0.06 | |
| Ech-diogram (131 × 10) | 87.68 ± 5.25 | 0.07 | 0.6181 | 86.16 ± 5.95 | 0.06 | 0.2633 | 89.24 ± 5.64 | 0.09 | 0.8575 | 89.96 ± 6.41 | 0.10 | 0.9951 |
| 0.09 |
| Balancescale (576 × 4) | 97.92 ± 1.61 | 1.34 | 0.0709 | 98.79 ± 0.88 | 0.98 | 0.1146 | 99.19 ± 0.82 | 1.58 | 0.3678 | 99.48 ± 0.69 | 1.89 | 0.6840 |
| 1.65 |
| WPBC (198 × 34) | 79.85 ± 5.15 | 0.15 | 0.4901 | 80.35 ± 6.11 | 0.13 | 0.6146 |
| 0.20 | 0.7236 | 82.87 ± 3.52 | 0.24 | 0.8650 | 82.36 ± 4.67 | 0.20 |
| WDBC (569 × 30) | 96.14 ± 1.03 | 1.27 | 0.0075 | 98.41 ± 0.87 | 0.91 | 0.6163 | 98.62 ± 0.95 | 1.32 | 0.7148 | 98.59 ± 0.89 | 2.04 | 0.8044 |
| 1.71 |
| Vertebral (310 × 6) | 85.48 ± 2.28 | 0.36 | 0.4774 | 85.81 ± 3.44 | 0.27 | 0.6224 | 86.41 ± 2.54 | 0.42 | 0.8012 | 86.77 ± 2.58 | 0.53 | 0.8894 |
| 0.51 |
| Australian (690 × 14) | 85.51 ± 1.62 | 2.05 | 0.4526 | 86.23 ± 2.00 | 1.37 | 0.6902 | 86.78 ± 1.69 | 2.39 | 0.8536 | 86.96 ± 1.28 | 3.11 | 0.9451 |
| 2.73 |
| BTSC (748 × 4) | 77.01 ± 2.86 | 2.41 | 0.2378 | 78.88 ± 1.91 | 1.56 | 0.6675 | 79.46 ± 2.51 | 2.76 | 0.8245 |
| 3.71 | 0.9504 | 79.68 ± 3.07 | 3.33 |
| Tic_tac_toe (958 × 27) |
| 3.99 | 1 |
| 2.69 | 1 |
| 4.73 | 1 |
| 9.37 | 1 |
| 5.45 |
| German (1000 × 24) | 75.40 ± 2.85 | 4.56 | 0.4943 | 74.40 ± 1.66 | 2.93 | 0.2232 | 76.32 ± 1.87 | 5.21 | 0.7435 | 77.00 ± 1.03 | 9.35 | 0.9288 |
| 8.27 |
| CMC (1473 × 9) | 77.39 ± 0.13 | 9.99 | 0.1728 | 77.66 ± 0.55 | 6.17 | 0.3959 | 78.46 ± 0.55 | 2.30 | 0.9765 | 78.28 ± 0.88 | 16.78 | 0.9246 |
| 15.31 |
Fig. 2Changes of accuracy as the growth of