| Literature DB >> 25802511 |
Chih-Feng Chao1, Ming-Huwi Horng1.
Abstract
The setting of parameters in the support vector machines (SVMs) is very important with regard to its accuracy and efficiency. In this paper, we employ the firefly algorithm to train all parameters of the SVM simultaneously, including the penalty parameter, smoothness parameter, and Lagrangian multiplier. The proposed method is called the firefly-based SVM (firefly-SVM). This tool is not considered the feature selection, because the SVM, together with feature selection, is not suitable for the application in a multiclass classification, especially for the one-against-all multiclass SVM. In experiments, binary and multiclass classifications are explored. In the experiments on binary classification, ten of the benchmark data sets of the University of California, Irvine (UCI), machine learning repository are used; additionally the firefly-SVM is applied to the multiclass diagnosis of ultrasonic supraspinatus images. The classification performance of firefly-SVM is also compared to the original LIBSVM method associated with the grid search method and the particle swarm optimization based SVM (PSO-SVM). The experimental results advocate the use of firefly-SVM to classify pattern classifications for maximum accuracy.Entities:
Mesh:
Year: 2015 PMID: 25802511 PMCID: PMC4352751 DOI: 10.1155/2015/212719
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1The plots of correct classification rate versus iteration numbers of SPECTF heart and Sonar data sets.
The used data set extracted from the UCI machine learning data repository.
| Data set collected from UCI the machine learning repository | ||
|---|---|---|
| Data set | Number of instances | Number of features |
| SPECTF heart | 267 | 44 |
| Breast Cancer Wisconsin (diagnosis) | 569 | 32 |
| Statlog (heart) | 270 | 13 |
| German credit data | 1000 | 20 |
| Sonar | 208 | 60 |
| Pima-Indians-diabetes | 768 | 8 |
| Australian credit approval | 690 | 14 |
| Live disorders | 345 | 7 |
| Ionosphere | 351 | 34 |
| Breast Cancer Wisconsin (original) | 699 | 10 |
The CCR of three different algorithms without feature selection (mean ± SD).
| Data set | Firefly-SVM | PSO-SVM | LIBSVM |
|---|---|---|---|
| SPECTF heart | 84.27 ± 0.95 | 80.19 ± 2.44 | 80.19 ± 2.73 |
| Breast Cancer Wisconsin (diagnosis) | 98.44 ± 0.39 | 97.54 ± 0.90 | 97.54 ± 1.54 |
| Statlog (heart) | 86.75 ± 2.19 | 86.75 ± 2.0 | 85.25 ± 2.99 |
| German credit data | 77.60 ± 0.89 | 75.33 ± 2.61 | 76.34 ± 1.24 |
| Sonar | 93.46 ± 3.31 | 92.69 ± 4.18 | 89.23 ± 4.63 |
| Pima-Indians-diabetes | 77.87 ± 1.33 | 76.34 ± 2.59 | 72.25 ± 3.22 |
| Australian credit approval | 88.63 ± 1.40 | 84.34 ± 3.77 | 83.76 ± 3.36 |
| Live disorders | 75.36 ± 2.99 | 73.22 ± 3.28 | 69.86 ± 2.67 |
| Ionosphere | 96.87 ± 1.91 | 95.21 ± 2.84 | 94.35 ± 2.25 |
| Breast Cancer Wisconsin (original) | 97.60 ± 0.37 | 96.62 ± 0.96 | 95.52 ± 0.92 |
The Matthews correlation coefficient for Table 2.
| Data set | Firefly-SVM | PSO-SVM | LIBSVM |
|---|---|---|---|
| SPECTF heart | 0.5104 | 0.4453 | 0.4592 |
| Breast Cancer Wisconsin (diagnosis) | 0.9663 | 0.9475 | 0.9598 |
| Statlog (heart) | 0.9962 | 0.9242 | 0.9019 |
| German credit data | 0.4981 | 0.3977 | 0.4721 |
| Sonar | 0.8488 | 0.8358 | 0.7865 |
| Pima-Indians-diabetes | 0.5580 | 0.5258 | 0.4206 |
| Australian credit approval | 0.7707 | 0.6417 | 0.6734 |
| Liver disorders | 0.4967 | 0.4529 | 0.3869 |
| Ionosphere | 0.9374 | 0.9032 | 0.8863 |
| Breast Cancer Wisconsin (original) | 0.9462 | 0.9242 | 0.9019 |
Classification results of the firefly-SVM and PSO-SVM with feature selection.
| Data sets |
| Firefly-SVM | PSO-SVM | ||
|---|---|---|---|---|---|
| CCR (mean ± SD) | Number of features | CCR (mean ± SD) | Average number of features | ||
| SPECTF heart | 44 | 85.89 ± 1.43 | 18 | 82.34 ± 2.174 | 22.6 |
| Breast Cancer Wisconsin (diagnosis) | 32 | 98.44 ± 0.39 | 14 | 98.44 ± 0.90 | 13.4 |
| Statlog (heart) | 13 | 86.75 ± 2.19 | 8 | 87.51 ± 2.21 | 8.6 |
| German credit data | 20 | 79.34 ± 1.14 | 12 | 75.33 ± 2.23 | 14.3 |
| Sonar | 60 | 95.23 ± 3.41 | 20 | 91.31 ± 2.42 | 32.7 |
| Pima-Indians-diabetes | 8 | 78.83 ± 1.62 | 5 | 77.87 ± 1.49 | 5.4 |
| Australian credit approval | 14 | 89.43 ± 1.65 | 8 | 84.34 ± 4.77 | 8.6 |
| Liver disorders | 7 | 75.36 ± 2.29 | 4 | 74.76 ± 2.12 | 5.2 |
| Ionosphere | 34 | 97.98 ± 3.15 | 12 | 95.21 ± 2.14 | 17.6 |
| Breast Cancer Wisconsin (original) | 10 | 98.67 ± 0.53 | 6 | 98.67 ± 1.39 | 6.7 |
Performance evaluation for each disease group.
| Method used | Sensitivity (%) | Specificity (%) | False negative rate (%) | Accuracy (%) |
|---|---|---|---|---|
| Method 1: firefly-SVM based OAA-FSVM | 2.5 | 92.50 ± 1.37 | ||
| (1) Normal | 93.10 | 90.00 | ||
| (2) Inflammation tendon | 93.10 | 96.42 | ||
| (3) Calcific tendon | 96.67 | 93.10 | ||
| (4) Supraspinatus tear | 100.00 | 100.00 | ||
|
| ||||
| Method 2: original OAA-FSVM trained by LIBSVM | 3.33 | 89.10 ± 2.49 | ||
| (1) Normal | 83.33 | 95.56 | ||
| (2) Inflammation tendon | 86.67 | 95.56 | ||
| (3) Calcific tendon | 90.00 | 96.67 | ||
| (4) Supraspinatus tear | 100.00 | 100.00 | ||
Performance indices for the firefly-SVM based and LIBSVM based OAA-FSVM.
| Measures | Firefly-SVM based OAA-FSVM | LIBSVM based OAA-FSVM |
|---|---|---|
| Accuracy (%) | 92.5 | 89.1 |
| Sensitivity (%) | 96.6 | 90.0 |
| Specificity (%) | 87.1 | 86.0 |
| Youden's index* (%) | 83.7 | 76.0 |
|
| 95.9 | 92.6 |
*Youden's index = Sensitivity + Specificity – 1.