| Literature DB >> 35309595 |
Abstract
Classification is a technique in data mining that is used to predict the value of a categorical variable and to produce input data and datasets of varying values. The classification algorithm makes use of the training datasets to build a model which can be used for allocating unclassified records to a defined class. In this paper, the coronavirus herd immunity optimizer (CHIO) algorithm is used to boost the efficiency of the probabilistic neural network (PNN) when solving classification problems. First, the PNN produces a random initial solution and submits it to the CHIO, which then attempts to refine the PNN weights. This is accomplished by the management of random phases and the effective identification of a search space that can probably decide the optimal value. The proposed CHIO-PNN approach was applied to 11 benchmark datasets to assess its classification accuracy, and its results were compared with those of the PNN and three methods in the literature, the firefly algorithm, African buffalo algorithm, and β-hill climbing. The results showed that the CHIO-PNN achieved an overall classification rate of 90.3% on all datasets, at a faster convergence speed as compared outperforming all the methods in the literature. Supplementary Information: The online version contains supplementary material available at 10.1007/s00500-022-06917-z.Entities:
Keywords: Classification problem; Coronavirus herd immunity optimizer; Data mining; Metaheuristics; Probabilistic neural network
Year: 2022 PMID: 35309595 PMCID: PMC8922087 DOI: 10.1007/s00500-022-06917-z
Source DB: PubMed Journal: Soft comput ISSN: 1432-7643 Impact factor: 3.643
Fig. 1Population hierarchy in herd immunity scenario (Al-Betar et al. 2020)
Fig. 2Flowchart of CHIO model
Fig. 3Representation of obtaining initial and final weights by CHIO-PNN
Fig. 4Proposed CHIO-PNN approach
Characteristics of the datasets
| Dataset | No. of attributes | Training set | Test set | |
|---|---|---|---|---|
| 1 | Haberman surgery survival (HSS) | 3 | 206 | 77 |
| 2 | PIMA Indian diabetes (PID) | 8 | 518 | 192 |
| 3 | Appendicitis (AP) | 7 | 71 | 27 |
| 4 | Breast Cancer (BC) | 10 | 193 | 72 |
| 5 | BUPA Liver Disorders (LD) | 6 | 233 | 86 |
| 6 | Statlog (Heart) | 13 | 182 | 68 |
| 7 | German Credit Data (GCD) | 20 | 675 | 250 |
| 8 | Parkinson’s | 23 | 131 | 49 |
| 9 | SPECTF | 45 | 180 | 67 |
| 10 | Australian Credit Approval (ACA) | 14 | 465 | 173 |
| 11 | Fourclass | 2 | 581 | 216 |
Parameter settings
| Parameter | Value |
|---|---|
| HIS | 30 |
| Max_Age | 100 |
| BRr | 0.01 |
| Max_Itr | 100 |
| LB (lower bound) | 0 |
| UB (upper bound) | 1 |
Results obtained by PNN and CHIO-PNN
| Dataset | Method | TP | FP | TN | FN | Accuracy | Sensitivity | Specificity | Error rate |
|---|---|---|---|---|---|---|---|---|---|
| PID | PNN | 35 | 28 | 90 | 39 | 65.104 | 0.473 | 0.763 | 0.349 |
| CHIO-PNN | 40 | 23 | 120 | 9 | 83.850 | 0.816 | 0.839 | 0.165 | |
| HSS | PNN | 44 | 12 | 6 | 15 | 64.935 | 0.746 | 0.333 | 0.351 |
| CHIO-PNN | 51 | 5 | 14 | 7 | 85.410 | 0.854 | 0.800 | 0.145 | |
| AP | PNN | 23 | 1 | 1 | 2 | 88.889 | 0.920 | 0.500 | 0.111 |
| CHIO-PNN | 24 | 0 | 2 | 1 | 96.296 | 0.960 | 1.000 | 0.038 | |
| BC | PNN | 14 | 9 | 36 | 13 | 69.444 | 0.519 | 0.800 | 0.306 |
| CHIO-PNN | 18 | 5 | 47 | 2 | 90.020 | 0.761 | 0.862 | 0.168 | |
| LD | PNN | 18 | 15 | 34 | 19 | 60.465 | 0.486 | 0.694 | 0.395 |
| CHIO-PNN | 32 | 3 | 47 | 4 | 91.860 | 0.766 | 0.897 | 0.080 | |
| Heart | PNN | 27 | 5 | 23 | 13 | 73.529 | 0.675 | 0.821 | 0.265 |
| CHIO-PNN | 30 | 0 | 24 | 12 | 82.350 | 0.680 | 1.000 | 0.171 | |
| GCD | PNN | 133 | 46 | 39 | 32 | 68.800 | 0.806 | 0.459 | 0.312 |
| CHIO-PNN | 165 | 14 | 44 | 27 | 83.600 | 0.850 | 0.750 | 0.168 | |
| Parkinson’s | PNN | 38 | 1 | 6 | 4 | 89.796 | 0.905 | 0.857 | 0.102 |
| CHIO-PNN | 39 | 0 | 6 | 4 | 91.830 | 0.906 | 1.000 | 0.821 | |
| SPECTF | PNN | 49 | 4 | 5 | 9 | 80.597 | 0.845 | 0.556 | 0.194 |
| CHIO-PNN | 49 | 2 | 14 | 2 | 94.020 | 0.960 | 0.750 | 0.060 | |
| ACA | PNN | 60 | 14 | 84 | 15 | 83.237 | 0.800 | 0.857 | 0.168 |
| CHIO-PNN | 70 | 4 | 95 | 4 | 95.790 | 0.933 | 0.959 | 0.043 | |
| Fourclass | PNN | 78 | 0 | 138 | 0 | 100.000 | 1.000 | 1.000 | 0.000 |
| CHIO-PNN | 78 | 0 | 138 | 0 | 100.000 | 1.000 | 1.000 | 0.000 |
Comparison of CHIO-PNN with previous methods
| Datasets | Methods | TP | FP | TN | FN | Accuracy | Sensitivity | Specificity | Error rate |
|---|---|---|---|---|---|---|---|---|---|
| PID | PNN | 35 | 28 | 90 | 39 | 65.104 | 0.473 | 0.763 | 0.349 |
| CHIO-PNN | 40 | 23 | 120 | 9 | 0.816 | 0.839 | 0.165 | ||
| FA-PNN | 33 | 30 | 113 | 16 | 76.040 | 0.673 | 0.790 | 0.140 | |
| ABO-PNN | 45 | 18 | 115 | 14 | 83.330 | 0.760 | 0.860 | 0.170 | |
| β-HC-PNN | 142 | 37 | 319 | 20 | 81.250 | 0.880 | 0.900 | 0.110 | |
| WEA-PNN | 39 | 24 | 122 | 7 | 83.854 | 0.847 | 0.857 | 0.161 | |
| HSS | PNN | 44 | 12 | 6 | 15 | 64.935 | 0.746 | 0.333 | 0.351 |
| CHIO-PNN | 51 | 5 | 14 | 7 | 85.410 | 0.854 | 0.800 | 0.145 | |
| FA-PNN | 54 | 2 | 10 | 11 | 83.120 | 0.830 | 0.833 | 0.168 | |
| ABO-PNN | 54 | 2 | 11 | 10 | 84.420 | 0.840 | 0.850 | 0.160 | |
| β-HC-PNN | 139 | 12 | 34 | 21 | 0.870 | 0.740 | 0.160 | ||
| WEA-PNN | 53 | 3 | 12 | 9 | 84.420 | 0.854 | 0.800 | 0.155 | |
| AP | PNN | 23 | 1 | 1 | 2 | 88.889 | 0.920 | 0.500 | 0.111 |
| CHIO-PNN | 24 | 0 | 2 | 1 | 96.296 | 0.960 | 1.000 | 0.038 | |
| FA-PNN | 24 | 0 | 1 | 2 | 92.590 | 0.923 | 1.000 | 0.075 | |
| ABO-PNN | 24 | 0 | 2 | 1 | 0.960 | 1.00 | 0.040 | ||
| β-HC-PNN | 53 | 2 | 15 | 1 | 0.980 | 0.880 | 0.040 | ||
| WEA-PNN | 24 | 0 | 1 | 2 | 92.59 | 0.923 | 1.000 | 0.074 | |
| BC | PNN | 14 | 9 | 36 | 13 | 69.444 | 0.519 | 0.800 | 0.306 |
| CHIO-PNN | 18 | 5 | 47 | 2 | 0.761 | 0.862 | 0.168 | ||
| FA-PNN | 31 | 1 | 24 | 12 | 80.880 | 0.720 | 0.960 | 0.19 | |
| ABO-PNN | 18 | 5 | 43 | 6 | 84.720 | 0.750 | 0.900 | 0.150 | |
| β-HC-PNN | 49 | 7 | 125 | 12 | 84.720 | 0.800 | 0.950 | 0.100 | |
| WEA-PNN | 17 | 6 | 44 | 5 | 84.72 | 0.772 | 0.880 | 0.152 | |
| LD | PNN | 18 | 15 | 34 | 19 | 60.465 | 0.486 | 0.694 | 0.395 |
| CHIO-PNN | 32 | 3 | 47 | 4 | 91.860 | 0.766 | 0.897 | 0.080 | |
| FA-PNN | 31 | 1 | 24 | 12 | 79.07 | 0.720 | 0.960 | 0.210 | |
| ABO-PNN | 28 | 5 | 45 | 8 | 84.880 | 0.780 | 0.900 | 0.150 | |
| β-HC-PNN | 77 | 22 | 112 | 22 | 0.780 | 0.840 | 0.190 | ||
| WEA-PNN | 24 | 9 | 49 | 4 | 84.88 | 0.857 | 0.844 | 0.151 | |
| Heart | PNN | 27 | 5 | 23 | 13 | 73.529 | 0.675 | 0.821 | 0.265 |
| CHIO-PNN | 30 | 0 | 24 | 12 | 82.350 | 0.680 | 1.000 | 0.171 | |
| FA-PNN | 31 | 1 | 24 | 12 | 80.880 | 0.720 | 0.960 | 0.190 | |
| ABO-PNN | 32 | 0 | 24 | 12 | 82.350 | 0.730 | 1.000 | 0.180 | |
| β-HC-PNN | 79 | 3 | 90 | 10 | 0.890 | 0.970 | 0.070 | ||
| WEA-PNN | 32 | 0 | 25 | 11 | 83.82 | 0.744 | 1.000 | 0.161 | |
| GCD | PNN | 133 | 46 | 39 | 32 | 68.800 | 0.806 | 0.459 | 0.312 |
| CHIO-PNN | 165 | 14 | 44 | 27 | 0.850 | 0.750 | 0.168 | ||
| FA-PNN | 166 | 13 | 30 | 41 | 78.400 | 0.801 | 0.697 | 0.216 | |
| ABO-PNN | 157 | 22 | 50 | 21 | 82.800 | 0.880 | 0.690 | 0.170 | |
| β-HC-PNN | 439 | 27 | 182 | 27 | 80.800 | 0.940 | 0.870 | 0.080 | |
| WEA-PNN | 165 | 14 | 42 | 29 | 82.800 | 0.850 | 0.750 | 0.172 | |
| Parkinson’s | PNN | 38 | 1 | 6 | 4 | 89.796 | 0.905 | 0.857 | 0.102 |
| CHIO-PNN | 39 | 0 | 6 | 4 | 91.830 | 0.906 | 1.000 | 0.821 | |
| FA-PNN | 38 | 1 | 6 | 4 | 89.800 | 0.904 | 0.857 | 0.120 | |
| ABO-PNN | 39 | 0 | 8 | 2 | 0.950 | 1.000 | 0.040 | ||
| β-HC-PNN | 95 | 0 | 35 | 1 | 91.840 | 0.990 | 1.000 | 0.010 | |
| WEA-PNN | 93 | 0 | 7 | 3 | 93.88 | 0.928 | 1.000 | 0.062 | |
| SPECTF | PNN | 49 | 4 | 5 | 9 | 80.597 | 0.845 | 0.556 | 0.194 |
| CHIO-PNN | 49 | 2 | 14 | 2 | 0.960 | 0.750 | 0.060 | ||
| FA-PNN | 25 | 1 | 10 | 4 | 92.540 | 0.926 | 0.909 | 0.075 | |
| ABO-PNN | 51 | 2 | 9 | 5 | 89.550 | 0.910 | 0.820 | 0.100 | |
| β-HC-PNN | 138 | 7 | 30 | 5 | 93.040 | 0.990 | 0.940 | 0.020 | |
| WEA-PNN | 49 | 4 | 12 | 2 | 91.04 | 0.960 | 0.750 | 0.086 | |
| ACA | PNN | 60 | 14 | 84 | 15 | 83.237 | 0.800 | 0.857 | 0.168 |
| CHIO-PNN | 70 | 4 | 95 | 4 | 0.933 | 0.959 | 0.043 | ||
| FA-PNN | 65 | 9 | 94 | 5 | 91.910 | 0.928 | 0.912 | 0.080 | |
| ABO-PNN | 69 | 5 | 95 | 4 | 94.800 | 0.950 | 0.950 | 0.050 | |
| β-HC-PNN | 197 | 11 | 245 | 12 | 93.060 | 0.940 | 0.960 | 0.050 | |
| WEA-PNN | 71 | 3 | 94 | 5 | 95.38 | 0.934 | 0.969 | 0.046 | |
| Fourclass | PNN | 78 | 0 | 138 | 0 | 100.000 | 1.000 | 1.000 | 0.000 |
| CHIO-PNN | 78 | 0 | 138 | 0 | 100.000 | 1.000 | 1.000 | 0.000 | |
| FA-PNN | 78 | 0 | 138 | 0 | 100.000 | 1.000 | 1.000 | 0.000 | |
| ABO-PNN | 78 | 0 | 138 | 0 | 100.000 | 1.000 | 1.000 | 0.000 | |
| β-HC-PNN | 78 | 0 | 138 | 0 | 100.000 | 1.000 | 1.000 | 0.000 | |
| WEA-PNN | 78 | 0 | 138 | 0 | 100.00 | 1.000 | 1.000 | 0.000 |
Fig. 5Average of the best accuracy of tested methods
Fig. 6Convergence speed of tested methods
The statistics and P values of the T test for the accuracy of CHIO and FA
| Dataset | Model | Mean | Std. deviation | Std. error mean | |
|---|---|---|---|---|---|
| PID | CHIO | 82.6900 | 0.54000 | 0.54000 | 0.00015 |
| FA | 73.4895 | 1.28560 | 0.23472 | ||
| HSS | CHIO | 83.3600 | 0.81000 | 0.81000 | 0.00011 |
| FA | 81.8179 | 1.02322 | 0.18681 | ||
| AP | CHIO | 96.2900 | 0.00000 | 0.00000 | 0.00012 |
| FA | 92.5926 | 0.00012 | 0.00002 | ||
| BC | CHIO | 81.8900 | 1.87000 | 1.80000 | 0.00017 |
| FA | 77.3935 | 1.74347 | 0.31831 | ||
| LD | CHIO | 81.7400 | 3.08000 | 3.08000 | 0.00000 |
| FA | 75.5810 | 1.49604 | 0.27310 | ||
| Heart | CHIO | 80.6300 | 2.01000 | 2.0000 | 0.00000 |
| FA | 78.6819 | 2.23781 | 0.40857 | ||
| GCD | CHIO | 82.8000 | 0.00000 | 0.00000 | 0.00014 |
| FA | 82.8000 | 0.00000 | 0.28854 | ||
| Parkinson’s | CHIO | 91.2200 | 1.09000 | 1.09000 | 0.00000 |
| FA | 89.7950 | 0.00000 | 0.00000 | ||
| SPECTF | CHIO | 89.3500 | 1.06000 | 1.60000 | 0.00000 |
| FA | 88.8057 | 1.82787 | 0.33372 | ||
| ACA | CHIO | 94.5100 | 0.57000 | 0.57000 | 0.00000 |
| FA | 89.8840 | 1.05983 | 0.19350 | ||
| Fourclass | CHIO | 100.000 | 0.00000 | 0.00000 | 0.00000 |
| FA | 100.000 | 0.00000 | 0.00000 |
Fig. 7Boxplots for CHIO and FA