| Literature DB >> 35106027 |
Yuxian Duan1,2, Changyun Liu1, Song Li1, Xiangke Guo1, Chunlin Yang2,3.
Abstract
Clustering analysis is essential for obtaining valuable information from a predetermined dataset. However, traditional clustering methods suffer from falling into local optima and an overdependence on the quality of the initial solution. Given these defects, a novel clustering method called gradient-based elephant herding optimization for cluster analysis (GBEHO) is proposed. A well-defined set of heuristics is introduced to select the initial centroids instead of selecting random initial points. Specifically, the elephant optimization algorithm (EHO) is combined with the gradient-based algorithm GBO for assigning initial cluster centers across the search space. Second, to overcome the imbalance between the original EHO exploration and exploitation, the initialized population is improved by introducing Gaussian chaos mapping. In addition, two operators, i.e., random wandering and variation operators, are set to adjust the location update strategy of the agents. Nine datasets from synthetic and real-world datasets are adopted to evaluate the effectiveness of the proposed algorithm and the other metaheuristic algorithms. The results show that the proposed algorithm ranks first among the 10 algorithms. It is also extensively compared with state-of-the-art techniques, and four evaluation criteria of accuracy rate, specificity, detection rate, and F-measure are used. The obtained results clearly indicate the excellent performance of GBEHO, while the stability is also more prominent.Entities:
Keywords: Cluster analysis; Elephant herding optimization; Metaheuristic algorithm; Real-world datasets
Year: 2022 PMID: 35106027 PMCID: PMC8795968 DOI: 10.1007/s10489-021-03020-y
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.086
Fig. 1Flowchart of GBEHO
Details of nine benchmark functions
| No. | Function | Dimension | Range | |
|---|---|---|---|---|
| F1 |
| 30 | [-100,100] | 0 |
| F2 | 30 | [-10,10] | 0 | |
| F3 | 30 | [-100,100] | 0 | |
| F4 | 30 | [-100,100] | 0 | |
| F5 | 30 | [-30,30] | 0 | |
| F6 | 30 | [-100,100] | 0 | |
| F7 | 30 | [-1.28,1.28] | 0 | |
| F8 | 30 | [-500,500] | ||
| F9 | 30 | [-5.12,5.12] | 0 | |
| F10 | 30 | [-32,32] | 0 | |
| F11 | 30 | [-600,600] | 0 | |
| F12 | 30 | [-50,50] | 0 | |
| F13 | 30 | [-50,50] | 0 | |
| F14 | 2 | [-65,65] | 1 | |
| F15 | 4 | [-5,5] | 0.00030 | |
| F16 | 2 | [-5,5] | -1.0316 | |
| F17 | 2 | [-5,5] | 0.398 | |
| F18 | 2 | [-2,2] | 3 | |
| F19 | 3 | [1,3] | -3.86 | |
| F20 | 6 | [0,1] | -3.32 | |
| F21 | 4 | [0,10] | -10.1532 | |
| F22 | 4 | [0,10] | -10.4028 | |
| F23 | 4 | [0,10] | -10.5363 |
Comparison results on 23 benchmark functions of EHO and GBEHO with PSR varying from 0.2 to 0.8
| Datasets | EHO | GB2 | GB3 | GB4 | GB5 | GB6 | GB7 | GB8 | |
|---|---|---|---|---|---|---|---|---|---|
| F1 | Mean | 1.51E-03 | 1.25E-265 | 4.64E-234 | 1.25E-212 | 5.16E-190 | 4.80E-169 | 2.19E-138 | |
| Std | 3.27E-03 | 7.12E-264 | 8.95E-233 | 1.75E-211 | 3.96E-190 | 8.49E-168 | 1.00E-137 | ||
| p-value | 2.95E-11 | 1.51E-05 | 6.41E-09 | 3.61E-11 | 2.95E-11 | 2.95E-11 | 2.95E-11 | ||
| rank | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| F2 | Mean | 9.46E-03 | 1.38E-224 | 5.33E-202 | 4.70E-188 | 9.46E-162 | 4.66E-146 | 7.61E-126 | |
| Std | 1.15E-02 | 7.23E-223 | 2.95E-202 | 5.66E-187 | 5.18E-161 | 1.79E-145 | 3.49E-125 | ||
| p-value | 3.02E-11 | 5.09E-08 | 3.34E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | ||
| rank | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| F3 | Mean | 1.35E + 00 | 1.69E-191 | 2.52E-179 | 1.98E-163 | 3.79E-145 | 9.17E-119 | 4.25E-103 | |
| Std | 2.91E + 00 | 7.69E-190 | 4.12E-178 | 8.95E-162 | 2.07E-144 | 5.03E-118 | 2.31E-102 | ||
| p-value | 3.02E-11 | 3.03E-03 | 1.78E-10 | 3.34E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | ||
| rank | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| F4 | Mean | 4.90E-03 | 9.25E-218 | 1.19E-200 | 1.47E-180 | 1.13E-163 | 4.59E-144 | 6.00E-123 | |
| Std | 7.23E-03 | 1.92E-217 | 5.46E-198 | 6.74E-179 | 3.87E-162 | 2.48E-143 | 3.29E-122 | ||
| p-value | 3.02E-11 | 3.35E-08 | 6.70E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | ||
| rank | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| F5 | Mean | 4.23E-01 | 6.02E-01 | 7.21E-01 | 5.02E-01 | 6.13E-01 | 4.49E-01 | 4.22E-01 | |
| Std | 4.45E-01 | 1.27E + 00 | 6.84E-01 | 4.69E-01 | 1.09E + 00 | 9.84E-01 | 8.43E-01 | ||
| p-value | 1.32E-04 | 3.15E-05 | 2.50E-05 | 2.38E-05 | 3.50E-05 | 9.47E-05 | 1.68E-05 | ||
| rank | 1 | 3 | 6 | 8 | 5 | 7 | 4 | 2 | |
| F6 | Mean | 4.80E-02 | 6.71E-02 | 1.13E-01 | 9.93E-02 | 1.02E-01 | 9.42E-02 | 1.12E-01 | |
| Std | 5.32E-02 | 5.82E-02 | 8.24E-02 | 9.56E-02 | 7.94E-02 | 7.56E-02 | 8.11E-02 | ||
| p-value | 1.70E-08 | 1.33E-10 | 6.12E-10 | 8.15E-11 | 2.61E-10 | 4.08E-11 | 4.98E-11 | ||
| rank | 1 | 2 | 3 | 8 | 5 | 6 | 4 | 7 | |
| F7 | Mean | 2.11E-03 | 3.83E-04 | 4.83E-04 | 5.70E-04 | 3.54E-04 | 4.51E-04 | 5.20E-04 | |
| Std | 1.75E-03 | 3.25E-04 | 3.85E-04 | 4.48E-04 | 2.70E-04 | 3.42E-04 | 3.94E-04 | ||
| p-value | 1.19E-06 | 6.10E-03 | 2.42E-02 | 4.84E-02 | |||||
| rank | 8 | 3 | 5 | 7 | 2 | 1 | 4 | 6 | |
| F8 | Mean | -8.17E + 03 | -7.80E + 03 | -8.15E + 03 | -8.49E + 03 | -8.19E + 03 | -8.52E + 03 | -8.68E + 03 | |
| Std | 1.25E + 02 | 6.25E + 02 | 6.48E + 02 | 9.94E + 01 | 1.04E + 02 | 5.92E + 01 | 5.91E + 01 | ||
| p-value | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.34E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | ||
| rank | 1 | 6 | 8 | 7 | 4 | 5 | 3 | 2 | |
| F9 | Mean | 3.22E-02 | 5.35E-28 | 2.68E-22 | 1.89E-20 | 4.21E-18 | 6.41E-15 | 5.47E-13 | |
| Std | 5.88E-02 | 3.98E-27 | 5.91E-21 | 6.19E-19 | 9.13E-17 | 3.49E-14 | 6.77E-12 | ||
| p-value | 1.21E-12 | 8.23E-04 | 5.31E-05 | 5.31E-05 | 5.31E-05 | 5.31E-05 | 5.31E-05 | ||
| rank | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| F10 | Mean | 8.34E-04 | 2.13E-13 | 7.64E-11 | 3.86E-09 | 5.56E-08 | 1.69E-06 | ||
| Std | 1.12E-04 | 5.31E-12 | 2.19E-10 | 1.51E-08 | 8.82E-07 | 4.42E-06 | |||
| p-value | 1.21E-12 | 5.36E-07 | 8.91E-08 | 1.21E-12 | 1.21E-12 | 1.21E-12 | |||
| rank | 8 | 1 | 1 | 3 | 4 | 5 | 6 | 7 | |
| F11 | Mean | 4.77E-04 | 4.52E-20 | 8.52E-17 | 2.39E-15 | 1.13E-11 | 6.39E-09 | 5.61E-07 | |
| Std | 4.91E-04 | 5.39E-19 | 2.98E-16 | 4.33E-14 | 2.56E-10 | 2.99E-08 | 4.11E-07 | ||
| p-value | 1.21E-12 | 9.14E-08 | 1.21E-12 | 1.21E-12 | 1.21E-12 | 1.21E-12 | 1.21E-12 | ||
| rank | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| F12 | Mean | 1.78E-03 | 2.66E-03 | 2.65E-03 | 2.91E-03 | 3.86E-03 | 4.03E-03 | 4.25E-03 | |
| Std | 1.21E-03 | 1.54E-03 | 1.93E-03 | 1.89E-03 | 2.51E-03 | 2.51E-03 | 2.44E-03 | ||
| p-value | 2.87E-10 | 4.08E-11 | 3.16E-10 | 9.92E-11 | 3.47E-10 | 3.69E-11 | 4.50E-11 | ||
| rank | 1 | 2 | 4 | 3 | 5 | 6 | 7 | 8 | |
| F13 | Mean | 8.61E-04 | 1.24E-04 | 9.09E-04 | 2.88E-03 | 6.42E-03 | 1.44E-02 | 2.29E-02 | |
| Std | 1.91E-03 | 1.86E-04 | 1.73E-03 | 3.94E-03 | 8.54E-03 | 2.83E-02 | 3.56E-02 | ||
| p-value | 2.77E-05 | 2.60E-05 | 7.62E-03 | 1.87E-05 | 3.37E-05 | 3.03E-03 | 3.16E-05 | ||
| rank | 3 | 1 | 2 | 4 | 5 | 6 | 7 | 8 | |
| F14 | Mean | 9.92E-01 | 9.98E-01 | 9.98E-01 | 9.94E-01 | 9.92E-01 | 9.93E-01 | 9.93E-01 | |
| Std | 1.44E-04 | 5.24E-06 | 2.43E-05 | 8.74E-04 | 2.43E + 00 | 9.65E-02 | 1.21E + 00 | ||
| p-value | 5.36E-09 | 3.67E-08 | 2.23E-09 | 3.26E-07 | 3.26E-07 | 6.77E-07 | 7.70E-07 | ||
| rank | 7 | 1 | 2 | 3 | 4 | 8 | 5 | 6 | |
| F15 | Mean | 1.65E-03 | 1.16E-03 | 4.83E-04 | 1.18E-03 | 1.01E-03 | 1.35E-03 | 1.95E-03 | |
| Std | 9.23E-04 | 3.66E-03 | 2.43E-04 | 3.64E-03 | 1.10E-03 | 3.67E-03 | 5.14E-03 | ||
| p-value | 4.50E-11 | 1.71E-02 | 2.71E-03 | 3.37E-04 | 6.97E-03 | 3.03E-02 | |||
| rank | 7 | 1 | 4 | 2 | 5 | 3 | 6 | 8 | |
| F16 | Mean | -3.35E-01 | -1.03E + 00 | -1.03E + 00 | -1.03E + 00 | -1.03E + 00 | -1.03E + 00 | -1.03E + 00 | |
| Std | 3.98E-01 | 8.15E-08 | 2.36E-08 | 1.19E-07 | 2.40E-07 | 8.48E-07 | 1.19E-07 | ||
| p-value | 3.02E-11 | 4.94E-05 | 3.51E-02 | 2.89E-03 | 1.03E-06 | 1.86E-06 | |||
| rank | 8 | 2 | 1 | 3 | 4 | 5 | 7 | 6 | |
| F17 | Mean | 4.81E-01 | 3.98E-01 | 3.98E-01 | 3.98E-01 | 3.98E-01 | 3.98E-01 | 3.98E-01 | |
| Std | 9.93E-02 | 3.80E-08 | 8.81E-08 | 5.20E-08 | 2.28E-07 | 1.60E-07 | 4.78E-08 | ||
| p-value | 3.02E-11 | 3.83E-06 | 3.16E-10 | 6.35E-04 | 2.32E-02 | ||||
| rank | 8 | 1 | 2 | 5 | 4 | 7 | 6 | 3 | |
| F18 | Mean | 2.89E + 01 | 3.00E + 00 | 3.00E + 00 | 3.00E + 00 | 3.00E + 00 | 3.00E + 00 | 3.00E + 00 | |
| Std | 7.40E + 00 | 2.11E-06 | 1.53E-06 | 1.50E-06 | 7.29E-05 | 4.43E-05 | 1.48E-05 | ||
| p-value | 3.02E-11 | 2.12E-05 | 9.06E-08 | 2.17E-05 | 7.62E-05 | 1.41E-04 | |||
| rank | 8 | 1 | 4 | 3 | 2 | 7 | 6 | 5 | |
| F19 | Mean | -2.89E + 00 | -3.86E + 00 | -3.86E + 00 | -3.86E + 00 | -3.86E + 00 | -3.86E + 00 | -3.86E + 00 | |
| Std | 5.82E-01 | 1.94E-03 | 9.56E-04 | 1.14E-03 | 5.48E-04 | 8.71E-04 | 5.01E-04 | ||
| p-value | 3.02E-11 | 4.20E-03 | 1.67E-04 | 2.84E-02 | |||||
| rank | 8 | 1 | 7 | 5 | 6 | 3 | 4 | 2 | |
| F20 | Mean | -1.57E + 00 | -3.27E + 00 | -3.27E + 00 | -3.27E + 00 | -3.29E + 00 | -3.27E + 00 | -3.28E + 00 | |
| Std | 5.30E-01 | 7.45E-02 | 7.90E-02 | 7.62E-02 | 7.08E-02 | 7.85E-02 | 7.04E-02 | ||
| p-value | 3.02E-11 | 5.19E-04 | 7.06E-03 | 3.40E-02 | 1.15E-02 | 2.52E-03 | |||
| rank | 8 | 5 | 7 | 4 | 1 | 2 | 6 | 3 | |
| F21 | Mean | -1.01E + 01 | -9.97E + 00 | -9.64E + 00 | -9.81E + 00 | -9.87E + 00 | -9.64E + 00 | -9.81E + 00 | |
| Std | 1.76E-02 | 1.22E-01 | 1.55E + 00 | 1.29E + 00 | 1.29E + 00 | 1.56E + 00 | 1.45E + 00 | ||
| p-value | 2.38E-03 | 7.60E-07 | 1.62E-02 | 7.06E-04 | 9.12E-03 | ||||
| rank | 2 | 3 | 7 | 6 | 4 | 8 | 5 | 1 | |
| F22 | Mean | -1.02E + 01 | -9.34E + 00 | -9.87E + 00 | -9.65E + 00 | -9.51E + 00 | -9.52E + 00 | -9.26E + 00 | |
| Std | 1.83E-01 | 2.16E + 00 | 1.62E + 00 | 1.97E + 00 | 2.01E + 00 | 2.01E + 00 | 2.35E + 00 | ||
| p-value | 3.51E-02 | 7.96E-04 | 3.79E-05 | 1.19E-05 | 1.05E-05 | 1.02E-04 | 2.12E-03 | ||
| rank | 1 | 2 | 7 | 3 | 6 | 5 | 8 | ||
| F23 | Mean | -1.04E + 01 | -9.81E + 00 | -1.04E + 01 | -9.63E + 00 | -9.81E + 00 | -9.86E + 00 | -9.99E + 00 | |
| Std | 1.65E-01 | 3.27E + 00 | 9.87E-01 | 2.05E + 00 | 1.82E + 00 | 2.14E + 00 | 2.13E + 00 | ||
| p-value | 2.24E-02 | 2.05E-03 | 3.92E-02 | 3.15E-02 | |||||
| rank | 1 | 2 | 7 | 3 | 8 | 6 | 5 | 4 | |
| Average Rank | 5.6087 | 1.8696 | 3.8696 | 4.1304 | 4.1818 | 5.2609 | 5.4783 | 5.5652 | |
| All Rank | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
Fig. 2Comparison of the box plots for different algorithms
Fig. 3Comparison of box plots for different algorithms
Comparison results on 6 benchmark functions of different variants
| Datasets | EHO | GEHO | MEHO | RWEHO | GBEHO | |
|---|---|---|---|---|---|---|
| F2 | Mean | 3.42E-03 | 1.53E-24 | |||
| Std | 4.83E-03 | 8.37E-24 | ||||
| Best | 2.12E-05 | 4.15E-45 | ||||
| Worst | 2.46E-02 | 4.58E-23 | ||||
| F9 | Mean | 6.44E-05 | ||||
| Std | 1.59E-04 | |||||
| Best | 3.28E-08 | |||||
| Worst | 6.33E-04 | |||||
| F11 | Mean | 1.20E-04 | ||||
| Std | 1.47E-04 | |||||
| Best | 5.39E-06 | |||||
| Worst | 3.88E-04 | |||||
| F14 | Mean | 9.98E-01 | 9.98E-01 | 9.98E-01 | 9.98E-01 | |
| Std | 2.21E-04 | 3.44E-04 | 4.31E-07 | 2.83E-09 | ||
| Best | 9.98E-01 | 9.98E-01 | 9.98E-01 | 9.98E-01 | ||
| Worst | 9.99E-01 | 9.99E-01 | 9.98E-01 | 9.98E-01 | ||
| F15 | Mean | 7.59E-02 | 2.95E-02 | 6.34E-03 | 5.95E-04 | |
| Std | 2.76E-03 | 2.34E-04 | 3.62E-03 | 4.19E-04 | ||
| Best | 5.35E-03 | 9.09E-03 | 2.08E-03 | 3.12E-04 | ||
| Worst | 1.70E-02 | 1.60E-02 | 9.04E-03 | 7.05E-04 | ||
| F20 | Mean | − 2.19E + 00 | − 3.24E + 00 | − 3.32E + 00 | − 3.32E + 00 | − 3.32E + 00 |
| Std | 7.85E-01 | 5.48E-02 | 5.51E-02 | 2.86E-02 | ||
| Best | − 2.87E + 00 | − 3.32E + 00 | − 3.32E + 00 | − 3.32E + 00 | − 3.32E + 00 | |
| Worst | − 2.04E + 00 | − 3.15E + 00 | − 3.27E + 00 | − 3.28E + 00 | − 3.29E + 00 |
Parameter settings of the different algorithms
| Algorithm | Parameter | Range |
|---|---|---|
| GBEHO | 5 | |
| 0.5 | ||
| 0.2 | ||
| PSO | 2 | |
| 2 | ||
|
| 0.9 | |
|
| 0.2 | |
| DE | Differential weight | 0.8 |
| Crossover probability | 0.2 | |
| GA | Crossover rate | 0.7 |
| Mutation rate | 0.2 | |
| Selection rate | 0.8 | |
| CS | 0.25 | |
| GSA | 100 | |
| 20 | ||
| BA | 0.9 | |
| 0.9 | ||
|
| 0 | |
|
| 2 | |
| QALO-K | 0.2 | |
| 0.5 | ||
| GWOTS | [2,0] | |
| 0.01 | ||
| 5 |
Basic information of the datasets
| Title | Number | Attribute | Cluster |
|---|---|---|---|
| Two-moon | 1502 | 2 | 2 |
| Aggregation | 788 | 2 | 7 |
| Iris | 150 | 3 | 4 |
| Wine | 178 | 3 | 13 |
| Seeds | 210 | 3 | 7 |
| breast | 277 | 9 | 2 |
| heart | 270 | 13 | 2 |
| CMC | 1473 | 9 | 3 |
| Vowel | 874 | 3 | 6 |
Comparison results on different datasets
| Datasets | GBEHO | k-means | PSO | DE | GA | CS | GSA | BA | QALO-K | GWOTS | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Iris | Mean | 96.6555 | 101.65 | 96.6555 | 100.5246 | 97.0601 | 99.6906 | 102.4521 | 98.4628 | 99.9655 | |
| Std | 9.8172 | 2.14E-09 | 1.4261 | 0.3509 | 0.2451 | 2.9389 | 3.5164 | 1.2551 | 2.1593 | ||
| Best | 96.6555 | 97.3259 | 96.6555 | 97.0982 | 96.6611 | 95.8766 | 96.6088 | 96.6555 | 96.9264 | ||
| Worst | 123.8497 | 103.9348 | 97.2282 | 96.9519 | 107.5079 | 108.9816 | 102.0196 | 104.5636 | |||
| Wine | Mean | 16922.5807 | 16292.4966 | 16313.0385 | 16295.4036 | 16652.5692 | 17035.6734 | 16551.9985 | 16368.5625 | 16413.1972 | |
| Std | 1383.6805 | 0.5118 | 5.6243 | 1.8804 | 169.3483 | 27.3914 | 15.1421 | 3.5621 | 9.3362 | ||
| Best | 16555.6794 | 16292.1876 | 16304.4056 | 16292.672 | 16432.0892 | 16989.6022 | 16522.4516 | 16357.4776 | 16399.2559 | ||
| Worst | 23800.1739 | 16293.9858 | 16324.7719 | 16302.228 | 17155.5986 | 17090.6633 | 16582.9983 | 16374.3042 | 16429.2935 | ||
| Seeds | Mean | 313.5273 | 311.7979 | 328.3513 | 311.8186 | 425.4047 | 327.9767 | 353.1714 | 316.3263 | 315.4146 | |
| Std | 8.24E-07 | 0.2578 | 4.8276 | 0.0196 | 13.5094 | 4.5986 | 14.5277 | 2.0355 | 1.8022 | ||
| Best | 313.2168 | 321.3633 | 311.8018 | 401.2814 | 318.404 | 331.5712 | 312.4252 | 312.1504 | |||
| Worst | 313.7343 | 311.7981 | 338.3638 | 311.8956 | 459.032 | 338.4472 | 379.0951 | 319.8667 | 318.4113 | ||
| breast | Mean | 710.8969 | 709.2409 | 705.9913 | 708.3464 | 820.0942 | 725.4121 | 707.2217 | 711.8086 | 721.7778 | |
| Std | 6.6782 | 1.3015 | 1.7923 | 2.8458 | 16.253 | 8.1136 | 5.8082 | 2.5663 | 6.4781 | ||
| Best | 706.7196 | 708.8748 | 703.1575 | 702.8753 | 792.9774 | 713.8192 | 702.2515 | 706.0635 | 710.2501 | ||
| Worst | 730.1364 | 709.4936 | 709.5189 | 711.4823 | 855.0989 | 740.9746 | 717.5345 | 716.4412 | 735.0085 | ||
| heart | Mean | 11218.3322 | 11041.2554 | 11044.8615 | 11045.75 | 13057.0111 | 11076.9611 | 11058.7853 | 11050.7003 | 11054.8329 | |
| Std | 582.7805 | 0.3517 | 1.917 | 3.8177 | 456.0667 | 20.5362 | 8.7843 | 2.5672 | 4.1267 | ||
| Best | 11040.586 | 11108.7834 | 11041.0695 | 11041.4541 | 12044.4999 | 11043.4089 | 11046.6782 | 11044.2447 | 11048.511 | ||
| Worst | 14303.8768 | 11043.2214 | 11048.5204 | 11051.4777 | 14226.2367 | 11109.0933 | 11068.5732 | 11056.5577 | 11060.7088 | ||
| CMC | Mean | 5543.9922 | 5532.0356 | 5582.5772 | 5535.1876 | 6544.9993 | 5567.4964 | 7513.5361 | 5671.2282 | 5642.1387 | |
| Std | 1.522 | 0.0018 | 10.2814 | 2.2088 | 182.9644 | 3.2124 | 602.5733 | 4.7362 | 13.4821 | ||
| Best | 5539.8891 | 5532.1851 | 5565.5292 | 5532.4447 | 6078.6424 | 5562.3148 | 6205.1573 | 5658.0608 | 5619.5534 | ||
| Worst | 5547.2265 | 5532.1921 | 5601.7021 | 5540.069 | 6918.2407 | 5575.149 | 9133.2466 | 5678.3702 | 5672.4519 | ||
| Vowel | Mean | 718.0558 | 749.5712 | 1074.9043 | 759.6974 | 1135.8144 | 858.994 | 1333.5731 | 827.0123 | 776.8145 | |
| Std | 12.6938 | 15.2169 | 12.7467 | 12.8204 | 15.229 | 35.7316 | 45.6782 | 17.3256 | 7.9426 | ||
| Best | 706.8515 | 718.1412 | 1048.3263 | 739.9638 | 1107.727 | 791.5659 | 1182.7827 | 804.0304 | 761.5453 | ||
| Worst | 748.2787 | 779.1062 | 1095.4346 | 791.1734 | 1160.4399 | 944.5888 | 1418.2864 | 869.9818 | 794.1206 | ||
| Two-moon | Mean | 861.5887 | 874.4292 | 861.0024 | 861.0027 | 864.9261 | 861.0024 | 935.3676 | 861.0048 | 861.0025 | |
| Std | 3.47E-13 | 1.63E-11 | 3.05E-04 | 1.24E-03 | 4.7591 | 2.49E-04 | 35.7313 | 7.83E-05 | 2.85E-07 | ||
| Best | 861.5887 | 874.4292 | 861.0596 | 871.0532 | 861.0046 | 861.0025 | |||||
| Worst | 861.5887 | 874.4292 | 862.0022 | 861.0033 | 861.0088 | 881.0101 | 861.003 | 1005.6893 | 861.0049 | ||
| Aggregation | Mean | 2801.593 | 2741.3152 | 2912.3635 | 2741.3672 | 2883.8272 | 2783.4515 | 3482.1218 | 2737.9821 | 2719.3046 | |
| Std | 88.7827 | 56.7838 | 46.2044 | 44.5043 | 48.0931 | 35.6736 | 176.3351 | 32.1578 | 21.2672 | ||
| Best | 2713.3486 | 2713.7931 | 2827.733 | 2712.2452 | 2794.5799 | 2710.9933 | 3142.4736 | 2695.249 | 2691.1079 | ||
| Worst | 3126.1613 | 2967.2318 | 3017.472 | 2906.9183 | 2981.8651 | 2883.6941 | 3928.2248 | 2791.453 | 2758.5424 |
Fig. 4Comparison of box plots for different algorithms
Fig. 5Convergence curves of different algorithms
Results of p-values obtained by the Wilcoxon signed-rank test
| Datasets | PSO | DE | GA | CS | GSA | BA | QALO-K | GWOTS | k-means |
|---|---|---|---|---|---|---|---|---|---|
| Iris | 3.25E-02 | 9.17E-12 | 9.17E-12 | 1.97E-10 | 4.34E-05 | 9.17E-12 | 9.17E-12 | 5.77E-12 | |
| Wine | 1.76E-02 | 3.02E-11 | 6.07E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.16E-12 |
| Seeds | 1.01E-11 | 1.01E-11 | 1.01E-11 | 1.01E-11 | 1.01E-11 | 1.01E-11 | 1.01E-11 | 1.01E-11 | 4.14E-12 |
| breast | 3.02E-11 | 1.17E-09 | 2.78E-07 | 3.02E-11 | 3.02E-11 | 2.67E-09 | 3.02E-11 | 3.02E-11 | 2.57E-11 |
| heart | 2.68E-11 | 2.68E-11 | 2.68E-11 | 2.68E-11 | 2.68E-11 | 2.68E-11 | 2.68E-11 | 1.41E-11 | |
| CMC | 2.88E-11 | 2.88E-11 | 2.88E-11 | 2.88E-11 | 2.88E-11 | 2.88E-11 | 2.88E-11 | 2.88E-11 | 2.88E-11 |
| Vowel | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.02E-11 | 3.56E-04 |
| Two-moon | 1.92E-09 | 1.95E-09 | 1.93E-10 | 1.21E-12 | 1.21E-12 | 1.21E-12 | 1.21E-12 | 1.15E-12 | 1.69E-14 |
| Aggregation | 3.71E-05 | 3.02E-11 | 1.49E-06 | 3.02E-11 | 9.92E-11 | 3.02E-11 | 1.49E-04 | 6.38E-03 | 1.40E-09 |
Results of ranks obtained by the Friedman test
| Datasets | GBEHO | k-means | PSO | DE | GA | CS | GSA | BA | QALO-K | GWOTS |
|---|---|---|---|---|---|---|---|---|---|---|
| Iris | 1.00 | 10.00 | 2.00 | 5.00 | 4.00 | 3.00 | 8.00 | 9.00 | 6.00 | 7.00 |
| Wine | 1.00 | 10.00 | 2.00 | 4.00 | 3.00 | 9.00 | 8.00 | 7.00 | 5.00 | 6.00 |
| Seeds | 1.00 | 3.00 | 6.00 | 7.00 | 2.00 | 10.00 | 8.00 | 9.00 | 5.00 | 4.00 |
| breast | 1.00 | 7.00 | 2.00 | 3.00 | 4.00 | 10.00 | 9.00 | 6.00 | 5.00 | 8.00 |
| heart | 1.00 | 10.00 | 2.00 | 3.00 | 4.00 | 9.00 | 8.00 | 7.00 | 5.00 | 6.00 |
| CMC | 1.00 | 4.00 | 2.00 | 6.00 | 3.00 | 9.00 | 5.00 | 10.00 | 8.00 | 7.00 |
| Vowel | 1.00 | 2.00 | 3.00 | 8.00 | 4.00 | 9.00 | 7.00 | 10.00 | 6.00 | 5.00 |
| Two-moon | 2.00 | 8.00 | 1.00 | 5.00 | 7.00 | 9.00 | 4.00 | 10.00 | 6.00 | 3.00 |
| Aggregation | 1.00 | 9.00 | 6.00 | 8.00 | 5.00 | 7.00 | 4.00 | 10.00 | 3.00 | 2.00 |
| Avg.rank | 1.11 | 7.00 | 2.89 | 5.44 | 4.00 | 8.33 | 6.78 | 8.67 | 5.44 | 5.33 |
| Overall rank | 1.00 | 8.00 | 2.00 | 5.00 | 3.00 | 9.00 | 7.00 | 10.00 | 5.00 | 4.00 |
Fig. 6The original distribution
Fig. 7Comparison of the clustering results on the Iris dataset
Fig. 8Comparison of clustering results on the Seeds dataset
Fig. 9Comparison of clustering results on the Aggregation dataset
Parameter settings of the different algorithms
| Algorithm | Parameter | Range |
|---|---|---|
| CSOS | 30 | |
| 500 | ||
| Hybrid FCM-PSO | 2 | |
| 1 | ||
| 2 | ||
| 2 | ||
| 30 | ||
| 500 | ||
| ACLSHMS | 5 | |
| 1 | ||
| 30 | ||
| 500 | ||
| KIGSA-C | 100 | |
| 30 | ||
| 500 |
Legend:
N: The size of population, Maxit: Maximum number of iterations,
ω: Inertia weight, c1: Cognitive coefficient,
c2: Social coefficient, m: Fuzzifier constant,
k: Number of clusters in bid grouping, C: Moving coefficient, G0: Gconstant
Results of GBEHO with PSR = 0.2 versus others state-of-the-art techniques
| Algorithm | ||||||
|---|---|---|---|---|---|---|
| Dataset | Measure | GBEHO | CSOS | Hybrid FCM-PSO | ACLSHMS | KIGSA-C |
| Wine | AR | 0.8263 | 0.7191 | 0.6707 | 0.8041 | 0.7866 |
| SP | 0.7528 | 0.6895 | 0.6512 | 0.7352 | 0.7194 | |
| DR | 0.8536 | 0.7618 | 0.8098 | 0.8412 | 0.7985 | |
| F1 | 0.8254 | 0.6575 | 0.7418 | 0.7882 | 0.8583 | |
| breast | AR | 0.3588 | 0.3276 | 0.3012 | 0.3367 | 0.3693 |
| SP | 0.2353 | 0.2112 | 0.2454 | 0.2336 | 0.3518 | |
| DR | 0.5637 | 0.5098 | 0.4839 | 0.5567 | 0.6147 | |
| F1 | 0.4412 | 0.2786 | 0.3108 | 0.4148 | 0.4685 | |
| CMC | AR | 0.5688 | 0.5174 | 0.5236 | 0.5472 | 0.5334 |
| SP | 0.5712 | 0.5568 | 0.5384 | 0.5611 | 0.5598 | |
| DR | 0.4396 | 0.3748 | 0.3982 | 0.4157 | 0.4049 | |
| F1 | 0.4936 | 0.3415 | 0.4165 | 0.4577 | 0.5103 | |
| heart | AR | 0.6111 | 0.6457 | 0.6329 | 0.584 | 0.6382 |
| SP | 0.5823 | 0.6212 | 0.6035 | 0.5426 | 0.6096 | |
| DR | 0.6533 | 0.7083 | 0.6726 | 0.6121 | 0.6231 | |
| F1 | 0.6512 | 0.6788 | 0.6812 | 0.5987 | 0.6989 | |
| Vowel | AR | 0.1875 | 0.1126 | 0.1667 | 0.2012 | 0.2379 |
| SP | 0.1812 | 0.1297 | 0.1482 | 0.2139 | 0.2854 | |
| DR | 0.625 | 0.4631 | 0.5052 | 0.4667 | 0.6898 | |
| F1 | 0.1664 | 0.1258 | 0.1895 | 0.1978 | 0.2512 | |
Fig. 10Comparison results of 5 algorithms on different datasets