| Literature DB >> 25477953 |
Jian Zhang1, Ling Shen2.
Abstract
To organize the wide variety of data sets automatically and acquire accurate classification, this paper presents a modified fuzzy c-means algorithm (SP-FCM) based on particle swarm optimization (PSO) and shadowed sets to perform feature clustering. SP-FCM introduces the global search property of PSO to deal with the problem of premature convergence of conventional fuzzy clustering, utilizes vagueness balance property of shadowed sets to handle overlapping among clusters, and models uncertainty in class boundaries. This new method uses Xie-Beni index as cluster validity and automatically finds the optimal cluster number within a specific range with cluster partitions that provide compact and well-separated clusters. Experiments show that the proposed approach significantly improves the clustering effect.Entities:
Mesh:
Year: 2014 PMID: 25477953 PMCID: PMC4244935 DOI: 10.1155/2014/368628
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Shadowed sets induced by fuzzy function f(x).
Algorithm 1SP-FCM.
Performance of FCM, RCM, SCM, SRCM, and SP-FCM on four UCI data sets.
| Different indices | Algorithm | Data sets | |||
|---|---|---|---|---|---|
| Iris | Wine | Glass | Ionosphere | ||
| DB index | FCM | 0.7642 | 0.8803 | 2.2971 | 2.0587 |
| RCM | 0.6875 | 0.5692 | 1.9635 | 1.5434 | |
| SCM | 0.6862 | 0.5327 | 1.8495 | 1.4763 | |
| SRCM | 0.6613 | 0.4436 | 1.5804 | 1.3971 | |
| SP-FCM | 0.6574 | 0.4328 | 1.5237 | 1.4066 | |
|
| |||||
| Dunn index | FCM | 2.3106 | 2.5834 | 0.1142 | 0.8381 |
| RCM | 2.7119 | 2.8157 | 0.2637 | 1.0233 | |
| SCM | 2.4801 | 2.7992 | 0.3150 | 1.0319 | |
| SRCM | 3.0874 | 3.1342 | 0.5108 | 1.1924 | |
| SP-FCM | 3.3254 | 3.1764 | 0.4921 | 1.2605 | |
Figure 2XB validity index of four UCI data sets with cluster number C.
Performance of FCM, RCM, SCM, SRCM, and SP-FCM on four yeast expression data sets.
| Different indices | Algorithm | Data sets | |||
|---|---|---|---|---|---|
| GDS608 | GDS2003 | GDS2267 | GDS2712 | ||
| DB index | FCM | 2.0861 | 2.4671 | 1.5916 | 1.9526 |
| RCM | 1.6109 | 2.2104 | 1.0274 | 1.2058 | |
| SCM | 1.5938 | 2.1346 | 0.8946 | 1.0965 | |
| SRCM | 1.3274 | 1.9523 | 0.7438 | 0.7326 | |
| SP-FCM | 1.2958 | 1.8946 | 0.7962 | 0.6843 | |
|
| |||||
| Dunn index | FCM | 0.2647 | 0.2976 | 0.4208 | 0.3519 |
| RCM | 0.3789 | 0.3981 | 0.7164 | 0.6074 | |
| SCM | 0.3865 | 0.3775 | 0.8439 | 0.6207 | |
| SRCM | 0.5126 | 0.4953 | 0.9759 | 0.8113 | |
| SP-FCM | 0.5407 | 0.5026 | 0.9182 | 0.8049 | |
Figure 3XB validity index of four yeast gene expression data sets with cluster number C.
Figure 4Ten package images with SIFT features.
Figure 5XB validity index of bag data set with cluster number C.
Performance of FCM, RCM, SCM, SRCM, and SP-FCM on package datasets.
| Different indices | Algorithms | ||||
|---|---|---|---|---|---|
| FCM | RCM | SCM | SRCM | SP-FCM | |
| DB index | 184.569 | 159.671 | 143.194 | 124.038 | 107.826 |
| Dunn index | 92.647 | 116.298 | 125.376 | 169.422 | 167.313 |