| Literature DB >> 35237309 |
Cong Zhang1, Jing Xue2, Xiaoqing Gu3.
Abstract
With the rapid development of artificial intelligence, various medical devices and wearable devices have emerged, enabling people to collect various health data of themselves in hospitals or other places. This has led to a substantial increase in the scale of medical data, and it is impossible to import these data into memory at one time. As a result, the hardware requirements of the computer become higher and the time consumption increases. This paper introduces an online clustering framework, divides the large data set into several small data blocks, processes each data block by weighting clustering, and obtains the cluster center and corresponding weight of each data block. Finally, the final cluster center is obtained by processing these cluster centers and corresponding weights, so as to accelerate clustering processing and reduce memory consumption. Extensive experiments are performed on UCI standard database, real cancer data set, and brain CT image data set. The experimental results show that the proposed method is superior to previous methods in less time consumption and good clustering performance.Entities:
Mesh:
Year: 2022 PMID: 35237309 PMCID: PMC8885256 DOI: 10.1155/2022/6168785
Source DB: PubMed Journal: Comput Intell Neurosci
Clustering performance on Armstrong-2002-v2, Bhattacharjee-2001, and Heart Disease data sets.
| Data sets | Algorithms | Accuracy | Entropy |
| Purity |
|---|---|---|---|---|---|
|
| OFCM | 0.7235 | 0.4728 | 0.7948 | 0.7331 |
| SPFCM | 0.7237 | 0.4697 |
| 0.7372 | |
| OWBFC |
|
| 0.7964 |
| |
|
| |||||
|
| OFCM | 0.8213 | 0.2879 | 0.8637 | 0.8235 |
| SPFCM | 0.8635 | 0.2455 | 0.9294 | 0.8769 | |
| OWBFC |
|
|
|
| |
|
| |||||
|
| OFCM | 0.7643 | 0.4675 | 0.7921 | 0.7039 |
| SPFCM | 0.7659 | 0.4678 | 0.7914 | 0.7054 | |
| OWBFC |
|
|
|
| |
The best average performances are shown in bold type in Tables 1–7.
Accuracy (mean/max/min) on the DRD and HCV data sets (%).
| DRD data set | |||
|---|---|---|---|
| Block size | OFCM | SPFCM | OWBFC |
| 5 | 73.16/73.32/72.95 | 73.26/73.37/73.18 |
|
| 10 | 73.34/74.44/73.12 | 73.37/73.54/73.15 |
|
| 50 | 73.53/73.64/73.31 | 73.51/73.62/73.43 |
|
| HCV data set | |||
| Block size | OFCM | SPFCM | OWBFC |
| 5 | 74.16/74.41/74.11 | 74.17/74.31/74.09 |
|
| 10 | 74.29/74.44/74.16 | 74.24/74.42/74.13 |
|
| 50 | 74.37/74.48/74.31 | 74.33/74.47/74.25 |
|
Entropy (mean/max/ min) on the DRD and HCV data sets (%).
| DRD data set | |||
|---|---|---|---|
| Block size | OFCM | SPFCM | OWBFC |
| 5 | 47.72/47.88/47.65 | 47.77/47.92/47.61 |
|
| 10 | 47.75/47.89/47.67 | 47.77/47.91/47.63 |
|
| 50 | 47.81/47.95/47.76 | 47.82/47.97/47.74 |
|
| HCV data set | |||
| Block size | OFCM | SPFCM | OWBFC |
| 5 | 46.65/46.78/46.52 | 46.79/46.87/46.68 |
|
| 10 | 46.69/46.78/46.53 | 46.81/46.89/46.72 |
|
| 50 | 46.71/46.82/46.66 | 46.88/46.96/46.75 |
|
F-measure (mean/max/min) on the DRD and HCV data sets (%).
| DRD data set | |||
|---|---|---|---|
| Block size | OFCM | SPFCM | OWBFC |
| 5 | 77.96/78.13/77.81 | 77.93/78.15/77.82 |
|
| 10 | 77.96/78.15/77.81 | 77.96/78.17/77.84 |
|
| 50 | 78.04/78.21/77.85 | 77.98/78.22/77.88 |
|
| HCV data set | |||
| Block size | OFCM | SPFCM | OWBFC |
| 5 | 75.02/75.14/74.93 | 75.02/75.16/74.89 |
|
| 10 | 75.13/75.25/75.02 | 75.10/75.24/74.90 |
|
| 50 | 75.17/75.31/75.05 | 75.16/75.36/75.03 |
|
Purity (mean/max/min) on the DRD and HCV data sets (%).
| DRD data set | |||
|---|---|---|---|
| Block size | OFCM | SPFCM | OWBFC |
| 5 | 74.58/74.63/74.34 | 74.55/74.67/74.43 |
|
| 10 | 74.64/74.72/74.49 | 74.59/74.75/74.48 |
|
| 50 | 74.72/74.88/74.62 | 74.66/74.81/74.57 |
|
| HCV data set | |||
| Block size | OFCM | SPFCM | OWBFC |
| 5 | 74.61/74.79/74.48 | 74.55/74.67/74.38 |
|
| 10 | 74.67/74.83/74.54 | 74.60/74.77/74.45 |
|
| 50 | 74.71/74.88/74.62 | 74.66/74.79/74.52 |
|
OWBFC running time on different block ratios (s).
| Data sets | Block size |
| |||||
|---|---|---|---|---|---|---|---|
| 100% | 50% | 10% | 5% | 100%/50% | 100%/10% | 100%/5% | |
| DRD | 85.31 | 67.54 | 15.67 | 12.51 | 1.26 | 5.44 | 6.82 |
| HCV | 97.25 | 70.26 | 17.48 | 13.16 | 1.38 | 5.56 | 7.38 |
Figure 1Three brain images used in the experiment. (a) cta, (b) ctb, and (c) ctc.
Figure 2Clustering results of BFC on three brain images. (a) cta, (b) ctb, and (c) ctc.
Figure 3Clustering results of OWBFC on three brain images. (a) cta, (b) ctb, and (c) ctc.
Clustering results of BFC and OWBFC on three brain images (%).
| Data sets | Methods | Accuracy | Entropy |
| Purity |
|---|---|---|---|---|---|
|
| BFC | 87.54 | 23.64 | 91.26 | 87.43 |
| OWBFC |
|
|
|
| |
|
| |||||
|
| BFC | 88.65 | 21.38 | 91.97 | 88.76 |
| OWBFC |
|
|
|
| |
|
| |||||
|
| BFC | 86.91 | 24.33 | 90.65 | 87.15 |
| OWBFC |
|
|
|
| |
Running time on three brain images (s).
| Image | BFC | OWBFC |
|---|---|---|
| Cta | 1224.23 |
|
| Ctb | 1231.14 |
|
| Ctc | 1219.56 |
|