| Literature DB >> 34527212 |
Honglei Zhu1, Yingying Zhao1, Xueyun Wang1, Yulong Xu1.
Abstract
Medical data analysis is an important part of intelligent medicine, and clustering analysis is a commonly used method for data analysis of Traditional Chinese Medicine (TCM); however, the classical K-Means algorithm is greatly affected by the selection of initial clustering center, which is easy to fall into the local optimal solution. To avoid this problem, an improved differential evolution clustering algorithm is proposed in this paper. The proposed algorithm selects the initial clustering center randomly, optimizes and locates the clustering center in the process of evolution iteration, and improves the mutation mode of differential evolution to enhance the overall optimization ability, so that the clustering effect can reach the global optimization as far as possible. Three University of California, Irvine (UCI), data sets are selected to compare the clustering effect of the classical K-Means algorithm, the standard DE-K-Means algorithm, the K-Means++ algorithm, and the proposed algorithm. The experimental results show that, in terms of global optimization, the proposed algorithm is obviously superior to the other three algorithms, and in terms of convergence speed, the proposed algorithm is better than DE-K-Means algorithm. Finally, the proposed algorithm is applied to analyze the drug data of Traditional Chinese Medicine in the treatment of pulmonary diseases, and the analysis results are consistent with the theory of Traditional Chinese Medicine.Entities:
Mesh:
Year: 2021 PMID: 34527212 PMCID: PMC8437652 DOI: 10.1155/2021/4468741
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Figure 1The central individual μ(g).
Algorithm 1Improved differential evolution clustering algorithm.
The base information of datasets.
| Datasets | Number of data | Number of attributes | Number of classes |
|---|---|---|---|
| Iris | 150 | 4 | 3 |
| Wine | 178 | 13 | 3 |
| Zoo | 101 | 16 | 7 |
The clustering results of Iris.
| Algorithm | Minimum inner-class distance | Maximum inner-class distance | Mean inner-class distance | Mean number of iterations |
|---|---|---|---|---|
| 97.3259 | 123.8497 | 103.042985 | 7.1 | |
| 97.3259 | 122.4787 | 100.461185 | 6.6 | |
| DE- | 96.6555 | 97.3365 | 96.6725675 | 1109.3 |
| Proposed | 96.6555 | 96.6555 | 96.6555 | 549.8 |
The clustering results of Wine.
| Algorithm | Minimum inner-class distance | Maximum inner-class distance | Mean inner-class distance | Mean number of iterations |
|---|---|---|---|---|
| 16555.6794 | 18436.9521 | 16953.75104 | 7.9 | |
| 16555.6794 | 18436.9521 | 17384.2979 | 8.0 | |
| DE- | 16292.1846 | 16295.1591 | 16292.43106 | 1500.0 |
| Proposed | 16292.1846 | 16292.6672 | 16292.19667 | 1319.2 |
The clustering results of Zoo.
| Algorithm | Minimum inner-class distance | Maximum inner-class distance | Mean inner-class distance | Mean number of iterations |
|---|---|---|---|---|
| 101.9719 | 133.4409 | 110.77463 | 5.0 | |
| 101.9719 | 118.4956 | 109.392745 | 3.9 | |
| DE- | 101.3131 | 126.2266 | 106.9885275 | 1500.0 |
| Proposed | 101.1552 | 107.9804 | 104.4135725 | 1500.0 |
Figure 2The standard DE-K-Means Iris convergence curve (a) and the improved DE-K-Means Iris convergence curve (b).
Figure 3The standard DE-K-Means Wine convergence curve (a) and the improved DE-K-Means Wine convergence curve (b).
Figure 4The standard DE-K-Means Zoo convergence curve (a) and the improved DE-K-Means Zoo convergence curve (b).
The clustering results of TCM data.
| Algorithm | Minimum inner-class distance | Maximum inner-class distance | Mean inner-class distance |
|---|---|---|---|
| DE- | 493.9222 | 524.8227 | 501.462525 |
| Proposed | 489.2295 | 507.5878 | 496.5335875 |
Figure 5The convergence curve of DE-K-Means algorithm.
Figure 6The convergence curve of the proposed algorithm.