| Literature DB >> 29225667 |
Yang Lei1, Dai Yu2, Zhang Bin1, Yang Yang1.
Abstract
Clustering algorithm as a basis of data analysis is widely used in analysis systems. However, as for the high dimensions of the data, the clustering algorithm may overlook the business relation between these dimensions especially in the medical fields. As a result, usually the clustering result may not meet the business goals of the users. Then, in the clustering process, if it can combine the knowledge of the users, that is, the doctor's knowledge or the analysis intent, the clustering result can be more satisfied. In this paper, we propose an interactive K-means clustering method to improve the user's satisfactions towards the result. The core of this method is to get the user's feedback of the clustering result, to optimize the clustering result. Then, a particle swarm optimization algorithm is used in the method to optimize the parameters, especially the weight settings in the clustering algorithm to make it reflect the user's business preference as possible. After that, based on the parameter optimization and adjustment, the clustering result can be closer to the user's requirement. Finally, we take an example in the breast cancer, to testify our method. The experiments show the better performance of our algorithm.Entities:
Mesh:
Year: 2017 PMID: 29225667 PMCID: PMC5684610 DOI: 10.1155/2017/4915828
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Figure 1Framework of the interactive clustering method.
Figure 2Example of interactive clustering.
Algorithm 1PSO algorithm to find the optimal combination of the weights.
Figure 3The prototype system of interactive processing.
Figure 4Clustering results with analysis purpose of worst_radius and worst_concavity.
Figure 5Interactively clustering results with clustering number of 3.
Figure 6Clustering results with analysis purpose of mean_radius and mean_compactness.
Figure 7Interactively clustering results with clustering number of 4.