| Literature DB >> 35707794 |
D Scaldelai1, L C Matioli2, S R Santos1, M Kleina3.
Abstract
In this paper, we propose the MulticlusterKDE algorithm applied to classify elements of a database into categories based on their similarity. MulticlusterKDE is centered on the multiple optimization of the kernel density estimator function with multivariate Gaussian kernel. One of the main features of the proposed algorithm is that the number of clusters is an optional input parameter. Furthermore, it is very simple, easy to implement, well defined and stops at a finite number of steps and it always converges regardless of the data set. We illustrate our findings by implementing the algorithm in R software. The results indicate that the MulticlusterKDE algorithm is competitive when compared to K-means, K-medoids, CLARA, DBSCAN and PdfCluster algorithms. Features such as simplicity and efficiency make the proposed algorithm an attractive and promising research field that can be used as basis for its improvement and also for the development of new density-based clustering algorithms.Entities:
Keywords: Gaussian kernel; Kernel density estimation; clustering data; optimization method; multiclusterKDE
Year: 2020 PMID: 35707794 PMCID: PMC9041763 DOI: 10.1080/02664763.2020.1799958
Source DB: PubMed Journal: J Appl Stat ISSN: 0266-4763 Impact factor: 1.416