Literature DB >> 29736237

Merging K-means with hierarchical clustering for identifying general-shaped groups.

Anna D Peterson1, Arka P Ghosh1, Ranjan Maitra1.   

Abstract

Clustering partitions a dataset such that observations placed together in a group are similar but different from those in other groups. Hierarchical and K-means clustering are two approaches but have different strengths and weaknesses. For instance, hierarchical clustering identifies groups in a tree-like structure but suffers from computational complexity in large datasets while K-means clustering is efficient but designed to identify homogeneous spherically-shaped clusters. We present a hybrid non-parametric clustering approach that amalgamates the two methods to identify general-shaped clusters and that can be applied to larger datasets. Specifically, we first partition the dataset into spherical groups using K-means. We next merge these groups using hierarchical methods with a data-driven distance measure as a stopping criterion. Our proposal has the potential to reveal groups with general shapes and structure in a dataset. We demonstrate good performance on several simulated and real datasets.

Entities:  

Keywords:  K-means algorithm; complete linkage; distance measure; hierarchical clustering; single linkage

Year:  2018        PMID: 29736237      PMCID: PMC5935272          DOI: 10.1002/sta4.172

Source DB:  PubMed          Journal:  Stat (Int Stat Inst)        ISSN: 2049-1573


  6 in total

1.  Tight clustering: a resampling-based approach for identifying stable and tight patterns in data.

Authors:  George C Tseng; Wing H Wong
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

2.  Combining multiple clusterings using evidence accumulation.

Authors:  Ana L N Fred; Anil K Jain
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2005-06       Impact factor: 6.226

3.  Clustering in the presence of scatter.

Authors:  Ranjan Maitra; Ivan P Ramler
Journal:  Biometrics       Date:  2008-05-30       Impact factor: 2.571

4.  Initializing partition-optimization algorithms.

Authors:  Ranjan Maitra
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2009 Jan-Mar       Impact factor: 3.710

5.  Practical problems in a method of cluster analysis.

Authors:  F H Marriott
Journal:  Biometrics       Date:  1971-09       Impact factor: 2.571

6.  Combining Mixture Components for Clustering.

Authors:  Jean-Patrick Baudry; Adrian E Raftery; Gilles Celeux; Kenneth Lo; Raphaël Gottardo
Journal:  J Comput Graph Stat       Date:  2010-06-01       Impact factor: 2.302

  6 in total
  5 in total

1.  Sustainable Smart Industry: A Secure and Energy Efficient Consensus Mechanism for Artificial Intelligence Enabled Industrial Internet of Things.

Authors:  A Sasikumar; Logesh Ravi; Ketan Kotecha; Jatinderkumar R Saini; Vijayakumar Varadarajan; V Subramaniyaswamy
Journal:  Comput Intell Neurosci       Date:  2022-06-20

2.  Determining a cutoff score for the family burden interview schedule using three statistical methods.

Authors:  Yu Yu; Zi-Wei Liu; Wei Zhou; Mei Zhao; Bing-Wei Tang; Shui-Yuan Xiao
Journal:  BMC Med Res Methodol       Date:  2019-05-08       Impact factor: 4.615

3.  Identifying population segments for effective intervention design and targeting using unsupervised machine learning: an end-to-end guide.

Authors:  Elisabeth Engl; Peter Smittenaar; Sema K Sgaier
Journal:  Gates Open Res       Date:  2019-10-21

4.  Early IgG Response to Foot and Mouth Disease Vaccine Formulated with a Vegetable Oil Adjuvant.

Authors:  Xuemei Cui; Yong Wang; Babar Maqbool; Lijia Yuan; Shanshan He; Cenrong Zhang; Wei Xu; Songhua Hu
Journal:  Vaccines (Basel)       Date:  2019-10-09

5.  Correlation between Overconfidence and Learning Motivation in Postgraduate Infection Prevention and Control Training.

Authors:  Milena Trifunovic-Koenig; Stefan Bushuven; Bianka Gerber; Baerbel Otto; Markus Dettenkofer; Florian Salm; Martin R Fischer
Journal:  Int J Environ Res Public Health       Date:  2022-05-09       Impact factor: 3.390

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.