Literature DB >> 21869081

The effect of cluster size, dimensionality, and the number of clusters on recovery of true cluster structure.

G W Milligan1, S C Soon, L M Sokol.   

Abstract

An evaluation of four clustering methods and four external criterion measures was conducted with respect to the effect of the number of clusters, dimensionality, and relative cluster sizes on the recovery of true cluster structure. The four methods were the single link, complete link, group average (UPGMA), and Ward's minimum variance algorithms. The results indicated that the four criterion measures were generally consistent with each other, of which two highly similar pairs were identified. The tirst pair consisted of the Rand and corrected Rand statistics, and the second pair was the Jaccard and the Fowlkes and Mallows indexes. With respect to the methods, recovery was found to improve as the number of clusters increased and as the number of dimensions increased. The relative cluster size factor produced differential performance effects, with Ward's procedure providing the best recovery when the clusters were of equal size. The group average method gave equivalent or better recovery when the clusters were of unequal size.

Entities:  

Year:  1983        PMID: 21869081     DOI: 10.1109/tpami.1983.4767342

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  7 in total

1.  RSQRT: AN HEURISTIC FOR ESTIMATING THE NUMBER OF CLUSTERS TO REPORT.

Authors:  John Carlis; Kelsey Bruso
Journal:  Electron Commer Res Appl       Date:  2012-03       Impact factor: 6.014

2.  Modeling differences in the dimensionality of multiblock data by means of clusterwise simultaneous component analysis.

Authors:  Kim De Roover; Eva Ceulemans; Marieke E Timmerman; John B Nezlek; Patrick Onghena
Journal:  Psychometrika       Date:  2013-01-25       Impact factor: 2.500

3.  A Note on Using the Adjusted Rand Index for Link Prediction in Networks.

Authors:  Michaela Hoffman; Douglas Steinley; Michael J Brusco
Journal:  Soc Networks       Date:  2015-04-06

4.  KSC-N: Clustering of Hierarchical Time Profile Data.

Authors:  Joke Heylen; Iven Van Mechelen; Philippe Verduyn; Eva Ceulemans
Journal:  Psychometrika       Date:  2014-12-10       Impact factor: 2.500

5.  Clustering Vector Autoregressive Models: Capturing Qualitative Differences in Within-Person Dynamics.

Authors:  Kirsten Bulteel; Francis Tuerlinckx; Annette Brose; Eva Ceulemans
Journal:  Front Psychol       Date:  2016-10-07

6.  Common and cluster-specific simultaneous component analysis.

Authors:  Kim De Roover; Marieke E Timmerman; Batja Mesquita; Eva Ceulemans
Journal:  PLoS One       Date:  2013-05-08       Impact factor: 3.240

Review 7.  Clustering Algorithms: Their Application to Gene Expression Data.

Authors:  Jelili Oyelade; Itunuoluwa Isewon; Funke Oladipupo; Olufemi Aromolaran; Efosa Uwoghiren; Faridah Ameh; Moses Achas; Ezekiel Adebiyi
Journal:  Bioinform Biol Insights       Date:  2016-11-30
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.