Literature DB >> 14596492

Local optima in K-means clustering: what you don't know may hurt you.

Douglas Steinley1.   

Abstract

The popular K-means clustering method, as implemented in 3 commercial software packages (SPSS, SYSTAT, and SAS), generally provides solutions that are only locally optimal for a given set of data. Because none of these commercial implementations offer a reasonable mechanism to begin the K-means method at alternative starting points, separate routines were written within the MATLAB (Math-Works, 1999) environment that can be initialized randomly (these routines are provided at the end of the online version of this article in the PsycARTICLES database). Through the analysis of 2 empirical data sets and 810 simulated data sets, it is shown that the results provided by commercial packages are most likely locally optimal. These results suggest the need for some strategy to study the local optima problem for a specific data set or to identify methods for finding "good" starting values that might lead to the best solutions possible.

Entities:  

Mesh:

Year:  2003        PMID: 14596492     DOI: 10.1037/1082-989X.8.3.294

Source DB:  PubMed          Journal:  Psychol Methods        ISSN: 1082-989X


  31 in total

1.  A model-based cluster analysis approach to adolescent problem behaviors and young adult outcomes.

Authors:  Eun Young Mun; Michael Windle; Lisa M Schainker
Journal:  Dev Psychopathol       Date:  2008

2.  Modeling differences in the dimensionality of multiblock data by means of clusterwise simultaneous component analysis.

Authors:  Kim De Roover; Eva Ceulemans; Marieke E Timmerman; John B Nezlek; Patrick Onghena
Journal:  Psychometrika       Date:  2013-01-25       Impact factor: 2.500

3.  Taxicab Correspondence Analysis.

Authors:  V Choulakian
Journal:  Psychometrika       Date:  2017-02-11       Impact factor: 2.500

4.  A Note on Using the Adjusted Rand Index for Link Prediction in Networks.

Authors:  Michaela Hoffman; Douglas Steinley; Michael J Brusco
Journal:  Soc Networks       Date:  2015-04-06

5.  Local Optima in Mixture Modeling.

Authors:  Emilie M Shireman; Douglas Steinley; Michael J Brusco
Journal:  Multivariate Behav Res       Date:  2016 Jul-Aug       Impact factor: 5.923

6.  A comparison of latent class, K-means, and K-median methods for clustering dichotomous data.

Authors:  Michael J Brusco; Emilie Shireman; Douglas Steinley
Journal:  Psychol Methods       Date:  2016-09-08

7.  Psychosocial costs of racism to Whites: Understanding patterns among university students.

Authors:  Lisa B Spanierman; Nathan R Todd; Carolyn J Anderson
Journal:  J Couns Psychol       Date:  2009-04

8.  KSC-N: Clustering of Hierarchical Time Profile Data.

Authors:  Joke Heylen; Iven Van Mechelen; Philippe Verduyn; Eva Ceulemans
Journal:  Psychometrika       Date:  2014-12-10       Impact factor: 2.500

9.  Identifying subtypes of criminal psychopaths: A replication and extension.

Authors:  Marc T Swogger; David S Kosson
Journal:  Crim Justice Behav       Date:  2007

10.  Psychopathy Subtypes among African American County Jail Inmates.

Authors:  Marc T Swogger; Zach Walsh; David S Kosson
Journal:  Crim Justice Behav       Date:  2008
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.