Literature DB >> 26721666

Examining the effect of initialization strategies on the performance of Gaussian mixture modeling.

Emilie Shireman1, Douglas Steinley2, Michael J Brusco3.   

Abstract

Mixture modeling is a popular technique for identifying unobserved subpopulations (e.g., components) within a data set, with Gaussian (normal) mixture modeling being the form most widely used. Generally, the parameters of these Gaussian mixtures cannot be estimated in closed form, so estimates are typically obtained via an iterative process. The most common estimation procedure is maximum likelihood via the expectation-maximization (EM) algorithm. Like many approaches for identifying subpopulations, finite mixture modeling can suffer from locally optimal solutions, and the final parameter estimates are dependent on the initial starting values of the EM algorithm. Initial values have been shown to significantly impact the quality of the solution, and researchers have proposed several approaches for selecting the set of starting values. Five techniques for obtaining starting values that are implemented in popular software packages are compared. Their performances are assessed in terms of the following four measures: (1) the ability to find the best observed solution, (2) settling on a solution that classifies observations correctly, (3) the number of local solutions found by each technique, and (4) the speed at which the start values are obtained. On the basis of these results, a set of recommendations is provided to the user.

Entities:  

Keywords:  Algorithm initialization; EM algorithm; Mixture modeling

Mesh:

Year:  2017        PMID: 26721666      PMCID: PMC4930421          DOI: 10.3758/s13428-015-0697-6

Source DB:  PubMed          Journal:  Behav Res Methods        ISSN: 1554-351X


  14 in total

1.  Properties of the Hubert-Arabie adjusted Rand index.

Authors:  Douglas Steinley
Journal:  Psychol Methods       Date:  2004-09

2.  Mixture Model Tests Of Hierarchical Clustering Algorithms: The Problem Of Classifying Everybody.

Authors:  C Edelbrock
Journal:  Multivariate Behav Res       Date:  1979-07-01       Impact factor: 5.923

3.  A latent class analysis of DSM-IV alcohol use disorder criteria and binge drinking in undergraduates.

Authors:  Cheryl L Beseler; Laura A Taylor; Deborah T Kraemer; Robert F Leeman
Journal:  Alcohol Clin Exp Res       Date:  2011-10-17       Impact factor: 3.455

4.  Investigating population heterogeneity with factor mixture models.

Authors:  Gitta H Lubke; Bengt Muthén
Journal:  Psychol Methods       Date:  2005-03

5.  Profiling local optima in K-means clustering: developing a diagnostic technique.

Authors:  Douglas Steinley
Journal:  Psychol Methods       Date:  2006-06

6.  Local solutions in the estimation of growth mixture models.

Authors:  John R Hipp; Daniel J Bauer
Journal:  Psychol Methods       Date:  2006-03

7.  K-means clustering: a half-century synthesis.

Authors:  Douglas Steinley
Journal:  Br J Math Stat Psychol       Date:  2006-05       Impact factor: 3.380

8.  Evaluating mixture modeling for clustering: recommendations and cautions.

Authors:  Douglas Steinley; Michael J Brusco
Journal:  Psychol Methods       Date:  2011-03

9.  Local Optima in Mixture Modeling.

Authors:  Emilie M Shireman; Douglas Steinley; Michael J Brusco
Journal:  Multivariate Behav Res       Date:  2016 Jul-Aug       Impact factor: 5.923

10.  Toward an alcohol use disorder continuum using item response theory: results from the National Epidemiologic Survey on Alcohol and Related Conditions.

Authors:  Tulshi D Saha; S Patricia Chou; Bridget F Grant
Journal:  Psychol Med       Date:  2006-03-27       Impact factor: 7.723

View more
  5 in total

Review 1.  Evaluation of functional MRI-based human brain parcellation: a review.

Authors:  Pantea Moghimi; Anh The Dang; Quan Do; Theoden I Netoff; Kelvin O Lim; Gowtham Atluri
Journal:  J Neurophysiol       Date:  2022-06-08       Impact factor: 2.974

2.  Estimating a panel MSK dataset for comparative analyses of national absorptive capacity systems, economic growth, and development in low and middle income countries.

Authors:  Muhammad Salar Khan
Journal:  PLoS One       Date:  2022-10-20       Impact factor: 3.752

3.  The world cancer patient population (WCPP): An updated standard for international comparisons of population-based survival.

Authors:  Adalberto Miranda-Filho; Freddie Bray; Hadrien Charvat; Swaminathan Rajaraman; Isabelle Soerjomataram
Journal:  Cancer Epidemiol       Date:  2020-09-14       Impact factor: 2.984

4.  Neuronal classification from network connectivity via adjacency spectral embedding.

Authors:  Ketan Mehta; Rebecca F Goldin; David Marchette; Joshua T Vogelstein; Carey E Priebe; Giorgio A Ascoli
Journal:  Netw Neurosci       Date:  2021-08-30

5.  Development of a mixture model allowing for smoothing functions of longitudinal trajectories.

Authors:  Ming Ding; Jorge E Chavarro; Garrett M Fitzmaurice
Journal:  Stat Methods Med Res       Date:  2020-10-27       Impact factor: 3.021

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.