Literature DB >> 8068845

Bias in misspecified mixtures.

G Gray1.   

Abstract

A finite mixture is a distribution where a given observation can come from any of a finite set of components. That is, the density of the random variable X is of the form f(x) = pi 1f1(x) + pi 2f2(x) + ... + pi kfk(x), where the pi i are the mixing proportions and the fi are the component densities. Mixture models are common in many areas of biology; the most commonly applied is a mixture of normal densities. Many of the problems with inference in the mixture setting are well known. Not so well documented, however, are the extreme biases that can occur in the maximum likelihood estimators (MLEs) when there is model misspecification. This paper shows that even the seemingly innocuous assumption of equal variances for the components of the mixture can lead to surprisingly large asymptotic biases in the MLEs of the parameters. Assuming normality when the underlying distributions are skewed can also lead to strong biases. We explicitly calculate the asymptotic biases when maximum likelihood is carried out assuming normality for several types of true underlying distribution. If the true distribution is a mixture of skewed components, then an application of the Box-Cox power transformation can reduce the asymptotic bias substantially. The power lambda in the Box-Cox transformation is in this case treated as an additional parameter to be estimated. In many cases the bias can be reduced to acceptable levels, thus leading to meaningful inference. A modest Monte Carlo study gives an indication of the small-sample performance of inference procedures (including the power and level of likelihood ratio tests) based on a likelihood that incorporates estimation of lambda. A real data example illustrates the method.

Entities:  

Mesh:

Year:  1994        PMID: 8068845

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  3 in total

1.  Genetic algorithms for finite mixture model based voxel classification in neuroimaging.

Authors:  Jussi Tohka; Evgeny Krestyannikov; Ivo D Dinov; Allan MacKenzie Graham; David W Shattuck; Ulla Ruotsalainen; Arthur W Toga
Journal:  IEEE Trans Med Imaging       Date:  2007-05       Impact factor: 10.048

2.  The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference: an application to longitudinal modeling.

Authors:  Brianna C Heggeseth; Nicholas P Jewell
Journal:  Stat Med       Date:  2013-01-07       Impact factor: 2.373

3.  Re-interpreting conventional interval estimates taking into account bias and extra-variation.

Authors:  Michael Höfler; Shaun R Seaman
Journal:  BMC Med Res Methodol       Date:  2006-10-16       Impact factor: 4.615

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.