Literature DB >> 23997380

EM vs MM: A Case Study.

Hua Zhou1, Yiwen Zhang.   

Abstract

The celebrated expectation-maximization (EM) algorithm is one of the most widely used optimization methods in statistics. In recent years it has been realized that EM algorithm is a special case of the more general minorization-maximization (MM) principle. Both algorithms creates a surrogate function in the first (E or M) step that is maximized in the second M step. This two step process always drives the objective function uphill and is iterated until the parameters converge. The two algorithms differ in the way the surrogate function is constructed. The expectation step of the EM algorithm relies on calculating conditional expectations, while the minorization step of the MM algorithm builds on crafty use of inequalities. For many problems, EM and MM derivations yield the same algorithm. This expository note walks through the construction of both algorithms for estimating the parameters of the Dirichlet-Multinomial distribution. This particular case is of interest because EM and MM derivations lead to two different algorithms with completely distinct operating characteristics. The EM algorithm converges fast but involves solving a nontrivial maximization problem in the M step. In contrast the MM updates are extremely simple but converge slowly. An EM-MM hybrid algorithm is derived which shows faster convergence than the MM algorithm in certain parameter regimes. The local convergence rates of the three algorithms are studied theoretically from the unifying MM point of view and also compared on numerical examples.

Entities:  

Keywords:  Convergence rate; Dirichlet-Multinomial distribution; EM algorithm; MM algorithm

Year:  2012        PMID: 23997380      PMCID: PMC3755471          DOI: 10.1016/j.csda.2012.05.018

Source DB:  PubMed          Journal:  Comput Stat Data Anal        ISSN: 0167-9473            Impact factor:   1.681


  8 in total

1.  On the optimal design of genetic variant discovery studies.

Authors:  Iuliana Ionita-Laza; Nan M Laird
Journal:  Stat Appl Genet Mol Biol       Date:  2010-08-27

2.  Overdispersion in allelic counts and θ-correction in forensic genetics.

Authors:  Torben Tvedebrink
Journal:  Theor Popul Biol       Date:  2010-07-13       Impact factor: 1.570

3.  Fisher information matrix of the Dirichlet-multinomial distribution.

Authors:  Sudhir R Paul; Uditha Balasooriya; Tathagata Banerjee
Journal:  Biom J       Date:  2005-04       Impact factor: 2.207

4.  MM Algorithms for Some Discrete Multivariate Distributions.

Authors:  Hua Zhou; Kenneth Lange
Journal:  J Comput Graph Stat       Date:  2010-09-01       Impact factor: 2.302

5.  Dirichlet mixtures: a method for improved detection of weak but significant protein sequence homology.

Authors:  K Sjölander; K Karplus; M Brown; R Hughey; A Krogh; I S Mian; D Haussler
Journal:  Comput Appl Biosci       Date:  1996-08

6.  Modelling overdispersion in toxicological mortality data grouped over time.

Authors:  R J Hines; J F Lawless
Journal:  Biometrics       Date:  1993-03       Impact factor: 2.571

7.  A quasi-Newton acceleration for high-dimensional optimization algorithms.

Authors:  Hua Zhou; David Alexander; Kenneth Lange
Journal:  Stat Comput       Date:  2011-01-04       Impact factor: 2.559

8.  The distribution of fetal death in control mice and its implications on statistical tests for dominant lethal effects.

Authors:  J K Haseman; E R Soares
Journal:  Mutat Res       Date:  1976-12       Impact factor: 2.433

  8 in total
  2 in total

1.  Regression Models For Multivariate Count Data.

Authors:  Yiwen Zhang; Hua Zhou; Jin Zhou; Wei Sun
Journal:  J Comput Graph Stat       Date:  2017-02-16       Impact factor: 2.302

2.  A Brief Survey of Modern Optimization for Statisticians.

Authors:  Kenneth Lange; Eric C Chi; Hua Zhou
Journal:  Int Stat Rev       Date:  2014-04-01       Impact factor: 2.217

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.