Literature DB >> 19693282

Model Selection Criteria for Missing-Data Problems Using the EM Algorithm.

Joseph G Ibrahim1, Hongtu Zhu, Niansheng Tang.   

Abstract

We consider novel methods for the computation of model selection criteria in missing-data problems based on the output of the EM algorithm. The methodology is very general and can be applied to numerous situations involving incomplete data within an EM framework, from covariates missing at random in arbitrary regression models to nonignorably missing longitudinal responses and/or covariates. Toward this goal, we develop a class of information criteria for missing-data problems, called IC(H) (,) (Q), which yields the Akaike information criterion and the Bayesian information criterion as special cases. The computation of IC(H) (,) (Q) requires an analytic approximation to a complicated function, called the H-function, along with output from the EM algorithm used in obtaining maximum likelihood estimates. The approximation to the H-function leads to a large class of information criteria, called IC(H̃) (() (k) (),) (Q). Theoretical properties of IC(H̃) (() (k) (),) (Q), including consistency, are investigated in detail. To eliminate the analytic approximation to the H-function, a computationally simpler approximation to IC(H) (,) (Q), called IC(Q), is proposed, the computation of which depends solely on the Q-function of the EM algorithm. Advantages and disadvantages of IC(H̃) (() (k) (),) (Q) and IC(Q) are discussed and examined in detail in the context of missing-data problems. Extensive simulations are given to demonstrate the methodology and examine the small-sample and large-sample performance of IC(H̃) (() (k) (),) (Q) and IC(Q) in missing-data problems. An AIDS data set also is presented to illustrate the proposed methodology.

Entities:  

Year:  2008        PMID: 19693282      PMCID: PMC2728244          DOI: 10.1198/016214508000001057

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  6 in total

1.  Sensitivity analysis for nonrandom dropout: a local influence approach.

Authors:  G Verbeke; G Molenberghs; H Thijs; E Lesaffre; M G Kenward
Journal:  Biometrics       Date:  2001-03       Impact factor: 2.571

2.  Monte Carlo EM for missing covariates in parametric regression models.

Authors:  J G Ibrahim; M H Chen; S R Lipsitz
Journal:  Biometrics       Date:  1999-06       Impact factor: 2.571

3.  A local influence approach applied to binary data from a psychiatric study.

Authors:  Ivy Jansen; Geert Molenberghs; Marc Aerts; Herbert Thijs; Kristel Van Steen
Journal:  Biometrics       Date:  2003-06       Impact factor: 2.571

4.  Bayesian analysis for generalized linear models with nonignorably missing covariates.

Authors:  Lan Huang; Ming-Hui Chen; Joseph G Ibrahim
Journal:  Biometrics       Date:  2005-09       Impact factor: 2.571

5.  The effects of establishment practices, knowledge and attitudes on condom use among Filipina sex workers.

Authors:  D E Morisky; T V Tiglao; C D Sneed; S B Tempongko; J C Baltazar; R Detels; J A Stein
Journal:  AIDS Care       Date:  1998-04

6.  Parameter estimation from incomplete data in binomial regression when the missing data mechanism is nonignorable.

Authors:  J G Ibrahim; S R Lipsitz
Journal:  Biometrics       Date:  1996-09       Impact factor: 2.571

  6 in total
  16 in total

1.  Positing, fitting, and selecting regression models for pooled biomarker data.

Authors:  Emily M Mitchell; Robert H Lyles; Enrique F Schisterman
Journal:  Stat Med       Date:  2015-04-06       Impact factor: 2.373

2.  Asymptotics of AIC, BIC, and RMSEA for Model Selection in Structural Equation Modeling.

Authors:  Po-Hsien Huang
Journal:  Psychometrika       Date:  2017-04-26       Impact factor: 2.500

3.  Efficient Robust Estimation for Linear Models with Missing Response at Random.

Authors:  Man-Lai Tang; Niansheng Tang; Puying Zhao; Hongtu Zhu
Journal:  Scand Stat Theory Appl       Date:  2017-08-30       Impact factor: 1.396

4.  The E-MS Algorithm: Model Selection with Incomplete Data.

Authors:  Jiming Jiang; Thuan Nguyen; J Sunil Rao
Journal:  J Am Stat Assoc       Date:  2014-08-15       Impact factor: 5.033

5.  Bayesian model selection for incomplete data using the posterior predictive distribution.

Authors:  Michael J Daniels; Arkendu S Chatterjee; Chenguang Wang
Journal:  Biometrics       Date:  2012-05-02       Impact factor: 2.571

6.  Latent class models for joint analysis of disease prevalence and high-dimensional semicontinuous biomarker data.

Authors:  Bo Zhang; Zhen Chen; Paul S Albert
Journal:  Biostatistics       Date:  2011-09-10       Impact factor: 5.899

7.  A Bayesian proportional hazards regression model with non-ignorably missing time-varying covariates.

Authors:  Patrick T Bradshaw; Joseph G Ibrahim; Marilie D Gammon
Journal:  Stat Med       Date:  2010-10-20       Impact factor: 2.373

8.  Missing data methods in longitudinal studies: a review.

Authors:  Joseph G Ibrahim; Geert Molenberghs
Journal:  Test (Madr)       Date:  2009-05-01       Impact factor: 2.345

9.  Variable selection in the cox regression model with covariates missing at random.

Authors:  Ramon I Garcia; Joseph G Ibrahim; Hongtu Zhu
Journal:  Biometrics       Date:  2009-05-18       Impact factor: 2.571

10.  Estimating diagnostic accuracy of raters without a gold standard by exploiting a group of experts.

Authors:  Bo Zhang; Zhen Chen; Paul S Albert
Journal:  Biometrics       Date:  2012-09-24       Impact factor: 2.571

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.