Literature DB >> 25939490

Posterior predictive checking of multiple imputation models.

Cattram D Nguyen1,2, Katherine J Lee1,2, John B Carlin1,2.   

Abstract

Multiple imputation is gaining popularity as a strategy for handling missing data, but there is a scarcity of tools for checking imputation models, a critical step in model fitting. Posterior predictive checking (PPC) has been recommended as an imputation diagnostic. PPC involves simulating "replicated" data from the posterior predictive distribution of the model under scrutiny. Model fit is assessed by examining whether the analysis from the observed data appears typical of results obtained from the replicates produced by the model. A proposed diagnostic measure is the posterior predictive "p-value", an extreme value of which (i.e., a value close to 0 or 1) suggests a misfit between the model and the data. The aim of this study was to evaluate the performance of the posterior predictive p-value as an imputation diagnostic. Using simulation methods, we deliberately misspecified imputation models to determine whether posterior predictive p-values were effective in identifying these problems. When estimating the regression parameter of interest, we found that more extreme p-values were associated with poorer imputation model performance, although the results highlighted that traditional thresholds for classical p-values do not apply in this context. A shortcoming of the PPC method was its reduced ability to detect misspecified models with increasing amounts of missing data. Despite the limitations of posterior predictive p-values, they appear to have a valuable place in the imputer's toolkit. In addition to automated checking using p-values, we recommend imputers perform graphical checks and examine other summaries of the test quantity distribution.
© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Keywords:  Missing data; Model checking; Multiple imputation; Posterior predictive checking; Simulations

Mesh:

Year:  2015        PMID: 25939490     DOI: 10.1002/bimj.201400034

Source DB:  PubMed          Journal:  Biom J        ISSN: 0323-3847            Impact factor:   2.207


  4 in total

1.  The evolution and genetics of sexually dimorphic 'dual' mimicry in the butterfly Elymnias hypermnestra.

Authors:  Dee M Ruttenberg; Nicholas W VanKuren; Sumitha Nallu; Shen-Horn Yen; Djunijanti Peggie; David J Lohman; Marcus R Kronforst
Journal:  Proc Biol Sci       Date:  2021-01-13       Impact factor: 5.349

2.  Accounting for missing data in statistical analyses: multiple imputation is not always the answer.

Authors:  Rachael A Hughes; Jon Heron; Jonathan A C Sterne; Kate Tilling
Journal:  Int J Epidemiol       Date:  2019-08-01       Impact factor: 7.196

3.  Sensitivity analysis in multiple imputation in effectiveness studies of psychotherapy.

Authors:  Aureliano Crameri; Agnes von Wyl; Margit Koemeda; Peter Schulthess; Volker Tschuschke
Journal:  Front Psychol       Date:  2015-07-27

4.  Model checking in multiple imputation: an overview and case study.

Authors:  Cattram D Nguyen; John B Carlin; Katherine J Lee
Journal:  Emerg Themes Epidemiol       Date:  2017-08-23
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.