Literature DB >> 26393409

A two-step semiparametric method to accommodate sampling weights in multiple imputation.

Hanzhi Zhou1, Michael R Elliott2,3, Trviellore E Raghunathan2,3.   

Abstract

Multiple imputation (MI) is a well-established method to handle item-nonresponse in sample surveys. Survey data obtained from complex sampling designs often involve features that include unequal probability of selection. MI requires imputation to be congenial, that is, for the imputations to come from a Bayesian predictive distribution and for the observed and complete data estimator to equal the posterior mean given the observed or complete data, and similarly for the observed and complete variance estimator to equal the posterior variance given the observed or complete data; more colloquially, the analyst and imputer make similar modeling assumptions. Yet multiply imputed data sets from complex sample designs with unequal sampling weights are typically imputed under simple random sampling assumptions and then analyzed using methods that account for the sampling weights. This is a setting in which the analyst assumes more than the imputer, which can led to biased estimates and anti-conservative inference. Less commonly used alternatives such as including case weights as predictors in the imputation model typically require interaction terms for more complex estimators such as regression coefficients, and can be vulnerable to model misspecification and difficult to implement. We develop a simple two-step MI framework that accounts for sampling weights using a weighted finite population Bayesian bootstrap method to validly impute the whole population (including item nonresponse) from the observed data. In the second step, having generated posterior predictive distributions of the entire population, we use standard IID imputation to handle the item nonresponse. Simulation results show that the proposed method has good frequentist properties and is robust to model misspecification compared to alternative approaches. We apply the proposed method to accommodate missing data in the Behavioral Risk Factor Surveillance System when estimating means and parameters of regression models.
© 2015, The International Biometric Society.

Entities:  

Keywords:  Bayesian bootstrap; Behavioral Risk Factor Surveillance System (BRFSS); Missing data; Polya posterior; Sampling design

Mesh:

Year:  2015        PMID: 26393409      PMCID: PMC6058975          DOI: 10.1111/biom.12413

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  3 in total

1.  A nonparametric method to generate synthetic populations to adjust for complex sampling design features.

Authors:  Qi Dong; Michael R Elliott; Trivellore E Raghunathan
Journal:  Surv Methodol       Date:  2014-06-27       Impact factor: 0.378

2.  Bayesian penalized spline model-based inference for finite population proportion in unequal probability sampling.

Authors:  Qixuan Chen; Michael R Elliott; Roderick J A Little
Journal:  Surv Methodol       Date:  2010-06-29       Impact factor: 0.378

3.  Combining multiple imputation and inverse-probability weighting.

Authors:  Shaun R Seaman; Ian R White; Andrew J Copas; Leah Li
Journal:  Biometrics       Date:  2011-11-03       Impact factor: 2.571

  3 in total
  1 in total

1.  Multiple imputation with missing data indicators.

Authors:  Lauren J Beesley; Irina Bondarenko; Michael R Elliot; Allison W Kurian; Steven J Katz; Jeremy Mg Taylor
Journal:  Stat Methods Med Res       Date:  2021-10-13       Impact factor: 2.494

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.