Literature DB >> 34585424

Combining multiple imputation with raking of weights: An efficient and robust approach in the setting of nearly true models.

Kyunghee Han1, Pamela A Shaw1, Thomas Lumley2.   

Abstract

Multiple imputation (MI) provides us with efficient estimators in model-based methods for handling missing data under the true model. It is also well-understood that design-based estimators are robust methods that do not require accurately modeling the missing data; however, they can be inefficient. In any applied setting, it is difficult to know whether a missing data model may be good enough to win the bias-efficiency trade-off. Raking of weights is one approach that relies on constructing an auxiliary variable from data observed on the full cohort, which is then used to adjust the weights for the usual Horvitz-Thompson estimator. Computing the optimally efficient raking estimator requires evaluating the expectation of the efficient score given the full cohort data, which is generally infeasible. We demonstrate MI as a practical method to compute a raking estimator that will be optimal. We compare this estimator to common parametric and semi-parametric estimators, including standard MI. We show that while estimators, such as the semi-parametric maximum likelihood and MI estimator, obtain optimal performance under the true model, the proposed raking estimator utilizing MI maintains a better robustness-efficiency trade-off even under mild model misspecification. We also show that the standard raking estimator, without MI, is often competitive with the optimal raking estimator. We demonstrate these properties through several numerical examples and provide a theoretical discussion of conditions for asymptotically superior relative efficiency of the proposed raking estimator.
© 2021 John Wiley & Sons Ltd.

Entities:  

Keywords:  auxiliary variable; design-based estimation; model misspecifiation; multiple imputation; nearly true model; raking

Mesh:

Year:  2021        PMID: 34585424      PMCID: PMC8963275          DOI: 10.1002/sim.9210

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  11 in total

1.  Using the whole cohort in the analysis of countermatched samples.

Authors:  C Rivera; T Lumley
Journal:  Biometrics       Date:  2015-09-22       Impact factor: 2.571

2.  Connections between survey calibration estimators and semiparametric models for incomplete data.

Authors:  Thomas Lumley; Pamela A Shaw; James Y Dai
Journal:  Int Stat Rev       Date:  2011-08       Impact factor: 2.217

3.  Improved double-robust estimation in missing data and causal inference models.

Authors:  Andrea Rotnitzky; Quanhong Lei; Mariela Sued; James M Robins
Journal:  Biometrika       Date:  2012-04-29       Impact factor: 2.445

4.  Multiple imputation analysis of case-cohort studies.

Authors:  Helena Marti; Michel Chavance
Journal:  Stat Med       Date:  2011-02-24       Impact factor: 2.373

5.  Fitting additive hazards models for case-cohort studies: a multiple imputation approach.

Authors:  Jinhyouk Jung; Ofer Harel; Sangwook Kang
Journal:  Stat Med       Date:  2015-07-20       Impact factor: 2.373

6.  Using full-cohort data in nested case-control and case-cohort studies by multiple imputation.

Authors:  Ruth H Keogh; Ian R White
Journal:  Stat Med       Date:  2013-04-23       Impact factor: 2.373

7.  Improved Horvitz-Thompson Estimation of Model Parameters from Two-phase Stratified Samples: Applications in Epidemiology.

Authors:  Norman E Breslow; Thomas Lumley; Christie M Ballantyne; Lloyd E Chambless; Michal Kulich
Journal:  Stat Biosci       Date:  2009-05-01

8.  Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data.

Authors:  Weihua Cao; Anastasios A Tsiatis; Marie Davidian
Journal:  Biometrika       Date:  2009-08-07       Impact factor: 2.445

9.  Combining multiple imputation and inverse-probability weighting.

Authors:  Shaun R Seaman; Ian R White; Andrew J Copas; Leah Li
Journal:  Biometrics       Date:  2011-11-03       Impact factor: 2.571

10.  Tuning multiple imputation by predictive mean matching and local residual draws.

Authors:  Tim P Morris; Ian R White; Patrick Royston
Journal:  BMC Med Res Methodol       Date:  2014-06-05       Impact factor: 4.615

View more
  2 in total

1.  Optimal sampling for design-based estimators of regression models.

Authors:  Tong Chen; Thomas Lumley
Journal:  Stat Med       Date:  2022-01-06       Impact factor: 2.373

2.  Two-Phase Sampling Designs for Data Validation in Settings with Covariate Measurement Error and Continuous Outcome.

Authors:  Gustavo Amorim; Ran Tao; Sarah Lotspeich; Pamela A Shaw; Thomas Lumley; Bryan E Shepherd
Journal:  J R Stat Soc Ser A Stat Soc       Date:  2021-04-15       Impact factor: 2.175

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.