| Literature DB >> 23703895 |
Stephen Burgess1, Ian R White, Matthieu Resche-Rigon, Angela M Wood.
Abstract
Multiple imputation is a strategy for the analysis of incomplete data such that the impact of the missingness on the power and bias of estimates is mitigated. When data from multiple studies are collated, we can propose both within-study and multilevel imputation models to impute missing data on covariates. It is not clear how to choose between imputation models or how to combine imputation and inverse-variance weighted meta-analysis methods. This is especially important as often different studies measure data on different variables, meaning that we may need to impute data on a variable which is systematically missing in a particular study. In this paper, we consider a simulation analysis of sporadically missing data in a single covariate with a linear analysis model and discuss how the results would be applicable to the case of systematically missing data. We find in this context that ensuring the congeniality of the imputation and analysis models is important to give correct standard errors and confidence intervals. For example, if the analysis model allows between-study heterogeneity of a parameter, then we should incorporate this heterogeneity into the imputation model to maintain the congeniality of the two models. In an inverse-variance weighted meta-analysis, we should impute missing data and apply Rubin's rules at the study level prior to meta-analysis, rather than meta-analyzing each of the multiple imputations and then combining the meta-analysis estimates using Rubin's rules. We illustrate the results using data from the Emerging Risk Factors Collaboration.Entities:
Keywords: Rubin's rules; individual participant data; meta-analysis; missing data; multiple imputation
Mesh:
Substances:
Year: 2013 PMID: 23703895 PMCID: PMC3963448 DOI: 10.1002/sim.5844
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 1Schematic diagram illustrating the two approaches for combining Rubin's rules and inverse-variance weighted meta-analysis. Curved braces indicate application of Rubin's rules, square braces indicate application of meta-analysis.
Simulation study comparing complete-data, complete-case, and multiple imputation analyses with two imputation models to estimate β1 = 0.3 with thirty (30) equal sized studies using three analysis models in five scenarios with increasing heterogeneity: mean and standard deviation (SD) of estimates, mean standard error (SE), and coverage (Cov %) of the 95% confidence interval. In inverse-variance weighted analyses, it is indicated whether Rubin's rules were applied within each study prior to meta-analysis (RR then MA) or meta-analysis of imputed datasets was performed prior to combining estimates using Rubin's rules (MA then RR).
Results from scenarios where the analysis model is misspecified are shown with a shaded background, whereas results from scenarios where the imputation model is misspecified are shown in italics.
Simulation study comparing complete-data, complete-case and multiple imputation analyses with two imputation models to estimate β2 = − 0.6 with thirty (30) equal sized studies using three analysis models in five scenarios with increasing heterogeneity: mean and standard deviation (SD) of estimates, mean standard error (SE), and coverage (Cov %) of the 95% confidence interval. In inverse-variance weighted analyses, it is indicated whether Rubin's rules were applied within each study prior to meta-analysis (RR then MA) or meta-analysis of imputed datasets was performed prior to combining estimates using Rubin's rules (MA then RR).
Results from scenarios where the analysis model is misspecified are shown with a shaded background, whereas results from scenarios where the imputation model is misspecified are shown in italics.
Regression coefficients for the association of low-density lipoprotein cholesterol (mmol/L) with systolic blood pressure (mmHg) adjusting for body mass index (kg/m2) from complete-data, complete-case, and multiple imputation analyses with stratified and within-study imputation models using stratified, fixed-effects, and random-effects meta-analysis models: estimates with standard error (in brackets). In inverse-variance weighted analyses, it is indicated whether Rubin's rules were applied within each study prior to meta-analysis (RR then MA) or meta-analysis of imputed datasets was performed prior to combining estimates using Rubin's rules (MA then RR).
| Analysis model: | Stratified | Fixed-effect | Random-effects | |
|---|---|---|---|---|
| Complete-data | 1.219 (0.078) | 1.084 (0.069) | 1.189 (0.225) | |
| Complete-case | 1.230 (0.088) | 1.105 (0.078) | 1.166 (0.231) | |
| Stratified imputation | (MA then RR) | 1.248 (0.093) | 1.093 (0.081) | 1.278 (0.220) |
| (RR then MA) | 1.099 (0.077) | 1.278 (0.211) | ||
| Within-study imputation | (MA then RR) | 1.236 (0.089) | 1.112 (0.079) | 1.165 (0.239) |
| (RR then MA) | 1.110 (0.078) | 1.177 (0.226) | ||
| Scenario 1 | Scenario 2 | Scenario 3 | Scenario 4 | Scenario 5 | |
|---|---|---|---|---|---|
| 1 | |||||
| 0.2 | 0.2 | ||||
| − 0.6 | − 0.6 | − 0.6 | |||
| 0.3 | 0.3 | 0.3 | 0.3 |