| Literature DB >> 30341708 |
Jungyeon Choi1, Olaf M Dekkers2,3, Saskia le Cessie2,4.
Abstract
Propensity score analysis is a popular method to control for confounding in observational studies. A challenge in propensity methods is missing values in confounders. Several strategies for handling missing values exist, but guidance in choosing the best method is needed. In this simulation study, we compared four strategies of handling missing covariate values in propensity matching and propensity weighting. These methods include: complete case analysis, missing indicator method, multiple imputation and combining multiple imputation and missing indicator method. Concurrently, we aimed to provide guidance in choosing the optimal strategy. Simulated scenarios varied regarding missing mechanism, presence of effect modification or unmeasured confounding. Additionally, we demonstrated how missingness graphs help clarifying the missing structure. When no effect modification existed, complete case analysis yielded valid causal treatment effects even when data were missing not at random. In some situations, complete case analysis was also able to partially correct for unmeasured confounding. Multiple imputation worked well if the data were missing (completely) at random, and if the imputation model was correctly specified. In the presence of effect modification, more complex imputation models than default options of commonly used statistical software were required. Multiple imputation may fail when data are missing not at random. Here, combining multiple imputation and the missing indicator method reduced the bias as the missing indicator variable can be a proxy for unobserved confounding. The optimal way to handle missing values in covariates of propensity score models depends on the missing data structure and the presence of effect modification. When effect modification is present, default settings of imputation methods may yield biased results even if data are missing at random.Entities:
Keywords: Effect modification; Missing data; Missing indicator; Missingness graph; Multiple imputation; Propensity score analysis
Mesh:
Year: 2018 PMID: 30341708 PMCID: PMC6325992 DOI: 10.1007/s10654-018-0447-z
Source DB: PubMed Journal: Eur J Epidemiol ISSN: 0393-2990 Impact factor: 8.082
Fig. 1M-graphs for Simulation setting 1: MCAR scenario (a), MAR scenario (b), and MANR scenario (c)
Fig. 2M-graphs for Simulation setting 2: MCAR scenario (a), MAR scenario (b), and MANR scenario (c)
Fig. 3M-graphs for Simulation setting 3: MCAR scenario (a), MNAR scenario (b)
Fig. 4Mean treatment effects and their 5th and 95th percentile ranges estimated by propensity weighting in Simulation setting 1 (left), 2 (middle) and 3 (right). For each missing scenario, missing data are handled with complete case analysis, missing indicator method, multiple imputation, and the combination of multiple imputation and missing indicator method (Combined method). The vertical lines represent the true treatment effect
Results of treatment effect estimates from propensity matching and propensity weighting when assuming there is a homogeneous treatment effect and no unmeasured confounding. For each missing scenario, missing data are handled with complete case analysis, missing indicator method, multiple imputation, and the combination of multiple imputation and missing indicator (Combined method)
| Homogeneous treatment effect | |||||||
|---|---|---|---|---|---|---|---|
| Propensity matching | Propensity weighting | ||||||
| Coefficient | MSE | Coefficient | MSE | ||||
| Mean | SD | Mean | SD | ||||
| No missing | No adjustment | 1.298 | 0.123 | 1.700 | 1.298 | 0.123 | 1.700 |
| After adjustment | 0.044 | 0.085 | 0.009 | 0.006 | 0.109 | 0.012 | |
| MCAR | Complete case analysis | 0.043 | 0.121 | 0.016 | 0.014 | 0.152 | 0.023 |
| Missing indicator | 0.238 | 0.095 | 0.066 | 0.189 | 0.111 | 0.048 | |
| Multiple imputation | |||||||
| With Y | 0.047 | 0.086 | 0.010 | 0.011 | 0.113 | 0.013 | |
| Without Y | 0.219 | 0.087 | 0.056 | 0.186 | 0.110 | 0.047 | |
| Combined method | |||||||
| With Y | 0.048 | 0.087 | 0.010 | 0.011 | 0.112 | 0.013 | |
| Without Y | 0.218 | 0.087 | 0.055 | 0.187 | 0.110 | 0.047 | |
| MAR | Complete case analysis | 0.024 | 0.128 | 0.017 | 0.007 | 0.165 | 0.027 |
| Missing indicator | 0.259 | 0.099 | 0.077 | 0.172 | 0.123 | 0.044 | |
| Multiple imputation | |||||||
| With Y | 0.052 | 0.092 | 0.011 | 0.010 | 0.122 | 0.015 | |
| Without Y | 0.244 | 0.090 | 0.068 | 0.185 | 0.120 | 0.049 | |
| Combined method | |||||||
| With Y | 0.050 | 0.092 | 0.011 | 0.010 | 0.122 | 0.015 | |
| Without Y | 0.243 | 0.090 | 0.067 | 0.185 | 0.120 | 0.048 | |
| MNAR | Complete case analysis | 0.025 | 0.129 | 0.017 | 0.012 | 0.166 | 0.028 |
| Missing indicator | 0.231 | 0.098 | 0.063 | 0.149 | 0.122 | 0.037 | |
| Multiple imputation | |||||||
| With Y | 0.069 | 0.095 | 0.014 | 0.029 | 0.123 | 0.016 | |
| Without Y | 0.248 | 0.091 | 0.070 | 0.215 | 0.118 | 0.060 | |
| Combined method | |||||||
| With Y | 0.052 | 0.093 | 0.011 | 0.011 | 0.122 | 0.015 | |
| Without Y | 0.211 | 0.088 | 0.053 | 0.160 | 0.119 | 0.040 | |
Results of treatment effect estimates from propensity matching and propensity weighting when assuming X2 is an effect modifier and no unmeasured confounder exists. Here, multiple imputation is done in two ways; commonly used method (no interaction term) and elaborated method (interaction terms included)
| Heterogeneous treatment effect | ||||||||
|---|---|---|---|---|---|---|---|---|
| Propensity matching | Propensity weighting | |||||||
| Coefficient | Bias | MSE | Coefficient | MSE | ||||
| Mean | SD | Mean | SD | |||||
| No missing | No adjustment | 1.736 | 0.156 | 1.409 | 2.011 | 1.736 | 0.156 | 3.040 |
| After adjustment | 0.327 | 0.093 | 0.000 | 0.009 | − 0.003 | 0.152 | 0.023 | |
| MCAR | Complete case analysis | 0.300 | 0.133 | − 0.027 | 0.018 | − 0.003 | 0.219 | 0.048 |
| Missing indicator | 0.574 | 0.120 | 0.247 | 0.075 | 0.305 | 0.162 | 0.119 | |
|
| ||||||||
| Multiple imputation | ||||||||
| With Y | 0.315 | 0.103 | − 0.012 | 0.011 | − 0.021 | 0.168 | 0.029 | |
| Without Y | 0.542 | 0.108 | 0.215 | 0.058 | 0.297 | 0.158 | 0.113 | |
| Combined method | ||||||||
| With Y | 0.315 | 0.102 | − 0.012 | 0.011 | − 0.021 | 0.169 | 0.029 | |
| Without Y | 0.541 | 0.110 | 0.214 | 0.058 | 0.297 | 0.158 | 0.113 | |
| Interaction terms | ||||||||
| Multiple imputation | 0.316 | 0.103 | − 0.011 | 0.011 | − 0.002 | 0.166 | 0.028 | |
| Combined method | 0.316 | 0.104 | − 0.011 | 0.011 | − 0.003 | 0.166 | 0.028 | |
| MAR | Complete case analysis | 0.129 | 0.147 | − 0.198 | 0.061 | − 0.200 | 0.241 | 0.098 |
| Missing indicator | 0.620 | 0.122 | 0.293 | 0.101 | 0.272 | 0.179 | 0.106 | |
|
| ||||||||
| Multiple imputation | ||||||||
| With Y | 0.251 | 0.107 | − 0.076 | 0.017 | − 0.093 | 0.181 | 0.042 | |
| Without Y | 0.579 | 0.112 | 0.252 | 0.076 | 0.286 | 0.173 | 0.111 | |
| Combined method | ||||||||
| With Y | 0.250 | 0.108 | − 0.077 | 0.017 | − 0.092 | 0.182 | 0.042 | |
| Without Y | 0.580 | 0.113 | 0.253 | 0.077 | 0.285 | 0.173 | 0.111 | |
| Interaction terms | ||||||||
| Multiple imputation | 0.330 | 0.116 | 0.003 | 0.013 | 0.010 | 0.185 | 0.034 | |
| Combined method | 0.330 | 0.116 | 0.003 | 0.013 | 0.010 | 0.185 | 0.034 | |
| MNAR | Complete case analysis | − 0.111 | 0.141 | − 0.438 | 0.211 | − 0.411 | 0.224 | 0.219 |
| Missing indicator | 0.588 | 0.121 | 0.261 | 0.082 | 0.230 | 0.171 | 0.082 | |
|
| ||||||||
| Multiple imputation | ||||||||
| With Y | 0.151 | 0.114 | − 0.176 | 0.044 | − 0.238 | 0.207 | 0.100 | |
| Without Y | 0.586 | 0.112 | 0.259 | 0.080 | 0.350 | 0.165 | 0.150 | |
| Combined method | ||||||||
| With Y | 0.140 | 0.111 | − 0.187 | 0.047 | − 0.248 | 0.206 | 0.104 | |
| Without Y | 0.546 | 0.108 | 0.219 | 0.060 | 0.248 | 0.165 | 0.089 | |
| Interaction terms | ||||||||
| Multiple imputation | 0.182 | 0.117 | − 0.145 | 0.035 | − 0.192 | 0.208 | 0.080 | |
| Combined method | 0.170 | 0.114 | − 0.157 | 0.038 | − 0.205 | 0.264 | 0.112 | |
Results of treatment effect estimates from propensity matching and inverse probability weighting, when an unmeasured confounding exists
| Homogeneous treatment effect/unmeasured confounding | |||||||
|---|---|---|---|---|---|---|---|
| Propensity matching | Propensity weighting | ||||||
| Coefficient | MSE | Coefficient | MSE | ||||
| Mean | SD | Mean | SD | ||||
| No missing | No adjustment | 2.011 | 0.154 | 4.068 | 2.011 | 0.154 | 4.068 |
| After adjustment | 0.377 | 0.111 | 0.154 | 0.328 | 0.168 | 0.136 | |
| MCAR | Complete case analysis | 0.362 | 0.152 | 0.154 | 0.336 | 0.233 | 0.167 |
| Missing indicator | 0.870 | 0.138 | 0.776 | 0.774 | 0.171 | 0.628 | |
| Multiple imputation | |||||||
| With Y | 0.376 | 0.119 | 0.155 | 0.330 | 0.171 | 0.138 | |
| Without Y | 0.807 | 0.119 | 0.665 | 0.771 | 0.165 | 0.621 | |
| Combined method | |||||||
| With Y | 0.375 | 0.119 | 0.155 | 0.330 | 0.171 | 0.138 | |
| Without Y | 0.808 | 0.119 | 0.667 | 0.770 | 0.165 | 0.620 | |
| MNAR | Complete case analysis | 0.145 | 0.163 | 0.048 | 0.157 | 0.255 | 0.089 |
| Missing indicator | 0.514 | 0.117 | 0.277 | 0.345 | 0.197 | 0.158 | |
| Multiple imputation | |||||||
| With Y | 0.422 | 0.141 | 0.197 | 0.354 | 0.200 | 0.165 | |
| Without Y | 1.003 | 0.129 | 1.023 | 1.028 | 0.154 | 1.079 | |
| Combined method | |||||||
| With Y | 0.240 | 0.114 | 0.071 | 0.169 | 0.191 | 0.065 | |
| Without Y | 0.469 | 0.105 | 0.231 | 0.386 | 0.175 | 0.180 | |
Results of Simulation setting 2 where the multiple imputation by chained equations (MICE) with Bayesian linear regression is used for a sensitivity analysis
| Heterogeneous treatment effect | |||||||
|---|---|---|---|---|---|---|---|
| Propensity matching | Propensity weighting | ||||||
| Coefficient | Bias | MSE | Coefficient | MSE | |||
| Mean | SD | Mean | SD | ||||
| No adjustment | 1.730 | 0.158 | 1.410 | 2.012 | 1.730 | 0.158 | 3.019 |
| After adjustment | 0.321 | 0.096 | 0.000 | 0.009 | − 0.017 | 0.156 | 0.025 |
|
| |||||||
| Multiple imputation | |||||||
| With Y | 0.304 | 0.095 | − 0.017 | 0.009 | − 0.041 | 0.170 | 0.031 |
| Without Y | 0.536 | 0.101 | 0.215 | 0.056 | 0.292 | 0.142 | 0.105 |
| Combined method | |||||||
| With Y | 0.303 | 0.095 | − 0.018 | 0.009 | − 0.042 | 0.172 | 0.031 |
| Without Y | 0.537 | 0.104 | 0.216 | 0.058 | 0.294 | 0.143 | 0.107 |
| Interaction terms | |||||||
| Multiple imputation | 0.315 | 0.094 | − 0.006 | 0.009 | − 0.014 | 0.169 | 0.029 |
| Combined method | 0.315 | 0.096 | − 0.006 | 0.009 | − 0.015 | 0.171 | 0.029 |
|
| |||||||
| Multiple imputation | |||||||
| With Y | 0.220 | 0.103 | − 0.101 | 0.021 | − 0.116 | 0.192 | 0.050 |
| Without Y | 0.568 | 0.110 | 0.247 | 0.073 | 0.264 | 0.158 | 0.095 |
| Combined method | |||||||
| With Y | 0.220 | 0.101 | 0.010 | − 0.116 | 0.190 | 0.049 | |
| Without Y | 0.568 | 0.111 | 0.248 | 0.074 | 0.264 | 0.157 | 0.094 |
| Interaction terms | |||||||
| Multiple imputation | 0.330 | 0.101 | 0.009 | 0.010 | 0.002 | 0.199 | 0.040 |
| Combined method | 0.331 | 0.103 | 0.010 | 0.011 | 0.001 | 0.198 | 0.039 |
|
| |||||||
| Multiple imputation | |||||||
| With Y | 0.102 | 0.110 | − 0.219 | 0.060 | − 0.269 | 0.213 | 0.118 |
| Without Y | 0.570 | 0.110 | 0.249 | 0.074 | 0.325 | 0.153 | 0.129 |
| Combined method | |||||||
| With Y | 0.095 | 0.103 | − 0.225 | 0.061 | − 0.275 | 0.211 | 0.120 |
| Without Y | 0.537 | 0.105 | 0.216 | 0.058 | 0.233 | 0.149 | 0.076 |
| Interaction terms | |||||||
| Multiple imputation | 0.173 | 0.101 | − 0.147 | 0.032 | − 0.197 | 0.220 | 0.087 |
| Combined method | 0.169 | 0.103 | − 0.151 | 0.034 | − 0.206 | 0.215 | 0.089 |