| Literature DB >> 30170561 |
Jacques-Emmanuel Galimard1,2, Sylvie Chevret3,4,5, Emmanuel Curis6,7, Matthieu Resche-Rigon3,4,5.
Abstract
BACKGROUND: Multiple imputation by chained equations (MICE) requires specifying a suitable conditional imputation model for each incomplete variable and then iteratively imputes the missing values. In the presence of missing not at random (MNAR) outcomes, valid statistical inference often requires joint models for missing observations and their indicators of missingness. In this study, we derived an imputation model for missing binary data with MNAR mechanism from Heckman's model using a one-step maximum likelihood estimator. We applied this approach to improve a previously developed approach for MNAR continuous outcomes using Heckman's model and a two-step estimator. These models allow us to use a MICE process and can thus also handle missing at random (MAR) predictors in the same MICE process.Entities:
Keywords: Heckman’s model; Missing data; Missing not at random (MNAR); Multiple imputation by chained equation (MICE); Sample selection method
Mesh:
Year: 2018 PMID: 30170561 PMCID: PMC6119269 DOI: 10.1186/s12874-018-0547-1
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Binary Y: Simulation results for β1=1 with ρ=0, representing a MAR mechanism, and ρ=0.3 and 0.6, representing an MNAR mechanism
| Methods |
|
|
|
| RMSE | Cover |
|---|---|---|---|---|---|---|
| Before | 0 | 0.7 | 0.108 | 0.109 | 0.109 | 94.9 |
| deletion | 0.3 | 1.1 | 0.109 | 0.109 | 0.110 | 95.9 |
| 0.6 | 0.9 | 0.109 | 0.109 | 0.109 | 95.2 | |
| CCA | 0 | 1.2 | 0.137 | 0.137 | 0.137 | 95.4 |
| 0.3 | -6.1 | 0.135 | 0.135 | 0.148 | 92.0 | |
| 0.6 | -11.9 | 0.135 | 0.134 | 0.179 | 83.5 | |
| HEml | 0 | -0.3 | 0.161 | 0.163 | 0.163 | 95.0 |
| 0.3 | -0.1 | 0.148 | 0.151 | 0.150 | 94.8 | |
| 0.6 | -0.1 | 0.134 | 0.132 | 0.132 | 96.1 | |
| MIHEml | 0 | -1.0 | 0.159 | 0.161 | 0.162 | 94.2 |
| 0.3 | -1.0 | 0.148 | 0.150 | 0.150 | 95.5 | |
| 0.6 | -0.9 | 0.135 | 0.132 | 0.133 | 95.4 |
%Rbias: % relative bias; SE: Root mean square of the estimated standard error; SE: Empirical Monte Carlo standard error; RMSE: Root mean square error; Cover: % coverage of the nominal 95% confidence interval; CCA: Complete case analysis; HEml: Heckman’s one-step ML estimation; MIHEml: Multiple imputation using Heckman’s one-step ML estimation
Fig. 1Binary outcome Boxplot of β1 estimates on the 1000 simulations associated to Table 1 (plot a), Table 3 (plot b), Table 5 left (plot c) and Table 5 right (plot d)
Binary Y and logit selection model: Simulation results for β1=1 estimates
| Methods |
|
|
|
| RMSE | Cover |
|---|---|---|---|---|---|---|
| Before | 0 | 0.9 | 0.109 | 0.110 | 0.110 | 95.3 |
| deletion | 1 | 1.0 | 0.109 | 0.103 | 0.103 | 96.4 |
| 2 | 1.2 | 0.109 | 0.115 | 0.115 | 94.4 | |
| CCA | 0 | 1.5 | 0.135 | 0.137 | 0.138 | 95.8 |
| 1 | -7.2 | 0.133 | 0.131 | 0.149 | 90.6 | |
| 2 | -15.4 | 0.134 | 0.146 | 0.212 | 74.0 | |
| Heml | 0 | -2.4 | 0.167 | 0.170 | 0.171 | 93.5 |
| 1 | -2.5 | 0.153 | 0.152 | 0.154 | 96.0 | |
| 2 | -3.5 | 0.144 | 0.152 | 0.156 | 94.9 | |
| MIHEml | 0 | -4.0 | 0.163 | 0.163 | 0.168 | 93.9 |
| 1 | -3.7 | 0.152 | 0.152 | 0.156 | 95.2 | |
| 1 | -4.2 | 0.145 | 0.155 | 0.160 | 94.9 |
%Rbias: % relative bias; SE: Root mean square of the estimated standard error; SE: Empirical Monte Carlo standard error; RMSE: Root mean square error; Cover: % coverage of the nominal 95% confidence interval; CCA: Complete case analysis; HEml: Heckman’s one-step ML estimation; MIHEml: Multiple imputation using Heckman’s one-step ML estimation
Binary Y: Simulation results for β1=1 with ρ=0, representing a MAR mechanism, and ρ=0.3 and 0.6, representing an MNAR mechanism, in the presence of missing data on X2
| Methods |
|
|
| RMSE | Cover |
|
| RMSE | Cover |
|---|---|---|---|---|---|---|---|---|---|
| Before deletion | 0 | 0.7 | 0.109 | 0.113 | 95.0 | 0.8 | 0.109 | 0.111 | 94.9 |
| 0.3 | 0.5 | 0.108 | 0.108 | 95.7 | 0.9 | 0.109 | 0.109 | 95.0 | |
| 0.6 | 0.5 | 0.108 | 0.106 | 95.8 | 1.1 | 0.109 | 0.106 | 95.5 | |
| CCA | 0 | 1.0 | 0.158 | 0.159 | 95.3 | -20.3 | 0.165 | 0.262 | 73.9 |
| 0.3 | -3.8 | 0.158 | 0.166 | 93.4 | -26.9 | 0.164 | 0.313 | 60.9 | |
| 0.6 | -8.4 | 0.158 | 0.178 | 90.9 | -33.2 | 0.165 | 0.369 | 47.6 | |
| HEml | 0 | -1.3 | 0.182 | 0.182 | 94.8 | -21.1 | 0.190 | 0.287 | 78.5 |
| 0.3 | -0.5 | 0.169 | 0.175 | 94.3 | -21.2 | 0.178 | 0.274 | 76.3 | |
| 0.6 | -1.1 | 0.158 | 0.158 | 95.0 | -22.5 | 0.165 | 0.277 | 73.8 | |
| MIHEml | 0 | -2.1 | 0.167 | 0.167 | 95.2 | -1.4 | 0.166 | 0.168 | 94.5 |
| 0.3 | -1.8 | 0.155 | 0.153 | 95.6 | -1.7 | 0.155 | 0.151 | 95.7 | |
| 0.6 | -2.5 | 0.146 | 0.140 | 96.6 | -2.5 | 0.145 | 0.140 | 96.0 | |
%Rbias: % relative bias; SE: Root mean square of the estimated standard error; SE: Empirical Monte Carlo standard error; RMSE: Root mean square error; Cover: % coverage of the nominal 95% confidence interval; CCA: Complete case analysis; HEml: Heckman’s one-step ML estimation; MIHEml: Multiple imputation using Heckman’s one-step ML estimation
Continuous Y: Simulation results for β1=1 with ρ=0, representing a MAR mechanism, and ρ=0.3 and 0.6, representing an MNAR mechanism
| Methods |
|
|
|
| RMSE | Cover |
|---|---|---|---|---|---|---|
| Before | 0 | 0.0 | 0.064 | 0.064 | 0.064 | 95.1 |
| deletion | 0.3 | 0.0 | 0.063 | 0.065 | 0.065 | 95.0 |
| 0.6 | -0.2 | 0.064 | 0.064 | 0.064 | 94.3 | |
| CCA | 0 | 0.1 | 0.083 | 0.084 | 0.084 | 95.1 |
| 0.3 | -9.1 | 0.082 | 0.081 | 0.122 | 80.3 | |
| 0.6 | -17.8 | 0.078 | 0.079 | 0.194 | 38.2 | |
| HEml | 0 | 0.0 | 0.103 | 0.103 | 0.103 | 95.2 |
| 0.3 | -0.4 | 0.101 | 0.101 | 0.101 | 94.6 | |
| 0.6 | -0.4 | 0.092 | 0.092 | 0.092 | 94.2 | |
| MIHEml | 0 | 0.0 | 0.105 | 0.103 | 0.103 | 94.7 |
| 0.3 | -0.3 | 0.103 | 0.102 | 0.102 | 95.3 | |
| 0.6 | -0.3 | 0.096 | 0.094 | 0.094 | 94.8 | |
| HE2steps | 0 | 0.0 | 0.103 | 0.102 | 0.102 | 95.4 |
| 0.3 | -0.4 | 0.103 | 0.103 | 0.103 | 94.6 | |
| 0.6 | -0.2 | 0.100 | 0.099 | 0.099 | 95.4 | |
| MIHE2steps | 0 | 0.0 | 0.105 | 0.103 | 0.103 | 95.2 |
| 0.3 | -0.4 | 0.104 | 0.104 | 0.104 | 94.0 | |
| 0.6 | -0.2 | 0.103 | 0.100 | 0.099 | 95.2 |
%Rbias: % relative bias; SE: Root mean square of the estimated standard error; SE: Empirical Monte Carlo standard error; RMSE: Root mean square error; Cover: % coverage of the nominal 95% confidence interval; CCA: Complete case analysis; HEml: Heckman one-step ML estimation; MIHEml: Multiple imputation using Heckman’s one-step ML estimation; HE2steps: Heckman’s two-step estimation; MIHE2steps: Multiple imputation using Heckman’s two-step estimation
Fig. 2Continuous outcome Boxplot of β1 estimates on the 1000 simulations associated to Table 2 (plot a), Table 4 (plot b), Table 6 left (plot c) and Table 6 right (plot d)
Continuous Y and logit selection model: Simulation results for β1=1 estimates
| Methods |
|
|
|
| RMSE | Cover |
|---|---|---|---|---|---|---|
| Before | 0 | 0.2 | 0.063 | 0.064 | 0.064 | 95.4 |
| deletion | 1 | 0.0 | 0.064 | 0.066 | 0.066 | 94.2 |
| 2 | -0.2 | 0.063 | 0.061 | 0.061 | 96.2 | |
| CCA | 0 | 0.3 | 0.079 | 0.078 | 0.078 | 95.8 |
| 1 | -18.9 | 0.079 | 0.080 | 0.205 | 33.5 | |
| 2 | -30.1 | 0.075 | 0.076 | 0.310 | 2.1 | |
| HEml | 0 | 0.3 | 0.105 | 0.108 | 0.108 | 93.6 |
| 1 | -1.3 | 0.117 | 0.131 | 0.131 | 92.7 | |
| 2 | -1.5 | 0.098 | 0.111 | 0.112 | 94.3 | |
| MIHEml | 0 | 0.3 | 0.107 | 0.110 | 0.110 | 94.0 |
| 1 | -1.2 | 0.121 | 0.133 | 0.134 | 92.6 | |
| 2 | -1.3 | 0.105 | 0.113 | 0.114 | 94.7 | |
| HE2steps | 0 | 0.3 | 0.107 | 0.105 | 0.105 | 95.4 |
| 1 | 0.9 | 0.149 | 0.158 | 0.158 | 95.0 | |
| 2 | 0.0 | 0.162 | 0.165 | 0.165 | 95.6 | |
| MIHE2steps | 0 | 0.3 | 0.110 | 0.106 | 0.106 | 95.5 |
| 1 | 0.9 | 0.151 | 0.159 | 0.159 | 94.6 | |
| 2 | 0.0 | 0.163 | 0.166 | 0.166 | 94.8 |
%Rbias: % relative bias; SE: Root mean square of the estimated standard error; SE: Empirical Monte Carlo standard error; RMSE: Root mean square error; Cover: % coverage of the nominal 95% confidence interval; CCA: Complete case analysis; HEml: Heckman one-step ML estimation; MIHEml: Multiple imputation using Heckman’s one-step ML estimation; HE2steps: Heckman’s two-step estimation; MIHE2steps: Multiple imputation using Heckman’s two-step estimation
Continuous Y: Simulation results for β1=1 with ρ=0, representing a MAR mechanism, and ρ=0.3 and 0.6, representing an MNAR mechanism, in the presence of missing data on X2
| Methods |
|
|
| RMSE | Cover |
|
| RMSE | Cover |
|---|---|---|---|---|---|---|---|---|---|
| Before deletion | 0 | 0.2 | 0.063 | 0.063 | 94.6 | -0.1 | 0.063 | 0.064 | 95.8 |
| 0.3 | 0.1 | 0.064 | 0.065 | 94.8 | 0.0 | 0.063 | 0.064 | 94.7 | |
| 0.6 | -0.3 | 0.064 | 0.065 | 94.0 | 0.2 | 0.063 | 0.063 | 95.2 | |
| CCA | 0 | 0.1 | 0.095 | 0.092 | 94.8 | -28.9 | 0.097 | 0.307 | 15.9 |
| 0.3 | -6.3 | 0.093 | 0.118 | 87.4 | -33.8 | 0.094 | 0.351 | 4.7 | |
| 0.6 | -13.3 | 0.090 | 0.162 | 69.7 | -37.7 | 0.090 | 0.388 | 1.6 | |
| HEml | 0 | 0.1 | 0.113 | 0.112 | 94.3 | -27.7 | 0.118 | 0.306 | 34.6 |
| 0.3 | -0.3 | 0.110 | 0.121 | 92.7 | -26.9 | 0.110 | 0.294 | 34.4 | |
| 0.6 | -0.6 | 0.103 | 0.106 | 93.3 | -27.5 | 0.099 | 0.293 | 20.9 | |
| MIHEml | 0 | -0.1 | 0.107 | 0.103 | 95.0 | 1.2 | 0.111 | 0.107 | 95.8 |
| 0.3 | -0.9 | 0.105 | 0.109 | 94.1 | -0.1 | 0.108 | 0.107 | 93.7 | |
| 0.6 | -2.2 | 0.101 | 0.099 | 94.9 | -1.7 | 0.103 | 0.097 | 94.2 | |
| HE2steps | 0 | 0.2 | 0.114 | 0.111 | 94.8 | -28.1 | 0.117 | 0.306 | 30.8 |
| 0.3 | 0.0 | 0.114 | 0.123 | 92.7 | -27.7 | 0.111 | 0.300 | 29.2 | |
| 0.6 | -0.3 | 0.113 | 0.114 | 93.9 | -28.0 | 0.104 | 0.299 | 23.7 | |
| MIHE2steps | 0 | -0.1 | 0.107 | 0.101 | 95.5 | 1.1 | 0.110 | 0.107 | 96.1 |
| 0.3 | -0.6 | 0.106 | 0.112 | 93.1 | 0.0 | 0.108 | 0.110 | 94.7 | |
| 0.6 | -1.5 | 0.105 | 0.105 | 94.8 | -1.3 | 0.105 | 0.102 | 94.5 | |
%Rbias: % relative bias; SE: Root mean square of the estimated standard error; SE: Empirical Monte Carlo standard error; RMSE: Root mean square error; Cover: % coverage of the nominal 95% confidence interval; CCA: Complete case analysis; HEml: Heckman one-step ML estimation; MIHEml: Multiple imputation using Heckman’s one-step ML estimation; HE2steps: Heckman’s two-step estimation; MIHE2steps: Multiple imputation using Heckman’s two-step estimation
Estimation of the predictive value of the randomisation group and severity score
| Methods (% used) | Assumed mechanisms |
|
|
| ||||
|---|---|---|---|---|---|---|---|---|
|
|
| Coeff | SE | Coeff | SE | Coeff | SE | |
| CCA (66%) | MCAR | MCAR | 0.243 | 0.217 | 0.061 | 0.206 | 0.021 | 0.163 |
| MI (100%) | MAR | MAR | 0.380 | 0.205 | 0.055 | 0.183 | 0.035 | 0.163 |
| HEml (79%) | MNAR | MCAR | 0.272 | 0.268 | 0.077 | 0.223 | 0.048 | 0.223 |
| MIHEml (100%) | MNAR | MAR | 0.396 | 0.188 | 0.105 | 0.182 | 0.123 | 0.181 |
Adh.: Adherence; Sev.: Severity score; Coeff: Coefficient; SE: Standard error; CCA: Complete case analysis; MI: Multiple imputation using classic imputation models; HEml: Heckman’s one-step ML estimation; MIHEml: Multiple imputation using Heckman’s one-step ML estimation