| Literature DB >> 21169167 |
Matthijs Blankers1, Maarten W J Koeter, Gerard M Schippers.
Abstract
BACKGROUND: Missing data is a common nuisance in eHealth research: it is hard to prevent and may invalidate research findings.Entities:
Mesh:
Year: 2010 PMID: 21169167 PMCID: PMC3057309 DOI: 10.2196/jmir.1448
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Missing data approaches in this study
| Approach | Description | Missingness Pattern | Type |
| Complete cases | Only cases without missing observations in analysis | MCARa | Basic, single |
| Mean imputation | Imputes missing observations with listwise mean for each variable | MCARb | Basic, single |
| LOCF | Imputes the last available observation in the current data collection wave | - | Basic, single |
| Regression imputation | Imputes missing observations by prediction based on other variables in a regression model | MAR, MCAR | Advanced, single |
| EM imputation | Imputes missing observations using expectation maximization algorithm | MAR, MCAR | Advanced, single |
| NORM | Multiple imputes missing observations under a normal model | MAR, MCAR | Advanced, multiple |
| MICE | Multiple imputes missing observations using chained equations | MAR, MCAR | Advanced, multiple |
| SPSS MI | Multiple imputes missing observations under a normal model in SPSS | MAR, MCAR | Advanced, multiple |
| Amelia II | Multiple imputes missing observations using a bootstrapping-based algorithm | MAR, MCAR | Advanced, multiple |
a This approach will lead to unbiased point estimators (eg, means) under MCAR, but will result in lowered power and sample size.
b This approach will lead to unbiased point estimators (eg, means) under MCAR, but will result in biased, smaller confidence intervals.
Figure 2Repeated application of nine missing data approaches
Figure 1Strip chart for 9 missing data approaches and the reference value
Independent samples t tests for missing data approaches against reference value
| Method | Mean | SD | Degrees of Freedom | Cohen’s | ||
| Reference | 2.62 | 5.22 | 0 | 246 | 1 | 0 |
| Complete cases | 1.39 | 2.63 | -2.09 | 176 | 0.04 | -0.31 |
| Mean imputation | 1.39 | 1.73 | -2.50 | 246 | 0.01 | -0.35 |
| LOCF | 4.85 | 5.43 | 3.29 | 246 | 0.001 | 0.42 |
| Regression imputation | 1.39 | 2.37 | -2.38 | 246 | 0.01 | -0.32 |
| EM imputation | 3.09 | 3.85 | 0.809 | 246 | 0.42 | 0.10 |
| NORM | 3.14 | 9.55 | 0.534 | 246 | 0.53 | 0.07 |
| MICE | 3.06 | 4.30 | 0.730 | 246 | 0.47 | 0.09 |
| SPSS 17 MI | 1.49 | 2.03 | -2.26 | 246 | 0.03 | -0.31 |
| Amelia II | 2.88 | 3.33 | 0.468 | 246 | 0.64 | 0.06 |
a Independent samples t tests
Coverage of the reference confidence interval for imputed means
| Missing Data Approach | Coverage Proportion | Variance of Bootstrapped Sample |
| Complete cases | 0.15 | 0.088 |
| Mean imputation | 0.15 | 0.088 |
| LOCF | 0 | 0.381 |
| EM imputation | 0.83 | 0.206 |
| Regression imputation | 0.17 | 0.105 |
| NORM | 0.43 | 3.027 |
| MICE | 0.71 | 0.622 |
| SPSS MI | 0.23 | 0.093 |
| Amelia II | 0.96 | 0.205 |