| Literature DB >> 30207407 |
Jaap Brand1, Stef van Buuren2, Saskia le Cessie3,4, Wilbert van den Hout1.
Abstract
In healthcare cost-effectiveness analysis, probability distributions are typically skewed and missing data are frequent. Bootstrap and multiple imputation are well-established resampling methods for handling skewed and missing data. However, it is not clear how these techniques should be combined. This paper addresses combining multiple imputation and bootstrap to obtain confidence intervals of the mean difference in outcome for two independent treatment groups. We assessed statistical validity and efficiency of 10 candidate methods and applied these methods to a clinical data set. Single imputation nested in the bootstrap percentile method (with added noise to reflect the uncertainty of the imputation) emerged as the method with the best statistical properties. However, this method can require extensive computation times and the lack of standard software makes this method not accessible for a larger group of researchers. Using a standard unpaired t-test with standard multiple imputation without bootstrap appears to be a robust alternative with acceptable statistical performance for which standard multiple imputation software is available.Entities:
Keywords: bootstrap; confidence interval; cost-effectiveness analysis; mean difference; multiple imputation
Mesh:
Year: 2018 PMID: 30207407 PMCID: PMC6585698 DOI: 10.1002/sim.7956
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Overview of the 10 candidate methods
|
|
|
|---|---|
|
| |
| • List‐wise deletion | BENCH_LWD |
| • Single imputation using the predicted mean value | BENCH_prd |
|
| |
| • Standard multiple imputation using predictive mean matching and | MW_S |
| Rubin's rules for the computation of the confidence interval based | |
| on the normality assumption, without bootstrap | |
| • Multiple imputation using predictive mean matching with reduction | MW_EDW |
| of the effect of skewness by means of Edgeworth Expansion, | |
| without bootstrap | |
|
| |
| • Multiple imputation using predictive mean matching in the outer | MB_p |
| and the bootstrap percentile method in the inner loop | |
| • Multiple imputation using predictive mean matching in the outer | MB_t |
| loop and the bootstrap‐t method in the inner loop | |
|
| |
| • The bootstrap percentile method in the outer loop and multiple | BM_p |
| imputation by means of predictive mean matching in the inner loop | |
| • The bootstrap‐t method in the outer loop and multiple imputation by | BM_t |
| means of predictive mean matching in the inner loop | |
|
| |
| • The bootstrap percentile method in the outer loop, encompassing | BS_p |
| imputation by means of predictive mean matching | |
| • The bootstrap‐t method in the outer loop, encompassing single | BS_t |
| imputation by means of predictive mean matching | |
Assumptions in the data simulation models used to compare the candidate methods. Unspecified parameters follow the reference case assumptions (see text). The specified 15 assumptions were combined with both a missing completely at random mechanism (MCAR, open plot symbols) and a missing at random mechanism (MAR, filled plot symbols)
|
|
|
|---|---|
| % missing in the costs data (reference case 40% and 40%) | |
| 10% and 10% | Blue triangle point‐down |
| 10% and 50% | Blue circle |
| 50% and 50% | Blue triangle point‐up |
| Sample size (reference case | |
|
| Green triangle point‐down |
|
| Green circle |
|
| Green triangle point‐up |
| % zeroes in cost data (reference case 30% and 30%) | |
| 5% and 5% | Orange triangle point‐down |
| 5% and 40% | Orange circle |
| 40% and 40% | Orange triangle point‐up |
| Skewness parameter | |
| 0.5 and 0.5 | Red triangle point‐down |
| 0.5 and 3 | Red circle |
| 3 and 3 | Red triangle point‐up |
| Rank correlation (reference case ‐0.8 and ‐0.8) | |
| ‐0.3 and ‐0.3 | Purple triangle point‐down |
| ‐0.3 and ‐0.9 | Purple circle |
| ‐0.9 and ‐0.9 | Purple triangle point‐up |
Figure 1Results of the simulation study in which the performance of the 10 candidate methods for 30 different data simulation models was assessed for the actual confidence interval coverage (Panel A), bias (Panel B), and average confidence interval (Panel C). The top row of Panel A indicates the number of data simulation models (out of 30) for which each method is considered valid (ie, unbiased and with coverage at least 93.6%). For legend of the symbols, see Table 2 [Colour figure can be viewed at wileyonlinelibrary.com]
Figure 2Estimated four quality‐adjusted life year (QALY) outcomes and five cost outcomes for the Sciatica trial. Top panels display the point estimates with upper and lower bound of the confidence intervals. Bottom panels show the lengths of those confidence intervals [Colour figure can be viewed at wileyonlinelibrary.com]
Computation time to analyze data from the Sciatica study (total and for MICE calls). Time indicated by “xh ym zs” denotes x hours and y minutes and z seconds
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
|
|
|
| ||
| BENCH_LWD | 0.2 s | 0 | 0% | ||
| BENCH_prd | 23 s | 1 | 23 s | 23 s | 100% |
| MW_S | 1 m 52 s | 5 | 1 m 52 s | 22 s | 100% |
| MW_EDW | 1 m 52 s | 5 | 1 m 52 s | 22 s | 100% |
| MB_p and MB_t | 1 m 54 s | 5 | 1 m 53 s | 23 s | 99.1% |
| BM_p and BM_t | 29 h 25 m 21 s | 5000 | 28 h 57 m 43 s | 21 s | 98.4% |
| BS_p and BS_t | 5 h 53 m 34 s | 1000 | 5 h 48 m 03 s | 21 s | 98.4% |