| Literature DB >> 19638200 |
Andrea Marshall1, Douglas G Altman, Roger L Holder, Patrick Royston.
Abstract
BACKGROUND: Multiple imputation (MI) provides an effective approach to handle missing covariate data within prognostic modelling studies, as it can properly account for the missing data uncertainty. The multiply imputed datasets are each analysed using standard prognostic modelling techniques to obtain the estimates of interest. The estimates from each imputed dataset are then combined into one overall estimate and variance, incorporating both the within and between imputation variability. Rubin's rules for combining these multiply imputed estimates are based on asymptotic theory. The resulting combined estimates may be more accurate if the posterior distribution of the population parameter of interest is better approximated by the normal distribution. However, the normality assumption may not be appropriate for all the parameters of interest when analysing prognostic modelling studies, such as predicted survival probabilities and model performance measures.Entities:
Mesh:
Year: 2009 PMID: 19638200 PMCID: PMC2727536 DOI: 10.1186/1471-2288-9-57
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Parameter of interest in prognostic modelling studies and ways to combine estimates after MI
| Parameters | Possible methods for combining estimates of parameters after MI* |
|---|---|
| Mean Value | Rubin's rules |
| Standard Deviation | Rubin's rules |
| Correlation | Rubin's rules after Fisher's Z transformation |
| Regression coefficient | Rubin's rules |
| Hazard ratio | Rubin's rules after logarithmic transformation |
| Prognostic Index/linear predictor per patient | Rubin's rules |
| Testing significance of individual covariate in model | Rubin's rules using a Wald test for a single estimates (Table 2(A)) |
| Testing significance of all fitted covariates in model | Rubin's rules using a Wald test for multivariate estimates (Table 2(B)) |
| Likelihood ratio | Rules for combining likelihood ratio statistics if parametric model (Table 2(D)) or |
| Proportion of variance explained (e.g. R2 statistics) | Robust methods |
| Discrimination (c-index) | Robust methods |
| Prognostic Separation D statistic | Rubin's rules |
| Calibration (Shrinkage estimate) | Robust methods |
| Survival probabilities | Rubin's rules after complementary log-log transformation |
| Percentiles of a survival distribution | Rubin's rules after logarithmic transformation |
* Reflect the authors' experiences and current evidence.
Summary of significance tests for combining different estimates from m imputed datasets after MI
| Estimate | Test statistic | Degrees of freedom (df) | Relative increase in variance ( | |
|---|---|---|---|---|
KEY: F = value from the F-distribution, which the test statistic is compared to.
= average of the m imputed data estimates.
= within imputation variance.
B = between imputation variance.
T = total variance for the combined MI estimate.
w, j = 1,..., m = χ2 statistics associated with testing the null hypothesis H: Q = Qon each imputed dataset, such that the significance level for the jimputed dataset is P{ > w}, where is the χ2 value with k degrees of freedom (Rubin 1987).
= average of the repeated χ2 statistics.
= average of the m likelihood ratio statistics, w,..., w, evaluated using the average MI parameter estimates and the average of the estimates from a model fitted subject to the null hypothesis.