| Literature DB >> 29514597 |
Christopher J Weir1, Isabella Butcher2, Valentina Assi2, Stephanie C Lewis2, Gordon D Murray2, Peter Langhorne3, Marian C Brady4.
Abstract
BACKGROUND: Rigorous, informative meta-analyses rely on availability of appropriate summary statistics or individual participant data. For continuous outcomes, especially those with naturally skewed distributions, summary information on the mean or variability often goes unreported. While full reporting of original trial data is the ideal, we sought to identify methods for handling unreported mean or variability summary statistics in meta-analysis.Entities:
Keywords: Continuous outcomes; Meta-analysis; Missing mean; Missing standard deviation; Systematic review
Mesh:
Year: 2018 PMID: 29514597 PMCID: PMC5842611 DOI: 10.1186/s12874-018-0483-0
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Systematic review of methods to derive missing variance/SD: PRISMA Flow Diagram.
Flow diagram based on: Moher D, Liberati A, Tetzlaff J, Altman DG, The PRISMA Group (2009). Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Med 6(6): e1000097. doi:10.1371/journal.pmed1000097
Summary of methods identified for replacing missing variance/SD/SE
| Method | Category | Description | Statistics required | Assumptions | Software implementation |
|---|---|---|---|---|---|
| Abrams et al. (2005) | 3 | Bayesian meta-analysis estimates within-patient correlation between baseline and follow-up; enables imputation of mean change from baseline and its SD when only baseline and follow-up means and SDs reported | • baseline mean/SD | SD at baseline same as SD at follow-up; within subject correlation comes from same distribution for all studies and treatment arms; careful choice of prior distribution for variance parameters | Example WinBUGS [ |
| Hozo et al. (2005) | 1 | Missing variance estimated for example as: | • minimum | Data normally distributed | Excel spreadsheet provided by Wan et al. (2014) |
|
| |||||
| Sung et al. (2006) | 3 | Imputation of missing variances within Bayesian meta-analysis assuming distributed as | • variances reported for other included studies | Assume missing variances come from same lognormal distribution as reported variances | Implemented in WinBUGS; code supplied in online supplement to article |
| Walter and Yao (2007) | 1 | Improved version of “range” method which calculates SD = (b-a)/4 | • sample size | Approximate normality | Lookup table in paper could readily be implemented in standard software; RevMan [ |
| Ma et al. (2008) | 2 | Impute weighted average of variances observed in other studies; or calculate a range of pooled estimates for efficacy based on the smallest and largest variances observed | • sample size | Unobserved and observed variances come from the same underlying distribution | Could readily be implemented in any statistical software |
| Nixon et al. (2009) | 3 | Impute missing change from baseline SD values in Bayesian random effects meta-regression | • baseline SD | Log transform of baseline SD, follow-up SD and change from baseline SD follow trivariate normal distribution. Where follow-up SD is based on complete cases, imputation assumes non-informative drop-out | Applied in WinBUGS |
| Dakin et al. (2010) | 3 | Bayesian hierarchical modelling estimating SD values in context of network meta-analysis. SD assumed to follow gamma distribution; parameters estimated from studies reporting SDs | • observed SDs | Observed and missing SD values come from the same gamma distribution | WinBUGS code provided in publication |
| MacNeil et al. (2010) | 3 | Impute missing SDs in hierarchical Bayesian meta-analysis based on posterior predictive distribution | • observed SDs | Observed, missing SDs arise from same gamma distribution | Implemented in PyMC Markov chain Monte Carlo (MCMC) toolkit [ |
| Stevens (2011), Stevens et al. (2012) | 3 | Bayesian network meta-analysis that enables imputation of missing SDs via posterior predictive distribution (variances assumed to follow gamma distribution) | • observed variances | Variances follow gamma distribution; log(SD) given weak uniform prior distribution | WinBUGS code provided |
| Boucher (2012) | 3 | Emax model of SDs; implemented using either maximum likelihood or hierarchical Bayesian model | • observed SDs over time in longitudinal study | longitudinal modelling of SDs using Emax mixed effects model; differences by treatment group permitted in SDs; weak uniform prior for SD used in Bayesian approach | SAS (SAS Institute Inc., Cary, NC) PROC NLMIXED and WinBUGS code provided for maximum likelihood and Bayesian approaches respectively |
| Wan et al. (2014)* | 1 |
| • lower quartile | Data normally distributed | Excel spreadsheet provided by Wan et al. (2014) |
| Bland (2015) | 1 | Missing variance estimated as: | • minimum | Data normally distributed | Excel spreadsheet provided by Wan et al. (2014) |
|
| |||||
| Kwon and Reis (2015) | 1 | Approximate Bayesian computation to estimate SD | • available summary statistics | Underlying distribution of data | R code provided |
| Chowdhry et al. (2016) | 2 | Meta-regression assuming sample variances follow gamma distribution | • observed variances from other studies in meta-analysis | Variances missing at random (MAR) and follow a gamma distribution | Can be fitted in SAS PROC NLMIXED |
a minimum value, q1 lower quartile, m median, q3 upper quartile, b maximum, n sample size, , sample mean
*Also provide formulae for scenarios where only a, b and n are available; or where a, b, q1, q3, and n are available; see Results section for details
Key to category numbers:
1 Methods to derive the variance/SD/SE algebraically from parametric test statistics, p-values, etc
2 Summary statistic level imputation of variance/SD/SE, for example substituting SD data from other studies, using coefficient of variation, non-parametric summaries, or correlation data
3 Meta-analysis level strategies, for example multiple imputation or bootstrapping
4 Methods to meta-analyse effects on continuous outcomes without using individual study variance/SD/SE
5 Methods to impute effect size, from which variance/SD/SE could be derived
Fig. 2Systematic review of methods to derive missing mean: PRISMA Flow Diagram.
Flow diagram based on: Moher D, Liberati A, Tetzlaff J, Altman DG, The PRISMA Group (2009). Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Med 6(6): e1000097. 10.1371/journal.pmed1000097
GALA results: missing SD
| Method for missing SD replacement | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Complete data result | Ma [ | Walter [ | Cochrane Handbook [ | None, omit study | ||||||
| Mean difference (days) | 95% confidence interval | Bias | Imprecision | Bias | Imprecision | Bias | Imprecision | Bias | Imprecision | |
|
| ||||||||||
|
| − 0.01 | (− 0.87, 0.85) | ||||||||
| 2 missing SD | 0.21 | 1.18 | 0.05 | 1.05 | −0.01 | 0.73 | 0.27 | 1.28 | ||
|
| −0.01 | (− 0.37, 0.35) | ||||||||
| 2 missing SD | 0.04 | 1.04 | −0.01 | 1.01 | −0.02 | 1.97 | 0.02 | 1.06 | ||
| 5 missing SD | −0.03 | 1.56 | 0.01 | 1.10 | −0.12 | 2.40 | 0.00 | 1.64 | ||
|
| 0.00 | (−0.31, 0.30) | ||||||||
| 5 missing SD | 0.07 | 1.26 | 0.02 | 1.02 | 0.06 | 2.74 | 0.02 | 1.25 | ||
| 10 missing SD | 0.26 | 2.20 | 0.02 | 1.07 | 0.17 | 3.93 | 0.05 | 1.41 | ||
|
| −0.01 | (−0.28, 0.25) | ||||||||
| 5 missing SD | 0.06 | 1.11 | 0.03 | 1.06 | −0.12 | 2.23 | 0.09 | 1.15 | ||
| 10 missing SD | −0.09 | 1.49 | −0.01 | 1.13 | −0.21 | 2.45 | −0.02 | 1.62 | ||
| 15 missing SD | −0.03 | 1.87 | 0.02 | 1.17 | −0.23 | 2.28 | 0.12 | 2.19 | ||
Results are given for mixed sample size scenario (average of 60 patients per trial) and random allocation of trials to missing SD values. Imprecision is the ratio of the widths of the confidence intervals for the intervention effect [width when estimating missing SDs: width when all SDs available]. Results for other scenarios (small trials, large trials; missing SD in small trials, large trials) show similar patterns and are available in online Additional file 3 Tables S1-S8
GALA results: missing mean
| Method for missing mean replacement | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Complete data result | Hozo [ | Bland [ | Wan [ | None, omit study | ||||||
| Mean difference (days) | 95% confidence interval | Bias | Imprecision | Bias | Imprecision | Bias | Imprecision | Bias | Imprecision | |
|
| ||||||||||
|
| −0.01 | (−0.87, 0.85) | ||||||||
| 2 missing means | −0.20 | 1.00 | −0.12 | 1.00 | 0.05 | 1.00 | 0.27 | 1.28 | ||
|
| −0.01 | (− 0.37, 0.35) | ||||||||
| 2 missing means | 0.64 | 3.47 | 0.15 | 2.15 | −0.03 | 1.00 | 0.02 | 1.06 | ||
| 5 missing means | 0.12 | 4.35 | −0.06 | 1.58 | 0.04 | 1.00 | 0.00 | 1.64 | ||
|
| 0.00 | (−0.31, 0.30) | ||||||||
| 5 missing means | 1.16 | 3.67 | 0.38 | 2.30 | −0.03 | 1.00 | 0.02 | 1.25 | ||
| 10 missing means | 1.02 | 4.34 | 0.34 | 2.66 | 0.01 | 1.00 | 0.05 | 1.41 | ||
|
| −0.01 | (−0.28, 0.25) | ||||||||
| 5 missing means | 0.02 | 1.43 | 0.01 | 1.15 | 0.01 | 1.00 | 0.09 | 1.15 | ||
| 10 missing means | 0.01 | 2.92 | 0.06 | 1.89 | 0.04 | 1.00 | −0.02 | 1.62 | ||
| 15 missing means | −0.19 | 3.26 | −0.05 | 2.13 | 0.03 | 1.00 | 0.12 | 2.19 | ||
Results are given for mixed sample size scenario (average of 60 patients per trial) and random allocation of trials to missing mean values. Imprecision is the ratio of widths of confidence intervals for the intervention effect [width when estimating missing means: width when all means available] Results for other scenarios (small trials, large trials; missing mean in small trials, large trials) show similar patterns (online Additional file 3 Tables S9-S16)