| Literature DB >> 27577523 |
Kirsty M Rhodes1, Rebecca M Turner1, Ian R White1, Dan Jackson1, David J Spiegelhalter2, Julian P T Higgins3.
Abstract
Many meta-analyses combine results from only a small number of studies, a situation in which the between-study variance is imprecisely estimated when standard methods are applied. Bayesian meta-analysis allows incorporation of external evidence on heterogeneity, providing the potential for more robust inference on the effect size of interest. We present a method for performing Bayesian meta-analysis using data augmentation, in which we represent an informative conjugate prior for between-study variance by pseudo data and use meta-regression for estimation. To assist in this, we derive predictive inverse-gamma distributions for the between-study variance expected in future meta-analyses. These may serve as priors for heterogeneity in new meta-analyses. In a simulation study, we compare approximate Bayesian methods using meta-regression and pseudo data against fully Bayesian approaches based on importance sampling techniques and Markov chain Monte Carlo (MCMC). We compare the frequentist properties of these Bayesian methods with those of the commonly used frequentist DerSimonian and Laird procedure. The method is implemented in standard statistical software and provides a less complex alternative to standard MCMC approaches. An importance sampling approach produces almost identical results to standard MCMC approaches, and results obtained through meta-regression and pseudo data are very similar. On average, data augmentation provides closer results to MCMC, if implemented using restricted maximum likelihood estimation rather than DerSimonian and Laird or maximum likelihood estimation. The methods are applied to real datasets, and an extension to network meta-analysis is described. The proposed method facilitates Bayesian meta-analysis in a way that is accessible to applied researchers. © 2016 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd. © 2016 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.Entities:
Keywords: data augmentation; heterogeneity; informative priors; meta-analysis; meta-regression
Mesh:
Year: 2016 PMID: 27577523 PMCID: PMC5111594 DOI: 10.1002/sim.7090
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Predictive inverse‐gamma (IG) distributions for between‐study variance τ 2 expected in future binary outcome meta‐analyses of log odds ratios. To implement a predictive distribution as an informative prior for τ 2 in an approximate Bayesian meta‐analysis by data augmentation, we augment the observed study data using the pseudo data reported.
|
|
|
| |
|---|---|---|---|
|
|
|
| |
|
|
|
| |
|
|
|
|
|
| 2 studies with | 6 studies with | 2 studies with | |
| effects | effects | effects | |
|
|
|
| |
|
|
|
|
|
| 3 studies with | 2 studies with | 2 studies with | |
| effects | effects | effects | |
|
|
|
| |
|
|
|
|
|
| 3 studies with | 2 studies with | 3 studies with | |
| effects | effects | effects |
Results from re‐analysing data from published meta‐analyses to explore discrepancies between data augmentation and MCMC.
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| ||||||||
|
| ||||||||
| Conventional (DL estimation) | Observed | 1.42( − 0.08,2.91) | 1.78(0.39,52.2)1 | 80 | ||||
| Data augmentation by DL | Augmented4 | IG(1.45, 0.18)3 | IG(1.5, 0.18) | 1.15(0.48,1.83) | 0.12(0.17,8.48)1 | |||
| Data augmentation by ML | Augmented4 | IG(1.45, 0.18)3 | IG(1.5, 0.18) | 1.21(0.35,2.07) | 0.36(0.17,8.48)1 | |||
| Data augmentation by REML | Augmented4 | IG(1.45, 0.18)3 | IG(1.5, 0.18) | 1.24(0.30,2.19) | 0.51(0.17,8.48)1 | |||
| MCMC | Observed | IG(1.45, 0.18)3 | IG(1.45, 0.18) | 1.24(0.18,2.56)2 | 0.0009 | 0.52(0.07,4.21)2 | 0.003 | |
|
| ||||||||
| MCMC | Augmented4 |
| IG(1.5, 0.18) | 1.24(0.20,2.53)2 | 0.0006 | 0.50(0.07,3.97)2 | 0.002 | |
| MCMC | Observed | IG(1.5, 0.18)5 | IG(1.5, 0.18) | 1.24(0.20,2.53)2 | 0.0006 | 0.50(0.07,3.98)2 | 0.002 | |
|
| ||||||||
|
| ||||||||
| Conventional (DL estimation) | Observed | − 0.28( − 1.28,0.72) | 0(0,7.61)1 | 0 | ||||
| Data augmentation by DL | Augmented4 | IG(1.06, 0.01)6 | IG(1, 0.01) | − 0.28( − 1.29,0.73) | 0.01(0.002,0.52)1 | |||
| Data augmentation by ML | Augmented4 | IG(1.06, 0.01)6 | IG(1, 0.01) | − 0.28( − 1.29,0.73) | 0.01(0.002,0.52)1 | |||
| Data augmentation by REML | Augmented4 | IG(1.06, 0.01)6 | IG(1, 0.01) | − 0.28( − 1.29,0.73) | 0.01(0.002,0.52)1 | |||
| MCMC | Observed | IG(1.06, 0.01)6 | IG(1.06, 0.01) | − 0.28( − 1.31,0.74)2 | 0.003 | 0.01 (0.003,0.21)2 | 0.0002 | |
|
| ||||||||
| MCMC | Augmented4 |
| IG(1, 0.01) | − 0.28( − 1.31,0.74)2 | 0.003 | 0.01(0.003,0.24)2 | 0.0002 | |
| MCMC | Observed | IG(1, 0.01)5 | IG(1, 0.01) | − 0.28( − 1.31,0.74)2 | 0.003 | 0.01 (0.003,0.24)2 | 0.0002 | |
The confidence interval for τ 2 is obtained iteratively via the Q‐profile method 23.
Posterior medians and 95% credible intervals are reported.
Empirically based inverse‐gamma(1.45,0.18) prior for a pharmacological vs placebo/control meta‐analysis with a subjective outcome.
Observed data augmented using pseudo data to represent the achieved inverse‐gamma prior for τ 2.
The prior we achieve in data augmentation, due to rounding the number of artificial studies to be an integer.
Empirically based inverse‐gamma(1.06,0.01) prior for a pharmacological vs placebo/control meta‐analysis with an all‐cause mortality outcome.
MCMC, Markov chain Monte Carlo; DL, DerSimonian and Laird; ML, maximum likelihood; REML, restricted maximum likelihood.
Figure 1Average estimates [top row] and root mean squared error (RMSE) [bottom row] of between‐study variance τ 2 from the simulation study, using 20 000 simulations for each combination of τ 2 and K, plotted on the logarithm scale. In each case, DL denotes conventional estimation by the DerSimonian and Laird procedure. DA and IS denote Bayesian methods by data augmentation and importance sampling, respectively.
Figure 2Average estimates with corresponding 95% intervals [top row] and coverage probabilities of estimated 95% intervals for the summary effect μ [bottom row] from the simulation study, using 20 000 simulations for each combination of τ 2 and K, plotted on the logarithm scale. In each case, DL denotes conventional estimation by the DerSimonian and Laird procedure. DA and IS denote Bayesian methods by data augmentation and importance sampling, respectively.
Properties of estimates for μ and τ 2 from the simulation study with K = 5 studies, using 1000 simulations for each τ 2 value. In each case, IS denotes values using Bayesian methods by importance sampling. ‘Z length’ denotes the average length of the nominal 95% interval for μ using the standard normal quantile, and ‘Z coverage’ denotes the proportion of nominal 95% intervals that cover the true value, using the standard normal quantile.
|
| ||||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
| |||||
| IS | MCMC | IS | MCMC | IS | MCMC | IS | MCMC | |
|
| − 0.001 | − 0.001 | 0.106 | 0.106 | 0.771 | 0.771 | 0.9993 | 1 |
|
| 0.003 | 0.003 | 0.143 | 0.144 | 0.800 | 0.805 | 0.995 | 0.995 |
|
| 0.001 | 0.001 | 0.170 | 0.169 | 0.840 | 0.842 | 0.985 | 0.985 |
|
| 0.005 | 0.005 | 0.260 | 0.260 | 0.971 | 0.984 | 0.996 | 0.996 |
|
| 0.013 | 0.012 | 0.556 | 0.554 | 1.654 | 1.839 | 0.991 | 0.992 |
|
| ||||||||
|
|
| |||||||
| IS | MCMC | IS | MCMC | |||||
|
| 0.064 | 0.064 | 0.018 | 0.018 | ||||
|
| 0.073 | 0.073 | 0.027 | 0.027 | ||||
|
| 0.084 | 0.084 | 0.040 | 0.040 | ||||
|
| 0.141 | 0.142 | 0.112 | 0.111 | ||||
|
| 0.767 | 0.787 | 0.646 | 0.682 | ||||
MCMC, Markov chain Monte Carlo.
Results from re‐analysing data from the network meta‐analysis to compare interventions for smoking cessation. Bayesian approaches apply an empirically based inverse‐gamma(1.39, 0.13) prior for τ 2 in a non‐pharmacological meta‐analysis with a subjective outcome.
|
|
| (95 |
| (95 |
| (95 |
| (95 |
|---|---|---|---|---|---|---|---|---|
| Conventional (DL estimation) | 0.46 | ( − 0.24, 1.16) | 0.70 | (0.27, 1.12) | 0.98 | (0.13, 1.84) | 0.60 | (0.21, 1.32)1 |
| Data augmentation by DL | 0.35 | ( − 0.03, 0.72) | 0.59 | (0.38, 0.80) | 0.85 | (0.34, 1.37) | 0.09 | (0.19, 1.07)1 |
| Data augmentation by ML | 0.42 | ( − 0.13, 0.96) | 0.64 | (0.32, 0.97) | 0.92 | (0.23, 1.61) | 0.31 | (0.19, 1.07)1 |
| Data augmentation by REML | 0.42 | ( − 0.14, 0.99) | 0.65 | (0.31, 1.00) | 0.93 | (0.22, 1.64) | 0.35 | (0.19, 1.07)1 |
| MCMC | 0.43 | ( − 0.16, 1.05)2 | 0.66 | (0.31, 1.04)2 | 0.94 | (0.20, 1.70)2 | 0.37 | (0.18, 0.83)2 |
The confidence interval for τ 2 is obtained iteratively via the Q‐profile method 23.
Posterior medians and 95% credible intervals are reported for the log odds ratios μ ,μ and μ and for the common heterogeneity variance τ 2.
DL, DerSimonian and Laird; ML, maximum likelihood; REML, restricted maximum likelihood; MCMC, Markov chain Monte Carlo;