| Literature DB >> 24992266 |
Georgia Salanti1, Cinzia Del Giovane2, Anna Chaimani1, Deborah M Caldwell3, Julian P T Higgins4.
Abstract
Systematic reviews that collate data about the relative effects of multiple interventions via network meta-analysis are highly informative for decision-making purposes. A network meta-analysis provides two types of findings for a specific outcome: the relative treatment effect for all pairwise comparisons, and a ranking of the treatments. It is important to consider the confidence with which these two types of results can enable clinicians, policy makers and patients to make informed decisions. We propose an approach to determining confidence in the output of a network meta-analysis. Our proposed approach is based on methodology developed by the Grading of Recommendations Assessment, Development and Evaluation (GRADE) Working Group for pairwise meta-analyses. The suggested framework for evaluating a network meta-analysis acknowledges (i) the key role of indirect comparisons (ii) the contributions of each piece of direct evidence to the network meta-analysis estimates of effect size; (iii) the importance of the transitivity assumption to the validity of network meta-analysis; and (iv) the possibility of disagreement between direct evidence and indirect evidence. We apply our proposed strategy to a systematic review comparing topical antibiotics without steroids for chronically discharging ears with underlying eardrum perforations. The proposed framework can be used to determine confidence in the results from a network meta-analysis. Judgements about evidence from a network meta-analysis can be different from those made about evidence from pairwise meta-analyses.Entities:
Mesh:
Year: 2014 PMID: 24992266 PMCID: PMC4084629 DOI: 10.1371/journal.pone.0099682
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Network of topical antibiotics without steroids for chronically discharging ears.
Edges are weighted according to the inverse of the variance of the direct summary ln(OR) (presented along the edges) and nodes are weighted according to the number of studies.
Summary information from direct comparisons of topical antibiotics without steroids for chronically discharging ears.
| Comparison | No. studies | Direct evidenceOR (95% CI) | Variance of ln(OR) | I2 (p-value) | τ2 |
| AB: Quinolone antibiotic vs no treatment | 2 | 0.09 (0.01, 0.51) | 0.83 | 69% (0.07) | 1.22 |
| AD: Antiseptic vs no treatment | 1 | 1.42 (0.65, 3.09) | 0.16 | NE | NE |
| BC: Non-quinolone antibiotic vs quinolone antibiotic | 7 | 1.46 (0.80, 2.67) | 0.10 | 48% (0.07) | 0.31 |
| BD: Antiseptic vs quinolone antibiotic | 5 | 3.47 (1.71, 7.07) | 0.13 | 66% (0.02) | 0.39 |
| CD: Antiseptic vs non-quinolone antibiotic | 4 | 1.69 (0.59, 4.83) | 0.28 | 67% (0.03) | 0.75 |
Number of studies, the direct evidence from pairwise meta-analysis (OR, 95% confidence interval and variance) and information about heterogeneity (I2 and heterogeneity variance τ2).
NE: Not estimable.
Figure 2Contributions matrix: percentage contribution of each direct estimate to the network meta-analysis estimates.
Rows correspond to network meta-analysis ORs (separated for mixed and indirect evidence) and columns correspond to direct meta-analysis ORs. The contribution of each direct comparison to the total network evidence that provides the ranking of the treatments is presented separately (row named Entire network). The sizes of the boxes are proportional to the percentage contribution of each direct estimate to the network meta-analysis estimates (rows 1–6) and to the entire network (row 7). The last row shows the number of included direct comparisons. The names of the treatments are given in Figure 1.
Figure 3Study limitations for each network estimate for pairwise comparisons of topical antibiotics.
Calculations are based on the contributions of direct evidence. The colours represent the risk of bias (green: low, yellow: moderate, red: high). The initial judgements about the risk of bias in the direct estimates are shown on the right side of the figure (there is no direct evidence for AC). The names of the treatments are given in Figure 1.
Figure 4Network estimates of mean ORs, their 95% confidence intervals and 95% predictive intervals (red extensions).
The names of the treatments are given in Figure 1.
Figure 5Study limitations weighted by contribution of direct estimates to the network of topical antibiotics.
The colours represent the risk of bias (green: low, yellow: moderate, red: high). The names of the treatments are given in Figure 1.
Hypothetical (extreme) examples for ranking treatments with maximum (left) and minimal (right) imprecision.
| Network ranking with high imprecision | Network ranking with high precision | |||||||
|
|
|
|
|
|
|
|
| |
| Best | 28 | 24 | 24 | 24 | 97 | 1 | 1 | 1 |
| Second | 24 | 28 | 24 | 24 | 1 | 97 | 1 | 1 |
| Third | 24 | 24 | 28 | 24 | 1 | 1 | 97 | 1 |
| Last | 24 | 24 | 24 | 28 | 1 | 1 | 1 | 97 |
The entries in the table are the probabilities (as a percentage) of each treatment achieving each possible rank.
Figure 6Rankograms for topical antibiotics without steroids for chronically discharging ears.
On the horizontal axes are the possible ranks and on the vertical axis the probability that each treatment achieves each rank.
Figure 7Comparison-adjusted funnel plot for the network of topical antibiotics without steroids for chronically discharging ears.
Each observation is the difference between a study estimate and its direct meta-analysis mean effect. Studies on the right hand side ‘overestimate’ the effect of newer treatments.
Summary of domain assessment for evaluating the quality of evidence from a network meta-analysis: Procedures for a pairwise effect estimate and overall ranking.
| Evaluate the confidence in a specific pairwise effect estimated in network meta-analysis | |||
| GRADE domain | Domain assessment in NMA | Description of procedure | Instructions for downgrading |
|
| Study limitations | Determine which directcomparisons contribute toestimation of the NMAtreatment effect | Use standard GRADE considerations to inform judgment. |
|
| Joint considerationof indirectnessandintransitivity | Evaluate indirectness ofpopulations, interventions andoutcomes as in standardGRADE. Evaluate transitivityby comparing the distributionof known effect modifiersacross comparisons thatcontribute evidence toestimation of the NMAtreatment effect | If |
|
| Joint considerationof statisticalheterogeneityand statisticalinconsistency | (a) Judge the extent ofheterogeneity, considering thecomparison-specificheterogeneity variance, theNMA estimate of variance, aprediction interval and/or otherrelevant metrics such as I2 . (b)Evaluate the extent to which thecomparison under evaluationis involved in inconsistentloops of evidence. | (a) If important heterogeneity is found, downgrade. If heterogeneity is low do not downgrade. (b) Power to detect inconsistency may be low; Downgrade in absence of statistical evidence for inconsistency when direct and indirect estimates imply different clinical decisions. |
|
| Imprecision | Focus on width of theconfidence interval. | Assess uncertainty around the pairwise estimate. Downgrade if confidence interval crosses null value or includes values favoring either treatment). |
|
| Publicationbias | Non-statistical considerationof likelihood of non-publication ofevidence that would inform thepairwise comparison. Plot pairwiseestimates on contour-enhancedfunnel plot. | Use standard GRADE to inform judgment. |
|
| |||
|
|
|
|
|
|
| Studylimitations | Integrate risk of bias assessmentsfrom each direct comparison toformulate a | Use standard GRADEconsiderations to informjudgment. |
|
| Joint considerationof indirectnessand intransitivity | Evaluate indirectness of populations,interventions and outcomes as instandard GRADE. Evaluatetransitivity across network bycomparing the distribution of knowneffect modifiers acrosscomparisons. | If |
|
| Jointconsiderationof statisticalheterogeneityand statisticalinconsistency | (a) Judge the extent of heterogeneityconsidering primarily the NMAvariance estimate(s) used and othernetwork-wise metrics such as Q forheterogeneity in a network (b)Evaluate inconsistency in networkusing statistical methods (such as global tests of inconsistency, orglobal inconsistency parameter). | (a) If important heterogeneity is found, downgrade. If heterogeneity is low do not downgrade. (b) For overall treatment rankings, inconsistency should be given greater emphasis, since ranks are based on mean effects and the uncertainty they are estimated with. Downgrade in absence of statistical evidence for inconsistency when several direct and indirect estimates imply different clinical decisions. |
|
| Imprecision | Visually examine rankingprobabilities (e.g. rankograms) foroverlap to assess precision oftreatment rankings | If probabilities are similarly distributed across the ranks, downgrade for imprecision. |
|
| Publicationbias | Non-statistical consideration oflikelihood of non-publication foreach pairwise comparison. Ifappropriate, plot NMA estimateson a comparison adjusted funnelplot and assess asymmetry. | As asymmetry does not provideconcrete evidence of publication bias, downgrading should only be considered jointly with the non |
When integrating assessments about direct comparisons into a judgement about an NMA treatment effect or the ranking, more weight should be given to assessments from direct comparisons that contribute more information. We recommend use of the contributions matrix to quantify how much information each direct comparison contributes to the estimation of the NMA treatment effect under evaluation or the ranking.
Summary of our confidence in effect estimates and ranking of treatments.
| Comparison | Nature of the evidence | Confidence | Downgrading due to |
| AB: Quinolone antibiotic vs no treatment | Mixed | Low | Study limitations |
| AC: Non-quinolone antibiotic vs no treatment | Indirect | Low | Study limitations |
| AD: Antiseptic vs no treatment | Mixed | Very low | Study limitations |
| BC: Non-quinolone antibiotic vs quinolone antibiotic | Mixed | Very low | Study limitations |
| BD: Antiseptic vs quinolone antibiotic | Mixed | Moderate | Inconsistency |
| CD: Antiseptic vs non-quinolone antibiotic | Mixed | Very low | Study limitations |
|
|
|
|
Dominated by evidence at high or moderate risk of bias.
No convincing evidence for the plausibility of the transitivity assumption.
Predictive intervals for treatment effect include effects that would have different interpretations (there is additionally no convincing evidence for the plausibility of the transitivity assumption).
Confidence intervals include values favouring either treatment.
60% of the information is from studies at moderate risk of bias.
Moderate level of heterogeneity, and some evidence of inconsistency in the network.