| Literature DB >> 32380948 |
Abstract
BACKGROUND: We aimed to assess the magnitude of interactions in costs, quality-adjusted life-years (QALYs) and net benefits within a sample of published economic evaluations of factorial randomised controlled trials (RCTs), evaluate the impact that different analytical methods would have had on the results and compare the performance of different criteria for identifying which interactions should be taken into account.Entities:
Keywords: Cost-effectiveness analysis; Economic evaluation; Factorial randomised controlled trial; Interactions; Simulation; Systematic review
Mesh:
Year: 2020 PMID: 32380948 PMCID: PMC7203889 DOI: 10.1186/s12874-020-00978-0
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Flow diagram showing study identification
Characteristics of the studies meeting inclusion criteria
| Completed studies | Protocols and terminated studies | Overall | |||
|---|---|---|---|---|---|
| Publication yeara | 2009 | 18% (7/40) | 50% (7/14) | 26% (14/54) | |
| 2008 | 10% (4/40) | 14% (2/14) | 11% (6/54) | ||
| 2007 | 5% (2/40) | 29% (4/14) | 11% (6/54) | ||
| 2006 | 10% (4/40) | 0% (0/14) | 7% (4/54) | ||
| 2005 | 15% (6/40) | 0% (0/14) | 11% (6/54) | ||
| 2000–2004 | 28% (11/40) | 7% (1/14) | 6% (3/54) | ||
| Pre-2000 | 15% (6/40) | 0% (0/14) | 9% (5/54) | ||
| Mainly/entirely publicly funded | 75% (30/40) | 100% (14/14) | 81% (44/54) | ||
| Country where research done | United Kingdom | 28% (11/40) | 43% (6/14) | 31% (17/54) | |
| United States | 38% (15/40) | 7% (1/14) | 30% (16/54) | ||
| Other individual country | 23% (9/40) | 29% (4/14) | 24% (13/54) | ||
| Multinational | 13% (5/40) | 21% (3/14) | 15% (8/54) | ||
| Disease area | Cardiovascular disease | 45% (18/40) | 0% (0/14) | 33% (18/54) | |
| Musculoskeletal | 13% (5/40) | 14% (2/14) | 13% (7/54) | ||
| Cancer | 15% (6/40) | 0% (0/40) | 11% (6/54) | ||
| Smoking, drugs or alcohol | 13% (5/40) | 29% (4/14) | 17% (9/54) | ||
| Other | 25% (10/40) | 57% (8/14) | 33% (18/54) | ||
| Factors target different diseases | 3% (1/40) | 14% (2/14) | 6% (3/54) | ||
| Type of interventionb | Pharmaceutical | 45% (18/40) | 36% (5/14) | 43% (23/54) | |
| Physiotherapy, exercise and related | 15% (6/40) | 29% (4/14) | 19% (10/54) | ||
| Nutrition | 8% (3/40) | 14% (2/14) | 9% (5/54) | ||
| Surgery | 13% (5/40) | 7% (1/14) | 11% (6/54) | ||
| Training, counselling, incentives and logistical interventions | 50% (20/40) | 43% (6/14) | 48% (26/54) | ||
| Sample sizec | Mean (range) | 3420 (126, 20,536) | 1371 (132, 4733) | 3002 (126, 20,536) | |
| < 200 patients | 13% (5/39) | 7% (1/14) | 11% (6/53) | ||
| 200–999 patients | 38% (15/39) | 36% (5/14) | 38% (20/53) | ||
| > 1000 patients | 49% (19/39) | 29% (4/14) | 43% (23/53) | ||
| Type of economic evaluation | Cost-effectiveness analysis | 68% (27/40) | 0% (0/14) | 50% (27/54) | |
| Cost-effectiveness and cost-utility analysis | 18% (7/40) | 43% (6/14) | 24% (13/54) | ||
| Cost-utility analysis | 8% (3/40) | 29% (4/14) | 13% (7/54) | ||
| Cost-consequence analysis | 8% (3/40) | 0% (0/14) | 6% (3/54) | ||
| Cost-utility and cost-benefit analysis | 0% (0/40) | 7% (1/14) | 2% (1/54) | ||
| Depends on results | 0% (0/40) | 7% (1/14) | 2% (1/54) | ||
| Not stated | 0% (0/40) | 14% (2/14) | 4% (2/54) | ||
| Cluster-randomised | 13% (5/40) | 43% (6/14) | 20% (11/54) | ||
| Type of design | Full factorial | 88% (35/40) | 86% (12/14) | 87% (47/54) | |
| Partial factorial | 10% (4/40) | 7% (1/14) | 9% (5/54) | ||
| Incomplete factorial | 3% (1/40) | 7% (1/14) | 4% (2/54) | ||
| Factorial matrix size | 2 factors & 2 levels2 (e.g. 2 × 2, 2 × 2 + 1, or double 2 × 2) | 78% (31/40) | 71% (10/14) | 76% (41/54) | |
| > 2 factors each with 2 levels (e.g. 2x2x2) | 10% (4/40) | 0% (0/14) | 7% (4/54) | ||
| 2 factors with > 2 levels on ≥1 factor (e.g. 3 × 2) | 10% (4/40) | 21% (3/14) | 13% (7/54) | ||
| > 2 factors with > 2 levels on ≥1 factor (e.g. 3x2x2) | 3% (1/40) | 7% (1/14) | 4% (2/54) | ||
| Primary clinical endpoint | Assumed no interaction | 53% (21/40) | 29% (4/14) | 46% (25/54) | |
| Included interaction term | 23% (9/40) | 7% (1/14) | 19% (10/54) | ||
| Unclear | 25% (10/40) | 64% (9/14) | 35% (19/54) | ||
| Base case economic evaluation | Assumed no interaction | 40% (16/40) | 7% (1/14) | 31% (17/54) | |
| Included interaction term | 53% (21/40) | 0% (0/14) | 39% (21/54) | ||
| Unclear | 8% (3/40) | 93% (13/14) | 30% (16/54) | ||
| Presentation of uncertainty | None | 48% (19/40) | – | 48% (19/40) | |
| Present using confidence intervals and/or scatter graphs | 15% (6/40) | – | 15% (6/40) | ||
| Cost-effectiveness acceptability curves | Pair-wise comparison(s) | 23% (9/40) | – | 23% (9/40) | |
| Multiple comparison(s) | 8% (3/40) | – | 8% (3/40) | ||
| Pair-wise and multiple comparison(s) | 8% (3/40) | – | 8% (3/40) | ||
a Completed studies are categorised by the year when the first paper describing the results of the economic evaluation was published, while protocols are categorised by the date the protocol was published
b Numbers add up to > 100% since 16 studies/protocols evaluated two or more different types of intervention
c Excludes one cluster-randomised study for which the number of patients was not reported
Magnitude of interactions for the 16 studies reporting mean costs and mean health benefits for each cell within the factorial design
| Interaction: effect ratio | Proportion of interactions in different categories | |||
|---|---|---|---|---|
| Cost | Health benefit | Net monetary benefit | ||
| Qualitative | <−1 | 16.7% (4/24) | 33.3% (8/24) | 29.2% (7/24) |
| Sub-additive | Between −1 and 0 | 41.7% (10/24) | 25.0% (6/24) | 25.0% (6/24) |
| Mixed non-qualitativeb | 4.2% (1/24) | 12.5% (3/24) | 12.5% (3/24) | |
| Additive | 0 | 0.0% (0/24) | 8.3% (2/24) | 0.0% (0/24) |
| Super-additive | > 0 | 37.5% (9/24) | 20.8% (5/24) | 33.3% (8/24) |
| Interaction:effect ratio > 1 or < −1 | 29% (7/24) | 38% (9/24) | 38% (9/24) | |
| Range of interaction:effect ratios | -2, 232 | −23, 2 | −44, 13 | |
a A ceiling ratio of £20,000 (or $20,000 or €20,000) was used for quality-adjusted life-years (QALYs) [14] and life-years gained, while a ceiling ratio of £5000 (or $5000 or €5000) per unit of benefit was arbitrarily used for all other health benefits
b Mixed interaction: one factor increases the outcome of interest, while the other decreases it. The interaction therefore cannot be classified as either sub-additive or super-additive
List of the criteria for determining which interactions are taken into account that were evaluated in the study
| Name of criterion | Rationale | Details of how it was applied | |
|---|---|---|---|
| 1 | Always include all interactions | Sometimes referred to as “never pool” [ | Interactions were included in analyses on all trial samples. |
| 2 | Never include any interactions | Sometimes referred to as “always pool” [ | No interactions were included analyses on any trial samples. |
| 3 | Include interactions where p < 0.05 | Reflects standard practice for clinical endpoints, where only interactions that are statistically significant in an initial test are included in the main analysis [ | Interaction for cost [or benefits] was included if it was statistically significantly different from zero in the model that included interactions for both costs and benefits. |
| 4 | Include interactions where | ||
| 5 | Include interactions where | ||
| 6 | Include interactions decreasing AIC | Information criteria trade efficiency against bias, taking account of sample size [ | Results are based on the mixed model with lowest AIC/BIC. |
| 7 | Include interactions decreasing BIC | ||
| 8 | Include qualitative interactions in cost or benefits | Interactions that change the ranking of treatments for cost or benefits may also have a high chance of changing the ranking of treatments for net benefits and therefore could also change the conclusions. This approach is simpler to implement than the criteria based on interactions for net benefit as it does not depend on the ceiling ratio. However, at ceiling ratios other than zero and infinity, the conclusions of economic evaluation could be sensitive to interactions even if this criterion does not pick up qualitative interactions for either costs or benefits. | Includes interactions for cost [benefits] that change rankings of treatments for cost [benefits]: i.e. those that are larger than and have the opposite sign from one or both of the simple effects (which will have interaction:effect ratios <− 1). |
| 9 | Include interactions for cost or benefits if >simple effect | This criterion includes super-additive interactions for cost or benefits that are larger than as the smaller of the two simple effects, as well as the qualitative interactions included in criterion 8. However, like 8, it may not identify all qualitative interactions for net benefit. | All interactions with an absolute magnitude larger than the smaller of the two simple effects (i.e. all those with interaction:effect ratios <− 1 or > 1) are included. |
| 10 | Include interactions for cost or benefits if p < 0.05 or > simple effect | This approach takes account of statistical significance and interactions that are larger than main effects. | As for 9, but also including smaller interactions that are statistically significantly different from zero. |
| 11 | Include qualitative interactions for cost, benefits or NMB | Allowing for interactions will have no effect on the conclusions about which treatment is adopted unless the interactions are qualitative on a NMB scale (i.e. change the ranking of treatments) at the ceiling ratio(s) of interest. However, since the true shadow price of a QALY is unknown, this approach requires arbitrary judgements about the ceiling ratio(s) at which the interactions are assessed. Including all interactions that are qualitative at any ceiling ratio would generally result in inclusion of all interactions, since any quantitative interaction in either costs or QALYs will produce a qualitative interaction in NMB at some ceiling ratio whenever the treatment lies in the north-east or south-west quadrants [ | Interactions were calculated at a series of 8 evenly-spaced ceiling ratios between £5000 and £40,000 per unit benefit using the coefficients for all 4 mixed models. Interactions for costs [or benefits] were included if the interaction for cost [or benefits] was qualitative, or if the coefficients from the mixed model that included an interaction for cost [or benefits] but not benefits [or cost] produced a qualitative interaction for NMB at any of the ceiling ratios. |
| 12 | Include interactions for cost, benefit or NMB if >simple effect | Includes all qualitative interactions in cost, benefits or NMB and any super-additive interactions that are larger than the smaller of the two simple effects. Calculated as for 11, but also including large super-additive interactions. | |
| 13 | Include |interactions| ≥0.25 or ≥ £250 | An absolute limit for the size of interaction that can safely be ignored could be pre-specified. However, there is no general rule for how large this limit should be and it may vary between applications. The size thresholds used were chosen arbitrarily. | Only interactions above the designated size threshold were taken into account. For example, criterion 13 includes interactions in benefits that are ≥0.25 (or ≤ −0.25) units in size and interactions in cost that are ≥£250 (or ≤ −£250) in size. |
| 14 | Include |interactions| ≥0.5 or ≥ £500 | ||
| 15 | Include |interactions| ≥1 or ≥ £1000 |
Abbreviations: AIC Akaike information criterion, BIC Bayesian information criterion, NMB net monetary benefit, QALY quality-adjusted life-year
The measures used to assess performance of the criteria for deciding which interactions are considered
| Measure | Rationale | Details of how it was calculated |
|---|---|---|
| Sensitivity for including non-zero interactions | Sensitivity and specificity evaluate the extent to which criteria identify non-zero interactions, but do not reflect the consequences of ignoring them. | The proportion of samples in which interactions in cost [or benefit] were taken into account in the analysis when the true interaction was not zero |
| Specificity for excluding interactions equal to zero | The proportion of samples in which interactions in cost [or benefit] were excluded from the analysis when the true interaction equalled zero | |
| Probability of adopting treatment with highest NMB | This focuses on the purpose of economic evaluation: namely to inform a treatment adoption decision regarding which treatment has highest expected NMB and to thereby maximise health gain from the budget. It assumes that inference is irrelevant to decision-making [ | The treatment arm with highest expected NMB was identified at the ceiling ratio of interest for (a) the “true” parameters used to generate the data and (b) based on the mixed model coefficients estimated on each sample. The proportion of samples in which the treatment predicted to have highest NMB (b) was the same as the “true” best treatment (a) was calculated for each scenario. |
| Opportunity cost associated with adopting a suboptimal treatment | This measure takes account of the opportunity cost of adopting the wrong treatment, as well as the probability of adopting the wrong treatment [ | For each sample, the opportunity cost was defined as the NMB for the “true” best treatment (a) minus the NMB for the treatment predicted to have highest NMB in that analysis of that sample (b). In both cases, NMB for each treatment was calculated using the “true” parameters used for data generation. Opportunity cost was therefore zero for all samples in which the “true” best treatment was adopted and positive in all other cases. Opportunity cost was then averaged across samples and scenarios. |
Abbreviations: NMB net monetary benefit
Comparison of performance of the different criteria with regards the probability and the opportunity cost associated with adopting a treatment that does not have highest true NMB. The values shown in bold represent the most favourable of all criteria for this measure
| Criterion | Proportion of samples in which interactions are included: % (n) | Sensitivity: proportion of any non-zero interactions taken into account: % (n) | Specificity: proportion of interactions equal to 0 that are excluded: % (n) | Mean opportunity cost of adopting a suboptimal treatment | Probability of adopting treatment with highest NMB |
|---|---|---|---|---|---|
| 1: Always include all interactions | 100.00% (21,600) | 0.00% (0) | £472 | 81.70% | |
| 2: Never include any interactions | 0.00% (0) | 0.00% (0) | £793 | 76.60% | |
| 3: Include interactions where p < 0.05 | 34.44% (7439) | 46.59% (6709) | 89.86% (6470) | £556 | 80.61% |
| 4: Include interactions where p < 0.10 | 44.86% (9689) | 56.87% (8189) | 79.17% (5700) | £515 | 81.32% |
| 5: Include interactions where p < 0.25 | 68.45% (14,785) | 77.18% (11,114) | 49.01% (3529) | £491 | 81.58% |
| 6: Include interactions decreasing AIC | 40.74% (8800) | 52.91% (7619) | 83.60% (6019) | £529 | 81.11% |
| 7: Include interactions decreasing BIC | 19.79% (4275) | 28.88% (4158) | 98.38% (7083) | £646 | 78.32% |
| 8: Include qualitative interactions in cost or benefits | 42.02% (9076) | 47.54% (6846) | 69.03% (4970) | £503 | 81.71% |
| 9: Include interactions for cost or benefits if >simple effect | 56.04% (12,104) | 63.77% (9183) | 59.43% (4279) | £474 | |
| 10: Include interactions for cost or benefits if p < 0.05 or > simple effect | 61.78% (13,344) | 71.42% (10,285) | 57.51% (4141) | £475 | 83.14% |
| 11: Include qualitative interactions for cost, benefits or NMB | 57.01% (12,314) | 60.76% (8749) | 50.49% (3635) | £480 | 81.21% |
| 12: Include interactions for cost, benefit or NMB if >simple effect | 67.85% (14,656) | 73.40% (10,569) | 43.24% (3113) | £475 | 81.55% |
| 13: Include interactions ≥0.25 or ≥ £250 | 31.92% (6894) | 37.02% (5331) | 78.29% (5637) | 81.22% | |
| 14: Include interactions ≥0.5 or ≥ £500 | 21.73% (4694) | 24.69% (3555) | 84.18% (6061) | £476 | 82.33% |
| 15: Include interactions ≥1 or ≥ £1000 | 16.65% (3596) | 18.51% (2666) | 87.08% (6270) | £488 | 82.01% |
Abbreviations: AIC Akaike information criterion, BIC Bayesian information criterion, NMB net monetary benefit, QALY quality-adjusted life-year
a Percentages are out of 21,600 (300 samples of 6 trials, each with six scenarios across two endpoints [costs and QALYs])
b Percentages are out of 14,400 (300 samples of 6 trials, each with four scenarios with non-zero interactions across two endpoints [costs and QALYs])
c Percentages are out of 7200 (300 samples of 6 trials, each with two scenarios with zero interactions across two endpoints [costs and QALYs])