| Literature DB >> 34580839 |
Daniel Gallacher1, Peter Kimani2, Nigel Stallard2.
Abstract
INTRODUCTION: Time-to-event data from clinical trials are routinely extrapolated using parametric models to estimate the cost effectiveness of novel therapies, but how this approach performs in the presence of heterogeneous populations remains unknown.Entities:
Mesh:
Year: 2021 PMID: 34580839 PMCID: PMC8738626 DOI: 10.1007/s40273-021-01082-x
Source DB: PubMed Journal: Pharmacoeconomics ISSN: 1170-7690 Impact factor: 4.981
Simulation summary presented according to ADEMP guidelines
| ADEMP category | Response |
|---|---|
| Aims | To investigate the performance of commonly used methods of estimating treatment efficacy in life-years in heterogeneous populations across a range of scenarios varying combinations of treatment efficacy and patient prognosis using follow-up typical of a clinical trial |
| Data-generating mechanism | Data were repeatedly sampled from exponential distributions representing the survival of patients belonging to either a subgroup or a complement population and being part of either a treatment or a control group. Censoring times were sampled from a Gompertz distribution |
| Methods | Patients were censored if their censoring time occurred before their event time. Eight parametric curves were fitted to each set of simulated data, and their life-year estimates and goodness-of-fit statistics were captured across seven scenarios. Different methods of obtaining a preferred estimate of life-years were compared. 10,000 simulations of each scenario were run, calculated assuming a variance of 0.25 and desired Monte Carlo standard error of 0.005 for bias, with each scenario taking approximately 10 h. A seed was used for reproducibility |
| Estimand | Predicted life-years (or restricted mean survival time), hazard ratio |
| Performance measures | Bias, empirical standard error, mean-squared error, Monte Carlo standard error |
ADEMP aims, data-generating mechanisms, estimands, methods, and performance measures
Description of scenarios and summary of parameters
| Scenario number and description | HR in subgroup | HR in complement | LYs in intervention subgroup (HR) | LYs in intervention complement (HR) | LYs in control subgroup (HR) | LYs in control complement (HR) |
|---|---|---|---|---|---|---|
| Scenario 0: treatment is effective in whole population, no differences between subgroup and complement | 0.70 | 0.70 | 5.68 ( | 5.68 ( | 4.00 ( | 4.00 ( |
| Scenario 1: subgroup and complement have same prognosis, effective only in subgroup | 0.70 | 1.00 | 5.68 ( | 4.00 ( | 4.00 ( | 4.00 ( |
| Scenario 2: subgroup has worse prognosis, effective only in subgroup | 0.70 | 1.00 | 2.86 ( | 4.00 ( | 2.00 ( | 4.00 ( |
| Scenario 3: subgroup has worse prognosis, same HR of effect in both subgroup and complement | 0.70 | 0.70 | 2.86 ( | 5.68 ( | 2.00 ( | 4.00 ( |
| Scenario 4: subgroup has worse prognosis, effective only in complement | 1.00 | 0.70 | 2.00 ( | 5.68 ( | 2.00 ( | 4.00 ( |
| Scenario 5: subgroup has worse prognosis, stronger effect in subgroup than in complement | 0.70 | 0.80 | 2.86 ( | 5.00 ( | 2.00 ( | 4.00 ( |
| Scenario 6: subgroup has worse prognosis, effective in subgroup, slight negative effect in complement | 0.70 | 1.10 | 2.86 ( | 3.63 ( | 2.00 ( | 4.00 ( |
HR hazard ratio, LYs life-years
Fig. 1Example Kaplan–Meier plot of subgroup, complement and combined data for where the treatment is only effective in the subgroup and subgroup has a worse prognosis (scenario 2)
Fig. 2Example of parametric curves failing to predict true survival for a heterogeneous population. Here λ1 = 0.5 and λ2 = 0.25
Incremental life-year estimates and cost difference for each method across the scenarios
| Scenario description | AIC—independent | BIC—independent | AIC—combined | BIC—combined | Average of all models | Complete KM follow-up | Percentage of simulations with significant treatment effecta or interactionb |
|---|---|---|---|---|---|---|---|
| Scenario 0: treatment is effective in whole population, no differences between subgroup and complement (true ILY 1.69) | 1.67 (− 0.02, − 1%) 27% | 1.65 (− 0.03, − 2%) 27% | 1.64 (− 0.04, − 3%) 39% | 95.0%a 4.8%b | |||
| Scenario 1: Subgroup and complement have same prognosis, effective only in subgroup (true ILY 0.84) | 0.74 (− 0.10, − 12%) 15% | 0.74 (− 0.11, − 13%) 15% | 0.83 (− 0.02, − 2%) 23% | 42.6%a 46.1%b | |||
| Scenario 2: subgroup has worse prognosis, effective only in subgroup (true ILY 0.43) | 0.46 (+ 0.04, + 8%) 9% | 0.55 (+ 0.12, + 29%) 11% | 0.52 (+ 0.09, + 22%) 11% | 0.56 (+ 0.13, + 30%) 12% | 0.43 (+ 0.00, + 0%) 16% | 56.5%a 54.7%b | |
| Scenario 3: subgroup has worse prognosis, same hazard ratio of effect in both subgroup and complement (true ILY 1.27) | 1.15 (− 0.13, − 10%) 24% | 1.16 (− 0.11, − 9%) 30% | 1.16 (− 0.11, − 9%) 30% | 1.14 (− 0.14, − 11%) 24% | 1.26 (− 0.01, − 1%) 38% | 97.8%a 5.2%b | |
| Scenario 4: subgroup has worse prognosis, effective only in complement (true ILY 0.84) | 0.57 (− 0.28, − 33%) 11% | 0.65 (− 0.19, − 23%) 11% | 0.54 (− 0.30, − 36%) 10% | 0.61 (− 0.23, − 28%) 15% | 0.83 (− 0.02, − 2%) 27% | 43.7%a 52.9%b | |
| Scenario 5: subgroup has worse prognosis, stronger effect in subgroup than in complement (true ILY 0.92) | 0.88 (− 0.04, − 5%) 21% | 0.93 (+ 0.01, + 1%) 26% | 0.87 (− 0.05, − 6%) 20% | 0.92 (− 0.01, − 1%) 29% | 91.4%a 11.8%b | ||
| Scenario 6: subgroup has worse prognosis, effective in subgroup, slight negative effect in complement (true ILY 0.25) | 0.40 (+ 0.15, + 62%) 7% | 0.36 (+ 0.11, + 45%) 6% | 0.41 (+ 0.16, + 65%) 7% | 0.32 (+ 0.07, + 27%) 7% | 0.25 (+ 0.00, + 1%) 9% | 35.6%a 75.8%b |
Data are presented as mean incremental life-years (difference, % difference) % within 10% unless otherwise indicated
Bold indicates model with the least biased average estimate of ILY
AIC Akaike information criterion, BIC Bayesian information criterion, ILY incremental life-years
aSignificant if p < 0.05 for treatment effect hazard ratio in a Cox proportional hazards model
bSignificant if p < 0.05 for subgroup-treatment effect interaction hazard ratio in a Cox proportional hazards model
Fig. 3Violin plot of difference in life-year estimation for each method, by arm where estimates for the intervention are on the left of each violin, and controls are on the right. Dashed line indicates the mean, occasionally distinguishable from the solid median line. AIC Akaike information criterion, BIC Bayesian information criterion, LY life-year, sig. significant
Life-year estimates for variations of scenario 2
| Scenario description | AIC—independent | BIC—independent | AIC—combined | BIC—combined | Average of all models | Complete KM follow-up | Percentage of simulations with significant treatment effecta or interactionb |
|---|---|---|---|---|---|---|---|
| Scenario 2: original (true ILY 0.43) | 0.46 (+ 0.04, + 8%) 9% | 0.55 (+ 0.12, + 29%) 11% | 0.52 (+ 0.09, + 22%) 11% | 0.56 (+ 0.13, + 30%) 12% | 0.43 (+ 0.00, + 0%) 16% | 56.5%a 54.7%b | |
| Scenario 2: increased sample size (true ILY 0.43) | 0.55 (+ 0.12, + 28%) 14% | 0.47 (+ 0.05, + 11%) 13% | 0.56 (+ 0.13, + 31%) 14% | 0.43 (0.00, 0%) 22% | 86.1%a 83.8%b | ||
| Scenario 2: separate models for subgroup and complement if significant interaction is detected (true ILY 0.43) | 0.43 (+ 0.00, + 0%) 10% | 0.46 (+ 0.04, + 8%) 12% | 0.45 (+ 0.02, + 5%) 11% | 0.46 (+ 0.04, + 8%) 12% | 0.42 ( 11% | 0.43 (+ 0.00, + 0%) 16% | 56.5%a 54.7%b |
| Scenario 2: increased sample size and separate models for subgroup and complement if significant interaction is detected (true ILY 0.43) | 0.42 ( 14% | 0.43 (+ 0.01, + 1%) 16% | 0.42 ( 15% | 0.44 (+ 0.01, + 2%) 16% | 0.39 ( 14% | 0.43 (+ 0.00, + 0%) 23% | 85.9%a 84.1%b |
Data are presented as mean incremental life-years (difference, % difference) % within 10% unless otherwise indicated
Bold indicates model with the least biased average estimate of ILY
AIC Akaike information criterion, BIC Bayesian information criterion, ILY incremental life-years
a Significant if p < 0.05 for treatment effect hazard ratio in a Cox proportional hazards model
bSignificant if p < 0.05 for subgroup-treatment effect interaction hazard ratio in a Cox proportional hazards model
Fig. 4Violin plot of difference in life-year estimation for each method for variations of scenario 2. Estimates for the intervention are on the left of each violin, and controls are on the right. Dashed line indicates the mean, occasionally distinguishable from the solid median line. AIC Akaike information criterion, BIC Bayesian information criterion, LY life-year, sig. significant
| Heterogeneity in time-to-event data may not be identified in current health technology appraisals and may result in biased estimates of treatment benefit that would affect prices and patient access to therapy. |
| Heterogeneity should be considered. Methods such as averaging across plausible models, encouraging larger trial populations, and accounting for detectable heterogeneity may reduce the bias associated with heterogeneity compared with current methods used in health technology appraisals undertaken by the UK National Institute for Care and Excellence. |