| Literature DB >> 33314001 |
Anthony J Hatswell1,2, Ash Bullement3, Michael Schlichting4, Murtuza Bharmal5.
Abstract
BACKGROUND: Health state utility values ('utilities') are an integral part of health technology assessment. Though traditionally categorised by disease status in oncology (i.e. progression), several recent assessments have adopted values calculated according to the time that measures were recorded before death. We conducted a simulation study to understand the limitations of each approach, with a focus on mismatches between the way utilities are generated, and analysed.Entities:
Year: 2020 PMID: 33314001 PMCID: PMC8060240 DOI: 10.1007/s40258-020-00620-6
Source DB: PubMed Journal: Appl Health Econ Health Policy ISSN: 1175-5652 Impact factor: 2.561
Setup of the simulation study base case and scenarios
| Characteristic | Base-case value | Rationale | Scenario analysis value(s) |
|---|---|---|---|
| Study design settings | |||
| Number of patients in study | 300 | 300 patients is approximately what has been seen in immune-oncology studies to date (though this does vary) | 150 (scenario 1) 500 (scenario 2) |
| Cohort age, years | 65 | The approximate age of patients enrolled in to contemporary immunotherapy studies | 55 (scenario 3) 75 (scenario 4) |
| Male: female ratio | 1:1 | Although the gender ratio in studies is driven by the prevalence of conditions. In the simulation study, however, this only affects background mortality so is not varied | |
| Utility measurement interval | 120 days | Utilities are usually measured at increasing intervals over time, for simplicity a uniform pattern has been imposed | 90 days (scenario 5) 180 days (scenario 6) |
| Administrative censoring for utility values | 48 months for all patients | Utilities are generally only collected until the end of the study period. A ‘typical’ data collection period has been used, which is varied in sensitivity analysis to include other observation periods seen in trials | 18 months for all patients (scenario 7) 60 months for all patients (scenario 8) Until progression or maximum 60 months (scenario 9) Until 30 days after progression or maximum 60 months (scenario 10) |
| Missing data | 0% | Missing data can be an issue in clinical studies. In the base case this is assumed to be zero, with different mechanisms for missingness explored in sensitivity analysis | 10% of observations MCAR (scenario 11) 10% of patients lost to follow up at a random timepoint (all subsequent data censored; permanent MCAR) (scenario 12) Increasing likelihood of censored values as utility decreases (MNAR) (scenario 13) Censoring probability linked to time to death (scenario 14) |
| Survival simulation | |||
| Ratio of patients exhibiting poor/intermediate/background survival | 13:7:4 | In immune-oncology studies a number of patients have experienced durable survival, this proportion however varies between studies | 13:7:0 (scenario 15)—no long-term survivors 13:7:2 (scenario 16)—a lower rate of long-term survivors 13:7:6 (scenario 17)—a higher rate of long-term survivors |
| Time to progression for patients with poor outcomes (months) | Gamma (shape = 3, scale = 1) | Immuno-oncology studies exhibit a changing hazard over time with a short period on enrollment before many progression and survival events are observed, which decrease in frequency over time, with few being observed beyond 18 months [ | |
| Time to progression for patients with intermediate outcomes (months) | Weibull (shape = 1.3, scale = 8) | ||
| Post-progression survival | Weibull (shape = 1.5, scale = 14) | ||
| Percentage of deaths pre-progression | 20% | ||
| Pseudo-progression | 0% | A known issue with immuno-oncology is that the immune response can lead to swelling, which may be (incorrectly) categorized as disease progression. Whilst new measures have been developed to account for this, the impact is explored in sensitivity analysis where a portion of patients are miscategorized for regressions as having PFS as per the intermediate group | 10% of long-term survivors incorrectly assumed to have progressed in line with the patterns seen for other groups (scenario 18) |
| Link between pre- and post-progression survival | Independent distributions | The assumption is made that response to treatment, and post-progression survival are uncorrelated i.e. patient characteristics are not both predictive and prognostic | A scenario analysis (scenario 19) is presented where simulated post-progression survival is multiplied by 1.25 for long-term survivors, and 0.75 for short-term survivors. This implicitly assumes responders to treatment are healthier patients |
| Utility simulation | |||
| Patient utility distribution before progression or being close to death | Beta ( | In line with the literature on utilities which show reasonably high levels pre-progression, falling on disease progression [ | |
| Progressed utility (in progression scenarios) distribution | Beta ( | ||
| Time at which utility fell before death (in time-to-death scenarios) | Uniform (minimum, 90 days; maximum, 270 days) | Various observations have been reported in the literature, and thus a range is used which varies by scenario | |
| Utility fall before death (distribution) | Normal (mean, 0.5; SD, 0.2) | The absolute fall seen in studies have differed, but all have been substantial | |
MCAR missing completely at random, MNAR missing not at random, N number of patients, OS overall survival, PFS progression-free survival
Fig. 1Example of simulated time to event data compared to published immuno-oncology trials
Fig. 2Visual representation of the generation and analysis of each scenario
Setup of ‘real’ scenarios, mimicking previous immunotherapy studies
| # | Scenario | Scenario analysis value(s) |
|---|---|---|
| A | Ipilimumab in melanoma [ | Age = 57; 59% male; survival plateau = 17% |
| B | Nivolumab in renal cell carcinoma [ | Age = 62; 77% male; survival plateau = 20% |
| C | Pembrolizumab in non-small-cell lung cancer [ | Age = 64; 51% male; survival plateau = 30% |
| D | Atezolizumab in urothelial carcinoma [ | Age = 67; 76% male; survival plateau = 22% |
Scenario analysis results
| Scenario | Utility generation mechanism (UGM) | True QALYs | Utility analysis method (UAM) | |||||
|---|---|---|---|---|---|---|---|---|
| % Mean error (ME) | % Mean absolute error (MAE) | |||||||
| Prog | TTD | Combo | Prog | TTD | Combo | |||
| Base case | Prog-derived | 3.4 | 0 | 0.2 | − 0.7 | 0.4 | 0.5 | 0.7 |
| TTD-derived | 3.3 | − 3.5 | − 0.4 | 5.8 | 3.5 | 0.6 | 5.9 | |
| Combo-derived | 3.1 | − 3.4 | − 0.2 | 5.5 | 3.4 | 0.6 | 5.5 | |
| Scenario 1 | Prog-derived | 3.4 | 0 | 0.2 | − 0.7 | 0.4 | 0.5 | 0.8 |
| TTD-derived | 3.3 | − 3.4 | − 0.4 | 6 | 3.5 | 0.7 | 6 | |
| Combo-derived | 3.1 | − 3.4 | − 0.3 | 5.6 | 3.5 | 0.7 | 5.6 | |
| Scenario 2 | Prog-derived | 3.4 | 0 | 0.2 | − 0.7 | 0.2 | 0.3 | 0.7 |
| TTD-derived | 3.3 | − 3.4 | − 0.4 | 5.8 | 3.4 | 0.6 | 5.9 | |
| Combo-derived | 3.1 | − 3.4 | − 0.3 | 5.4 | 3.5 | 0.5 | 5.5 | |
| Scenario 3 | Prog-derived | 4.5 | 0 | 0.3 | − 0.5 | 0.4 | 0.5 | 0.6 |
| Age = 55 years | TTD-derived | 4.5 | − 3.1 | − 0.4 | 4.3 | 3.1 | 0.7 | 4.3 |
| Combo-derived | 4.2 | − 3.1 | − 0.3 | 4 | 3.1 | 0.7 | 4 | |
| Scenario 4 | Prog-derived | 2.4 | 0 | 0 | − 1 | 0.4 | 0.4 | 1 |
| Age = 75 years | TTD-derived | 2.3 | − 4.6 | − 0.4 | 8.6 | 4.7 | 0.6 | 8.6 |
| Combo-derived | 2.2 | − 4.6 | − 0.3 | 8 | 4.7 | 0.6 | 8.1 | |
| Scenario 5 | Prog-derived | 3.4 | 0 | 0.4 | − 0.7 | 0.3 | 0.5 | 0.7 |
| Utility interval = 90 | TTD-derived | 3.4 | − 5.6 | − 0.4 | 5.3 | 5.6 | 1.1 | 5.5 |
| Combo-derived | 3.2 | − 5.6 | − 0.6 | 4.9 | 5.6 | 0.8 | 5.1 | |
| Scenario 6 | Prog-derived | 3.4 | 0 | 0 | − 0.7 | 0.5 | 0.5 | 0.8 |
| Utility interval = 180 | TTD-derived | 3.3 | − 1.2 | − 0.4 | 6.1 | 1.4 | 0.5 | 6.1 |
| Combo-derived | 3.1 | − 1.2 | − 0.1 | 5.7 | 1.4 | 0.6 | 5.7 | |
| Scenario 7 | Prog-derived | 3.4 | 0 | 0.7 | − 0.7 | 0.5 | 0.8 | 0.8 |
| Length = 18 months | TTD-derived | 3.3 | − 7.4 | − 0.4 | 5 | 7.4 | 1.8 | 5.4 |
| Combo-derived | 3.1 | − 7.7 | − 1 | 4.6 | 7.7 | 1.5 | 5 | |
| Scenario 8 | Prog-derived | 3.4 | 0 | 0.1 | − 0.7 | 0.4 | 0.4 | 0.8 |
| Length = 60 months | TTD-derived | 3.3 | − 2.5 | − 0.4 | 5.9 | 2.5 | 0.5 | 5.9 |
| Combo-derived | 3.2 | − 2.4 | − 0.2 | 5.5 | 2.5 | 0.6 | 5.5 | |
| Scenario 9 | Prog-derived | 3.3 | 0 | 0.1 | − 0.7 | 0.4 | 0.4 | 0.8 |
| Length = 60 months or progression | TTD-derived | 3.3 | − 2.5 | − 0.4 | 5.9 | 2.6 | 0.5 | 5.9 |
| Combo-derived | 3.1 | − 2.5 | − 0.2 | 5.5 | 2.6 | 0.6 | 5.6 | |
| Scenario 10 | Prog-derived | 3.4 | 0 | 0.1 | − 0.7 | 0.4 | 0.4 | 0.8 |
| Length = 60 months or progression + 30 days | TTD-derived | 3.3 | − 2.7 | − 0.4 | 5.9 | 2.7 | 0.5 | 5.9 |
| Combo-derived | 3.1 | − 2.6 | − 0.2 | 5.6 | 2.7 | 0.6 | 5.6 | |
| Scenario 11 | Prog-derived | 3.4 | 0 | 0.1 | − 0.7 | 0.4 | 0.5 | 0.8 |
| Missing data = 10% randomly MCAR | TTD-derived | 3.3 | − 1.8 | − 0.4 | 6 | 1.9 | 0.5 | 6 |
| Combo-derived | 3.1 | − 1.8 | − 0.1 | 5.6 | 1.9 | 0.6 | 5.6 | |
| Scenario 12 | Prog-derived | 3.4 | 0 | 0.2 | − 0.7 | 0.4 | 0.5 | 0.8 |
| Missing data = 10% of patients MCAR | TTD-derived | 3.3 | − 3.7 | − 0.4 | 5.8 | 3.7 | 0.7 | 5.8 |
| Combo-derived | 3.1 | − 3.6 | − 0.3 | 5.4 | 3.7 | 0.7 | 5.4 | |
| Scenario 13 | Prog-derived | 3.4 | 0.2 | 0.4 | − 0.5 | 0.5 | 0.5 | 0.7 |
| Missing data = proportional to utility (MNAR) | TTD-derived | 3.3 | − 2.7 | − 0.4 | 5.9 | 2.7 | 0.6 | 5.9 |
| Combo-derived | 3.2 | − 2.6 | − 0.1 | 5.7 | 2.7 | 0.6 | 5.7 | |
| Scenario 14 | Prog-derived | 3.4 | 0 | 0.2 | − 0.7 | 0.4 | 0.5 | 0.8 |
| Missingness increases closer to death | TTD-derived | 3.4 | − 3 | − 0.4 | 5.8 | 3 | 0.6 | 5.9 |
| Combo-derived | 3.2 | − 2.9 | − 0.2 | 5.4 | 2.9 | 0.6 | 5.5 | |
| Scenario 15 | Prog-derived | 1 | 0 | − 0.5 | − 2.5 | 0.4 | 0.6 | 2.5 |
| No long-term survivors | TTD-derived | 0.8 | − 4.5 | − 0.4 | 27.5 | 4.6 | 1.1 | 27.5 |
| Combo-derived | 0.7 | − 4.6 | − 0.5 | 25 | 4.8 | 1.3 | 25 | |
| Scenario 16 | Prog-derived | 2.2 | 0 | 0.1 | − 1.1 | 0.4 | 0.5 | 1.1 |
| Lower rate of long-term survivors | TTD-derived | 2.1 | − 6.5 | − 0.4 | 8.8 | 6.5 | 1 | 9.1 |
| Combo-derived | 2 | − 6.7 | − 0.7 | 8.1 | 6.7 | 1 | 8.5 | |
| Scenario 17 | Prog-derived | 4.3 | 0 | 0.2 | − 0.6 | 0.5 | 0.5 | 0.7 |
| Higher rate of long-term survivors | TTD-derived | 4.3 | − 1.4 | − 0.4 | 4.6 | 1.5 | 0.5 | 4.6 |
| Combo-derived | 4.1 | − 1.4 | − 0.1 | 4.3 | 1.5 | 0.5 | 4.3 | |
| Scenario 18 | Prog-derived | 3.4 | 0.1 | 0.2 | − 0.5 | 0.5 | 0.5 | 0.6 |
| 10% Pseudo-progression included | TTD-derived | 3.3 | − 10 | − 0.4 | 5.5 | 10 | 0.6 | 5.7 |
| Combo-derived | 3.2 | − 8.1 | − 0.2 | 5.4 | 8.1 | 0.6 | 5.5 | |
| Scenario 19 | Prog-derived | 3.3 | 0 | 0.2 | − 0.7 | 0.4 | 0.5 | 0.8 |
| Link between pre- and post-progression survival | TTD-derived | 3.2 | − 4.5 | − 0.4 | 6 | 4.5 | 0.7 | 6.1 |
| Combo-derived | 3 | − 4.4 | − 0.3 | 5.7 | 4.5 | 0.7 | 5.7 | |
| Scenario A | Prog-derived | 4 | 0.1 | 0.2 | − 0.4 | 0.2 | 0.3 | 0.4 |
| Ipilimumab melanoma | TTD-derived | 4 | − 14.2 | − 0.4 | 4.3 | 14.2 | 0.7 | 4.5 |
| Combo-derived | 3.7 | − 12 | − 0.4 | 4.3 | 12 | 0.6 | 4.4 | |
| Scenario B | Prog-derived | 3.7 | 0 | 0.4 | − 0.6 | 0.2 | 0.5 | 0.6 |
| Nivolumab RCC | TTD-derived | 3.7 | − 4.4 | − 0.4 | 5 | 4.4 | 0.9 | 5.1 |
| Combo-derived | 3.5 | − 4.5 | − 0.5 | 4.6 | 4.5 | 0.7 | 4.7 | |
| Scenario C | Prog-derived | 4.6 | 0 | 0.5 | − 0.5 | 0.2 | 0.5 | 0.5 |
| Pembrolizumab NSCLC | TTD-derived | 4.7 | − 3 | − 0.4 | 4.1 | 3 | 0.8 | 4.1 |
| Combo-derived | 4.4 | − 2.9 | − 0.3 | 3.8 | 2.9 | 0.7 | 3.8 | |
| Scenario D | Prog-derived | 3.5 | 0 | 0.5 | − 0.7 | 0.3 | 0.6 | 0.7 |
| Atezolizumab UCC | TTD-derived | 3.4 | − 5.3 | − 0.4 | 5.3 | 5.3 | 1.1 | 5.4 |
| Combo-derived | 3.2 | − 5.5 | − 0.5 | 4.9 | 5.5 | 0.9 | 5 | |
Fig. 3Recommendations for selection of analysis framework for health-related quality of life (HRQL) data
| A mismatch between the data structure and analysis method results in biased and inaccurate estimates of utility values. |
| Unexpectedly, analysing utilities as a combination of progression- and TTD-based values performed poorly, even if utilities were generated within a corresponding framework. Over-specification of analyses should therefore be avoided. |
| The volume of data available has a marked impact on the accuracy of estimates; this especially means the duration of follow-up and number of long-term survivors. |