| Literature DB >> 35819121 |
Christopher J Weir1, Rod S Taylor2.
Abstract
The desire, by patients and society, for faster access to therapies has driven a long tradition of the use of surrogate endpoints in the evaluation of pharmaceuticals and, more recently, biologics and other innovative medical technologies. The consequent need for statistical validation of potential surrogate outcome measures is a prime example on the theme of statistical support for decision-making in health technology assessment (HTA). Following the pioneering methodology based on hypothesis testing that Prentice presented in 1989, a host of further methods, both frequentist and Bayesian, have been developed to enable the value of a putative surrogate outcome to be determined. This rich methodological seam has generated practical methods for surrogate evaluation, the most recent of which are based on the principles of information theory and bring together ideas from the causal effects and causal association paradigms. Following our synopsis of statistical methods, we then consider how regulatory authorities (on licensing) and payer and HTA agencies (on reimbursement) use clinical trial evidence based on surrogate outcomes. We review existing HTA surrogate outcome evaluative frameworks. We conclude with recommendations for further steps: (1) prioritisation by regulators and payers of the application of formal surrogate outcome evaluative frameworks, (2) application of formal Bayesian decision-analytic methods to support reimbursement decisions, and (3) greater utilization of conditional surrogate-based licensing and reimbursement approvals, with subsequent reassessment of treatments in confirmatory trials based on final patient-relevant outcomes.Entities:
Keywords: licensing; reimbursement; surrogate outcome
Mesh:
Substances:
Year: 2022 PMID: 35819121 PMCID: PMC9546435 DOI: 10.1002/pst.2219
Source DB: PubMed Journal: Pharm Stat ISSN: 1539-1604 Impact factor: 1.234
Statistical approaches to surrogate outcome evaluation.
| References | Method | Type of surrogate | Type of clinical outcome |
|---|---|---|---|
|
| Prentice criteria (Prentice, 1989) | Time to event | Time to event |
|
| Proportion of treatment effect explained (Freedman et al, 1992) | Binary | Binary |
|
| Proportion of treatment effect explained (Lin et al, 1997) | Continuous | Time to event |
|
| Single trial: relative effect and adjusted association (Buyse and Molenberghs, 1998) | Continuous | Continuous |
|
| Multi‐trial: joint model for S and T (Buyse et al, 2000) | Continuous | Continuous |
|
| Multi‐trial: two stage model (Tibaldi et al, 2003) | Continuous | Continuous |
|
| Multi‐trial: Bayesian joint model (Daniels and Hughes, 1997) | All | All |
|
| Multi‐trial: extension to time to event (Burzykowski et al, 2001; Renfro et al, 2012; Ghosh et al, 2012; Tibaldi et al, 2004) | Time to event | Time to event |
|
| Multi‐trial: extension to categorical outcomes (Molenberghs et al, 2001; Renard et al, 2002; Alonso et al, 2002) | Binary/ordinal/continuous | Binary/ordinal/continuous |
|
| Multi‐trial: repeated measures (Alonso et al, 2003; Alonso et al, 2004; Alonso et al, 2006) | Continuous | Continuous |
|
| Multi‐trial: repeated measures (Pryseley et al, 2010) | Continuous | Discrete |
|
| Multi‐trial: repeated measures (Renard et al, 2003) | Continuous | Time to event |
|
| Surrogate threshold effect (Burzykowski and Buyse, 2006) | All | All |
|
| Likelihood reduction factor (individual level surrogacy) (Alonso et al, 2004) | All | All |
|
| Proportion of information gain (Qu and Case, 2007; Miao et al, 2012) | All | All |
|
| Multi‐trial: information theoretic approach (Alonso and Molenberghs, 2007) | Binary | Binary |
|
| Multi‐trial: information theoretic approach extension (Alonso and Molenberghs, 2008) | Time to event | Time to event |
|
| Multi‐trial: information theoretic approach extension (Pryseley et al, 2007) | Binary | Continuous |
|
| Multi‐trial: information theoretic approach extension (Ensor and Weir, 2020; Ensor and Weir, 2021) | Binary/ordinal | Binary/ordinal |
|
| Principal stratification, principal surrogate (Frangakis and Rubin, 2002; Li et al, 2011) | Binary | Binary |
|
| Principal stratification extension (Wolfson and Gilbert, 2010) | Continuous | Binary |
|
| Principal stratification extension (Conlon et al, 2014) | Continuous | Continuous |
|
| Principal stratification extension – Bayesian(Li et al, 2010; Li et al, 2011) | Binary | Binary |
|
| Principal stratification extension – Bayesian (Zigler and Belin, 2012) | Continuous | Binary |
|
| Causal effect predictiveness (CEP) (Gilbert and Hudgens, 2008) | Discrete or continuous | Binary |
|
| CEP for time to event (Qin et al, 2008) | Continuous | Time to event |
|
| Average causal effect (Chen et al, 2007) | Discrete or continuous | Discrete or continuous |
|
| Distributional causal effect (Ju and Geng, 2010) | All | All |
|
| Direct and indirect effects via path analysis (Qu and Case, 2006) | All | All |
|
| Proportion of treatment effect explained – extension to joint modelling (Deslandes and Chevret, 2007) |
Repeated measures Time to event (multistate) |
Time to event Binary |
|
| Structural equation modelling (Ghosh et al, 2010; Emsley et al, 2010) | All | All |
|
| Missing data perspective (Chen et al, 2003) | Continuous | Continuous |
|
| Missing data perspective (Chen et al, 2008) | All | All |
|
| Missing data perspective (Benda and Gerlinger, 2007) | Continuous | Binary |
|
| 95% prediction model for true outcome (Baker et al, 2012) | Binary/time to event | Binary/time to event |
|
| Joint modelling permitting multiple surrogates (Xu and Zeger, 2001; Lin et al, 2002) | Continuous | Time to event |
|
| Joint modelling incorporating repeated measures (Taylor and Wang, 2002) | Continuous | Time to event |
International payers/HTA agencies & handling of surrogate endpoints: recommendations for specific statistical methods/considerations.
| Country | Payer/agency | Statistical methods | Citation/web link(s) |
|---|---|---|---|
| United Kingdom | National Institute of Health & Care Excellence (NICE), 2013/9 |
“When the use of “final” clinical endpoints is not possible and “surrogate” data on other outcomes are used to infer the effect of treatment on mortality and health‐related quality of life, evidence in support of the surrogate‐to‐final endpoint outcome relationship must be provided together with an explanation of how the relationship is quantified for use in modelling. The usefulness of the surrogate endpoint for estimating QALYs will be greatest when there is strong evidence that it predicts health‐related quality of life and/or survival. In all cases, the uncertainty associated with the relationship between the endpoint and health‐related quality of life or survival should be explored and quantified. Multivariate meta‐analysis of summary data for combining treatment effects on correlated outcomes and evaluating surrogate endpoints: “When data on the final clinical outcome are not available or limited at the licensing stage, and therefore also for the HTA decision‐making process, a modelling framework is required to establish the strength of the surrogate relationship between the treatment effects on the surrogate and the final outcome and to predict the likely treatment effect on the final outcome for the new health technology. Multivariate meta‐analytic methods provide such a framework as they, by definition, take into account the correlation between the treatment effects on the surrogate and final outcomes as well as the uncertainty related to all parameters describing the surrogate relationship.” “Relying solely on patient‐level association is not sufficient when evaluating surrogate endpoints, in particular when individual‐level association has been evaluated based on data from a single trial (Fleming and DeMets 1996). A meta‐analytic approach based on data from more than one trial to establish the association between the treatment effects on the candidate surrogate endpoint and on the final outcome is more appropriate for evaluation of surrogate endpoints.” |
|
| Germany | IQWiG, 2011 |
“There is no “best” method defined, however, correlation‐based validation is the ‘preferred’ method, in the sense it has been most widely used in evaluations. Another option discussed is the surrogate threshold effect (STE)” “A correlation of R ≥ 0.85; R2 ≥ 0.72 measured at the lower bound of the 95% percentage interval allows to conclude that the validation study represents a high reliable result. This interval R < 0.85; R2 < 0.72 to R > 0.7; R2 > 0.49 represents a medium reliable result between surrogate and patient relevant endpoint. If a validation study shows high reliable results with statistically low correlation (R ≤ 0.7; R2 ≤ 0.49) measured at the lower bound of the confidence interval then the surrogate is not considered as a valid endpoint.” |
|
| Canada | CADTH, 2017 | “Researchers should evaluate and justify the validity of any surrogate endpoints used for parameter estimation. Uncertainty in the association of the surrogate to the final clinical outcome should be reflected in the reference case probabilistic analysis. This uncertainty can also be explored through appropriate scenario analyses. The existence of multiple potential surrogates should be reflected in the analysis of uncertainty.” |
|
| Australia | PBAC 2008/2016 |
Transformation of a surrogate to a final outcome – suggested uncertainty analysis – use range of alternative plausible values and present as scenario analysis for economic evaluation Surrogate to Final Outcomes Working Group (STFOWG) report: Information requirements “For a meta‐regression of multiple randomised trials: the results for (1) the intercept and coefficient (and their 95% CI), (2) the coefficient of determination (R2 trial), (3) R2 individual (only if individual patient data are available for the trials), and (4) the surrogate threshold effect (STE) as determined by prediction bands.” “An assessment of the implications of the identified uncertainties relating to the PSM to TCO transformation for the structure, the input variables, the results and the sensitivity analyses of the modelled economic evaluation and for the presentation of the stepped economic evaluation.” |
|
Abbreviations: PSM, proposed surrogate measure; TCO, target clinical outcome.
FIGURE 1Comparison of role of surrogate endpoints in licensing and reimbursement.
Hierarchy of evidence for surrogate endpoints.
| Definition | Source of evidence | Statistical metrics | |
|---|---|---|---|
| Level 3 | Biological plausibility | Clinical data and understanding of disease (surrogate endpoints on final patient relevant outcome disease pathway) | Not applicable |
| Level 2 | Observational association | Epidemiological studies of relationship between surrogate endpoint & final patient relevant outcome | Correlation between surrogate endpoint and final patient relevant outcome |
| Level 1 | Interventional/treatment effect association | Randomised controlled trial(s) |
Trial level R2/Spearman's correlation Surrogate threshold effect (STE) |
Note: Adapted from Ciani et al, 2017 ; Taylor & Elston, 2009 ; Bucher et al, 1999.
Individual participant or trial level meta‐analysis of multiple randomised controlled trial or single large randomised controlled trial with surrogate‐final outcome association assessed by trial site.
Evidence should be randomised controlled trials from same disease indication, line of therapy, class of treatment/intervention and comparator therapy that surrogate endpoint is applied. If extrapolating from different disease indication, line of therapy, class of treatment/intervention and comparator therapy then evidence may not qualify as level 1.