| Literature DB >> 36186544 |
W Katherine Tan1, Brian D Segal1, Melissa D Curtis1, Shrujal S Baxi1, William B Capra2, Elizabeth Garrett-Mayer3, Brian P Hobbs4, David S Hong5, Rebecca A Hubbard6, Jiawen Zhu2, Somnath Sarkar1, Meghna Samant1.
Abstract
Background: Hybrid controlled trials with real-world data (RWD), where the control arm is composed of both trial and real-world patients, could facilitate research when the feasibility of randomized controlled trials (RCTs) is challenging and single-arm trials would provide insufficient information.Entities:
Keywords: External comparator cohorts; Hybrid control arms; Real-world data
Year: 2022 PMID: 36186544 PMCID: PMC9519429 DOI: 10.1016/j.conctc.2022.101000
Source DB: PubMed Journal: Contemp Clin Trials Commun ISSN: 2451-8654
Fig. 1Example schema for a hybrid controlled trial using external RWD.
Example disease settings and trials for which a hybrid controlled trial may be appropriate to consider. In addition to the considerations outlined in this table, it is critical to weigh the considerations in Section 3.2 to determine whether a hybrid controlled trial is appropriate and whether the external data are fit for purpose.
| Disease setting and representative trials | Low prevalence disease | Long time to events | SOC with low clinical benefit and/or toxic | Comments |
|---|---|---|---|---|
| Metastatic triple negative breast cancer (mTNBC) IMpassion130 (phase III for atezolizumab) | ✔ | Median OS < 18 months Lack of targeted therapies SOC can be difficult to tolerate (e.g. anthracycline- and taxane-based chemotherapy) | ||
| Chronic myeloid leukemia (CML) Phase II for imatinib mesylate (Kantarjian et al., 2002) | ✔ | Five-year survival for patients diagnosed in 1996–2002, 44.7% (Ries et al., 2006) SOC at the time (interferon alfa) had limited efficacy and serious side effects | ||
| Progressive Medullary Thyroid Cancer EXAM (phase III for cabozantinib) | ✔ | 10 year survival percentage of 95.6% for local cancers and 40% for metastatic cancers (Roman et al., 2006) SOC is ineffective, so placebo was used for control therapy in EXAM. This raises issues as to whether randomization was ethical. | ||
| Notch activating Adenoid Cystic Carcinoma (ACC) ACURRACY ( A future phase III trial | ✔ | ✔ | Median OS of ∼14 months in general population for ACC (Sharma et al., 2008) (not subset to patients with an activating notch mutation) Lack of targeted therapy No established SOC, and common treatments are ineffective and have serious side effects (chemotherapy, surgery, radiation) | |
| Adjuvant therapy for early breast cancer NATALEE ( APHINITY (phase III for Perjeta + Herceptin in HER2+) (Von Minckwitz et al., 2017) | ✔ | NATALEE is expected to take 7 years to complete. APHINITY enrolled 4800 patients to observe 381 invasive disease-free survival events. | ||
| Pan-tumor NTRK gene fusions NAVIGATE ( STARTRK-2 ( A future phase III basket study | ✔ | May depend on tumor type | Cohort selection in EHR-derived data may be challenging for basket trials, but might be possible after first gaining experience with each individual tumor type. | |
| First line Diffuse Large B-Cell Lymphoma (DLBCL) GOYA (phase III for Obinutuzumab + CHOP vs Rituximab-CHOP) (Vitolo et al., 2017) | ✔ | 5 year survival percentage of 62% (Crump et al., 2017) Rituximab-CHOP has been an established SOC for many years Approximately one third of patients relapse or are refractory to 1 L treatment (Friedberg 2011) | ||
| Relapsed/Refractory DLBCL ARGO ( Potential future studies comparing CAR-NK to CAR-T therapies. This may also be relevant in other disease areas (Liu et al., 2020). | ✔ | Median OS of 6.3 months for patients whose disease is refractory (best response of progression or stable disease during chemotherapy) or relapses (within 12 months of autologous stem cell transplantation) (Crump et al., 2017) | ||
Common classes of borrowing methods.
| Statistical method | Description | Tuning parameter | Pros/cons |
|---|---|---|---|
| Power prior with fixed power parameter (Chen et al., 2000; Ibrahim et al., 2000) | The contribution of each external patient to the likelihood is weighted by a common “power parameter” between 0 and 1. Typically implemented as a Bayesian model. | Power parameter: Setting it to 1 is equivalent to pooling, and setting it to 0 is equivalent to ignoring external data | |
| Test-then-pool (Viele et al., 2014) | A hypothesis test is done to compare the outcomes of external and trial controls after steps 1–3. For point null hypotheses, the data are pooled For non-equivalency null hypotheses, the external data are pooled if the null is rejected, and is ignored otherwise | For point null hypotheses: The significance level of the test (smaller alpha makes it more difficult to reject the null, and thus more likely to pool) The significance level of the test (smaller alpha results in wider confidence intervals, making it harder to reject the null and thus less likely to pool) The equivalency bounds (larger bounds are more likely to contain the confidence interval, thus making it more likely to reject the null and pool) | Simple Does not require outcome data for experimental group to determine downweighting factor |
| Adaptive/modified power prior model (Duan et al., 2006; Neuenschwander et al., 2009) | Similar to the (static) power prior, but the power parameter is given a prior distribution and allowed to be selected based on the data. The power parameter is estimated simultaneously with all other parameters in the model, including the treatment effect. | Hyperpriors on the power parameter | Can be difficult to implement in standard software and can be computationally intensive Requires outcome data on experimental group to estimate the downweighting factor |
| Frequentist version of modified power prior (See two-step approach in Web | Step 1: A regression model is fit to the external and trial controls to estimate the HR between these two arms. The estimated HR is mapped to a downweighting factor, such that HRs near 1 give a downweighting factor close to 1 and HRs far from 1 give a downweighting factor close to 0. | The rate at which the common weights decay to 0 as the HR moves away from 1. For example, the downweighting factor could be defined by the function | Simple and interpretable downweighting factor that is chosen dynamically Does not require outcome data from experimental group to determine downweighting factor, as the downweighting factor is determined in step 1 and outcome data for the experimental group is not required until step 2 |
| Commensurate prior model (Hobbs et al., 2011, 2012) | The outcomes in the randomized controls are centered around the outcomes in the external controls. For example, the log hazard rate of the trial controls might be given a normal prior, centered around the log hazard in the external controls and with hyperprior on the precision of the normal prior. | The hyperpriors on the precision of the normal distribution that shrinks the hazard rate in the randomized controls toward the hazard rate in the external controls. The more this precision is pushed toward zero, the less the hazard in the trial controls is shrunk toward the hazard in the external controls and the more the external controls are effectively downweighted. | Downweighting is implicit, so can be more difficult to interpret the amount of borrowed information. Requires outcome data on experimental group to estimate the downweighting factor |
In this context, pooling refers to combining RWD and trial control data into a single dataset that is then analyzed as though the data were collected together.
Fig. 2aSimulation results. X-axis values smaller than 1 indicate that external controls have longer median time-to-event than randomized controls after steps 1–3, and x-axis values larger than 1 indicate that external controls have shorter median time-to-event than randomized controls after steps 1–3. In practice, the full range of residual bias shown on the x-axis may not be relevant (see Section 4.4).
Simulation setup based on IMpassion13033.
| Parameter | Values |
|---|---|
| Experimental treatment effect: Hazard ratio between experimental and control arms of trial (HRExp) | 0.70 (More effective than expected) |
| Residual bias: Hazard ratio between real-world controls and randomized controls after careful alignment on I/E criteria, covariate balancing, and alignment of endpoints, index dates, and follow-up time (HRRWD) (composite bias) | Range from 0.5 to 2 by 0.1 (i.e. 0.5, 0.6, …, 1.9, 2.0): |
| Expected downweighting factor for external controls | 0.6 |
| Total number of patients in RCT (control + experimental) | 675 (out of 900 planned in IMpassion130) |
| Number of external patients potentially available to borrow | 375 (resulting in an expected 375 * 0.6 = 225 effectively borrowed external patients) |
| Randomization ratio in trial | 2:1 (experimental:control) |
| Target number of events (control + experimental + downweighted external control) | 655 |
| Percent lost to follow-up in both the trial and external data source | 5% |
| Accrual rate in trial | 34 patients per month |
| Significance level for hypothesis test of experimental treatment effect | 0.025 one-sided |
At the time of study design, the downweighting factor is known with certainty if using a power prior model with fixed power parameter, and is predicted if using a dynamic borrowing method.
Fig. 2bSame results for type I error, excluding power prior model and with a different y-axis scale. In practice, the full range of residual bias shown on the x-axis may not be relevant (see Section 4.4).
Fig. 3Two-step procedure with different risk/benefit profiles. In practice, the full range of residual bias shown on the x-axis may not be relevant.