| Literature DB >> 31218346 |
Michael J Grayling, Munyaradzi Dimairo, Adrian P Mander, Thomas F Jaki.
Abstract
Historically, phase II oncology trials assessed a treatment's efficacy by examining its tumor response rate in a single-arm trial. Then, approximately 25 years ago, certain statistical and pharmacological considerations ignited a debate around whether randomized designs should be used instead. Here, based on an extensive literature review, we review the arguments on either side of this debate. In particular, we describe the numerous factors that relate to the reliance of single-arm trials on historical control data and detail the trial scenarios in which there was general agreement on preferential utilization of single-arm or randomized design frameworks, such as the use of single-arm designs when investigating treatments for rare cancers. We then summarize the latest figures on phase II oncology trial design, contrasting current design choices against historical recommendations on best practice. Ultimately, we find several ways in which the design of recently completed phase II trials does not appear to align with said recommendations. For example, despite advice to the contrary, only 66.2% of the assessed trials that employed progression-free survival as a primary or coprimary outcome used a randomized comparative design. In addition, we identify that just 28.2% of the considered randomized comparative trials came to a positive conclusion as opposed to 72.7% of the single-arm trials. We conclude by describing a selection of important issues influencing contemporary design, framing this discourse in light of current trends in phase II, such as the increased use of biomarkers and recent interest in novel adaptive designs.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31218346 PMCID: PMC6910171 DOI: 10.1093/jnci/djz126
Source DB: PubMed Journal: J Natl Cancer Inst ISSN: 0027-8874 Impact factor: 13.506
A summary of the key arguments or counterpoints that have historically been put forward for and against the use of randomization in phase II oncology trials
| Consideration | For randomization | Against randomization |
|---|---|---|
| Molecularly targeted agents may often be cytostatic and not in general lead to tumor shrinkage. | PFS* should be the preferred primary outcome for cytostatic agents, which in turn makes randomization preferable. | The use of tumor response is still appropriate because several cytostatic agents have led to statistically significant tumor regression. |
| Success in phase II should reliably predict that clinical benefit will be observed in phase III. | The classical single-arm paradigm has performed poorly in predicting clinical benefit in phase III; randomized designs would perform better. | No evidence is available to suggest the use of randomization in phase II has improved success in phase III. |
| Randomized phase II designs appear more similar to phase III designs than conventional phase II designs. | A highly statistically significant | Investigators may incorrectly interpret the results of randomized phase II trials as though they are from a large phase III study. |
| Single-arm trials use historical control data to specify their design. | Reliance on historical control data for setting target response makes results of single-arm trials unreliable. | For several diseases, well-established resources allow for determination of historical response to treatment. |
| Prognostic factors can differ substantially between patients, and these are often strong predictors of response to treatment. | Randomization should balance prognostic factors between arms, allowing for more reliable assessment of efficacy. Attempts to account for such covariates in single-arm data using modeling are unreliable. | Randomization cannot guarantee prognostic factors will be balanced, especially in smaller trials, and modeling can be used to account for such variables in single-arm studies. |
| There is a trade-off between trial complexity and quality. | In conducting a randomized trial, you “get what you pay for,” with better quality data accrued. | Single-arm trials are simpler and easier to conduct. |
| Clinical trials should be conducted with as few participants as required to control type-I/II error rates to specified levels. | One-sided testing with increased type-I/II error rates allows randomized trials to be conducted with achievable sample sizes. Only in a randomized trial are these error rates ever known. | Single-arm trials require a much smaller sample size, and modifying error rates to reduce that required by a randomized trial would only increase failure in phase III. |
| Single-arm designs should be preferred from an ethical standpoint. | In general, there is equipoise, making randomized trials entirely appropriate. | Randomized trials are not ethical when large responses have been observed previously because not all participants have access to a potentially better treatment. |
PFS = progression-free survival.
A summary of the characteristics of phase II oncology trials by type of design used, based on a reanalysis of the data from Langrand-Escure et al. (80)*
| Characteristic | Single-arm, % (IQR) | Multi-arm nonrandomized, % (IQR) | Randomized noncomparative, % (IQR) | Randomized comparative, % (IQR) |
|
|---|---|---|---|---|---|
| Type of therapy | |||||
| Cytotoxic | 84 (26.7) | 13 (26.5) | 24 (31.6) | 27 (23.1) | .02 |
| Combination therapy | 102 (32.4) | 11 (22.4) | 27 (35.5) | 53 (45.3) | |
| Targeted therapy | 62 (19.7) | 17 (34.7) | 8 (10.5) | 16 (13.7) | |
| Other | 67 (21.3) | 8 (16.3) | 17 (22.4) | 21 (17.9) | |
| (Co-)Primary endpoint‡ | |||||
| Tumor response | 129 (41.0) | 18 (36.7) | 26 (34.2) | 18 (15.4) | — |
| Dichotomized PFS | 23 (7.3) | 6 (12.2) | 9 (11.8) | 5 (4.3) | |
| PFS | 18 (5.7) | 1 (2.0) | 5 (6.6) | 47 (40.2) | |
| Other | 159 (50.5) | 26 (53.1) | 38 (50.0) | 54 (46.2) | |
| Positive result | 152 (72.7) | 25 (83.3) | 39 (70.9) | 29 (28.2) | <.001 |
| Median type-I error rate (IQR) | 5 (5–10) | 5 (5–10) | 7.1 (5–10) | 5 (5–10) | .04 |
| Median type-II error rate (IQR) | 14 (10–20) | 10 (9.8–12.5) | 10 (10–20) | 20 (15–20) | <.001 |
| Median patients analyzed (IQR) | 43 (34–58) | 72.5 (49.8–111) | 92 (72.8–135) | 131 (89–182) | <.001 |
Note that the dataset from Langrand-Escure et al. (80) comprises 557 trial reports. IQR = interquartile range; PFS = progression-free survival.
In addition, P values for either a chi-squared test of row-column independence or a Kruskal-Wallis test of stochastic dominance, as appropriate, are reported for each of the characteristics. All statistical tests were two-sided.
The percentages for the (co-)primary characteristic section do not add up to 100% because the categories are not mutually exclusive given the allowance for co-primary endpoints, and consequently a P value is omitted for this section.
A summary of the choices of primary/coprimary endpoints in phase II by the type of treatment under investigation, based on a reanalysis of the data from Langrand-Escure et al. (80)*
| (Co-)Primary endpoint | Cytotoxic No. (%) | Combination therapy No. (%) | Targeted therapy No. (%) |
|---|---|---|---|
| Tumor response | 69 (46.6) | 53 (27.5) | 35 (34.0) |
| Dichotomized PFS | 5 (3.4) | 18 (9.3) | 16 (15.5) |
| PFS | 10 (6.8) | 38 (19.7) | 16 (15.5) |
| Other | 67 (45.3) | 92 (47.7) | 44 (42.7) |
The percentages do not add up to 100% because the categories are not mutually exclusive given the allowance for co-primary endpoints. PFS = progression-free survival.