| Literature DB >> 24229079 |
Daniel J Bratton1, Patrick P J Phillips, Mahesh K B Parmar.
Abstract
BACKGROUND: Randomised controlled trials are becoming increasingly costly and time-consuming. In 2011, Royston and colleagues proposed a particular class of multi-arm multi-stage (MAMS) designs intended to speed up the evaluation of new treatments in phase II and III clinical trials. Their design, which controls the type I error rate and power for each pairwise comparison, discontinues randomisation to poorly performing arms at interim analyses if they fail to show a pre-specified level of benefit over the control arm. Arms in which randomisation is continued to the final stage of the trial are compared against the control on a definitive time-to-event outcome measure. To increase efficiency, interim comparisons can be made on an intermediate time-to-event outcome which is on the causal pathway to the definitive outcome.Entities:
Mesh:
Year: 2013 PMID: 24229079 PMCID: PMC3840569 DOI: 10.1186/1471-2288-13-139
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Design parameters for phase II and III TB trials
| Primary outcome | Negative | Non- | |
| | culture status | failure/relapse | |
| Follow-up length | 8 weeks** | 18 months** | |
| Significance level (1-sided) | 2.5% | 2.5% | 0.1% |
| Power | 80% | 85% | 68% |
| Control arm event rate | 75% | 90% | |
| Treatment effect under | 0% | -6% (NI margin) | |
| Target treatment effect ( | 13% | 0% | |
| Allocation ratio ( | 1:1 | 1:1 | |
| Attrition rate | 15% | 20% | |
| Required sample size*** | 320 | 1122 | 1442 |
Design parameters for a phase II (based on Dorman et al. [10]) and a phase III (based on REMox [20]) TB trial. *Calculated assuming independence between trials. **An additional 6 week delay is typically required to determine culture status. ***Sample sizes estimated using equations (1) and (2). NI = non-inferiority.
Examples of two-arm two-stage phase II TB trials
| (i) | 1 | 0.5 | 0.90 | 56 | 96 | 0.48 | 262 | 56 | 134 | 0.67 | 281 | 0.39 | 0.500 | 0.900 | 360 |
| | 2 | 0.025 | 0.90 | 364 | 428 | 2.30 | | 364 | 428 | 2.49 | | | 0.021 | 0.826 | |
| (ii) | 1 | 0.5 | 0.95 | 94 | 140 | 0.70 | 284 | 94 | 178 | 0.89 | 303 | 0.51 | 0.500 | 0.950 | 398 |
| | 2 | 0.025 | 0.90 | 364 | 428 | 2.30 | | 364 | 428 | 2.49 | | | 0.023 | 0.870 | |
| (iii) | 1 | 0.2 | 0.90 | 156 | 214 | 1.07 | 257 | 156 | 252 | 1.26 | 287 | 0.65 | 0.200 | 0.900 | 381 |
| | 2 | 0.025 | 0.90 | 364 | 428 | 2.30 | | 364 | 428 | 2.49 | | | 0.020 | 0.843 | |
| (iv) | 1 | 0.2 | 0.95 | 214 | 282 | 1.41 | 311 | 214 | 320 | 1.60 | 342 | 0.77 | 0.200 | 0.950 | 414 |
| 2 | 0.025 | 0.90 | 364 | 428 | 2.30 | 364 | 428 | 2.49 | 0.023 | 0.883 | |||||
Characteristics of two-arm two-stage TB trials where I = D = culture status observed 4 or 14 weeks after randomisation. The fixed sample sizes correspond to fixed sample designs with pairwise alpha A2 and power Ω2. Key: for stage i, αi = significance level, ωi = nominal power, ni = total sample size required for analysis i, Ni = cumulative number of patients recruited by the end of stage i, ti = predicted timing (in years) of the end of stage i assuming a recruitment rate of 200 patients/year, ESS |H0 = expected sample size under the null hypothesis, ρ = correlation between stages, Ai = probability of passing stage i under H0, Ωi = probability of passing stage i under H1. *Plus an additional 6 week delay to determine culture status.
Examples of two-arm two-stage phase II/III TB trials
| (v) | 1 | 0.5 | 0.90 | 56 | 134 | 0.67 | 723 | 0.10 | 0.08 | 0.500 | 0.900 |
| | 2 | 0.025 | 0.90 | 1050 | 1312 | 3.84 | | | | 0.015 | 0.813 |
| (vi) | 1 | 0.5 | 0.95 | 94 | 178 | 0.89 | 745 | 0.12 | 0.11 | 0.500 | 0.950 |
| | 2 | 0.025 | 0.90 | 1050 | 1312 | 4.00 | | | | 0.015 | 0.857 |
| (vii) | 1 | 0.2 | 0.90 | 156 | 252 | 1.26 | 464 | 0.16 | 0.14 | 0.200 | 0.900 |
| | 2 | 0.025 | 0.90 | 1050 | 1312 | 4.28 | | | | 0.008 | 0.815 |
| (viii) | 1 | 0.2 | 0.95 | 214 | 320 | 1.60 | 518 | 0.19 | 0.16 | 0.200 | 0.950 |
| 2 | 0.025 | 0.90 | 1050 | 1312 | 4.54 | 0.009 | 0.858 |
Characteristics of two-arm two-stage TB trials where I = culture status observed 14 weeks after randomisation and D = relapse status at 18 months. Key: for stage i, αi = significance level, ωi = nominal power, ni = total sample size required for analysis i, Ni = cumulative number of patients recruited by the end of stage i, ti = predicted timing (in years) of the end of stage i, ESS |H0 = expected sample size under the null hypothesis, ρ|Hh = correlation between stages under hypothesis Hh, Ai = probability of passing stage i under H0, Ωi = probability of passing stage i under H1.
Correlations, type I error rates and powers obtained from simulation compared to calculated values
| (i) | 0.50 | 0.90 | 0.39 | 0.39 | 0.021 | 0.826 | 0.38 | 0.38 | 0.021 | 0.828 |
| (ii) | 0.50 | 0.95 | 0.50 | 0.50 | 0.023 | 0.870 | 0.50 | 0.50 | 0.024 | 0.872 |
| (iii) | 0.20 | 0.90 | 0.65 | 0.65 | 0.020 | 0.843 | 0.64 | 0.65 | 0.019 | 0.847 |
| (iv) | 0.20 | 0.95 | 0.76 | 0.76 | 0.023 | 0.883 | 0.76 | 0.76 | 0.023 | 0.885 |
| (v) | 0.50 | 0.90 | 0.10 | 0.08 | 0.015 | 0.813 | 0.07 | 0.06 | 0.014 | 0.809 |
| (vi) | 0.50 | 0.95 | 0.12 | 0.11 | 0.015 | 0.857 | 0.10 | 0.09 | 0.015 | 0.854 |
| (vii) | 0.20 | 0.90 | 0.16 | 0.14 | 0.008 | 0.815 | 0.12 | 0.11 | 0.008 | 0.811 |
| (viii) | 0.20 | 0.95 | 0.19 | 0.16 | 0.009 | 0.858 | 0.15 | 0.12 | 0.008 | 0.858 |
Overall type I error rates, powers and correlations between stages obtained from simulations of designs (i)-(viii) in Tables 2 and 3. Key: α1 = stage 1 significance level, ω1 = stage 1 power, ρ|Hh = correlation between stages under hypothesis Hh, α = overall type I error rate, ω = overall power. Hats indicate values estimated from simulations.
Absolute bias in arms dropped at the first interim analysis
| | | | |||||||
|---|---|---|---|---|---|---|---|---|---|
| | | | |||||||
| 1 power | |||||||||
| 0.5 | A | 65 | -5% | -6.9% | -4.7% | -3.2% | -10% | -1.4% | -1.0% |
| | B | 49 | 0% | -9.5% | -6.5% | -4.6% | -6% | -1.7% | -1.3% |
| | C | 23 | 8% | -14.6% | -9.8% | -7.0% | -3% | -2.7% | -1.9% |
| | D | 10 | 13% | -18.2% | -12.3% | -8.8% | 0% | -3.0% | -2.1% |
| 0.2 | A | 94 | -5% | -0.9% | -0.8% | -0.7% | -10% | -0.2% | -0.2% |
| | B | 80 | 0% | -2.6% | -2.1% | -1.9% | -6% | -0.5% | -0.5% |
| | C | 35 | 8% | -6.9% | -5.9% | -5.0% | -3% | -1.6% | -1.4% |
| | D | 10 | 13% | -10.7% | -9.2% | -7.7% | 0% | -2.2% | -2.0% |
| Stage 1 power | |||||||||
| 0.5 | A | 70 | -5% | -4.6% | -3.6% | -2.8% | -10% | -1.0% | -0.9% |
| | B | 49 | 0% | -7.3% | -5.6% | -4.5% | -6% | -1.5% | -1.2% |
| | C | 17 | 8% | -12.6% | -9.8% | -7.6% | -3% | -2.7% | -2.1% |
| | D | 5 | 13% | -16.3% | -12.9% | -9.9% | 0% | -3.4% | -2.7% |
| 0.2 | A | 95 | -5% | -0.6% | -0.6% | -0.5% | -10% | -0.2% | -0.2% |
| | B | 80 | 0% | -2.1% | -1.8% | -1.6% | -6% | -0.5% | -0.4% |
| | C | 27 | 8% | -6.7% | -6.0% | -5.3% | -3% | -1.7% | -1.5% |
| D | 5 | 13% | -10.8% | -9.6% | -8.4% | 0% | -2.3% | -2.1% | |
Simulation results showing the proportion of trials stopped at the first interim analysis and the absolute bias for such arms in the estimated treatment effect on D at the interim analysis and after all remaining patients have been followed up. Key: α1 = significance level in stage 1, θD = underlying treatment effect on the definitive outcome. *Plus an additional 6 week delay to determine culture status.
Absolute bias in arms reaching the final analysis
| 0.5 | A | -5% | 35 | -3.1% | 1.9% | 29 | -2.2% | 2.8% |
| | B | 0% | 51 | 1.4% | 1.4% | 50 | 1.8% | 1.8% |
| | C | 8% | 78 | 8.6% | 0.6% | 83 | 8.7% | 0.7% |
| | D | 13% | 90 | 13.3% | 0.3% | 95 | 13.2% | 0.2% |
| 0.2 | A | -5% | 6 | 0.9% | 5.9% | 5 | 2.4% | 7.4% |
| | B | 0% | 20 | 4.2% | 4.2% | 20 | 4.8% | 4.8% |
| | C | 8% | 65 | 9.5% | 1.5% | 73 | 9.5% | 1.5% |
| | D | 13% | 90 | 13.5% | 0.5% | 95 | 13.4% | 0.4% |
| 0.5 | A | -10% | 35 | -9.8% | 0.2% | 30 | -9.7% | 0.3% |
| | B | -6% | 51 | -5.9% | 0.1% | 51 | -5.8% | 0.2% |
| | C | -3% | 77 | -3.0% | 0.0% | 83 | -2.9% | 0.1% |
| | D | 0% | 90 | 0.0% | 0.0% | 95 | 0.0% | 0.0% |
| 0.2 | A | -10% | 6 | -9.4% | 0.6% | 5 | -9.3% | 0.7% |
| | B | -6% | 20 | -5.6% | 0.4% | 20 | -5.5% | 0.5% |
| | C | -3% | 65 | -2.9% | 0.1% | 73 | -2.9% | 0.1% |
| D | 0% | 90 | 0.0% | 0.0% | 95 | 0.0% | 0.0% | |
Simulation results showing the proportion of trials which continue to the final stage of the trial (% pass) and the absolute bias in the estimated treatment effect on D at the final analysis. Key: θ = underlying treatment effect on the definitive outcome, α1 = significance level in stage 1, ω1 = nominal power in stage 1, = average treatment effect on the definitive outcome in the final stage, = bias in the average treatment effect estimate on the definitive outcome in the final stage.