| Literature DB >> 31937341 |
Elizabeth G Ryan1, Sarah E Lamb2,3, Esther Williamson2, Simon Gates4.
Abstract
BACKGROUND: Bayesian adaptive designs can be more efficient than traditional methods for multi-arm randomised controlled trials. The aim of this work was to demonstrate how Bayesian adaptive designs can be constructed for multi-arm phase III clinical trials and assess potential benefits that these designs offer.Entities:
Keywords: Arm dropping; Bayesian adaptive design; Emergency medicine; Interim analysis; Monitoring; Multi-arm trial; Orthopaedic; Phase III; Randomised controlled trials; Response adaptive randomisation
Mesh:
Year: 2020 PMID: 31937341 PMCID: PMC6961269 DOI: 10.1186/s13063-019-4021-0
Source DB: PubMed Journal: Trials ISSN: 1745-6215 Impact factor: 2.728
Bayesian adaptive designs explored for the Collaborative Ankle Support Trial
| Design | Interim analysis schedulea | Arm allocation | Control allocation | Early stopping |
|---|---|---|---|---|
| 1 | None | 1:1:1:1 | Equal to other arms | None |
| 2 | Every 200 patients | 1:1:1:1 | Equal to other arms | Efficacy or futility every 200 patients |
| 3 | Every 50 patients | Arm dropping assessed at each interim analysis | Equal to other arms | Efficacy or futility every 200 patients |
| 4 | Every 50 patients | RAR at each analysis | Matched to best intervention arm | Efficacy or futility every 200 patients |
| 5 | Every 50 patients | RAR at each analysis | Fixed at 40% | Efficacy or futility every 200 patients |
| 6 | Every 50 patients | RAR at each analysis | No designated control; tubular bandage is treated as an intervention arm | Efficacy or futility every 200 patients |
RAR Response adaptive randomisation
aAt number of patients due for primary outcome follow up (at 12 weeks post-randomisation)
Scenarios explored for Bayesian designs
| Scenario | Mean control/tubular bandage FAOS QoL (SD) | Mean boot FAOS QoL (SD) | Mean brace FAOS QoL (SD) | Mean below-knee cast FAOS QoL (SD) |
|---|---|---|---|---|
| Null | 50 (20) | 50 (20) | 50 (20) | 50 (20) |
| One works, 10 more | 50 (20) | 50 (20) | 50 (20) | 60 (20) |
| One works, 5 more | 50 (20) | 50 (20) | 50 (20) | 55 (20) |
| Better, Best | 50 (20) | 55 (20) | 60 (20) | 65 (20) |
| One worse, others work | 50 (20) | 45 (20) | 55 (20) | 60 (20) |
| All work, two similar | 50 (20) | 55 (20) | 60 (20) | 60 (20) |
FAOS Foot and Ankle Outcome Score, QoL quality of life, SD standard deviation
Fig. 1Flow diagram showing the process for the virtual re-execution of designs 3–6. Response adaptive randomisation or arm dropping was performed every 50 patients until the final analysis (at N = 584). Early stopping for efficacy or futility was assessed every 200 patients. The process depicted in this figure was repeated 1000 times. CAST Collaborative Ankle Support Trial
Operating characteristics for Bayesian designs for the Collaborative Ankle Support Trial
| Scenario | Proportion stopping early for efficacy | Proportion stopping early for futility | MSE | Mean proportion allocated to control | Mean proportion allocated to boot | Mean proportion allocated to brace | Mean proportion allocated to below-knee cast |
|---|---|---|---|---|---|---|---|
| Null (50, 50, 50, 50) | |||||||
| Design 1 | NA | NA | NA | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 2 | 0.0063 | 0.013 | NA | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 3 | 0.0025 | 0.0248 | NA | 0.36 | 0.21 | 0.21 | 0.21 |
| Design 4 | 0.0022 | 0.0125 | NA | 0.33 | 0.22 | 0.22 | 0.22 |
| Design 5 | 0.0015 | 0.0134 | NA | 0.37 | 0.21 | 0.21 | 0.21 |
| Design 6 | 0.0117 | 0 | NA | 0.25 | 0.25 | 0.25 | 0.25 |
| One arm works, 10 more (50, 50, 50, 60) | |||||||
| Design 1 | NA | NA | 2.77 | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 2 | 0.732 | 0 | 5.03 | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 3 | 0.6919 | 0.0022 | 3.68 | 0.40 | 0.11 | 0.11 | 0.39 |
| Design 4 | 0.796 | 0 | 3.56 | 0.39 | 0.11 | 0.11 | 0.39 |
| Design 5 | 0.7909 | 0 | 3.29 | 0.36 | 0.10 | 0.10 | 0.44 |
| Design 6 | 0.9972 | 0 | 2.34 | 0.13 | 0.13 | 0.13 | 0.61 |
| One arm works, 5 more (50, 50, 50, 55) | |||||||
| Design 1 | NA | NA | NA | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 2 | 0.1091 | 0.0015 | NA | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 3 | 0.0624 | 0.0052 | NA | 0.39 | 0.13 | 0.13 | 0.35 |
| Design 4 | 0.0733 | 0.0008 | NA | 0.37 | 0.14 | 0.14 | 0.35 |
| Design 5 | 0.0677 | 0.001 | NA | 0.37 | 0.13 | 0.13 | 0.37 |
| Design 6 | 0.5654 | 0 | NA | 0.15 | 0.15 | 0.15 | 0.54 |
| Better best (50, 55, 60, 65) | |||||||
| Design 1 | NA | NA | 3.29 | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 2 | 0.7953 | 0 | 5.11 | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 3 | 0.6843 | 0.0001 | 4.16 | 0.37 | 0.10 | 0.19 | 0.34 |
| Design 4 | 0.8177 | 0 | 4.05 | 0.36 | 0.11 | 0.19 | 0.35 |
| Design 5 | 0.8069 | 0 | 3.86 | 0.36 | 0.10 | 0.18 | 0.37 |
| Design 6 | 0.8982 | 0 | 1.95 | 0.07 | 0.10 | 0.22 | 0.61 |
| One worse, others work (50, 45, 55, 60) | |||||||
| Design 1 | NA | NA | 2.96 | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 2 | 0.6341 | 0 | 5.10 | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 3 | 0.6123 | 0.0005 | 3.86 | 0.38 | 0.07 | 0.18 | 0.36 |
| Design 4 | 0.6872 | 0 | 3.67 | 0.38 | 0.07 | 0.18 | 0.37 |
| Design 5 | 0.6856 | 0 | 3.43 | 0.36 | 0.07 | 0.17 | 0.40 |
| Design 6 | 0.8972 | 0 | 1.95 | 0.10 | 0.07 | 0.22 | 0.61 |
| All work, two similar (50, 55, 60, 60) | |||||||
| Design 1 | NA | NA | 3.39 | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 2 | 0.2701 | 0 | 5.24 | 0.25 | 0.25 | 0.25 | 0.25 |
| Design 3 | 0.2692 | 0.0004 | 3.87 | 0.36 | 0.11 | 0.27 | 0.26 |
| Design 4 | 0.2744 | 0.2744 | 3.73 | 0.35 | 0.12 | 0.27 | 0.27 |
| Design 5 | 0.2744 | 0 | 3.54 | 0.37 | 0.10 | 0.26 | 0.27 |
| Design 6 | 0.5493 | 0 | 2.93 | 0.06 | 0.12 | 0.41 | 0.41 |
MSE mean square error, NA not applicable
Fig. 2Average sample sizes (a, c, e, g, i, k) and probability of trial success (Pr(Success); b, d, f, h, j) for each design. Each row represents a different scenario: a, b “Null” scenario; c, d “One works, 10 more”; e, f “One works, 5 more”; g, h “Better, Best”; i, j “One worse, others work”; k, l “All work, two similar”. The type I error is represented in b; The power is given in d, f, h, j, l
Fig. 3Allocations (Prop Alloc) across 10,000 simulated trials for the tubular bandage arm and true best arm. Each design is represented on the x axis. a “One works, 10 more” tubular bandage allocation; b “One works, 10 more” true best arm allocation; c “One works, 5 more” tubular bandage allocation; d “One works, 5 more” true best arm allocation; e “Better, Best” tubular bandage allocation; f “Better, Best” true best arm allocation; g “One worse, others work” tubular bandage allocation; h “One worse, others work” true best arm allocation; i “All work, two similar” tubular bandage allocation; j “All work, two similar” true best arm allocation
Summary of re-executions of the Collaborative Ankle Support Trial using each Bayesian design
| Design 1 | Design 2 | Design 3 | Design 4 | Design 5 | Design 6 | |
|---|---|---|---|---|---|---|
| Proportion stopping for efficacy at 200 patients | NA | 0.216 | 0.148 | 0.166 | 0.147 | 0.072 |
| Proportion stopping for efficacy at 400 patients | NA | 0.043 | 0.011 | 0.017 | 0.011 | 0.004 |
| Proportion stopping for futility at 200 patients | NA | 0 | 0 | 0 | 0 | 0 |
| Proportion stopping for futility at 400 patients | NA | 0 | 0 | 0 | 0 | 0 |
| Proportion re-executions declared successful at final analysis | 0.855 | 0.894 | 0.835 | 0.865 | 0.877 | 0.23 |
| Proportion re-executions tubular bandage (control) declared best at final analysis | 0 | 0 | 0.001 | 0 | 0 | 0 |
| Proportion re-executions boot declared best at final analysis | 0.054 | 0.057 | 0.085 | 0.036 | 0.021 | 0.007 |
| Proportion re-executions brace declared best at final analysis | 0.437 | 0.402 | 0.43 | 0.451 | 0.481 | 0.432 |
| Proportion re-executions below-knee cast declared best at final analysis | 0.509 | 0.541 | 0.484 | 0.513 | 0.498 | 0.561 |
| Median (IQR) of the posterior mean estimates for tubular bandage | 54.25 (52.70–55.68) | 53.72 (51.90–55.46) | 54.40 (52.99–55.74) | 53.91 (52.52–55.30) | 53.97 (52.64–55.33) | 52.49 (51.68–52.96) |
| Median (IQR) of the posterior estimates of the difference in means between boot and tubular bandage | 5.60 (3.65–7.48) | 6.00 (4.02–8.25) | 5.65 (3.75–7.56) | 4.77 (2.42–6.84) | 4.85 (2.58–7.05) | 6.42 (3.98–8.15) |
| Median (IQR) of the posterior estimates of the difference in means between brace and tubular bandage | 8.60 (6.52–10.63) | 8.66 (6.67–10.89) | 7.62 (4.81–10.22) | 8.48 (5.65–10.71) | 8.67 (5.99–10.73) | 9.64 (6.01–11.66) |
| Median (IQR) of the posterior estimates of the difference in means between below-knee cast and tubular bandage | 8.70 (6.86–10.91) | 9.69 (7.22–13.29) | 8.06 (5.44–10.53) | 8.79 (6.57–11.39)– | 8.68 (6.58–11.27) | 10.57 (8.69–11.78) |
IQR interquartile range, NA not applicable