| Literature DB >> 27587590 |
P Dutton1,2, S B Love1,2, L Billingham3, A B Hassan2,4.
Abstract
Trials run in either rare diseases, such as rare cancers, or rare sub-populations of common diseases are challenging in terms of identifying, recruiting and treating sufficient patients in a sensible period. Treatments for rare diseases are often designed for other disease areas and then later proposed as possible treatments for the rare disease after initial phase I testing is complete. To ensure the trial is in the best interests of the patient participants, frequent interim analyses are needed to force the trial to stop promptly if the treatment is futile or toxic. These non-definitive phase II trials should also be stopped for efficacy to accelerate research progress if the treatment proves to be particularly promising. In this paper, we review frequentist and Bayesian methods that have been adapted to incorporate two binary endpoints and frequent interim analyses. The Eurosarc Trial of Linsitinib in advanced Ewing Sarcoma (LINES) is used as a motivating example and provides a suitable platform to compare these approaches. The Bayesian approach provides greater design flexibility, but does not provide additional value over the frequentist approaches in a single trial setting when the prior is non-informative. However, Bayesian designs are able to borrow from any previous experience, using prior information to improve efficiency.Entities:
Keywords: Bayesian clinical trial; early stopping; multiple endpoints; phase II; multi-stage design
Mesh:
Substances:
Year: 2016 PMID: 27587590 PMCID: PMC5863794 DOI: 10.1177/0962280216662070
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Figure 1.Decision rules for the formal Bayesian decision rule approach. Red: stop the trial for futility. Orange: continue the trial. Green: stop the trial for efficacy.
Comparison of approaches for calculating sample size for the LINES trial using non-informative priors.
| Design | Sample size at analysis | Frequentist properties | Bayesian posterior probabilities | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Type I error | Type II error | Sample size H0 | Sample size H1 | Futility | Efficacy | Toxicity | Toxicity acceptable | ||
| Frequentist single stage | 44 | 0.0895 | 0.1888 | 44.00 | 44.00 | 0.845 | 0.948 | 0.996 | 0.907 |
| Bryant and Day (optimal) | 20,50 | 0.0965 | 0.1944 | 31.01 | 46.16 | 0.901 | 0.929 | 0.997 | 0.879 |
| Bryant and Day (minmax) | 24,41 | 0.0977 | 0.1982 | 33.19 | 40.26 | 0.850 | 0.938 | 0.993 | 0.920 |
| Bayesian posterior probability (single stage) | 44 | 0.1526 | 0.1161 | 44.00 | 44.00 | 0.910 | 0.901 | 0.996 | 0.907 |
| Bayesian posterior probability (two stage) | 22, 44 | 0.1491 | 0.1689 | 35.02 | 34.95 | 0.910 | 0.901 | 0.977 | 0.907 |
| Bayesian posterior probability (four stage) | 11, 22, 33, 44 | 0.1261 | 0.2795 | 26.02 | 28.14 | 0.910 | 0.901 | 0.952 | 0.907 |
| Bayesian posterior probability (six stage) | 10, 17, 24, 30, 37, 44 | 0.1364 | 0.2680 | 24.62 | 26.46 | 0.910 | 0.901 | 0.967 | 0.907 |
| Bayesian posterior predictive probability (two stage) | 22, 44 | 0.1477 | 0.1286 | 36.11 | 40.60 | 0.910 | 0.901 | 0.996 | 0.907 |
| Bayesian posterior predictive probability (four stage) | 11, 22, 33, 44 | 0.1426 | 0.1538 | 30.92 | 34.71 | 0.910 | 0.901 | 0.996 | 0.907 |
| Bayesian posterior predictive probability (six stage) | 11, 17, 24, 30, 37, 44 | 0.1455 | 0.1606 | 29.53 | 32.00 | 0.910 | 0.901 | 0.996 | 0.907 |
| Bayesian posterior predictive probability (continuous evaluation) | 11 to 44, continuous | 0.1471 | 0.1959 | 24.34 | 25.12 | 0.910 | 0.901 | 0.990 | 0.907 |
| Bayesian decision theory (four stage, complete) | 11, 22, 33, 44 | 0.1090 | 0.2657 | 24.14 | 29.16 | 0.910 | 0.901 | 0.996 | 0.907 |
| Bayesian decision theory (six stage, complete) | 11, 17, 24, 30, 37, 44 | 0.1149 | 0.2617 | 24.08 | 28.20 | 0.910 | 0.901 | 0.996 | 0.907 |
| Bayesian decision theory (four stage, censored[ | 11, 22, 33, 44 | 0.2585[ | 0.3700[ | 27.88 | 32.30 | 0.978 | 0.973 | 0.996 | 0.979 |
| Bayesian decision theory (four stage with censored portion) | 11, 22, 33, 44, 55, 66, 77, 88 | 0.0804 | 0.1741 | 32.96 | 37.22 | 0.958 | 0.958 | 0.996 | 0.927 |
| Bayesian decision theory (six stage, censored[ | 11, 17, 24, 30, 37, 44 | 0.2445[ | 0.3633[ | 26.07 | 30.34 | 0.976 | 0.975 | 0.999 | 0.976 |
| Bayesian decision theory (six stage with censored portion) | 11, 17, 24, 30, 37, 44, 55, 66, 77, 88 | 0.0819 | 0.1896 | 30.75 | 34.70 | 0.958 | 0.958 | 0.999 | 0.927 |
Trial with 88 patients, censored after 44 patients, with censored interim analysis at 55, 66, 77 and 88 patients. The larger trial is directly below the censored trial.
The type I error includes inconclusive trials with probability 0.2075 and the type II error includes inconclusive trials with probability 0.2276.
The type I error includes inconclusive trials with probability 0.1895 and the type II error includes inconclusive trials with probability 0.2006.
Comparison of approaches for calculating sample size for the LINES trial using informative priors. We use a Beta(3,7) prior for response and a Beta(2,8) prior for toxicity.
| Design | Sample size at analysis | Frequentist properties | Bayesian posterior probabilities | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Type I error | Type II error | Sample size H0 | Sample size H1 | Futility | Efficacy | Toxicity | Toxicity acceptable | ||
| Bayesian posterior probability (single stage) | 36 | 0.1755 | 0.1451 | 36 | 36 | 0.91 | 0.901 | 0.996 | 0.907 |
| Bayesian posterior probability (two stage) | 18,36 | 0.1559 | 0.2257 | 29.15 | 27.89 | 0.91 | 0.901 | 0.953 | 0.907 |
| Bayesian posterior probability (four stage) | 9,18,27,36 | 0.1497 | 0.2567 | 24.29 | 24.29 | 0.91 | 0.901 | 0.953 | 0.907 |
| Bayesian posterior probability (six stage) | 9,15,20,25,30,36 | 0.1508 | 0.2659 | 21.32 | 22.71 | 0.91 | 0.901 | 0.952 | 0.907 |
| Bayesian posterior likelihood (two stage) | 18,36 | 0.1662 | 0.1539 | 30.81 | 33.25 | 0.91 | 0.901 | 0.996 | 0.907 |
| Bayesian posterior likelihood (four stage) | 9,15,20,25,30,36 | 0.1595 | 0.1684 | 25.79 | 27.02 | 0.91 | 0.901 | 0.996 | 0.907 |
| Bayesian posterior likelihood (six stage) | 9,15,20,25,30,36 | 0.1595 | 0.1684 | 25.79 | 27.02 | 0.91 | 0.901 | 0.996 | 0.907 |
| Bayesian posterior likelihood (continuous evaluation) | 28 analyses between 9 and 36 | 0.1604 | 0.1921 | 22.1 | 22.56 | 0.91 | 0.901 | 0.99 | 0.907 |
| Bayesian loss function (four stage, complete) | 9,18,27,36 | 0.1301 | 0.5727 | 17.57 | 25.64 | 0.922 | 0.948 | 0.972 | 0.992 |
| Bayesian loss function (six stage, complete) | 9,15,20,25,30,36 | 0.0747 | 0.4695 | 16.17 | 23.54 | 0.91 | 0.901 | 0.972 | 0.953 |
| Bayesian loss function (four stage, censored[ | 9,18,27,36 | 0.2149[ | 0.6203[ | 20.26 | 28.07 | 0.967 | 0.975 | 0.994 | 0.997 |
| Bayesian loss function (four stage with censored portion) | 9,18,27,36,45, 54,63,72 | 0.0564 | 0.3192 | 24.25 | 34.44 | 0.947 | 0.956 | 0.994 | 0.954 |
| Bayesian loss function (six stage, censored[ | 9,15,20,25,30,36 | 0.2066[ | 0.6201[ | 19.36 | 27.16 | 0.97 | 0.975 | 0.993 | 0.997 |
| Bayesian loss function (six stage with censored portion) | 9,15,20,25,30, 36,45,54,63,72 | 0.0559 | 0.341 | 23.18 | 33.09 | 0.947 | 0.956 | 0.993 | 0.954 |
Trial with 72 patients, censored after 36 patients, with censored interim analysis at 45, 54, 63 and 72 patients. The larger trial is directly below the censored trial.
The type I error includes inconclusive trials with probability 0.2075 and the type II error includes inconclusive trials with probability 0.2026.
The type I error includes inconclusive trials with probability 0.1823 and the type II error includes inconclusive trials with probability 0.3209.
Summary of conclusions.
| Methodology | Advantages | Disadvantages | |
|---|---|---|---|
| Frequentist | Formal control of type I and type II error | Increased sample size | |
| Bayesian | Inclusion of informative prior information. Particularly useful when a previous trial has data that can be used to inform the prior Smaller sample size Continual assessment is possible | Informative priors reduce sample size, which will increase the trial type I and type II error | |
| Design | |||
| Single stage | Trial is simple to design, run and analyse | ||
| Multistage – stop early for futility | If treatment is futile, the expected number of patients exposed is smaller | Small changes in design affect the trial properties | |
| Multistage – stop early for efficacy | If treatment if efficacious, the expected number of patients required is smaller accelerating research | Small changes in design affect the trial properties | |
| Single-stage | Frequentist | Easy to run and analyse | |
| Bryant and Day two-stage design | Frequentist | Inclusion of a second endpoint over Simon’s two-stage design | Type I and type II errors are higher for the same sample size |
| Posterior probability | Bayesian | Smallest expected sample sizes | Highest type I and type II errors tend to be a non-significant chance of stopping prematurely at the first few interim analysis |
| Posterior predictive probability | Bayesian | Good balance between expected sample size and certainty of results | |
| Bayes Decision theory | Bayesian | Easy to extend a trial if the results are uncertain | Complicated to design |