| Literature DB >> 27417407 |
Amy V Spencer1, Chris Harbron2, Adrian Mander3, James Wason3, Ian Peers4.
Abstract
Potential predictive biomarkers are often measured on a continuous scale, but in practice, a threshold value to divide the patient population into biomarker 'positive' and 'negative' is desirable. Early phase clinical trials are increasingly using biomarkers for patient selection, but at this stage, it is likely that little will be known about the relationship between the biomarker and the treatment outcome. We describe a single-arm trial design with adaptive enrichment, which can increase power to demonstrate efficacy within a patient subpopulation, the parameters of which are also estimated. Our design enables us to learn about the biomarker and optimally adjust the threshold during the study, using a combination of generalised linear modelling and Bayesian prediction. At the final analysis, a binomial exact test is carried out, allowing the hypothesis that 'no population subset exists in which the novel treatment has a desirable response rate' to be tested. Through extensive simulations, we are able to show increased power over fixed threshold methods in many situations without increasing the type-I error rate. We also show that estimates of the threshold, which defines the population subset, are unbiased and often more precise than those from fixed threshold studies. We provide an example of the method applied (retrospectively) to publically available data from a study of the use of tamoxifen after mastectomy by the German Breast Study Group, where progesterone receptor is the biomarker of interest.Entities:
Keywords: Bayesian prediction; adaptive design; biomarker-guided trial; enrichment; threshold determination
Mesh:
Substances:
Year: 2016 PMID: 27417407 PMCID: PMC5378309 DOI: 10.1002/sim.7042
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.373
Figure 1Flow diagram outlining the key steps in the continuous biomarker-adaptive threshold trial design, which are described in detail in Sections 2.1 to 2.3 of the main text.
Results from 5000 simulation iterations using adaptive design 1 and fixed design 1 with S1 = S2 = 50, t1 = 0.5, ρ = 0.4 and α = 0.05 in scenarios where H0 is true, but the maximum Π only just fails to reach ρ. Here, the proportion of significant simulations is a type-I error rate.
| Π(0) | Proportion significant/type-I error rate (overall) | Proportion significant/type-I error rate (completed studies) | Stopping rate | Ratio of subjects screened (AD1 to FD1) | ||||
|---|---|---|---|---|---|---|---|---|
| Adaptive design 1 | Fixed design 1 | Adaptive design 1 | Fixed design 1 | Adaptive design 1 | Fixed design 1 | Overall | Completed studies | |
| 0.39 | 0.018 | 0.029 | 0.198 | 0.131 | 0.908 | 0.781 | 0.94 | 1.27 |
| 0.38 | 0.015 | 0.026 | 0.186 | 0.124 | 0.919 | 0.791 | 0.94 | 1.33 |
| 0.36 | 0.006 | 0.012 | 0.108 | 0.078 | 0.941 | 0.843 | 0.97 | 1.54 |
| 0.34 | 0.004 | 0.005 | 0.068 | 0.044 | 0.947 | 0.877 | 0.99 | 1.61 |
| 0.29 | 0.001 | 0.001 | 0.032 | 0.024 | 0.975 | 0.950 | 1.04 | 2.33 |
| 0.16 | 0 | 0 | 0 | 0 | 0.994 | 0.999 | 1.04 | 4.02 |
Within the subset of studies not stopped at the interim, the proportion which resulted in a significant hypothesis test result.
Results from 5000 simulation iterations using adaptive design 1 and fixed design 1 with S1 = S2 = 50, t1 = 0.5, ρ = 0.4, α = 0.05 and RH = XH / S = 0.49 in scenarios with a biomarker effect and where HA is true. Here, the proportion of significant simulations is power.
| Π(0) | Π(0.95) | Proportion significant/power (overall) | Proportion significant/power (completed
studies) | Stopping rate | Ratio of subjects screened (AD1 to FD1) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Adaptive design 1 | Fixed design 1 | Adaptive design 1 | Fixed design 1 | Adaptive design 1 | Fixed design 1 | Overall | Completed studies | |||||
| 3 | 0.8 | n/a | 0.20 | 0.46 | 0.002 | <0.001 | 0.067 | 0.024 | 0.967 | 0.984 | 1.16 | 3.21 |
| 6 | 0.8 | 0.93 | 0.13 | 0.51 | 0.002 | 0 | 0.069 | 0 | 0.974 | >0.999 | 1.18 | 3.99 |
| 9 | 0.8 | 0.89 | 0.11 | 0.56 | 0.002 | 0 | 0.044 | n/a | 0.964 | 1 | 1.28 | 4.38 |
| 3 | 0.6 | 0.85 | 0.24 | 0.53 | 0.036 | 0.008 | 0.273 | 0.069 | 0.866 | 0.890 | 1.36 | 2.42 |
| 6 | 0.6 | 0.74 | 0.19 | 0.65 | 0.148 | 0.001 | 0.503 | 0.020 | 0.706 | 0.941 | 2.24 | 2.83 |
| 9 | 0.6 | 0.71 | 0.17 | 0.74 | 0.350 | 0.001 | 0.717 | 0.014 | 0.512 | 0.958 | 3.14 | 2.83 |
| 3 | 0.5 | 0.76 | 0.27 | 0.56 | 0.110 | 0.037 | 0.448 | 0.153 | 0.755 | 0.760 | 1.49 | 2.22 |
| 6 | 0.5 | 0.65 | 0.22 | 0.71 | 0.384 | 0.033 | 0.729 | 0.144 | 0.474 | 0.771 | 2.28 | 2.21 |
| 9 | 0.5 | 0.62 | 0.21 | 0.81 | 0.669 | 0.039 | 0.831 | 0.161 | 0.195 | 0.755 | 2.80 | 2.04 |
| 3 | 0.4 | 0.66 | 0.29 | 0.60 | 0.228 | 0.145 | 0.570 | 0.340 | 0.600 | 0.573 | 1.41 | 1.76 |
| 6 | 0.4 | 0.56 | 0.26 | 0.76 | 0.623 | 0.237 | 0.795 | 0.447 | 0.217 | 0.469 | 1.85 | 1.67 |
| 9 | 0.4 | 0.53 | 0.24 | 0.86 | 0.827 | 0.336 | 0.866 | 0.562 | 0.046 | 0.402 | 1.74 | 1.43 |
| 3 | 0.23 | 0.5 | 0.34 | 0.66 | 0.477 | 0.510 | 0.734 | 0.699 | 0.350 | 0.271 | 1.20 | 1.33 |
| 6 | 0.33 | 0.5 | 0.28 | 0.79 | 0.732 | 0.522 | 0.824 | 0.704 | 0.111 | 0.259 | 1.45 | 1.35 |
| 9 | 0.35 | 0.5 | 0.26 | 0.89 | 0.844 | 0.599 | 0.856 | 0.766 | 0.014 | 0.218 | 1.33 | 1.20 |
| 3 | 0.2 | 0.48 | 0.35 | 0.66 | 0.515 | 0.586 | 0.750 | 0.758 | 0.314 | 0.227 | 1.17 | 1.28 |
| 6 | 0.2 | 0.38 | 0.33 | 0.84 | 0.830 | 0.913 | 0.847 | 0.968 | 0.020 | 0.057 | 1.03 | 1.01 |
| 9 | 0.2 | 0.36 | 0.32 | 0.93 | 0.859 | 0.983 | 0.860 | 0.996 | 0.001 | 0.013 | 0.88 | 0.88 |
| 3 | 0.02 | 0.3 | 0.397 | 0.72 | 0.872 | 0.599 | 0.880 | 0.766 | 0.112 | 0.218 | 1.04 | 1.20 |
| 6 | 0.10 | 0.3 | 0.36 | 0.87 | 0.895 | 0.990 | 0.899 | 0.997 | 0.005 | 0.007 | 0.85 | 0.85 |
| 9 | 0.13 | 0.3 | 0.35 | 0.95 | 0.901 | 0.997 | 0.901 | >0.999 | 0 | 0.002 | 0.81 | 0.81 |
Within the subset of studies not stopped at the interim, the proportion which resulted in a significant hypothesis test result.
In this set of simulations, all studies with the fixed threshold design were stopped at the interim so there are no completed studies; however, the ratio of screened subjects is calculated based on any theoretical fixed design study that was not stopped having screened 200 subjects.
Results from 5000 simulation iterations using adaptive design 1 and fixed design 1 with S1 = S2 = 50, t1 = 0.5, ρ = 0.4 and α = 0.05 in scenarios with no biomarker effect and therefore a constant response rate.
| Π | Proportion significant (overall) | Proportion significant (completed studies) | Stopping rate | Ratio of subjects screened (AD1 to FD1) | ||||
|---|---|---|---|---|---|---|---|---|
| Adaptive design 1 | Fixed design 1 | Adaptive design 1 | Fixed design 1 | Adaptive design 1 | Fixed design 1 | Overall | Completed studies | |
| 0.35 | <0.001 | 0.001 | 0.016 | 0.018 | 0.975 | 0.934 | 0.99 | 1.56 |
| 0.40 | 0.022 | 0.033 | 0.210 | 0.135 | 0.897 | 0.758 | 0.93 | 1.23 |
| 0.42 | 0.049 | 0.078 | 0.343 | 0.226 | 0.857 | 0.668 | 0.89 | 1.16 |
| 0.50 | 0.409 | 0.569 | 0.843 | 0.760 | 0.515 | 0.251 | 0.81 | 0.94 |
| 0.55 | 0.729 | 0.872 | 0.971 | 0.938 | 0.249 | 0.071 | 0.79 | 0.84 |
| 0.65 | 0.982 | 0.998 | 1 | >0.999 | 0.018 | 0.001 | 0.76 | 0.76 |
Within the subset of studies not stopped at the interim, the proportion which resulted in a significant hypothesis test result.
Figure 2The bias in 5000 iterations of a simulated study dependent on the stage 1 threshold (t1) and the study design where AD1 is adaptive design 1 and FD1 is fixed design 1. The data are generated from a logistic model with T = 0.6 and δ1 = 6. Other parameters are fixed at α = 0.05, 1-β = 0.8, ρ = 0.4 and S1 = S2 = 50.
Figure 3The power of different study designs using t1 = 0.5, α = 0.05 and 1-β = 0.8 or 1-βFD = 0.2, ρ = 0.4 and S1 = S2 = 50, with δ1 = 6, but a variety of true thresholds. We give (a) the overall power (proportion of 5000 simulation iterations which have significant hypothesis test results) and (b) the power within completed studies (those iterations which are not stopped at the interim).
Results of hypothesis tests (HA: RR > 0.65) on subsets (S = 70) of subjects on hormonal therapy from the German Breast Study Group.
| Recruitment | Responders/total sample size | Hypothesis test | Threshold estimate (75% CI) fmol/mg |
|---|---|---|---|
| Adaptive threshold (CBATT with | 53 / 70 | 0.037 | 4 (0, 15) |
| Fixed threshold ( | 48 / 70 | 0.312 | 8 (1, 20) |
| Full available data | 96 / 176 | n/a | 11 (3, 32) |
The subsets were ‘recruited’ with either the CBATT or fixed threshold design. Final threshold estimates and 75% confidence intervals from these subsets are also included, as well as for the full 176 subjects in the dataset who were on hormonal therapy and whose survival data were not censored before 1500 days.
Figure 4Estimated average recurrence free survival rate at 1500 days after mastectomy for the subpopulation, with progesterone receptor levels above those given on the x-axis. This was calculated with the full available data (S = 176) as well as with two ‘retrospective recruitment’ studies with S = 70, ρ = 0.65, α = 0.05, 1-β = 0.8 and t1 or t = 0.35.