| Literature DB >> 29871495 |
Matteo Quartagno1,2, A Sarah Walker1, James R Carpenter1,2, Patrick Pj Phillips1, Mahesh Kb Parmar1.
Abstract
Background Trials to identify the minimal effective treatment duration are needed in different therapeutic areas, including bacterial infections, tuberculosis and hepatitis C. However, standard non-inferiority designs have several limitations, including arbitrariness of non-inferiority margins, choice of research arms and very large sample sizes. Methods We recast the problem of finding an appropriate non-inferior treatment duration in terms of modelling the entire duration-response curve within a pre-specified range. We propose a multi-arm randomised trial design, allocating patients to different treatment durations. We use fractional polynomials and spline-based methods to flexibly model the duration-response curve. We call this a 'Durations design'. We compare different methods in terms of a scaled version of the area between true and estimated prediction curves. We evaluate sensitivity to key design parameters, including sample size, number and position of arms. Results A total sample size of ~ 500 patients divided into a moderate number of equidistant arms (5-7) is sufficient to estimate the duration-response curve within a 5% error margin in 95% of the simulations. Fractional polynomials provide similar or better results than spline-based methods in most scenarios. Conclusion Our proposed practical randomised trial 'Durations design' shows promising performance in the estimation of the duration-response curve; subject to a pending careful investigation of its inferential properties, it provides a potential alternative to standard non-inferiority designs, avoiding many of their limitations, and yet being fairly robust to different possible duration-response curves. The trial outcome is the whole duration-response curve, which may be used by clinicians and policymakers to make informed decisions, facilitating a move away from a forced binary hypothesis testing paradigm.Entities:
Keywords: Antimicrobial resistance; design; duration of therapy; flexible modelling; non-inferiority; randomised trial
Mesh:
Year: 2018 PMID: 29871495 PMCID: PMC6136078 DOI: 10.1177/1740774518778027
Source DB: PubMed Journal: Clin Trials ISSN: 1740-7745 Impact factor: 2.486
Simulation scenarios: eight different data-generating mechanisms were investigated.
| Type | Equation | Characteristics | Plot |
|---|---|---|---|
| 1. Logistic growth curve |
| Increases and asymptotes early |
|
| 2. Gompertz curve A |
| Small curvature |
|
| 3. Gompertz curve B |
| Larger curvature, asymptotes more clearly |
|
| 4. Gompertz curve C |
| Asymptotes extremely early |
|
| 5. Linearduration–responsecurve on log-odds scale |
| Situation where simple logisticregression is appropriate |
|
| 6. Quadraticduration–responsecurve, curvature > 0 |
| First derivative increasing |
|
| 7. Quadraticduration–responsecurve, curvature < 0 |
| First derivative decreasing |
|
| 8. Piece-wise linearduration–response curve |
| Different from linear spline logisticregression, here it is linear inthe success rate, notin the log-odds |
|
In plots, x-axis is treatment duration, and y-axis is probability of cure.
Scaled Area Between Curves (sABC), and coverage (%) across the eight different scenarios in the base-case design (1000 simulations of 504 patients randomised across seven arms, using FP).
| sABC |
| Coverage (%) | ||||||
|---|---|---|---|---|---|---|---|---|
| Min | Med. | Max | Med. | Mean | ||||
| Scenario 1 | 0.019 | 0.022 | 0.032 |
| 0.077 | 0.105 | 0.164 | 61.0 |
| Scenario 2 | 0.005 | 0.006 | 0.024 |
| 0.082 | 0.047 | 0.128 | 83.4 |
| Scenario 3 | 0.003 | 0.007 | 0.022 |
| 0.079 | 0.055 | 0.123 | 86.8 |
| Scenario 4 | 0.007 | 0.010 | 0.022 |
| 0.050 | 0.066 | 0.105 | 79.6 |
| Scenario 5 | 0.000 | 0.003 | 0.015 |
| 0.061 | 0.030 | 0.078 | 94.7 |
| Scenario 6 | 0.011 | 0.012 | 0.022 |
| 0.066 | 0.051 | 0.100 | 89.5 |
| Scenario 7 | 0.002 | 0.004 | 0.015 |
| 0.056 | 0.033 | 0.082 | 92.9 |
| Scenario 8 | 0.009 | 0.010 | 0.025 |
| 0.061 | 0.070 | 0.138 | 72.7 |
| Overall | 0.000 | 0.006 | 0.022 |
| 0.082 | 0.055 | 0.129 | 82.6 |
Column for the 95th percentile of scaled Area Between Curves is in bold, to show how scaled Area Between Curves is smaller, or close to, 5% in all scenarios and overall across all 8000 simulations. Asterisks next to Scenario 5 results indicate that this is the only scenario where the data-generating mechanism is actually a particular case of fractional polynomial on the log-odds scale and therefore performs optimally. sABC is the scaled Area Between Curves as defined in the proposals section, while indicates the maximum absolute error for a single duration and coverage (%) is defined as the percentage of the true underlying curve included within the point-wise 95% confidence region around the estimated curve.
Figure 1.Prediction curves (red) of a random selection of 100 simulations against the true data-generating curve (black) for all the eight scenarios under the base-case configuration. The base-case scenario assumes a sample size of 504 patients, randomised to seven equidistant arms, and fits a fractional polynomial model to estimate the duration–response curve.
Figure 2.Comparison of results of trial simulations from the eight scenarios varying either (1) the flexible regression method used (LS3, LS5, LSNE, MARS, FP), with total sample size of 504 patients (panels (a) and (b)), or (2) the total sample size between 250 and 1000 patients, using FP (panels (c) and (d)). Patients are divided into seven equidistant duration arms. The red horizontal line indicates 5% scaled Area Between Curves (sABC). In the left panels, we show the box plots of the whole simulation results, while in the right panels we compare percentiles from the eight scenarios. LS3-5: Linear Spline with 3–5 knots; LSNE: linear spline with non-equidistant knots; MARS: multivariable adaptive regression splines; FP: fractional polynomials. (a) Comparison of flexible regression methods: 8000 simulations. (b) Comparison of flexible regression methods: 95th percentiles. (c) Sensitivity to sample size: 8000 simulations. (d) Sensitivity to sample size: 95th percentiles.
Figure 3.Prediction curves leading to the largest scaled Area Between Curves for each of the eight scenarios with the base-case design, analysing data either with three-knot linear spline (blue) or fractional polynomials (red).
Figure 4.Comparison of results of trial simulations from the eight scenarios either varying the number of equidistant arms (panels (a) and (b)) between 3 and 20, using fractional polynomials (FP), or using different designs, equidistant (ED) or not equidistant (NED), comparing four different regression methods (panels (c) and (d)). The total sample size is always 504 patients. The red horizontal line indicates 5% scaled Area Between Curves. In the left panels, we show the box plots of the whole simulation results, while in the right panels, we compare percentiles from the eight scenarios. In panel (d), there is only one point for NED-LS3, since only in one scenario the percentile for scaled Area Between Curves was smaller than 0.25. LS3: linear spline with three knots; LSNE: linear spline with non-equidistant knots; MARS: multivariable adaptive regression splines; FP: fractional polynomials. (a) Sensitivity to number of arms: 8000 simulations. (b) Sensitivity to number of arms: 95th percentile. (c) Sensitivity to placement of arms: 8000 simulations. (d) Sensitivity to placement of arms: 95th percentile.