| Literature DB >> 34747043 |
Kentaro Matsuura1,2, Junya Honda3,4, Imad El Hanafi5,6, Takashi Sozu7, Kentaro Sakamaki8.
Abstract
Estimation of the dose-response curve for efficacy and subsequent selection of an appropriate dose in phase II trials are important processes in drug development. Various methods have been investigated to estimate dose-response curves. Generally, these methods are used with equal allocation of subjects for simplicity; nevertheless, they may not fully optimize performance metrics because of nonoptimal allocation. Optimal allocation methods, which include adaptive allocation methods, have been proposed to overcome the limitations of equal allocation. However, they rely on asymptotics, and thus sometimes cannot efficiently optimize the performance metric with the sample size in an actual clinical trial. The purpose of this study is to construct an adaptive allocation rule that directly optimizes a performance metric, such as power, accuracy of model selection, accuracy of the estimated target dose, or mean absolute error over the estimated dose-response curve. We demonstrate that deep reinforcement learning with an appropriately defined state and reward can be used to construct such an adaptive allocation rule. The simulation study shows that the proposed method can successfully improve the performance metric to be optimized when compared with the equal allocation, D-optimal, and TD-optimal methods. In particular, when the mean absolute error was set to the metric to be optimized, it is possible to construct a rule that is superior for many metrics.Entities:
Keywords: adaptive design; clinical trial; dose-finding; dose-ranging; optimal design; response-adaptive
Mesh:
Year: 2021 PMID: 34747043 PMCID: PMC9298337 DOI: 10.1002/sim.9247
Source DB: PubMed Journal: Stat Med ISSN: 0277-6715 Impact factor: 2.497
Dose‐response scenarios
| Scenario no. | Model | Max effect | Formula |
|
|
|---|---|---|---|---|---|
| 1 | linear | 1.65 |
| 6.30 | (5.67, 6.93) |
| 2 | linear | 1.65 |
| 7.88 | (7.09, 8.00) |
| 3 | linear | 1.65 |
| 5.25 | (4.73, 5.78) |
| 4 | Emax | 1.65 |
| 2.00 | (1.44, 2.95) |
| 5 | Emax | 1.65 |
| 6.83 | (3.30, 8.00) |
| 6 | Emax | 1.65 |
| 1.17 | (0.92, 1.52) |
| 7 | sigEmax | 1.65 |
| 5.06 | (4.68, 5.58) |
| 8 | sigEmax | 1.65 |
| 7.37 | (5.75, 8.00) |
| 9 | sigEmax | 1.65 |
| 4.47 | (4.24, 4.74) |
| 10 | quadratic | 1.65 |
| 3.24 | (2.76, 3.81) |
| 11 | quadratic | 1.65 |
| 5.26 | (3.98, 8.00) |
| 12 | quadratic | 1.65 |
| 2.48 | (2.16, 2.84) |
| 13 | exponential | 1.65 |
| 7.76 | (7.66, 7.86) |
| 14 | exponential | 1.65 |
| 7.98 | (7.88, 8.00) |
| 15 | exponential | 1.65 |
| 7.58 | (7.47, 7.67) |
| 16 | flat | 0 |
| ‐ | ‐ |
Note: If the upper of did not exist or was greater than 8 (maximum dose), the upper was set to 8
FIGURE 1Dose‐response scenarios
FIGURE 2The results for the type I error rate before adjustment
FIGURE 3The results for power. The vertical dotted line represents the value of the equal allocation
FIGURE 4Probability of selecting the true model
FIGURE 6The results for MAE. Smaller MAE implies better accuracys
FIGURE 5Probability that the estimated target dose is within the interval
FIGURE 7The results for the average number of subjects allocated