Literature DB >> 33080100

Comparison of Various Phase I Combination Therapy Designs in Oncology for Evaluation of Early Tumor Shrinkage Using Simulations.

Jérémy Seurat¹, Pascal Girard², Kosalaram Goteti³, France Mentré¹.

Abstract

There is still a lack of efficient designs for identifying the dose response in oncology combination therapies in early clinical trials. The concentration response relationship can be identified using the early tumor shrinkage time course, which has been shown to be a good early response marker of clinical efficacy. The performance of various designs using an exposure-tumor growth inhibition model was explored using simulations. Different combination effects of new drug M and cetuximab (reference therapy) were explored first assuming no effect of M on cetuximab (to investigate the type I error (α)), and subsequently assuming additivity or synergy between cetuximab and M. One-arm, two-arm, and four-arm designs were evaluated. In the one-arm design, 60 patients received cetuximab + M. In the two-arm design, 30 patients received cetuximab and 30 received cetuximab + M. In the four-arm design, in addition to cetuximab and cetuximab + M as standard doses, combination arms with lower doses of cetuximab were evaluated (15 patients/arm). Model-based predictions or "simulated observations" of early tumor shrinkage at week 8 (ETS8) were compared between the different arms. With the same number of individuals, the one-arm design showed better statistical power than other designs but led to strong inflation of α in case of misestimated reference for ETS8 value. The two-arm design protected against this misestimation and, with the same total number of subjects, would provide higher statistical power than a four-arm design. However, a four-arm design would be helpful for exploring more doses of cetuximab in combination with M to better understand the interaction.

Entities: Chemical Disease Gene Mutation Species

Year: 2020 PMID： 33080100 PMCID： PMC7762808 DOI： 10.1002/psp4.12564

Source DB: PubMed Journal: CPT Pharmacometrics Syst Pharmacol ISSN： 2163-8306

WHAT IS THE CURRENT KNOWLEDGE ON THE TOPIC? Currently, there is no standard for design choice in early clinical studies of combinations treatments in immuno‐oncology. WHAT QUESTION DID THIS STUDY ADDRESS? These clinical trial simulations compared designs for the first time in the context of a fixed dose of one agent assumed to be optimal while testing different doses of another based on early tumor shrinkage. WHAT DOES THIS STUDY ADD TO OUR KNOWLEDGE? Statistical tests performed on individual model‐based early tumor shrinkage at week 8 values and the one‐arm design indicate a better power than two‐arm or four‐arm designs, but imply strong assumptions about the historical reference value, leading to strong inflation of type I error in the case of underestimating the reference. Choosing a two‐arm or a four‐arm design depends on the objective of the study: a two‐arm design is preferable to a four‐arm design to achieve a good statistical power, but a four‐arm design allows better exploration of the combination and better dose selection. HOW MIGHT THIS CHANGE DRUG DISCOVERY, DEVELOPMENT, AND/OR THERAPEUTICS? These simulations show that, in early‐phase oncology combination studies (with early tumor shrinkage as the outcome), the choice of a one‐arm design (i.e., without a comparator arm) is not appropriate unless the conditions of the clinical trial are similar to those of the reference treatment or because of ethical reasons. This implies strong assumptions about the historical value, and the trial could lead to a wrong conclusion if the historical value is no longer relevant. Early‐phase studies are critical in oncology drug development. These trials are frequently designed to evaluate the overall response as well as toxicity and pharmacokinetics. Go/no go decisions and dose recommendations for further studies are based on the results of phase I studies. Therefore, the design choice is crucial in the early phase. There is growing interest in the use of combination therapies in oncology, especially immuno‐oncology. The combination of several agents could have a synergistic effect and enhance the antitumor activity. However, toxicities are frequently observed in these studies, especially when chemotherapies are involved. In the past two decades, several new targeted therapies, such as monoclonal antibodies, with efficacy in cancer subpopulations and limited adverse effects compared with traditional therapies have reached the market. These drugs have recently been tested in many combination trials in addition of chemotherapy/radiotherapy. , However, the designs of clinical trials investigating combinations of immune‐oncology agents are often based on empiricism without clear assumptions about the drug interactions. Therefore, the design choice in combination trials could be challenging, especially when clinical information is only available for single agent given as monotherapy. Recommendations have been made concerning the design of early studies with a combination of targeted anticancer therapies. The importance of the assumed pharmacodynamic drug interaction and the usefulness of biomarkers are highlighted in these recommendations. More recent recommendations insist on the importance of including drug concentrations in the analysis and understanding the pharmacokinetic/pharmacodynamic relationships. The importance of including biomarker‐driven objectives in phase I trials has also been shown. Among clinical trial end points widely used in oncology, overall survival is universally accepted but involves large studies with long follow‐up, which is not compatible with early‐phase studies. End points based on tumor assessment can be used. The Response Evaluation Criteria in Solid Tumors (RECIST) criteria, widely adopted in oncology clinical trials including on targeted therapies, consist of four categories that depend on the evolution of tumor size (TS). , However, in the field of modeling and simulation it seems preferable to use the continuous TS data when available rather than RECIST as this would result in a loss of information. , Furthermore, the link between the early tumor shrinkage (ETS) and overall survival or progression free‐survival has been proposed as an early efficacy marker based on several studies. , , Nonlinear mixed effect models (NLMEM) are increasingly used to support drug decision making, especially in early‐phase trials. In oncology, these models can be used to capture the pharmacokinetics of antitumoral agents or biomarker kinetics. NLMEM can describe tumor dynamics, and several tumor growth models are found in the literature such as Gompertz or Simeoni. Tumor growth inhibition (TGI) models including a drug induced decrease in TS are used for various types of cancer and antitumoral agents, including targeted therapies. , TS (or other biomarkers) can also be linked to survival by a joint model. However, before conducting a trial and modeling the data, it is crucial to choose an appropriate design to obtain good precision of the parameter estimates and thus precise results. For this purpose, model‐based clinical trial simulation (CTS) can be performed to evaluate and compare designs. In simulations, we used a combination of candidate drug M given on top of cetuximab, a monoclonal antibody targeting the epidermal growth factor receptor of tumor cells that has been approved for metastatic colorectal cancer with wild‐type RAS and usually given in combination with chemotherapy. Several ongoing or completed studies indicate that new combinations including cetuximab could be promising for different cancers. , The aim of this study was to compare by CTS several combination designs in the treatment of refractory colorectal cancer with different assumptions about the interaction between cetuximab and M. The performances, type I error (α), and power of several designs were compared to test the superiority of the combination treatment to cetuximab alone based on ETS at week 8 (ETS8). Another objective was to compare, by modeling and simulation of exposure–TGI, the test performances on predicted ETS8 obtained from the model to the observed ETS8.

METHODS

The different models used to perform the simulations were the model of pharmacokinetic exposure during the first 8 weeks of therapy for each drug and the TGI model. We studied three different designs (one arm, two arms, and four arms) in three different scenarios: no effect of M, additivity, and synergy. Individual ETS8 values were computed as predictions after global model fitting, observations, or true values.

Models

Cetuximab and drug M pharmacokinetic exposures

Cetuximab has been described by various pharmacokinetic models. , A significant influence of body surface area (BSA) on clearance of cetuximab has been shown. Therefore, it is recommended to adjust the dose of cetuximab (DoseC) on BSA. We assumed a simple model with independent clearance with dose, as found in a previous study in patients with metastatic colorectal cancer. Therefore, cetuximab exposure expressed as area under the curve (AUCC) integrated over 2 weeks (between two administrations), in mg × week/L, was simulated as in Eq. 1.where CLC is the cetuximab clearance value (CLC) presented in Table , including BSA effect standardized by 1.85 (i.e., the median BSA that was expected before the clinical trial), and BSA was simulated according to a lognormal distribution with a median of 1.75 and m2 and an interindividual variability of 10% according to derived BSA distribution in previous cetuximab studies. , ,

Table 1

Parameter values for area under the curve models of cetuximab and drug M and for the tumor growth inhibition model

Parameter (unit)	μ	ω ²
Pharmacokinetics models
CL_C (L.week⁻¹)	3.9	0.0025
CL_M (L.week⁻¹)	5.0	0.01
Tumor growth inhibition model
TS0 (mm)	100a	0.5a
KS (week⁻¹)	0.001	—
KD_SoC (week⁻¹)	0.015	1.5
KD_C (L × mg⁻¹.week⁻²)	0.00025a	1.5a
KD_M (L × mg⁻¹.week⁻²)	0.00025a	1a
KR (week⁻¹)	0.2	1

μ are the fixed effects, ω2 the variance of random effects η, and σ2 the variance of residuals.

CLC, cetuximab clearance value; CLM, clearance of drug M; Cov, covariance; INT, interaction parameter; INT50, exposure needed for 50% of interaction maximal effect; KDC, effect of drug cetuximab; KDM, effect of drug M; KDSoC, standard of care effect; KR, constant rate of resistance development; KS, natural growth of TS; L.week‐1, Litre.week‐1; η, random effects; TS0, tumor size at baseline.

Estimated parameter.

Parameter values for area under the curve models of cetuximab and drug M and for the tumor growth inhibition model μ are the fixed effects, ω2 the variance of random effects η, and σ2 the variance of residuals. CLC, cetuximab clearance value; CLM, clearance of drug M; Cov, covariance; INT, interaction parameter; INT50, exposure needed for 50% of interaction maximal effect; KDC, effect of drug cetuximab; KDM, effect of drug M; KDSoC, standard of care effect; KR, constant rate of resistance development; KS, natural growth of TS; L.week‐1, Litre.week‐1; η, random effects; TS0, tumor size at baseline. Estimated parameter. The current maximal dosing regimen for cetuximab given in combination with another treatment is 500 mg per m2 every 2 weeks. This dosing regimen has a comparable activity and safety to the standard dosing regimen. A dosing regimen of 400 and 200 mg per m2 every 2 weeks were also considered in case of combination with drug M. Concerning the drug M, its area under the curve (AUCM) was integrated over 2 weeks (between two administrations) and simulated as in Eq. 2:where CLM is the clearance of drug M (CLM) presented in Table .

Tumor size

Tumor size is defined as the sum of the longest diameters of target lesions that are simulated through a TGI model. The concentration of anticancer antibodies such as cetuximab is associated with the response rate. A previous TGI model was developed based on clinical data from patients affected by metastatic colorectal cancer. In this analysis, the TS dynamics was described by the ordinary differential equation (Eq. 3) linking TS and both drug exposures ( and ). In this model (see Supplementary file), KS is the natural growth of TS, KD includes the different pooled drug effects on tumor shrinkage, KR represents the constant rate of resistance development, and TS0 is the TS at baseline. KD is the sum of KDSoC (for standard of care, chemotherapy), KDC × AUCC (for cetuximab, with KDCthe effect of cetuximab), and (for drug M, with KDM the effect of M). According to Eq. 3, the effect of drug M is the sum of its proper effect and its interaction with cetuximab. This interaction model is inspired by the global pharmacodynamics interaction model. Depending on the sign of the interaction parameter (INT), this model is able to capture different types of interactions. A positive INT represents synergism, a negative INT represents antagonism, and if this parameter is null, the effects of both drugs are additive. Parameter values of this model are given in Table based on previous reported analysis on cetuximab (excepted KDM and interaction parameters) in the treatment of colorectal cancer. The standard of care effect (KDSoC) includes the placebo response. KDM is selected from a sensitivity analysis to obtain plausible tumor shrinkage. Two interaction scenarios are investigated: additivity in which the two compounds act independently and synergy in which the cetuximab exposure increases the drug M effect. The difference between these two scenarios in terms of median TS evolution from 0 to 8 weeks is illustrated in Figure . Let i denote the i th individual (i = 1,…, N) and j the j th TS measurement of an individual. The statistical model for the TS observation TS in individual i at time t is given by Eq. 4.where f is the TGI model described in Eq. 3, θ is the vector of individual parameters, and ε is the residual errors of mean 0 and of variance proportional to the TS: . The random effect model is exponential as , where µ is the fixed effect and ηi the random effect, except for INT in which the random effect model is additive: . The AUCs of both cetuximab and M are simulated assuming a constant clearance and used as regressors in the TGI model.

Simulation settings

Designs

We studied three types of design—one arm, two arms, or four arms—considering the same total number of individuals (N = 60). Simulated TS are performed at predose (baseline) and weeks 2, 4, 6, and 8. The inclusion criterion was a baseline TS > 20 mm. Dropout, incomplete adherence, missing TS measurements, and protocol deviations were not considered. The therapy for all patients included a standard of care, which is omitted next for simplification. Simulations were performed using R software (R Foundation for Statistical Computing, Vienna, Austria) version 3.4.3., specifically, packages mvtnorm to generate individual parameters and deSolve to simulate longitudinal data. Simulated TS < 10 mm were censored and considered to be below the limit of quantification. The dosing regimens of cetuximab and drug M for each possible design are described in the following paragraphs. The dose of M was assumed to be fixed. For each design and each interaction scenario, 500 data sets were simulated.

One‐arm: M + C500

In the one‐arm design, the N = 60 patients were allocated to a combination M + C500: drug M and cetuximab. The dosing regimen was fixed for each drug: cetuximab at 500 mg/m2 every 2 weeks (C500) and drug M at its standard dose.

Two‐arm: C500/M + C500

In the two‐arm design, the N = 60 patients were randomized among the two possible arms: cetuximab “alone” (C500) or M + C500, resulting in 30 patients in each arm. Typical simulated data for patients in the two arms for the additivity and synergy scenarios are presented in Figure .

Four‐arm: C500/M ± C200/M ± C400/M + C500

In the four‐arm design, two combination arms with lower doses of cetuximab were considered (M + C400 and M + C200) in addition to C500 and M + C500. In the lower dose arms, the dose of C was 400 mg/m2 (C400) or 200 mg/m2 every 2 weeks. Each arm included 15 patients.

Scenarios of the effect of M + C

Three scenarios concerning the effect of M and the interaction between M and C were considered. To investigate the type I error of tests (superiority of M + cetuximab vs. cetuximab alone), a first scenario without an effect of drug M (i.e., with KDM = 0) was studied. In the other two scenarios, an effect of M (μKDM = 0.00025) was implemented. These two scenarios differ in the interaction between M and cetuximab: additivity (INT = 0) or synergy (μINT = 2).

Analysis

Predictions, observations, and true individual ETS8 values

For each individual i, ETS8 is expressed as a percentage and calculated as in Eq. 5.where and are individual TS at baseline and after 8 weeks of treatment, respectively. Predicted (IPRED), observed (OBS), and true (TRUE) ETS8 are calculated using predictions, observations, and true individual TS at both week 0 and week 8, respectively, as described in the next paragraphs. Individual predictions (IPRED) were obtained from model‐based parameters. Parameters with a superscript letter in Table are estimated based on the TGI model (Eq. 1) by fitting the simulated data using the SAEM (Stochastic Approximation Expectation‐Maximization) algorithm in MONOLIX 2018R2 software (Lixoft, Antony, France). MONOLIX runs were performed with five Markov chains, a minimum of 500 iterations during the exploratory phase and a minimum of 200 iterations during the smoothing phase. An adequate method is used to account for below the limit of quantification data. TS individual predictions at baseline and week 8 were obtained using the mode of empirical Bayes estimates of the individual parameters. In the synergy model, the μINT and its interindividual variability ωINT were estimated, whereas in the additivity model they were fixed to 0. Simulated data without an effect of M were refitted for both the additivity and synergy models to evaluate the type I error for each assumed interaction. Individual simulated TS observations (OBS) were directly calculated from the simulated TS values plus residual error while true individual TSs (TRUE) were simply derived from individual parameters without residual error. It is important to realize that true values are not observed in a real clinical trial. An example of OBS, IPRED, and TRUE TSs in one individual is illustrated in Figure .

Design evaluation

The performances of each design were evaluated and compared in terms of type I error and power to detect the superiority of the combination (drug M + cetuximab) to cetuximab based on the ETS8 values (OBS, IPRED, and TRUE). The statistical tests described in the next paragraphs were two‐sided using α = 5%. For the one‐arm design, as all the individuals were allocated to the combination arm, individual ETS8 values were compared with a reference ETS8 for cetuximab alone from historical data using a one‐sample Wilcoxon test. Different scenarios were investigated concerning the reference in which the historical value is either not valid (lower or higher than true, due, e.g., to standard of care changes or differences in patient cohorts) or true for this reference ETS8. In this study, the reference value was 29%, (i.e., the median of all individual simulated ETS8 values with C500). For the randomized trial with two arms, C500 and cetuximab + M, a two‐sample Wilcoxon test was performed to compare the ETS8 values of individuals from each arm. In the four‐arm design, a global Kruskal–Wallis test was performed first to compare the ETS8 values between the four arms. If a difference between at least two arms was detected by this global test, a Dunnett test was performed to compare each combination arm to the reference C500 alone using a Benjamini–Hochberg procedure to account for multiplicity of tests. From the data sets simulated without an effect of drug M (KDM = 0, i.e., under H0), the type I error was evaluated as the proportion of trials for which H0 is rejected and compared to the nominal value of 5%. The 95% prediction interval for proportion π of H0 rejection (type I error or power) is given by binomial law ((3.3%, 7.3%) with 500 data sets). From the data sets simulated with an effect of drug M (additivity or synergy scenario, i.e., under H1), the power of the tests was evaluated as the proportion of trials for which H0 is rejected.

RESULTS

First, we checked whether model‐based ETS8 values (IPRED) were overpredicting or underpredicting the true values. Overall, ETS8 are distributed accurately around the identity line for each design and under both additivity and synergy scenarios. However, a trend of overpredicted values for ETS8 < 25% is shown, although it seems to not be influenced by the scenario, design, or treatment arm (Figure ).

One‐arm: M + C500

For the one‐arm design, the ETS8 values for patients were compared to an ETS8 value for current reference therapy obtained from historical data. Distributions of individual ETS8 (OBS, IPRED, and TRUE) are presented in Figure in the cases of additivity and synergy. The median of OBS, IPRED, and TRUE ETS8 values were close to each other in each scenario. The observed median ETS8 was 43% for additivity and 55% for synergy, whereas the reference ETS8 for cetuximab alone was 29%. According to our model and simulation settings, assuming additivity, almost three of four patients had greater tumor shrinkage at week 8 than those of reference treatment. These boxplots enable us to see the interindividual variability in ETS8. As predictions account for residual error, the 5th to 95th percentile of ETS8 is narrower for IPRED (from 23% to 78% for additivity and from 28% to 92% for synergy) than OBS (from 3% to 78% for additivity and from 13% to 89% for synergy). In both scenarios, the distribution of predictions seems to be more representative of true values than observations.

Figure 1

Table 2

Type I error and power of tests in one‐arm design

Theoretical median of ETS8 from historical data (%)		Additivity							Synergy
		Type I error (%)			Power (%)				Type I error (%)			Power (%)
		OBS	IPRED	TRUE	OBS	IPRED	TRUE	OBS	IPRED	TRUE	OBS	IPRED	TRUE
Lower than true	20	56.6	79.4	68.6	100.0	100.0	100.0	56.6	60.2	68.6	100.0	100.0	100.0
Lower than true	25	13.2	34.2	18.0	98.6	99.6	99.8	13.2	34.2	18.0	100.0	100.0	100.0
True reference	29	5.6	12.6	6.0	89.0	97.6	93.6	5.6	13.2	6.0	100.0	100.0	100.0
Higher than true	35	27.4	31.6	37.4	45.2	70.6	51.0	27.4	32.2	37.4	99.4	100.0	99.8
Higher than true	40	74.8	75.0	82.6	9.8	25.0	10.6	74.8	74.8	82.6	90.2	98.0	93.0

The type I error and power from clinical trial simulations are given as the observed proportion of significant tests performed on early tumor shrinkage at week 8 (ETS8) from 500 simulated clinical trials. Under H0, the 95% prediction interval from binomial distribution is 3.3%–7.3% for 500 simulations. For the one‐arm design, the number of individuals by data set is N = 60, and all individuals are allocated to the combination arm. The ETS8 distribution (of observations (OBS), predictions (IPRED), or true values (TRUE)) is compared with a theoretical median of ETS8 under cetuximab alone, performing a one‐sample Wilcoxon test (two‐sided, α = 5%). Different possible theoretical median of ETS8 values (true or not) are investigated. Two possible drug interactions are considered: additivity and synergy (see Models section).

Type I error and power of tests in one‐arm design The type I error and power from clinical trial simulations are given as the observed proportion of significant tests performed on early tumor shrinkage at week 8 (ETS8) from 500 simulated clinical trials. Under H0, the 95% prediction interval from binomial distribution is 3.3%–7.3% for 500 simulations. For the one‐arm design, the number of individuals by data set is N = 60, and all individuals are allocated to the combination arm. The ETS8 distribution (of observations (OBS), predictions (IPRED), or true values (TRUE)) is compared with a theoretical median of ETS8 under cetuximab alone, performing a one‐sample Wilcoxon test (two‐sided, α = 5%). Different possible theoretical median of ETS8 values (true or not) are investigated. Two possible drug interactions are considered: additivity and synergy (see Models section). In this kind of design, for multiple possible reasons, the assumed reference value from historical data could be wrong. Therefore, we investigated the type I error and power with lower (25% and 20%) and higher (35% and 40%) ETS8 reference values. Performing a two‐sided Wilcoxon test, the type I error was inflated in cases using either OBS or TRUE. For example, if we compare the ETS8 distribution to 20% instead of 29%, we wrongly conclude to superiority of the combination in more than the half of the CTS using observations or predictions. Moreover, assuming a higher reference value than true leads to a lack of power. For example, in the additivity scenario, assuming a reference value of 35% leads to a power of 45.2% using observations (instead of 89.0% with the real reference value).

Two‐arms: C500/M + C500

For the randomized two‐arm design, the distribution of individual ETS8 (OBS, IPRED, and TRUE) values in each arm is shown in Figure for additivity and synergy. As with the one‐arm design, the median OBS, IPRED, and TRUE ETS8 values are close to each other in each arm and each scenario. For C500, the median ETS8 is ~ 29% (i.e., the reference value in the one‐arm design) and for M + C500, the median ETS8 is ~ 43% with additivity and ~ 55% with synergy. Notably, with C500, more than 1 of 10 patients presented a negative observed ETS8 (i.e., increasing TS between weeks 0 and 8), whereas none of the individuals had a true negative ETS8 under this therapy with our simulation settings, which is captured well using predicted ETS8. The IPRED distribution of C500 seems to be similar between the two scenarios. This indicates that the estimated cetuximab specific effect is not influenced by the model or the scenario, as in the simulated data, and allows for comparison between the scenarios. As with the individuals given M + C500 in the one‐arm design, the variability in ETS8 is considerable with C500 in the two‐arm designs, especially in OBS (the 5th to 95th percentile extends from −12% to 65%). Therefore, the difference in ETS8 distribution between the two arms is difficult to distinguish in the case of additivity. The IPRED distribution is narrower and closer to TRUE distribution than OBS.

Figure 2

Boxplots of early tumor shrinkage at week 8 (ETS8) from observations (OBS; solid boxes, left subgroup for each treatment), predictions (IPRED; dotted boxes, middle subgroup for each treatment), and true values (TRUE; dashed boxes, right subgroup for each treatment) in the two‐arm design. Whiskers indicate 5th to 95th percentiles. C500 is cetuximab alone vs. the combination treatment (M + C500). With this design, comparing ETS8 distributions between the two arms by performing two‐sample Wilcoxon tests, no assumption on ETS8 reference value is required (Table ). Therefore, choosing a two‐arm design instead of a one‐arm design avoids a potential inflation of the type I error. In both additivity and synergy scenarios and using either observed or predicted ETS8, the type I error is well controlled as it lies in the 95% prediction interval (3.3%–7.3%). Compared with the previous design, there is a decrease in power in the additivity scenario (using OBS: 66.6% vs. 89.0% in one‐arm design). This result is explained by having only 30 patients per arm (vs. 60 in the one‐arm design). Nevertheless, this design remains powerful in the case of synergy (using OBS: 97.4% vs. 100% in the one‐arm design). Moreover, in each scenario, using the model‐based ETS8 value leads to a slight increase in power compared with using the raw data. For example, the power in the additivity scenario was 69.2% using IPRED.

Table 3

Type I error and power of tests in two‐arm design

	Additivity		Synergy
	Type I error (%)	Power (%)	Type I error (%)	Power (%)
OBS	6.2	66.6	6.2	97.4
IPRED	6.2	69.2	7.0	98.6
TRUE	6.2	89.0	6.2	99.8

The type I error and power from clinical trial simulations are given as the observed proportion of significant tests performed on early tumor shrinkage at week 8 (ETS8) from 500 simulated clinical trials. Under H0, the 95% prediction interval from binomial distribution is 3.3%–7.3% for 500 simulations. For the two‐arms design, 30 patients are allocated to the combination arm, and 30 are allocated to cetuximab alone. The ETS8 distributions (of observations (OBS), predictions (IPRED), or true values (TRUE)) are compared between the two arms, performing a two‐sample Wilcoxon test (two‐sided, α = 5%). Two possible drug interactions are considered: additivity and synergy (see Models section).

Type I error and power of tests in two‐arm design The type I error and power from clinical trial simulations are given as the observed proportion of significant tests performed on early tumor shrinkage at week 8 (ETS8) from 500 simulated clinical trials. Under H0, the 95% prediction interval from binomial distribution is 3.3%–7.3% for 500 simulations. For the two‐arms design, 30 patients are allocated to the combination arm, and 30 are allocated to cetuximab alone. The ETS8 distributions (of observations (OBS), predictions (IPRED), or true values (TRUE)) are compared between the two arms, performing a two‐sample Wilcoxon test (two‐sided, α = 5%). Two possible drug interactions are considered: additivity and synergy (see Models section).

Four‐arms: C500/M + C500/M + C400/M + C200

For the four‐arm design, the distributions of individual ETS8 values (OBS, IPRED, and TRUE) in each of the arms are presented in Figure for additivity and synergy. The IPRED distribution seems to be closer to TRUE than OBS, especially for extreme percentiles, except in the C500 arm. As expected, the distributions of C500 and M + C200 are close to each other for additivity. Moreover, in synergy, the ETS8 distribution is close between the two combination arms with the higher doses of cetuximab.

Figure 3

Boxplots of early tumor shrinkage at week 8 (ETS8) from observations (OBS; solid boxes, left subgroup for each treatment), predictions (IPRED; dotted boxes, middle subgroup for each treatment), and true values (TRUE; dashed boxes, right subgroup for each treatment) in the four‐arm design. Whiskers indicate 5th to 95th percentiles. C500 is cetuximab alone, and M + C200, M + C400, and M + C500 are combination treatments. Type I error and power are presented in Table . Performing a global Kruskal–Wallis to detect a difference between the ETS8 distributions of the four arms, more power was obtained in the synergy scenario than the additivity scenario. For example, using observed ETS8, the power was 69.4% in synergy vs. 30.2% in additivity. If the global test was significant, a Dunnett test was performed, comparing the ETS8 distribution in each of the three combination arms to those of the reference arm C500. Under the null hypothesis, if we compare the distribution of ETS8 for C500 and M + C500, the Dunnett test was conservative compared with a single Wilcoxon test, the type I error using observations was 2% with this procedure in the four‐arm design vs. 6.2% for the Wilcoxon test in the two‐arm design. Therefore, in the additivity scenario, the power to detect the superiority of M + C500 vs. C500 decreased to 19.8% using observations and 25.6% using predictions (vs. 66.6% and 69.2% in two‐arm design, respectively), which is partly explained by the lesser number of patients per arm (15 vs. 30). Nevertheless, in synergy and using model‐based ETS8, we achieved 65.4% power to detect the superiority for M + C500 and 52% for M + C400. Thus, under certain conditions, this design is interesting for exploring other doses in the combination of two drugs.

Table 4

Type I error and power of tests in four‐arm design

	Additivity	Synergy
Power of Kruskal–Wallis test (%)
OBS	30.2	69.4
IPRED	52.0	76.6
TRUE	44.0	87.8
Type I error of Dunnett test (%) C500 vs. M + C500
OBS	1.6	1.6
IPRED	2.0	2.0
TRUE	2.4	2.4

The type I error and power from clinical trial simulations are given as the observed proportion of significant tests performed on early tumor shrinkage at week 8 (ETS8) from 500 simulated clinical trials. For the four‐arms design, 15 patients are allocated to each arm. The ETS8 distributions (of observations (OBS), predictions (IPRED), or true values (TRUE)) are compared between the arms, performing first a global Kruskal–Wallis test (two‐sided, α = 5%), then, if the global test is significant, a Dunnett test where distribution of each combination arm (M + C200, M + C400, and M + C500) is compared with those of cetuximab alone (C500). Two possible drug interactions are considered: additivity and synergy (see Models section).

Type I error and power of tests in four‐arm design Type I error of Dunnett test (%) C500 vs. M + C500 The type I error and power from clinical trial simulations are given as the observed proportion of significant tests performed on early tumor shrinkage at week 8 (ETS8) from 500 simulated clinical trials. For the four‐arms design, 15 patients are allocated to each arm. The ETS8 distributions (of observations (OBS), predictions (IPRED), or true values (TRUE)) are compared between the arms, performing first a global Kruskal–Wallis test (two‐sided, α = 5%), then, if the global test is significant, a Dunnett test where distribution of each combination arm (M + C200, M + C400, and M + C500) is compared with those of cetuximab alone (C500). Two possible drug interactions are considered: additivity and synergy (see Models section).

DISCUSSION

This work compares designs for early clinical combination trials in oncology in the context of a fixed dose of one agent and different doses of another with the same number of subjects and based on ETS. The one‐arm design leads to inflation of type I error in the case of a misestimated reference value, as reported previously. Therefore, before the analysis of data from a one‐arm design, we should ensure that the conditions of the study are similar to the reference study. With the same number of subjects and an accurate reference value, the power of a one‐arm design is higher than that of a two‐arm design. However, to avoid the risk of an inaccurate reference value, a two‐arm randomized design including an arm with the reference therapy is preferable. Such a design conducts to control the type I error. Another possibility is to choose a four‐arm design with different possible dosing regimens. Although with the same total number of patients a two‐arm design leads to a greater power of tests, a four‐arm design offers more accurate dose selection. Our work aimed to show simulations in an ideal setting in order to facilitate the interpretation of the results. In these simulations, we did not account for possible incomplete adherence, missing data, protocol deviations (e.g., timing of TS measurements), or dropout. It would be interesting to incorporate these design departures in future simulations and analyses to mimic a real clinical study. Nevertheless, most of these departures are not preponderant in the first 8 weeks of a trial and therefore should not change drastically our simulation results. In this study, exposure to cetuximab was simulated from a previous pharmacokinetics model that included a BSA effect on clearance but other covariates, such as normal fat mass, would be more physiological. Moreover, the allometric coefficient of 0.75 (used for the BSA effect in the model) would be more appropriate with the normal fat mass. In our study, the clearance of M was not influenced by any covariate, but the effect of body weight or the normal fat mass could have been considered. In the model that we used, we assumed linear effects in the exposure ranges of cetuximab and M based on previous models in which the maximal effect (Emax) could not be estimated. However, it would be relevant to explore more mechanistic models and assume nonlinear effects, as is the case for the interaction between cetuximab and M in the model. The TGI model involves a global resistance parameter KR. It would be relevant to associate different resistance parameters with each drug and its exposure. Using NLMEM to analyze longitudinal data is a promising tool, especially when the outcome is based on TS. Tumor measurements are notably influenced by scan variability and the individual reviewing the image, which may result in noise in the data and a loss of power. The NLMEM accounts for the information from the longitudinal data, which were five TS measurements in our study, and allows estimation of the residual error, including the noise in the data. Therefore, performing tests based on model‐based predictions can lead to better power than observations. For example, in our two‐arm design with the additivity scenario, the power using model‐based ETS8 was 69.2% vs. 66.6% using observations. In this study, ETS was chosen as outcome. We could also dichotomize ETS as responder or nonresponder outcome. Different thresholds can be chosen to separate the patients depending on their ETS. We can dichotomize the subjects as responders if ETS8 ≥ 30% or nonresponder if ETS8 < 30%, which corresponds roughly to a complete or partial response vs. stable or progressive disease according to the RECIST criteria. With this binary outcome, in our two‐arm design and additivity scenario, the power of Fisher’s exact test was 46.4% instead of 69.2% with the Wilcoxon test of continuous model‐based ETS8 values. Guidelines highlight the importance of design choice for an early‐phase combination trial in oncology and that assumptions and knowledge about pharmacokinetic/pharmacodynamic interactions should be taken into account to select the most effective trial design. There is no standard design for combination studies, and designs that include randomization helps to define the effect of the combination vs. one or both of the individual components. We plan to extend this work to perform model‐based adaptive two‐stage designs , using the Fisher information matrix to optimize the second stage of the study in which different dose arms could also be added or dropped at the end of the first stage.

Funding

This work was supported by Merck KGaA, Darmstadt, Germany.

Conflict of Interest

Pascal Girard and Kosalaram Goteti are employed by Merck Institute for Pharmacometrics and EMD Serono R&D Institute, Inc., respectively, an affiliate of Merck KGaA, Darmstadt, Germany. All other authors declared no competing interests for this work.

Author Contributions

J.S., P.G., K.G., and F.M. wrote the manuscript. J.S., P.G., K.G., and F.M. designed the research. J.S., P.G., K.G., and F.M. performed the research. J.S., P.G., K.G., and F.M. analyzed the data.

Disclaimer

As Editor‐in‐Chief of CPT: Pharmacometrics and Systems Pharmacology, France Mentré was not involved in the review or decision process for this article. Figure S1 Click here for additional data file. Figure S2 Click here for additional data file. Figure S3 Click here for additional data file. Figure S4 Click here for additional data file. Supplementary Material Click here for additional data file.

42 in total

Review 1. Is intent-to-treat analysis always (ever) enough?

Authors: Lewis B Sheiner
Journal: Br J Clin Pharmacol Date: 2002-08 Impact factor: 4.335

2. Adaptive optimal design for bridging studies with an application to population pharmacokinetic studies.

Authors: Lee Kien Foo; Stephen Duffull
Journal: Pharm Res Date: 2012-02-14 Impact factor: 4.200

3. Modeling tumor growth kinetics after treatment with pazopanib or placebo in patients with renal cell carcinoma.

Authors: Peter L Bonate; A Benjamin Suttle
Journal: Cancer Chemother Pharmacol Date: 2013-05-29 Impact factor: 3.333

4. Cetuximab for the treatment of colorectal cancer.

Authors: Derek J Jonker; Chris J O'Callaghan; Christos S Karapetis; John R Zalcberg; Dongsheng Tu; Heather-Jane Au; Scott R Berry; Marianne Krahn; Timothy Price; R John Simes; Niall C Tebbutt; Guy van Hazel; Rafal Wierzbicki; Christiane Langer; Malcolm J Moore
Journal: N Engl J Med Date: 2007-11-15 Impact factor: 91.245

5. Predictive pharmacokinetic-pharmacodynamic modeling of tumor growth kinetics in xenograft models after administration of anticancer agents.

Authors: Monica Simeoni; Paolo Magni; Cristiano Cammia; Giuseppe De Nicolao; Valter Croci; Enrico Pesenti; Massimiliano Germani; Italo Poggesi; Maurizio Rocchetti
Journal: Cancer Res Date: 2004-02-01 Impact factor: 12.701

6. Fluorouracil, leucovorin, and oxaliplatin with and without cetuximab in the first-line treatment of metastatic colorectal cancer.

Authors: Carsten Bokemeyer; Igor Bondarenko; Anatoly Makhson; Joerg T Hartmann; Jorge Aparicio; Filippo de Braud; Serban Donea; Heinz Ludwig; Gunter Schuch; Christopher Stroh; Anja H Loos; Angela Zubel; Piotr Koralewski
Journal: J Clin Oncol Date: 2008-12-29 Impact factor: 44.544

7. Cetuximab and chemotherapy as initial treatment for metastatic colorectal cancer.

Authors: Eric Van Cutsem; Claus-Henning Köhne; Erika Hitre; Jerzy Zaluski; Chung-Rong Chang Chien; Anatoly Makhson; Geert D'Haens; Tamás Pintér; Robert Lim; György Bodoky; Jae Kyung Roh; Gunnar Folprecht; Paul Ruff; Christopher Stroh; Sabine Tejpar; Michael Schlichting; Johannes Nippgen; Philippe Rougier
Journal: N Engl J Med Date: 2009-04-02 Impact factor: 91.245

8. Binimetinib, Encorafenib, and Cetuximab Triplet Therapy for Patients With BRAF V600E-Mutant Metastatic Colorectal Cancer: Safety Lead-In Results From the Phase III BEACON Colorectal Cancer Study.

Authors: Eric Van Cutsem; Sanne Huijberts; Axel Grothey; Rona Yaeger; Pieter-Jan Cuyle; Elena Elez; Marwan Fakih; Clara Montagut; Marc Peeters; Takayuki Yoshino; Harpreet Wasan; Jayesh Desai; Fortunato Ciardiello; Ashwin Gollerkeri; Janna Christy-Bittel; Kati Maharry; Victor Sandor; Jan H M Schellens; Scott Kopetz; Josep Tabernero
Journal: J Clin Oncol Date: 2019-03-20 Impact factor: 44.544

9. Factors Affecting Combination Trial Success (FACTS): Investigator Survey Results on Early-Phase Combination Trials.

Authors: Channing J Paller; Erich P Huang; Thomas Luechtefeld; Holly A Massett; Christopher C Williams; Jinxiu Zhao; Amy E Gravell; Tami Tamashiro; Steven A Reeves; Gary L Rosner; Michael A Carducci; Lawrence Rubinstein; S Percy Ivy
Journal: Front Med (Lausanne) Date: 2019-06-04

10. Machine Learning Analysis of Individual Tumor Lesions in Four Metastatic Colorectal Cancer Clinical Studies: Linking Tumor Heterogeneity to Overall Survival.

Authors: Diego Vera-Yunca; Pascal Girard; Zinnia P Parra-Guillen; Alain Munafo; Iñaki F Trocóniz; Nadia Terranova
Journal: AAPS J Date: 2020-03-16 Impact factor: 4.009