Two-dimensional calibration-free odds design for phase I drug-combination trials

Wang, Wenliang; Jin, Huaqing; Zhang, Yan Dora; Yin, Guosheng

doi:10.3389/fonc.2023.1294258

METHODS article

Front. Oncol. , 29 November 2023

Sec. Cancer Epidemiology and Prevention

Volume 13 - 2023 | https://doi.org/10.3389/fonc.2023.1294258

This article is part of the Research Topic Early Phase Adaptive Clinical Trial Design in Oncology View all 6 articles

Two-dimensional calibration-free odds design for phase I drug-combination trials

Wenliang Wang¹

Huaqing Jin²

Yan Dora Zhang¹

Guosheng Yin^3*

¹Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
²Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA, United States
³Department of Mathematics, Imperial College London, London, United Kingdom

In oncology, it is commonplace to treat patients with a combination of drugs that deliver different effects from different disease-curing or cancer-elimination perspectives. Such drug combinations can often achieve higher efficacy in comparison with single-drug treatment due to synergy or non-overlapping toxicity. Due to the small sample size, there is a growing need for efficient designs for phase I clinical trials, especially for drug-combination trials. In the existing experimental design for phase I drug-combination trials, most of the proposed methods are parametric and model-based, either requiring tuning parameters or prior knowledge of the drug toxicity probabilities. We propose a two-dimensional calibration-free odds (2dCFO) design for drug-combination trials, which utilizes not only the current dose information but also that from all the neighborhood doses (i.e., along the left, right, up and down directions). In contrast to interval-based designs which only use the current dose information, the 2dCFO is more efficient and makes more accurate decisions because of its additional leverage over richer resources of neighborhood data. Because our design makes decisions completely based on odds ratios, it does not rely upon any dose–toxicity curve assumption. The simulations show that the 2dCFO delivers satisfactory performances in terms of accuracy and efficiency as well as demonstrating great robustness due to its non-parametric or model-free nature. More importantly, the 2dCFO only requires the minimal specification of the target toxicity probability, which greatly eases the design process from the clinicians’ aspects.

1 Introduction

The main objective of a phase I clinical trial is to find the maximum tolerated dose (MTD), which is defined as the highest dose with an acceptable probability of dose-limiting toxicity (DLT). For safety reasons, a phase I clinical trial typically assigns one cohort at a time to the most suitable dose level according to some criterion, and the choice of which dose to treat the next cohort is made based on the observed toxicity outcomes of the previous cohorts. Traditional phase I clinical trial designs are mainly for single-drug treatments, such as the 3 + 3 design (1), the continual reassessment method (CRM) (2), the dose escalation with overdose control design (3), the Bayesian optimal interval (BOIN) design (4), and the non-parametric overdose control (NOC) design (5).

In the past decades, combination therapies have demonstrated advantages of higher efficacy, lower toxicity, and fewer side effects compared to monotherapy (6), and thus an efficient design for drug-combination trials is more desirable. A common assumption for phase I clinical trials is the monotonic relationship between dose levels and toxicity probabilities. For single-drug trials, this simply restricts the sequence of toxicity probabilities with ascending dose levels are completely ordered, i.e., higher dose levels induce more severe toxicities. However, for two-drug combinations, this becomes a partial ordering constraint on each row and each column in the two-dimensional toxicity probability space, while the toxicity relationship on the diagonal lines is often unknown, thus imposing new challenges for dose-finding and toxicity probability estimation.

Numerous methods have been proposed for identifying the MTD in drug-combination trials, and most of the existing methods rely on parametric models. Thall et al. (7) proposed a two-stage Bayesian design with a six-parameter model, requiring informative priors based on historical dose–toxicity data from previous single-agent studies on each of the two drugs. Wang and Ivanova (8) proposed a three-parameter Bayesian design using the parsimonious working model for the dose–toxicity relationship. Yin and Yuan (9, 10) developed a joint toxicity probability model for the binary outcomes through a copula-type regression. Wages et al. (11) proposed a partial ordering continual reassessment method (POCRM) for two-dimensional drug-combination trials. Riviere et al. (12, 13) considered a Bayesian design using the standard logistic regression, which still requires priors of toxicity probabilities of the two drugs respectively.

All the aforementioned methods are parametric, requiring either specification of tuning parameters or prior knowledge of the drug toxicities. Due to their parametric nature, these methods often achieve high accuracy in selecting the MTD when precise information is given, at the cost of robustness when such information is missing or inadequate. On the other hand, non-parametric methods tend to be more robust compared to parametric ones. Mander and Sweeting (14) proposed a product of independent beta probabilities design based on conjugate Bayesian inference. The escalation scheme is constructed by estimating a maximum tolerated contour using the posterior probabilities. Lin and Yin (15) developed a two-dimensional Bayesian optimal interval (2dBOIN) design for drug-combination trials by extending the one-dimensional BOIN design to a two-dimensional space. Similarly, the 2dBOIN design makes decisions based on the posterior probability that a given dose’s toxicity probability falls in the pre-specified optimal interval. Razaee et al. (16) introduced a non-parametric Bayesian method for dual agents with truncated beta priors, in which the joint posterior probability of DLT is estimated using a weighted Gibbs sampler.

In this work, we extend the calibration-free odds (CFO) design (17) to two-dimensional drug-combination trials, named the 2dCFO design. Similar to the CFO design, our 2dCFO design is also a non-parametric or curve-free method using purely the odds based on the estimated posterior probabilities. The main advantages of the CFO design are that it requires minimal parameter tuning due to its model-free and calibration-free nature, and furthermore it does not require prior information. The CFO design has shown great advantages in single-drug trials with satisfactory accuracy and robustness properties, and so does the 2dCFO design.

The rest of the paper is organized as follows. In Section 2, we present the methodology and decision rules in the 2dCFO design while reviewing the idea of the one-dimensional CFO design. In Section 3, both fixed and random scenarios are experimented in the simulations to study the operating characteristics of our 2dCFO design in comparison with other state-of-the-art designs. In Section 4, we provide a real trial application to illustrate the operating characteristic of our design by redesigning a drug-combination trial using real data. Section 5 concludes with some remarks.

2 Methodology

2.1 The 2dCFO design

In a drug-combination trial, a two-dimensional joint toxicity probability space is considered. Suppose that we study the combined toxicity of two drugs, drug A and drug B, with J and K dose levels respectively. Let p_jkdenote the joint DLT rate for dose combination (j, k), j = 1,…,J, k = 1,…,K. Let ϕ be the pre-specified target DLT rate. Our aim is to find the MTD, which is the dose level with the DLT rate closest to the target ϕ,

\begin{matrix} {(j^{*}, k^{*})}_{MTD} = \underset{j, k}{\arg \min} ∣ p_{j k} - ϕ ∣, & 1 \leq j \leq J, & 1 \leq k \leq K . \end{matrix}

In our design, we enroll cohorts one by one with a fixed cohort size, e.g., the cohort size is 3 by default. After enrollment of n cohorts of patients, we obtain the observed toxicity outcomes at all the dose levels as $D_{n} = {(x_{j k}, m_{j k})}_{j, k = 1}^{J, K}$ , where x_jk and m_jk are the observed number of DLTs and the total number of patients treated at dose combination (j, k). Assume that dose level (j, k) is the current dose combination, which is denoted as C. We further denote the four adjacent dose combinations (j − 1, k), (j + 1, k), (j, k − 1), (j, k + 1) as L, R, D, U respectively, representing the left, right, down, and up positions relative to the central or current dose C. There are five possible decisions to assign the next cohort, either escalate/de-escalate to the dose level at one of the four positions (Left, Right, Up, Down), or treat at the current dose level, as illustrated in Figure 1A. For each dose combination d ∈ {L, D, C, U, R}, we denote the true DLT rate as p_dand the observed toxicity outcomes as (x_d, m_d).

FIGURE 1

Figure 1 Illustrations of odds and odds ratios in a two-dimensional dose combination space and its one-dimensional interpretations. (A) Illustration of a two-dimensional dose combination space, where C. is the current dose combination and L,R,D,U are the four adjacent dose combinations, where the directions of odds ratios presented in the figure indicate the tendency of dose movement. (B) Illustration of one-dimensional interpretations of the two-dimensional dose combination space. The upper figure shows the relative positions of D,L and U,R with respect to C. The lower figure shows the directions of odds and odds ratio between two adjacent doses when the corresponding value is large.

The 2dCFO design is constructed based on joint decisions of multiple one-dimensional CFO analyses. Following the partial ordering constraints, it is known that dose combinations L and D have a lower DLT rate than C and dose combinations U and R have a higher DLT rate than C. Reformulating in a one-dimensional space, D and L are located on the left of C, while U and R are located on the right. However, the relative positions of D and L, as well as U and R are not known or specified, as illustrated in Figure 1B. From the information above, we can identify four one-dimensional dose sequences with ascending DLT rates: {L, C, R}, {L, C, U}, {D, C, R}, {D, C, U}. Thus, the decision for escalating/de-escalating/staying in a two-dimensional space can be made based on the joint decisions of one-dimensional CFO analyses on the four dose sequences. We use the dose sequence {L, C, R} as an example to illustrate how we make decisions in a one-dimensional CFO design, and the same method can be applied to the other three dose sequences respectively.

The CFO design uses odds to measure the tendency for escalating/de-escalating the dose level, which is defined as

O_{d} = \frac{\Pr (p_{d} > ϕ ∣ x_{d}, m_{d})}{\Pr (p_{d} \leq ϕ ∣ x_{d}, m_{d})}, d \in {L, D, C, U, R} .

Let $\bar{O}$ _d = 1/O_d denote the reciprocal of O_d. Taking the odds of L and C as an example, a large value of O_Cindicates the current dose combination is too toxic, which suggests de-escalating to the next lower dose combination. Similarly, a large value of $\bar{O}$ _L indicates that the DLT rate of the left dose combination is too low, and thus suggests dose escalation. This is analogous to a battle between the left and current doses: the former tries to push the dose up while the latter tries to push it down. Hence, the ratio O_C/ $\bar{O}$ _L measures the tendency or strength for de-escalating to the left adjacent dose level L, as shown in Figure 1B. That is, a large value of O_C/ $\bar{O}$ _L tends to push the dose down. Similarly, O_C/ $\bar{O}$ _D measures the tendency or strength to de-escalate to dose combination D, and $\bar{O}$ _C/O_R, $\bar{O}$ _C/O_U measures the tendency or strength to escalate to dose combinations R and U respectively, as illustrated in Figure 1A.

A non-informative prior Beta(ϕ, 1 − ϕ) is adopted for each DLT rate p_d. Using the monotonic relationships that p_L< p_C< p_R, the marginal posterior density functions for p_L and p_C from the left side can be derived as

\begin{array}{l} f_{L} (p_{L} ∣ x_{L}, x_{C}) \propto f_{β} (p_{L}; a_{L}, b_{L}) \int_{p_{L}}^{1} f_{β} (p_{C}; a_{C}, b_{C}) d p_{C}, \\ f_{C} (p_{C} ∣ x_{L}, x_{C}) \propto f_{β} (p_{C}; a_{C}, b_{C}) \int_{0}^{p_{C}} f_{β} (p_{L}; a_{L}, b_{L}) d p_{L} . \end{array}

where f_β(·;a_d,b_d) is the density function of Beta(a_d,b_d), with a_d= ϕ + x_d and b_d= 1 − ϕ + m_d− x_d. Similarly, for computing the odds ratio between p_C and p_R from the right side, we have

\begin{array}{l} f_{C} (p_{C} ∣ x_{C}, x_{R}) \propto f_{β} (p_{C}; a_{C}, b_{C}) \int_{p_{C}}^{1} f_{β} (p_{R}; a_{R}, b_{R}) d p_{R}, \\ f_{R} (p_{R} ∣ x_{C}, x_{R}) \propto f_{β} (p_{R}; a_{R}, b_{R}) \int_{0}^{p_{R}} f_{β} (p_{C}; a_{C}, b_{C}) d p_{C} . \end{array}

Both O_C/ $\bar{O}$ _Land $\bar{O}$ _C/O_R can be computed using the Gaussian quadrature or the Monte Carlo method. The two odds ratios are used for making decisions. Escalation/de-escalation is more favorable when the odds ratio is relatively large, and staying at the current dose should be considered when the odds ratio is relatively small. The next step is to find appropriate thresholds for the two odds ratios, by exceeding which we will escalate/de-escalate the dose. Denote the true DLT rate of p_Land p_C as p₀_L and p₀_C respectively. Consider the probability of incorrect votes or indications V_L(γ_L), i.e., either the computed odds ratio is large (i.e., O_C/ $\bar{O}$ _L > γ_L), suggesting de-escalation, but in fact we should stay at the current dose under the condition of (p₀_C= ϕ,p₀L < ϕ), or the odds ratio is small (i.e., O_C/ $\bar{O}$ _L ≤ γ_L) suggesting escalation but the current dose is in fact overly toxic (p₀_L = ϕ,p₀_C > ϕ). As a result, the left threshold $γ_{L}^{*}$ can be defined as the one that minimizes V_L(γ_L):

\begin{array}{l} \begin{matrix} γ_{L}^{*} & = \arg \min_{γ_{L}} & V_{L} (γ_{L}) \end{matrix} \\ \begin{matrix} = \arg \min_{γ_{L}} & Pr (O_{C} / {\bar{O}}_{L} > γ_{L} ∣ p_{0 C} = ϕ, p_{0 L} < ϕ) + Pr (O_{C} / {\bar{O}}_{L} \leq γ_{L} ∣ p_{0 L} = ϕ, p_{0 C} > ϕ) \end{matrix} \\ \begin{matrix} = \arg \min_{γ_{L}} & \sum_{i = 0}^{m_{C}} \sum_{j = 0}^{m_{L}} I (O_{C} / {\bar{O}}_{L} > γ_{L}) \end{matrix} Pr (x_{C} = i ∣ p_{0 C} = ϕ) Pr (x_{L} = j ∣ p_{0 L} < ϕ) \\ \begin{matrix} + & \sum_{i = 0}^{m_{C}} \sum_{j = 0}^{m_{L}} I (O_{C} / {\bar{O}}_{L} \leq γ_{L}) \end{matrix} Pr (x_{C} = i ∣ p_{0 C} > ϕ) Pr (x_{L} = j ∣ p_{0 L} = ϕ), \end{array}

where I (·) denotes the indicator function. Similarly, denote the true DLT rate of pR as p0R, and then the right threshold is defined as

\begin{array}{l} \begin{matrix} γ_{R}^{*} & = \arg \min_{γ_{R}} & V_{R} (γ_{R}) \end{matrix} \\ \begin{matrix} = \arg \min_{γ_{R}} & Pr ({\bar{O}}_{C} / O_{R} > γ_{R} ∣ p_{0 C} = ϕ, p_{0 R} > ϕ) + Pr ({\bar{O}}_{C} / O_{R} \leq γ_{R} ∣ p_{0 R} = ϕ, p_{0 C} < ϕ) \end{matrix} \\ \begin{matrix} = \arg \min_{γ_{R}} & \sum_{i = 0}^{m_{C}} \sum_{j = 0}^{m_{R}} I ({\bar{O}}_{C} / O_{R} > γ_{R}) \end{matrix} Pr (x_{C} = i ∣ p_{0 C} = ϕ) Pr (x_{R} = j ∣ p_{0 R} > ϕ) \\ \begin{matrix} + & \sum_{i = 0}^{m_{C}} \sum_{j = 0}^{m_{R}} I ({\bar{O}}_{C} / O_{L} \leq γ_{R}) \end{matrix} Pr (x_{C} = i ∣ p_{0 C} < ϕ) Pr (x_{R} = j ∣ p_{0 R} = ϕ) . \end{array}

Since the trial for each patient is independent, the number of patients with DLTs follows a binomial distribution, thus we have

\begin{array}{l} Pr (x_{C} = i ∣ p_{0 C} = ϕ) = (\begin{matrix} m_{C} \\ i \end{matrix}) ϕ^{i} (1 - ϕ)^{m_{C} - i}, \\ Pr (x_{L} = j ∣ p_{0 L} = ϕ) = (\begin{matrix} m_{L} \\ j \end{matrix}) ϕ^{j} (1 - ϕ)^{m_{L} - j} . \end{array}

We further adopt a Uniform(0, ϕ) prior for p0L when p0L< ϕ and Uniform(ϕ,2ϕ) prior for p0C when p0C > ϕ, and then the following probabilities can be computed using the Gaussian quadrature,

\begin{array}{l} Pr (x_{L} = j ∣ p_{0 L} < ϕ) = \int_{0}^{ϕ} \frac{1}{ϕ} (\begin{matrix} m_{L} \\ j \end{matrix}) p_{0 L}^{j} (1 - p_{0 L})^{m_{L} - j} d p_{0 L}, \\ Pr (x_{C} = i ∣ p_{0 C} > ϕ) = \int_{ϕ}^{2 ϕ} \frac{1}{ϕ} (\begin{matrix} m_{C} \\ i \end{matrix}) p_{0 C}^{i} (1 - p_{0 C})^{m_{C} - i} d p_{0 C} . \end{array}

As a result, V_L(γ_L) is computed, and so is V_R(γ_R). Therefore, the two thresholds $γ_{L}^{*}$ and $γ_{R}^{*}$ can be obtained, and the decision follows the decision rules in Table 1.

TABLE 1

Table 1 Decision rules for one-dimensional CFO analysis.

Similar decisions can be obtained from the other three directions {L,C,U}, {D,C,R} and {D,C,U}. Finally, we can proceed to formulate joint decisions.

2.2 Decision rules and the algorithm

1. Treat the first cohort at the lowest dose level (1,1) or a pre-specified dose level.

2. Suppose the current cohort is treated at dose level C. We apply one-dimensional CFO analysis on both the horizontal direction {L,C,R} and the vertical direction {D,C,U}. The joint decisions are formulated as follows.

● If analyses along both directions suggest staying, i.e., the joint decision is to stay at the current (central) dose level, the next cohort will be treated at dose level C.

● If one direction suggests escalation (or de-escalation) while the other suggests staying, the joint decision is to escalate (or de-escalate) to the corresponding dose. For example, if the decision for {L,C,R} is escalation while the decision for {D,C,U} is staying, then the next cohort will be treated at dose level R.

● If one direction suggests escalation while the other suggests de-escalation, we need to further compare the contradictory direction using one-dimensional CFO analysis. For example, if the decision for {L,C,R} is escalation, while the decision for {D,C,U} is de-escalation, we further apply one-dimensional CFO analysis on {D,C,R}, because the true toxicity order of {D,C,R} is also known. We then escalate the dose level to R if the decision is escalation, and the dose level will stay at C if the decision is staying, and de-escalate to D if the decision is de-escalation.

● If both directions suggest escalation, we will escalate to either U or R, while the most appropriate direction can be chosen by comparing O_U and O_R. Since $\bar{O}$ _C/O_R measures the tendency of escalating towards R, while $\bar{O}$ _C/O_Umeasures the tendency of escalating towards U, if $\bar{O}$ _C/O_R > $\bar{O}$ _C/O_U, i.e., O_R< O_U, we treat the next cohort at dose level R, otherwise we treat the next cohort at U. If the two odds O_R and O_U are the same, we randomly select one dose from the two to treat the next cohort.’

● If both suggest de-escalation, we will de-escalate to either L or D. Similarly, if O_C/ $\bar{O}$ _L > O_C/ $\bar{O}$ _D, i.e., O_L > O_D, we treat the next cohort at dose level L, otherwise we treat the next cohort at D. If the two odds O_L and O_D are the same, we randomly select one dose from the two to treat the next cohort.

3. Repeat step 2 until the sample size is reached or the early stopping criteria are met.

The 2dCFO decision rules in Step 2 are summarized in Table 2.

TABLE 2

Table 2 Decision rules for the two-dimensional CFO analysis.

2.3 Early stopping and final selection of the MTD

In practice, it is preferable to impose some early stopping and overdose control strategies to alleviate safety concerns. During the implementation of the 2dCFO, we will terminate the trial if the lowest dose level (1,1) is overly toxic, as determined by Pr(p₁₁ > ϕ | x₁₁,m₁₁ ≥ 3) > 0.95. To further prevent assigning too many cohorts to the overly toxic doses, we exclude dose level (j₀,k₀) and all the higher dose levels, i.e., (j,k), where j ≥ j₀ and k ≥ k₀ if Pr(p _j₀k₀ > ϕ | x _j₀k₀,m _j₀k₀ ≥ 3) > 0.95.

Upon finishing treating all cohorts, we obtain the cumulative data for all dose combinations as $D_{n} = {(x_{j k}, m_{j k})}_{j, k = 1}^{J, K}$ . For each dose combination (j,k), the estimated DLT rate $\hat{p}$ _jk can be computed as x_jk/m_jk. To conform with the partial ordering constraints, we further perform a bivariate isotonic regression (18) on the estimated DLT rates $\hat{p}$ _jk using the pool-adjacent-violators algorithm (PAVA). Let $\tilde{p}$ _jk denote the estimator corresponding to $\hat{p}$ _jk in the isotonic regression, and then the MTD (j^∗,k^∗) is selected as

\begin{matrix} {(j^{*}, k^{*})}_{MTD} = \underset{j, k}{\arg \min} ∣ {\tilde{p}}_{j k} - ϕ ∣, & where 1 \leq j \leq J, & 1 \leq k \leq K . \end{matrix}

3 Simulation study

3.1 Fixed-scenario simulation

To assess the performance of the proposed method, we compare our 2dCFO design with three competitive methods: the two-dimensional Bayesian optimal interval design (2dBOIN), the partial ordering continue reassessment method (POCRM), and the adaptive logistic model design. We conduct simulations on 14 fixed scenarios, in which the number of MTDs varies from 1 to 3 and the target DLT rate is set to be 0.3, as shown in Table 3. The 14 fixed scenarios are carefully constructed to encompass all common instances of joint toxicity probability distribution. These scenarios include situations where the MTDs are positioned diagonally and doses are initiated from the minimum dosage and progressively advanced to the maximum dosage. They also incorporate situations where MTDs are sporadically spread across multiple diagonal lines. We set the total number of patients to be 60 with a cohort size of 3. Under each scenario, we run 5000 independent simulations. For the 2dBOIN method, we apply the R package “BOIN” (19) and adopt default values for all parameters, i.e., ϕ_{1 =} 0.6, ϕ_{2 =} 1.4 and λ = 0.95 as suggested in the original paper (11). For the POCRM, the provided R package “pocrm” (20) restricts the cohort size to be 1, while we slightly modified the source code to allow the cohort size to be 3. We adopt six orderings with equal prior probabilities, as suggested by Wages (21), while other parameters are set as default values. For the logistic method, we use the R package “dfcomb” (22), with the prior toxicity probabilities set as (0.2, 0.3, 0.4) and (0.1, 0.2, 0.3, 0.4, 0.5) for the two drugs respectively, and all other parameters are set as default values. The logistic method includes a start-up phase by default, in which the dose level will be increased until the first DLT is observed. Such a start-up phase is considered part of the design and is kept in our simulations. In the simulations, we do not adopt any early-stopping rules for each of the methods, and all patients should be treated before the final selection of the MTD.

TABLE 3

Table 3 Fourteen fixed scenarios of joint toxicity probabilities for simulations, with the target DLT rate ϕ = 30% in boldface.

We use four statistics to assess the performance of the methods. The percentage of the correct MTD selection, the percentage of patients treated at the MTD, the percentage of patients treated above the MTD, and the percentage of patients with DLT. The first two criteria measure the accuracy and efficiency of the designs respectively, for which the higher the better. The latter two criteria reflect the risk of the trials, and thus are expected to be as low as possible. All four statistics are computed as the ratio of simulations that meet specific conditions to the total number of simulations. For instance, the percentage of the correct MTD selection is determined by dividing the number of simulated trials that accurately select the MTD by the total number of simulations, which is 5000 in our case.

The simulation results for the 14 fixed scenarios are shown in Figure 2. According to the results, our 2dCFO design has the highest MTD selection rate (2dCFO: 62.21%, 2dBOIN: 60.48%, POCRM: 61.76%, logistic: 61.88%), and comparable percentages of patients treated at the MTD (2dCFO: 41.78%, 2dBOIN: 40.30%, POCRM: 42.10%, logistic: 36.91%). The safety measures are similar between 2dCFO and 2dBOIN; both have comparable percentages of patients treated above the MTD and percentages of patients with DLT. Although the logistic model has a comparable MTD selection rate, it has a significantly lower percentage of patients allocated to the MTD, indicating a large sacrifice in efficiency. In addition, the logistic model is less robust compared to 2dCFO, as it has a much higher MTD selection rate than other methods in scenarios 2 and 9, but has a much lower selection rate in scenarios 5 and 13. On the other hand, 2dCFO outperforms 2dBOIN with respect to accuracy and efficiency, while keeping similar low risks of toxicity, as indicated by the latter two statistics. Compared with POCRM, 2dCFO has a higher MTD selection rate, a comparable percentage of the MTD allocation, and a significantly lower percentage of patients treated above the MTD and a lower percentage of patients with DLT. Furthermore, we observe that apart from 2dCFO, the other three methods tend to have higher accuracy of MTD selection when MTDs are located at higher dose levels. However, when MTDs are located at lower dose levels, their MTD selection rates fluctuate and are not satisfactory. On the contrary, the 2dCFO design has a consistently high MTD selection rate no matter where the MTDs are located. Scenarios 1–5 and 9–10 have incremental MTDs or MTD contours, and thus the accuracy of the MTD selection across this set of scenarios can reflect the robustness of a method. The results show that our proposed design has consistently high accuracy across all these 7 scenarios. In contrast, the other methods all have significant drops in accuracy in some of the scenarios: The 2dBOIN design has significant drops in accuracy in scenarios 4 and 9. The accuracy of the adaptive logistic design falls in scenarios 4 and 5. The POCRM also has accuracy dropping in scenarios 2, 9, and 10. In conclusion, the 2dCFO design does have satisfactory performance in terms of accuracy, efficiency, and safety, as well as demonstrating high robustness across various patterns of MTD locations.

FIGURE 2

Figure 2 Simulation results under fixed scenarios. The performance of two non-parametric methods (2dCFO and 2dBOIN) and two parametric methods (POCRM and logistic) were compared in terms of the percentage of the correct MTD selection, the percentage of patients treated at the MTD, the percentage of patients treated above the MTD, and the percentage of patients with DLT.

3.2 Sensitivity analysis

To study the effect of changes in the sample size and cohort size on the accuracy of the design, we first fix the sample size at 60 while reducing the cohort size from 3 to 2 and 1, with 20, 30, and 60 cohorts respectively. Under each of the settings, we repeat the 14 fixed-scenario simulations and replicate 5000 simulations for each scenario. The results are shown in Figure 3A, from which we observe that there are no significant differences in the percentage of the correct MTD selection when the cohort size varies between 1 and 3. However, for most of the scenarios, a cohort size of 3 leads to the highest MTD selection rate, and thus 3 is still the best choice for the cohort size. Furthermore, we study the relationship between the correct MTD selection rate and the sample size. With the cohort size fixed at 3, the number of cohorts increases from 5 to 40, i.e., the sample size extends from 15 to 120. Under each of the configurations, we still repeat the 14 fixed-scenario simulations. The results were shown in Figures 3B, C. We can see that there are no significant fluctuations along the curves, indicating the 2dCFO is not sensitive to changes in the number of patients. For each of the scenarios, the percentage of the correct MTD selection keeps increasing steadily as the sample size increases. In some of the scenarios (scenarios 5 and 10), the MTD selection rate starts at a very low percentage due to the limited sample size but increases very quickly as the sample size increases. For most of the scenarios, the MTD selection rate can reach 70 percent at a sample size of 120.

FIGURE 3

Figure 3 The effect of changes in the cohort size and sample size on the MTD selection rate of the 2dCFO design. (A) The MTD selection rate with different cohort sizes. (B) The MTD selection rate with increasing sample size(scenarios 1–7). (C) The MTD selection rate with increasing sample size(scenarios 8–14).

3.3 Random-scenario simulation

To further assess the performance and robustness of the proposed method without cherry-picking scenarios, we conduct random-scenario simulations by randomly generating the toxicity probabilities under partial ordering constraints. To generate the random toxicity probabilities, we need to specify the dimension of the matrix J × K, the number of MTDs with the target toxicity probability n_MTD, and the minimum spacing ϵ between adjacent toxicity probabilities. In our simulations, we set the target toxicity probability ϕ = 0.3 and ϵ = 0.01. We first randomly draw J × K samples from Uniform(0,1), among which we randomly choose n_MTD number of elements and set them to be the target ϕ. We then check whether the samples have a minimum spacing of ϵ; if not, repeat the previous step to re-draw the samples until the minimum spacing constraint is satisfied. The next step is to reshape the sequence of toxicity probabilities into a J × K matrix and sort the matrix by first sorting each row and then sorting each column of the matrix. Consequently, the resulting matrix satisfies the partial ordering constraints. Lastly, we check whether there are multiple MTDs on the same row or the same column; if so, we go back to the initial stage to redraw the samples and repeat the above steps until at most one MTD is located on each row and each column.

For a better coverage of different dimensions of the toxicity probability matrix, we conduct random simulations under 3 × 5, 3 × 4, 3 × 3 and 2 × 3 cases with a varying number of MTDs, n_MTD. We use the same design parameters as in the fixed-scenario simulations. Under the logistic design, for the case with J = 3 and K = 4, we set the prior toxicity probabilities of the two drugs as (0.1,0.2,0.3) and (0.1,0.2,0.3,0.4) respectively; for J = 3 and K = 3, we set the prior toxicity probabilities as (0.1,0.2,0.3) for both drugs; for J = 2 and K = 3, the prior toxicity probabilities are set as (0.1,0.2) and (0.1,0.2,0.3) respectively. For POCRM, we still adopt the six ordering patterns as in the fixed-scenario simulations. Under each random-scenario setting, we conduct 5000 independent simulations, i.e., randomly generate 5000 probability matrices.

Inspired by Zhou et al. (23), we provide supplementary statistics to better illustrate the characteristics of the randomly generated probability matrices. We use the setting with J = 3,K = 5,n_MTD = 1 for illustrative purposes. Figure 4 shows a collection of boxplots which elucidate the distribution of the 5000 randomly generated DLT rates at each specific dose combination. Furthermore, Figure 5 displays a heatmap illustrating the mean with standard deviation of the generated DLT rates. As shown by the two figures, the randomly generated DLT rates are well spread out across the matrices, strictly adhering to the partial ordering constraints.

FIGURE 4

Figure 4 Boxplots of the randomly generated DLT rates in the 5000 simulations under the setting with J = 3,K = 5,n_MTD = 1.

FIGURE 5

Figure 5 Heatmap of the mean with standard deviation of the randomly generated DLT rates in the 5000 simulations under the setting with J = 3, K = 5, n_MTD = 1.

The simulation results for all these random scenarios are shown in Figure 6. In terms of accuracy, 2dCFO has on average a competitively high percentage of correct MTD selection, which is slightly lower than POCRM but higher than the competing non-parametric method 2dBOIN and the logistic method (2dCFO: 52.7%, 2dBOIN: 51.8%, POCRM: 53.9%, logistic: 50.1%). In terms of efficiency, both 2dCFO and POCRM yield significantly higher average percentages of patients treated at the MTD than the other two methods (2dCFO: 38.3%, 2dBOIN: 36.2%, POCRM: 39.3%, logistic: 31.1%). Although POCRM has the best accuracy and efficiency in our random simulations, it has a significantly higher risk in terms of the percentage of patients treated above the MTD (2dCFO: 23.3%, 2dBOIN: 24.2%, POCRM: 29.8%, logistic: 15.2%) and the percentage of patients with DLT (2dCFO: 29.2%, 2dBOIN: 28.4%, POCRM: 31.7%, logistic: 25.7%), when comparing to the other three methods. Furthermore, the risk is particularly higher when there are more dose levels, i.e., in the 3 × 5 and 3 × 4 cases.

FIGURE 6

Figure 6 Simulation results under random scenarios. The performances of the four competitive methods were compared under randomly generated toxicity probabilities with ϕ = 0.30 and n_MTD varies.

When comparing 2dCFO with POCRM, both methods exhibit satisfactory accuracy and efficiency. However, POCRM presents a significantly higher risk than 2dCFO, which is inherited from the CRM itself. Therefore, 2dCFO is more recommended especially when there are few doses under investigation (e.g., the 2×3 case), because the regression model in POCRM may not fit such sparse data well. On the other hand, 2dBOIN has similar performances to 2dCFO due to their non-parametric nature. However, 2dCFO has significantly higher efficiency in assigning the cohorts to the MTD across almost all scenarios. Furthermore, 2dCFO has slightly higher accuracy in selecting the MTD than 2dBOIN, especially in scenarios with few doses (e.g., the 2 × 3 case). The logistic method, on the contrary, has the worst performance in terms of both accuracy and efficiency. Even though it presents significantly lower risk compared to the other three methods, this advantage does not compensate for its shortcomings in accuracy and efficiency.

4 Real trial application

To gain more insights into the detailed implementation and application of our proposed method, we consider a real phase I dose-escalation study of Neratinib in combination with Temsirolimus, in patients with advanced solid tumors (24). The study aimed to find the MTD combination of Neratinib and Temsirolimus, which is defined as the highest tolerable dose combination achieving a target DLT rate of less than 0.33. A total of 60 patients were enrolled in the study and each patient received one of 16 combinations of Neratinib {120,160,200,240 mg} and Temsirolimus {15,25,50,75 mg}. The patients were treated in cohorts of size 2 following a bidirectional four-by-four dosing plan, with two initial cohorts treated at the dose combinations (160 mg of Neratinib + 15 mg of Temsirolimus) and (120 mg of Neratinib + 25 mg of Temsirolimus) respectively. The subsequent doses were determined by a non-parametric up-and-down design (25). The toxicity results were shown in Table 4. Based on the observed data, we fit a logistic regression model with the doses of Neratinib and Temsirolimus and their interaction term as covariates (15), which gives the estimated DLT rates of all 16 dose combinations in Table 5.

TABLE 4

Table 4 The observed toxicity outcomes and the number of patients treated at each dose combination in the trial of Neratinib and Temsirolimus.

TABLE 5

Table 5 The DLT rates estimated based on the observed toxicity outcomes in the trial of Neratinib and Temsirolimus, where the MTDs are assumed to have a DLT rate of 0.33.

To redesign the trial with 2dCFO, we set ϕ = 0.33, with a sample size of 60 and a cohort size of 3. Early stopping and overdose control are incorporated in the trial design. The first cohort is treated at the lowest dose level (1, 1), and the subsequent doses are determined according to our proposed decision rules. The dose escalation path and toxicity outcomes are shown in Figure 7 and the implementation and computation details are given in Table 6. Based on the simulated data, we obtain the estimated DTL rates as shown in Table 7. The MTD can be selected as either dose level (4, 2) or (3, 3), as both have DLT rates closest to the target 0.33. From the results, we can see that 2dCFO adopts an efficient as well as safe escalation strategy, with 60% (12 out of 20) of the cohorts treated at the two MTDs, only one cohort treated above the MTD, and 23% (14 out of 60) of the patients experienced DLTs. In the simulation of the trial, there is no overdose identified, as Pr(p_jk > ϕ | x_jk,m_jk ≥ 3)< 0.95 is satisfied for all dose combinations. However, the cut-off probability of 0.95 can be adjusted in practice for a more strict or moderate safety rule.

FIGURE 7

Figure 7 Dose allocations and toxicity outcomes of the redesigned Neratinib and Temsirolimus trial.

TABLE 6

Table 6 The implementation details of the 2dCFO design in simulating the Neratinib and Temsirolimus trial, where Pr (overdose)=Pr (p_jk > ϕ ∣ x_jk, m_jk ≥ 3) and overdose is indicated by Pr (overdose) > 0.95.

TABLE 7

Table 7 The estimated DLT rate $\hat{p}$ _jk( $\tilde{p}$ _jk) for each dose combination, where $\hat{p}$ _jk= x_jk/m_jk and $\tilde{p}$ _jk is the DLT rate estimated by the bivariate isotonic regression.

5 Concluding remarks

We propose a two-dimensional calibration-free odds (2dCFO) design for drug-combination trials, which is a major expansion of the one-dimensional CFO design for a single agent. The method relies solely on data-driven approaches, and due to its non-parametric nature, no prior probabilities and specifications of tuning parameters are required. We also do not include any start-up phase or preliminary stage in our simulations. Extensive simulations demonstrate that our proposed method has comparable accuracy, efficiency, and low risk compared to the state-of-the-art non-parametric method 2dBOIN and other parametric methods. The 2dCFO design is demonstrated to be robust in minimizing risk and maximizing efficiency. The real trial application shows that our proposed method is readily applicable to a real phase I escalation study of drug combinations. In addition, a preliminary stage and other overdose control strategies can be incorporated according to practical needs, because neither of them is an internal part of the design.

This study primarily showcases the effectiveness of the 2dCFO design for phase I trials involving a combination of two drugs. However, this research can also be expanded to incorporate seamless phase I/II trials and trials involving more than two drugs. Following the CFO design for a seamless phase I/II trial (17), the only difference in extending to a drug combination trial is the selection of the admissible set, which can be readily determined by adhering to the partial ordering constraints. Furthermore, one can modify the 2dCFO design to accommodate trials involving more than two drugs by simply altering the decision rules. For example, a trial involving three drugs would require 1dCFO analysis on three axes within the 3D toxicity probability space, spanning the horizontal (X-axis), vertical (Y-axis), and depth (Z-axis) dimensions. The final decision can then be derived from a majority vote across these three separate CFO analyses.

No overdose control strategies were adopted in both our fixed and random simulations. There are two types of overdose control strategies, early stopping and dose elimination. Early stopping will only be adopted when the lowest dose (1,1) is overly toxic. However, in our fixed scenarios, the highest DLT rate at (1,1) is 0.3. For random scenarios, according to Figures 4, 5, the DLT rate at (1,1) is much lower than 0.3. Hence both scenarios are not applicable for early stopping rules. Dose elimination, on the other hand, will eliminate the dose and all the higher doses if that dose is overly toxic. This was intentionally omitted from our simulations with the aim of focusing on the escalation performance of the methods, particularly their accuracy and efficiency. Implementing dose elimination rules could potentially skew results, as different methods adopt disparate rules, making it challenging to ensure fairness in comparisons. On the other hand, dose elimination is flexible, with the threshold being tunable for each method. Therefore, we proposed that these should not be considered intrinsic to the method itself, but rather, they should be seen as a variable to bear in mind when executing actual clinical trials.

When planning a real trial, the selection of methods plays a crucial role in achieving desired outcomes. According to our simulation results, in terms of balanced performance characteristics, 2dCFO stands out with competitive accuracy, efficiency, and an acceptable level of risk for safety control. This makes it a particularly suitable candidate, especially for trial designs with fewer dose levels. The performances of the non-parametric methods, namely 2dCFO and 2dBOIN, are largely similar. However, while 2dCFO is optimal for scenarios with fewer dose levels, 2dBOIN may offer a viable alternative in situations where computational complexity is a significant consideration, as it is based on simple patients count in the proposed interval. If safety is not the primary concern, POCRM might be a desirable choice due to its superior accuracy and efficiency. However, should safety be a paramount concern, the logistic method could present a more suitable option. These insights should guide the selection of methods in real trial design, always considering the specific requirements and priorities of each case.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.

Author contributions

WW: Conceptualization, Data curation, Formal Analysis, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing. HJ: Conceptualization, Methodology, Supervision, Writing – review & editing. YZ: Supervision, Writing – review & editing. GY: Conceptualization, Methodology, Project administration, Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Acknowledgments

We would like to express our gratitude to the editor and the two reviewers for their insightful comments and invaluable suggestions. Their meticulous review and constructive feedback greatly enhanced the quality of our work.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Storer BE. Design and analysis of phase I clinical trials. Biometrics (1989) 45, 925–37. doi: 10.2307/2531693

PubMed Abstract | CrossRef Full Text | Google Scholar

2. O’Quigley J, Pepe M, Fisher L. Continual reassessment method: a practical design for phase 1 clinical trials in cancer. Biometrics (1990) 46, 33–48. doi: 10.2307/2531628

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Babb J, Rogatko André, Zacks S. Cancer phase I clinical trials: efficient dose escalation with overdose control. Stat Med (1998) 17(10):1103–20. doi: 10.1002/(SICI)1097-0258(19980530)17:10<1103::AID-SIM793>3.0.CO;2-9

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Liu S, Yuan Y. Bayesian optimal interval designs for phase I clinical trials. J R Stat Society: Ser C (Applied Statistics) (2015) 64(3):507–23. doi: 10.1111/rssc.12089

CrossRef Full Text | Google Scholar

5. Lin R, Yin G. Nonparametric overdose control with late-onset toxicity in phase I clinical trials. Biostatistics (2017) 18(1), 180–94. doi: 10.1093/biostatistics/kxw038

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Sun X, Vilar S, Tatonetti NP. High-throughput methods for combinatorial drug discovery. Sci Trans Med (2013) 5(205):205rv1–1. doi: 10.1126/scitranslmed.3006667

CrossRef Full Text | Google Scholar

7. Thall PF, Millikan RE, Mueller P, Lee S-J. Dose-finding with two agents in phase I oncology trials. Biometrics (2003) 59(3):487–96. doi: 10.1111/1541-0420.00058

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Wang K, Ivanova A. Two-dimensional dose finding in discrete dose space. Biometrics (2005) 61(1):217–22. doi: 10.1111/j.0006-341X.2005.030540.x

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Yin G, Yuan Y. A latent contingency table approach to dose finding for combinations of two agents. Biometrics (2009) 65(3):866–75. doi: 10.1111/j.1541-0420.2008.01119.x

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Yin G, Yuan Y. Bayesian dose finding in oncology for drug combinations by copula regression. J R Stat Society: Ser C (Applied Statistics) (2009) 58(2):211–24. doi: 10.1111/j.1467-9876.2009.00649.x

CrossRef Full Text | Google Scholar

11. Wages NA, Conaway MR, O’Quigley J. Continual reassessment method for partial ordering. Biometrics (2011) 67(4):1555–63. doi: 10.1111/j.1541-0420.2011.01560.x

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Riviere M-K, Yuan Y, Dubois Frédéric, Zohar S. A bayesian dosefinding design for drug combination clinical trials based on the logistic model. Pharm Stat (2014) 13(4):247–57. doi: 10.1002/pst.1621

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Riviere M-K, Yuan Y, Dubois F, Zohar S. A bayesian dose finding design for clinical trials combining a cytotoxic agent with a molecularly targeted agent. J R Stat Society: Ser C (Applied Statistics) (2015) 64(1):215–29. doi: 10.1111/rssc.12072

CrossRef Full Text | Google Scholar

14. Mander AP, Sweeting MJ. A product of independent beta probabilities dose escalation design for dual-agent phase I trials. Stat Med (2015) 34(8):1261–76. doi: 10.1002/sim.6434

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Lin R, Yin G. Bayesian optimal interval design for dose finding in drugcombination trials. Stat Methods Med Res (2017) 26(5):2155–67. doi: 10.1177/0962280215594494

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Razaee ZS, Cook-Wiens G, Tighiouart M. A nonparametric bayesian method for dose finding in drug combinations cancer trials. Stat Med (2022) 41(6):1059–80. doi: 10.1002/sim.9316

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Jin H, Yin G. CFO: Calibration-free odds design for phase I/II clinical trials. Stat Methods Med Res (2022) 31(6):1051–66. doi: 10.1177/09622802221079353

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Bril G, Dykstra R, Pillers C, Robertson T. Algorithm as 206: isotonic regression in two independent variables. J R Stat Society Ser C (Applied Statistics) (1984) 33(3):352–7. doi: 10.2307/2347723

CrossRef Full Text | Google Scholar

19. Yan F, Zhang L, Zhou Y, Pan H, Liu S, Yuan Y. BOIN: an R package for designing single-agent and drug-combination dose-finding trials using bayesian optimal interval designs. J Stat Software (2020) 94:1–32. doi: 10.18637/jss.v094.i13

CrossRef Full Text | Google Scholar

20. Wages NA, Varhegyi N. pocrm: an r-package for phase I trials of combinations of agents. Comput Methods Programs Biomedicine (2013) 112(1):211–8. doi: 10.1016/j.cmpb.2013.05.020

CrossRef Full Text | Google Scholar

21. Wages NA, Conaway MR. Specifications of a continual reassessment method design for phase I trials of combined drugs. Pharm Stat (2013) 12(4):217–24. doi: 10.1002/pst.1575

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Riviere M-K, Jourdan J-H, Zohar S. Dfcomb: An r-package for phase I/II trials of drug combinations. Comput Methods Programs Biomedicine (2016) 125:117–33. doi: 10.1016/j.cmpb.2015.10.018

CrossRef Full Text | Google Scholar

23. Zhou H, Murray TA, Pan H, Yuan Y. Comparative review of novel model-assisted designs for phase i clinical trials. Stat Med (2018) 37(14):2208–22. doi: 10.1002/sim.7674

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Gandhi L, Bahleda R, Tolaney SM, Kwak EL, Cleary JM, Pandya SS, et al. Phase I study of neratinib in combination with temsirolimus in patients with human epidermal growth factor receptor 2–dependent and other solid tumors. J Clin Oncol (2014) 32(2):68–75. doi: 10.1200/JCO.2012.47.2787

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Ivanova A, Wang K. A non-parametric approach to the design and analysis of two-dimensional dose-finding trials. Stat Med (2004) 23(12):1861–70. doi: 10.1002/sim.1796

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: dose finding, drug combination, maximum tolerated dose, non-parametric method, phase I trial design

Citation: Wang W, Jin H, Zhang YD and Yin G (2023) Two-dimensional calibration-free odds design for phase I drug-combination trials. Front. Oncol. 13:1294258. doi: 10.3389/fonc.2023.1294258

Received: 14 September 2023; Accepted: 06 November 2023;
Published: 29 November 2023.

Edited by:

Jie Yang, Stony Brook University, United States

Reviewed by:

Rongji Mu, Shanghai Jiao Tong University, China
Fangrong Yan, China Pharmaceutical University, China
Jin Xu, East China Normal University, China

Copyright © 2023 Wang, Jin, Zhang and Yin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Guosheng Yin, Z3Vvc2hlbmcueWluQGltcGVyaWFsLmFjLnVr

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Two-dimensional calibration-free odds design for phase I drug-combination trials

1 Introduction

2 Methodology

2.1 The 2dCFO design

2.2 Decision rules and the algorithm

2.3 Early stopping and final selection of the MTD

3 Simulation study

3.1 Fixed-scenario simulation

3.2 Sensitivity analysis

3.3 Random-scenario simulation

4 Real trial application

5 Concluding remarks

Data availability statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good