| Literature DB >> 32380893 |
Yuejia Xu1, Angela M Wood2,3, Michael J Sweeting2,4, David J Roberts5,6, Brian Dm Tom1.
Abstract
There is a growing interest in precision medicine where individual heterogeneity is incorporated into decision-making and treatments are tailored to individuals to provide better healthcare. One important aspect of precision medicine is the estimation of the optimal individualized treatment rule (ITR) that optimizes the expected outcome. Most methods developed for this purpose are restricted to the setting with two treatments, while clinical studies with more than two treatments are common in practice. In this work, we summarize methods to estimate the optimal ITR in the multi-arm setting and compare their performance in large-scale clinical trials via simulation studies. We then illustrate their utilities with a case study using the data from the INTERVAL trial, which randomly assigned over 20,000 male blood donors from England to one of the three inter-donation intervals (12-week, 10-week, and eight-week) over two years. We estimate the optimal individualized donation strategies under three different objectives. Our findings are fairly consistent across five different approaches that are applied: when we target the maximization of the total units of blood collected, almost all donors are assigned to the eight-week inter-donation interval, whereas if we aim at minimizing the low hemoglobin deferral rates, almost all donors are assigned to donate every 12 weeks. However, when the goal is to maximize the utility score that "discounts" the total units of blood collected by the incidences of low hemoglobin deferrals, we observe some heterogeneity in the optimal inter-donation interval across donors and the optimal donor assignment strategy is highly dependent on the trade-off parameter in the utility function.Entities:
Keywords: Precision medicine; blood donation; individualized treatment rule; multi-arm trial; utility function
Mesh:
Year: 2020 PMID: 32380893 PMCID: PMC7682530 DOI: 10.1177/0962280220920669
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Description of simulation settings 1–6 and simulation results for n = 20,000 based on 100 replicates: mean (SD) of misclassification rates and value functions.
| Setting | Functional form of interaction | True optimal treatment | Covariates type | Method | Misclassification | Value |
|---|---|---|---|---|---|---|
| 1 | Tree | 1 or 2 or 3 | Continuous | 0.101 (0.016) | 1.226 (0.010) | |
| 0.094 (0.015) | 1.229 (0.010) | |||||
| ACWL | 0.028 (0.040) | 1.276 (0.023) | ||||
| D-learning | 0.100 (0.020) | 1.226 (0.013) | ||||
| BART |
|
| ||||
| 2 | Linear | 1 or 2 or 3 | Continuous | 0.015 (0.004) | 1.736 (0.003) | |
|
|
| |||||
| ACWL | 0.171 (0.020) | 1.662 (0.016) | ||||
| D-learning | 0.018 (0.005) |
| ||||
| BART | 0.056 (0.004) | 1.730 (0.006) | ||||
| 3 | Nonlinear | 1 or 2 or 3 | Continuous | 0.566 (0.012) | 1.089 (0.008) | |
| 0.565 (0.014) | 1.088 (0.011) | |||||
| ACWL | 0.561 (0.016) | 1.089 (0.009) | ||||
| D-learning | 0.572 (0.013) | 1.087 (0.008) | ||||
| BART |
|
| ||||
| 4 | Nonlinear | 1 or 2 or 3 | Continuous | 0.350 (0.011) | 1.129 (0.004) | |
| 0.352 (0.010) | 1.128 (0.004) | |||||
| ACWL | 0.362 (0.011) | 1.118 (0.004) | ||||
| D-learning | 0.359 (0.016) | 1.129 (0.004) | ||||
| BART |
|
| ||||
| 5 | Nonlinear | 1 or 2 or 3 | Continuous + binary+ categorical | 0.077 (0.019) | 1.101 (0.012) | |
| 0.078 (0.018) | 1.101 (0.011) | |||||
| ACWL | 0.029 (0.028) | 1.129 (0.016) | ||||
| D-learning | 0.090 (0.032) | 1.094 (0.019) | ||||
| BART |
|
| ||||
| 6 | Tree | 1 | Continuous |
|
| |
|
|
| |||||
| ACWL |
|
| ||||
| D-learning |
|
| ||||
| BART |
|
|
Note: Methods under comparison include the l1-penalized least squares with hierarchical group LASSO variable selection (l1-PLS-HGL), l1-penalized least squares with group LASSO variable selection (l1-PLS-GL), adaptive contrast weighted learning (ACWL), direct learning (D-learning), and Bayesian additive regression trees (BART). The smallest misclassification rates and the largest value functions for each setting are in bold.
Applications of the l1-penalized least squares with hierarchical group LASSO variable selection (l1-PLS-HGL), l1-penalized least squares with group LASSO variable selection (l1-PLS-GL), adaptive contrast weighted learning (ACWL), and direct learning (D-learning) to data from male donors in the INTERVAL trial.
Assignment Percentages | ITR Effects | |||||
|---|---|---|---|---|---|---|
| Target Outcome | Method | 12 weeks | 10 weeks | 8 weeks | Donation | Deferral |
| Donation | 0.1 (0.0) | 0.3 (0.0) | 99.6 (0.0) | 1.308 (0.004) | 0.026 (0.000) | |
| 0.0 (0.0) | 0.3 (0.3) | 99.7 (0.3) | 1.311 (0.005) | 0.027 (0.000) | ||
| ACWL | 0.0 (0.0) | 0.0 (0.0) | 100.0 (0.0) | 1.315 (0.000) | 0.027 (0.000) | |
| D-learning | 0.3 (0.1) | 0.3 (0.2) | 99.4 (0.2) | 1.307 (0.006) | 0.027 (0.000) | |
| Deferral | 94.4 (0.6) | 5.5 (0.6) | 0.0 (0.0) | −1.188 (0.010) | −0.024 (0.000) | |
| 99.7 (0.6) | 0.2 (0.5) | 0.0 (0.1) | −1.246 (0.006) | −0.024 (0.000) | ||
| ACWL | 99.7 (0.6) | 0.3 (0.6) | 0.0 (0.0) | −1.244 (0.007) | −0.025 (0.000) | |
| D-learning | 95.7 (0.5) | 4.1 (0.5) | 0.2 (0.1) | −1.200 (0.011) | −0.024 (0.000) | |
Note: Means and standard deviations (in parenthesis) of assignment proportions in % and empirical ITR effects on donation and deferral outcomes across 100 repetitions of 5-fold cross-validation are reported. ITR effects measure the difference in the average outcome between donors whose assigned inter-donation intervals in the trial are optimal (with respect to the method used to estimate the ITR) and those whose assigned inter-donation intervals are non-optimal. A larger ITR effect on donation and a smaller ITR effect on deferral are more desirable. The first four and last four rows correspond to the target being maximizing total units of blood collected by the blood service, and minimizing the low Hb deferral rates, respectively.
Applications of the l1-penalized least squares with hierarchical group LASSO variable selection (l1-PLS-HGL), l1-penalized least squares with group LASSO variable selection (l1-PLS-GL), adaptive contrast weighted learning (ACWL), and direct learning (D-learning) to data from male donors in the INTERVAL trial assuming the target is to maximize the utility. The trade-off parameter b in the utility function varies from 1 to 5 at an increment of 1.
Assignment Percentages | ITR Effects | ||||||
|---|---|---|---|---|---|---|---|
| Trade-off Parameter | Method | 12 weeks | 10 weeks | 8 weeks | Donation | Deferral | Utility |
| 0.9 (0.1) | 1.2 (0.1) | 97.9 (0.1) | 1.309 (0.008) | 0.024 (0.001) | 1.064 (0.009) | ||
| 0.3 (0.1) | 2.4 (0.8) | 97.2 (1.0) | 1.289 (0.014) | 0.025 (0.001) | 1.040 (0.014) | ||
| ACWL | 0.0 (0.0) | 0.0 (0.1) | 100.0 (0.1) | 1.314 (0.003) | 0.027 (0.000) | 1.055 (0.002) | |
| D-learning | 0.7 (0.2) | 1.2 (0.5) | 98.1 (0.5) | 1.309 (0.012) | 0.025 (0.001) | 1.058 (0.013) | |
| 3.4 (0.1) | 4.2 (0.3) | 92.4 (0.4) | 1.242 (0.016) | 0.021 (0.001) | 0.809 (0.020) | ||
| 1.7 (0.4) | 7.2 (1.6) | 91.1 (2.0) | 1.217 (0.027) | 0.022 (0.002) | 0.774 (0.024) | ||
| ACWL | 2.7 (0.8) | 3.4 (1.7) | 93.9 (1.3) | 1.266 (0.019) | 0.022 (0.001) | 0.814 (0.016) | |
| D-learning | 1.5 (0.4) | 5.8 (1.1) | 92.7 (1.0) | 1.260 (0.022) | 0.022 (0.001) | 0.816 (0.023) | |
| 8.6 (0.3) | 11.9 (0.5) | 79.5 (0.6) | 1.091 (0.022) | 0.011 (0.001) | 0.689 (0.028) | ||
| 4.8 (1.1) | 15.0 (3.3) | 80.2 (4.4) | 1.069 (0.056) | 0.017 (0.003) | 0.569 (0.041) | ||
| ACWL | 9.3 (1.4) | 8.2 (2.7) | 82.5 (2.2) | 1.100 (0.034) | 0.014 (0.001) | 0.627 (0.032) | |
| D-learning | 3.8 (0.8) | 17.5 (1.6) | 78.6 (1.3) | 1.067 (0.027) | 0.016 (0.001) | 0.607 (0.027) | |
| 17.0 (0.4) | 23.3 (0.5) | 59.7 (0.5) | 0.745 (0.023) | 0.001 (0.001) | 0.623 (0.030) | ||
| 10.5 (2.3) | 27.9 (3.2) | 61.6 (5.2) | 0.782 (0.070) | 0.008 (0.004) | 0.468 (0.081) | ||
| ACWL | 16.8 (1.9) | 16.2 (3.5) | 67.0 (3.1) | 0.793 (0.055) | 0.007 (0.002) | 0.475 (0.033) | |
| D-learning | 9.9 (1.6) | 30.0 (1.7) | 60.2 (0.7) | 0.783 (0.027) | 0.006 (0.001) | 0.543 (0.039) | |
| 26.4 (0.4) | 33.4 (0.5) | 40.3 (0.3) | 0.410 (0.022) | −0.007 (0.001) | 0.648 (0.031) | ||
| 18.2 (3.2) | 48.4 (6.2) | 33.4 (3.4) | 0.324 (0.059) | −0.004 (0.002) | 0.485 (0.084) | ||
| ACWL | 30.1 (2.5) | 22.6 (4.2) | 47.3 (3.2) | 0.422 (0.053) | −0.005 (0.002) | 0.541 (0.046) | |
| D-learning | 19.3 (1.6) | 37.4 (1.3) | 43.3 (0.7) | 0.505 (0.031) | −0.004 (0.001) | 0.622 (0.045) | |
Note: Means and standard deviations (in parenthesis) of assignment proportions in % and empirical ITR effects on donation, deferral, and utility across 100 repetitions of 5-fold cross-validation are reported. ITR effects measure the difference in the average outcome between donors whose assigned inter-donation intervals in the trial are optimal (with respect to the method used to estimate the ITR) and those whose assigned inter-donation intervals are non-optimal. A larger ITR effect on donation/utility and a smaller ITR effect on deferral are more desirable.
ITR effects of three non-personalized rules on the utility outcome. The trade-off parameter b in the utility function varies from 1 to 5 at an increment of 1.
ITR Effects on Utility | |||||
|---|---|---|---|---|---|
| Non-personalized Rule | |||||
| Recommend all male donors to donate every 12 weeks | −1.308 | −0.828 | −0.618 | −0.408 | −0.199 |
| Recommend all male donors to donate every 10 weeks | −0.025 | 0.027 | 0.079 | 0.131 | 0.183 |
| Recommend all male donors to donate every 8 weeks | 1.055 | 0.795 | 0.535 | 0.275 | 0.015 |
Note: ITR effects measure the difference in the average outcome between donors whose assigned inter-donation intervals in the trial are the same as the one specified in the non-personalized rule and those whose assigned inter-donation intervals are different from that specified in the non-personalized rule. A larger ITR effect on utility is more desirable.
Selection percentages of treatment-covariate interactions in the prediction model for the donation outcome across 100 repetitions of 5-fold cross-validation when l1-PLS is used to estimate the optimal ITR.
Variable Selection | |||
|---|---|---|---|
| Baseline Variables | Variable Type | HGL | GL |
| Age | Continuous | 63 | 21 |
| Body mass index | Continuous | 33 | 38 |
| SF-36v2 physical component score | Continuous | 47 | 11 |
| SF-36v2 mental component score | Continuous | 100 | 20 |
| Blood donations in the two years before trial enrollment | Continuous | 100 | 100 |
| Hemoglobin level | Continuous | 100 | 100 |
| White blood cell count | Continuous | 100 | 46 |
| Red blood cell count | Continuous | 0 | 11 |
| Mean corpuscular hemoglobin | Continuous | 100 | 96 |
| Mean corpuscular volume | Continuous | 62 | 11 |
| Platelet count | Continuous | 100 | 12 |
| Ethnicity | Categorical | 47 | 97 |
| Blood group | Categorical | 99 | 99 |
| Iron prescription | Categorical | 1 | 74 |
| Smoke ever | Categorical | 13 | 39 |
| Smoke currently | Categorical | 98 | 99 |
| Alcohol ever | Categorical | 85 | 32 |
| Alcohol currently | Categorical | 100 | 100 |
| New or returning donor status | Categorical | 100 | 32 |
Note: Hierarchical group LASSO (HGL) enforces strong hierarchy between main effects and interactions, and group LASSO (GL) does not impose strong hierarchy between main effects and interactions.
Figure 2.Density plots of ITR effects on utility when (a) b = 1 (b) b = 2 (c) b = 3 (d) b = 4 (e) b = 5 for five donor assignment rules: recommend all male donors to (i) donate every eight weeks, (ii) donate every 10 weeks, (iii) donate every 12 weeks, (iv) donate according to the BART ITR, and (v) donate according to the optimized ITR (non-achievable in practice). The trade-off parameter b in the utility function varies from 1 to 5 at an increment of 1. A larger ITR effect on utility is more desirable.
Applications of Bayesian additive regression trees (BART) to data from male donors in the INTERVAL trial assuming the target is to maximize the utility score. The trade-off parameter b in the utility function varies from 1 to 5 at an increment of 1.
Trade-off Parameter | ||||||
|---|---|---|---|---|---|---|
| Criteria | Assignment Rule | |||||
| BART ITRAssignment Percentages | 12 weeks | 0.0 | 3.6 | 10.7 | 17.5 | 26.7 |
| 10 weeks | 3.2 | 9.4 | 18.8 | 25.5 | 28.4 | |
| 8 weeks | 96.8 | 87.1 | 70.5 | 57.0 | 44.9 | |
| ITR Effects on UtilityPosterior Mean [95% Credible Interval] | All 12 weeks | −1.037[−1.118,−0.956] | −0.827 [−0.915,−0.739] | −0.618 [−0.712,−0.524] | −0.408 [−0.511,−0.305] | −0.199 [−0.317,−0.082] |
| All 10 weeks | −0.025 [−0.106, 0.055] | 0.027 [−0.060, 0.114] | 0.079 [−0.017, 0.174] | 0.131 [0.027, 0.232] | 0.183 [0.068, 0.297] | |
| All 8 weeks | 1.054 [0.973, 1.134] | 0.794 [0.708, 0.883] | 0.535 [0.437, 0.628] | 0.275 [0.169, 0.379] | 0.016 [−0.101, 0.135] | |
| BART ITR | 1.064 [0.983, 1.144] | 0.876 [0.793, 0.959] | 0.750 [0.643, 0.843] | 0.671 [0.553, 0.780] | 0.732 [0.548, 0.852] | |
| Optimized ITR | 1.082 [1.000, 1.159] | 0.898 [0.814, 0.981] | 0.802 [0.701, 0.893] | 0.770 [0.671, 0.870] | 0.823 [0.710, 0.928] | |
Note: Assignment proportions of the BART ITR in percentage and the posterior mean [95% equal tail credible interval] of the ITR effect on the utility outcome for five donor assignment rules are reported. Assignment rules include the following: recommend all male donors to (i) donate every eight weeks, (ii) donate every 10 weeks, (iii) donate every 12 weeks, (iv) donate according to the BART ITR, and (v) donate according to the optimized ITR (non-achievable in practice). A larger ITR effect on utility is more desirable.