| Literature DB >> 33465293 |
Ignacio González-García1, Vadryn Pierre2,3, Vincent F S Dubois1, Nassim Morsli4, Stuart Spencer5, Paul G Baverel1,6, Helen Moore7.
Abstract
We developed and evaluated a method for making early predictions of best overall response (BOR) and overall survival at 6 months (OS6) in patients with cancer treated with immunotherapy. This method combines machine learning with modeling of longitudinal tumor size data. We applied our method to data from durvalumab-exposed patients with recurrent/metastatic head and neck cancer. A fivefold cross-validation was used for model selection. Independent trial data, with various degrees of data truncation, were used for model validation. Mean classification error rates (90% confidence intervals [CIs]) from cross-validation were 5.99% (90% CI 2.98%-7.50%) for BOR and 19.8% (90% CI 15.8%-39.3%) for OS6. During model validation, the area under the receiver operating characteristic curves was preserved for BOR (0.97, 0.97, and 0.94) and OS6 (0.85, 0.84, and 0.82) at 24, 18, and 12 weeks, respectively. These results suggest our method predicts trial outcomes accurately from early data and could be used to aid drug development.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33465293 PMCID: PMC7965835 DOI: 10.1002/psp4.12594
Source DB: PubMed Journal: CPT Pharmacometrics Syst Pharmacol ISSN: 2163-8306
Covariate distribution at baseline, split by study, and data set (training and validation)
| Variable, mean (SD) | Training | Validation EAGLE | |||
|---|---|---|---|---|---|
| HAWK | CONDOR | Study_11 | Study_1108 | ||
|
| 111 | 195 | 41 | 54 | 482 |
| Age, years | 57.5 (12.2) | 60.3 (9.53) | 60.7 (11.5) | 58.9 (11.2) | 59.5 (9.59) |
| Albumin, [g/L | 38.1 (4.97) | 38.6 (4.91) | 39.2 (6.24) | 37.6 (4.67) | 39.1 (4.95) |
| ALP, U/L | 96.0 (52.3) | 99.9 (101) | 102 (78.8) | 105 (62.2) | 123 (100) |
| ALT, U/L | 19.5 (10.8) | 18.9 (11.3) | 19.1 (9.11) | 22.5 (12.0) | 19.1 (15.5) |
| GGT, U/L | 49.8 (55.7) | 60.4 (114) | 54.9 (64.6) | 61.5 (92.1) | 65 (108) |
| HB, g/L | 117 (14.0) | 117 (17.5) | 121 (18.6) | 117 (18.4) | 119 (16.6) |
| IC, % | 15.2 (18.2) | 15.8 (19.3) | 25.6 (21.3) | 21.1 (20.3) | 17.4 (21.6) |
| NEUT, 109/L | 6.28 (4.05) | 6.08 (3.36) | 6.43 (3.9) | 5.56 (3.81) | 6.24 (3.88) |
| NLR | 8.33 (7.97) | 7.32 (5.52) | 9.28 (6.79) | 8.33 (7.35) | 7.38 (8.49) |
| TC, % | 62.6 (26.6) | 3.98 (6.24) | 34.2 (34.4) | 22.5 (29.9) | 19.6 (28.2) |
| SLD, mm | 68.8 (38.9) | 73.3 (40.2) | 75.2 (51.7) | 76.3 (45.0) | 64.8 (42.2) |
Abbreviations: ALP, alkaline phosphatase; ALT, alanine transferase; ECOG, Eastern Cooperative Oncology Group; GGT, gamma glutamyl transferase; HB, hemoglobin; HPV, human papillomavirus; IC, immune cell PD‐L1 expression, expressed in percentage staining; NEUT, neutrophil count; NLR, neutrophil to lymphocyte ratio; SLD, sum of longest diameters of the tumor size at baseline; TC, tumor cell PD‐L1 expression, expressed in percentage staining.
FIGURE 1Schematic of the classification method
FIGURE 2Top panels. Longitudinal tumor data of durvalumab‐treated patients with recurrent and metastatic head and neck squamous cell carcinoma used for training the model (panel a, 4 studies) and for validation (panel b, one confirmatory study). Best overall response outcome classifications used are as follows: responders are patients with complete response (CR) or partial response (PR); nonresponders are patients with stable disease (SD), progressive disease (PD), or not evaluable (NE). Bottom panels. Kaplan‐Meier plots of overall survival (OS) for all four clinical trials used in the training dataset stratified by study (panel c), and validation dataset OS time profile (panel d). A 6‐month landmark time was used in the analysis to dichotomize OS (responder = alive and still in the trial at 6 months; nonresponder = not in the trial at 6 months), denoted as overall survival at 6 months (OS6)
FIGURE 3(a) Scatterplot of sum of longest diameter of target lesions time‐course data (plain circles) and nonlinear mixed‐effect tumor model fit (solid line) for eight representative individuals selected from the training data. Individual parameter estimates k_d and k_g, expressed in week−1, are provided. (b) Scatterplot of the individual parameter estimates (k_d vs. k_g) and categorization by best overall response (BOR) in the training dataset. (c) Scatterplot of (k_d vs. k_g) and categorization by overall survival at 6 months (OS6) response in the training dataset. In (b) and (c), the solid black demarcation curve represents the machine learning output separating responders (above the curve) from nonresponders (below the curve). Observed clinical outcomes of BOR and OS6 are color‐coded as indicated in the legends
FIGURE 4Area under the curve (AUC) estimates of receiver‐operator curve (ROC) and error rate estimates of the classification algorithm in various settings. Primary analysis: Fivefold cross‐validation (mean and 90% prediction interval), and external validation (12‐week truncation) for best overall response (BOR) and overall survival at 6 months (OS6) trial outcomes. Sensitivity analysis 1: comparison of results for monotherapy compared with combination therapy. Sensitivity analysis 2: analysis of the effect of sample size (see the [Link], [Link], [Link], [Link] for details on these analyses)
FIGURE 5(a) Spider plots of observed (dots) and individually predicted (lines) tumor size change from baseline over time at various early cutoff times of the test data set: left (24 weeks), middle (18 weeks), and right (12 weeks). (b) Receiver operating characteristic curve for prediction of best overall response (BOR; red) and overall survival at 6 months (OS6; green) corresponding to each data truncation. Classification error rates and areas under the curve (AUC) of the ROC curve for BOR (red) and OS6 (green) corresponding to each data truncation
Performance metrics of the classification based on external validation results from the analysis of EAGLE trial outcomes (BOR and OS6) with the entire dataset or 24‐, 18‐, or 12‐week data truncation
| BOR | OS6 | |||||||
|---|---|---|---|---|---|---|---|---|
| Data used | All data | 24 wk | 18 wk | 12 wk | All data | 24 wk | 18 wk | 12 wk |
| AUC ROC | 0.957 | 0.968 | 0.971 | 0.937 | 0.838 | 0.847 | 0.843 | 0.821 |
| Sensitivity | 0.773 | 0.795 | 0.761 | 0.659 | 0.574 | 0.574 | 0.566 | 0.481 |
| Specificity | 0.964 | 0.964 | 0.967 | 0.957 | 0.912 | 0.909 | 0.909 | 0.909 |
| Error rate (%) | 7.05 | 6.64 | 7.05 | 9.75 | 17.8 | 18.0 | 18.3 | 20.5 |
| PPV | 0.829 | 0.833 | 0.838 | 0.773 | 0.705 | 0.698 | 0.695 | 0.66 |
| NPV | 0.950 | 0.955 | 0.948 | 0.926 | 0.854 | 0.854 | 0.851 | 0.827 |
| AUC PRC | 0.836 | 0.866 | 0.861 | 0.793 | 0.611 | 0.641 | 0.626 | 0.591 |
| Youden Index | 0.737 | 0.759 | 0.728 | 0.616 | 0.486 | 0.483 | 0.475 | 0.39 |
| MCC | 0.757 | 0.774 | 0.756 | 0.656 | 0.521 | 0.516 | 0.510 | 0.436 |
Abbreviations: AUC, area under the curve; BOR, best overall response; MCC, Matthews correlation coefficient; NPV, negative predictive value; PPV, positive predictive value; PRC, precision and recall curve; 0S6, overall survival at 6 months; ROC, receiver‐operator curve.