| Literature DB >> 33280255 |
Phyllis Chan1, Xiaofei Zhou1,2, Nina Wang1, Qi Liu1, René Bruno3, Jin Y Jin1.
Abstract
Machine learning (ML) was used to leverage tumor growth inhibition (TGI) metrics to characterize the relationship with overall survival (OS) as a novel approach and to compare with traditional TGI-OS modeling methods. Historical dataset from a phase III non-small cell lung cancer study (OAK, atezolizumab vs. docetaxel, N = 668) was used. ML methods support the validity of TGI metrics in predicting OS. With lasso, the best model with TGI metrics outperforms the best model without TGI metrics. Boosting was the best linear ML method for this dataset with reduced estimation bias and lowest Brier score, suggesting better prediction accuracy. Random forest did not outperform linear ML methods despite hyperparameter optimization. Kernel machine was marginally the best nonlinear ML method for this dataset and uncovered nonlinear and interaction effects. Nonlinear ML may improve prediction by capturing nonlinear effects and covariate interactions, but its predictive performance and value need further evaluation with larger datasets.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33280255 PMCID: PMC7825187 DOI: 10.1002/psp4.12576
Source DB: PubMed Journal: CPT Pharmacometrics Syst Pharmacol ISSN: 2163-8306
Top predictors of overall survival selected by different analysis methods
| Analysis methods | Model selected predictors for OS (in order of significance) |
|---|---|
| TGI‐OS | logKG, Metsites, ALBU |
| Lasso | logKG, BSLD, ECOG, TTG, ALBU, LDH |
| Boosting | logKG, BSLD, ECOG, TTG, ALBU, LDH |
| Random forest | logKG, TTG, BSLD, LDH, ALBU |
| Kernel machine | logKG, BSLD, TTG, logKS |
ALBU, albumin; BSLD, baseline tumor size; ECOG, Eastern Cooperative Oncology Group performance status; LDH, lactate dehydrogenase; logKG, log of tumor growth rate constant; logKS, log of tumor shrinkage rate constant; Metsites, number of metastatic sites; OS, overall survival; TGI, tumor growth inhibition; TTG, time to tumor regrowth.
Figure 1Brier scores of different covariate models based on lasso Cox proportional hazard (Cox PH).
Figure 2Brier scores of different covariate models based on lasso accelerated failure time.
Brier scores of models with top predictor selection based on different methods
| Framework | No covariate | Covariates from TGI‐OS | Lasso | All covariates |
|---|---|---|---|---|
| Cox | 0.19 | 0.142 | 0.141 | 0.145 |
| AFT | 0.19 | 0.14 | 0.139 | 0.142 |
AFT, accelerated failure time; OS, overall survival; TGI, time to tumor regrowth.
Figure 3Marginal effects of log(KG), log(KS), and TTG on the log of survival time using kernel machine. KG = 1/week, KS = 1/week, TTG = week. KG, tumor growth rate constant; KS, tumor shrinkage rate constant; TTG, time to tumor regrowth.
Figure 4Interactive effects of log(KG) and log(KS) on the log of survival time using kernel machine. KG, tumor growth rate constant; KS, tumor shrinkage rate constant.
Figure 5Pairwise effects of the 27 covariates estimated by kernel machine. 1 = log(KG), 2 = log(KS), 3 = AUC1, 4 = TTG, 5 = LDH, 6 = ALBU, 7 = Histology, 8 = AST, 9 = ALP, 10 = Age, 11 = BSLD, 12 = SCr, 13 = Smoking status, 14 = TPRO, 15 = ECOG, 16 = Sex, 17 = BWT, 18 = Metsite2, 19 = Metsite4, 20 = Metsites, 21 = YSM, 22 = eGFR, 23 = TC123IC123, 24 = TC23IC23, 25 = number of prior chemotherapy regimens for advanced disease (second‐line vs. third‐line) = Line3, 26 = TC3, 27 = IC3. Red line is the line of identity. The intensity of the grayscale corresponds to the strength of the interaction effect. ALBU, albumin; ALP, alkaline phosphatase; AST, aspartate aminotransferase; AUC, area under the concentration‐time curve; BSLD, baseline sum of longest diameter; BWT, body weight; ECOG, Eastern Cooperative Oncology Group; eGFR, estimated glomerular filtration rate; KG, tumor growth rate constant; KS, tumor shrinkage rate constant; LDH, lactate dehydrogenase; SCr, serum creatinine; TTG, time to tumor regrowth; TPRO, total protein; YSM, years since metastasis.