| Literature DB >> 35285816 |
Sheng-Chieh Lu1, Cai Xu1, Chandler H Nguyen2, Yimin Geng3, André Pfob4, Chris Sidey-Gibbons1.
Abstract
BACKGROUND: In the United States, national guidelines suggest that aggressive cancer care should be avoided in the final months of life. However, guideline compliance currently requires clinicians to make judgments based on their experience as to when a patient is nearing the end of their life. Machine learning (ML) algorithms may facilitate improved end-of-life care provision for patients with cancer by identifying patients at risk of short-term mortality.Entities:
Keywords: artificial intelligence; cancer mortality; clinical prediction models; end-of-life care; machine learning
Year: 2022 PMID: 35285816 PMCID: PMC8961346 DOI: 10.2196/33182
Source DB: PubMed Journal: JMIR Med Inform
Figure 1PRISMA (Preferred Reporting Item for Systematic Reviews and Meta-Analyses) flowchart diagram for the study selection process. ML: machine learning.
Characteristics of the included studies (N=15).
| Type of cancer and study | Country | Study type | Treatment | Sample size | Algorithms | Input features (total number of features) | Outcome | |||||||||||||||
|
|
|
|
| Training | Testing | Validating |
|
|
| |||||||||||||
|
| ||||||||||||||||||||||
|
| Sena et al [ | Brazil | 1b | All | 543 | N/Aa | N/A | DTb, ANNc, and NBd | Comorbidity and PROe for physical and mental status assessments (9) | 180-day death | ||||||||||||
|
| Parikh et al [ | United States | 2a | All | 18,567 | 7958 | N/A | GBTf and RFg | Demographic, clinicopathologic, laboratory, comorbidity, and electrocardiogram data (599) | 180-day death | ||||||||||||
|
| Manz et al [ | United States | 4 | All | N/A | N/A | 24,582 | GBT | Same as Parikh et al [ | 180-day death | ||||||||||||
|
| Bertsimas et al [ | United States | 2a | All | 14,427 | 9556 | N/A | DT, regularized LRh, and GBT | Demographic, clinicopathologic, gene mutations, prior treatment, comorbidity, use of health care resources, vital signs, and laboratory data (401) | 180-day death | ||||||||||||
|
| Elfiky et al [ | United States | 2b | All | 17,832 | 9114 | N/A | GBT | Demographic, clinicopathologic, prescription, comorbidity, laboratory, vital sign, and use of health care resources data and physician notes (5390) | 180-day death | ||||||||||||
|
| ||||||||||||||||||||||
|
| Hanai et al [ | Japan | 2b | Curative resection | 125 | 48 | N/A | ANN | Demographic, clinicopathologic, and tumor entity data (17) | 1-year death | ||||||||||||
|
| ||||||||||||||||||||||
|
| Nilsaz-Dezfouli et al [ | Iran | 1b | Surgery | 452 | N/A | N/A | ANN | Demographic, clinicopathologic, tumor entity, and prior treatment (20) | 1-year death | ||||||||||||
|
| ||||||||||||||||||||||
|
| Arostegui et al [ | Spain | 2a | Curative or palliative surgery | 981 | 964 | N/A | DT and regularized LR | Demographic, clinicopathologic, tumor entity, comorbidity, ASAi prior treatment, laboratory, operational data, postoperational complication, and use of health care resources data (32) | 1-year death | ||||||||||||
|
| ||||||||||||||||||||||
|
| Biglarian et al [ | Iran | 2a | Surgery | 300 | 136 | N/A | ANN | Demographic, clinicopathologic, and symptom data (NRj) | 1-year death | ||||||||||||
|
| ||||||||||||||||||||||
|
| Klén et al [ | Turkey | 2a | Radical cystectomy | 733 | 366 | N/A | Regularized LR | Demographic, clinicopathologic, ASA, comorbidity, laboratory, prior treatment, tomography, and operational data (NR) | 90-day death | ||||||||||||
|
| ||||||||||||||||||||||
|
| Chiu et al [ | Taiwan | 2a | Liver resection | 347 | 87 | N/A | ANN | Demographic, clinicopathologic, tumor entity, comorbidity, ASA, laboratory, operational, and postoperational data (21) | 1-year death | ||||||||||||
|
| Zhang et al [ | China | 2a | Liver transplant | 230 | 60 | N/A | ANN | Donor demographic data and recipient laboratory, clinicopathologic, and image data (14) | 1-year death | ||||||||||||
|
| ||||||||||||||||||||||
|
| Karhade et al [ | United States | 2a | Surgery | 1432 | 358 | N/A | ANN, SVMk, DT, and BPMl | Demographic, clinicopathologic, tumor entity, ASA, laboratory, and operational data (23) | 30-day death | ||||||||||||
|
| Karhade et al [ | United States | 2a | Surgery | 587 | 145 | N/A | SGBm, RF, ANN, SVM, and regularized LR | Demographic, clinicopathologic, tumor entity, laboratory, operational, ECOGn, ASIAo, and prior treatment data (29) | 90-day death | ||||||||||||
|
| Karhade et al [ | United States | 4 | Curative surgery | N/A | N/A | 176 | SGB | ECOG, demographic, clinicopathologic, tumor entity, laboratory, prior treatment, and ASIA data (23) | 1-year death | ||||||||||||
aN/A: not applicable.
bDT: decision tree.
cANN: artificial neural network.
dNB: naive Bayes.
ePRO: patient-reported outcome.
fGBT: gradient-boosted tree.
gRF: random forest.
hLR: logistic regression.
iASA: American Sociological Association.
jNR: not reported.
kSVM: support vector machine.
lBPM: Bayes point machine.
mSGB: stochastic gradient boosting.
nECOG: Eastern Cooperative Oncology Group.
oASIA: American Spinal Injury Association.
Figure 2Risk of bias assessment for the included studies. Risk of bias assessment result for each included study using prediction model risk of bias assessment tool [15,35-49].
Predicting performance for the best model for each study in a holdout internal or external validation data set (N=12).
| Type of cancer and study | Outcome | Training sample | Validation sample | Mortality rate (%) | Algorithm | AUROCa | Accuracy | Sensitivity | Specificity | PPVb | NPVc | Calibration | Benchmark, model (Δ AUROC) | ||
|
| |||||||||||||||
|
| Manz et al [ | 180-day death | N/Ad | 24,582 | 4.2 | GBTe | 0.89 | —f | 0.27 | 0.99 | 0.45 | 0.97 | Well-fit | — | |
|
| Parikh et al [ | 180-day death | 18,567 | 7958 | 4.0 | RFg | 0.87 | 0.96 | — | 0.99 | 0.51 | — | Well-fit at the low-risk group | LRh (0.01) | |
|
| Bertsimas et al [ | 180-day death | 14,427 | 9556 | 5.6 | GBT | 0.87 | 0.87 | .60 | — | 0.53 | — | — | LR (0.11) | |
|
| Elfiky et al [ | 180-day death | 17,832 | 9114 | 18.4 | GBT | 0.83 | — | — | — | — | — | Well-fit | — | |
|
|
| ||||||||||||||
|
| Arostegui et al [ | 1-year death | 981 | 964 | 5.1 | DTi | 0.84 | — | — | — | — | — | Well-fit | — | |
|
| Biglarian et al [ | 1-year death | 300 | 136 | 37.5 | ANNj | 0.92 | — | 0.80 | 0.85 | — | — | — | CPHk (0.04)l | |
|
| |||||||||||||||
|
| Klén et al [ | 90-day death | 733 | 366 | 4.4 | Regularized LR | 0.72 | — | — | — | — | — | — | ACCIm univariate model (0.05) | |
|
| |||||||||||||||
|
| Chiu et al [ | 1-year death | 347 | 87 | 17 | ANN | 0.88 | — | 0.89 | 0.50 | — | — | — | LR (0.08) | |
|
| Zhang et al [ | 1-year death | 230 | 60 | 23.9 | ANN | 0.91 | — | 0.91 | 0.90 | 0.83 | 0.86 | — | — | |
|
| |||||||||||||||
|
| Karhade et al [ | 30-day death | 1432 | 358 | 8.5 | BPMn | 0.78 | — | — | — | — | — | Well-fit | — | |
|
| Karhade et al [ | 1-year death | 586 | 145 | 54.3 | SGBo | 0.89 | — | — | — | — | — | Well-fit | — | |
|
| Karhade et al [ | 1-year death | N/A | 176 | 56.2 | SGB | 0.77 | — | — | — | — | — | Fairly well-fit | — | |
aAUROC: area under the receiver operating characteristic curve.
bPPV: positive predictive value.
cNPV: negative predictive value.
dN/A: not applicable.
eGBT: gradient-boosted tree.
fNo data available
gRF: random forest.
hLR: logistic regression.
iDT: decision tree.
jANN: artificial neural network.
kCPH: Cox proportional hazard.
lSignificant at the α level defined by the study.
mACCI: adjusted Charlson comorbidity index.
nBPM: Bayes point machine
oSGB: stochastic gradient boosting.
Figure 3Pooled AUROC by machine learning (ML) algorithm. ANN: artificial neural network; AUROC: area under the receiver operating characteristic curve; BPM: Bayes point machine; DT: decision tree; GBT: gradient-boosted tree; LR: logistic regression; RF: random forest; SGB: stochastic gradient boosting; SVM: support vector machine.
The Model development processes and evaluations used in the included studies.
| Type and study | Data preprocessing | Model optimization | Interpretation | |||||||
|
| Numeric variables | Categorical | Missing data | Feature | Hyperparameter | Generalizability |
| |||
|
| ||||||||||
|
| Sena et al [ | Normalization | N/Aa | NRb | None | Software default | 10-fold CVc | VId | ||
|
| Nilsaz-Dezfouli et al [ | NR | NR | NR | VI | Grid search | 5×5-fold CV | VI | ||
|
| ||||||||||
|
| Parikh et al [ | NR | NR | Constant value imputation | Zero variance and between-variable correlation | Grid search | 5-fold CV | VI and coefficient | ||
|
| Klén et al [ | NR | NR | Complete cases only | LASSOe LRf | NR | NR | VI | ||
|
| Karhade et al [ | NR | NR | missForest multiple imputation | RFg | NR | 3×10-fold CV | VI, PDPh, and LIMEi | ||
|
| Karhade et al [ | NR | NR | Multiple imputation | Recursive feature selection | NR | 10-fold CV | NR | ||
|
| Arostegui et al [ | Discretization | One-hot encoding | Constant value imputation | RF variable importance | Software default | Bootstrapping | VI and decision tree rules | ||
|
| Bertsimas et al [ | NR | NR | Optimal impute algorithm | None | NR | NR | VI and decision tree rules | ||
|
| Chiu et al [ | NR | NR | Complete cases only | Univariate Cox proportional hazard model | NR | NR | VI | ||
|
| Zhang et al [ | Normalization | One-hot encoding | NR | Forward stepwise selection algorithm | NR | 10-fold CV | VI | ||
|
| Biglarian et al [ | NR | NR | NR | None | NR | NR | NR | ||
|
| ||||||||||
|
| Elfiky et al [ | NR | NR | Probabilistic imputation | None | Grid search | 4-fold CV | VI | ||
|
| Hanai et al [ | Standardization | NR | NR | Between-variable correlation and PIMj | NR | 5-fold CV | VI | ||
|
| ||||||||||
|
| Manz et al [ | NR | NR | Constant value imputation | N/A | N/A | N/A | VI and coefficient | ||
|
| Karhade et al [ | NR | NR | missForest multiple imputation | N/A | N/A | N/A | NR | ||
aN/A: not applicable.
bNR: not reported.
cCV: cross-validation.
dVI: variable importance.
eLASSO: least absolute shrinkage and selection operator.
fLR: logistic regression.
gRF: random forest.
hPDP: partial dependence plot.
iLIME: local interpretable model-agnostic explanation.
jPIM: parameter-increasing method.