Literature DB >> 34591602

Two-Stage Approaches to Accounting for Patient Heterogeneity in Machine Learning Risk Prediction Models in Oncology.

Eun Jeong Oh1, Ravi B Parikh2, Corey Chivers3, Jinbo Chen1.   

Abstract

PURPOSE: Machine learning models developed from electronic health records data have been increasingly used to predict risk of mortality for general oncology patients. But these models may have suboptimal performance because of patient heterogeneity. The objective of this work is to develop a new modeling approach to predicting short-term mortality that accounts for heterogeneity across multiple subgroups in the presence of a large number of electronic health record predictors.
METHODS: We proposed a two-stage approach to addressing heterogeneity among oncology patients of different cancer types for predicting their risk of mortality. Structured data were extracted from the University of Pennsylvania Health System for 20,723 patients of 11 cancer types, where 1,340 (6.5%) patients were deceased. We first modeled the overall risk for all patients without differentiating cancer types, as is done in the current practice. We then developed cancer type-specific models using the overall risk score as a predictor along with preselected type-specific predictors. The overall and type-specific models were compared with respect to discrimination using the area under the precision-recall curve (AUPRC) and calibration using the calibration slope. We also proposed metrics that characterize the degree of risk heterogeneity by comparing risk predictors in the overall and type-specific models.
RESULTS: The two-stage modeling resulted in improved calibration and discrimination across all 11 cancer types. The improvement in AUPRC was significant for hematologic malignancies including leukemia, lymphoma, and myeloma. For instance, the AUPRC increased from 0.358 to 0.519 (∆ = 0.161; 95% CI, 0.102 to 0.224) and from 0.299 to 0.354 (∆ = 0.055; 95% CI, 0.009 to 0.107) for leukemia and lymphoma, respectively. For all 11 cancer types, the two-stage approach generated well-calibrated risks. A high degree of heterogeneity between type-specific and overall risk predictors was observed for most cancer types.
CONCLUSION: Our two-stage modeling approach that accounts for cancer type-specific risk heterogeneity has improved calibration and discrimination than a model agnostic to cancer types.

Entities:  

Mesh:

Year:  2021        PMID: 34591602      PMCID: PMC8812620          DOI: 10.1200/CCI.21.00077

Source DB:  PubMed          Journal:  JCO Clin Cancer Inform        ISSN: 2473-4276


  28 in total

1.  Internal validation of predictive models: efficiency of some procedures for logistic regression analysis.

Authors:  E W Steyerberg; F E Harrell; G J Borsboom; M J Eijkemans; Y Vergouwe; J D Habbema
Journal:  J Clin Epidemiol       Date:  2001-08       Impact factor: 6.437

Review 2.  Estimation methods for heterogeneous cell population models in systems biology.

Authors:  Steffen Waldherr
Journal:  J R Soc Interface       Date:  2018-10-31       Impact factor: 4.118

3.  Accurate Prognostic Awareness Facilitates, Whereas Better Quality of Life and More Anxiety Symptoms Hinder End-of-Life Care Discussions: A Longitudinal Survey Study in Terminally Ill Cancer Patients' Last Six Months of Life.

Authors:  Siew Tzuh Tang; Chen Hsiu Chen; Fur-Hsing Wen; Jen-Shi Chen; Wen-Cheng Chang; Chia-Hsun Hsieh; Wen-Chi Chou; Ming-Mo Hou
Journal:  J Pain Symptom Manage       Date:  2017-12-29       Impact factor: 3.612

4.  Coping and Prognostic Awareness in Patients With Advanced Cancer.

Authors:  Ryan D Nipp; Joseph A Greer; Areej El-Jawahri; Samantha M Moran; Lara Traeger; Jamie M Jacobs; Juliet C Jacobsen; Emily R Gallagher; Elyse R Park; David P Ryan; Vicki A Jackson; William F Pirl; Jennifer S Temel
Journal:  J Clin Oncol       Date:  2017-06-02       Impact factor: 44.544

5.  The legal and ethical concerns that arise from using complex predictive analytics in health care.

Authors:  I Glenn Cohen; Ruben Amarasingham; Anand Shah; Bin Xie; Bernard Lo
Journal:  Health Aff (Millwood)       Date:  2014-07       Impact factor: 6.301

6.  The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.

Authors:  Takaya Saito; Marc Rehmsmeier
Journal:  PLoS One       Date:  2015-03-04       Impact factor: 3.240

7.  Risk prediction for chronic kidney disease progression using heterogeneous electronic health record data and time series analysis.

Authors:  Adler Perotte; Rajesh Ranganath; Jamie S Hirsch; David Blei; Noémie Elhadad
Journal:  J Am Med Inform Assoc       Date:  2015-04-20       Impact factor: 4.497

8.  Machine learning models in electronic health records can outperform conventional survival models for predicting patient mortality in coronary artery disease.

Authors:  Andrew J Steele; Spiros C Denaxas; Anoop D Shah; Harry Hemingway; Nicholas M Luscombe
Journal:  PLoS One       Date:  2018-08-31       Impact factor: 3.240

9.  Survival prediction models since liver transplantation - comparisons between Cox models and machine learning techniques.

Authors:  Georgios Kantidakis; Hein Putter; Carlo Lancia; Jacob de Boer; Andries E Braat; Marta Fiocco
Journal:  BMC Med Res Methodol       Date:  2020-11-16       Impact factor: 4.615

10.  Effect of Integrating Machine Learning Mortality Estimates With Behavioral Nudges to Clinicians on Serious Illness Conversations Among Patients With Cancer: A Stepped-Wedge Cluster Randomized Clinical Trial.

Authors:  Christopher R Manz; Ravi B Parikh; Dylan S Small; Chalanda N Evans; Corey Chivers; Susan H Regli; C William Hanson; Justin E Bekelman; Charles A L Rareshide; Nina O'Connor; Lynn M Schuchter; Lawrence N Shulman; Mitesh S Patel
Journal:  JAMA Oncol       Date:  2020-12-10       Impact factor: 31.777

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.