Literature DB >> 34308443

CaliForest: Calibrated Random Forest for Health Data.

Yubin Park1, Joyce C Ho2.   

Abstract

Real-world predictive models in healthcare should be evaluated in terms of discrimination, the ability to differentiate between high and low risk events, and calibration, or the accuracy of the risk estimates. Unfortunately, calibration is often neglected and only discrimination is analyzed. Calibration is crucial for personalized medicine as they play an increasing role in the decision making process. Since random forest is a popular model for many healthcare applications, we propose CaliForest, a new calibrated random forest. Unlike existing calibration methodologies, CaliForest utilizes the out-of-bag samples to avoid the explicit construction of a calibration set. We evaluated CaliForest on two risk prediction tasks obtained from the publicly-available MIMIC-III database. Evaluation on these binary prediction tasks demonstrates that CaliForest can achieve the same discriminative power as random forest while obtaining a better-calibrated model evaluated across six different metrics. CaliForest is published on the standard Python software repository and the code is openly available on Github.

Keywords:  Applied computing→Health informatics; Bagging; Computing methodologies→Classification and regression trees; General and reference→Empirical studies; calibration; healthcare; python; random forest

Year:  2020        PMID: 34308443      PMCID: PMC8299436          DOI: 10.1145/3368555.3384461

Source DB:  PubMed          Journal:  Proc ACM Conf Health Inference Learn (2020)


  23 in total

1.  Benchmarking deep learning models on large healthcare datasets.

Authors:  Sanjay Purushotham; Chuizheng Meng; Zhengping Che; Yan Liu
Journal:  J Biomed Inform       Date:  2018-06-05       Impact factor: 6.317

2.  Machine learning for phenotyping opioid overdose events.

Authors:  Jonathan Badger; Eric LaRose; John Mayer; Fereshteh Bashiri; David Page; Peggy Peissig
Journal:  J Biomed Inform       Date:  2019-04-25       Impact factor: 6.317

3.  Discrimination and Calibration of Clinical Prediction Models: Users' Guides to the Medical Literature.

Authors:  Ana Carolina Alba; Thomas Agoritsas; Michael Walsh; Steven Hanna; Alfonso Iorio; P J Devereaux; Thomas McGinn; Gordon Guyatt
Journal:  JAMA       Date:  2017-10-10       Impact factor: 56.272

Review 4.  Breast cancer: risk assessment and prevention.

Authors:  Mary A Hooks
Journal:  South Med J       Date:  2010-04       Impact factor: 0.954

5.  Probabilistic prediction in patient management and clinical trials.

Authors:  D J Spiegelhalter
Journal:  Stat Med       Date:  1986 Sep-Oct       Impact factor: 2.373

6.  Predicting the Future - Big Data, Machine Learning, and Clinical Medicine.

Authors:  Ziad Obermeyer; Ezekiel J Emanuel
Journal:  N Engl J Med       Date:  2016-09-29       Impact factor: 91.245

7.  Big data in health care: using analytics to identify and manage high-risk and high-cost patients.

Authors:  David W Bates; Suchi Saria; Lucila Ohno-Machado; Anand Shah; Gabriel Escobar
Journal:  Health Aff (Millwood)       Date:  2014-07       Impact factor: 6.301

8.  Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study.

Authors:  Ariana E Anderson; Wesley T Kerr; April Thames; Tong Li; Jiayang Xiao; Mark S Cohen
Journal:  J Biomed Inform       Date:  2015-12-17       Impact factor: 6.317

9.  Doubly Optimized Calibrated Support Vector Machine (DOC-SVM): an algorithm for joint optimization of discrimination and calibration.

Authors:  Xiaoqian Jiang; Aditya Menon; Shuang Wang; Jihoon Kim; Lucila Ohno-Machado
Journal:  PLoS One       Date:  2012-11-06       Impact factor: 3.240

10.  Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction.

Authors:  Juan Zhao; QiPing Feng; Patrick Wu; Roxana A Lupu; Russell A Wilke; Quinn S Wells; Joshua C Denny; Wei-Qi Wei
Journal:  Sci Rep       Date:  2019-01-24       Impact factor: 4.379

View more
  1 in total

Review 1.  Machine learning for the life-time risk prediction of Alzheimer's disease: a systematic review.

Authors:  Thomas W Rowe; Ioanna K Katzourou; Joshua O Stevenson-Hoare; Matthew R Bracher-Smith; Dobril K Ivanov; Valentina Escott-Price
Journal:  Brain Commun       Date:  2021-10-21
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.