Bobak J Mortazavi1, Nicholas S Downing1, Emily M Bucholz1, Kumar Dharmarajan1, Ajay Manhapra1, Shu-Xia Li1, Sahand N Negahban1, Harlan M Krumholz2. 1. From the Section of Cardiovascular Medicine, Department of Internal Medicine (B.J.M., N.S.D., E.M.B., K.D., H.M.K.), Department of Psychiatry and the Section of General Medicine, Department of Internal Medicine (A.M.), and Robert Wood Johnson Foundation Clinical Scholars Program, Department of Internal Medicine, Yale School of Medicine, and Department of Health Policy and Management (H.M.K.), Yale School of Public Health, New Haven, CT; Center for Outcomes Research and Evaluation, Yale-New Haven Hospital, New Haven, CT (B.J.M., N.S.D., E.M.B., K.D., S.-X.L., H.M.K.); and Department of Statistics, Yale University, New Haven, CT (B.J.M., S.N.N.). 2. From the Section of Cardiovascular Medicine, Department of Internal Medicine (B.J.M., N.S.D., E.M.B., K.D., H.M.K.), Department of Psychiatry and the Section of General Medicine, Department of Internal Medicine (A.M.), and Robert Wood Johnson Foundation Clinical Scholars Program, Department of Internal Medicine, Yale School of Medicine, and Department of Health Policy and Management (H.M.K.), Yale School of Public Health, New Haven, CT; Center for Outcomes Research and Evaluation, Yale-New Haven Hospital, New Haven, CT (B.J.M., N.S.D., E.M.B., K.D., S.-X.L., H.M.K.); and Department of Statistics, Yale University, New Haven, CT (B.J.M., S.N.N.). harlan.krumholz@yale.edu.
Abstract
BACKGROUND: The current ability to predict readmissions in patients with heart failure is modest at best. It is unclear whether machine learning techniques that address higher dimensional, nonlinear relationships among variables would enhance prediction. We sought to compare the effectiveness of several machine learning algorithms for predicting readmissions. METHODS AND RESULTS: Using data from the Telemonitoring to Improve Heart Failure Outcomes trial, we compared the effectiveness of random forests, boosting, random forests combined hierarchically with support vector machines or logistic regression (LR), and Poisson regression against traditional LR to predict 30- and 180-day all-cause readmissions and readmissions because of heart failure. We randomly selected 50% of patients for a derivation set, and a validation set comprised the remaining patients, validated using 100 bootstrapped iterations. We compared C statistics for discrimination and distributions of observed outcomes in risk deciles for predictive range. In 30-day all-cause readmission prediction, the best performing machine learning model, random forests, provided a 17.8% improvement over LR (mean C statistics, 0.628 and 0.533, respectively). For readmissions because of heart failure, boosting improved the C statistic by 24.9% over LR (mean C statistic 0.678 and 0.543, respectively). For 30-day all-cause readmission, the observed readmission rates in the lowest and highest deciles of predicted risk with random forests (7.8% and 26.2%, respectively) showed a much wider separation than LR (14.2% and 16.4%, respectively). CONCLUSIONS: Machine learning methods improved the prediction of readmission after hospitalization for heart failure compared with LR and provided the greatest predictive range in observed readmission rates.
BACKGROUND: The current ability to predict readmissions in patients with heart failure is modest at best. It is unclear whether machine learning techniques that address higher dimensional, nonlinear relationships among variables would enhance prediction. We sought to compare the effectiveness of several machine learning algorithms for predicting readmissions. METHODS AND RESULTS: Using data from the Telemonitoring to Improve Heart Failure Outcomes trial, we compared the effectiveness of random forests, boosting, random forests combined hierarchically with support vector machines or logistic regression (LR), and Poisson regression against traditional LR to predict 30- and 180-day all-cause readmissions and readmissions because of heart failure. We randomly selected 50% of patients for a derivation set, and a validation set comprised the remaining patients, validated using 100 bootstrapped iterations. We compared C statistics for discrimination and distributions of observed outcomes in risk deciles for predictive range. In 30-day all-cause readmission prediction, the best performing machine learning model, random forests, provided a 17.8% improvement over LR (mean C statistics, 0.628 and 0.533, respectively). For readmissions because of heart failure, boosting improved the C statistic by 24.9% over LR (mean C statistic 0.678 and 0.543, respectively). For 30-day all-cause readmission, the observed readmission rates in the lowest and highest deciles of predicted risk with random forests (7.8% and 26.2%, respectively) showed a much wider separation than LR (14.2% and 16.4%, respectively). CONCLUSIONS: Machine learning methods improved the prediction of readmission after hospitalization for heart failure compared with LR and provided the greatest predictive range in observed readmission rates.
Authors: Ruben Amarasingham; Billy J Moore; Ying P Tabak; Mark H Drazner; Christopher A Clark; Song Zhang; W Gary Reed; Timothy S Swanson; Ying Ma; Ethan A Halm Journal: Med Care Date: 2010-11 Impact factor: 2.983
Authors: Li Wang; Brian Porter; Charles Maynard; Christopher Bryson; Haili Sun; Elliott Lowy; Mary McDonell; Kathleen Frisbee; Christopher Nielson; Stephan D Fihn Journal: Am J Cardiol Date: 2012-07-21 Impact factor: 2.778
Authors: Bjoern H Menze; B Michael Kelm; Ralf Masuch; Uwe Himmelreich; Peter Bachert; Wolfgang Petrich; Fred A Hamprecht Journal: BMC Bioinformatics Date: 2009-07-10 Impact factor: 3.169
Authors: Joseph S Ross; Gregory K Mulvey; Brett Stauffer; Vishnu Patlolla; Susannah M Bernheim; Patricia S Keenan; Harlan M Krumholz Journal: Arch Intern Med Date: 2008-07-14
Authors: Kathleen F Kerr; Zheyu Wang; Holly Janes; Robyn L McClelland; Bruce M Psaty; Margaret S Pepe Journal: Epidemiology Date: 2014-01 Impact factor: 4.822
Authors: Alexander C Razavi; Dominique J Monlezun; Alexander Sapin; Leah Sarris; Emily Schlag; Amber Dyer; Timothy Harlan Journal: Am J Lifestyle Med Date: 2019-07-14
Authors: John P Corradi; Stephen Thompson; Jeffrey F Mather; Christine M Waszynski; Robert S Dicks Journal: J Med Syst Date: 2018-11-14 Impact factor: 4.460