Michael E Matheny1,2,3,4, Iben Ricket5, Christine A Goodrich5, Rashmee U Shah6, Meagan E Stabler5, Amy M Perkins2,4, Chad Dorn1, Jason Denton1, Bruce E Bray6,7, Ram Gouripeddi7, John Higgins5, Wendy W Chapman7,8, Todd A MacKenzie5, Jeremiah R Brown5. 1. Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee. 2. Deparment of Biostatistics, Vanderbilt University Medical Center, Nashville, Tennessee. 3. Division of General Internal Medicine, Vanderbilt University Medical Center, Nashville, Tennessee. 4. Geriatric Research Education and Clinical Care Center, Tennessee Valley Healthcare System VA, Nashville. 5. Departments of Epidemiology and Biomedical Data Science, Dartmouth Geisel School of Medicine, Hanover, New Hampshire. 6. Division of Cardiovascular Medicine, University of Utah School of Medicine, Salt Lake City. 7. Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City. 8. Centre for Clinical and Public Health Informatics, University of Melbourne, Melbourne, Victoria, Australia.
Abstract
Importance: In the US, more than 600 000 adults will experience an acute myocardial infarction (AMI) each year, and up to 20% of the patients will be rehospitalized within 30 days. This study highlights the need for consideration of calibration in these risk models. Objective: To compare multiple machine learning risk prediction models using an electronic health record (EHR)-derived data set standardized to a common data model. Design, Setting, and Participants: This was a retrospective cohort study that developed risk prediction models for 30-day readmission among all inpatients discharged from Vanderbilt University Medical Center between January 1, 2007, and December 31, 2016, with a primary diagnosis of AMI who were not transferred from another facility. The model was externally validated at Dartmouth-Hitchcock Medical Center from April 2, 2011, to December 31, 2016. Data analysis occurred between January 4, 2019, and November 15, 2020. Exposures: Acute myocardial infarction that required hospital admission. Main Outcomes and Measures: The main outcome was thirty-day hospital readmission. A total of 141 candidate variables were considered from administrative codes, medication orders, and laboratory tests. Multiple risk prediction models were developed using parametric models (elastic net, least absolute shrinkage and selection operator, and ridge regression) and nonparametric models (random forest and gradient boosting). The models were assessed using holdout data with area under the receiver operating characteristic curve (AUROC), percentage of calibration, and calibration curve belts. Results: The final Vanderbilt University Medical Center cohort included 6163 unique patients, among whom the mean (SD) age was 67 (13) years, 4137 were male (67.1%), 1019 (16.5%) were Black or other race, and 933 (15.1%) were rehospitalized within 30 days. The final Dartmouth-Hitchcock Medical Center cohort included 4024 unique patients, with mean (SD) age of 68 (12) years; 2584 (64.2%) were male, 412 (10.2%) were rehospitalized within 30 days, and most of the cohort were non-Hispanic and White. The final test set AUROC performance was between 0.686 to 0.695 for the parametric models and 0.686 to 0.704 for the nonparametric models. In the validation cohort, AUROC performance was between 0.558 to 0.655 for parametric models and 0.606 to 0.608 for nonparametric models. Conclusions and Relevance: In this study, 5 machine learning models were developed and externally validated to predict 30-day readmission AMI hospitalization. These models can be deployed within an EHR using routinely collected data.
Importance: In the US, more than 600 000 adults will experience an acute myocardial infarction (AMI) each year, and up to 20% of the patients will be rehospitalized within 30 days. This study highlights the need for consideration of calibration in these risk models. Objective: To compare multiple machine learning risk prediction models using an electronic health record (EHR)-derived data set standardized to a common data model. Design, Setting, and Participants: This was a retrospective cohort study that developed risk prediction models for 30-day readmission among all inpatients discharged from Vanderbilt University Medical Center between January 1, 2007, and December 31, 2016, with a primary diagnosis of AMI who were not transferred from another facility. The model was externally validated at Dartmouth-Hitchcock Medical Center from April 2, 2011, to December 31, 2016. Data analysis occurred between January 4, 2019, and November 15, 2020. Exposures: Acute myocardial infarction that required hospital admission. Main Outcomes and Measures: The main outcome was thirty-day hospital readmission. A total of 141 candidate variables were considered from administrative codes, medication orders, and laboratory tests. Multiple risk prediction models were developed using parametric models (elastic net, least absolute shrinkage and selection operator, and ridge regression) and nonparametric models (random forest and gradient boosting). The models were assessed using holdout data with area under the receiver operating characteristic curve (AUROC), percentage of calibration, and calibration curve belts. Results: The final Vanderbilt University Medical Center cohort included 6163 unique patients, among whom the mean (SD) age was 67 (13) years, 4137 were male (67.1%), 1019 (16.5%) were Black or other race, and 933 (15.1%) were rehospitalized within 30 days. The final Dartmouth-Hitchcock Medical Center cohort included 4024 unique patients, with mean (SD) age of 68 (12) years; 2584 (64.2%) were male, 412 (10.2%) were rehospitalized within 30 days, and most of the cohort were non-Hispanic and White. The final test set AUROC performance was between 0.686 to 0.695 for the parametric models and 0.686 to 0.704 for the nonparametric models. In the validation cohort, AUROC performance was between 0.558 to 0.655 for parametric models and 0.606 to 0.608 for nonparametric models. Conclusions and Relevance: In this study, 5 machine learning models were developed and externally validated to predict 30-day readmission AMI hospitalization. These models can be deployed within an EHR using routinely collected data.
Authors: Patricia S Keenan; Sharon-Lise T Normand; Zhenqiu Lin; Elizabeth E Drye; Kanchana R Bhat; Joseph S Ross; Jeremiah D Schuur; Brett D Stauffer; Susannah M Bernheim; Andrew J Epstein; Yongfei Wang; Jeph Herrin; Jersey Chen; Jessica J Federer; Jennifer A Mattera; Yun Wang; Harlan M Krumholz Journal: Circ Cardiovasc Qual Outcomes Date: 2008-09
Authors: Jennifer Meddings; Heidi Reichert; Shawna N Smith; Theodore J Iwashyna; Kenneth M Langa; Timothy P Hofer; Laurence F McMahon Journal: J Gen Intern Med Date: 2016-11-15 Impact factor: 5.128
Authors: Lauren N Smith; Anil N Makam; Douglas Darden; Helen Mayo; Sandeep R Das; Ethan A Halm; Oanh Kieu Nguyen Journal: Circ Cardiovasc Qual Outcomes Date: 2018-01
Authors: Farhan J Khawaja; Nilay D Shah; Ryan J Lennon; Joshua P Slusser; Aziz A Alkatib; Charanjit S Rihal; Bernard J Gersh; Victor M Montori; David R Holmes; Malcolm R Bell; Jeptha P Curtis; Harlan M Krumholz; Henry H Ting Journal: Arch Intern Med Date: 2011-11-28
Authors: Harlan M Krumholz; Zhenqiu Lin; Elizabeth E Drye; Mayur M Desai; Lein F Han; Michael T Rapp; Jennifer A Mattera; Sharon-Lise T Normand Journal: Circ Cardiovasc Qual Outcomes Date: 2011-03
Authors: Ruth M Reeves; Lee Christensen; Jeremiah R Brown; Michael Conway; Maxwell Levis; Glenn T Gobbel; Rashmee U Shah; Christine Goodrich; Iben Ricket; Freneka Minter; Andrew Bohm; Bruce E Bray; Michael E Matheny; Wendy Chapman Journal: J Biomed Inform Date: 2021-06-24 Impact factor: 8.000
Authors: Jeremiah R Brown; Iben M Ricket; Ruth M Reeves; Rashmee U Shah; Christine A Goodrich; Glen Gobbel; Meagan E Stabler; Amy M Perkins; Freneka Minter; Kevin C Cox; Chad Dorn; Jason Denton; Bruce E Bray; Ramkiran Gouripeddi; John Higgins; Wendy W Chapman; Todd MacKenzie; Michael E Matheny Journal: J Am Heart Assoc Date: 2022-03-24 Impact factor: 6.106