Saul Blecker1, Stuart D Katz2, Leora I Horwitz1, Gilad Kuperman3, Hannah Park4, Alex Gold2, David Sontag5. 1. Department of Population Health, New York University School of Medicine, New York2Department of Medicine, New York University School of Medicine, New York. 2. Department of Medicine, New York University School of Medicine, New York. 3. Department of Information Systems, NewYork-Presbyterian Hospital, New York. 4. Department of Population Health, New York University School of Medicine, New York. 5. Department of Computer Science, New York University, New York.
Abstract
IMPORTANCE: Accurate, real-time case identification is needed to target interventions to improve quality and outcomes for hospitalized patients with heart failure. Problem lists may be useful for case identification but are often inaccurate or incomplete. Machine-learning approaches may improve accuracy of identification but can be limited by complexity of implementation. OBJECTIVE: To develop algorithms that use readily available clinical data to identify patients with heart failure while in the hospital. DESIGN, SETTING, AND PARTICIPANTS: We performed a retrospective study of hospitalizations at an academic medical center. Hospitalizations for patients 18 years or older who were admitted after January 1, 2013, and discharged before February 28, 2015, were included. From a random 75% sample of hospitalizations, we developed 5 algorithms for heart failure identification using electronic health record data: (1) heart failure on problem list; (2) presence of at least 1 of 3 characteristics: heart failure on problem list, inpatient loop diuretic, or brain natriuretic peptide level of 500 pg/mL or higher; (3) logistic regression of 30 clinically relevant structured data elements; (4) machine-learning approach using unstructured notes; and (5) machine-learning approach using structured and unstructured data. MAIN OUTCOMES AND MEASURES: Heart failure diagnosis based on discharge diagnosis and physician review of sampled medical records. RESULTS: A total of 47 119 hospitalizations were included in this study (mean [SD] age, 60.9 [18.15] years; 23 952 female [50.8%], 5258 black/African American [11.2%], and 3667 Hispanic/Latino [7.8%] patients). Of these hospitalizations, 6549 (13.9%) had a discharge diagnosis of heart failure. Inclusion of heart failure on the problem list (algorithm 1) had a sensitivity of 0.40 and a positive predictive value (PPV) of 0.96 for heart failure identification. Algorithm 2 improved sensitivity to 0.77 at the expense of a PPV of 0.64. Algorithms 3, 4, and 5 had areas under the receiver operating characteristic curves of 0.953, 0.969, and 0.974, respectively. With a PPV of 0.9, these algorithms had associated sensitivities of 0.68, 0.77, and 0.83, respectively. CONCLUSIONS AND RELEVANCE: The problem list is insufficient for real-time identification of hospitalized patients with heart failure. The high predictive accuracy of machine learning using free text demonstrates that support of such analytics in future electronic health record systems can improve cohort identification.
IMPORTANCE: Accurate, real-time case identification is needed to target interventions to improve quality and outcomes for hospitalized patients with heart failure. Problem lists may be useful for case identification but are often inaccurate or incomplete. Machine-learning approaches may improve accuracy of identification but can be limited by complexity of implementation. OBJECTIVE: To develop algorithms that use readily available clinical data to identify patients with heart failure while in the hospital. DESIGN, SETTING, AND PARTICIPANTS: We performed a retrospective study of hospitalizations at an academic medical center. Hospitalizations for patients 18 years or older who were admitted after January 1, 2013, and discharged before February 28, 2015, were included. From a random 75% sample of hospitalizations, we developed 5 algorithms for heart failure identification using electronic health record data: (1) heart failure on problem list; (2) presence of at least 1 of 3 characteristics: heart failure on problem list, inpatient loop diuretic, or brain natriuretic peptide level of 500 pg/mL or higher; (3) logistic regression of 30 clinically relevant structured data elements; (4) machine-learning approach using unstructured notes; and (5) machine-learning approach using structured and unstructured data. MAIN OUTCOMES AND MEASURES: Heart failure diagnosis based on discharge diagnosis and physician review of sampled medical records. RESULTS: A total of 47 119 hospitalizations were included in this study (mean [SD] age, 60.9 [18.15] years; 23 952 female [50.8%], 5258 black/African American [11.2%], and 3667 Hispanic/Latino [7.8%] patients). Of these hospitalizations, 6549 (13.9%) had a discharge diagnosis of heart failure. Inclusion of heart failure on the problem list (algorithm 1) had a sensitivity of 0.40 and a positive predictive value (PPV) of 0.96 for heart failure identification. Algorithm 2 improved sensitivity to 0.77 at the expense of a PPV of 0.64. Algorithms 3, 4, and 5 had areas under the receiver operating characteristic curves of 0.953, 0.969, and 0.974, respectively. With a PPV of 0.9, these algorithms had associated sensitivities of 0.68, 0.77, and 0.83, respectively. CONCLUSIONS AND RELEVANCE: The problem list is insufficient for real-time identification of hospitalized patients with heart failure. The high predictive accuracy of machine learning using free text demonstrates that support of such analytics in future electronic health record systems can improve cohort identification.
Authors: R Scott Evans; Jose Benuzillo; Benjamin D Horne; James F Lloyd; Alejandra Bradshaw; Deborah Budge; Kismet D Rasmusson; Colleen Roberts; Jason Buckway; Norma Geer; Teresa Garrett; Donald L Lappé Journal: J Am Med Inform Assoc Date: 2016-02-17 Impact factor: 4.497
Authors: Serguei Pakhomov; Susan A Weston; Steven J Jacobsen; Christopher G Chute; Ryan Meverden; Véronique L Roger Journal: Am J Manag Care Date: 2007-06 Impact factor: 2.229
Authors: Katherine M Newton; Peggy L Peissig; Abel Ngo Kho; Suzette J Bielinski; Richard L Berg; Vidhu Choudhary; Melissa Basford; Christopher G Chute; Iftikhar J Kullo; Rongling Li; Jennifer A Pacheco; Luke V Rasmussen; Leslie Spangler; Joshua C Denny Journal: J Am Med Inform Assoc Date: 2013-03-26 Impact factor: 4.497
Authors: Daniel M Hartung; Jacquelyn Hunt; Joseph Siemienczuk; Heather Miller; Daniel R Touchette Journal: J Gen Intern Med Date: 2005-02 Impact factor: 5.128
Authors: Saul Blecker; Sunil K Agarwal; Patricia P Chang; Wayne D Rosamond; Donald E Casey; Anna Kucharska-Newton; Martha J Radford; Josef Coresh; Stuart Katz Journal: J Am Coll Cardiol Date: 2013-09-25 Impact factor: 24.094
Authors: Herbert C Szeto; Robert K Coleman; Parisa Gholami; Brian B Hoffman; Mary K Goldstein Journal: Am J Manag Care Date: 2002-01 Impact factor: 2.229
Authors: Geoffrey H Tison; Alanna M Chamberlain; Mark J Pletcher; Shannon M Dunlay; Susan A Weston; Jill M Killian; Jeffrey E Olgin; Véronique L Roger Journal: Int J Med Inform Date: 2018-09-19 Impact factor: 4.046
Authors: Michael R Mathis; Milo C Engoren; Hyeon Joo; Michael D Maile; Keith D Aaronson; Michael L Burns; Michael W Sjoding; Nicholas J Douville; Allison M Janda; Yaokun Hu; Kayvan Najarian; Sachin Kheterpal Journal: Anesth Analg Date: 2020-05 Impact factor: 5.108
Authors: Lina M Brinker; Matthew C Konerman; Pedram Navid; Michael P Dorsch; Jennifer McNamara; Cristen J Willer; Mary E Tinetti; Scott L Hummel; Parag Goyal Journal: Am J Med Date: 2020-08-18 Impact factor: 4.965