Jeffrey L Greenwald1, Patrick R Cronin, Victoria Carballo, Goodarz Danaei, Garry Choy. 1. *Department of Medicine, Core Educator Faculty †Laboratory of Computer Science, Massachusetts General Hospital, Boston ‡Partners HealthCare, Needham §Departments of Global Health and Population and Epidemiology, Harvard TH Chan School of Public Health ∥QPID Informatics and Massachusetts General Physicians Organization, Massachusetts General Hospital, Boston, MA.
Abstract
BACKGROUND: With the increasing focus on reducing hospital readmissions in the United States, numerous readmissions risk prediction models have been proposed, mostly developed through analyses of structured data fields in electronic medical records and administrative databases. Three areas that may have an impact on readmission but are poorly captured using structured data sources are patients' physical function, cognitive status, and psychosocial environment and support. OBJECTIVE OF THE STUDY: The objective of the study was to build a discriminative model using information germane to these 3 areas to identify hospitalized patients' risk for 30-day all cause readmissions. RESEARCH DESIGN: We conducted clinician focus groups to identify language used in the clinical record regarding these 3 areas. We then created a dataset including 30,000 inpatients, 10,000 from each of 3 hospitals, and searched those records for the focus group-derived language using natural language processing. A 30-day readmission prediction model was developed on 75% of the dataset and validated on the other 25% and also on hospital specific subsets. RESULTS: Focus group language was aggregated into 35 variables. The final model had 16 variables, a validated C-statistic of 0.74, and was well calibrated. Subset validation of the model by hospital yielded C-statistics of 0.70-0.75. CONCLUSIONS: Deriving a 30-day readmission risk prediction model through identification of physical, cognitive, and psychosocial issues using natural language processing yielded a model that performs similarly to the better performing models previously published with the added advantage of being based on clinically relevant factors and also automated and scalable. Because of the clinical relevance of the variables in the model, future research may be able to test if targeting interventions to identified risks results in reductions in readmissions.
BACKGROUND: With the increasing focus on reducing hospital readmissions in the United States, numerous readmissions risk prediction models have been proposed, mostly developed through analyses of structured data fields in electronic medical records and administrative databases. Three areas that may have an impact on readmission but are poorly captured using structured data sources are patients' physical function, cognitive status, and psychosocial environment and support. OBJECTIVE OF THE STUDY: The objective of the study was to build a discriminative model using information germane to these 3 areas to identify hospitalized patients' risk for 30-day all cause readmissions. RESEARCH DESIGN: We conducted clinician focus groups to identify language used in the clinical record regarding these 3 areas. We then created a dataset including 30,000 inpatients, 10,000 from each of 3 hospitals, and searched those records for the focus group-derived language using natural language processing. A 30-day readmission prediction model was developed on 75% of the dataset and validated on the other 25% and also on hospital specific subsets. RESULTS: Focus group language was aggregated into 35 variables. The final model had 16 variables, a validated C-statistic of 0.74, and was well calibrated. Subset validation of the model by hospital yielded C-statistics of 0.70-0.75. CONCLUSIONS: Deriving a 30-day readmission risk prediction model through identification of physical, cognitive, and psychosocial issues using natural language processing yielded a model that performs similarly to the better performing models previously published with the added advantage of being based on clinically relevant factors and also automated and scalable. Because of the clinical relevance of the variables in the model, future research may be able to test if targeting interventions to identified risks results in reductions in readmissions.
Authors: Ank E Nijhawan; Lisa R Metsch; Song Zhang; Daniel J Feaster; Lauren Gooden; Mamta K Jain; Robrina Walker; Shannon Huffaker; Michael J Mugavero; Petra Jacobs; Wendy S Armstrong; Eric S Daar; Meg Sullivan; Carlos Del Rio; Ethan A Halm Journal: J Acquir Immune Defic Syndr Date: 2019-03-01 Impact factor: 3.731
Authors: Tomasz Oliwa; Brian Furner; Jessica Schmitt; John Schneider; Jessica P Ridgway Journal: J Am Med Inform Assoc Date: 2021-01-15 Impact factor: 4.497
Authors: David A Dorr; Ana R Quiñones; Taylor King; Melissa Y Wei; Kellee White; Cosmin A Bejan Journal: Med Care Date: 2022-06-04 Impact factor: 3.178
Authors: Matthew E Dupre; Alicia Nelson; Scott M Lynch; Bradi B Granger; Hanzhang Xu; Janese M Willis; Lesley H Curtis; Eric D Peterson Journal: JMIR Res Protoc Date: 2017-06-15
Authors: Ruth M Reeves; Lee Christensen; Jeremiah R Brown; Michael Conway; Maxwell Levis; Glenn T Gobbel; Rashmee U Shah; Christine Goodrich; Iben Ricket; Freneka Minter; Andrew Bohm; Bruce E Bray; Michael E Matheny; Wendy Chapman Journal: J Biomed Inform Date: 2021-06-24 Impact factor: 8.000