Daniel To1, Brihat Sharma2, Niranjan Karnik3, Cara Joyce4, Dmitriy Dligach5, Majid Afshar6. 1. Stritch School of Medicine, Loyola University Chicago, Maywood, IL, USA. 2. Department of Computer Science, Loyola University Chicago, Chicago, IL, USA. 3. Department of Psychiatry & Behavioral Sciences, Rush Medical College, Rush University, Chicago, IL, USA. 4. Stritch School of Medicine, Loyola University Chicago, Maywood, IL, USA; Department of Public Health Sciences, Loyola University Chicago, Maywood, IL, USA; Center for Health Outcomes and Informatics Research, Loyola University Chicago, Maywood, IL, USA. 5. Department of Computer Science, Loyola University Chicago, Chicago, IL, USA; Department of Public Health Sciences, Loyola University Chicago, Maywood, IL, USA; Center for Health Outcomes and Informatics Research, Loyola University Chicago, Maywood, IL, USA. 6. Stritch School of Medicine, Loyola University Chicago, Maywood, IL, USA; Department of Public Health Sciences, Loyola University Chicago, Maywood, IL, USA; Center for Health Outcomes and Informatics Research, Loyola University Chicago, Maywood, IL, USA. Electronic address: majid.afshar@lumc.edu.
Abstract
BACKGROUND: Current modes of identifying alcohol misuse in hospitalized patients rely on self-report questionnaires and diagnostic codes that have limitations, including low sensitivity. Information in the clinical notes of the electronic health record (EHR) may further augment the identification of alcohol misuse. Natural language processing (NLP) with supervised machine learning has been successful at analyzing clinical notes and identifying cases of alcohol misuse in trauma patients. METHODS: An alcohol misuse NLP classifier, previously developed on trauma patients who completed the Alcohol Use Disorders Identification Test, was validated in a cohort of 1000 hospitalized patients at a large, tertiary health system between January 1, 2007 and September 1, 2017. The clinical notes were processed using the clinical Text Analysis and Knowledge Extraction System. The National Institute on Alcohol Abuse and Alcoholism (NIAAA) guidelines for alcohol misuse were used during annotation of the medical records in our validation dataset. RESULTS: The alcohol misuse classifier had an area under the receiver operating characteristic curve of 0.91 (95% CI 0.90-0.93) in the cohort of hospitalized patients. The sensitivity, specificity, positive predictive value, and negative predictive value were 0.88 (95% CI 0.85-0.90), 0.78 (95% CI 0.74-0.82), 0.85 (95% CI 0.82-0.87), and 0.82 (95% CI 0.78-0.86), respectively. The Hosmer-Lemeshow Test (p = 0.13) demonstrates good model fit. Additionally, there was a dose-dependent response in alcohol consumption behaviors across increasing strata of predicted probabilities for alcohol misuse. CONCLUSION: The alcohol misuse NLP classifier had good discrimination and test characteristics in hospitalized patients. An approach using the clinical notes with NLP and supervised machine learning may better identify alcohol misuse cases than conventional methods solely relying on billing diagnostic codes.
BACKGROUND: Current modes of identifying alcohol misuse in hospitalizedpatients rely on self-report questionnaires and diagnostic codes that have limitations, including low sensitivity. Information in the clinical notes of the electronic health record (EHR) may further augment the identification of alcohol misuse. Natural language processing (NLP) with supervised machine learning has been successful at analyzing clinical notes and identifying cases of alcohol misuse in traumapatients. METHODS: An alcohol misuse NLP classifier, previously developed on traumapatients who completed the Alcohol Use Disorders Identification Test, was validated in a cohort of 1000 hospitalized patients at a large, tertiary health system between January 1, 2007 and September 1, 2017. The clinical notes were processed using the clinical Text Analysis and Knowledge Extraction System. The National Institute on Alcohol Abuse and Alcoholism (NIAAA) guidelines for alcohol misuse were used during annotation of the medical records in our validation dataset. RESULTS: The alcohol misuse classifier had an area under the receiver operating characteristic curve of 0.91 (95% CI 0.90-0.93) in the cohort of hospitalized patients. The sensitivity, specificity, positive predictive value, and negative predictive value were 0.88 (95% CI 0.85-0.90), 0.78 (95% CI 0.74-0.82), 0.85 (95% CI 0.82-0.87), and 0.82 (95% CI 0.78-0.86), respectively. The Hosmer-Lemeshow Test (p = 0.13) demonstrates good model fit. Additionally, there was a dose-dependent response in alcohol consumption behaviors across increasing strata of predicted probabilities for alcohol misuse. CONCLUSION: The alcohol misuse NLP classifier had good discrimination and test characteristics in hospitalizedpatients. An approach using the clinical notes with NLP and supervised machine learning may better identify alcohol misuse cases than conventional methods solely relying on billing diagnostic codes.
Authors: Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute Journal: J Am Med Inform Assoc Date: 2010 Sep-Oct Impact factor: 4.497
Authors: Yan Wang; Elizabeth S Chen; Serguei Pakhomov; Elliot Arsoniadis; Elizabeth W Carter; Elizabeth Lindemann; Indra Neil Sarkar; Genevieve B Melton Journal: AMIA Annu Symp Proc Date: 2015-11-05
Authors: Beatriz Rosón; Rafael Monte; Rocío Gamallo; Ruben Puerta; Antonio Zapatero; Joaquín Fernández-Solá; Isabel Pastor; José A Girón; Javier Laso Journal: Eur J Intern Med Date: 2010-05-20 Impact factor: 4.487
Authors: Majid Afshar; Andrew Phillips; Niranjan Karnik; Jeanne Mueller; Daniel To; Richard Gonzalez; Ron Price; Richard Cooper; Cara Joyce; Dmitriy Dligach Journal: J Am Med Inform Assoc Date: 2019-03-01 Impact factor: 4.497
Authors: Edward K Chang; Christine Y Yu; Robin Clarke; Andrew Hackbarth; Timothy Sanders; Eric Esrailian; Daniel W Hommes; Bruce A Runyon Journal: J Clin Gastroenterol Date: 2016 Nov/Dec Impact factor: 3.062
Authors: Chaitanya Shivade; Preethi Raghavan; Eric Fosler-Lussier; Peter J Embi; Noemie Elhadad; Stephen B Johnson; Albert M Lai Journal: J Am Med Inform Assoc Date: 2013-11-07 Impact factor: 4.497
Authors: Christophe Lemey; Aziliz Le Glaz; Yannis Haralambous; Deok-Hee Kim-Dufor; Philippe Lenca; Romain Billot; Taylor C Ryan; Jonathan Marsh; Jordan DeVylder; Michel Walter; Sofian Berrouiguet Journal: J Med Internet Res Date: 2021-05-04 Impact factor: 5.428
Authors: Braja G Patra; Mohit M Sharma; Veer Vekaria; Prakash Adekkanattu; Olga V Patterson; Benjamin Glicksberg; Lauren A Lepow; Euijung Ryu; Joanna M Biernacka; Al'ona Furmanchuk; Thomas J George; William Hogan; Yonghui Wu; Xi Yang; Jiang Bian; Myrna Weissman; Priya Wickramaratne; J John Mann; Mark Olfson; Thomas R Campion; Mark Weiner; Jyotishman Pathak Journal: J Am Med Inform Assoc Date: 2021-11-25 Impact factor: 7.942