Literature DB >> 36168546

Natural Language Processing and Machine Learning to Identify People Who Inject Drugs in Electronic Health Records.

David Goodman-Meza1,2, Amber Tang3, Babak Aryanfar2, Sergio Vazquez4, Adam J Gordon5,6, Michihiko Goto7,8, Matthew Bidwell Goetz2,3, Steven Shoptaw9, Alex A T Bui10.   

Abstract

Background: Improving the identification of people who inject drugs (PWID) in electronic medical records can improve clinical decision making, risk assessment and mitigation, and health service research. Identification of PWID currently consists of heterogeneous, nonspecific International Classification of Diseases (ICD) codes as proxies. Natural language processing (NLP) and machine learning (ML) methods may have better diagnostic metrics than nonspecific ICD codes for identifying PWID.
Methods: We manually reviewed 1000 records of patients diagnosed with Staphylococcus aureus bacteremia admitted to Veterans Health Administration hospitals from 2003 through 2014. The manual review was the reference standard. We developed and trained NLP/ML algorithms with and without regular expression filters for negation (NegEx) and compared these with 11 proxy combinations of ICD codes to identify PWID. Data were split 70% for training and 30% for testing. We calculated diagnostic metrics and estimated 95% confidence intervals (CIs) by bootstrapping the hold-out test set. Best models were determined by best F-score, a summary of sensitivity and positive predictive value.
Results: Random forest with and without NegEx were the best-performing NLP/ML algorithms in the training set. Random forest with NegEx outperformed all ICD-based algorithms. F-score for the best NLP/ML algorithm was 0.905 (95% CI, .786-.967) and 0.592 (95% CI, .550-.632) for the best ICD-based algorithm. The NLP/ML algorithm had a sensitivity of 92.6% and specificity of 95.4%. Conclusions: NLP/ML outperformed ICD-based coding algorithms at identifying PWID in electronic health records. NLP/ML models should be considered in identifying cohorts of PWID to improve clinical decision making, health services research, and administrative surveillance. Published by Oxford University Press on behalf of Infectious Diseases Society of America 2022.

Entities:  

Keywords:  EHR; NLP; PWID; machine learning

Year:  2022        PMID: 36168546      PMCID: PMC9511274          DOI: 10.1093/ofid/ofac471

Source DB:  PubMed          Journal:  Open Forum Infect Dis        ISSN: 2328-8957            Impact factor:   4.423


  40 in total

1.  Detecting Opioid-Related Aberrant Behavior using Natural Language Processing.

Authors:  Jesse M Lingeman; Priscilla Wang; William Becker; Hong Yu
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx.

Authors:  Saeed Mehrabi; Anand Krishnan; Sunghwan Sohn; Alexandra M Roch; Heidi Schmidt; Joe Kesterson; Chris Beesley; Paul Dexter; C Max Schmidt; Hongfang Liu; Mathew Palakal
Journal:  J Biomed Inform       Date:  2015-03-16       Impact factor: 6.317

3.  Validation of an Algorithm to Identify Infective Endocarditis in People Who Inject Drugs.

Authors:  Laura J Ball; Adeel Sherazi; Dora Laczko; Kaveri Gupta; Sharon Koivu; Matthew A Weir; Tina Mele; Rommel Tirona; John K McCormick; Michael Silverman
Journal:  Med Care       Date:  2018-10       Impact factor: 2.983

4.  Association of Evidence-Based Care Processes With Mortality in Staphylococcus aureus Bacteremia at Veterans Health Administration Hospitals, 2003-2014.

Authors:  Michihiko Goto; Marin L Schweizer; Mary S Vaughan-Sarrazin; Eli N Perencevich; Daniel J Livorsi; Daniel J Diekema; Kelly K Richardson; Brice F Beck; Bruce Alexander; Michael E Ohl
Journal:  JAMA Intern Med       Date:  2017-10-01       Impact factor: 21.873

5.  Estimated number of injection-involved drug overdose deaths, United States, 2000 - 2018.

Authors:  Eric W Hall; Eli S Rosenberg; Christopher M Jones; Alice Asher; Eduardo Valverde; Heather Bradley
Journal:  Drug Alcohol Depend       Date:  2022-03-26       Impact factor: 4.852

6.  Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement.

Authors:  Gary S Collins; Johannes B Reitsma; Douglas G Altman; Karel G M Moons
Journal:  Ann Intern Med       Date:  2015-01-06       Impact factor: 25.391

7.  Validity of International Classification of Diseases codes in identifying illicit drug use target conditions using medical record data as a reference standard: A systematic review.

Authors:  Kaitlin M McGrew; Juell B Homco; Tabitha Garwe; Hanh Dung Dao; Mary B Williams; Douglas A Drevets; S Reza Jafarzadeh; Yan Daniel Zhao; Hélène Carabin
Journal:  Drug Alcohol Depend       Date:  2019-12-23       Impact factor: 4.852

8.  Increasing Infectious Endocarditis Admissions Among Young People Who Inject Drugs.

Authors:  Alysse G Wurcel; Jordan E Anderson; Kenneth K H Chui; Sally Skinner; Tamsin A Knox; David R Snydman; Thomas J Stopka
Journal:  Open Forum Infect Dis       Date:  2016-07-26       Impact factor: 3.835

9.  Effect of a Predictive Analytics-Targeted Program in Patients on Opioids: a Stepped-Wedge Cluster Randomized Controlled Trial.

Authors:  Kiersten L Strombotne; Aaron Legler; Taeko Minegishi; Jodie A Trafton; Elizabeth M Oliva; Eleanor T Lewis; Pooja Sohoni; Melissa M Garrido; Steven D Pizer; Austin B Frakt
Journal:  J Gen Intern Med       Date:  2022-05-02       Impact factor: 6.473

10.  Validity of ICD-based algorithms to estimate the prevalence of injection drug use among infective endocarditis hospitalizations in the absence of a reference standard.

Authors:  Kaitlin M McGrew; Hélène Carabin; Tabitha Garwe; S Reza Jafarzadeh; Mary B Williams; Yan Daniel Zhao; Douglas A Drevets
Journal:  Drug Alcohol Depend       Date:  2020-03-04       Impact factor: 4.852

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.