Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Natural Language Processing and Machine Learning to Identify People Who Inject Drugs in Electronic Health Records.

Literature DB >> 36168546

Natural Language Processing and Machine Learning to Identify People Who Inject Drugs in Electronic Health Records.

David Goodman-Meza^1,2, Amber Tang³, Babak Aryanfar², Sergio Vazquez⁴, Adam J Gordon^5,6, Michihiko Goto^7,8, Matthew Bidwell Goetz^2,3, Steven Shoptaw⁹, Alex A T Bui¹⁰.

Abstract

Background: Improving the identification of people who inject drugs (PWID) in electronic medical records can improve clinical decision making, risk assessment and mitigation, and health service research. Identification of PWID currently consists of heterogeneous, nonspecific International Classification of Diseases (ICD) codes as proxies. Natural language processing (NLP) and machine learning (ML) methods may have better diagnostic metrics than nonspecific ICD codes for identifying PWID.
Methods: We manually reviewed 1000 records of patients diagnosed with Staphylococcus aureus bacteremia admitted to Veterans Health Administration hospitals from 2003 through 2014. The manual review was the reference standard. We developed and trained NLP/ML algorithms with and without regular expression filters for negation (NegEx) and compared these with 11 proxy combinations of ICD codes to identify PWID. Data were split 70% for training and 30% for testing. We calculated diagnostic metrics and estimated 95% confidence intervals (CIs) by bootstrapping the hold-out test set. Best models were determined by best F-score, a summary of sensitivity and positive predictive value.
Results: Random forest with and without NegEx were the best-performing NLP/ML algorithms in the training set. Random forest with NegEx outperformed all ICD-based algorithms. F-score for the best NLP/ML algorithm was 0.905 (95% CI, .786-.967) and 0.592 (95% CI, .550-.632) for the best ICD-based algorithm. The NLP/ML algorithm had a sensitivity of 92.6% and specificity of 95.4%. Conclusions: NLP/ML outperformed ICD-based coding algorithms at identifying PWID in electronic health records. NLP/ML models should be considered in identifying cohorts of PWID to improve clinical decision making, health services research, and administrative surveillance. Published by Oxford University Press on behalf of Infectious Diseases Society of America 2022.

Entities: Chemical

Keywords: EHR; NLP; PWID; machine learning

Year: 2022 PMID： 36168546 PMCID： PMC9511274 DOI： 10.1093/ofid/ofac471

Source DB: PubMed Journal: Open Forum Infect Dis ISSN： 2328-8957 Impact factor: 4.423

Keyword Cloud
References

40 in total

1. Detecting Opioid-Related Aberrant Behavior using Natural Language Processing.

Authors: Jesse M Lingeman; Priscilla Wang; William Becker; Hong Yu
Journal: AMIA Annu Symp Proc Date: 2018-04-16

2. DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx.

Authors: Saeed Mehrabi; Anand Krishnan; Sunghwan Sohn; Alexandra M Roch; Heidi Schmidt; Joe Kesterson; Chris Beesley; Paul Dexter; C Max Schmidt; Hongfang Liu; Mathew Palakal
Journal: J Biomed Inform Date: 2015-03-16 Impact factor: 6.317

3. Validation of an Algorithm to Identify Infective Endocarditis in People Who Inject Drugs.

Authors: Laura J Ball; Adeel Sherazi; Dora Laczko; Kaveri Gupta; Sharon Koivu; Matthew A Weir; Tina Mele; Rommel Tirona; John K McCormick; Michael Silverman
Journal: Med Care Date: 2018-10 Impact factor: 2.983

4. Association of Evidence-Based Care Processes With Mortality in Staphylococcus aureus Bacteremia at Veterans Health Administration Hospitals, 2003-2014.

Authors: Michihiko Goto; Marin L Schweizer; Mary S Vaughan-Sarrazin; Eli N Perencevich; Daniel J Livorsi; Daniel J Diekema; Kelly K Richardson; Brice F Beck; Bruce Alexander; Michael E Ohl
Journal: JAMA Intern Med Date: 2017-10-01 Impact factor: 21.873

5. Estimated number of injection-involved drug overdose deaths, United States, 2000 - 2018.

Authors: Eric W Hall; Eli S Rosenberg; Christopher M Jones; Alice Asher; Eduardo Valverde; Heather Bradley
Journal: Drug Alcohol Depend Date: 2022-03-26 Impact factor: 4.852

6. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): the TRIPOD statement.

Authors: Gary S Collins; Johannes B Reitsma; Douglas G Altman; Karel G M Moons
Journal: Ann Intern Med Date: 2015-01-06 Impact factor: 25.391

7. Validity of International Classification of Diseases codes in identifying illicit drug use target conditions using medical record data as a reference standard: A systematic review.

Authors: Kaitlin M McGrew; Juell B Homco; Tabitha Garwe; Hanh Dung Dao; Mary B Williams; Douglas A Drevets; S Reza Jafarzadeh; Yan Daniel Zhao; Hélène Carabin
Journal: Drug Alcohol Depend Date: 2019-12-23 Impact factor: 4.852

8. Increasing Infectious Endocarditis Admissions Among Young People Who Inject Drugs.

Authors: Alysse G Wurcel; Jordan E Anderson; Kenneth K H Chui; Sally Skinner; Tamsin A Knox; David R Snydman; Thomas J Stopka
Journal: Open Forum Infect Dis Date: 2016-07-26 Impact factor: 3.835

9. Effect of a Predictive Analytics-Targeted Program in Patients on Opioids: a Stepped-Wedge Cluster Randomized Controlled Trial.

Authors: Kiersten L Strombotne; Aaron Legler; Taeko Minegishi; Jodie A Trafton; Elizabeth M Oliva; Eleanor T Lewis; Pooja Sohoni; Melissa M Garrido; Steven D Pizer; Austin B Frakt
Journal: J Gen Intern Med Date: 2022-05-02 Impact factor: 6.473

10. Validity of ICD-based algorithms to estimate the prevalence of injection drug use among infective endocarditis hospitalizations in the absence of a reference standard.

Authors: Kaitlin M McGrew; Hélène Carabin; Tabitha Garwe; S Reza Jafarzadeh; Mary B Williams; Yan Daniel Zhao; Douglas A Drevets
Journal: Drug Alcohol Depend Date: 2020-03-04 Impact factor: 4.852