Literature DB >> 35925971

Multi-label classification of symptom terms from free-text bilingual adverse drug reaction reports using natural language processing.

Sitthichok Chaichulee1,2, Chissanupong Promchai3, Tanyamai Kaewkomon3, Chanon Kongkamol4,2, Thammasin Ingviya4,2, Pasuree Sangsupawanich5.   

Abstract

Allergic reactions to medication range from mild to severe or even life-threatening. Proper documentation of patient allergy information is critical for safe prescription, avoiding drug interactions, and reducing healthcare costs. Allergy information is regularly obtained during the medical interview, but is often poorly documented in electronic health records (EHRs). While many EHRs allow for structured adverse drug reaction (ADR) reporting, a free-text entry is still common. The resulting information is neither interoperable nor easily reusable for other applications, such as clinical decision support systems and prescription alerts. Current approaches require pharmacists to review and code ADRs documented by healthcare professionals. Recently, the effectiveness of machine algorithms in natural language processing (NLP) has been widely demonstrated. Our study aims to develop and evaluate different NLP algorithms that can encode unstructured ADRs stored in EHRs into institutional symptom terms. Our dataset consists of 79,712 pharmacist-reviewed drug allergy records. We evaluated three NLP techniques: Naive Bayes-Support Vector Machine (NB-SVM), Universal Language Model Fine-tuning (ULMFiT), and Bidirectional Encoder Representations from Transformers (BERT). We tested different general-domain pre-trained BERT models, including mBERT, XLM-RoBERTa, and WanchanBERTa, as well as our domain-specific AllergyRoBERTa, which was pre-trained from scratch on our corpus. Overall, BERT models had the highest performance. NB-SVM outperformed ULMFiT and BERT for several symptom terms that are not frequently coded. The ensemble model achieved an exact match ratio of 95.33%, a F1 score of 98.88%, and a mean average precision of 97.07% for the 36 most frequently coded symptom terms. The model was then further developed into a symptom term suggestion system and achieved a Krippendorff's alpha agreement coefficient of 0.7081 in prospective testing with pharmacists. Some degree of automation could both accelerate the availability of allergy information and reduce the efforts for human coding.

Entities:  

Mesh:

Year:  2022        PMID: 35925971      PMCID: PMC9352066          DOI: 10.1371/journal.pone.0270595

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.752


  17 in total

1.  Focal Loss for Dense Object Detection.

Authors:  Tsung-Yi Lin; Priya Goyal; Ross Girshick; Kaiming He; Piotr Dollar
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2018-07-23       Impact factor: 6.226

Review 2.  Epidemiology and risk factors for drug allergy.

Authors:  Bernard Y-H Thong; Teck-Choon Tan
Journal:  Br J Clin Pharmacol       Date:  2011-05       Impact factor: 4.335

Review 3.  Drug allergy.

Authors:  Paul A Greenberger
Journal:  Allergy Asthma Proc       Date:  2019-11-01       Impact factor: 2.587

Review 4.  Deep learning in clinical natural language processing: a methodical review.

Authors:  Stephen Wu; Kirk Roberts; Surabhi Datta; Jingcheng Du; Zongcheng Ji; Yuqi Si; Sarvesh Soni; Qiong Wang; Qiang Wei; Yang Xiang; Bo Zhao; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2020-03-01       Impact factor: 4.497

5.  Identifying symptom groups from Emergency Department presenting complaint free text using SNOMED CT.

Authors:  Amol S Wagholikar; Michael J Lawley; David P Hansen; Kevin Chu
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

6.  Automated identification of drug and food allergies entered using non-standard terminology.

Authors:  Richard H Epstein; Paul St Jacques; Michael Stockin; Brian Rothman; Jesse M Ehrenfeld; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-06-07       Impact factor: 4.497

7.  Deep Natural Language Processing to Identify Symptom Documentation in Clinical Notes for Patients With Heart Failure Undergoing Cardiac Resynchronization Therapy.

Authors:  Richard E Leiter; Enrico Santus; Zhijing Jin; Katherine C Lee; Miryam Yusufov; Isabel Chien; Ashwin Ramaswamy; Edward T Moseley; Yujie Qian; Deborah Schrag; Charlotta Lindvall
Journal:  J Pain Symptom Manage       Date:  2020-06-22       Impact factor: 3.612

8.  Deep Learning for Natural Language Processing in Radiology-Fundamentals and a Systematic Review.

Authors:  Vera Sorin; Yiftach Barash; Eli Konen; Eyal Klang
Journal:  J Am Coll Radiol       Date:  2020-01-28       Impact factor: 5.532

9.  Benchmarking for biomedical natural language processing tasks with a domain specific ALBERT.

Authors:  Usman Naseem; Adam G Dunn; Matloob Khushi; Jinman Kim
Journal:  BMC Bioinformatics       Date:  2022-04-21       Impact factor: 3.307

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.