| Literature DB >> 33936461 |
Avijit Mitra1, Bhanu Pratap Singh Rawat1, David McManus2, Alok Kapoor2, Hong Yu1,3,2,4.
Abstract
A bleeding event is a common adverse drug reaction amongst patients on anticoagulation and factors critically into a clinician's decision to prescribe or continue anticoagulation for atrial fibrillation. However, bleeding events are not uniformly captured in the administrative data of electronic health records (EHR). As manual review is prohibitively expensive, we investigate the effectiveness of various natural language processing (NLP) methods for automatic extraction of bleeding events. Using our expert-annotated 1,079 de-identified EHR notes, we evaluated state-of-the-art NLP models such as biLSTM-CRF with language modeling, and different BERT variants for six entity types. On our dataset, the biLSTM-CRF surpassed other models resulting in a macro F1-score of 0.75 whereas the performance difference is negligible for sentence and document-level predictions with the best macro F1-scores of 0.84 and 0.96, respectively. Our error analyses suggest that the models' incorrect predictions can be attributed to variability in entity spans, memorization, and missing negation signals. ©2020 AMIA - All rights reserved.Entities:
Mesh:
Year: 2021 PMID: 33936461 PMCID: PMC8075442
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076