Literature DB >> 30118854

A convolutional route to abbreviation disambiguation in clinical text.

Venkata Joopudi1, Bharath Dandala1, Murthy Devarakonda2.   

Abstract

OBJECTIVE: Abbreviations sense disambiguation is a special case of word sense disambiguation. Machine learning methods based on neural networks showed promising results for word sense disambiguation (Festag and Spreckelsen, 2017) [1] and, here we assess their effectiveness for abbreviation sense disambiguation.
METHODS: Convolutional Neural Network (CNN) models were trained, one for each abbreviation, to disambiguate abbreviation senses. A reverse substitution (of long forms with short forms) method from a previous study was used on clinical narratives from Cleveland Clinic, USA, to auto-generate training data. Accuracy of the CNN and traditional Support Vector Machine (SVM) models were studied using: (a) 5-fold cross validation on the auto-generated training data; (b) a manually created, set-aside gold standard; and (c) 10-fold cross validation on a publicly available dataset from a previous study.
RESULTS: CNN improved accuracy by 1-4 percentage points on all the three datasets compared to SVM, and the improvement was the most for the set-aside dataset. The improvement was statistically significant at p < 0.05 on the auto-generated dataset. We found that for some common abbreviations, sense distributions mismatch between the test and auto generated training data, and mitigating the mismatch significantly improved the model accuracy.
CONCLUSION: The neural network models work well in disambiguating abbreviations in clinical narratives, and they are robust across datasets. This avoids feature-engineering for each dataset. Coupled with an enhanced auto-training data generation, neural networks can simplify development of a practical abbreviation disambiguation system.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2018        PMID: 30118854     DOI: 10.1016/j.jbi.2018.07.025

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  6 in total

Review 1.  Deep learning in clinical natural language processing: a methodical review.

Authors:  Stephen Wu; Kirk Roberts; Surabhi Datta; Jingcheng Du; Zongcheng Ji; Yuqi Si; Sarvesh Soni; Qiong Wang; Qiang Wei; Yang Xiang; Bo Zhao; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2020-03-01       Impact factor: 4.497

2.  DI++: A deep learning system for patient condition identification in clinical notes.

Authors:  Jinhe Shi; Xiangyu Gao; William C Kinsman; Chenyu Ha; Guodong Gordon Gao; Yi Chen
Journal:  Artif Intell Med       Date:  2021-12-02       Impact factor: 5.326

3.  Reducing Physicians' Cognitive Load During Chart Review: A Problem-Oriented Summary of the Patient Electronic Record.

Authors:  Jennifer J Liang; Ching-Huei Tsou; Bharath Dandala; Ananya Poddar; Venkata Joopudi; Diwakar Mahajan; John Prager; Preethi Raghavan; Michele Payne
Journal:  AMIA Annu Symp Proc       Date:  2022-02-21

4.  Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.

Authors:  Denis Newman-Griffis; Guy Divita; Bart Desmet; Ayah Zirikly; Carolyn P Rosé; Eric Fosler-Lussier
Journal:  J Am Med Inform Assoc       Date:  2021-03-01       Impact factor: 4.497

5.  The Revival of the Notes Field: Leveraging the Unstructured Content in Electronic Health Records.

Authors:  Michela Assale; Linda Greta Dui; Andrea Cina; Andrea Seveso; Federico Cabitza
Journal:  Front Med (Lausanne)       Date:  2019-04-17

6.  Disambiguating Clinical Abbreviations Using a One-Fits-All Classifier Based on Deep Learning Techniques.

Authors:  Areej Jaber; Paloma Martínez
Journal:  Methods Inf Med       Date:  2022-02-01       Impact factor: 1.800

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.