Literature DB >> 29295153

Unsupervised Abbreviation Expansion in Clinical Narratives.

Michel Oleynik1, Markus Kreuzthaler1, Stefan Schulz1.   

Abstract

Clinical narratives are typically produced under time pressure, which incites the use of abbreviations and acronyms. To expand such short forms in a correct way eases text comprehension and further semantic processing. We propose a completely unsupervised and data-driven algorithm for the resolution of non-lexicalised and potentially ambiguous abbreviations. Based on the lookup of word bigrams and unigrams extracted from a corpus of 30,000 pseudonymised cardiology reports in German, our method achieved an F<inf>1</inf> score of 0.91, evaluated with a test set of 200 text excerpts. The results are statistically significantly better (p &lt; 0.001) than a baseline approach and show that a simple and domain-independent strategy may be enough to resolve abbreviations when a large corpus of similar texts is available. Further work is needed to combine this strategy with sentence and abbreviation detection modules, to adapt it to acronym resolution and to evaluate it with different datasets.

Keywords:  Electronic Health Records; Natural Language Processing

Mesh:

Year:  2017        PMID: 29295153

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  2 in total

1.  Improving the Path from Diagnoses to Documentation: A Cognitive Review Tool for Clinical Notes and Administrative Records.

Authors:  Yufan Guo; Joy Wu; Tyler Baldwin; David Beymer; Vandana V Mukherjee; Tanveer F Syeda-Mahmood
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

2.  Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.

Authors:  Denis Newman-Griffis; Guy Divita; Bart Desmet; Ayah Zirikly; Carolyn P Rosé; Eric Fosler-Lussier
Journal:  J Am Med Inform Assoc       Date:  2021-03-01       Impact factor: 4.497

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.