Literature DB >> 25705556

Challenges and practical approaches with word sense disambiguation of acronyms and abbreviations in the clinical domain.

Sungrim Moon1, Bridget McInnes2, Genevieve B Melton3.   

Abstract

OBJECTIVES: Although acronyms and abbreviations in clinical text are used widely on a daily basis, relatively little research has focused upon word sense disambiguation (WSD) of acronyms and abbreviations in the healthcare domain. Since clinical notes have distinctive characteristics, it is unclear whether techniques effective for acronym and abbreviation WSD from biomedical literature are sufficient.
METHODS: The authors discuss feature selection for automated techniques and challenges with WSD of acronyms and abbreviations in the clinical domain.
RESULTS: There are significant challenges associated with the informal nature of clinical text, such as typographical errors and incomplete sentences; difficulty with insufficient clinical resources, such as clinical sense inventories; and obstacles with privacy and security for conducting research with clinical text. Although we anticipated that using sophisticated techniques, such as biomedical terminologies, semantic types, part-of-speech, and language modeling, would be needed for feature selection with automated machine learning approaches, we found instead that simple techniques, such as bag-of-words, were quite effective in many cases. Factors, such as majority sense prevalence and the degree of separateness between sense meanings, were also important considerations.
CONCLUSIONS: The first lesson is that a comprehensive understanding of the unique characteristics of clinical text is important for automatic acronym and abbreviation WSD. The second lesson learned is that investigators may find that using simple approaches is an effective starting point for these tasks. Finally, similar to other WSD tasks, an understanding of baseline majority sense rates and separateness between senses is important. Further studies and practical solutions are needed to better address these issues.

Entities:  

Keywords:  Abbreviations as Topic; Artificial Intelligence; Automated Pattern Recognition; Medical Records; Natural Language Processing

Year:  2015        PMID: 25705556      PMCID: PMC4330198          DOI: 10.4258/hir.2015.21.1.35

Source DB:  PubMed          Journal:  Healthc Inform Res        ISSN: 2093-3681


  19 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  Evaluating the UMLS as a source of lexical knowledge for medical language processing.

Authors:  C Friedman; H Liu; L Shagina; S Johnson; G Hripcsak
Journal:  Proc AMIA Symp       Date:  2001

3.  Aggregating UMLS semantic types for reducing conceptual complexity.

Authors:  A T McCray; A Burgun; O Bodenreider
Journal:  Stud Health Technol Inform       Date:  2001

4.  The sublanguage of cross-coverage.

Authors:  Peter D Stetson; Stephen B Johnson; Matthew Scotch; George Hripcsak
Journal:  Proc AMIA Symp       Date:  2002

5.  Abbreviation and acronym disambiguation in clinical discourse.

Authors:  Sergeui Pakhomov; Ted Pedersen; Christopher G Chute
Journal:  AMIA Annu Symp Proc       Date:  2005

6.  A comparative study of supervised learning as applied to acronym expansion in clinical reports.

Authors:  Mahesh Joshi; Serguei Pakhomov; Ted Pedersen; Christopher G Chute
Journal:  AMIA Annu Symp Proc       Date:  2006

7.  ADAM: another database of abbreviations in MEDLINE.

Authors:  Wei Zhou; Vetle I Torvik; Neil R Smalheiser
Journal:  Bioinformatics       Date:  2006-09-18       Impact factor: 6.937

8.  A study of abbreviations in clinical notes.

Authors:  Hua Xu; Peter D Stetson; Carol Friedman
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

Review 9.  Medical abbreviations: writing little and communicating less.

Authors:  Kathleen E Walsh; Jerry H Gurwitz
Journal:  Arch Dis Child       Date:  2008-10       Impact factor: 3.791

10.  Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.

Authors:  Hua Xu; Marianthi Markatou; Rositsa Dimova; Hongfang Liu; Carol Friedman
Journal:  BMC Bioinformatics       Date:  2006-07-05       Impact factor: 3.169

View more
  8 in total

1.  Towards Comprehensive Clinical Abbreviation Disambiguation Using Machine-Labeled Training Data.

Authors:  Gregory P Finley; Serguei V S Pakhomov; Reed McEwan; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

2.  Differentiating Sense through Semantic Interaction Data.

Authors:  T Elizabeth Workman; Charlene Weir; Thomas C Rindflesch
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

3.  Clinical Natural Language Processing in 2015: Leveraging the Variety of Texts of Clinical Interest.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2016-11-10

4.  Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression.

Authors:  Tielman T Van Vleck; Lili Chan; Steven G Coca; Catherine K Craven; Ron Do; Stephen B Ellis; Joseph L Kannry; Ruth J F Loos; Peter A Bonis; Judy Cho; Girish N Nadkarni
Journal:  Int J Med Inform       Date:  2019-07-06       Impact factor: 4.046

5.  A method for harmonization of clinical abbreviation and acronym sense inventories.

Authors:  Lisa V Grossman; Elliot G Mitchell; George Hripcsak; Chunhua Weng; David K Vawdrey
Journal:  J Biomed Inform       Date:  2018-11-07       Impact factor: 6.317

6.  Automated Mapping of Real-world Oncology Laboratory Data to LOINC.

Authors:  Jonathan Kelly; Chen Wang; Jianyi Zhang; Spandan Das; Anna Ren; Pradnya Warnekar
Journal:  AMIA Annu Symp Proc       Date:  2022-02-21

7.  Automated extraction of Biomarker information from pathology reports.

Authors:  Jeongeun Lee; Hyun-Je Song; Eunsil Yoon; Seong-Bae Park; Sung-Hye Park; Jeong-Wook Seo; Peom Park; Jinwook Choi
Journal:  BMC Med Inform Decis Mak       Date:  2018-05-21       Impact factor: 2.796

8.  A deep database of medical abbreviations and acronyms for natural language processing.

Authors:  Lisa Grossman Liu; Raymond H Grossman; Elliot G Mitchell; Chunhua Weng; Karthik Natarajan; George Hripcsak; David K Vawdrey
Journal:  Sci Data       Date:  2021-06-02       Impact factor: 6.444

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.