Literature DB >> 22742938

A new clustering method for detecting rare senses of abbreviations in clinical notes.

Hua Xu1, Yonghui Wu, Noémie Elhadad, Peter D Stetson, Carol Friedman.   

Abstract

Abbreviations are widely used in clinical documents and they are often ambiguous. Building a list of possible senses (also called sense inventory) for each ambiguous abbreviation is the first step to automatically identify correct meanings of abbreviations in given contexts. Clustering based methods have been used to detect senses of abbreviations from a clinical corpus [1]. However, rare senses remain challenging and existing algorithms are not good enough to detect them. In this study, we developed a new two-phase clustering algorithm called Tight Clustering for Rare Senses (TCRS) and applied it to sense generation of abbreviations in clinical text. Using manually annotated sense inventories from a set of 13 ambiguous clinical abbreviations, we evaluated and compared TCRS with the existing Expectation Maximization (EM) clustering algorithm for sense generation, at two different levels of annotation cost (10 vs. 20 instances for each abbreviation). Our results showed that the TCRS-based method could detect 85% senses on average; while the EM-based method found only 75% senses, when similar annotation effort (about 20 instances) was used. Further analysis demonstrated that the improvement by the TCRS method was mainly from additionally detected rare senses, thus indicating its usefulness for building more complete sense inventories of clinical abbreviations.
Copyright © 2012 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2012        PMID: 22742938      PMCID: PMC3729222          DOI: 10.1016/j.jbi.2012.06.003

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  17 in total

1.  A broad-coverage natural language processing system.

Authors:  C Friedman
Journal:  Proc AMIA Symp       Date:  2000

2.  A study of abbreviations in the UMLS.

Authors:  H Liu; Y A Lussier; C Friedman
Journal:  Proc AMIA Symp       Date:  2001

3.  Evaluating the UMLS as a source of lexical knowledge for medical language processing.

Authors:  C Friedman; H Liu; L Shagina; S Johnson; G Hripcsak
Journal:  Proc AMIA Symp       Date:  2001

4.  "Understanding" medical school curriculum content using KnowledgeMap.

Authors:  Joshua C Denny; Jeffrey D Smithers; Randolph A Miller; Anderson Spickard
Journal:  J Am Med Inform Assoc       Date:  2003-03-28       Impact factor: 4.497

5.  Creating an online dictionary of abbreviations from MEDLINE.

Authors:  Jeffrey T Chang; Hinrich Schütze; Russ B Altman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

6.  The sublanguage of cross-coverage.

Authors:  Peter D Stetson; Stephen B Johnson; Matthew Scotch; George Hripcsak
Journal:  Proc AMIA Symp       Date:  2002

7.  A simple algorithm for identifying negated findings and diseases in discharge summaries.

Authors:  W W Chapman; W Bridewell; P Hanbury; G F Cooper; B G Buchanan
Journal:  J Biomed Inform       Date:  2001-10       Impact factor: 6.317

8.  SaRAD: a Simple and Robust Abbreviation Dictionary.

Authors:  Eytan Adar
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

9.  Pathology abbreviated: a long review of short terms.

Authors:  Jules J Berman
Journal:  Arch Pathol Lab Med       Date:  2004-03       Impact factor: 5.534

10.  Detecting abbreviations in discharge summaries using machine learning methods.

Authors:  Yonghui Wu; S Trent Rosenbloom; Joshua C Denny; Randolph A Miller; Subramani Mani; Dario A Giuse; Hua Xu
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22
View more
  5 in total

1.  A Preliminary Study of Clinical Abbreviation Disambiguation in Real Time.

Authors:  Y Wu; J C Denny; S T Rosenbloom; R A Miller; D A Giuse; M Song; H Xu
Journal:  Appl Clin Inform       Date:  2015-06-03       Impact factor: 2.342

2.  Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations.

Authors:  Hua Xu; Peter D Stetson; Carol Friedman
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

3.  A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD).

Authors:  Yonghui Wu; Joshua C Denny; S Trent Rosenbloom; Randolph A Miller; Dario A Giuse; Lulu Wang; Carmelo Blanquicett; Ergin Soysal; Jun Xu; Hua Xu
Journal:  J Am Med Inform Assoc       Date:  2017-04-01       Impact factor: 4.497

4.  A New Biomedical Passage Retrieval Framework for Laboratory Medicine: Leveraging Domain-specific Ontology, Multilevel PRF, and Negation Differential Weighting.

Authors:  Keejun Han; Hyoeun Shim; Mun Y Yi
Journal:  J Healthc Eng       Date:  2018-12-24       Impact factor: 2.682

5.  Disambiguating Clinical Abbreviations Using a One-Fits-All Classifier Based on Deep Learning Techniques.

Authors:  Areej Jaber; Paloma Martínez
Journal:  Methods Inf Med       Date:  2022-02-01       Impact factor: 1.800

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.