Literature DB >> 11977807

Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method.

H Liu1, Y A Lussier, C Friedman.   

Abstract

With the growing use of Natural Language Processing (NLP) techniques for information extraction and concept indexing in the biomedical domain, a method that quickly and efficiently assigns the correct sense of an ambiguous biomedical term in a given context is needed concurrently. The current status of word sense disambiguation (WSD) in the biomedical domain is that handcrafted rules are used based on contextual material. The disadvantages of this approach are (i) generating WSD rules manually is a time-consuming and tedious task, (ii) maintenance of rule sets becomes increasingly difficult over time, and (iii) handcrafted rules are often incomplete and perform poorly in new domains comprised of specialized vocabularies and different genres of text. This paper presents a two-phase unsupervised method to build a WSD classifier for an ambiguous biomedical term W. The first phase automatically creates a sense-tagged corpus for W, and the second phase derives a classifier for W using the derived sense-tagged corpus as a training set. A formative experiment was performed, which demonstrated that classifiers trained on the derived sense-tagged corpora achieved an overall accuracy of about 97%, with greater than 90% accuracy for each individual ambiguous term.

Mesh:

Year:  2001        PMID: 11977807     DOI: 10.1006/jbin.2001.1023

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  23 in total

1.  Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the UMLS.

Authors:  Hongfang Liu; Stephen B Johnson; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

2.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

3.  A multi-aspect comparison study of supervised word sense disambiguation.

Authors:  Hongfang Liu; Virginia Teller; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2004-04-02       Impact factor: 4.497

4.  Tailoring vocabularies for NLP in sub-domains: a method to detect unused word sense.

Authors:  Rosa L Figueroa; Qing Zeng-Treitler; Sergey Goryachev; Eduardo P Wiechmann
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

5.  Generating quality word sense disambiguation test sets based on MeSH indexing.

Authors:  Jung-Wei Fan; Carol Friedman
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

6.  Quantitative assessment of dictionary-based protein named entity tagging.

Authors:  Hongfang Liu; Zhang-Zhi Hu; Manabu Torii; Cathy Wu; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2006-06-23       Impact factor: 4.497

7.  Enhancing acronym/abbreviation knowledge bases with semantic information.

Authors:  Manabu Torii; Hongfang Liu
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

8.  A fast document classification algorithm for gene symbol disambiguation in the BITOLA literature-based discovery support system.

Authors:  Andrej Kastrin; Dimitar Hristovski
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

9.  Towards Comprehensive Clinical Abbreviation Disambiguation Using Machine-Labeled Training Data.

Authors:  Gregory P Finley; Serguei V S Pakhomov; Reed McEwan; Genevieve B Melton
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

10.  Building a high-quality sense inventory for improved abbreviation disambiguation.

Authors:  Naoaki Okazaki; Sophia Ananiadou; Jun'ichi Tsujii
Journal:  Bioinformatics       Date:  2010-03-25       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.