Literature DB >> 12386113

Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the UMLS.

Hongfang Liu1, Stephen B Johnson, Carol Friedman.   

Abstract

UNLABELLED: Motivation. The UMLS has been used in natural language processing applications such as information retrieval and information extraction systems. The mapping of free-text to UMLS concepts is important for these applications. To improve the mapping, we need a method to disambiguate terms that possess multiple UMLS concepts. In the general English domain, machine-learning techniques have been applied to sense-tagged corpora, in which senses (or concepts) of ambiguous terms have been annotated (mostly manually). Sense disambiguation classifiers are then derived to determine senses (or concepts) of those ambiguous terms automatically. However, manual annotation of a corpus is an expensive task. We propose an automatic method that constructs sense-tagged corpora for ambiguous terms in the UMLS using MEDLINE abstracts.
METHODS: For a term W that represents multiple UMLS concepts, a collection of MEDLINE abstracts that contain W is extracted. For each abstract in the collection, occurrences of concepts that have relations with W as defined in the UMLS are automatically identified. A sense-tagged corpus, in which senses of W are annotated, is then derived based on those identified concepts. The method was evaluated on a set of 35 frequently occurring ambiguous biomedical abbreviations using a gold standard set that was automatically derived. The quality of the derived sense-tagged corpus was measured using precision and recall.
RESULTS: The derived sense-tagged corpus had an overall precision of 92.9% and an overall recall of 47.4%. After removing rare senses and ignoring abbreviations with closely related senses, the overall precision was 96.8% and the overall recall was 50.6%.
CONCLUSIONS: UMLS conceptual relations and MEDLINE abstracts can be used to automatically acquire knowledge needed for resolving ambiguity when mapping free-text to UMLS concepts.

Mesh:

Year:  2002        PMID: 12386113      PMCID: PMC349379          DOI: 10.1197/jamia.m1101

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  14 in total

1.  Mining molecular binding terminology from biomedical text.

Authors:  T C Rindflesch; L Hunter; A R Aronson
Journal:  Proc AMIA Symp       Date:  1999

2.  UMLS concept indexing for production databases: a feasibility study.

Authors:  P Nadkarni; R Chen; C Brandt
Journal:  J Am Med Inform Assoc       Date:  2001 Jan-Feb       Impact factor: 4.497

3.  Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method.

Authors:  H Liu; Y A Lussier; C Friedman
Journal:  J Biomed Inform       Date:  2001-08       Impact factor: 6.317

4.  A study of abbreviations in the UMLS.

Authors:  H Liu; Y A Lussier; C Friedman
Journal:  Proc AMIA Symp       Date:  2001

5.  Circular hierarchical relationships in the UMLS: etiology, diagnosis, treatment, complications and prevention.

Authors:  O Bodenreider
Journal:  Proc AMIA Symp       Date:  2001

6.  A semantic lexicon for medical language processing.

Authors:  S B Johnson
Journal:  J Am Med Inform Assoc       Date:  1999 May-Jun       Impact factor: 4.497

7.  Finding the findings: identification of findings in medical literature using restricted natural language processing.

Authors:  C A Sneiderman; T C Rindflesch; A R Aronson
Journal:  Proc AMIA Annu Fall Symp       Date:  1996

Review 8.  Migraine and magnesium: eleven neglected connections.

Authors:  D R Swanson
Journal:  Perspect Biol Med       Date:  1988       Impact factor: 1.416

9.  Developing a test collection for biomedical word sense disambiguation.

Authors:  M Weeber; J G Mork; A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

10.  Ambiguity resolution while mapping free text to the UMLS Metathesaurus.

Authors:  T C Rindflesch; A R Aronson
Journal:  Proc Annu Symp Comput Appl Med Care       Date:  1994
View more
  34 in total

1.  An examination of PubMed's ability to disambiguate subject queries and journal title queries.

Authors:  Aida Marissa Smith
Journal:  J Med Libr Assoc       Date:  2004-01

2.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

3.  A multi-aspect comparison study of supervised word sense disambiguation.

Authors:  Hongfang Liu; Virginia Teller; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2004-04-02       Impact factor: 4.497

4.  Tailoring vocabularies for NLP in sub-domains: a method to detect unused word sense.

Authors:  Rosa L Figueroa; Qing Zeng-Treitler; Sergey Goryachev; Eduardo P Wiechmann
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

5.  Generating quality word sense disambiguation test sets based on MeSH indexing.

Authors:  Jung-Wei Fan; Carol Friedman
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

6.  Using a statistical natural language Parser augmented with the UMLS specialist lexicon to assign SNOMED CT codes to anatomic sites and pathologic diagnoses in full text pathology reports.

Authors:  Henry J Lowe; Yang Huang; Donald P Regula
Journal:  AMIA Annu Symp Proc       Date:  2009-11-14

7.  MachineProse: an ontological framework for scientific assertions.

Authors:  Deendayal Dinakarpandian; Yugyung Lee; Kartik Vishwanath; Rohini Lingambhotla
Journal:  J Am Med Inform Assoc       Date:  2005-12-15       Impact factor: 4.497

8.  Comparison of vector space model methodologies to reconcile cross-species neuroanatomical concepts.

Authors:  P R Srinivas; Shang-Heng Wei; Nello Cristianini; E G Jones; F A Gorin
Journal:  Neuroinformatics       Date:  2005

9.  Abbreviation and acronym disambiguation in clinical discourse.

Authors:  Sergeui Pakhomov; Ted Pedersen; Christopher G Chute
Journal:  AMIA Annu Symp Proc       Date:  2005

10.  Quantitative assessment of dictionary-based protein named entity tagging.

Authors:  Hongfang Liu; Zhang-Zhi Hu; Manabu Torii; Cathy Wu; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2006-06-23       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.