| Literature DB >> 22195148 |
Bridget T McInnes1, Ted Pedersen, Ying Liu, Genevieve B Melton, Serguei V Pakhomov.
Abstract
In this paper, we introduce a novel knowledge-based word sense disambiguation method that determines the sense of an ambiguous word in biomedical text using semantic similarity or relatedness measures. These measures quantify the degree of similarity between concepts in the Unified Medical Language System (UMLS). The objective of this work was to develop a method that can disambiguate terms in biomedical text by exploiting similarity information extracted from the UMLS and to evaluate the efficacy of information content-based semantic similarity measures, which augment path-based information with probabilities derived from biomedical corpora. We show that information content-based measures obtain a higher disambiguation accuracy than path-based measures because they weight the path based on where it exists in the taxonomy coupled with the probability of the concepts occurring in a corpus of text.Mesh:
Year: 2011 PMID: 22195148 PMCID: PMC3243213
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076