Literature DB >> 20351846

Generating quality word sense disambiguation test sets based on MeSH indexing.

Jung-Wei Fan1, Carol Friedman.   

Abstract

Word sense disambiguation (WSD) determines the correct meaning of a word that has more than one meaning, and is a critical step in biomedical natural language processing, as interpretation of information in text can be correct only if the meanings of their component terms are correctly identified first. Quality evaluation sets are important to WSD because they can be used as representative samples for developing automatic programs and as referees for comparing different WSD programs. To help create quality test sets for WSD, we developed a MeSH-based automatic sense-tagging method that preferentially annotates terms being topical of the text. Preliminary results were promising and revealed important issues to be addressed in biomedical WSD research. We also suggest that, by cross-validating with 2 or 3 annotators, the method should be able to efficiently generate quality WSD test sets. Online supplement is available at: http://www.dbmi.columbia.edu/~juf7002/AMIA09.

Entities:  

Mesh:

Year:  2009        PMID: 20351846      PMCID: PMC2815444     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  7 in total

1.  Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method.

Authors:  H Liu; Y A Lussier; C Friedman
Journal:  J Biomed Inform       Date:  2001-08       Impact factor: 6.317

2.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

3.  Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the UMLS.

Authors:  Hongfang Liu; Stephen B Johnson; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

4.  High agreement but low kappa: II. Resolving the paradoxes.

Authors:  D V Cicchetti; A R Feinstein
Journal:  J Clin Epidemiol       Date:  1990       Impact factor: 6.437

5.  Developing a test collection for biomedical word sense disambiguation.

Authors:  M Weeber; J G Mork; A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

6.  The Unified Medical Language System.

Authors:  D A Lindberg; B L Humphreys; A T McCray
Journal:  Methods Inf Med       Date:  1993-08       Impact factor: 2.176

7.  Ambiguity of human gene symbols in LocusLink and MEDLINE: creating an inventory and a disambiguation test collection.

Authors:  Marc Weeber; Bob J Schijvenaars; Erik M Van Mulligen; Barend Mons; Rob Jelier; Christian C Van Der Eijk; Jan A Kors
Journal:  AMIA Annu Symp Proc       Date:  2003
  7 in total
  1 in total

1.  Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation.

Authors:  Antonio J Jimeno-Yepes; Bridget T McInnes; Alan R Aronson
Journal:  BMC Bioinformatics       Date:  2011-06-02       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.