Literature DB >> 11825186

Subword segmentation--leveling out morphological variations for medical document retrieval.

U Hahn1, M Honeck, M Piotrowski, S Schulz.   

Abstract

Many lexical items from medical sublanguages exhibit a complex morphological structure that is hard to account for by simple string matching (e.g., truncation). While inflection is usually easy to deal with, productive morphological processes in terms of derivation and (single-word) composition constitute a real challenge. We here propose an approach in which morphologically complex word forms are segmented into medically significant subwords. After segmentation, both query terms and document terms are submitted to the matching procedure. This way, problems arising from morphologically motivated word form alterations can be eliminated from the retrieval procedure. We provide empirical data which reveals that subword-based indexing and retrieval performs significantly better than conventional string matching approaches.

Mesh:

Year:  2001        PMID: 11825186      PMCID: PMC2243631     

Source DB:  PubMed          Journal:  Proc AMIA Symp        ISSN: 1531-605X


  7 in total

1.  Morpheme-based, cross-lingual indexing for medical document retrieval.

Authors:  S Schulz; U Hahn
Journal:  Int J Med Inform       Date:  2000-09       Impact factor: 4.046

2.  Morpho-semantic parsing of medical expressions.

Authors:  R H Baud; C Lovis; A M Rassinoux; J R Scherrer
Journal:  Proc AMIA Symp       Date:  1998

3.  The GRAIL concept modelling language for medical terminology.

Authors:  A L Rector; S Bechhofer; C A Goble; I Horrocks; W A Nowlan; W D Solomon
Journal:  Artif Intell Med       Date:  1997-02       Impact factor: 5.326

4.  Identification and transformation of terminal morphemes in medical English.

Authors:  A W Pratt; M Pacak
Journal:  Methods Inf Med       Date:  1969-04       Impact factor: 2.176

5.  Morphosemantic analysis of compound word forms denoting surgical procedures.

Authors:  L M Norton; M G Pacak
Journal:  Methods Inf Med       Date:  1983-01       Impact factor: 2.176

6.  The use of morphosemantic regularities in the medical vocabulary for automatic lexical coding.

Authors:  S Wolff
Journal:  Methods Inf Med       Date:  1984-10       Impact factor: 2.176

7.  Morphosemantic analysis of -ITIS forms in medical language.

Authors:  M G Pacak; L M Norton; G S Dunham
Journal:  Methods Inf Med       Date:  1980-04       Impact factor: 2.176

  7 in total
  4 in total

1.  Corpus-based associations provide additional morphological variants to medical terminologies.

Authors:  Pierre Zweigenbaum; Natalia Grabar
Journal:  AMIA Annu Symp Proc       Date:  2003

2.  Identification of related gene/protein names based on an HMM of name variations.

Authors:  L Yeganova; L Smith; W J Wilbur
Journal:  Comput Biol Chem       Date:  2004-04       Impact factor: 2.877

3.  A system for automated lexical mapping.

Authors:  Jennifer Y Sun; Yao Sun
Journal:  J Am Med Inform Assoc       Date:  2006-02-24       Impact factor: 4.497

4.  A strategy for assigning new concepts in the MEDLINE database.

Authors:  Won Kim; W John Wilbur
Journal:  AMIA Annu Symp Proc       Date:  2005
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.