Literature DB >> 16839808

Automatic lexeme acquisition for a multilingual medical subword thesaurus.

Kornél Markó1, Stefan Schulz, Udo Hahn.   

Abstract

PURPOSE: We present a method for the automated acquisition of a multilingual medical lexicon (for Spanish, French and Swedish) to be used within the framework of a medical cross-language text retrieval system.
METHODS: For the lexical acquisition process, we incorporate seed lexicons and lists of trusted term translations derived from the UMLS Metathesaurus. The seed lexicons for Spanish, French and Swedish are automatically generated from (previously manually constructed) Portuguese, German and English sources by simple string transformations. Lexical and semantic hypotheses are then validated by processing pairs of term translations. In a last step, we use the cleaned list of "approved" translations in order to augment, step by step, the target dictionaries by processing the parallel corpora in terms of co-occurrence patterns of hypothesized translation equivalents which cannot be derived by simple character substitutions.
RESULTS: An existing multilingual lexicon for the medical domain with about 60,000 entries for English, German, and Portuguese was automatically augmented by more then 17,000 new lexemes for Spanish, French, and Swedish.
CONCLUSIONS: Our approach constitutes a promising method for the automated creation of new lexicon entries and their linkage to semantic identifiers.

Mesh:

Year:  2006        PMID: 16839808     DOI: 10.1016/j.ijmedinf.2006.05.032

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  3 in total

Review 1.  Natural Language Processing methods and systems for biomedical ontology learning.

Authors:  Kaihong Liu; William R Hogan; Rebecca S Crowley
Journal:  J Biomed Inform       Date:  2010-07-18       Impact factor: 6.317

2.  Multilingual chief complaint classification for syndromic surveillance: an experiment with Chinese chief complaints.

Authors:  Hsin-Min Lu; Hsinchun Chen; Daniel Zeng; Chwan-Chuen King; Fuh-Yuan Shih; Tsung-Shu Wu; Jin-Yi Hsiao
Journal:  Int J Med Inform       Date:  2008-10-05       Impact factor: 4.046

Review 3.  The Unified Medical Language System at 30 Years and How It Is Used and Published: Systematic Review and Content Analysis.

Authors:  Xia Jing
Journal:  JMIR Med Inform       Date:  2021-08-27
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.