Literature DB >> 15811780

Automatic processing of multilingual medical terminology: applications to thesaurus enrichment and cross-language information retrieval.

H Déjean1, E Gaussier, J-M Renders, F Sadat.   

Abstract

OBJECTIVES: We present in this article experiments on multi-language information extraction and access in the medical domain. For such applications, multilingual terminology plays a crucial role when working on specialized languages and specific domains.
MATERIAL AND METHODS: We propose firstly a method for enriching multilingual thesauri which extracts new terms from parallel corpora, and secondly, a new approach for bilingual lexicon extraction from comparable corpora, which uses a bilingual thesaurus as a pivot. We illustrate their use in multi-language information retrieval (English/German) in the medical domains.
RESULTS: Our experiments show that these automatically extracted bilingual lexicons are accurate enough (85% precision for term extraction) for semi-automatically enriching mono- or bi-lingual thesauri such as the universal medical language system, and that their use in cross-language information retrieval significantly improves the retrieval performance (from 22 to 40% average precision) and clearly outperforms existing bilingual lexicon resources (both general lexicons and specialized ones).
CONCLUSION: We show in this paper first that bilingual lexicon extraction from parallel corpora in the medical domain could lead to accurate, specialized lexicons, which can be used to help enrich existing thesauri and second that bilingual lexicons extracted from comparable corpora outperform general bilingual resources for cross-language information retrieval.

Mesh:

Year:  2005        PMID: 15811780     DOI: 10.1016/j.artmed.2004.07.015

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  5 in total

1.  Contribution to terminology internationalization by word alignment in parallel corpora.

Authors:  Louise Deléger; Magnus Merkel; Pierre Zweigenbaum
Journal:  AMIA Annu Symp Proc       Date:  2006

2.  A twofold strategy for translating a medical terminology into French.

Authors:  Louise Deléger; Tayeb Merabti; Thierry Lecrocq; Michel Joubert; Pierre Zweigenbaum; Stéfan Darmoni
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

3.  Fostering Multilinguality in the UMLS: A Computational Approach to Terminology Expansion for Multiple Languages.

Authors:  Johannes Hellrich; Udo Hahn
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

4.  Translating the Foundational Model of Anatomy into French using knowledge-based and lexical methods.

Authors:  Tayeb Merabti; Lina F Soualmia; Julien Grosjean; Olivier Palombi; Jean-Michel Müller; Stéfan J Darmoni
Journal:  BMC Med Inform Decis Mak       Date:  2011-10-26       Impact factor: 2.796

5.  Creating a medical English-Swedish dictionary using interactive word alignment.

Authors:  Mikael Nyström; Magnus Merkel; Lars Ahrenberg; Pierre Zweigenbaum; Håkan Petersson; Hans Ahlfeldt
Journal:  BMC Med Inform Decis Mak       Date:  2006-10-12       Impact factor: 2.796

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.