Literature DB >> 25160251

Exploiting parallel corpora to scale up multilingual biomedical terminologies.

Johannes Hellrich1, Udo Hahn1.   

Abstract

Creating and maintaining biomedical terminologies for multiple natural languages is a resource-expensive task, typically carried out by human domain experts. We here report on efforts to computationally support this process by treating term acquisition as a machine translation-guided classification problem capitalizing on parallel corpora. We report on experiments for French, German, Spanish and Dutch parts of a UMLS-derived terminology for which we generated 18 k, 23 k, 19 k and 12 k new terms and synonyms, respectively. Based on expert assessments of a novel German terminology segment about 80% of the newly acquired terms were judged as bio-medically reasonable and terminologically valid.

Entities:  

Mesh:

Year:  2014        PMID: 25160251

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  2 in total

1.  A deep learning approach to bilingual lexicon induction in the biomedical domain.

Authors:  Geert Heyman; Ivan Vulić; Marie-Francine Moens
Journal:  BMC Bioinformatics       Date:  2018-07-09       Impact factor: 3.169

Review 2.  The Unified Medical Language System at 30 Years and How It Is Used and Published: Systematic Review and Content Analysis.

Authors:  Xia Jing
Journal:  JMIR Med Inform       Date:  2021-08-27
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.