Literature DB >> 36093038

Evaluating Biomedical Word Embeddings for Vocabulary Alignment at Scale in the UMLS Metathesaurus Using Siamese Networks.

Goonmeet Bajaj1, Vinh Nguyen2, Thilini Wijesiriwardene3, Hong Yung Yip3, Vishesh Javangula4, Srinivasan Parthasarathy1, Amit Sheth3, Olivier Bodenreider2.   

Abstract

Recent work uses a Siamese Network, initialized with BioWordVec embeddings (distributed word embeddings), for predicting synonymy among biomedical terms to automate a part of the UMLS (Unified Medical Language System) Metathesaurus construction process. We evaluate the use of contextualized word embeddings extracted from nine different biomedical BERT-based models for synonymy prediction in the UMLS by replacing BioWordVec embeddings with embeddings extracted from each biomedical BERT model using different feature extraction methods. Surprisingly, we find that Siamese Networks initialized with BioWordVec embeddings still outperform the Siamese Networks initialized with embedding extracted from biomedical BERT model.

Entities:  

Year:  2022        PMID: 36093038      PMCID: PMC9455661          DOI: 10.18653/v1/2022.insights-1.11

Source DB:  PubMed          Journal:  Proc Conf Assoc Comput Linguist Meet        ISSN: 0736-587X


  5 in total

1.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  BioWordVec, improving biomedical word embeddings with subword information and MeSH.

Authors:  Yijia Zhang; Qingyu Chen; Zhihao Yang; Hongfei Lin; Zhiyong Lu
Journal:  Sci Data       Date:  2019-05-10       Impact factor: 6.444

3.  Adding an Attention Layer Improves the Performance of a Neural Network Architecture for Synonymy Prediction in the UMLS Metathesaurus.

Authors:  Vinh Nguyen; Olivier Bodenreider
Journal:  Stud Health Technol Inform       Date:  2022-06-06

4.  BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

Authors:  Jinhyuk Lee; Wonjin Yoon; Sungdong Kim; Donghyeon Kim; Sunkyu Kim; Chan Ho So; Jaewoo Kang
Journal:  Bioinformatics       Date:  2020-02-15       Impact factor: 6.937

  5 in total
  1 in total

1.  Context-Enriched Learning Models for Aligning Biomedical Vocabularies at Scale in the UMLS Metathesaurus.

Authors:  Vinh Nguyen; Hong Yung Yip; Goonmeet Bajaj; Thilini Wijesiriwardene; Vishesh Javangula; Srinivasan Parthasarathy; Amit Sheth; Olivier Bodenreider
Journal:  Proc Int World Wide Web Conf       Date:  2022-04-25
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.