| Literature DB >> 36093038 |
Goonmeet Bajaj1, Vinh Nguyen2, Thilini Wijesiriwardene3, Hong Yung Yip3, Vishesh Javangula4, Srinivasan Parthasarathy1, Amit Sheth3, Olivier Bodenreider2.
Abstract
Recent work uses a Siamese Network, initialized with BioWordVec embeddings (distributed word embeddings), for predicting synonymy among biomedical terms to automate a part of the UMLS (Unified Medical Language System) Metathesaurus construction process. We evaluate the use of contextualized word embeddings extracted from nine different biomedical BERT-based models for synonymy prediction in the UMLS by replacing BioWordVec embeddings with embeddings extracted from each biomedical BERT model using different feature extraction methods. Surprisingly, we find that Siamese Networks initialized with BioWordVec embeddings still outperform the Siamese Networks initialized with embedding extracted from biomedical BERT model.Entities:
Year: 2022 PMID: 36093038 PMCID: PMC9455661 DOI: 10.18653/v1/2022.insights-1.11
Source DB: PubMed Journal: Proc Conf Assoc Comput Linguist Meet ISSN: 0736-587X