Literature DB >> 31009761

Concept embedding to measure semantic relatedness for biomedical information ontologies.

Junseok Park1, Kwangmin Kim1, Woochang Hwang2, Doheon Lee3.   

Abstract

There have been many attempts to identify relationships among concepts corresponding to terms from biomedical information ontologies such as the Unified Medical Language System (UMLS). In particular, vector representation of such concepts using information from UMLS definition texts is widely used to measure the relatedness between two biological concepts. However, conventional relatedness measures have a limited range of applicable word coverage, which limits the performance of these models. In this paper, we propose a concept-embedding model of a UMLS semantic relatedness measure to overcome the limitations of earlier models. We obtained context texts of biological concepts that are not defined in UMLS by utilizing Wikipedia as an external knowledgebase. Concept vector representations were then derived from the context texts of the biological concepts. The degree of relatedness between two concepts was defined as the cosine similarity between corresponding concept vectors. As a result, we validated that our method provides higher coverage and better performance than the conventional method.
Copyright © 2019 The Authors. Published by Elsevier Inc. All rights reserved.

Keywords:  Embedding; NLP; Paragraph vector; Similarity; UMLS; Wikipedia

Year:  2019        PMID: 31009761     DOI: 10.1016/j.jbi.2019.103182

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  5 in total

1.  Improved biomedical word embeddings in the transformer era.

Authors:  Jiho Noh; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2021-07-18       Impact factor: 8.000

2.  BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale.

Authors:  Qingyu Chen; Kyubum Lee; Shankai Yan; Sun Kim; Chih-Hsuan Wei; Zhiyong Lu
Journal:  PLoS Comput Biol       Date:  2020-04-23       Impact factor: 4.475

3.  An interactive retrieval system for clinical trial studies with context-dependent protocol elements.

Authors:  Junseok Park; Seongkuk Park; Kwangmin Kim; Woochang Hwang; Sunyong Yoo; Gwan-Su Yi; Doheon Lee
Journal:  PLoS One       Date:  2020-09-18       Impact factor: 3.240

4.  Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.

Authors:  Denis Newman-Griffis; Eric Fosler-Lussier
Journal:  Front Digit Health       Date:  2021-03-10

5.  Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.

Authors:  Yuqing Mao; Kin Wah Fung
Journal:  J Am Med Inform Assoc       Date:  2020-10-01       Impact factor: 4.497

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.