Literature DB >> 29295178

Retrofitting Concept Vector Representations of Medical Concepts to Improve Estimates of Semantic Similarity and Relatedness.

Zhiguo Yu1, Byron C Wallace2, Todd Johnson1, Trevor Cohen1.   

Abstract

Estimation of semantic similarity and relatedness between biomedical concepts has utility for many informatics applications. Automated methods fall into two categories: methods based on distributional statistics drawn from text corpora, and methods using the structure of existing knowledge resources. Methods in the former category disregard taxonomic structure, while those in the latter fail to consider semantically relevant empirical information. In this paper, we present a method that retrofits distributional context vector representations of biomedical concepts using structural information from the UMLS Metathesaurus, such that the similarity between vector representations of linked concepts is augmented. We evaluated it on the UMNSRS benchmark. Our results demonstrate that retrofitting of concept vector representations leads to better correlation with human raters for both similarity and relatedness, surpassing the best results reported to date. They also demonstrate a clear improvement in performance on this reference standard for retrofitted vector representations, as compared to those without retrofitting.

Entities:  

Keywords:  Natural Language Processing; Semantics; Unified Medical Language System

Mesh:

Year:  2017        PMID: 29295178      PMCID: PMC6464117     

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  13 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

3.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  Towards the development of a conceptual distance metric for the UMLS.

Authors:  Jorge E Caviedes; James J Cimino
Journal:  J Biomed Inform       Date:  2004-04       Impact factor: 6.317

5.  Aligning knowledge sources in the UMLS: methods, quantitative results, and applications.

Authors:  Olivier Bodenreider; Anita Burgun
Journal:  Stud Health Technol Inform       Date:  2004

6.  Measures of semantic similarity and relatedness in the biomedical domain.

Authors:  Ted Pedersen; Serguei V S Pakhomov; Siddharth Patwardhan; Christopher G Chute
Journal:  J Biomed Inform       Date:  2006-06-10       Impact factor: 6.317

7.  A document clustering and ranking system for exploring MEDLINE citations.

Authors:  Yongjing Lin; Wenyuan Li; Keke Chen; Ying Liu
Journal:  J Am Med Inform Assoc       Date:  2007-06-28       Impact factor: 4.497

Review 8.  Empirical distributional semantics: methods and biomedical applications.

Authors:  Trevor Cohen; Dominic Widdows
Journal:  J Biomed Inform       Date:  2009-02-14       Impact factor: 6.317

9.  Towards a framework for developing semantic relatedness reference standards.

Authors:  Serguei V S Pakhomov; Ted Pedersen; Bridget McInnes; Genevieve B Melton; Alexander Ruggieri; Christopher G Chute
Journal:  J Biomed Inform       Date:  2010-10-31       Impact factor: 6.317

Review 10.  Can literature analysis identify innovation drivers in drug discovery?

Authors:  Pankaj Agarwal; David B Searls
Journal:  Nat Rev Drug Discov       Date:  2009-11       Impact factor: 84.694

View more
  6 in total

1.  Retrofitting Vector Representations of Adverse Event Reporting Data to Structured Knowledge to Improve Pharmacovigilance Signal Detection.

Authors:  Xiruo Ding; Trevor Cohen
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

2.  Augmenting aer2vec: Enriching distributed representations of adverse event report data with orthographic and lexical information.

Authors:  Xiruo Ding; Justin Mower; Devika Subramanian; Trevor Cohen
Journal:  J Biomed Inform       Date:  2021-06-08       Impact factor: 8.000

3.  Improved biomedical word embeddings in the transformer era.

Authors:  Jiho Noh; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2021-07-18       Impact factor: 8.000

4.  Discovering Clinical Information Models Online to Promote Interoperability of Electronic Health Records: A Feasibility Study of OpenEHR.

Authors:  Lin Yang; Xiaoshuo Huang; Jiao Li
Journal:  J Med Internet Res       Date:  2019-05-28       Impact factor: 5.428

5.  BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale.

Authors:  Qingyu Chen; Kyubum Lee; Shankai Yan; Sun Kim; Chih-Hsuan Wei; Zhiyong Lu
Journal:  PLoS Comput Biol       Date:  2020-04-23       Impact factor: 4.475

6.  Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.

Authors:  Yuqing Mao; Kin Wah Fung
Journal:  J Am Med Inform Assoc       Date:  2020-10-01       Impact factor: 4.497

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.