Literature DB >> 25523466

Evaluating semantic similarity and relatedness over the semantic grouping of clinical term pairs.

Bridget T McInnes1, Ted Pedersen2.   

Abstract

INTRODUCTION: This article explores how measures of semantic similarity and relatedness are impacted by the semantic groups to which the concepts they are measuring belong. Our goal is to determine if there are distinctions between homogeneous comparisons (where both concepts belong to the same group) and heterogeneous ones (where the concepts are in different groups). Our hypothesis is that the similarity measures will be significantly affected since they rely on hierarchical is-a relations, whereas relatedness measures should be less impacted since they utilize a wider range of relations. In addition, we also evaluate the effect of combining different measures of similarity and relatedness. Our hypothesis is that these combined measures will more closely correlate with human judgment, since they better reflect the rich variety of information humans use when assessing similarity and relatedness.
METHOD: We evaluate our method on four reference standards. Three of the reference standards were annotated by human judges for relatedness and one was annotated for similarity.
RESULTS: We found significant differences in the correlation of semantic similarity and relatedness measures with human judgment, depending on which semantic groups were involved. We also found that combining a definition based relatedness measure with an information content similarity measure resulted in significant improvements in correlation over individual measures. AVAILABILITY: The semantic similarity and relatedness package is an open source program available from http://umls-similarity.sourceforge.net/. The reference standards are available at http://www.people.vcu.edu/∼{}btmcinnes/downloads.html.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  NLP; Natural language processing; Semantic relatedness; Semantic similarity

Mesh:

Year:  2014        PMID: 25523466     DOI: 10.1016/j.jbi.2014.11.014

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  6 in total

1.  Feature extraction for phenotyping from semantic and knowledge resources.

Authors:  Wenxin Ning; Stephanie Chan; Andrew Beam; Ming Yu; Alon Geva; Katherine Liao; Mary Mullen; Kenneth D Mandl; Isaac Kohane; Tianxi Cai; Sheng Yu
Journal:  J Biomed Inform       Date:  2019-02-07       Impact factor: 6.317

2.  POETenceph - Automatic identification of clinical notes indicating encephalopathy using a realist ontology.

Authors:  Kristina M Doing-Harris; Charlene R Weir; Sean Igo; Jianlin Shi; Yijun Shao; John F Hurdle
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

3.  Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.

Authors:  Denis Newman-Griffis; Guy Divita; Bart Desmet; Ayah Zirikly; Carolyn P Rosé; Eric Fosler-Lussier
Journal:  J Am Med Inform Assoc       Date:  2021-03-01       Impact factor: 4.497

4.  Indirect association and ranking hypotheses for literature based discovery.

Authors:  Sam Henry; Bridget T McInnes
Journal:  BMC Bioinformatics       Date:  2019-08-15       Impact factor: 3.169

5.  HESML: a real-time semantic measures library for the biomedical domain with a reproducible survey.

Authors:  Juan J Lastra-Díaz; Alicia Lara-Clares; Ana Garcia-Serrano
Journal:  BMC Bioinformatics       Date:  2022-01-06       Impact factor: 3.169

6.  A hierarchical method to automatically encode Chinese diagnoses through semantic similarity estimation.

Authors:  Wenxin Ning; Ming Yu; Runtong Zhang
Journal:  BMC Med Inform Decis Mak       Date:  2016-03-03       Impact factor: 2.796

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.