Literature DB >> 26707454

Computing semantic similarity between biomedical concepts using new information content approach.

Mohamed Ben Aouicha1, Mohamed Ali Hadj Taieb2.   

Abstract

The exploitation of heterogeneous clinical sources and healthcare records is fundamental in clinical and translational research. The determination of semantic similarity between word pairs is an important component of text understanding that enables the processing and structuring of textual resources. Some of these measures have been adapted to the biomedical field by incorporating domain information extracted from clinical data or from medical ontologies such as MeSH. This study focuses on Information Content (IC) based measures that exploit the topological parameters of the taxonomy to express the semantics of a concept. A new intrinsic IC computing method based on the taxonomical parameters of the ancestors' subgraph is then assigned to a biomedical concept into the "is a" hierarchy. Moreover, we present a study of the topological parameters through the MeSH taxonomy. This study treats the semantic interpretation and the different ways of expressing the parameters of depth and the descendants' subgraph. Using MeSH as an input ontology, the accuracy of our proposal is evaluated and compared against other IC-based measures according to several widely-used benchmarks of biomedical terms. The correlation between the results obtained for the evaluated measure using the proposed approach and those from the ratings of human' experts shows that our proposal outperforms the previous measures.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Biomedicine; DAG topological parameters; Information content; MeSH; Semantic similarity

Mesh:

Year:  2015        PMID: 26707454     DOI: 10.1016/j.jbi.2015.12.007

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  4 in total

1.  HESML: a real-time semantic measures library for the biomedical domain with a reproducible survey.

Authors:  Juan J Lastra-Díaz; Alicia Lara-Clares; Ana Garcia-Serrano
Journal:  BMC Bioinformatics       Date:  2022-01-06       Impact factor: 3.169

2.  Using logical constraints to validate statistical information about disease outbreaks in collaborative knowledge graphs: the case of COVID-19 epidemiology in Wikidata.

Authors:  Houcemeddine Turki; Dariusz Jemielniak; Mohamed A Hadj Taieb; Jose E Labra Gayo; Mohamed Ben Aouicha; Mus'ab Banat; Thomas Shafee; Eric Prud'hommeaux; Tiago Lubiana; Diptanshu Das; Daniel Mietchen
Journal:  PeerJ Comput Sci       Date:  2022-09-29

3.  BIOSSES: a semantic sentence similarity estimation system for the biomedical domain.

Authors:  Gizem Sogancioglu; Hakime Öztürk; Arzucan Özgür
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

4.  Use of word and graph embedding to measure semantic relatedness between Unified Medical Language System concepts.

Authors:  Yuqing Mao; Kin Wah Fung
Journal:  J Am Med Inform Assoc       Date:  2020-10-01       Impact factor: 4.497

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.