Literature DB >> 22166490

Finding disease similarity based on implicit semantic similarity.

Sachin Mathur1, Deendayal Dinakarpandian.   

Abstract

Genomics has contributed to a growing collection of gene-function and gene-disease annotations that can be exploited by informatics to study similarity between diseases. This can yield insight into disease etiology, reveal common pathophysiology and/or suggest treatment that can be appropriated from one disease to another. Estimating disease similarity solely on the basis of shared genes can be misleading as variable combinations of genes may be associated with similar diseases, especially for complex diseases. This deficiency can be potentially overcome by looking for common biological processes rather than only explicit gene matches between diseases. The use of semantic similarity between biological processes to estimate disease similarity could enhance the identification and characterization of disease similarity. We present functions to measure similarity between terms in an ontology, and between entities annotated with terms drawn from the ontology, based on both co-occurrence and information content. The similarity measure is shown to outperform other measures used to detect similarity. A manually curated dataset with known disease similarities was used as a benchmark to compare the estimation of disease similarity based on gene-based and Gene Ontology (GO) process-based comparisons. The detection of disease similarity based on semantic similarity between GO Processes (Recall=55%, Precision=60%) performed better than using exact matches between GO Processes (Recall=29%, Precision=58%) or gene overlap (Recall=88% and Precision=16%). The GO-Process based disease similarity scores on an external test set show statistically significant Pearson correlation (0.73) with numeric scores provided by medical residents. GO-Processes associated with similar diseases were found to be significantly regulated in gene expression microarray datasets of related diseases. Copyright Â
© 2011 Elsevier Inc. All rights reserved.

Mesh:

Year:  2011        PMID: 22166490     DOI: 10.1016/j.jbi.2011.11.017

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  34 in total

1.  An ontology for Autism Spectrum Disorder (ASD) to infer ASD phenotypes from Autism Diagnostic Interview-Revised data.

Authors:  Omri Mugzach; Mor Peleg; Steven C Bagley; Stephen J Guter; Edwin H Cook; Russ B Altman
Journal:  J Biomed Inform       Date:  2015-07-04       Impact factor: 6.317

2.  Using SemRep to label semantic relations extracted from clinical text.

Authors:  Ying Liu; Robert Bill; Marcelo Fiszman; Thomas Rindflesch; Ted Pedersen; Genevieve B Melton; Serguei V Pakhomov
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

3.  DDAPRED: a computational method for predicting drug repositioning using regularized logistic matrix factorization.

Authors:  Xiaofeng Wang; Renxiang Yan
Journal:  J Mol Model       Date:  2020-02-15       Impact factor: 1.810

4.  HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology.

Authors:  Feichen Shen; Suyuan Peng; Yadan Fan; Andrew Wen; Sijia Liu; Yanshan Wang; Liwei Wang; Hongfang Liu
Journal:  J Biomed Inform       Date:  2019-06-27       Impact factor: 6.317

5.  Deepening the knowledge of rare diseases dependent on angiogenesis through semantic similarity clustering and network analysis.

Authors:  Raquel Pagano-Márquez; José Córdoba-Caballero; Beatriz Martínez-Poveda; Ana R Quesada; Elena Rojano; Pedro Seoane; Juan A G Ranea; Miguel Ángel Medina
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

6.  Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

Authors:  Hui Peng; Chaowang Lan; Yuansheng Liu; Tao Liu; Michael Blumenstein; Jinyan Li
Journal:  Oncotarget       Date:  2017-08-24

7.  Bi-directional semantic similarity for gene ontology to optimize biological and clinical analyses.

Authors:  Sang Jay Bien; Chan Hee Park; Hae Jin Shim; Woongcheol Yang; Jihun Kim; Ju Han Kim
Journal:  J Am Med Inform Assoc       Date:  2012-02-28       Impact factor: 4.497

8.  Evaluating the consistency of gene sets used in the analysis of bacterial gene expression data.

Authors:  Nathan L Tintle; Alexandra Sitarik; Benjamin Boerema; Kylie Young; Aaron A Best; Matthew Dejongh
Journal:  BMC Bioinformatics       Date:  2012-08-08       Impact factor: 3.169

9.  The human disease network in terms of dysfunctional regulatory mechanisms.

Authors:  Jing Yang; Su-Juan Wu; Wen-Tao Dai; Yi-Xue Li; Yuan-Yuan Li
Journal:  Biol Direct       Date:  2015-10-08       Impact factor: 4.540

10.  Calculating semantic relatedness for biomedical use in a knowledge-poor environment.

Authors:  Maciej Rybinski; José Aldana-Montes
Journal:  BMC Bioinformatics       Date:  2014-11-27       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.