| Literature DB >> 31678735 |
Liang Cheng1, Hengqiang Zhao1, Pingping Wang2, Wenyang Zhou2, Meng Luo2, Tianxin Li2, Junwei Han3, Shulin Liu4, Qinghua Jiang5.
Abstract
Although our knowledge of <span class="Species">human diseases has increased dramatically, the molecular basis, phenotypic traits, and therapeutic targets of most diseases still remain unclear. An increasing number of studies have observed that similar diseases often are caused by similar molecules, can be diagnosed by similar markers or phenotypes, or can be cured by similar drugs. Thus, the identification of diseases similar to known ones has attracted considerable attention worldwide. To this end, the associations between diseases at the molecular, phenotypic, and taxonomic levels were used to measure the pairwise similarity in diseases. The corresponding perfor<span class="Species">mance assessment strategies for these methods involving the terms "category-based," "simulated-patient-based," and "benchmark-data-based" were thus further emphasized. Then, frequently used methods were evaluated using a benchmark-data-based strategy. To facilitate the assessment of disease similarity scores, researchers have designed dozens of tools that implement these methods for calculating disease similarity. Currently, disease similarity has been advantageous in predicting noncoding RNA (ncRNA) function and therapeutic drugs for diseases. In this article, we review disease similarity methods, evaluation strategies, tools, and their applications in the biomedical community. We further evaluate the performance of these methods and discuss the current limitations and future trends for calculating disease similarity.Entities:
Keywords: disease similarity; molecular basis; ncRNA function; phenotypic traits; therapeutic drugs
Year: 2019 PMID: 31678735 PMCID: PMC6838934 DOI: 10.1016/j.omtn.2019.09.019
Source DB: PubMed Journal: Mol Ther Nucleic Acids ISSN: 2162-2531 Impact factor: 8.886
Figure 1Sub-graph of the DO Hierarchy for Alzheimer’s Disease
Arrows represent an “IS_A” relationship for DO. For example, “Alzheimer’s disease” is linked to “Dementia” by an “IS_A” relationship. All of the terms that can be linked by “IS_A” relationships in the graph from “Alzheimer’s disease” are the ancestors of “Alzheimer’s disease.” All of the terms that can link to “Disease” by “IS_A” relationships are the descendants of “Disease.”
Summary of Data Sources
| Category and Name | Creation Date | Initiator | PMID |
|---|---|---|---|
| OMIM | 1960s | McKusick | |
| MeSH | 1960s | Winifred Sewell | |
| UMLS | 1980s | Olivier Bodenreider | |
| SNOMED CT | 2001 | Wang et al. | |
| DO | 2003 | Schriml et al. | |
| MEDIC | 2012 | Davis et al. | |
| GeneRIF | 2007 | ||
| CTD | 2003 | ||
| GAD | 2004 | Becker et al. | |
| miR2Disease | Jiang et al. | ||
| HPO | 2008 | Robinson et al. | |
| SpliceDisease | 2011 | ||
| lncRNADisease | 2012 | ||
| HMDD v2.0 | 2013 | ||
| SIDD | 2013 | Cheng et al. | |
| OAHG | 2016 | Cheng et al. | |
| GOA | 2003 | Camon et al. | |
| HumanNet | 2011 | Lee et al. | |
Summary of Disease Similarity Methods
| Author(s) | Molecule Based | Phenotype Based | Hierarchy Based | Vocabulary | PMID (or Reference Number) | Year |
|---|---|---|---|---|---|---|
| Freudenberg and Propping | √ | OMIM | 2002 | |||
| van Driel et al. | √ | OMIM | 2006 | |||
| Köhler et al. | √ | OMIM | 2009 | |||
| Zhang et al. | √ | OMIM | 2010 | |||
| Zhou et al. | √ | MeSH | 2014 | |||
| Chen et al. | √ | UMLS | 2015 | |||
| Hoehndorf et al. | √ | DO | 2015 | |||
| Deng et al. | √ | OMIM | 2015 | |||
| Mabotuwana et al. | √ | SNOMED CT | 2013 | |||
| Mathur et al. | √ | DO | 2010 | |||
| Suthram et al. | √ | UMLS | 2010 | |||
| Gottlieb et al. | √ | UMLS | 2011 | |||
| Hamaneh and Yu | √ | OMIM/MeSH | 2014 | |||
| Kim et al. | √ | PharmGKB | 2015 | |||
| Wang et al. | √ | DO/MeSH | 2007 | |||
| Resnik | √ | √ | DO | 1995 | ||
| Lin | √ | √ | DO | 1998 | ||
| Schlicker et al. | √ | √ | 2006 | |||
| Mathur et al. | √ | √ | DO | 2012 | ||
| Cheng et al. | √ | √ | DO | 2014 |
Figure 2Schematic of the Process of Phenotype-Based Methods
Figure 3Schematic of the Process of Performance Evaluation
(A) Performance evaluation of a simulated patient-based method. (B) Performance evaluation of a term-category-based method. (C) Performance evaluation of a benchmark-data-based method.
Figure 4Performance Evaluation Using a Benchmark-Data-Based Strategy
(A) ROC curve for one of the 100 iterations using disease-related genes from GeneRIF. (B) The average AUC from 100 iterations using disease-related genes from GeneRIF. (C) ROC curve for one of the 100 iterations using disease-related genes from SIDD. (D) The average AUC from 100 iterations using disease-related genes from SIDD.
Summary of Disease Similarity Tools
| Author(s) | Name | Type | Web Site | Vocabulary | PMID | Year |
|---|---|---|---|---|---|---|
| van Driel et al. | MimMiner | webpage | OMIM | 2006 | ||
| Robinson et al. | Phenomizer | webpage | OMIM | 2009 | ||
| Wang et al. | MISIM | webpage | MeSH | 2010 | ||
| Li et al. | DOSim | R package | DO | 2011 | ||
| Hoehndorf et al. | NA | webpage | OMIM | 2015 | ||
| Hamaneh and Yu | DeCoaD | webpage | DO | 2015 | ||
| Deng et al. | HPOSim | R package | OMIM | 2015 | ||
| Yu et al. | DOSE | R package | DO | 2015 | ||
| Cheng et al. | DisSim | webpage | DO | 2016 | ||
| Cheng et al. | DisSetSim | webpage | DO | 2017 | ||
| Cheng et al. | DincRNA | webpage | DO | 2018 |
Figure 5Distribution of Disease Terms in DO, MeSH, and OMIM