| Literature DB >> 31722666 |
Yonghyun Nam1, Dong-Gi Lee1, Sunjoo Bang1, Ju Han Kim2, Jae-Hoon Kim3, Hyunjung Shin4.
Abstract
BACKGROUND: The recent advances in human disease network have provided insights into establishing the relationships between the genotypes and phenotypes of diseases. In spite of the great progress, it yet remains as only a map of topologies between diseases, but not being able to be a pragmatic diagnostic/prognostic tool in medicine. It can further evolve from a map to a translational tool if it equips with a function of scoring that measures the likelihoods of the association between diseases. Then, a physician, when practicing on a patient, can suggest several diseases that are highly likely to co-occur with a primary disease according to the scores. In this study, we propose a method of implementing 'n-of-1 utility' (n potential diseases of one patient) to human disease network-the translational disease network.Entities:
Keywords: Comorbidity; Disease network; Disease scoring; Protein interaction; Semi-supervised learning
Mesh:
Year: 2019 PMID: 31722666 PMCID: PMC6854734 DOI: 10.1186/s12859-019-3106-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Data for diseases, disease-protein relationships, and protein-protein interactions, and literature for comorbidity analysis: the number in parentheses indicates the amount of data originating from the respective sources. See also Additional file 1: Appendix A and C
| Metabolic diseases | Disease-protein relationship | Protein-protein interaction | Comorbidity analysis | |
|---|---|---|---|---|
| Data Sources | MeSH The Medical Subject Headings | CTD (7624) ver. 2014/07/11 Comparative Toxicogenomics Database, GAD (34,773) ver.2013/07/16 Genetic Association Database, OMIM (4078) ver. 2014/03/10 Online Mendelian Inheritance in Man, PharmGKB (6610) ver. 2015/07/20 The Pharmacogenomics Knowledge Base, TTD (395) ver. 2013/07/04 Therapeutic Target Database, | DIP (773) ver. 2014/10/10 Database of Interacting Proteins, Entrez Gene (58,778) ver. 2014/07/20 MINT (736) ver. 2014/07/08 Molecular Interaction Database, PharmGKB (507) ver. 2014/07/20 The Pharmacogenomics Knowledge Base, | PubMed Literature US National Library of Medicine National Institutes of Health |
| Number of Data | 181 out of 302 metabolic diseases | 53,480 relations between 2411 diseases and 7733 proteins | 60,794 interactions of 15,281 proteins | 62 pairs of 55 diseases |
Fig. 5The proposed model: a a method of constructing a disease network based on a q-step walk on the protein-protein interaction (PPI) network and b a scoring model for calculating the scores of disease co-occurrence when a specific disease is given
Fig. 6Associated diseases of DA by q-step walks on the PPI network in the case of a q=0, b q=1, c q=2 and d q=3
Fig. 1Changes in network density by a q-step walk on the PPI network: The amount of connections for disease-disease association increases as the step size q increases. With rich connections among diseases, the chances of having inferences for most diseases, including rare diseases, are increased because it enables us to use the information propagated from other diseases through the connections
Fig. 2Disease network comparison: A small subset of diseases belonging to the metabolic diseases category in MeSH is selected. The nodes in the graph represent the eight selected diseases, and the width of an edge indicates connection strength proportional to the number of shared proteins between two diseases. In Goh et al.’s approach, there is only a single edge connecting two disease nodes, while the remaining six are disconnected. In contrast, in PPI(0~, all eight disease nodes are connected, and each has a high degree of node connectivity with various connection strengths
Fig. 3AUC comparison from q = 0 to 10 for integrated networks: PPI(0~. The experiment was repeated 55 times, and the average AUC value with the standard deviation is presented as a circle with an error bar. PPI(0), PPI(1), PPI(2), and PPI(3) are individual disease networks constructed as described in Methods section. PPI(0~q) s are integrated networks from Eq. (6). PPI(0) corresponds to the existing disease network suggested by Goh et al. [21]. A randomized network is added to our experiment to obtain a reference performance. The best performance was achieved by PPI(0~3), and p-values of the pairwise t-tests are shown in the bottom of the plot
Fig. 4Probabilities of the diseases associated with T2DM: The solid line represents the probability values of the 180 diseases. Shown on the line with open circles are the locations of 13 diseases comorbid with diabetes mellitus type II. The probability values support the knowledge based on real medical practice and vice versa