| Literature DB >> 27454118 |
Yonghyun Nam1, Myungjun Kim1, Kyungwon Lee2, Hyunjung Shin3.
Abstract
BACKGROUND: The study on disease-disease association has been increasingly viewed and analyzed as a network, in which the connections between diseases are configured using the source information on interactome maps of biomolecules such as genes, proteins, metabolites, etc. Although abundance in source information leads to tighter connections between diseases in the network, for a certain group of diseases, such as metabolic diseases, the connections do not occur much due to insufficient source information; a large proportion of their associated genes are still unknown. One way to circumvent the difficulties in the lack of source information is to integrate available external information by using one of up-to-date integration or fusion methods. However, if one wants a disease network placing huge emphasis on the original source of data but still utilizing external sources only to complement it, integration may not be pertinent. Interpretation on the integrated network would be ambiguous: meanings conferred on edges would be vague due to fused information.Entities:
Mesh:
Year: 2016 PMID: 27454118 PMCID: PMC4959382 DOI: 10.1186/s12911-016-0315-2
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Proposed Method: a original network with disconnected nodes, and b complemented network that links the disconnected nodes to the connected network through newly found edges using external information
Fig. 2Schematic description of CLASH Algorithm
Fig. 3Pseudo Code of CLASH Algorithm
Data sources for metabolic diseases, proteins, disease-protein associations, comorbidity
| Disease | Original source data | External source data | ||
|---|---|---|---|---|
| Protein | Disease-Protein association | Comorbidity literature | ||
| Number of data | 181 out of 302 | 15,281 out of 30,634 | 53,430 relations | 6518 out of 1,001,254 |
| Sources | Medical subject headings 2014 | Comparative Toxicogenomics Database (CTD) | PubMed (05-01-31 ~ 15-03-31) | |
Fig. 4Results for Complementing Ability of CLASH: a shows that the proportion of edges have been recovered by 18 %, on average. b shows that CLASH improves AUC performance up to 0.79. The p-values for statistical tests for pairwise comparison between %-damaged original network and complemented network are 0.0002, 0.0001, 0.0002 and 0.000, respectively. On the other hand, CLASH is robust to noise: the noisy networks incurred insignificant degradation or no change in performance to %-damaged networks, preserving the original information
Fig. 5Utility of CLASH by demonstrating the process for the malabsorption syndrome: CLASH algorithm complements the network with four recovered edges and four newly found ones. Therefore, malabsorption syndrome extends its associations with more diseases, hyperhomocysteinemia, hypoglycemia, osteomalacia and insulin resistance, apart from the originally connected four diseases. Single solid lines refer to extended edges and double lines refer to original edges. Also notations ‘†’, ‘*’ and ‘**’ denotes associated diseases via PPI, PubMed and multiple paths involving more than one edge, respectively
Fig. 6Top tier ranked up to 10th associated diseases with malabsorption syndrome: Notations ‘†’, ‘*’ and ‘**’ are identical to those in Fig. 5
Top tier ranked up to 10th associated diseases
| Target Disease | Associated via Reference Network | Associated via Complemented Network | ||
|---|---|---|---|---|
| Celiac Disease | Diabetes Mellitus | Calcinosis | Diabetes Mellitus†
| Malabsorption Syndromes* |
| Lactose Intolerance | Diabetes Mellitus | Hyperinsulinism | Mucolipidoses* | Diabetes Mellitus†
|
| Hypophosphatasia | Metabolic Syndrome X | Amyotrophic Lateral Sclerosis | Acidosis, Renal Tubular* | Metabolism, Inborn Errors* |
| Refsum Disease | Zellweger Syndrome | Porphyrias | Neuronal Ceroid-Lipofuscinoses* | Leigh Disease* |
| Fanconi | Hypophosphatemia | Osteomalacia | Celiac Disease* | Diabetes, Gestational* |
| Menkes Kinky Hair Syndrome | N/A | Congenital Disorders of Glycosylation* | Refsum Disease** | |
| Pyruvate Carboxylase Deficiency | N/A | Acidosis, Renal Tubular* | Amino Acid Metabolism, Inborn Errors* | |
| Rothmund-Thomson Syndrome | N/A | DNA Repair-Deficiency Disorders* | Hypercalcemia* | |
| Sphingolipidoses | N/A | Neuronal Ceroid-Lipofuscinoses* | Zellweger Syndrome** | |
| Alkaptonuria | N/A | Carbohydrate Metabolism, Inborn Errors* | Amino Acid Metabolism, Inborn Errors* | |
Notations ‘†’, ‘*’ and ‘**’ are identical to those in Fig. 5