| Literature DB >> 31760947 |
Yongtian Wang1, Liran Juan2, Jiajie Peng3, Tianyi Zang4, Yadong Wang5.
Abstract
BACKGROUND: As the terminal products of cellular regulatory process, functional related metabolites have a close relationship with complex diseases, and are often associated with the same or similar diseases. Therefore, identification of disease related metabolites play a critical role in understanding comprehensively pathogenesis of disease, aiming at improving the clinical medicine. Considering that a large number of metabolic markers of diseases need to be explored, we propose a computational model to identify potential disease-related metabolites based on functional relationships and scores of referred literatures between metabolites. First, obtaining associations between metabolites and diseases from the Human Metabolome database, we calculate the similarities of metabolites based on modified recommendation strategy of collaborative filtering utilizing the similarities between diseases. Next, a disease-associated metabolite network (DMN) is built with similarities between metabolites as weight. To improve the ability of identifying disease-related metabolites, we introduce scores of text mining from the existing database of chemicals and proteins into DMN and build a new disease-associated metabolite network (FLDMN) by fusing functional associations and scores of literatures. Finally, we utilize random walking with restart (RWR) in this network to predict candidate metabolites related to diseases.Entities:
Keywords: Collaborative filtering; Metabolite network; Random walking with restart; Similarity of metabolites
Mesh:
Year: 2019 PMID: 31760947 PMCID: PMC6876110 DOI: 10.1186/s12859-019-3127-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The flow chart of building FLDMN to identify potential disease-related metabolites
Fig. 2The workflow of calculating metabolite similarity
Fig. 3The quantities distribution of metabolite associations with different similarities
Statistics for evaluating disease-related metabolite network
| Disease name | Disease Ontology | Test node | Positive group | Version 2017 | Version 2018 |
|---|---|---|---|---|---|
| L-2-hydroxyglutaric aciduria | DOID:0050574 | 2 | 4 | 2 | 4 |
| medium chain acyl-CoA dehydrogenase deficiency | DOID:0080153 | 1 | 16 | 15 | 16 |
| short chain acyl-CoA dehydrogenase deficiency | DOID:0080154 | 1 | 4 | 3 | 4 |
| Crohn’s colitis | DOID:0060192 | 5 | 8 | 3 | 16 |
| cerebrotendinous xanthomatosis | DOID:4810 | 5 | 7 | 2 | 9 |
| maple syrup urine disease | DOID:9269 | 6 | 23 | 17 | 24 |
| abetalipoproteinemia | DOID:1386 | 3 | 4 | 1 | 4 |
| celiac disease | DOID:10608 | 11 | 22 | 11 | 82 |
| methylmalonic acidemia | DOID:14749 | 1 | 2 | 1 | 2 |
| irritable bowel syndrome | DOID:9778 | 5 | 7 | 2 | 15 |
| Fanconi syndrome | DOID:1062 | 1 | 2 | 1 | 5 |
| citrullinemia | DOID:9273 | 6 | 8 | 2 | 8 |
| inflammatory bowel disease 1 | DOID:0110892 | 5 | 8 | 3 | 16 |
| isovaleric acidemia | DOID:14753 | 2 | 11 | 9 | 12 |
| type 2 diabetes mellitus | DOID:9352 | 1 | 27 | 27 | 27 |
| aromatic L-amino acid decarboxylase deficiency | DOID:0090123 | 2 | 9 | 7 | 12 |
| cholesterol ester storage disease | DOID:14502 | 1 | 2 | 1 | 2 |
| congenital adrenal hyperplasia | DOID:0050811 | 15 | 18 | 3 | 28 |
| Crohn’s disease | DOID:8778 | 5 | 8 | 3 | 16 |
Fig. 4Average AUC of three metabolite networks. The average AUC of FLDMN reaches 76.03%, while the average AUC of ST_SUBNET is 62.3% and DMN has an average AUC value of 76.03%
Fig. 5Performances of three metabolite networks to predict candidate metabolites related with a given disease. ST_SUBNET, DMN and FLDMN are utilized to predict candidate metabolites for each of these 19 diseases, respectively. For a given disease, the three bars with different colours represent average AUC of the three metabolite networks, correspondingly
Fig. 6ROC of L-Threonine based on FLDMN
Fig. 7Average AUC of DMN with different thresholds