| Literature DB >> 25153931 |
Desi Shang1, Chunquan Li2, Qianlan Yao1, Haixiu Yang1, Yanjun Xu1, Junwei Han1, Jing Li1, Fei Su1, Yunpeng Zhang1, Chunlong Zhang1, Dongguo Li3, Xia Li1.
Abstract
Identification of key metabolites for complex diseases is a challenging task in today's medicine and biology. A special disease is usually caused by the alteration of a series of functional related metabolites having a global influence on the metabolic network. Moreover, the metabolites in the same metabolic pathway are often associated with the same or similar disease. Based on these functional relationships between metabolites in the context of metabolic pathways, we here presented a pathway-based random walk method called PROFANCY for prioritization of candidate disease metabolites. Our strategy not only takes advantage of the global functional relationships between metabolites but also sufficiently exploits the functionally modular nature of metabolic networks. Our approach proved successful in prioritizing known metabolites for 71 diseases with an AUC value of 0.895. We also assessed the performance of PROFANCY on 16 disease classes and found that 4 classes achieved an AUC value over 0.95. To investigate the robustness of the PROFANCY, we repeated all the analyses in two metabolic networks and obtained similar results. Then we applied our approach to Alzheimer's disease (AD) and found that a top ranked candidate was potentially related to AD but had not been reported previously. Furthermore, our method was applicable to prioritize the metabolites from metabolomic profiles of prostate cancer. The PROFANCY could identify prostate cancer related-metabolites that are supported by literatures but not considered to be significantly differential by traditional differential analysis. We also developed a freely accessible web-based and R-based tool at http://bioinfo.hrbmu.edu.cn/PROFANCY.Entities:
Mesh:
Year: 2014 PMID: 25153931 PMCID: PMC4143229 DOI: 10.1371/journal.pone.0104934
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic of the PROFANCY. We firstly reconstruct metabolic networks based on the structure data from KEGG or EHMN database and add functional pathway nodes in this metabolic network.
We then map the known disease metabolites (seed nodes) and candidate metabolites into the above network. After that, we extend random walk with restart (RWR) method to this network. Finally, we can rank the candidate metabolites according to the steady probability of RWR.
Figure 2The ROC curve of 7 disease classes and all diseases with or without functional pathway nodes in two metabolic networks.
The AUC value of 16 disease classes.
| Disease Class | AUC value | |||
| KEGG without FPN | PROFANCY in KEGG | EHMN without FPN | PROFANCY in EHMN | |
| Metabolic | 0.935 | 0.957 | 0.92 | 0.936 |
| Neurological | 0.882 | 0.903 | 0.803 | 0.861 |
| Cardiovascular | 0.909 | 0.846 | 0.861 | 0.84 |
| Endocrine | 0.868 | 0.865 | 0.832 | 0.83 |
| Immunological | 0.832 | 0.961 | 0.856 | 0.909 |
| Muscular | 0.988 | 0.973 | 0.293 | 0.797 |
| Psychiatric | 0.831 | 0.873 | 0.706 | 0.85 |
| Cancer | 0.885 | 0.882 | 0.722 | 0.788 |
| Connective tissue | 0.859 | 0.838 | 0.794 | 0.793 |
| Developmental | 0.488 | 0.578 | 0.466 | 0.556 |
| Gastrointestinal | 0.788 | 0.821 | 0.774 | 0.948 |
| Multiple | 0.7 | 0.632 | 0.712 | 0.776 |
| Respiratory | 0.999 | 0.999 | 0.974 | 0.991 |
| Renal | 0.934 | 0.936 | 0.879 | 0.881 |
| Nutritional | 0.681 | 0.639 | 0.721 | 0.684 |
| Hematological | 0.343 | 0.425 | 0.809 | 0.612 |
| All | 0.88 | 0.895 | 0.824 | 0.871 |
FPN = functional pathway nodes.
Figure 3Top ranked candidate metabolites of Alzheimer's disease in the KEGG metabolic network.
The top ranked candidate metabolites of AD are showed in the KEGG metabolic network. The gray, blue and red nodes represent candidate metabolites, known metabolites (seed nodes) and top 1% ranked candidates, respectively. The black boxes represent 6 candidates which ranked in top 10 and their connected functional pathway nodes in both metabolic networks. The right large box shows the pathway of “Valine, leucine and isoleucine degradation” which includes the metabolite (S)-Methylmalonate semialdehyde (arrow pointed).
Top ranked Alzheimer's disease related metabolites by PROFANCY.
| KEGG ID | Metabolites name | KEGG rank | EHMN rank | Reference |
| C06002 | (S)-Methylmalonate semialdehyde | 1 | 5 | |
| C00097 | L-cysteine | 2 | 3 |
|
| C00124 | D-galactose | 3 | 2 |
|
| C00099 | Beta-alanine | 4 | 10 |
|
| C00101 | Tetrahydrofolate | 7 | 9 |
|
| C00259 | L-arabinose | 10 | 6 |
|
Top ranked prostate cancer candidate metabolites by PROFANCY.
| KEGG ID | name | KEGG rank | EHMN rank | P-value (N vs T) | P-value (N vs M) | Reference |
| C00794 | Sorbitol | 1 | 3 | 0.731925 | 0.001756 |
|
| C00137 | myo-Inositol | 2 | 1 | 0.037338 | 1.65E-07 |
|
| C00031 | Glucose | 3 | 6 | 0.02918 | 0.013403 |
|
| C00049 | Aspartate | 4 | 2 | 0.066092 | 0.015197 |
|
| C00095 | Fructose | 7 | 7 | 0.059253 | 0.003321 |
|
| C00064 | Glutamine | 8 | 8 | 0.396531 | 0.333774 |
|
N vs T: normal samples vs localized cancer samples; N vs M: normal samples vs metastatic cancer samples.
Figure 4Cluster analyses of top 30 ranked prostate cancer candidate metabolites of from PROFANCY.
Unsupervised hierarchical clustering of top 30 ranked candidate metabolites (columns) and samples (rows) is performed, and a heat map was generated. “N”, “T” and “M” represented the benign, clinically localized prostate cancer, or metastatic cancer samples, respectively. The top 30 ranked metabolites are from (A) KEGG metabolic network and (B) EHMN metabolic network.