| Literature DB >> 23029334 |
Yu-Fei Gao1, Lei Chen, Yu-Dong Cai, Kai-Yan Feng, Tao Huang, Yang Jiang.
Abstract
Metabolic pathway analysis, one of the most important fields in biochemistry, is pivotal to understanding the maintenance and modulation of the functions of an organism. Good comprehension of metabolic pathways is critical to understanding the mechanisms of some fundamental biological processes. Given a small molecule or an enzyme, how may one identify the metabolic pathways in which it may participate? Answering such a question is a first important step in understanding a metabolic pathway system. By utilizing the information provided by chemical-chemical interactions, chemical-protein interactions, and protein-protein interactions, a novel method was proposed by which to allocate small molecules and enzymes to 11 major classes of metabolic pathways. A benchmark dataset consisting of 3,348 small molecules and 654 enzymes of yeast was constructed to test the method. It was observed that the first order prediction accuracy evaluated by the jackknife test was 79.56% in identifying the small molecules and enzymes in a benchmark dataset. Our method may become a useful vehicle in predicting the metabolic pathways of small molecules and enzymes, providing a basis for some further analysis of the pathway systems.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23029334 PMCID: PMC3448724 DOI: 10.1371/journal.pone.0045944
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Distribution of 3,348 small molecules and 654 enzymes of yeast in the 11 metabolic pathway classes.
| Tag | Metabolic pathway class | Number of small molecules | Number of enzymes | Total |
|
| Carbohydrate Metabolism | 394 | 198 | 592 |
|
| Energy Metabolism | 151 | 146 | 297 |
|
| Lipid Metabolism | 399 | 84 | 483 |
|
| Nucleotide Metabolism | 133 | 107 | 240 |
|
| Amino Acid Metabolism | 489 | 158 | 647 |
|
| Metabolism of Other Amino Acids | 156 | 44 | 200 |
|
| Glycan Biosynthesis and Metabolism | 47 | 18 | 65 |
|
| Metabolism of Cofactors and Vitamins | 350 | 87 | 437 |
|
| Metabolism of Terpenoids and Polyketides | 507 | 18 | 525 |
|
| Biosynthesis of Other Secondary Metabolites | 509 | 17 | 526 |
|
| Xenobiotics Biodegradation and Metabolism | 709 | 21 | 730 |
| Total | – | 3,844 | 898 | 4,742 |
Figure 1The number of small molecules against the number of pathway classes.
Figure 2The number of enzymes against the number of pathway classes.
The interactive compounds and proteins of C07277 and YLL058W.
| Row index | Compound/Enzyme | Compound/Enzyme | Interaction confidence score | Tag of metabolic pathway class |
| 1 | C07277 | C00103 | 409 |
|
| 2 | C07277 | C00363 | 441 |
|
| 3 | C07277 | C00507 | 416 |
|
| 4 | C07277 | C03319 | 446 |
|
| 5 | C07277 | C11912 | 63 |
|
| 6 | C07277 | YDL055C | 298 |
|
| 7 | YLL058W | C00087 | 317 |
|
| 8 | YLL058W | C00109 | 900 |
|
| 9 | YLL058W | C00155 | 900 |
|
| 10 | YLL058W | C00283 | 317 |
|
| 11 | YLL058W | C00542 | 904 |
|
| 12 | YLL058W | C01077 | 900 |
|
| 13 | YLL058W | C02291 | 900 |
|
| 14 | YLL058W | C05688 | 900 |
|
| 15 | YLL058W | C05699 | 900 |
|
| 16 | YLL058W | YAL012W | 463 |
|
| 17 | YLL058W | YGL184C | 241 |
|
The information in this column represents the metabolic pathway classes of the compound or enzyme in column 3.
The likelihood of C07277 and YLL058W belonging to each pathway class.
| Test sample | Likelihood for each pathway class | Remark |
| C07277 |
| Sum of confidence scores in row 1,3,6 of |
|
| – | |
|
| – | |
|
| Sum of confidence scores in row 2 of | |
|
| – | |
|
| – | |
|
| – | |
|
| – | |
|
| Sum of confidence scores in row 1,4,5 of | |
|
| Sum of confidence scores in row 1,4 of | |
|
| – | |
| YLL058W |
| Sum of confidence scores in row 8 of |
|
| Sum of confidence scores in row 7,9,10,11,12,16,17 of | |
|
| – | |
|
| – | |
|
| Sum of confidence scores in row 8,9,10,12,13,16,17 of | |
|
| Sum of confidence scores in row 14,15 of | |
|
| – | |
|
| – | |
|
| – | |
|
| – | |
|
| – |
The information in this column shows the means by which the likelihood in column 2 was calculated by using the data in Table 2.
The prediction accuracies obtained by our method for small molecules, enzymes, and all samples.
| Prediction order | Prediction accuracy forsmall molecules (S | Prediction accuracy forenzymes (S | Prediction accuracy fortotal samples S = S |
| 1 | 77.12% | 92.05% | 79.56% |
| 2 | 19.12% | 22.48% | 19.67% |
| 3 | 7.38% | 10.55% | 7.90% |
| 4 | 3.61% | 4.13% | 3.70% |
| 5 | 2.75% | 4.13% | 2.97% |
| 6 | 1.40% | 1.83% | 1.47% |
| 7 | 0.96% | 0.76% | 0.92% |
| 8 | 0.51% | 0.76% | 0.55% |
| 9 | 0.45% | 0.61% | 0.47% |
| 10 | 0.30% | 0.00% | 0.25% |
| 11 | 0.15% | 0.00% | 0.12% |
Figure 3Three curves showing the changes of proportions of interactions contributing to the prediction when increasing the confidence score, where the chemical-chemical curve addresses chemical-chemical interactions, chemical-protein curve chemical-protein interactions, protein-protein curve protein-protein interactions.
The X-axis is the confidence score. The Y-axis is the proportion of interactions contributing to the prediction. Generally, chemical-protein curve and protein-protein curve are ascending with the increase of confidence score, while chemical-chemical curve remains at a low level for low confidence scores and starts to increase quickly for high confidence scores.
The distribution of samples with incorrect 1-st order predicted pathway class in 11 pathway classes.
| Metabolic pathway class | Number of misclassified samples |
| Carbohydrate Metabolism | 105 |
| Energy Metabolism | 32 |
| Lipid Metabolism | 79 |
| Nucleotide Metabolism | 26 |
| Amino Acid Metabolism | 146 |
| Metabolism of Other Amino Acids | 79 |
| Glycan Biosynthesis and Metabolism | 21 |
| Metabolism of Cofactors and Vitamins | 107 |
| Metabolism of Terpenoids and Polyketides | 107 |
| Biosynthesis of Other Secondary Metabolites | 95 |
| Xenobiotics Biodegradation and Metabolism | 113 |
| Total | 910 |
The value in this cell is larger than the total number of samples with incorrect 1-st order prediction because some samples belong to more than one pathway class.
Interactive compounds and enzymes of C00439 in pathway classes M 5 and M 8.
| Index | Interactive compounds and enzymes in | Interactive compounds and enzymes in | ||
| Compound/Enzyme | Confidence score | Compound/Enzyme | Confidence score | |
| 1 | C01045 | 940 | C00101 | 934 |
| 2 | C00785 | 938 | C00445 | 927 |
| 3 | C00101 | 934 | C00025 | 923 |
| 4 | C00025 | 923 | C00001 | 899 |
| 5 | C00014 | 901 | C00018 | 899 |
| 6 | C03680 | 899 | C00664 | 899 |
| 7 | C01817 | 511 | C03479 | 899 |
| 8 | C05568 | 388 | C14818 | 899 |
| 9 | C00135 | 302 | C14819 | 899 |
| 10 | C00073 | 283 | C00504 | 739 |
| 11 | C02170 | 191 | C00234 | 438 |
| 12 | – | – | C00992 | 378 |
| 13 | – | – | C00440 | 205 |
| 14 | – | – | YGL125W | 177 |
| Likelihood | – | 7,210 | – | 10,115 |