| Literature DB >> 16753070 |
S June Oh1, Je-Gun Joung, Jeong-Ho Chang, Byoung-Tak Zhang.
Abstract
BACKGROUND: To infer the tree of life requires knowledge of the common characteristics of each species descended from a common ancestor as the measuring criteria and a method to calculate the distance between the resulting values of each measure. Conventional phylogenetic analysis based on genomic sequences provides information about the genetic relationships between different organisms. In contrast, comparative analysis of metabolic pathways in different organisms can yield insights into their functional relationships under different physiological conditions. However, evaluating the similarities or differences between metabolic networks is a computationally challenging problem, and systematic methods of doing this are desirable. Here we introduce a graph-kernel method for computing the similarity between metabolic networks in polynomial time, and use it to profile metabolic pathways and to construct phylogenetic trees.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16753070 PMCID: PMC1534063 DOI: 10.1186/1471-2105-7-284
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Statistics for the dataset according to the number of enzymes and their relationships
| enzyme | relation | |
| # of total occurrences | 35,134 | 17,567 |
| # of unique elements | 218 | 1,275 |
| max # per organism | 544 | 123 |
| min # per organism | 46 | 26 |
| avg # per organism | 68 | 217 |
| stdev across organisms | 26 | 133 |
The nine reference pathways used in the analysis
| MAP No. (KEGG) | pathway name |
| 00010 | glycolysis/gluconeogenesis |
| 00020 | citrate cycle (TCA cycle) |
| 00030 | pentose phosphate pathway |
| 00051 | fructose and mannose metabolism |
| 00052 | galactose metabolism |
| 00620 | pyruvate metabolism |
| 00630 | glyoxylate and dicarboxylate metabolism |
| 00640 | propanoate metabolism |
| 00650 | butanoate metabolism |
The 81 organisms included in the phylogenetic analysis. Full scientific names were abbreviated into three character notation (Abbr.) and their domain informations in phylogeny were also represented in single character that are Eubacteria (B), Archaea (A) and Eukaryota (E), respectively.
| Abbr. | Domain | Organism | Abbr. | Domain | Organism |
| Aae | B | Mth | A | ||
| Ana | B | Mtu | B | ||
| Atc | B | Nma | B | ||
| Ath | E | Nme | B | ||
| Atu | B | Oih | B | ||
| Bha | B | Pab | A | ||
| Bme | B | Pae | B | ||
| Bsu | B | Pai | A | ||
| Cac | B | Pfu | A | ||
| Ccr | B | Pho | A | ||
| Cel | E | Pmu | B | ||
| Cje | B | Rno | E | ||
| Cmu | B | Rso | B | ||
| Cpa | B | Sam | B | ||
| Cpe | B | Sau | B | ||
| Cpj | B | Sav | B | ||
| Cpn | B | Sce | E | ||
| Cte | B | Sco | B | ||
| Ctr | B | Sme | B | ||
| Dme | E | Spg | B | ||
| Dra | B | Spm | B | ||
| Ece | B | Spo | E | ||
| Ecj | B | Spy | B | ||
| Eco | B | Sso | A | ||
| Ecs | B | Stm | B | ||
| Fnu | B | Sto | A | ||
| Hal | A | Sty | B | ||
| Hin | B | Syn | B | ||
| Hpj | B | Tac | A | ||
| Hpy | B | Tel | B | ||
| Hsa | E | Tma | B | ||
| Lin | B | Tpa | B | ||
| Lla | B | Tte | B | ||
| Lmo | B | Tvo | A | ||
| Mac | A | Vch | B | ||
| Mja | A | Xax | B | ||
| Mle | B | Xca | B | ||
| Mlo | B | Xfa | B | ||
| Mma | A | Ype | B | ||
| Mmu | E | Ypk | B | ||
| Mtc | B |
Figure 1The distribution of enzymes in nine reference pathways. The x axis is the index of the maximum value of the order sorted by the frequency of enzyme over all pathways. All plots showed similar distributions.
Figure 2Comparison of phylogenetic trees constructed using (a) the kernel-based method and (b) the alignments of the following two enzyme sequences, together: phosphoglycerate kinase (EC 2.7.2.3) and phosphopyruvate hydratase (EC 4.2.1.11). The amino acid sequences of corresponding enzymes were retrieved from the GenBank database and analyzed by CLUSTAL with default parameters. The resultant trees were viewed with the TREEVIEW program [43].
Figure 3Current classification of biological taxonomy. The tree was reconstructed from part of the data in the NCBI (National Center for Biotechnology Information) Taxonomy [44] and viewed with the TREEVIEW program [43].
Comparison of similarity scores with respect to NCBI taxonomy for 65 organisms with the glycolysis pathway (β = 0.8).
| Method | Similarity score |
| Our method | 0.196 |
| [11] | 0.154 |
Figure 4The procedure for data processing for phylogenetic tree construction from the metabolic networks.
Figure 5The simple concept for computing the similarity between two metabolic networks using the kernel method.