| Literature DB >> 27936108 |
Yiran Huang1,2, Cheng Zhong2, Hai Xiang Lin3, Jing Huang4.
Abstract
Metabolic pathway alignment has been widely used to find one-to-one and/or one-to-many reaction mappings to identify the alternative pathways that have similar functions through different sets of reactions, which has important applications in reconstructing phylogeny and understanding metabolic functions. The existing alignment methods exhaustively search reaction sets, which may become infeasible for large pathways. To address this problem, we present an effective alignment method for accurately extracting reaction mappings between two metabolic pathways. We show that connected relation between reactions can be formalized as binary relation of reactions in metabolic pathways, and the multiplications of zero-one matrices for binary relations of reactions can be accomplished in finite steps. By utilizing the multiplications of zero-one matrices for binary relation of reactions, we efficiently obtain reaction sets in a small number of steps without exhaustive search, and accurately uncover biologically relevant reaction mappings. Furthermore, we introduce a measure of topological similarity of nodes (reactions) by comparing the structural similarity of the k-neighborhood subgraphs of the nodes in aligning metabolic pathways. We employ this similarity metric to improve the accuracy of the alignments. The experimental results on the KEGG database show that when compared with other state-of-the-art methods, in most cases, our method obtains better performance in the node correctness and edge correctness, and the number of the edges of the largest common connected subgraph for one-to-one reaction mappings, and the number of correct one-to-many reaction mappings. Our method is scalable in finding more reaction mappings with better biological relevance in large metabolic pathways.Entities:
Mesh:
Year: 2016 PMID: 27936108 PMCID: PMC5148114 DOI: 10.1371/journal.pone.0168044
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A part of lysine biosynthesis pathway.
The square rectangles represent reactions. The compounds are depicted by small circles. Reactions are represented by their KEGG identifiers. Plants use the upper path with a reaction, whereas H. sapiens (human) accomplishes this transformation through the lower path with three reactions.
Fig 2Binary relation of reactions in the metabolic pathway of lysine biosynthesis.
The square rectangles represent reactions. The compounds are depicted by small circles. Reactions are represented by their KEGG identifiers. The directed edge from reactions r to r denotes that at least one output compound of r is an input compound of r. R is the binary relation of reactions in the metabolic pathway of lysine biosynthesis. (a) The metabolic pathway of lysine biosynthesis. (b) Directed graph for R. (c) Zero-one matrix M for R. (d) Zero-one matrix M2.
Fig 3Overview of the MPBR method.
MPBR searches 1-to-3 reaction mappings between G′ and G. M and M′ are zero-one matrices for binary relation of reactions in G and G′ respectively. The size of reaction set RS′ in G′ is m = 1, x = 1,2,3,4. The size of reaction set RS in G is n = 3, y = 1,2,3,4,5. T(r1,r1′), T(r2,r2′), T(r3,r3′) and T(r4,r4′) are the topological similarities between the reactions in G and G′ respectively, the values of B are the similarities between the reactions in G and G′.
EC and NC of one-to-one alignment results for eco-atc.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | 0.26 | 0.26 | ||||
| 1.2 | 0.00 | 0.00 | ||||
| 1.3 | 0.58 | 0.00 | 0.64 | 0.00 | ||
| 1.4 | 0.51 | 0.00 | 0.77 | 0.00 | ||
| 1.5 | 0.91 | 0.91 | 0.76 | 0.76 | ||
| 1.6 | 0.67 | 0.00 | 0.76 | 0.00 | ||
| 1.7 | 0.64 | 0.65 | 0.69 | 0.69 | ||
| 1.8 | 0.58 | 0.00 | 0.61 | 0.00 | ||
| 1.9 | 0.70 | 0.00 | 0.66 | 0.00 | ||
| 1.1 | 0.87 | 0.00 | 0.80 | 0.00 | ||
| 1.11 | ||||||
| 1.12 | 0.67 | 0.65 | 0.73 | 0.74 | ||
| 1.13 | 0.76 | 0.76 | 0.84 | 0.84 | ||
| 1.14 | 0.47 | 0.37 | 0.65 | 0.58 | ||
The best performer for the relative item is marked in bold.
C-1tomany and CR-1tomany of one-to-many alignment results for mmu-eco.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | * | * | ||||
| 1.2 | ||||||
| 1.3 | 33 | * | 0.01 | * | ||
| 1.4 | 2 | * | * | |||
| 1.5 | 19 | * | 0.45 | * | ||
| 1.6 | 22 | * | 0.14 | * | ||
| 1.7 | 9 | 7 | 0.20 | 0.16 | ||
| 1.8 | 1 | * | 0.07 | * | ||
| 1.9 | ||||||
| 1.10 | 10 | * | 0.15 | * | ||
| 1.11 | 1 | 0 | 0.50 | 0.00 | ||
| 1.12 | * | * | * | * | ||
| 1.13 | 13 | 0 | 0.13 | 0.00 | ||
| 1.14 | 8 | 1 | 0.57 | 0.13 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
ELCCS of one-to-one alignment results for eco-atc and hsa-mmu.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | 5 | 4 | ||||
| 1.2 | 0 | 0 | ||||
| 1.3 | 751 | 0 | 750 | 750 | ||
| 1.4 | 73 | 0 | ||||
| 1.5 | 289 | 289 | ||||
| 1.6 | 364 | 0 | 508 | 508 | ||
| 1.7 | 143 | 141 | 175 | 176 | ||
| 1.8 | 37 | 0 | 50 | 54 | ||
| 1.9 | 0 | |||||
| 1.10 | 149 | 0 | ||||
| 1.11 | ||||||
| 1.12 | 2941 | 2944 | 2752 | 2753 | ||
| 1.13 | 247 | 256 | 292 | 0 | ||
| 1.14 | 92 | 95 | 204 | 199 | ||
The best performer for the relative item is marked in bold.
C-1tomany and CR-1tomany of one-to-many alignment results for eco-atc.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | ||||||
| 1.2 | ||||||
| 1.3 | 34 | 0.34 | * | |||
| 1.4 | 1 | 0 | 0.07 | 0.00 | ||
| 1.5 | 30 | 5 | 0.44 | 0.50 | ||
| 1.6 | 21 | 0.66 | * | |||
| 1.7 | 20 | 14 | 0.07 | 0.42 | ||
| 1.8 | 4 | 1 | 0.33 | 0.20 | ||
| 1.9 | 0 | 0.00 | 0.25 | |||
| 1.10 | 17 | 2 | 0.28 | 0.11 | ||
| 1.11 | 1 | 0 | 0.17 | 0.00 | ||
| 1.12 | 112 | 0.23 | * | |||
| 1.13 | 16 | 6 | 0.62 | 0.35 | ||
| 1.14 | 9 | 6 | 0.82 | 0.50 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
C-1tomany and CR-1tomany of one-to-many alignment results for hsa-mmu.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | 0 | 1 | 0.00 | 0.13 | ||
| 1.2 | ||||||
| 1.3 | 43 | * | 0.57 | * | ||
| 1.4 | 2 | 1 | 0.31 | 0.29 | ||
| 1.5 | 33 | * | 0.35 | * | ||
| 1.6 | 19 | * | 0.13 | * | ||
| 1.7 | 24 | 6 | 0.35 | 0.29 | ||
| 1.8 | 4 | 1 | 0.24 | 0.20 | ||
| 1.9 | ||||||
| 1.10 | 17 | * | 0.33 | * | ||
| 1.11 | 0 | 0 | 0.00 | 0.00 | ||
| 1.12 | 117 | * | 0.34 | * | ||
| 1.13 | 11 | * | 0.16 | * | ||
| 1.14 | 9 | 3 | 0.75 | 0.38 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
EC and NC of one-to-one alignment results for hsa-eco.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | 0.16 | * | 0.45 | * | ||
| 1.2 | 0.14 | 0.00 | 0.18 | 0.29 | ||
| 1.3 | 0.48 | * | 0.54 | * | ||
| 1.4 | 0.26 | * | 0.34 | * | ||
| 1.5 | 0.31 | * | 0.26 | * | ||
| 1.6 | 0.72 | 0.00 | 0.78 | 0.00 | ||
| 1.7 | 0.27 | 0.25 | 0.35 | 0.37 | ||
| 1.8 | 0.53 | * | 0.57 | * | ||
| 1.9 | 0.06 | |||||
| 1.1 | 0.53 | 0.00 | 0.55 | 0.00 | ||
| 1.11 | 0.33 | 0.00 | 0.00 | |||
| 1.12 | 0.48 | 0.48 | 0.45 | 0.45 | ||
| 1.13 | 0.64 | 0.00 | 0.74 | 0.00 | ||
| 1.14 | 0.68 | 0.72 | 0.81 | |||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
ELCCS of one-to-one alignment results for mmu-atc and mmu-eco.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | 4 | * | 7 | * | ||
| 1.2 | 1 | 2 | 2 | 2 | ||
| 1.3 | 725 | * | 711 | * | ||
| 1.4 | 20 | * | 33 | * | ||
| 1.5 | 174 | * | 162 | * | ||
| 1.6 | 376 | 390 | 497 | 0 | ||
| 1.7 | 155 | 131 | 148 | 150 | ||
| 1.8 | 41 | * | * | |||
| 1.9 | 1 | 0 | 2 | 0 | ||
| 1.10 | 143 | 0 | 134 | 0 | ||
| 1.11 | 3 | 0 | 0 | |||
| 1.12 | 2324 | 2330 | 2436 | 2453 | ||
| 1.13 | 258 | 276 | 301 | 0 | ||
| 1.14 | 100 | 103 | 198 | 198 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
C-1tomany and CR-1tomany of one-to-many alignment results for hsa-eco.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | * | * | ||||
| 1.2 | ||||||
| 1.3 | 31 | * | 0.01 | * | ||
| 1.4 | 2 | * | * | |||
| 1.5 | 20 | * | 0.44 | * | ||
| 1.6 | 21 | * | 0.16 | * | ||
| 1.7 | 13 | 8 | 0.30 | 0.19 | ||
| 1.8 | 2 | * | 0.14 | * | ||
| 1.9 | ||||||
| 1.10 | 11 | * | 0.15 | * | ||
| 1.11 | 1 | 0 | 0.50 | 0.00 | ||
| 1.12 | * | * | * | * | ||
| 1.13 | 11 | * | 0.13 | * | ||
| 1.14 | 7 | 2 | 0.54 | 0.22 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
The percentage of the solved instances and the average running time for the solved instances (in seconds).
| Methods | One-to-one alignment | One-to-many alignment | ||||
|---|---|---|---|---|---|---|
| MPBR | 100%(84/84) | 693.03 | 526.21 | 100%(84/84) | 199.65 | 232.45 |
| CAMPways | 100%(84/84) | 1595.59 | 1171.42 | 95%(80/84) | 499.98 | 518.56 |
| SubMAP | 76%(64/84) | 15.12 | - | 49%(41/84) | 299.1 | - |
“-” means that this item is not applicable for SubMAP.
C-manytomany of many-to-many alignment results of MPBR.
| 1.1 | 0 | 2 | 0 | 0 | 0 | 0 |
| 1.2 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1.3 | 23367 | 18420 | 19252 | 16145 | 16076 | 19085 |
| 1.4 | 62 | 59 | 82 | 16 | 16 | 82 |
| 1.5 | 5320 | 6271 | 5247 | 5323 | 5302 | 5221 |
| 1.6 | 5088 | 6434 | 8224 | 4277 | 4222 | 7522 |
| 1.7 | 764 | 1952 | 589 | 800 | 740 | 542 |
| 1.8 | 10 | 59 | 13 | 19 | 18 | 12 |
| 1.9 | 2 | 0 | 0 | 0 | 0 | 0 |
| 1.10 | 6063 | 14523 | 9628 | 9628 | 9628 | 9628 |
| 1.11 | 0 | 5 | 2 | 0 | 2 | 0 |
| 1.12 | 53398 | 60859 | 52767 | 41791 | 41395 | 50243 |
| 1.13 | 4115 | 4348 | 4177 | 3335 | 3335 | 4177 |
| 1.14 | 358 | 3733 | 3573 | 539 | 484 | 1322 |
Fig 4Sample alignments.
The upper reactions are a part of the pathways of atc, whereas the lower reactions are a part of the pathways of eco. Reactions are represented by their KEGG identifiers. Enzymes are shown in EC numbers. The compounds are depicted by small circles. (a) One-to-one reaction mappings extracted from the alignment of the metabolic pathways of lysine biosynthesis in atc and eco. (b) A one-to-many reaction mapping extracted from the alignment of the metabolic pathways of Glyoxylate and dicarboxylate metabolism in atc and eco. (c) A many-to-many reaction mapping extracted from the alignment of the metabolic pathways of Glycolysis in atc and eco.
EC and NC of one-to-one alignment results for hsa-mmu.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | ||||||
| 1.2 | 0.00 | 0.00 | ||||
| 1.3 | 0.97 | 0.97 | 0.97 | 0.98 | ||
| 1.4 | ||||||
| 1.5 | 0.92 | 0.92 | 0.91 | 0.93 | ||
| 1.6 | 0.51 | 0.61 | ||||
| 1.7 | 0.92 | 0.92 | 0.93 | 0.93 | ||
| 1.8 | 0.91 | 0.91 | ||||
| 1.9 | 0.68 | 0.89 | ||||
| 1.1 | 0.97 | 0.97 | 0.97 | |||
| 1.11 | ||||||
| 1.12 | 0.94 | 0.94 | 0.92 | 0.94 | ||
| 1.13 | 0.00 | 0.00 | ||||
| 1.14 | 0.91 | 0.88 | 0.99 | 0.97 | ||
The best performer for the relative item is marked in bold.
EC and NC of one-to-one alignment results for hsa-atc.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | 0.14 | * | 0.44 | * | ||
| 1.2 | 0.00 | 0.00 | 0.06 | 0.06 | ||
| 1.3 | 0.61 | * | 0.61 | * | ||
| 1.4 | 0.33 | * | 0.35 | * | ||
| 1.5 | 0.33 | * | 0.30 | * | ||
| 1.6 | 0.54 | 0.44 | 0.68 | 0.63 | ||
| 1.7 | 0.38 | 0.39 | 0.47 | 0.48 | ||
| 1.8 | 0.55 | * | 0.46 | * | ||
| 1.9 | 0.05 | |||||
| 1.1 | 0.51 | 0.00 | 0.56 | 0.00 | ||
| 1.11 | 0.25 | 0.00 | 0.00 | |||
| 1.12 | 0.46 | 0.47 | 0.45 | 0.45 | ||
| 1.13 | 0.69 | 0.67 | 0.76 | 0.75 | ||
| 1.14 | 0.32 | 0.21 | 0.57 | 0.47 | ||
The best performer for the relative item is marked in bold.The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
EC and NC of one-to-one alignment results for mmu-atc.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | * | * | ||||
| 1.2 | 0.14 | 0.00 | 0.00 | 0.06 | ||
| 1.3 | 0.61 | * | 0.59 | * | ||
| 1.4 | 0.33 | * | 0.35 | * | ||
| 1.5 | 0.34 | * | 0.30 | * | ||
| 1.6 | 0.55 | 0.46 | 0.68 | 0.63 | ||
| 1.7 | 0.34 | 0.35 | 0.44 | 0.45 | ||
| 1.8 | 0.59 | * | 0.49 | * | ||
| 1.9 | 0.05 | |||||
| 1.1 | 0.54 | 0.00 | 0.55 | 0.00 | ||
| 1.11 | 0.25 | 0.00 | 0.00 | |||
| 1.12 | 0.47 | 0.47 | 0.45 | 0.45 | ||
| 1.13 | 0.67 | 0.66 | 0.75 | 0.74 | ||
| 1.14 | 0.34 | 0.22 | 0.58 | 0.49 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
EC and NC of one-to-one alignment results for mmu-eco.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | 0.16 | * | * | |||
| 1.2 | 0.14 | 0.00 | 0.17 | 0.28 | ||
| 1.3 | 0.47 | * | 0.53 | * | ||
| 1.4 | 0.26 | * | 0.34 | * | ||
| 1.5 | 0.32 | * | 0.28 | * | ||
| 1.6 | 0.68 | 0.00 | 0.78 | 0.00 | ||
| 1.7 | 0.24 | 0.21 | 0.33 | 0.34 | ||
| 1.8 | 0.58 | * | 0.55 | * | ||
| 1.9 | 0.06 | |||||
| 1.1 | 0.52 | 0.00 | 0.55 | 0.00 | ||
| 1.11 | 0.33 | 0.00 | 0.00 | |||
| 1.12 | 0.49 | 0.48 | 0.45 | 0.45 | ||
| 1.13 | 0.67 | 0.00 | 0.75 | 0.00 | ||
| 1.14 | 0.73 | 0.69 | 0.80 | 0.79 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
ELCCS of one-to-one alignment results for hsa-eco and hsa-atc.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | 6 | * | * | |||
| 1.2 | 4 | 2 | 1 | 2 | ||
| 1.3 | 735 | * | 728 | * | ||
| 1.4 | 33 | * | 20 | * | ||
| 1.5 | 57 | * | 174 | * | ||
| 1.6 | 515 | 0 | 378 | 392 | ||
| 1.7 | 210 | 175 | 264 | 282 | ||
| 1.8 | * | 45 | * | |||
| 1.9 | 2 | 0 | 1 | 0 | ||
| 1.10 | 134 | 0 | 129 | 0 | ||
| 1.11 | 0 | 3 | 0 | |||
| 1.12 | 2435 | 2499 | 2304 | 2342 | ||
| 1.13 | 301 | 0 | 263 | 275 | ||
| 1.14 | 195 | 215 | 103 | 107 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
C-1tomany and CR-1tomany of one-to-many alignment results for hsa-atc.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | * | * | ||||
| 1.2 | ||||||
| 1.3 | 30 | * | 0.46 | * | ||
| 1.4 | 1 | * | 0.17 | * | ||
| 1.5 | 23 | * | 0.51 | * | ||
| 1.6 | 22 | * | 0.73 | * | ||
| 1.7 | 22 | 8 | 0.37 | 0.21 | ||
| 1.8 | 2 | * | 0.13 | * | ||
| 1.9 | ||||||
| 1.10 | 14 | * | 0.15 | * | ||
| .11 | 1 | 1 | 0.50 | 0.14 | ||
| 1.12 | * | * | * | * | ||
| 1.13 | 13 | 7 | 0.59 | 0.44 | ||
| 1.14 | 5 | 5 | 0.45 | 0.36 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.
C-1tomany and CR-1tomany of one-to-many alignment results for mmu-atc.
| MPBR | CAMPways | SubMAP | MPBR | CAMPways | SubMAP | |
|---|---|---|---|---|---|---|
| 1.1 | * | * | ||||
| 1.2 | ||||||
| 1.3 | 30 | * | 0.45 | * | ||
| 1.4 | 1 | * | 0.17 | * | ||
| 1.5 | 24 | * | 0.52 | * | ||
| 1.6 | 20 | * | 0.67 | * | ||
| 1.7 | 16 | 5 | 0.28 | 0.12 | ||
| 1.8 | 2 | * | 0.14 | * | ||
| 1.9 | ||||||
| 1.10 | 11 | * | 0.15 | * | ||
| 1.11 | 1 | 1 | 0.50 | 0.14 | ||
| 1.12 | * | * | * | * | ||
| 1.13 | 14 | 7 | 0.64 | 0.44 | ||
| 1.14 | 7 | 4 | 0.64 | 0.29 | ||
The best performer for the relative item is marked in bold. The asterisk “*” denotes that the program is unable to generate a result under our current computing environment.