| Literature DB >> 29086112 |
German A Preciat Gonzalez1, Lemmer R P El Assal1, Alberto Noronha1, Ines Thiele1, Hulda S Haraldsdóttir1, Ronan M T Fleming2.
Abstract
The mechanism of each chemical reaction in a metabolic network can be represented as a set of atom mappings, each of which relates an atom in a substrate metabolite to an atom of the same element in a product metabolite. Genome-scale metabolic network reconstructions typically represent biochemistry at the level of reaction stoichiometry. However, a more detailed representation at the underlying level of atom mappings opens the possibility for a broader range of biological, biomedical and biotechnological applications than with stoichiometry alone. Complete manual acquisition of atom mapping data for a genome-scale metabolic network is a laborious process. However, many algorithms exist to predict atom mappings. How do their predictions compare to each other and to manually curated atom mappings? For more than four thousand metabolic reactions in the latest human metabolic reconstruction, Recon 3D, we compared the atom mappings predicted by six atom mapping algorithms. We also compared these predictions to those obtained by manual curation of atom mappings for over five hundred reactions distributed among all top level Enzyme Commission number classes. Five of the evaluated algorithms had similarly high prediction accuracy of over 91% when compared to manually curated atom mapped reactions. On average, the accuracy of the prediction was highest for reactions catalysed by oxidoreductases and lowest for reactions catalysed by ligases. In addition to prediction accuracy, the algorithms were evaluated on their accessibility, their advanced features, such as the ability to identify equivalent atoms, and their ability to map hydrogen atoms. In addition to prediction accuracy, we found that software accessibility and advanced features were fundamental to the selection of an atom mapping algorithm in practice.Entities:
Keywords: Atom mapping; AutoMapper; Automation; CLCA; DREAM; ICMAP; MWED; Metabolic network reconstruction; RDT; Recon 3D
Year: 2017 PMID: 29086112 PMCID: PMC5471289 DOI: 10.1186/s13321-017-0223-1
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 1An atom mapping for the enolase reaction. a Enolase (VMH ID: ENO) catalyses the hydrolysis of 2-phosphoglycerate (VMH ID: 2pg) to produce phosphoenolpyruvate (VMH ID: pep) and water (VMH ID: h2o). The atoms of the substrate are assigned with a mapping number that matches only with one atom of the same element in the product molecules; this representation describes the reaction mechanism. b, c A graphical representation of two possible atom mappings for the enolase reaction. Nodes (circles) represent atoms. Atoms can be matched to metabolite structures in (a) on their metabolite identifiers, colours and numbers. Directed edges (arrows) represent atom transitions. All hydrogen atoms are omitted to simplify the figure. Since oxygen atoms 5, and 6 and 9, 10, and 11 are chemically equivalent twelve accurate atom mappings could be predicted for this reaction
Fig. 2Atom mapping predictions for the enolase reaction. All six compared algorithms returned an accurate atom mapping but included different types of additional information. CLCA and MWED identify equivalent atoms in the reactants (blue). DREAM and AutoMapper map hydrogen atoms (yellow). RDT, CLCA, ICMAP and MWED all identify reaction centres (green). Unlike the other three algorithms, MWED does not identify reaction centres by adding information to the bonds that break and form. Instead, it assigns different colours to the molecular substructures (moieties) that break apart or bind together. The atom mapped reactions are visualised with MarvinView (ChemAxon, Budapest, Hungary), which accepts the RXN and SMILES formats as input
Number of evaluated reactions per algorithm
| Algorithm | Number of reactions compared | Unmapped reactions |
|---|---|---|
| RDT | 512 | 0 |
| DREAM | 512 | 0 |
| AutoMapper | 512 | 0 |
| CLCA | 488 | 24 |
| MWED | 477 | 35 |
| ICMAP | 496 | 16 |
Due to limited access to some algorithms, we could not predict atom mappings for all 512 manually curated reactions
Fig. 3Accuracy by reaction types. Percentage of reactions where predicted atom mappings agreed with the manually curated atom mappings. On each bar is shown the number of reactions compared for each algorithm and top level EC number
Comparison of technical features
| Approach | Interface | Availability | Input file formats | Output file formats | |
|---|---|---|---|---|---|
| RDT | Structure-based | Web and desktop application | Free | RXN, SMILES | RXN, SMILES |
| DREAM | Optimisation-based | Web application | Free | RXN, SMILES | RXN |
| MWED | Optimisation-based | Desktop application | Free for academics | RXN, SMILES | SMILES, MetaCyc |
| CLCA | Structure-based | Algorithm | Free | SMILES | SMILES |
| ICMAP | Structure-based | Desktop application | Commercial | RXN | RXN |
| AutoMapper | Structure-based | Desktop application | Free for academics | RXN, SMILES | RXN, SMILES |
Comparison of advanced features
| Equivalent atoms | Hydrogen atoms | Reaction centres | Maps all atoms | Maps R groups | Stereo-chemistry | Maps unbalanced reactions | |
|---|---|---|---|---|---|---|---|
| RDT |
|
|
|
|
|
|
|
| DREAM |
|
|
|
|
|
|
|
| MWED |
|
|
|
|
|
|
|
| CLCA |
|
|
|
|
|
|
|
| ICMAP |
|
|
|
|
|
|
|
| AutoMapper 5.0.1 |
|
|
|
|
|
|
|
CLCA can only map hydrogen atoms for reactions with fully protonated molecules
Fig. 4Coverage of metabolites and reactions in Recon 3D. a Coverage of unique metabolites structure data. b Coverage of reaction atom mapping data
Fig. 5An incorrect reaction mechanism predicted by five algorithms. Alanine-glyoxylate transaminase (VMH ID: AGTim) reaction catalyses the chemical transformation of L-alanine (VMH ID: ala_L) and glyoxylate (VMH ID: glx) into pyruvate (VMH ID: pyr) and glycine (VMH ID: gly). The known reaction mechanism of the alanine-glyoxylate transaminase reaction is represented by the manual atom mapping (top). Five algorithms predicted the same incorrect atom mapping for this reaction (bottom) [43]. Only the MWED algorithm predicted correctly
Fig. 6Incorrect addition–elimination mechanism predicted by DREAM. Acetylcholinesterase (VMH ID: HMR_0641) reaction catalyses the breakdown of acetylcholine (VMH ID: ach) and water (VMH ID: h2o) to form acetate (VMH ID: ac) and choline (VMH ID: chol). The predicted mechanism (bottom) for the acetylcholinesterase reaction does not correspond to the mechanism described by the EC number (top). The (C3–O5) bond is broken and the (C3–O1) bond is formed. However, DREAM predicts that the (C6–O5) bond is broken, followed by formation of the (C6–O1) bond [43]
Fig. 7Chemically equivalent atoms. Four molecules with chemically equivalent atoms (coloured backgrounds). a Molecular oxygen (VMH ID: o2). b Methyl phosphate where all three highlighted oxygen atoms are chemically equivalent through resonance. MWED, but not CLCA, can identify the highlighted atoms as being chemically equivalent. c 1-Amino-1,1-ethanedio. d 1,3-Diaminopropane (VMH ID: 13dampp), which shows that chemically equivalent atoms are not necessarily connected to a shared atom. CLCA, but not MWED, can identify the highlighted atoms as being chemically equivalent
Fig. 8Unmapped atoms. Occasionally, ICMAP leaves some atoms unmapped. In this ICMAP prediction (VMH ID: MVLACc), the oxygen atoms in Mevalonate (VMH ID: mev_R) that are indicated in blue, may map to the water molecule (VMH ID: h2o) or the 4-hydroxy-4-methyl-2-oxanone molecule (VMH ID: mvlac)
Fig. 9Unbalanced thyroid peroxidase reaction. The thyroid peroxidase reaction (VMH ID: THYPX) catalises the hydrogen peroxide molecules (VMH ID: h2o2) and two hydrogen iodides (VMH ID: i) into two water molecules (VMH ID: h2o) and two molecular iodines (VMH ID: iodine). The reaction can be balanced by adding the unmapped molecular iodine product atoms (blue background) on the left hand side