Literature DB >> 26527720

MetaNetX/MNXref--reconciliation of metabolites and biochemical reactions to bring together genome-scale metabolic networks.

Sébastien Moretti1, Olivier Martin2, T Van Du Tran2, Alan Bridge3, Anne Morgat4, Marco Pagni5.   

Abstract

MetaNetX is a repository of genome-scale metabolic networks (GSMNs) and biochemical pathways from a number of major resources imported into a common namespace of chemical compounds, reactions, cellular compartments--namely MNXref--and proteins. The MetaNetX.org website (http://www.metanetx.org/) provides access to these integrated data as well as a variety of tools that allow users to import their own GSMNs, map them to the MNXref reconciliation, and manipulate, compare, analyze, simulate (using flux balance analysis) and export the resulting GSMNs. MNXref and MetaNetX are regularly updated and freely available.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2015        PMID: 26527720      PMCID: PMC4702813          DOI: 10.1093/nar/gkv1117

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

A genome-scale metabolic network (GSMN), or stoichiometric model, describes the set of biochemical reactions which may occur in a given organism, as well as the requisite enzymes, and may also include information on sub-cellular compartments, transport reactions and transporters. By design GSMNs are focused on the metabolism of small molecular weight compounds when energy and mass conservation law can be applied, and are not suited to represent gene regulation or signaling pathways. In practice, a GSMN has a double purpose, as it is both a repository of knowledge about an organism's metabolism, and a model that can be simulated, using flux balance analysis (FBA). Such simulations can address different questions: (i) establish the essentiality of genes in specific growth conditions; (ii) reveal opportunities for metabolic engineering and optimization; (iii) suggest new drug targets (1). To permit simulations, a GSMN usually includes artificial reactions that describe the growth medium, a growth equation (which implies the composition of the biomass) and possibly hypothetical reactions not (yet) supported by experimental biology but required to make a model functional. A relatively small number of high quality GSMNs have been published to date, essentially for model organisms, and are made available by a few dedicated databases (2–6). The development of such models requires significant human effort and curation, and the fully automated reconstruction of a GSMN from an annotated genome sequence remains a challenge (7,8). Such methods require the integration of high quality curated data covering the known biochemistry of a vast range of organisms, as well as methods that address the specific requirements of a functional GSMN, including the elemental balancing of individual reactions. These considerations form the major motivation for the development of the resource presented here.

MNXREF RECONCILIATION

The metabolite identifiers found in the early-published GSMNs were often specific to the individual groups developing and curating them, and did not generally reference the major databases of chemical compounds. In recent years there have been a few attempts to ‘reconcile’ the different nomenclatures of these compounds (9,10) including our own effort MNXref (11). The principles of the reconciliation algorithm used in MNXref can be summarized as follows: Reconciliation of common metabolites based on chemical structures; Reconciliation of metabolites through shared chemical nomenclature; Reconciliation of reactions through shared metabolites; Identification of candidate reactions for reconciliation through shared cross-references; Iterative reconciliation of metabolites through reaction context. Figure 1 illustrates the reconciliation process using malonyl-CoA as an example; individual steps in the reconciliation are color-coded according to the type of evidence used. Table 1 summarizes the overlaps between the various sources of biochemical data and GSMNs according to the results of the MNXref reconciliation. The MNXref namespace is regularly updated with metabolite and reaction data from new resources; recent additions include the EAWAG-BBD/UMBBD pathway database (12).
Figure 1.

Evidences used to reconcile different chemical compounds for the metabolite malonyl-CoA (MNXM40 in MNXref): magenta, using structure supplied by the source databases; red, using recomputed structure; orange, recomputed structure protonated at pH 7.3; yellow, recomputed structure protonated at pH 7.3 but ignoring the stereo layer of the InChI representation; green, using the cross-references supplied by the source databases; dark blue, based on compound primary names; light blue, based on compound synonym names. Triangle: in a post-processing step, chebi:57384 was chosen to best represent the targeted metabolite.

Table 1.

Numbers of reconciled metabolites (a) and reactions (b) in MNXref 2.0, and mapped proteins (c), found in common between published GSMNs and major biochemical databases in MetaNetX.org

MNXrefBiGG (2) 18 GSMNsBioCyc (3) 19 GSMNsPath2Models (4) 132 GSMNsThe SEED (18) 50 GSMNsYeastNet (6) 1 GSMN
(a) Metabolites
BiGGa (2) (version 2beta)403934142610283618291021
BioPath (19) (2010–05–03)1313649875943567427
ChEBI (20) (version 131)46 477850715 63117 97371084416
HMDB (21) (version 3.6)42 5421292252530441054714
KEGG (22) (version 75.1)28 4291958394553561560908
LIPIDMAPS (23) (2015–06–28)40 71941213821587280252
MetaCyc (3) (version 19.1)15 4721835538056371399826
Reactome (24) (2015–07–13)457617992539277014671521
The SEEDa (18) (2013–06–19)16 2802040312040981551678
UMBBD–EAWAG (12) (2014–06–30)139520634758815067
UniPathway (25) (version 2015_03)1113692874928657393
(b) Reactions
BiGGa (2) (version 2beta)11 45860553380258018761730
BioPath (19) (2010–05–03)1545456684725328285
KEGG (22) (version 75.1)9925133530854309877528
MetaCyc (3) (version 19.1)13 793141950404220828549
Reactome (24) (2015–07–13)23 59241115848514728492604
Rhea (16) (version 64)32 256510110 60310 29331902050
The SEEDa (18) (2013–06–19)13 2603069298033371738932
UniPathway (25) (version 2015_03)1994106514351471836559
(c) Proteins
UniProt (15) (version 2015_08)11 14215 67076 29327 154912

aBiGG and The SEED distribute collections of metabolites and reactions that are not necessarily retrieved in one of their GSMN.

Evidences used to reconcile different chemical compounds for the metabolite malonyl-CoA (MNXM40 in MNXref): magenta, using structure supplied by the source databases; red, using recomputed structure; orange, recomputed structure protonated at pH 7.3; yellow, recomputed structure protonated at pH 7.3 but ignoring the stereo layer of the InChI representation; green, using the cross-references supplied by the source databases; dark blue, based on compound primary names; light blue, based on compound synonym names. Triangle: in a post-processing step, chebi:57384 was chosen to best represent the targeted metabolite. aBiGG and The SEED distribute collections of metabolites and reactions that are not necessarily retrieved in one of their GSMN. In the construction and use of GSMNs, every reaction must be balanced with respect to elemental composition and charge; failure to balance reactions will lead to violations in mass conservation that can have detrimental effects on the downstream simulations. The case of protons is worthy of particular attention in this regard. Protons provide a means to balance chemical equations occurring in aqueous solution, but they are also responsible for creating membrane potentials whose dissipation is a major driving force in cell metabolism. In order to distinguish these two roles we have introduced separate identifiers for those protons transported across a membrane (MNXM01 in MNXref), and those protons introduced for the purposes of balancing a reaction (MNXM1 in MNXref). An artificial spontaneous reaction is then added to every compartment of the GSMN to permit the free exchange between transported and balanced protons (MNXR01 in MNXref). In this way, the original properties of the GSMN are preserved.

METANETX REPOSITORY AND TOOLS

MetaNetX.org (13) is a website that provides free access to the MNXref reconciliation data and a collection of published GSMNs and biochemical pathways mapped onto MNXref. The website also allows users to upload, manipulate, analyze or modify their own GSMNs and export them in SBML or in our own tab-delimited format. MetaNetX.org also offers a selection of tools for analyses including network structure, FBA or nested pattern methods (14). Gene names have been widely used in published GSMNs to describe the protein complexes that act as enzymes or transporters. Gene nomenclature is, however, essentially organism specific, if not dependent on a particular genome assembly. In MetaNetX we use UniProt accession numbers (15) to identify gene products: it greatly facilitates the inter organisms comparison of GSMNs from different sources. Although the MNXref reconciliation algorithm is essentially automated, the compilation of the MetaNetX repository requires some manual intervention and a certain number of editorial choices. This includes definition of an accepted list of species and strains that includes important model organisms. Preference is given to the most comprehensive GSMNs from external sources that use accepted standard formats and have sufficient protein coverage (full acknowledgement is given to these external sources). We are closely collaborating with Rhea (16), which is a database of manually curated biochemical reactions, as part of the ongoing effort to further improve the quality of annotation of our resource.

CONCLUSION

The www.metanetx.org resource provides a comprehensive suite of tools for the analysis of genome-scale metabolic models, based on a single integrated namespace of metabolites and metabolic reactions that integrates the most widely used biochemical databases and model repositories – MNXref. The reconciliation process used in MNXref greatly simplifies the development and analysis of genome-scale metabolic models, allowing users to concentrate on model analysis rather than the time-consuming problem of identifier mapping. Future developments will include the provision of tools and the integration of new resources such as the SwissLipids knowledgebase (17), which provides lipid structures and curated data on enzymatic reactions.
  25 in total

1.  Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology.

Authors:  Peter D Karp; Suzanne M Paley; Markus Krummenacker; Mario Latendresse; Joseph M Dale; Thomas J Lee; Pallavi Kaipa; Fred Gilham; Aaron Spaulding; Liviu Popescu; Tomer Altman; Ian Paulsen; Ingrid M Keseler; Ron Caspi
Journal:  Brief Bioinform       Date:  2009-12-02       Impact factor: 11.622

Review 2.  Constraint-based models predict metabolic and associated cellular functions.

Authors:  Aarash Bordbar; Jonathan M Monk; Zachary A King; Bernhard O Palsson
Journal:  Nat Rev Genet       Date:  2014-01-16       Impact factor: 53.242

3.  HMDB 3.0--The Human Metabolome Database in 2013.

Authors:  David S Wishart; Timothy Jewison; An Chi Guo; Michael Wilson; Craig Knox; Yifeng Liu; Yannick Djoumbou; Rupasri Mandal; Farid Aziat; Edison Dong; Souhaila Bouatra; Igor Sinelnikov; David Arndt; Jianguo Xia; Philip Liu; Faizath Yallou; Trent Bjorndahl; Rolando Perez-Pineiro; Roman Eisner; Felicity Allen; Vanessa Neveu; Russ Greiner; Augustin Scalbert
Journal:  Nucleic Acids Res       Date:  2012-11-17       Impact factor: 16.971

4.  MetRxn: a knowledgebase of metabolites and reactions spanning metabolic models and databases.

Authors:  Akhil Kumar; Patrick F Suthers; Costas D Maranas
Journal:  BMC Bioinformatics       Date:  2012-01-10       Impact factor: 3.169

5.  UniPathway: a resource for the exploration and annotation of metabolic pathways.

Authors:  Anne Morgat; Eric Coissac; Elisabeth Coudert; Kristian B Axelsen; Guillaume Keller; Amos Bairoch; Alan Bridge; Lydie Bougueleret; Ioannis Xenarios; Alain Viari
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

6.  The SwissLipids knowledgebase for lipid biology.

Authors:  Lucila Aimo; Robin Liechti; Nevila Hyka-Nouspikel; Anne Niknejad; Anne Gleizes; Lou Götz; Dmitry Kuznetsov; Fabrice P A David; F Gisou van der Goot; Howard Riezman; Lydie Bougueleret; Ioannis Xenarios; Alan Bridge
Journal:  Bioinformatics       Date:  2015-05-05       Impact factor: 6.937

7.  The RAST Server: rapid annotations using subsystems technology.

Authors:  Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal:  BMC Genomics       Date:  2008-02-08       Impact factor: 3.969

8.  The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST).

Authors:  Ross Overbeek; Robert Olson; Gordon D Pusch; Gary J Olsen; James J Davis; Terry Disz; Robert A Edwards; Svetlana Gerdes; Bruce Parrello; Maulik Shukla; Veronika Vonstein; Alice R Wattam; Fangfang Xia; Rick Stevens
Journal:  Nucleic Acids Res       Date:  2013-11-29       Impact factor: 16.971

9.  The Reactome pathway knowledgebase.

Authors:  David Croft; Antonio Fabregat Mundo; Robin Haw; Marija Milacic; Joel Weiser; Guanming Wu; Michael Caudy; Phani Garapati; Marc Gillespie; Maulik R Kamdar; Bijay Jassal; Steven Jupe; Lisa Matthews; Bruce May; Stanislav Palatnik; Karen Rothfels; Veronica Shamovsky; Heeyeon Song; Mark Williams; Ewan Birney; Henning Hermjakob; Lincoln Stein; Peter D'Eustachio
Journal:  Nucleic Acids Res       Date:  2013-11-15       Impact factor: 16.971

Review 10.  Reconciliation of metabolites and biochemical reactions for metabolic networks.

Authors:  Thomas Bernard; Alan Bridge; Anne Morgat; Sébastien Moretti; Ioannis Xenarios; Marco Pagni
Journal:  Brief Bioinform       Date:  2012-11-19       Impact factor: 11.622

View more
  54 in total

1.  biochem4j: Integrated and extensible biochemical knowledge through graph databases.

Authors:  Neil Swainston; Riza Batista-Navarro; Pablo Carbonell; Paul D Dobson; Mark Dunstan; Adrian J Jervis; Maria Vinaixa; Alan R Williams; Sophia Ananiadou; Jean-Loup Faulon; Pedro Mendes; Douglas B Kell; Nigel S Scrutton; Rainer Breitling
Journal:  PLoS One       Date:  2017-07-14       Impact factor: 3.240

2.  Analysis of human metabolism by reducing the complexity of the genome-scale models using redHUMAN.

Authors:  Maria Masid; Meric Ataman; Vassily Hatzimanikatis
Journal:  Nat Commun       Date:  2020-06-04       Impact factor: 14.919

3.  Framework and resource for more than 11,000 gene-transcript-protein-reaction associations in human metabolism.

Authors:  Jae Yong Ryu; Hyun Uk Kim; Sang Yup Lee
Journal:  Proc Natl Acad Sci U S A       Date:  2017-10-24       Impact factor: 11.205

4.  Reconstruction of Genome-Scale Metabolic Model for Hansenula polymorpha Using RAVEN.

Authors:  Francisco Zorrilla; Eduard J Kerkhoven
Journal:  Methods Mol Biol       Date:  2022

Review 5.  Path to improving the life cycle and quality of genome-scale models of metabolism.

Authors:  Yara Seif; Bernhard Ørn Palsson
Journal:  Cell Syst       Date:  2021-09-22       Impact factor: 11.091

6.  Condition-specific series of metabolic sub-networks and its application for gene set enrichment analysis.

Authors:  Van Du T Tran; Sébastien Moretti; Alix T Coste; Sara Amorim-Vaz; Dominique Sanglard; Marco Pagni
Journal:  Bioinformatics       Date:  2019-07-01       Impact factor: 6.937

7.  An atlas of human metabolism.

Authors:  Jonathan L Robinson; Pınar Kocabaş; Hao Wang; Pierre-Etienne Cholley; Daniel Cook; Avlant Nilsson; Mihail Anton; Raphael Ferreira; Iván Domenzain; Virinchi Billa; Angelo Limeta; Alex Hedin; Johan Gustafsson; Eduard J Kerkhoven; L Thomas Svensson; Bernhard O Palsson; Adil Mardinoglu; Lena Hansson; Mathias Uhlén; Jens Nielsen
Journal:  Sci Signal       Date:  2020-03-24       Impact factor: 8.192

8.  COMMIT: Consideration of metabolite leakage and community composition improves microbial community reconstructions.

Authors:  Philipp Wendering; Zoran Nikoloski
Journal:  PLoS Comput Biol       Date:  2022-03-23       Impact factor: 4.475

9.  Compartment and hub definitions tune metabolic networks for metabolomic interpretations.

Authors:  T Cameron Waller; Jordan A Berg; Alexander Lex; Brian E Chapman; Jared Rutter
Journal:  Gigascience       Date:  2020-01-01       Impact factor: 6.524

10.  Genome-scale metabolic network reconstruction of model animals as a platform for translational research.

Authors:  Hao Wang; Jonathan L Robinson; Pinar Kocabas; Johan Gustafsson; Mihail Anton; Pierre-Etienne Cholley; Shan Huang; Johan Gobom; Thomas Svensson; Mattias Uhlen; Henrik Zetterberg; Jens Nielsen
Journal:  Proc Natl Acad Sci U S A       Date:  2021-07-27       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.