| Literature DB >> 26615025 |
Thomas Pfau, Maria Pires Pacheco, Thomas Sauter.
Abstract
Genome-scale metabolic network reconstructions provide a basis for the investigation of the metabolic properties of an organism. There are reconstructions available for multiple organisms, from prokaryotes to higher organisms and methods for the analysis of a reconstruction. One example is the use of flux balance analysis to improve the yields of a target chemical, which has been applied successfully. However, comparison of results between existing reconstructions and models presents a challenge because of the heterogeneity of the available reconstructions, for example, of standards for presenting gene-protein-reaction associations, nomenclature of metabolites and reactions or selection of protonation states. The lack of comparability for gene identifiers or model-specific reactions without annotated evidence often leads to the creation of a new model from scratch, as data cannot be properly matched otherwise. In this contribution, we propose to improve the predictive power of metabolic models by switching from gene-protein-reaction associations to transcript-isoform-reaction associations, thus taking advantage of the improvement of precision in gene expression measurements. To achieve this precision, we discuss available databases that can be used to retrieve this type of information and point at issues that can arise from their neglect. Further, we stress issues that arise from non-standardized building pipelines, like inconsistencies in protonation states. In addition, problems arising from the use of non-specific cofactors, e.g. artificial futile cycles, are discussed, and finally efforts of the metabolic modelling community to unify model reconstructions are highlighted.Entities:
Keywords: gene-protein-reaction association; metabolic network reconstruction; unification
Mesh:
Year: 2015 PMID: 26615025 PMCID: PMC5142010 DOI: 10.1093/bib/bbv100
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
Different formats for the exchange of metabolic models
| Model style | Description | Advantages//Disadvantages | Examples |
|---|---|---|---|
| SBML/COBRA | SBML with additional information in the notes sections of entries [ | Models are usable in any SBML capable tool, but the additional information needs explicit parsers. Tool independent.//There is no clear definition of used fields in the SBML format, and different groups use multiple different data fields. | BiGG models [ |
| SBML/Mod | SBML using ‘ModifierSpecies' to define GPRs | Models are usable in any SBML capable tool. Genes can be linked to multiple sources. Proteins can be encoded and linked explicitly. Tool independent.//Needs parsers that make use of these properties. Lacks a defined standard how ‘ModifierSpecies' have to be defined. | HMR [ |
| SBML/FBC | SBML with FBC extension for FBA-specific information [ | Uses SBML defined fields (from the FBC extension) to provide FBA-specific information. Proteins can be encoded (and identified) explicitly. Tool independent.//FBC extension not yet processed by many tools. | BiGG2 Database ( |
| Toolbox-specific formats | Formats specific to one modelling tool, e.g. COBRA MATLAB files [ | Files can directly be used in the respective toolbox and can contain additional information.//Not easily loaded into other tools. | Recon2 [ |
| Spread sheets | Commonly multiple sheets or files with compounds, reactions and genes | Easily accessible for non-computational users. Tool independent.//Difficult to parse for further analysis, because of the lack of a standard format. | HepatoNet [ |
Note. Annotation of the SBML is either achieved by COBRA notes fields (e.g. for Database links), or using BQ and the annotation class of SBML. Both types have been used in combination with SBML/Mod and SBML/COBRA, even though commonly SBML/COBRA models do not include BQ annotations, as they rely on the COBRA annotations.
Figure 1Alternative splice forms are created by removal and addition of exons during the splicing process. This example shows two the alternate splice forms i1 (depicted in black) and i2 (depicted in red) of a human glucuronosyltransferase (UGT1A). The main isoforme, i1, is implicated in the metabolism and excretion of toxic compounds, e.g. drugs like codeine, while isoform i2 inhibits the activity of the main isoform. A colour version of this figure is available online at BIB online: https://academic.oup.com/bib.
Databases for transcript-specific genome annotations of multiple species
| Name | Species | Method of annotation | Reference | Link |
|---|---|---|---|---|
| GENCODE | Human and mouse | Manual and automated | [ | |
| ASPicDB | Human | Automated | [ | |
| Vega | Human, zebrafish, pig, mouse and rat | Manual annotation | [ | |
| H-DBAS | human, mouse, rat, chimpanzee, macaque and dog | Manual | [ | |
| SASD | Human | Prediction | [ | |
| ASIP | Plants | Automated | [ |
Databases aiming at providing functional metabolic models that are directly comparable
| Resource | Unification | Description |
|---|---|---|
| BiGG [ | SBML/COBRA | Database containing multiple genome-scale metabolic networks in the COBRA format. |
| BiGG2 ( | SBML/COBRA, SBML/FBC | Update to BiGG, currently in a beta version, providing multiple models annotated using FBC. |
| MetaCyc [ | SBML/COBRA, biocyc flat files | Large collection of metabolic reconstructions. Flat File format contains additional details not included in the provided SBMLs. |
| SEED [ | SEED IDs, Partial SBML/COBRA format | System for construction of metabolic reconstructions and analysis. Export of reconstructions is available in SBML format (with minimal annotations) and Excel sheets. |
| MetaNetX [ | MNXRef IDs, SBML/COBRA, bioql information for metabolites | Repository of unified metabolic reconstructions linking to multiple external databases. Offers tools for network analysis and modifications. SBML files contain additional yeast-style annotations for species. |
| MetRxn [ | MetRxn ID, SBML/COBRA | Database matching multiple metabolite and reaction databases aiming at providing a curated basis for network reconstruction. |