Literature DB >> 19561017

libAnnotationSBML: a library for exploiting SBML annotations.

Neil Swainston1, Pedro Mendes.   

Abstract

SUMMARY: The Systems Biology Markup Language (SBML) is an established community XML format for the markup of biochemical models. With the introduction of SBML level 2 version 3, specific model entities, such as species or reactions, can now be annotated using ontological terms. These annotations, which are encoded using the resource description framework (RDF), provide the facility to specify definite terms to individual components, allowing software to unambiguously identify such components and thus link the models to existing data resources. libSBML is an application programming interface library for the manipulation of SBML files. While libSBML provides the facilities for reading and writing such annotations from and to models, it is beyond the scope of libSBML to provide interpretation of these terms. The libAnnotationSBML library introduced here acts as a layer on top of libSBML linking SBML annotations to the web services that describe these ontological terms. Two applications that use this library are described: SbmlSynonymExtractor finds name synonyms of SBML model entities and SbmlReactionBalancer checks SBML files to determine whether specifed reactions are elementally balanced.

Entities:  

Mesh:

Year:  2009        PMID: 19561017      PMCID: PMC2734318          DOI: 10.1093/bioinformatics/btp392

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

The minimum information requested in the annotation of biochemical models (MIRIAM; Le Novère et al., 2005) defines guidelines for annotation of biochemical models. The annotation of models with the MIRIAM standard provides a number of significant advantages in the development of computational tools and applications that can reason over them (Kell and Mendes, 2008). An example is the task of comparing or merging two biochemical models. Before the introduction of MIRIAM, individual components of SBML models (Hucka et al., 2003) were identified solely by free-text, human-readable, name attributes, often resulting in equivalent components being named differently in different models. As naming conventions are non-standard, it is impossible to definitively match these components computationally, and the process of model merging then requires human input to resolve ambiguities. Providing MIRIAM-compliant annotations allows a component to be unambiguously identified by associating it with one or more terms from publicly available databases such as ChEBI (Degtyarenko et al., 2008) or KEGG (Kanehisa et al., 2000) (Fig. 1).
Fig. 1.

Simplified example of MIRIAM-compliant SBML species elements, annotated with ChEBI and KEGG terms, respectively.

Simplified example of MIRIAM-compliant SBML species elements, annotated with ChEBI and KEGG terms, respectively.

2 FEATURES

The species elements in Figure 1 are both annotated with MIRIAM-compliant terms. libSBML (Bornstein et al., 2008) provides the facility for reading a given SBML element's annotation and hence could be used to determine that species1 and species2 are annotated with ChEBI term CHEBI:4167 and KEGG Compound C00031, respectively. From this, it may be concluded that the compounds represented by these species are different. However, manual inspection of the database references in ChEBI and KEGG show that both species are annotated with references that share the same chemical structure, and hence are equivalent. Performing such a comparison computationally is beyond the scope of libSBML. To do so, the annotations must be ‘dereferenced’ by querying the two databases via their web service interfaces. This task is complicated particularly because each of the web services has non-standard interfaces. The libAnnotationSBML library creates a unified framework for supporting MIRIAM-compliant annotations by wrapping these divergent web services into a Java API, allowing each web service to be queried in a consistent manner. The library itself can act as a layer on top of the libSBML API. The library is built dynamically by querying the MIRIAM web service (Laibe and Le Novère, 2007), which provides a collection of data types that are recommended for use in model annotation. The web service provides details of each of these data types including names, URNs and physical URLs to resources. From this, a collection of Ontology objects are instantiated, one for each data type supported specified in MIRIAM. Individual OntologyTerms objects are built up from an Ontology object and a unique identifier. Once instantiated, the OntologyTerm provides a number of methods, specified in Figure 2. The implementation of these methods is performed by mapping the calls to an appropriate call to the data type's web service, where such a web service exists.
Fig. 2.

Class diagram showing public methods of OntologyTerm and specialized subclasses ChebiTerm, UniProtTerm and KeggReactionTerm.

Class diagram showing public methods of OntologyTerm and specialized subclasses ChebiTerm, UniProtTerm and KeggReactionTerm. The OntologyTerm class can be extended to provide methods specific to the SBML element that is being described. For example, a metabolite species element annotated with a ChEBI term will return a ChebiTerm object, providing a method for accessing the chemical formula of the metabolite. Similarly, a KEGG Reaction annotation will return a KeggReactionTerm object, providing methods for accessing reactants and products, each returned as OntologyTerms themselves. Applying libAnnotationSBML to the SBML in Figure 1 will associate an OntologyTerm with each of the species. Calling getName() on these ChEBI and KEGG OntologyTerm objects returns ‘d-glucopyranose’ and ‘d-glucose’, respectively. Considering the initial problem of comparing SBML components, this provides an example of why names cannot be used reliably to perform this task. A more reliable approach is to exploit the fact that many data resources cross-reference one another. For example, entries in the ChEBI database can provide details of the equivalent term in KEGG, and vice versa. The OntologyTerm class supports this by implementing a getXrefs() method which returns cross references themselves as OntologyTerms, along with a predicate, defined in libSBML, that indicates the relationship between them. When an OntologyTerm references an equivalent entity in a different database, the predicate libsbmlConstants.BQB_IS is returned. In the case of a genomic database entry cross referencing an entry in a proteomic database, libsbmlConstants.BQB_ENCODES is used. Utilizing this method, it can be determined computationally that the ChEBI and KEGG terms cross-reference one another, and hence species1 and species2 can be unambiguously determined to represent equivalent entities. The libAnnotationSBML library facilitates the rapid development of tools to manipulate SBML annotation terms. The library can be used to add annotation to unannotated SBML models, using a similar approach to semanticSBML (Schulz et al., 2006). libAnnotationSBML can annotate both metabolites and proteins, exploiting the search facility that exists in both the ChEBI and UniProt web services (The UniProt Consortium, 2008). The focus of libAnnotationSBML is to develop tools to manipulate already annotated models. An example of such a tool is the SbmlSynonymExtractor, which takes annotated SBML as input, and returns a mapping of all species terms to their name synonyms, harvested from ChEBI, KEGG or UniProt. Another tool, the SbmlReactionBalancer, determines whether the reactions specified within an SBML file are elementally balanced by querying the ChEBI web service to retrieve chemical formulae of reaction participants. libAnnotationSBML was used extensively in the development of a genome-scale model of yeast metabolism, the first model of this scale in which all compartments, metabolites, enzymes and complexes are unambiguously defined using MIRIAM-compliant annotations (Herrgård et al., 2008).

3 IMPLEMENTATION AND DISTRIBUTION

The API is written in Java 1.5 and is dependent upon libSBML v3. It is supported in Linux, Windows and MacOS X and is distributed as source code and associated build files under the open source Academic Free Licence v3.0 from http://mcisb.sf.net/ along with other tools described in this manuscript.
  10 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models.

Authors:  M Hucka; A Finney; H M Sauro; H Bolouri; J C Doyle; H Kitano; A P Arkin; B J Bornstein; D Bray; A Cornish-Bowden; A A Cuellar; S Dronov; E D Gilles; M Ginkel; V Gor; I I Goryanin; W J Hedley; T C Hodgman; J-H Hofmeyr; P J Hunter; N S Juty; J L Kasberger; A Kremling; U Kummer; N Le Novère; L M Loew; D Lucio; P Mendes; E Minch; E D Mjolsness; Y Nakayama; M R Nelson; P F Nielsen; T Sakurada; J C Schaff; B E Shapiro; T S Shimizu; H D Spence; J Stelling; K Takahashi; M Tomita; J Wagner; J Wang
Journal:  Bioinformatics       Date:  2003-03-01       Impact factor: 6.937

3.  Minimum information requested in the annotation of biochemical models (MIRIAM).

Authors:  Nicolas Le Novère; Andrew Finney; Michael Hucka; Upinder S Bhalla; Fabien Campagne; Julio Collado-Vides; Edmund J Crampin; Matt Halstead; Edda Klipp; Pedro Mendes; Poul Nielsen; Herbert Sauro; Bruce Shapiro; Jacky L Snoep; Hugh D Spence; Barry L Wanner
Journal:  Nat Biotechnol       Date:  2005-12       Impact factor: 54.908

Review 4.  The markup is the model: reasoning about systems biology models in the Semantic Web era.

Authors:  Douglas B Kell; Pedro Mendes
Journal:  J Theor Biol       Date:  2007-12-03       Impact factor: 2.691

Review 5.  SBMLmerge, a system for combining biochemical network models.

Authors:  Marvin Schulz; Jannis Uhlendorf; Edda Klipp; Wolfram Liebermeister
Journal:  Genome Inform       Date:  2006

6.  LibSBML: an API library for SBML.

Authors:  Benjamin J Bornstein; Sarah M Keating; Akiya Jouraku; Michael Hucka
Journal:  Bioinformatics       Date:  2008-02-05       Impact factor: 6.937

7.  A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology.

Authors:  Markus J Herrgård; Neil Swainston; Paul Dobson; Warwick B Dunn; K Yalçin Arga; Mikko Arvas; Nils Blüthgen; Simon Borger; Roeland Costenoble; Matthias Heinemann; Michael Hucka; Nicolas Le Novère; Peter Li; Wolfram Liebermeister; Monica L Mo; Ana Paula Oliveira; Dina Petranovic; Stephen Pettifer; Evangelos Simeonidis; Kieran Smallbone; Irena Spasić; Dieter Weichart; Roger Brent; David S Broomhead; Hans V Westerhoff; Betül Kirdar; Merja Penttilä; Edda Klipp; Bernhard Ø Palsson; Uwe Sauer; Stephen G Oliver; Pedro Mendes; Jens Nielsen; Douglas B Kell
Journal:  Nat Biotechnol       Date:  2008-10       Impact factor: 54.908

8.  ChEBI: a database and ontology for chemical entities of biological interest.

Authors:  Kirill Degtyarenko; Paula de Matos; Marcus Ennis; Janna Hastings; Martin Zbinden; Alan McNaught; Rafael Alcántara; Michael Darsow; Mickaël Guedj; Michael Ashburner
Journal:  Nucleic Acids Res       Date:  2007-10-11       Impact factor: 16.971

9.  The universal protein resource (UniProt).

Authors: 
Journal:  Nucleic Acids Res       Date:  2007-11-27       Impact factor: 16.971

10.  MIRIAM Resources: tools to generate and resolve robust cross-references in Systems Biology.

Authors:  Camille Laibe; Nicolas Le Novère
Journal:  BMC Syst Biol       Date:  2007-12-13
  10 in total
  15 in total

1.  biochem4j: Integrated and extensible biochemical knowledge through graph databases.

Authors:  Neil Swainston; Riza Batista-Navarro; Pablo Carbonell; Paul D Dobson; Mark Dunstan; Adrian J Jervis; Maria Vinaixa; Alan R Williams; Sophia Ananiadou; Jean-Loup Faulon; Pedro Mendes; Douglas B Kell; Nigel S Scrutton; Rainer Breitling
Journal:  PLoS One       Date:  2017-07-14       Impact factor: 3.240

2.  Data Management in Computational Systems Biology: Exploring Standards, Tools, Databases, and Packaging Best Practices.

Authors:  Natalie J Stanford; Martin Scharm; Paul D Dobson; Martin Golebiewski; Michael Hucka; Varun B Kothamachu; David Nickerson; Stuart Owen; Jürgen Pahle; Ulrike Wittig; Dagmar Waltemath; Carole Goble; Pedro Mendes; Jacky Snoep
Journal:  Methods Mol Biol       Date:  2019

3.  A community-driven global reconstruction of human metabolism.

Authors:  Ines Thiele; Neil Swainston; Ronan M T Fleming; Andreas Hoppe; Swagatika Sahoo; Maike K Aurich; Hulda Haraldsdottir; Monica L Mo; Ottar Rolfsson; Miranda D Stobbe; Stefan G Thorleifsson; Rasmus Agren; Christian Bölling; Sergio Bordel; Arvind K Chavali; Paul Dobson; Warwick B Dunn; Lukas Endler; David Hala; Michael Hucka; Duncan Hull; Daniel Jameson; Neema Jamshidi; Jon J Jonsson; Nick Juty; Sarah Keating; Intawat Nookaew; Nicolas Le Novère; Naglis Malys; Alexander Mazein; Jason A Papin; Nathan D Price; Evgeni Selkov; Martin I Sigurdsson; Evangelos Simeonidis; Nikolaus Sonnenschein; Kieran Smallbone; Anatoly Sorokin; Johannes H G M van Beek; Dieter Weichart; Igor Goryanin; Jens Nielsen; Hans V Westerhoff; Douglas B Kell; Pedro Mendes; Bernhard Ø Palsson
Journal:  Nat Biotechnol       Date:  2013-03-03       Impact factor: 54.908

4.  BioModels Database: An enhanced, curated and annotated resource for published quantitative kinetic models.

Authors:  Chen Li; Marco Donizelli; Nicolas Rodriguez; Harish Dharuri; Lukas Endler; Vijayalakshmi Chelliah; Lu Li; Enuo He; Arnaud Henry; Melanie I Stefan; Jacky L Snoep; Michael Hucka; Nicolas Le Novère; Camille Laibe
Journal:  BMC Syst Biol       Date:  2010-06-29

5.  Annotation of rule-based models with formal semantics to enable creation, analysis, reuse and visualization.

Authors:  Goksel Misirli; Matteo Cavaliere; William Waites; Matthew Pocock; Curtis Madsen; Owen Gilfellon; Ricardo Honorato-Zimmer; Paolo Zuliani; Vincent Danos; Anil Wipat
Journal:  Bioinformatics       Date:  2015-11-11       Impact factor: 6.937

6.  A model of yeast glycolysis based on a consistent kinetic characterisation of all its enzymes.

Authors:  Kieran Smallbone; Hanan L Messiha; Kathleen M Carroll; Catherine L Winder; Naglis Malys; Warwick B Dunn; Ettore Murabito; Neil Swainston; Joseph O Dada; Farid Khan; Pınar Pir; Evangelos Simeonidis; Irena Spasić; Jill Wishart; Dieter Weichart; Neil W Hayes; Daniel Jameson; David S Broomhead; Stephen G Oliver; Simon J Gaskell; John E G McCarthy; Norman W Paton; Hans V Westerhoff; Douglas B Kell; Pedro Mendes
Journal:  FEBS Lett       Date:  2013-07-04       Impact factor: 4.124

7.  Systematic integration of experimental data and models in systems biology.

Authors:  Peter Li; Joseph O Dada; Daniel Jameson; Irena Spasic; Neil Swainston; Kathleen Carroll; Warwick Dunn; Farid Khan; Naglis Malys; Hanan L Messiha; Evangelos Simeonidis; Dieter Weichart; Catherine Winder; Jill Wishart; David S Broomhead; Carole A Goble; Simon J Gaskell; Douglas B Kell; Hans V Westerhoff; Pedro Mendes; Norman W Paton
Journal:  BMC Bioinformatics       Date:  2010-11-29       Impact factor: 3.169

8.  Annotation of SBML models through rule-based semantic integration.

Authors:  Allyson L Lister; Phillip Lord; Matthew Pocock; Anil Wipat
Journal:  J Biomed Semantics       Date:  2010-06-22

9.  Integrating systems biology models and biomedical ontologies.

Authors:  Robert Hoehndorf; Michel Dumontier; John H Gennari; Sarala Wimalaratne; Bernard de Bono; Daniel L Cook; Georgios V Gkoutos
Journal:  BMC Syst Biol       Date:  2011-08-11

10.  Controlled vocabularies and semantics in systems biology.

Authors:  Mélanie Courtot; Nick Juty; Christian Knüpfer; Dagmar Waltemath; Anna Zhukova; Andreas Dräger; Michel Dumontier; Andrew Finney; Martin Golebiewski; Janna Hastings; Stefan Hoops; Sarah Keating; Douglas B Kell; Samuel Kerrien; James Lawson; Allyson Lister; James Lu; Rainer Machne; Pedro Mendes; Matthew Pocock; Nicolas Rodriguez; Alice Villeger; Darren J Wilkinson; Sarala Wimalaratne; Camille Laibe; Michael Hucka; Nicolas Le Novère
Journal:  Mol Syst Biol       Date:  2011-10-25       Impact factor: 11.429

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.