Literature DB >> 16381960

BioModels Database: a free, centralized database of curated, published, quantitative kinetic models of biochemical and cellular systems.

Nicolas Le Novère1, Benjamin Bornstein, Alexander Broicher, Mélanie Courtot, Marco Donizelli, Harish Dharuri, Lu Li, Herbert Sauro, Maria Schilstra, Bruce Shapiro, Jacky L Snoep, Michael Hucka.   

Abstract

BioModels Database (http://www.ebi.ac.uk/biomodels/), part of the international initiative BioModels.net, provides access to published, peer-reviewed, quantitative models of biochemical and cellular systems. Each model is carefully curated to verify that it corresponds to the reference publication and gives the proper numerical results. Curators also annotate the components of the models with terms from controlled vocabularies and links to other relevant data resources. This allows the users to search accurately for the models they need. The models can currently be retrieved in the SBML format, and import/export facilities are being developed to extend the spectrum of formats supported by the resource.

Entities:  

Mesh:

Year:  2006        PMID: 16381960      PMCID: PMC1347454          DOI: 10.1093/nar/gkj092

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The number of quantitative models trying to explain various aspects of the cellular machinery is increasing at a steady pace, thanks in part to the rising popularity of systems biology (1). However, as for all types of knowledge, such models will only be as useful as their access and reuse is easy for all scientists. A first step was to define standard descriptions to encode quantitative models in machine-readable formats. Example of such formats are CellML (2) and the Systems Biology Markup Language (SBML) (3,4). The biomedical community now needs public integrated resources, where authors can deposit, in controlled formats, the models they describe in scientific publications. Some general repositories of quantitative models have been made available, such as the CellML repository CellML repository [(5), ] JWS Online (6) and the former SBML repository. In addition specialist repositories include SenseLab ModelDB (7), the Database of Quantitative Cellular Signalling (DOCQS) (8) and SigPath (9). However no general public resource existed that allowed the user to browse, search and retrieve annotated models Here we present BioModels Database, developed as part of the BioModels.net initiative (). BioModels.net is a collaboration between the SBML Team (USA), the EMBL-EBI (UK), the Systems Biology Group of the Keck Graduate Institute (USA), the Systems Biology Institute (Japan) and JWS Online at Stellenbosch University (South Africa). Its aims are as follows: (i) to define agreed-upon standards for model curation, (ii) to define agreed-upon vocabularies for annotating models with connections to biological data resources and (iii) to provide a free, centralized, publicly accessible database of annotated, computational models in SBML and other structured formats. BioModels Database is an annotated resource of quantitative models of biomedical interest. Models are carefully curated to verify their correspondence to their source articles. They are also extensively annotated, with (i) terms from controlled vocabularies, such as disease codes and Gene Ontology terms and (ii) links to other data resources, such as sequence or pathway databases. Researchers in the biomedical and life science communities can then search and retrieve models related to a particular disease, biological process or molecular complex.

SUBMISSION, CURATION AND ANNOTATION

Models can be submitted by anyone to the curation pipeline of the database (Figure 1). At present, BioModels Database aims to store and annotate models that can be encoded with SBML. CellML models are also accepted. These model formats are synonymous with models that can be integrated or iterated forwards in time, such as ordinary differential equation models. Although we are aware that this means we can cover only a restricted part of the modeling field, we make this our initial focus for the following reason: (i) since a crucial part of the curation process is the verification that the models produce numerical results similar to the ones described in the reference article, iterative simulations over ranges of parameter values and perturbation of simulations at equilibrium are mandatory and (ii) a very large number of such models have already been published, and the pace of their publication is increasing steadily. As a consequence, they are sufficient to consume all the curation workforce we have, and we can envision to gather in the near future.
Figure 1

Pipeline describing the structure of BioModels database.

To be accepted in BioModels Database, a model must be compliant with MIRIAM, the Minimal Information Requested in the Annotation of Models (10). One of the requirements of MIRIAM is that a model has to be associated with a reference description that provides directly, or through references, the structure of the model, the necessary quantitative parameters and presents the results of numerical analysis of the model. BioModels Database further refines the notion of reference description, by considering only models described in the peer-reviewed scientific litterature. A series of automated tasks are performed by the pipeline prior to human intervention (see Materials and Methods for details): Verification that the file is well-formed XML. If necessary, conversion to the latest version of SBML. Verification of the syntax of SBML. Series of consistency checks, enforcing the validity of the model. If any of those steps is not completed, a member of the distributed team of curators can reject the model, or instead correct it and resubmit it to the pipeline. The last and most important step, of the curation process, is verifying that when instantiated in a simulation, the model provides results corresponding to the reference scientific article. Curators do not normally challenge the biological relevance of the models, and assume the peer-review process already filtered out unsuitable contributions. However, in specific cases, curators can spot mistakes in an article and, with the agreement of the authors, modify the model accordingly. Once the model is verified to be valid SBML, and to correspond well to the article, it is accepted in the production database for annotation. In order to be confident in reusing an encoded model, one should be able to trace its origin, and the people who were involved in its inception. The following information is therefore added to the model: (i) either a PubMed identifier () or a DOI () or an URL that permits identifying the peer-review article describing the model; (ii) name and contact details of the individuals who actually contributed to the encoding of the model in its present form; (iii) name and contact of the the person who finally entered the model in the production database and who should be contacted if there is a problem with the encoding of the model or the annotation. In addition, model components are annotated with references to relevant resources, such as terms from controled vocabularies (Taxonomy, Gene Ontology, ChEBI, etc.) and links to other databases (UniProt, KEGG, Reactome, etc.). This annotation is a crucial feature of BioModels Database in that it permits the unambiguous identification of molecular species or reactions and enables effective search stategies.

SEARCH AND RETRIEVAL

The thorough annotation of models allows a triple search strategy to be run in order to retrieve models of interest (Figure 2).
Figure 2

Schema representing the cascading search strategy. The result is a list of BioModels entries.

The models converted to SBML are stored directly in an XML native database (Xindice, ), enabling those models and/or their components to be retrieved based on the content of their elements and attributes (using XPath, ). For instance, the user can search for a given string of characters in the id, name and notes elements of each model component. Models can be retrieved by searching the annotation database directly, using SQL. Although this search is quick, it requires knowing the exact identifiers used by curators to annotate a model and relate it to third party resources, such as UniProt accession, Gene Ontology Term ID, etc. We, therefore, implemented a more advanced search system. A user can actually search third party resources directly, such as PubMed, Gene Ontology and UniProt, for instance with literal text matching. The search system retrieves the relevant identifiers and then searches BioModels Database for the models annotated with those identifiers. As a consequence, the user can retrieve all the models dealing with ‘cell cycle’ or ‘MAPK’, without having to type ‘GO:0007049’ or ‘P27361’. Several searches of any of the three types can also be run in parallel, the results being thereafter combined with boolean operators. Once retrieved, the models of interest can be downloaded in SBML Level 2 format. A number of export filters are under development to provide the models in a wider range of formats. BioModels Database is copyrighted by The BioModels Team, i.e. the set of individuals developing the resource. However, the copyright on the database does not imply copyright of the original models in BioModels Database. Each individual model retains the copyright assigned by both the creator(s) of the model and the author(s) of the reference publication. Users may distribute verbatim copies of the entire content of BioModels Database, including the models and their annotations, or a subset of the models. Users may also modify any of the models in any way, provided that at least one of the following condition is fulfilled: The modified model is used only within the user's organization. The modifications are placed in the Public Domain, or otherwise made Freely Available by allowing the Copyright Holders of the model to include the modifications in the standard version of the model. The modified model is renamed, and both BioModels Database identifier and any mention of the Copyright Holders of the model is removed. Other distribution arrangements are made directly with the Copyright Holders of the model(s) in question. This restricted license has been rendered necessary by the specific nature of the data distributed by BioModels Database. If a user of BioModels Database downloads a kinetics model and modifies it, the resulting model could be meaningless, or even worse, exhibits a behaviour completely different of what was initially meant by the authors and the creators. Therefore, we thought that the best compromise was to let complete freedom of reuse and modification, providing that BioModels Database is not associated with any modification.

PERSPECTIVE

Although BioModels Database is a very recent resource, it has already gained momentum thanks to the support of the SBML community, which has started to submit models, and major scientific publishing actors such as Nature Publishing Group, which has publicized the launch of the database. The growth of BioModels Database is currently limited, by the size of the curation workforce, to only a dozen new models a month. We expect that the existence of this public resource will contribute to an improvement in the quality of the models published by establishing an additional process for evaluating those models. The increase in quality and the continuously improved support of SBML by modelling tools should increase the speed of curation. Meanwhile, we will continue to improve the search and retrieval facilities, and support more export formats, so that users can directy use the models contained in the database even in non-SBML compliant tools.
  9 in total

1.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models.

Authors:  M Hucka; A Finney; H M Sauro; H Bolouri; J C Doyle; H Kitano; A P Arkin; B J Bornstein; D Bray; A Cornish-Bowden; A A Cuellar; S Dronov; E D Gilles; M Ginkel; V Gor; I I Goryanin; W J Hedley; T C Hodgman; J-H Hofmeyr; P J Hunter; N S Juty; J L Kasberger; A Kremling; U Kummer; N Le Novère; L M Loew; D Lucio; P Mendes; E Minch; E D Mjolsness; Y Nakayama; M R Nelson; P F Nielsen; T Sakurada; J C Schaff; B E Shapiro; T S Shimizu; H D Spence; J Stelling; K Takahashi; M Tomita; J Wagner; J Wang
Journal:  Bioinformatics       Date:  2003-03-01       Impact factor: 6.937

2.  The Database of Quantitative Cellular Signaling: management and analysis of chemical kinetic models of signaling networks.

Authors:  Sudhir Sivakumaran; Sridhar Hariharaputran; Jyoti Mishra; Upinder S Bhalla
Journal:  Bioinformatics       Date:  2003-02-12       Impact factor: 6.937

3.  Systems biology markup language: Level 2 and beyond.

Authors:  A Finney; M Hucka
Journal:  Biochem Soc Trans       Date:  2003-12       Impact factor: 5.407

Review 4.  CellML: its future, present and past.

Authors:  Catherine M Lloyd; Matt D B Halstead; Poul F Nielsen
Journal:  Prog Biophys Mol Biol       Date:  2004 Jun-Jul       Impact factor: 3.667

5.  Quantitative information management for the biochemical computation of cellular networks.

Authors:  Fabien Campagne; Susana Neves; Chiung-wen Chang; Lucy Skrabanek; Prahlad T Ram; Ravi Iyengar; Harel Weinstein
Journal:  Sci STKE       Date:  2004-08-24

6.  ModelDB: making models publicly accessible to support computational neuroscience.

Authors:  Michele Migliore; Thomas M Morse; Andrew P Davison; Luis Marenco; Gordon M Shepherd; Michael L Hines
Journal:  Neuroinformatics       Date:  2003

7.  Web-based kinetic modelling using JWS Online.

Authors:  Brett G Olivier; Jacky L Snoep
Journal:  Bioinformatics       Date:  2004-04-08       Impact factor: 6.937

8.  Minimum information requested in the annotation of biochemical models (MIRIAM).

Authors:  Nicolas Le Novère; Andrew Finney; Michael Hucka; Upinder S Bhalla; Fabien Campagne; Julio Collado-Vides; Edmund J Crampin; Matt Halstead; Edda Klipp; Pedro Mendes; Poul Nielsen; Herbert Sauro; Bruce Shapiro; Jacky L Snoep; Hugh D Spence; Barry L Wanner
Journal:  Nat Biotechnol       Date:  2005-12       Impact factor: 54.908

9.  International alliances for quantitative modeling in systems biology.

Authors:  Hiroaki Kitano
Journal:  Mol Syst Biol       Date:  2005-03-29       Impact factor: 11.429

  9 in total
  250 in total

1.  Proteus: a web-based, context-specific modelling tool for molecular networks.

Authors:  Florian Gnad; Javier Estrada; Jeremy Gunawardena
Journal:  Bioinformatics       Date:  2012-03-15       Impact factor: 6.937

2.  What it takes to understand and cure a living system: computational systems biology and a systems biology-driven pharmacokinetics-pharmacodynamics platform.

Authors:  Maciej Swat; Szymon M Kiełbasa; Sebastian Polak; Brett Olivier; Frank J Bruggeman; Mark Quinton Tulloch; Jacky L Snoep; Arthur J Verhoeven; Hans V Westerhoff
Journal:  Interface Focus       Date:  2010-12-08       Impact factor: 3.906

Review 3.  Computational tools for metabolic engineering.

Authors:  Wilbert B Copeland; Bryan A Bartley; Deepak Chandran; Michal Galdzicki; Kyung H Kim; Sean C Sleight; Costas D Maranas; Herbert M Sauro
Journal:  Metab Eng       Date:  2012-05       Impact factor: 9.783

Review 4.  Using views of Systems Biology Cloud: application for model building.

Authors:  Oliver Ruebenacker; Michael Blinov
Journal:  Theory Biosci       Date:  2010-08-21       Impact factor: 1.919

5.  Simulating plant metabolic pathways with enzyme-kinetic models.

Authors:  Kai Schallau; Björn H Junker
Journal:  Plant Physiol       Date:  2010-01-29       Impact factor: 8.340

6.  Kinetic Modeling using BioPAX ontology.

Authors:  Oliver Ruebenacker; Ion I Moraru; James C Schaff; Michael L Blinov
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2007-11-02

7.  Exploring the gap between dynamic and constraint-based models of metabolism.

Authors:  Daniel Machado; Rafael S Costa; Eugénio C Ferreira; Isabel Rocha; Bruce Tidor
Journal:  Metab Eng       Date:  2012-01-28       Impact factor: 9.783

Review 8.  Systems Biology for Smart Crops and Agricultural Innovation: Filling the Gaps between Genotype and Phenotype for Complex Traits Linked with Robust Agricultural Productivity and Sustainability.

Authors:  Anil Kumar; Rajesh Kumar Pathak; Sanjay Mohan Gupta; Vikram Singh Gaur; Dinesh Pandey
Journal:  OMICS       Date:  2015-10

9.  Kinetic modeling and the rise of systems pharmacology.

Authors:  Robert D Phair
Journal:  J Lipid Res       Date:  2015-11-20       Impact factor: 5.922

10.  Parameter Estimation and Uncertainty Quantification for Systems Biology Models.

Authors:  Eshan D Mitra; William S Hlavacek
Journal:  Curr Opin Syst Biol       Date:  2019-11-06
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.