Literature DB >> 34674411

Specifications of standards in systems and synthetic biology: status and developments in 2021.

Falk Schreiber1,2, Padraig Gleeson3, Martin Golebiewski4, Thomas E Gorochowski5, Michael Hucka6, Sarah M Keating3, Matthias König7, Chris J Myers8, David P Nickerson9, Björn Sommer10, Dagmar Waltemath11.   

Abstract

This special issue of the Journal of Integrative Bioinformatics contains updated specifications of COMBINE standards in systems and synthetic biology. The 2021 special issue presents four updates of standards: Synthetic Biology Open Language Visual Version 2.3, Synthetic Biology Open Language Visual Version 3.0, Simulation Experiment Description Markup Language Level 1 Version 4, and OMEX Metadata specification Version 1.2. This document can also be consulted to identify the latest specifications of all COMBINE standards.
© 2021 Falk Schreiber et al., published by De Gruyter, Berlin/Boston.

Entities:  

Mesh:

Year:  2021        PMID: 34674411      PMCID: PMC8573232          DOI: 10.1515/jib-2021-0026

Source DB:  PubMed          Journal:  J Integr Bioinform        ISSN: 1613-4516


Introduction

COMBINE (‘COmputational Modeling in BIology’ NEtwork) [1, 2] is the formal entity coordinating the development of standards in systems and synthetic biology. It was founded in 2009, and since then supports and coordinates standard developments across the globe and thereby fosters and moderates discussions; designs and implements dissemination strategies; and offers a central access point to specifications and library support. The COMBINE coordination board organises two annual community meetings – the COMBINE forum and the HARMONY hackathon. Figure 1 shows an overview of COMBINE standards and associated initiatives. This editorial will present the latest specifications of all COMBINE standards, and this special issue highlights updates over the last year, namely the releases of the following specifications: OMEX Metadata Version 1.2, SED-ML Level 1 Version 4, SBOL Visual 2.3 and 3.0. Special issues on COMBINE standards have been published since 2016, and earlier editions [3], [4], [5], [6], [7] provide updates for the years 2015–2020.
Figure 1:

COMBINE standards and associated efforts (updated from [3]).

COMBINE standards and associated efforts (updated from [3]). The community websites of the different standards and initiatives are available from the COMBINE web site at https://co.mbine.org/. This web site also contains links to COMBINE events such as the COMBINE forum and the HARMONY hackathon. Additional information, the history of COMBINE as well as its organisation can be found in publications, for example, by Hucka et al. [2], Myers et al. [8], Waltemath et al. [9] or Dräger and Waltemath [10]. This editorial will in the remaining part provide a brief updated overview of the latest specifications of the COMBINE standards and related initiatives (updated from [7]).

Current versions of COMBINE standards

Please refer to the following (most up-to-date) specifications when using COMBINE standards. New specifications or updates of existing specifications are highlighted with NEW.

Core standards

BioPAX (Biological PAthway eXchange)

BioPAX is a standard language for integration, exchange and analysis of biological pathway data. It is expressed in OWL. The current specification is:

CellML

The CellML language is an XML markup language to store and exchange computer-based mathematical models. The current specifications are:

NeuroML

The neural open markup language (NeuroML) is an XML-based description language that provides a common data format for defining and exchanging descriptions of neuronal cell and network models. The current specification is:

SBGN (Systems Biology Graphical Notation)

The systems biology graphical notation (SBGN), is a set standard graphical languages to describe visually biological knowledge. It consists of three languages describing Process Descriptions, Entity Relationships and Activity Flows. In addition, SBGN-ML is an XML-based file format describing the geometry of SBGN maps, while preserving their underlying biological meaning. The current specifications are:

SBML (Systems Biology Markup Language)

The systems biology markup language (SBML) is a computer-readable XML format for representing models of biological processes. SBML is suitable for, but not limited to, models using a process description approach. SBML development is coordinated by an elected editorial board and central developer team. The current specifications are:

SBOL (Synthetic Biology Open Language)

The synthetic biology open language (SBOL) is a language for the description and the exchange of synthetic biological parts, devices, and systems. SBOL visual (SBOLv) is a complementary standard that provides a standard set of glyphs and rules for drawing genetic circuit diagrams. The current specifications are listed in the following table. NEW Synthetic biology open language visual (SBOL Visual) Version 2.3 [34] updates SBOL visual version 2.2 with several new features. These include higher-level “interactions with interactions”, the representation of the binding with a nucleic acid backbone using overlapping glyphs, and a new “unspecified interaction” glyph. In this version, the “insulator” glyph is deprecated and replaced with a new “inert DNA spacer” glyph, and the polypeptide region glyph is now recommended for showing 2A sequences. NEW SBOL Visual Version 3.0 [35] is major revision of the SBOL Visual standard in which the diagrams and glyphs are now defined with respect to the SBOL 3 data model rather than the SBOL 2 data model. In addition, the use of dashed undirected lines for subsystem mappings has been removed, and deprecated material has been removed from the collection of glyphs. Finally, the deprecated BioPAX alternatives to SBO terms have been removed.

SED-ML (Simulation Experiment Description Markup Language)

The simulation experiment description markup language is an XML-based format for encoding simulation experiments. SED-ML allows to define the models to use, the experimental tasks to run and which results to produce. SED-ML can be used with models encoded in several languages. The current specification is listed in the following table. NEW The simulation experiment description markup language (SED-ML) Level 1 Version 4 [38] clarifies previous versions of SED-ML and extends the language with multiple new features. Specifically, enhancements introduced in SED-ML Level 1 Version 4 include (1) enriched plotting capabilities, (2) dimension reductions and math on multidimensional data, (3) support for simple parameter fitting experiments, and (4) a generic Analysis task. Further refinements in this version of the specification aim to clarify the use of SED-ML with non-XML model description formats.

Associated standards

Associated standards provide an additional layer of semantics to COMBINE representation formats. The current specifications are: A COMBINE archive is a single file bundling the various documents and all relevant information necessary for a modelling and simulation project. The archive is encoded using the Open Modeling EXchange format (OMEX). COMBINE archive metadata provides a harmonised, community-driven approach for annotating a variety of standardised model and data representation formats within a COMBINE archive. BioModels.net qualifiers are standardised relationships (predicates) that specify the relation between an object represented in a description language and the external resource used to annotate it. MIRIAM Unique Resource Identifiers allow one to uniquely and unambiguously identify an entity in a stable and perennial manner. MIRIAM Registry is a set of services and resources that provide support for generating, interpreting and resolving MIRIAM URIs. Through the Identifiers.org technology, MIRIAM URIs can be dereferenced in a flexible and robust way. MIRIAM URIs are used by SBML, SED-ML, CellML and BioPAX controlled annotation schemes. The systems biology ontology (SBO) is a set of controlled, relational vocabularies of terms commonly used in systems biology, and in particular in computational modelling. Each element of an SBML file carries an optional attribute sboTerm which value must be a term from SBO. Each symbol of SBGN is associated with an SBO term. The kinetic simulation algorithm ontology (KiSAO) describes existing algorithms and their inter-relationships through their characteristics and parameters. KiSAO is used in SED-ML, which allows simulation software to automatically choose the best algorithm available to perform a simulation and unambiguously refer to it. The OMEX Metadata Specification is a technical implementation of the community consensus across COMBINE standards to harmonise the way we describe computational models and other resources with metadata [50]. NEW The OMEX Metadata Specification Version 1.2 [42] clarifies the OMEX Metadata Specification 1.0 [51]. Specifically, the main changes introduced in OMEX Metadata Specification Version 1.2 include (1) clarification of authorship and provenance information, (2) clarification of distinction between document- and model-level annotations, and (3) the introduction of an archive-level namespacing convention to ensure reproducible sharing and interpretation of the knowledge graph encoded in the annotations. Click here for additional data file.
StandardSpecificationReference
BioPAX [11]BioPAX[12]
StandardSpecificationReference
CellML [13]CellML 2.0[14]
CellML Metadata Framework 2.0[15]
CellML 1.1[16]
StandardSpecificationReference
NeuroML [17, 18]NeuroML version 2.1[17]
StandardSpecificationReference
SBGN [19]SBGN Process Description Level 1 Version 2[20]
SBGN Entity Relationship Level 1 Version 2.0[21]
SBGN Activity Flow Level 1 Version 1.2[22]
SBGN Markup Language Version 0.3[23]
StandardSpecificationReference
SBML [24]SBML Level 3 Core, Version 2 Release 2[25]
SBML Level 3 Package: Distributions, Version 1, Release 1[26]
SBML Level 3 Package: Flux Balance Constraints Version 2[27]
SBML Level 3 Package: Groups, Version 1[28]
SBML Level 3 Package: Hierarchical Model Composition, Version 1[29]
SBML Level 3 Package: Layout, Version 1[30]
SBML Level 3 Package: Multistate, Multicomponent and Multicompartment[31]
Species, Version 1, Release 2
SBML Level 3 Package: Qualitative Models, Version 1[32]
SBML Level 3 Package: Render, Version 1, Release 1[33]
StandardSpecificationReference
SBOL [36]SBOL Version 3.0.0[37]
SBOL Visual Version 2.3[34]
SBOL Visual Version 3.0[35]
StandardSpecificationReference
SED-ML [39]SED-ML Level 1 Version 4[38]
StandardSpecificationReference
COMBINE Archive [40]COMBINE Archive 1.0[41]
OMEX MetadataOMEX Metadata Version 1.2[42]
BioModels.net qualifiers [43][44]
Identifiers.org URIs [45][46]
Systems Biology Ontology [47][external] Bioportal[48]
Kinetic Simulation Algorithm Ontology [47][external] Bioportal[49]
  41 in total

1.  Systems Biology Graphical Notation: Activity Flow language Level 1 Version 1.2.

Authors:  Huaiyu Mi; Falk Schreiber; Stuart Moodie; Tobias Czauderna; Emek Demir; Robin Haw; Augustin Luna; Nicolas Le Novère; Anatoly Sorokin; Alice Villéger
Journal:  J Integr Bioinform       Date:  2015-09-04

2.  The BioPAX community standard for pathway data sharing.

Authors:  Emek Demir; Michael P Cary; Suzanne Paley; Ken Fukuda; Christian Lemer; Imre Vastrik; Guanming Wu; Peter D'Eustachio; Carl Schaefer; Joanne Luciano; Frank Schacherer; Irma Martinez-Flores; Zhenjun Hu; Veronica Jimenez-Jacinto; Geeta Joshi-Tope; Kumaran Kandasamy; Alejandra C Lopez-Fuentes; Huaiyu Mi; Elgar Pichler; Igor Rodchenkov; Andrea Splendiani; Sasha Tkachev; Jeremy Zucker; Gopal Gopinath; Harsha Rajasimha; Ranjani Ramakrishnan; Imran Shah; Mustafa Syed; Nadia Anwar; Ozgün Babur; Michael Blinov; Erik Brauner; Dan Corwin; Sylva Donaldson; Frank Gibbons; Robert Goldberg; Peter Hornbeck; Augustin Luna; Peter Murray-Rust; Eric Neumann; Oliver Ruebenacker; Oliver Reubenacker; Matthias Samwald; Martijn van Iersel; Sarala Wimalaratne; Keith Allen; Burk Braun; Michelle Whirl-Carrillo; Kei-Hoi Cheung; Kam Dahlquist; Andrew Finney; Marc Gillespie; Elizabeth Glass; Li Gong; Robin Haw; Michael Honig; Olivier Hubaut; David Kane; Shiva Krupa; Martina Kutmon; Julie Leonard; Debbie Marks; David Merberg; Victoria Petri; Alex Pico; Dean Ravenscroft; Liya Ren; Nigam Shah; Margot Sunshine; Rebecca Tang; Ryan Whaley; Stan Letovksy; Kenneth H Buetow; Andrey Rzhetsky; Vincent Schachter; Bruno S Sobral; Ugur Dogrusoz; Shannon McWeeney; Mirit Aladjem; Ewan Birney; Julio Collado-Vides; Susumu Goto; Michael Hucka; Nicolas Le Novère; Natalia Maltsev; Akhilesh Pandey; Paul Thomas; Edgar Wingender; Peter D Karp; Chris Sander; Gary D Bader
Journal:  Nat Biotechnol       Date:  2010-09-09       Impact factor: 54.908

3.  Controlled vocabularies and semantics in systems biology.

Authors:  Mélanie Courtot; Nick Juty; Christian Knüpfer; Dagmar Waltemath; Anna Zhukova; Andreas Dräger; Michel Dumontier; Andrew Finney; Martin Golebiewski; Janna Hastings; Stefan Hoops; Sarah Keating; Douglas B Kell; Samuel Kerrien; James Lawson; Allyson Lister; James Lu; Rainer Machne; Pedro Mendes; Matthew Pocock; Nicolas Rodriguez; Alice Villeger; Darren J Wilkinson; Sarala Wimalaratne; Camille Laibe; Michael Hucka; Nicolas Le Novère
Journal:  Mol Syst Biol       Date:  2011-10-25       Impact factor: 11.429

4.  SBML Level 3 package: Groups, Version 1 Release 1.

Authors:  Michael Hucka; Lucian P Smith
Journal:  J Integr Bioinform       Date:  2016-12-18

5.  SBML Level 3 package: Render, Version 1, Release 1.

Authors:  Frank T Bergmann; Sarah M Keating; Ralph Gauges; Sven Sahle; Katja Wengler
Journal:  J Integr Bioinform       Date:  2018-04-02

Review 6.  Harmonizing semantic annotations for computational models in biology.

Authors:  Maxwell Lewis Neal; Matthias König; David Nickerson; Göksel Mısırlı; Reza Kalbasi; Andreas Dräger; Koray Atalag; Vijayalakshmi Chelliah; Michael T Cooling; Daniel L Cook; Sharon Crook; Miguel de Alba; Samuel H Friedman; Alan Garny; John H Gennari; Padraig Gleeson; Martin Golebiewski; Michael Hucka; Nick Juty; Chris Myers; Brett G Olivier; Herbert M Sauro; Martin Scharm; Jacky L Snoep; Vasundra Touré; Anil Wipat; Olaf Wolkenhauer; Dagmar Waltemath
Journal:  Brief Bioinform       Date:  2019-03-22       Impact factor: 11.622

7.  The Systems Biology Markup Language (SBML): Language Specification for Level 3 Version 2 Core Release 2.

Authors:  Michael Hucka; Frank T Bergmann; Claudine Chaouiya; Andreas Dräger; Stefan Hoops; Sarah M Keating; Matthias König; Nicolas Le Novère; Chris J Myers; Brett G Olivier; Sven Sahle; James C Schaff; Rahuman Sheriff; Lucian P Smith; Dagmar Waltemath; Darren J Wilkinson; Fengkai Zhang
Journal:  J Integr Bioinform       Date:  2019-06-20

8.  Systems biology markup language (SBML) level 3 package: multistate, multicomponent and multicompartment species, version 1, release 2.

Authors:  Fengkai Zhang; Lucian P Smith; Michael L Blinov; James Faeder; William S Hlavacek; Jose Juan Tapia; Sarah M Keating; Nicolas Rodriguez; Andreas Dräger; Leonard A Harris; Andrew Finney; Bin Hu; Michael Hucka; Martin Meier-Schellersheim
Journal:  J Integr Bioinform       Date:  2020-07-06

9.  Open modeling and exchange (OMEX) metadata specification version 1.0.

Authors:  Maxwell L Neal; John H Gennari; Dagmar Waltemath; David P Nickerson; Matthias König
Journal:  J Integr Bioinform       Date:  2020-06-25

10.  Systems biology graphical notation markup language (SBGNML) version 0.3.

Authors:  Frank T Bergmann; Tobias Czauderna; Ugur Dogrusoz; Adrien Rougny; Andreas Dräger; Vasundra Touré; Alexander Mazein; Michael L Blinov; Augustin Luna
Journal:  J Integr Bioinform       Date:  2020-06-22
View more
  1 in total

1.  Design considerations for representing systems biology information with the Systems Biology Graphical Notation.

Authors:  Falk Schreiber; Tobias Czauderna
Journal:  J Integr Bioinform       Date:  2022-07-04
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.