Literature DB >> 29092055

SABIO-RK: an updated resource for manually curated biochemical reaction kinetics.

Ulrike Wittig1, Maja Rey1, Andreas Weidemann1, Renate Kania2, Wolfgang Müller1.   

Abstract

SABIO-RK (http://sabiork.h-its.org/) is a manually curated database containing data about biochemical reactions and their reaction kinetics. The data are primarily extracted from scientific literature and stored in a relational database. The content comprises both naturally occurring and alternatively measured biochemical reactions and is not restricted to any organism class. The data are made available to the public by a web-based search interface and by web services for programmatic access. In this update we describe major improvements and extensions of SABIO-RK since our last publication in the database issue of Nucleic Acid Research (2012). (i) The website has been completely revised and (ii) allows now also free text search for kinetics data. (iii) Additional interlinkages with other databases in our field have been established; this enables users to gain directly comprehensive knowledge about the properties of enzymes and kinetics beyond SABIO-RK. (iv) Vice versa, direct access to SABIO-RK data has been implemented in several systems biology tools and workflows. (v) On request of our experimental users, the data can be exported now additionally in spreadsheet formats. (vi) The newly established SABIO-RK Curation Service allows to respond to specific data requirements.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29092055      PMCID: PMC5753344          DOI: 10.1093/nar/gkx1065

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

In 2006, SABIO-RK database (1) has been established to support modellers of biochemical reactions and complex networks. SABIO-RK represents a repository for structured, curated and annotated data about reactions and their kinetics. The data are manually extracted from the scientific literature and stored in a relational database. As compared with automatic data extraction by text mining tools, the manual extraction process guarantees a very high degree of accurateness and completeness. Especially, the extraction of the complex information of reactions and kinetics of most of the available publications are not enough structured and well written. Furthermore, relevant information is distributed over the entire article and unique identifiers or controlled vocabularies are missing (2,3). Based on the time consuming process of manual data extraction and manual curation, SABIO-RK emphasizes on quality rather than on quantity. SABIO-RK is not only a database for modellers but also for experimentalists in the laboratory who are looking for example for more details about the enzymatic activity of a protein or about alternative reactions of an enzyme. For many years SABIO-RK was focussing on kinetics of metabolic reactions but with an increased user interest in kinetics data for signalling events, SABIO-RK also stores reactions and binding events of signal transduction pathways. The bidirectional cross-references between SABIO-RK and protein specific databases like UniProtKB (4), pathway databases like KEGG (5) or chemical compound databases like ChEBI (6) assist users to find more specific kinetic information in SABIO-RK and vice versa. A comparable database providing kinetic parameters is BRENDA (7). In contrast to SABIO-RK, the information in BRENDA is centred on enzymes and their kinetic constants, whereas SABIO-RK focuses on reactions and additionally, beside constants, offers the associated kinetic rate laws, formulas and experimental conditions. Other databases containing kinetic data are focussing e.g. on proteins (UniProtKB), plant metabolism (MetaCrop (8)) or protein interactions (KDBI (9)).

NEW DATABASE CONTENT

The most SABIO-RK content sources are articles published between the late 1960s and today, which comprise currently more than 300 different journals. The selection of papers has changed over time. In the first years most of the publications were non-specifically selected by reaction kinetics related keyword search in the PubMed database (10), nowadays the focus of the selection is dependent on collaboration projects and user requests. The fact, that more than one third of the database content refers to mammals (mainly human and rat) and around 15% are liver data, is the result of such a former collaboration project. And that 25% of the data in SABIO-RK are related to the central Glycolysis/Gluconeogenesis pathway is due to user requests and several smaller projects. An increased user interest in plant metabolism is reflected in about 10% reaction kinetics data for green plants (embryophyta). All in all, the database content increased since the last NAR publication 2012 ∼40%. As of September 2017 SABIO-RK provides data extracted from more than 5.600 publications, stored in ∼57.000 different database entries. The kinetics data are related to 934 different organisms, of which about two-thirds belong to eukaryotes and one-third to bacteria, archaea and viruses. At present the top ten organisms in SABIO-RK are Homo sapiens, Rattus norvegicus, Escherichia coli, Saccharomyces cerevisiae, Mus musculus, Bos taurus, Bacillus subtilis, Arabidopsis thaliana, Sus scrofa and Oryctolagus cuniculus. A more detailed statistic about the database content is depicted on Figure 1.
Figure 1.

SABIO-RK statistics (status: September 2017).

SABIO-RK statistics (status: September 2017). SABIO-RK mostly contains metabolic reactions and only a small fraction for signalling and transport reactions. Currently there are kinetic data for ∼80 different signalling and 150 different transport reactions stored in the database. Transport reactions include reactions with and without chemical conversion of substrates. Reactions usually are assigned to pathways, which are based on the classifications from KEGG. But given that often alternative or non-biochemical compounds are used in experiments, there are many alternative reactions in SABIO-RK which are not linked to a biochemical pathway. A database entry in SABIO-RK comprises kinetics data for one single reaction in one organism under specific experimental conditions. If the publication provides information for more than one biochemical reaction, organism or enzyme, these data will be stored not as one single, but in several distinct database entries. About 25% of the database entries contain data for specific mutant enzyme variants which allow the comparison of kinetics data from mutant with wildtype proteins. More than 90% of SABIO-RK data have been manually extracted from publications. Biological experts read the paper and insert relevant information in a web-based curation interface where the data are semi-automatically checked for correctness and consistency. Annotations and unique identifiers are added for interoperability and interlinkage with ontologies, controlled vocabularies and external databases. Additionally, kinetics data from lab experiments or models can be directly uploaded into the curation interface via SBML format and further processed by the curators. Data in the database entry and in the details pages are highly interlinked to external databases, ontologies and controlled vocabularies. Details pages for the reaction, organism, enzyme, pathway and compound are additionally shown in extra pop-up windows after clicking on the appropriate term. Links are implemented for reactions to KEGG, for proteins to UniProtKB, for organisms to NCBI taxonomy (10), for tissues to Brenda Tissue Ontology (BTO) (11), for publications to PubMed, for compounds to ChEBI, KEGG and PubChem (10), for cell locations and signalling events to Gene Ontology (GO) (12), for kinetic laws and parameters to Systems Biology Ontology (SBO) (13), and for enzymes to ExPASy (14), KEGG, BRENDA, IntEnz (15), IUBMB (http://www.chem.qmul.ac.uk/iubmb/enzyme/), Reactome (16) and MetaCrop.

NEW DATA ACCESS

Data in SABIO-RK can be retrieved both via the web-based search interface and REST-ful web services. The most obvious change affects the website, which has been adapted to a more modern design, but also contains new features, like free text search. A free text search for ‘liver’ will return all entries containing this search term independent of the data field (e.g. tissue, publication title or comment). The advanced search feature allows the definition of complex queries by selecting different attributes like enzyme name, tissue, PubMedID, etc. from a selection list. This selection list includes not only names but also SABIO-RK internal as well as external identifiers (from KEGG, ChEBI, UniProtKB, GO, SBO etc.) and the possibility to search for signalling events (e.g. protein autophosphorylation) or signalling modifications (e.g. acetylation). The autocomplete function instantaneously makes suggestions and predicts how many results (database entries) are in the database for the given query. Figure 2 shows an example of the earlier introduced ontology-screened search for organisms using NCBI taxonomy and tissues using BTO and the results for classified groups of organisms and tissues in the new website design.
Figure 2.

Screenshots of SABIO-RK newly designed web-based search interface with query results for kinetics data in mammalian blood using ontology-based search (using NCBI taxonomy for organism and BTO for tissue).

Screenshots of SABIO-RK newly designed web-based search interface with query results for kinetics data in mammalian blood using ontology-based search (using NCBI taxonomy for organism and BTO for tissue). Additional options can now be specified by defining filters in the filter options box. Filters can be set e.g. for enzymes/proteins by selecting data for wildtype or mutant proteins. Selecting the rate equation filter will display only data entries with a kinetic rate equation and accordingly, transport reactions are displayed when transport reaction filter is selected. The environmental conditions pH and temperature can be specified by moving the slider buttons to select a range. Since SABIO-RK contains data from different kinds of sources (publication, direct submission from laboratory or model upload via SBML) filter could be defined to search for specific data sources. Search results are displayed in three different views: Entry view, Reaction view, and Visual search. By default the Entry view is shown, which lists the resulting database entries in a summarized way. Detailed information for each database entry can be viewed by clicking on the blue triangle. The Reaction view groups the database entries based on their reactions. To get a quick impression about the connection of a certain reaction with enzymes, organisms, and tissues a corresponding visualization is provided in this view. Columns in both, Entry and Reaction view can be sorted by clicking on the column header. Finally, the Visual search depicts a visualization of the search result together with the opportunity to confine the query by clicking on parts of the diagrams for organisms, tissues, kinetic parameters or kinetic rate laws. A partial screenshot of the Visual search containing the diagrams for organisms and tissues is shown in Figure 2. Search results of a SABIO-RK query can be selected for export by collecting database entries in an export cart. Data can be exported in standard exchange formats including SBML (17), BioPAX (18), SBPAX (19), MatLab (http://www.mathworks.com) and in spreadsheet format where the exported table columns can be defined by (de)selecting attributes from the list (see Figure 3).
Figure 3.

Screenshot of SABIO-RK web-based search interface representing spreadsheet export functionality.

Screenshot of SABIO-RK web-based search interface representing spreadsheet export functionality. Beside the web interface, SABIO-RK web services can be used to access the database automatically which is also used for retrieval of kinetics data by third-party software tools and data workflows. These tools include CellDesigner (20), VirtualCell (21), Sycamore (22), SBMLsqueezer (23), cy3sabiork (http://apps.cytoscape.org/apps/cy3sabiork), Path2Models (24), LigDig (25), FAIRDOMHub (26). Currently SABIO-RK is accessed mostly (ca. 90%) via web services, which underlines the importance of its integration in modelling and visualization tools. Standard export formats for the web services are SBML, BioPAX/SBPAX and XML. Beside that a Python script is offered to use the web services for data export in table format. SABIO-RK is cross-referenced by several other biological databases and online platforms which allows the users of these external resources to gain further knowledge about enzymatic activities of enzymes (links from UniProtKB, BRENDA, NextProt (27), ChloroKB (28), MetaCrop) and detailed information about kinetics of biochemical reactions (links from KEGG Reaction, MetaNetX (29), BKMS-react (30)) as well as the participation and meaning of compounds (links from ChEBI, MetaNetX) on it. Currently about 20% of SABIO-RK users are entering the database search interface through cross-references from external databases. External links are implemented using the same structure of query definition as in the search interface (e.g. http://sabiork.h-its.org/newSearch?q=ecnumber:2.7.1.40) to implement detailed or complex queries. The results of the query links can be further refined in the search interface.

NEW SERVICES

To adapt the SABIO-RK database even more to user requirements in regard to the database content, we support specific curation requests. In case that no results are returned for a specific query, the following note is displayed: ‘Sorry, we found no results for your query… — but you may send a request to add the corresponding data’, to encourage users to send their specific questions via the SABIO-RK contact form. For example SABIO-RK curators will help to find relevant data in the literature and insert the kinetics data extracted from the publications in the database. This service can include searches for kinetics data for specific organisms, pathways or enzymes. Here SABIO-RK is flexible enough and not restricted to any organism class or biochemical reaction type. This service is free of charge and a list of public curation request is displayed on a separate website for services (http://sabiork.h-its.org/publicCuration/list). Other types of requests, feedback or bug reports can always be given by using the contact form. Additionally, to foster an interactive exchange amongst users and between users and the SABIO-RK team, an internet forum via Google Groups has been established. Users can also request for data upload of their own experimental data and models ideally given in SBML format. These data then run through the curation process, are annotated and linked to controlled vocabularies, ontologies and external databases, to allow the comparison of these private results with published data. Unpublished data or models can be protected from public access in a user password restricted area.
  29 in total

1.  The ENZYME database in 2000.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models.

Authors:  M Hucka; A Finney; H M Sauro; H Bolouri; J C Doyle; H Kitano; A P Arkin; B J Bornstein; D Bray; A Cornish-Bowden; A A Cuellar; S Dronov; E D Gilles; M Ginkel; V Gor; I I Goryanin; W J Hedley; T C Hodgman; J-H Hofmeyr; P J Hunter; N S Juty; J L Kasberger; A Kremling; U Kummer; N Le Novère; L M Loew; D Lucio; P Mendes; E Minch; E D Mjolsness; Y Nakayama; M R Nelson; P F Nielsen; T Sakurada; J C Schaff; B E Shapiro; T S Shimizu; H D Spence; J Stelling; K Takahashi; M Tomita; J Wagner; J Wang
Journal:  Bioinformatics       Date:  2003-03-01       Impact factor: 6.937

3.  IntEnz, the integrated relational enzyme database.

Authors:  Astrid Fleischmann; Michael Darsow; Kirill Degtyarenko; Wolfgang Fleischmann; Sinéad Boyce; Kristian B Axelsen; Amos Bairoch; Dietmar Schomburg; Keith F Tipton; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  Integrating BioPAX pathway knowledge with SBML models.

Authors:  O Ruebenacker; I I Moraru; J C Schaff; M L Blinov
Journal:  IET Syst Biol       Date:  2009-09       Impact factor: 1.615

5.  The BioPAX community standard for pathway data sharing.

Authors:  Emek Demir; Michael P Cary; Suzanne Paley; Ken Fukuda; Christian Lemer; Imre Vastrik; Guanming Wu; Peter D'Eustachio; Carl Schaefer; Joanne Luciano; Frank Schacherer; Irma Martinez-Flores; Zhenjun Hu; Veronica Jimenez-Jacinto; Geeta Joshi-Tope; Kumaran Kandasamy; Alejandra C Lopez-Fuentes; Huaiyu Mi; Elgar Pichler; Igor Rodchenkov; Andrea Splendiani; Sasha Tkachev; Jeremy Zucker; Gopal Gopinath; Harsha Rajasimha; Ranjani Ramakrishnan; Imran Shah; Mustafa Syed; Nadia Anwar; Ozgün Babur; Michael Blinov; Erik Brauner; Dan Corwin; Sylva Donaldson; Frank Gibbons; Robert Goldberg; Peter Hornbeck; Augustin Luna; Peter Murray-Rust; Eric Neumann; Oliver Ruebenacker; Oliver Reubenacker; Matthias Samwald; Martijn van Iersel; Sarala Wimalaratne; Keith Allen; Burk Braun; Michelle Whirl-Carrillo; Kei-Hoi Cheung; Kam Dahlquist; Andrew Finney; Marc Gillespie; Elizabeth Glass; Li Gong; Robin Haw; Michael Honig; Olivier Hubaut; David Kane; Shiva Krupa; Martina Kutmon; Julie Leonard; Debbie Marks; David Merberg; Victoria Petri; Alex Pico; Dean Ravenscroft; Liya Ren; Nigam Shah; Margot Sunshine; Rebecca Tang; Ryan Whaley; Stan Letovksy; Kenneth H Buetow; Andrey Rzhetsky; Vincent Schachter; Bruno S Sobral; Ugur Dogrusoz; Shannon McWeeney; Mirit Aladjem; Ewan Birney; Julio Collado-Vides; Susumu Goto; Michael Hucka; Nicolas Le Novère; Natalia Maltsev; Akhilesh Pandey; Paul Thomas; Edgar Wingender; Peter D Karp; Chris Sander; Gary D Bader
Journal:  Nat Biotechnol       Date:  2010-09-09       Impact factor: 54.908

6.  MetaCrop 2.0: managing and exploring information about crop plant metabolism.

Authors:  Falk Schreiber; Christian Colmsee; Tobias Czauderna; Eva Grafahrend-Belau; Anja Hartmann; Astrid Junker; Björn H Junker; Matthias Klapperstück; Uwe Scholz; Stephan Weise
Journal:  Nucleic Acids Res       Date:  2011-11-15       Impact factor: 16.971

7.  SABIO-RK--database for biochemical reaction kinetics.

Authors:  Ulrike Wittig; Renate Kania; Martin Golebiewski; Maja Rey; Lei Shi; Lenneke Jong; Enkhjargal Algaa; Andreas Weidemann; Heidrun Sauer-Danzwith; Saqib Mir; Olga Krebs; Meik Bittkowski; Elina Wetsch; Isabel Rojas; Wolfgang Müller
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

8.  SBMLsqueezer 2: context-sensitive creation of kinetic equations in biochemical networks.

Authors:  Andreas Dräger; Daniel C Zielinski; Roland Keller; Matthias Rall; Johannes Eichner; Bernhard O Palsson; Andreas Zell
Journal:  BMC Syst Biol       Date:  2015-10-09

9.  MetaNetX/MNXref--reconciliation of metabolites and biochemical reactions to bring together genome-scale metabolic networks.

Authors:  Sébastien Moretti; Olivier Martin; T Van Du Tran; Alan Bridge; Anne Morgat; Marco Pagni
Journal:  Nucleic Acids Res       Date:  2015-11-02       Impact factor: 16.971

10.  The Reactome pathway Knowledgebase.

Authors:  Antonio Fabregat; Konstantinos Sidiropoulos; Phani Garapati; Marc Gillespie; Kerstin Hausmann; Robin Haw; Bijay Jassal; Steven Jupe; Florian Korninger; Sheldon McKay; Lisa Matthews; Bruce May; Marija Milacic; Karen Rothfels; Veronica Shamovsky; Marissa Webber; Joel Weiser; Mark Williams; Guanming Wu; Lincoln Stein; Henning Hermjakob; Peter D'Eustachio
Journal:  Nucleic Acids Res       Date:  2015-12-09       Impact factor: 16.971

View more
  14 in total

1.  An empirical analysis of enzyme function reporting for experimental reproducibility: Missing/incomplete information in published papers.

Authors:  Peter Halling; Paul F Fitzpatrick; Frank M Raushel; Johann Rohwer; Santiago Schnell; Ulrike Wittig; Roland Wohlgemuth; Carsten Kettner
Journal:  Biophys Chem       Date:  2018-08-24       Impact factor: 2.352

Review 2.  Machine Learning and Hybrid Methods for Metabolic Pathway Modeling.

Authors:  Miroslava Cuperlovic-Culf; Thao Nguyen-Tran; Steffany A L Bennett
Journal:  Methods Mol Biol       Date:  2023

3.  Reconstruction of a catalogue of genome-scale metabolic models with enzymatic constraints using GECKO 2.0.

Authors:  Benjamín Sánchez; Mihail Anton; Iván Domenzain; Eduard J Kerkhoven; Aarón Millán-Oropeza; Céline Henry; Verena Siewers; John P Morrissey; Nikolaus Sonnenschein; Jens Nielsen
Journal:  Nat Commun       Date:  2022-06-30       Impact factor: 17.694

4.  Combining hypothesis- and data-driven neuroscience modeling in FAIR workflows.

Authors:  Olivia Eriksson; Upinder Singh Bhalla; Kim T Blackwell; Sharon M Crook; Daniel Keller; Andrei Kramer; Marja-Leena Linne; Ausra Saudargienė; Rebecca C Wade; Jeanette Hellgren Kotaleski
Journal:  Elife       Date:  2022-07-06       Impact factor: 8.713

5.  A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function.

Authors:  Gemma L Holliday; Shoshana D Brown; David Mischel; Benjamin J Polacco; Patricia C Babbitt
Journal:  Database (Oxford)       Date:  2020-01-01       Impact factor: 3.451

6.  BioModels Parameters: a treasure trove of parameter values from published systems biology models.

Authors:  Mihai Glont; Chinmay Arankalle; Krishna Tiwari; Tung V N Nguyen; Henning Hermjakob; Rahuman S Malik-Sheriff
Journal:  Bioinformatics       Date:  2020-11-01       Impact factor: 6.937

Review 7.  Curating and comparing 114 strain-specific genome-scale metabolic models of Staphylococcus aureus.

Authors:  Alina Renz; Andreas Dräger
Journal:  NPJ Syst Biol Appl       Date:  2021-06-29

8.  Datanator: an integrated database of molecular data for quantitatively modeling cellular behavior.

Authors:  Yosef D Roth; Zhouyang Lian; Saahith Pochiraju; Bilal Shaikh; Jonathan R Karr
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

9.  MetaNetX/MNXref: unified namespace for metabolites and biochemical reactions in the context of metabolic models.

Authors:  Sébastien Moretti; Van Du T Tran; Florence Mehl; Mark Ibberson; Marco Pagni
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

10.  The first 10 years of the international coordination network for standards in systems and synthetic biology (COMBINE).

Authors:  Dagmar Waltemath; Martin Golebiewski; Michael L Blinov; Padraig Gleeson; Henning Hermjakob; Michael Hucka; Esther Thea Inau; Sarah M Keating; Matthias König; Olga Krebs; Rahuman S Malik-Sheriff; David Nickerson; Ernst Oberortner; Herbert M Sauro; Falk Schreiber; Lucian Smith; Melanie I Stefan; Ulrike Wittig; Chris J Myers
Journal:  J Integr Bioinform       Date:  2020-06-29
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.