Literature DB >> 22086948

MetaCrop 2.0: managing and exploring information about crop plant metabolism.

Falk Schreiber1, Christian Colmsee, Tobias Czauderna, Eva Grafahrend-Belau, Anja Hartmann, Astrid Junker, Björn H Junker, Matthias Klapperstück, Uwe Scholz, Stephan Weise.   

Abstract

MetaCrop is a manually curated repository of high-quality data about plant metabolism, providing different levels of detail from overview maps of primary metabolism to kinetic data of enzymes. It contains information about seven major crop plants with high agronomical importance and two model plants. MetaCrop is intended to support research aimed at the improvement of crops for both nutrition and industrial use. It can be accessed via web, web services and an add-on to the Vanted software. Here, we present several novel developments of the MetaCrop system and the extended database content. MetaCrop is now available in version 2.0 at http://metacrop.ipk-gatersleben.de.

Entities:  

Mesh:

Year:  2011        PMID: 22086948      PMCID: PMC3245004          DOI: 10.1093/nar/gkr1004

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The importance of crop plants goes far beyond their use for nutrition. Plants are also used for renewable resources or in the chemical industry, and thus need to be improved steadily. For a continuous improvement of crop plants, detailed understanding of their metabolism is essential. MetaCrop is a resource to manage and explore manually curated high-quality data about crop plant metabolism. It contains information at different levels of detail from overview maps to pathways, to reactions, to reaction details and contains additional related data such as literature references. MetaCrop allows researchers (i) to explore metabolic information by browsing through various levels of abstraction, (ii) to integrate experimental data into metabolic pathways and (iii) to create metabolic models for simulation purposes. The initial system has been presented in Ref. (1), and its technical basis in Ref. (2). MetaCrop has been continuously developed in both technical aspects as well as database content over the last few years. In the following, we want to present the major improvements, which comprise a substantial extension of the content of the information system, the usage of the novel SBGN standard (3) and new ways of importing data as well as accessing the system. Figure 1 illustrates the architectural overview of the MetaCrop system including novel developments.
Figure 1.

Overview of MetaCrop, data sources, curation steps and applications.

Overview of MetaCrop, data sources, curation steps and applications.

DATABASE DESCRIPTION

Content

The data collection of MetaCrop is based on extensive manual curation. Currently, the system contains information about seven agronomically important crop plants as well as two model plants comprising both monocotyledon and dicotyledon species. MetaCrop manages data about biochemical reactions and translocation processes, catalyzing enzymes, metabolites, macromolecules, stoichiometry, detailed locations (up to compartment level) and references. Parameters comprise, for example, names, synonym names, gene identifiers, EC and CAS numbers, chemical formulas, Gene ID, kinetic parameters and PubMed IDs. Since the previous version was presented in Ref. (1), the database content has been almost doubled now containing information about 62 pathways, 566 reactions, 63 translocation processes and 21 compartments from >1800 scientific publications (Table 1, as of October 2011). Although MetaCrop focusses on the crop plants Hordeum vulgare (barley), Triticum aestivum (wheat), Oryza sativa (rice), Zea mays (maize), Solanum tuberosum (potato), Brassica napus (canola) and Beta vulgaris (sugar beet), and the model plants Arabidopsis thaliana and Medicago truncatula, additional data for other plants (crops and non-crops) is continuously added to the database.
Table 1.

Content of the MetaCrop database

OrganismPathwaysReactionsTranslocationsCompartmentsReferences
Hordeum vulgare54362449454
Triticum aestivum5128567407
Oryza sativa5231398448
Zea mays573302710936
Solanum tuberosum57235145373
Brassica napus4517175247
Beta vulgaris49235-6420
Arabidopsis thalianaa5940519131351
Medicago truncatulaa49247-4386
Totalb6256663211846

aModel plants in life sciences research.

bDatabase object such as pathways, reactions, translocations, etc. are only listed once although they can occur in different organisms.

Content of the MetaCrop database aModel plants in life sciences research. bDatabase object such as pathways, reactions, translocations, etc. are only listed once although they can occur in different organisms. In addition to the extension of the data content, the database schema has been improved, in comparison to the initial MetaCrop version, in order to manage additional high-quality data. On the one hand, this comprises structures for the handling of gene identifiers, which are indispensable for data mapping and the discrimination of enzyme isoforms with different subcellular localization. On the other hand, structures for the storage of more detailed descriptions of different types of translocation processes were developed, which are important with regard to modeling and simulation of metabolic networks.

Web interface

As a point of entry to the MetaCrop database a web interface based on the Oracle Application Express technology was developed. It is intended to enable users to browse through different levels of granularity. Besides classical report tables showing, for example, detailed locations (up to compartment level) or kinetic parameters, the web interface provides clickable pathway maps with the pathways represented in the novel SBGN notation. Furthermore, an SBML exporter for the composition of individual metabolic models for analysis and simulation is available.

SBGN maps and SBGN-ML

SBGN, the Systems Biology Graphical Notation (3), has been developed as a standard for the visual representation of biochemical and cellular processes and networks. SBGN comprises three different views onto the biological system: process description (PD), entity relationship (ER) and activity flow (AF). This graphical representation helps to communicate biological knowledge in an unambiguous and easy way. For the visualization of crop plant metabolic pathways, MetaCrop uses maps with the SBGN PD notation. Furthermore, to support the exchange of such pathway maps, they can be downloaded as SBGN Markup Language (SBGN-ML) files. Figure 2 shows an example SBGN map of a metabolic pathway as well as a corresponding report of details about one biochemical reaction of the pathway.
Figure 2.

Example from the MetaCrop web interface showing (a) an SBGN map of the TCA cycle and (b) details of a reaction chosen from this map, which could be obtained by clicking at the respective map element.

Example from the MetaCrop web interface showing (a) an SBGN map of the TCA cycle and (b) details of a reaction chosen from this map, which could be obtained by clicking at the respective map element.

SBML exporter

In order to analyze metabolic data with stoichiometric or kinetic methods (in silico experiments), it is often necessary to construct user-specific metabolic models. For this reason, MetaCrop provides an export facility enabling the user to create models in the standardized SBML (4) format. While browsing the web interface, the user can put single elements such as reactions or substances, or even whole pathways into a kind of a shopping cart. Thereafter, the individual model can be composed, including the selection of parameter values (compartment, species, kinetic values, etc.), and finally exported as a SBML file.

Web-services

In addition to the SBML-based data exchange, SOAP-based web services were developed for interacting with external software tools, e.g. with the network visualization system Vanted (5). Web services were developed providing several methods for each of the five categories Pathway, Conversion (reaction or translocation), Substance, Publication and Taxonomy (6). The web services allow secure data transport (https) as well as filtering of data. Figure 3 illustrates the MetaCrop web service architecture.
Figure 3.

MetaCrop web services architecture.

MetaCrop web services architecture.

Vanted add-on

MetaCrop can easily be integrated as a data source into analysis tools. This is demonstrated by integrating MetaCrop into Vanted. An add-on for the network visualization system Vanted has been developed, which uses the web services described above. This add-on extends the search and filter capabilities of the web interface. Besides browsing of the database content, it also allows access to the graphical representations (SBGN maps) of the pathways and filtering of pathways for a species of interest. Figure 4 illustrates the user interface of the Vanted add-on.
Figure 4.

The Vanted add-on for MetaCrop which allows access to the database content using the MetaCrop web-services.

The Vanted add-on for MetaCrop which allows access to the database content using the MetaCrop web-services.

CURATION PROCESS AND CONTINUATION

MetaCrop data acquisition is performed by domain experts and is mainly based on research papers. Each record stored in the system is enriched manually by bibliography information. The main focus during the curation process is the extraction of data from scientific primary literature. In parts, meta data is extracted manually from existing databases such as BRENDA (7), ChEBI (8) and KEGG (9). The latter data is stored in MetaCrop only after extensive checks against literature. Controlled vocabulary is used to ensure high quality and to provide comparability of data, for example, by using ontology terms from Gene Ontology (10) and Plant Ontology (11). For curators there are three possibilities for storing data in MetaCrop. First, data can be entered directly into the database using a simple curation web interface. Second, pathway data already available as a SBML file can be imported using a Java-based SBML importer. The third way includes the employment of a set of user-friendly MS-Excel templates, which can be imported by a Java application into the database. MetaCrop is used in several projects and will be extended continuously in the future.

APPLICATION

The MetaCrop database is applicable to a broad variety of scientific questions. Exemplarily, three applications shall be mentioned here. (i) the navigation and exploration of plant metabolic pathways on different levels of detail to obtain overview and detailed knowledge concerning metabolism in plants; (ii) the analysis of–omics data to help in analyzing and understanding experimental metabolism-related–omics data such as metabolomics, transcriptomics, fluxomics and enzyme activity; and (iii) the modeling and simulation of crop plant metabolism to investigate the dynamics of the underlying biological system. The possibility to explore plant metabolism is, for example, important in teaching. MetaCrop already supports this through its web interface, which allows a search for information about metabolites, enzymes, pathways, etc., and a click through pathway maps from overview pathways to detailed information. The Vanted add-on provides additional exploration possibilities such as the derivation of species-specific pathways. To further improve the way pathways can be explored, MetaCrop can be used in other applications using the web services. One example is the method and tool presented in Ref. (12), which introduces a new visualization approach to visualize interconnected pathways. Large amounts of experimental data about metabolomes, proteomes, transcriptomes, etc. are nowadays available. MetaCrop pathways can be used to provide a context for such data and to support analyzes and understanding by mapping the data onto appropriate pathways. Figure 5 illustrates this in an example derived with the Vanted system.
Figure 5.

Metabolite concentrations and enzyme activities that were measured in several accessions of Arabidopsis thaliana (13) were mapped on a MetaCrop biochemical pathway (TCA cycle).

Metabolite concentrations and enzyme activities that were measured in several accessions of Arabidopsis thaliana (13) were mapped on a MetaCrop biochemical pathway (TCA cycle). Another application example comprises the modeling and simulation of crop plant metabolism. Models can be built in MetaCrop and exported as SBML files. This works for stoichiometric models, which can be analyzed using constraint-based methods with tools such as FBASimVis (14) and, to some extent, for kinetic models, which can be analyzed using ODE-based methods with tools such as Copasi (15). It should be noted that the necessary kinetic values are only available for a part of the MetaCrop content as not all reactions have these parameters available in the literature. An example has been presented in Ref. (16), where a metabolic model of the primary metabolism in barley endosperm with 257 biochemical and transport reactions across four different compartments based on information in MetaCrop has been investigated using flux balance analysis.

DISCUSSION

Metabolic pathway databases contain knowledge of biochemical processes involved in the metabolism. There are a number of well-known databases for general and/or plant metabolic networks such as KEGG (9), EGENE (17), MetaCyc (18), PlantCyc (19), Arabidopsis Reactome (20) and Panther Pathways (21); for a complete list of available databases see Ref. (22). The advantage of MetaCrop is 2-fold: none of these databases covers such diverse levels of detail from overview maps to enzyme kinetics, and only some of them guarantee such high quality by manual curation and literature referencing of every database entry. MetaCrop also has its special niche by focusing on crop plants with high agronomical value.

CONCLUSION

MetaCrop is a high-quality database of metabolism in crop plants. It can be accessed in several ways and used in different application scenarios. MetaCrop will be further extended in the future.

FUNDING

German Federal Ministry of Education and Research (in part). Funding for open access charge: Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben. Conflict of interest statement. None declared.
  20 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  COPASI--a COmplex PAthway SImulator.

Authors:  Stefan Hoops; Sven Sahle; Ralph Gauges; Christine Lee; Jürgen Pahle; Natalia Simus; Mudita Singhal; Liang Xu; Pedro Mendes; Ursula Kummer
Journal:  Bioinformatics       Date:  2006-10-10       Impact factor: 6.937

3.  EGENES: transcriptome-based plant database of genes with metabolic pathway information and expressed sequence tag indices in KEGG.

Authors:  Ali Masoudi-Nejad; Susumu Goto; Ruy Jauregui; Masumi Ito; Shuichi Kawashima; Yuki Moriya; Takashi R Endo; Minoru Kanehisa
Journal:  Plant Physiol       Date:  2007-04-27       Impact factor: 8.340

4.  Arabidopsis reactome: a foundation knowledgebase for plant systems biology.

Authors:  Nicolas Tsesmetzis; Matthew Couchman; Janet Higgins; Alison Smith; John H Doonan; Georg J Seifert; Esther E Schmidt; Imre Vastrik; Ewan Birney; Guanming Wu; Peter D'Eustachio; Lincoln D Stein; Richard J Morris; Michael W Bevan; Sean V Walsh
Journal:  Plant Cell       Date:  2008-06-30       Impact factor: 11.277

5.  BRENDA, the enzyme information system in 2011.

Authors:  Maurice Scheer; Andreas Grote; Antje Chang; Ida Schomburg; Cornelia Munaretto; Michael Rother; Carola Söhngen; Michael Stelzer; Juliane Thiele; Dietmar Schomburg
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

6.  Pathguide: a pathway resource list.

Authors:  Gary D Bader; Michael P Cary; Chris Sander
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

7.  Meta-All: a system for managing metabolic pathway information.

Authors:  Stephan Weise; Ivo Grosse; Christian Klukas; Dirk Koschützki; Uwe Scholz; Falk Schreiber; Björn H Junker
Journal:  BMC Bioinformatics       Date:  2006-10-23       Impact factor: 3.169

8.  The Plant Ontology Database: a community resource for plant structure and developmental stages controlled vocabulary and annotations.

Authors:  Shulamit Avraham; Chih-Wei Tung; Katica Ilic; Pankaj Jaiswal; Elizabeth A Kellogg; Susan McCouch; Anuradha Pujar; Leonore Reiser; Seung Y Rhee; Martin M Sachs; Mary Schaeffer; Lincoln Stein; Peter Stevens; Leszek Vincent; Felipe Zapata; Doreen Ware
Journal:  Nucleic Acids Res       Date:  2008-01       Impact factor: 16.971

9.  VANTED: a system for advanced data analysis and visualization in the context of biological networks.

Authors:  Björn H Junker; Christian Klukas; Falk Schreiber
Journal:  BMC Bioinformatics       Date:  2006-03-06       Impact factor: 3.169

10.  MetaCrop: a detailed database of crop plant metabolism.

Authors:  Eva Grafahrend-Belau; Stephan Weise; Dirk Koschützki; Uwe Scholz; Björn H Junker; Falk Schreiber
Journal:  Nucleic Acids Res       Date:  2007-10-11       Impact factor: 16.971

View more
  19 in total

1.  Creating interactive, web-based and data-enriched maps with the Systems Biology Graphical Notation.

Authors:  Astrid Junker; Hendrik Rohn; Tobias Czauderna; Christian Klukas; Anja Hartmann; Falk Schreiber
Journal:  Nat Protoc       Date:  2012-03-01       Impact factor: 13.491

Review 2.  Plant metabolic modeling: achieving new insight into metabolism and metabolic engineering.

Authors:  Kambiz Baghalian; Mohammad-Reza Hajirezaei; Falk Schreiber
Journal:  Plant Cell       Date:  2014-10-24       Impact factor: 11.277

Review 3.  Nutritional metabolomics: progress in addressing complexity in diet and health.

Authors:  Dean P Jones; Youngja Park; Thomas R Ziegler
Journal:  Annu Rev Nutr       Date:  2012-04-23       Impact factor: 11.848

4.  Assessing the metabolic impact of nitrogen availability using a compartmentalized maize leaf genome-scale model.

Authors:  Margaret Simons; Rajib Saha; Nardjis Amiour; Akhil Kumar; Lenaïg Guillard; Gilles Clément; Martine Miquel; Zhenni Li; Gregory Mouille; Peter J Lea; Bertrand Hirel; Costas D Maranas
Journal:  Plant Physiol       Date:  2014-09-23       Impact factor: 8.340

5.  Visual analysis of transcriptome data in the context of anatomical structures and biological networks.

Authors:  Astrid Junker; Hendrik Rohn; Falk Schreiber
Journal:  Front Plant Sci       Date:  2012-11-15       Impact factor: 5.753

6.  OPTIMAS-DW: a comprehensive transcriptomics, metabolomics, ionomics, proteomics and phenomics data resource for maize.

Authors:  Christian Colmsee; Martin Mascher; Tobias Czauderna; Anja Hartmann; Urte Schlüter; Nina Zellerhoff; Jessica Schmitz; Andrea Bräutigam; Thea R Pick; Philipp Alter; Manfred Gahrtz; Sandra Witt; Alisdair R Fernie; Frederik Börnke; Holger Fahnenstich; Marcel Bucher; Thomas Dresselhaus; Andreas Pm Weber; Falk Schreiber; Uwe Scholz; Uwe Sonnewald
Journal:  BMC Plant Biol       Date:  2012-12-29       Impact factor: 4.215

7.  VANTED v2: a framework for systems biology applications.

Authors:  Hendrik Rohn; Astrid Junker; Anja Hartmann; Eva Grafahrend-Belau; Hendrik Treutler; Matthias Klapperstück; Tobias Czauderna; Christian Klukas; Falk Schreiber
Journal:  BMC Syst Biol       Date:  2012-11-10

8.  Reverse engineering: a key component of systems biology to unravel global abiotic stress cross-talk.

Authors:  Swetlana Friedel; Björn Usadel; Nicolaus von Wirén; Nese Sreenivasulu
Journal:  Front Plant Sci       Date:  2012-12-31       Impact factor: 5.753

9.  Conversion of KEGG metabolic pathways to SBGN maps including automatic layout.

Authors:  Tobias Czauderna; Michael Wybrow; Kim Marriott; Falk Schreiber
Journal:  BMC Bioinformatics       Date:  2013-08-16       Impact factor: 3.169

10.  Recent progress in the development of metabolome databases for plant systems biology.

Authors:  Atsushi Fukushima; Miyako Kusano
Journal:  Front Plant Sci       Date:  2013-04-04       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.