| Literature DB >> 23066841 |
Yves Sucaet1, Yi Wang, Jie Li, Eve Syrkin Wurtele.
Abstract
BACKGROUND: Plants are important as foods, pharmaceuticals, biorenewable chemicals, fuel resources, bioremediation tools and general tools for recombinant technology. The study of plant biological pathways is advanced by easy access to integrated data sources. Today, various plant data sources are scattered throughout the web, making it increasingly complicated to build comprehensive datasets.Entities:
Mesh:
Year: 2012 PMID: 23066841 PMCID: PMC3483157 DOI: 10.1186/1471-2105-13-267
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Data sources of MetNet
| AraCyc | Plain text files organized according to frame data model | [ | Pathways, interactions and biomolecules participated in. Name, synonyms, references, comments. Majority metabolic pathways in MetNetDB come from AraCyc |
| AGRIS | Plain text files organized according to simple graph model | [ | Transcription network, references and binding sites of individual transcriptional factors |
| GO | MySQL dump files organized according to acyclic directed graph data model | [ | The whole copy of gene ontology database |
| TAIR | Plain text files (Tabular data) | [ | Affymetrix array elements and their corresponding LocusID mapping, Unitprot ID, TargetP location of polypeptides, loci of each AraCyc pathway |
| MapMan | Excel files (Tabular data) | [ | Gene annotation, MapMan BIN ID, gene function category |
| BioCyc open chemical compound database | Plain text files organized according to frame data model | [ | UNQUE-ID, synonyms |
| ChEBI | MySQL dump organized according to directed graph data model | [ | ChEBI ID, formula, molecular weight, IUPAC, SMILES |
| PubChem | XML files organized according to object data model | [ | PubChem CID, synonyms |
| NCI | Structure data format according to object data model | [ | Synonyms, CAS registry number |
| KEGG | Plain text files (for compounds) organized according to object data model | [ | Synonyms |
| SUBA | Excel file | [ | Protein subcellular location including experiment verified and software predicted |
| PPDB | Tabular data | [ | Curated protein subcellular location, especially those in plastid |
| AMPDB | Tabular data | [ | Mitochondrion proteins, the subcellular location comes from computational prediction |
| AtNoPDB | Tabular data | [ | Nucleolar proteins, subcellular location comes from prediction and experiments |
| AraPerox | Plain text | [ | Putative proteins in peroxisomes. Subcellular location comes from literature and computational prediction |
| plprot | Plain text files organized according to object data model | [ | Subcellular location comes from TargetP prediction |
| BRENDA | Plain text files organized according to object data model | [ | Enzyme’s interaction, substrate, product, activator, inhibitor, synonyms, metal ions, references |
| MetNet curator | Manually curation | | All, with focus on signal transduction information |
| AtPID | Excel spreadsheet | [ | Protein-Protein interaction data |
| EcoCyc | Plain text files organized according to frame data model | [ | Pathways, interactions and biomolecules participated in. Name, synonyms, references, comments. |
| VitisNet | SBML files made with CellDesigner | [ | Manually constructed pathways based on draft genome sequence |
Comparing MetNet Online with other resources
| MetNet | 966 | ||
| KEGG
[ | 422 | Several phylums (>1700) | |
| PlantCyc
[ | 898 | Phylums Clorophyta and Streptophyta (>360) | |
| WikiPathways
[ | 247 | Some phylums (22) | |
| Gramene
[ | 5960 | Phylums Clorophyta / Streptophyta (>360); and |
Figure 1The order of the data integration for MetNetDB, with respect to Arabidopsis: AraCyc is the backbone of the whole database, then AGRIS is used to extend the networks. Other databases are integrated without any special order requirement. External data usually does not need any special transformation because the data can be easily added into to the database as a new field/label of a gene, a protein or a molecule. So the data integration from the external data source to MetNetDB is straightforward in most cases.
Figure 2The MetNet Online portal start page. In order to rapidly familiarize novice users, a “pathway of the day” display and a “gene of day” display encourage self-guided exploration.
Figure 3Different browsing options. Pathways can be browsed by the subcellular location where (part of) the pathway occurs. Hovering over a pathway in the right-side panel bring up a thumbnail of the pathway. The pathway can then be browsed in textual mode (enumerated list of interactions and entities that make up the pathway) and visual mode.
Figure 4A custom ethylene-related network. The network was generated dynamically by selecting all pathways in which ethylene (ethane) was found.