| Literature DB >> 24225315 |
Ron Caspi1, Tomer Altman, Richard Billington, Kate Dreher, Hartmut Foerster, Carol A Fulcher, Timothy A Holland, Ingrid M Keseler, Anamika Kothari, Aya Kubo, Markus Krummenacker, Mario Latendresse, Lukas A Mueller, Quang Ong, Suzanne Paley, Pallavi Subhraveti, Daniel S Weaver, Deepika Weerasinghe, Peifen Zhang, Peter D Karp.
Abstract
The MetaCyc database (MetaCyc.org) is a comprehensive and freely accessible database describing metabolic pathways and enzymes from all domains of life. MetaCyc pathways are experimentally determined, mostly small-molecule metabolic pathways and are curated from the primary scientific literature. MetaCyc contains >2100 pathways derived from >37,000 publications, and is the largest curated collection of metabolic pathways currently available. BioCyc (BioCyc.org) is a collection of >3000 organism-specific Pathway/Genome Databases (PGDBs), each containing the full genome and predicted metabolic network of one organism, including metabolites, enzymes, reactions, metabolic pathways, predicted operons, transport systems and pathway-hole fillers. Additions to BioCyc over the past 2 years include YeastCyc, a PGDB for Saccharomyces cerevisiae, and 891 new genomes from the Human Microbiome Project. The BioCyc Web site offers a variety of tools for querying and analysis of PGDBs, including Omics Viewers and tools for comparative analysis. New developments include atom mappings in reactions, a new representation of glycan degradation pathways, improved compound structure display, better coverage of enzyme kinetic data, enhancements of the Web Groups functionality, improvements to the Omics viewers, a new representation of the Enzyme Commission system and, for the desktop version of the software, the ability to save display states.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24225315 PMCID: PMC3964957 DOI: 10.1093/nar/gkt1103
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
List of species with >20 experimentally elucidated pathways represented in MetaCyc (meaning that there is experimental evidence for the occurrence of these pathways in the organism)
| Bacteria | Eukarya | Archaea | |||
|---|---|---|---|---|---|
| 312 | 328 | 25 | |||
| 70 | 229 | 21 | |||
| 60 | 172 | 21 | |||
| 50 | 81 | ||||
| 41 | 62 | ||||
| 31 | 55 | ||||
| 31 | 55 | ||||
| 26 | 51 | ||||
| 25 | 46 | ||||
| 23 | 45 | ||||
| 43 | |||||
| 41 | |||||
| 27 | |||||
| 27 | |||||
| 26 | |||||
| 25 | |||||
| 21 | |||||
| 21 |
The species are grouped by taxonomic domain and are ordered within each domain based on the number of pathways (number following species name) to which the given species was assigned.
The distribution of pathways in MetaCyc based on the taxonomic classification of associated species
| Bacteria | Eukarya | Archaea | |||
|---|---|---|---|---|---|
| Proteobacteria | 1042 | Viridiplantae | 916 | Euryarchaeota | 137 |
| Firmicutes | 319 | Fungi | 343 | Crenarchaeota | 42 |
| Actinobacteria | 280 | Metazoa | 312 | Thaumarchaeota | 1 |
| Bacteroidetes/Chlorobi | 62 | Euglenozoa | 28 | ||
| Cyanobacteria | 65 | Alveolata | 16 | ||
| Deinococcus-Thermus | 28 | Amoebozoa | 10 | ||
| Tenericutes | 18 | Stramenopiles | 7 | ||
| Thermotogae | 25 | Fornicata | 4 | ||
| Aquificae | 16 | Rhodophyta | 4 | ||
| Spirochaetes | 12 | Haptophyceae | 4 | ||
| Chlamydiae -Verrucomicrobia | 7 | Parabasalia | 3 | ||
| Planctomycetes | 6 | ||||
| Chloroflexi | 5 | ||||
| Fusobacteria | 4 | ||||
| Nitrospirae | 2 | ||||
| Thermodesulfobacteria | 2 | ||||
| Chrysiogenetes | 1 | ||||
For example, the statement ‘Tenericutes 18’ means that there is experimental evidence for at least 18 MetaCyc pathways for their occurrence in members of this taxonomic group. Major Taxonomic groups are grouped by domain and are ordered within each domain based on the number of pathways (number following taxon name) associated with the taxon. A pathway may be associated with multiple organisms.
Figure 1.The new glycan degradation pathways use symbolic representation to illustrate the structures of complex glycan molecules. Colored arrows show the sites that are cleaved by enzymes and provide hyperlinks to those enzymes. The final products produced by the combined degradation of the polymer by all enzymes are listed on the right side of the diagram.
Figure 2.This figure illustrates some of the different types of enzymatic kinetic data that can be captured and presented by Pathway Tools. The software lets the curator enter the data using the units reported in a paper, and converts them automatically to the standard units. When possible, the catalytic efficiency is computed automatically and included in the table. Temperature and pH optima can be captured differently for the two directions of a reversible reaction.
Figure 3.EC numbers are now database objects that have their own pages. An EC-Number page includes all of the information defined by the EC, and additional information that includes a list of unofficial reactions (see text for details) and a list of enzymes determined by the software to fit the definition of the EC number.
During the past 2 years we added new links from MetaCyc to the following external databases
| Database name | Description | URL |
|---|---|---|
| Direct links | ||
| dictyBase | A | dictybase.org |
| DIP | A database of interacting proteins | dip.doe-mbi.ucla.edu |
| DisProt | A database of protein disorder | disprot.org |
| EuPathDB | A eukaryotic pathogen database | eupathdb.org |
| Expression atlas | A database of analyzed ArrayExpress Archive results | |
| FlyBase | A | flybase.org |
| MINT | A molecular interaction database | mint.bio.uniroma2.it |
| PDB | A database of 3D structures of large biological molecules | rcsb.org/pdb |
| PDBsum | A pictorial database of PDB structures | |
| PhosphoSitePlus | A database for protein post-translational modifications | Phosphosite.org |
| PRIDE | A proteomics identifications database | ebi.ac.uk/pride |
| Protein model portal | A database of protein models computed by comparative modeling methods | proteinmodelportal.org |
| Rhea | A manually annotated database of chemical reactions | ebi.ac.uk/rhea |
| STRING | A database of known and predicted protein-protein interactions | string-db.org |
| Swiss-model repository | A database of annotated three-dimensional comparative protein structure models generated by Swiss-Model | swissmodel.expasy.org/repository |
| ‘In-family’ type links | ||
| CAZy | A carbohydrate-active enzymes database | cazy.org |
| InterPro | A protein sequence functional analysis database | ebi.ac.uk/interpro |
| PANTHER | A database for protein analysis through evolutionary relationships | pantherdb.org |
| Pfam | A protein families database | pfam.sanger.ac.uk |
| PRINTS-S | A database of protein family fingerprints | bioinf.manchester.ac.uk/dbbrowser/sprint |
| ProDom | A database of protein domain families | prodom.prabi.fr |
| PROSITE | A database of protein domains, families and functional sites | prosite.expasy.org |
| SMART | A simple modular architecture research tool | smart.embl.de |
Figure 4.The new right-sidebar on BioCyc web pages contains operations that are specific to the currently displayed page. The operations and links available on the sidebar change depending on the type of object that is currently displayed. In this example, the operations and links are relevant to an Escherichia coli gene/protein page. Operations and links that are not specific to a particular object type are available from the menu bar at the top of the page and do not change.
Figure 5.The Cellular Omics Viewer allows the user to paint omics data over the cellular Overview. New functionality enables the display of per-node omics data in a pop-up window as a column chart, an x-y plot or a heat map. The pop-ups can also be generated to show all the data for a given pathway. This figure also shows the pop-up that appears on right-clicking a reaction.
Figure 6.Pathway diagram customization is available via the web interface, and lets the user control many aspects of the pathway diagram. A new option allows painting user-supplied Omics data directly to the pathway. The modified diagram can be exported to a pdf or postscript format file for incorporation in presentations or manuscripts.