| Literature DB >> 31586394 |
Ron Caspi1, Richard Billington1, Ingrid M Keseler1, Anamika Kothari1, Markus Krummenacker1, Peter E Midford1, Wai Kit Ong1, Suzanne Paley1, Pallavi Subhraveti1, Peter D Karp1.
Abstract
MetaCyc (MetaCyc.org) is a comprehensive reference database of metabolic pathways and enzymes from all domains of life. It contains 2749 pathways derived from more than 60 000 publications, making it the largest curated collection of metabolic pathways. The data in MetaCyc are evidence-based and richly curated, resulting in an encyclopedic reference tool for metabolism. MetaCyc is also used as a knowledge base for generating thousands of organism-specific Pathway/Genome Databases (PGDBs), which are available in BioCyc.org and other genomic portals. This article provides an update on the developments in MetaCyc during September 2017 to August 2019, up to version 23.1. Some of the topics that received intensive curation during this period include cobamides biosynthesis, sterol metabolism, fatty acid biosynthesis, lipid metabolism, carotenoid metabolism, protein glycosylation, antibiotics and cytotoxins biosynthesis, siderophore biosynthesis, bioluminescence, vitamin K metabolism, brominated compound metabolism, plant secondary metabolism and human metabolism. Other additions include modifications to the GlycanBuilder software that enable displaying glycans using symbolic representation, improved graphics and fonts for web displays, improvements in the PathoLogic component of Pathway Tools, and the optional addition of regulatory information to pathway diagrams.Entities:
Year: 2020 PMID: 31586394 PMCID: PMC6943030 DOI: 10.1093/nar/gkz862
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.A typical MetaCyc pathway. A short pathway was selected for this figure; the average number of metabolites in a MetaCyc pathway is 13, with the largest pathway containing 244 metabolites. The green captions are links to the upstream and downstream pathways that produce the inputs and outputs for this pathway. Enzymes are labeled in ochre, genes are labeled in purple and compounds are labeled in red. When available, EC numbers are shown next to reactions. Each item in a pathway diagram is a clickable hyperlink to a dedicated page for that item. Below each pathway is a summary and additional information such as the pathway’s taxonomic range, some taxa known to possess it, the pathway’s position within the pathway ontology, and potential pathway variants (not shown).
List of species with 20 or more experimentally elucidated pathways represented in MetaCyc (meaning experimental evidence exists for the occurrence of these pathways in the organism)
| Bacteria | Eukarya | Archaea | |||
|---|---|---|---|---|---|
|
| 358 |
| 345 |
| 31 |
|
| 80 |
| 318 |
| 25 |
|
| 64 |
| 204 |
| 24 |
|
| 51 |
| 88 |
| 20 |
|
| 49 |
| 67 | ||
|
| 48 |
| 65 | ||
|
| 36 |
| 58 | ||
|
| 33 |
| 56 | ||
|
| 27 |
| 52 | ||
|
| 26 |
| 51 | ||
|
| 25 |
| 47 | ||
|
| 23 |
| 47 | ||
|
| 21 |
| 30 | ||
|
| 21 |
| 29 | ||
|
| 28 | ||||
|
| 26 | ||||
|
| 24 | ||||
|
| 21 | ||||
|
| 20 | ||||
|
| 20 |
The species are grouped by taxonomic domain and are ordered within each domain based on the number of pathways (number following species name) to which the given species was assigned.
The distribution of pathways in MetaCyc based on the taxonomic classification of associated species
| Bacteria | Eukarya | Archaea | |||
|---|---|---|---|---|---|
| Proteobacteria | 1231 | Viridiplantae | 1021 | Thermoprotei | 50 |
| Actinobacteria | 412 | Fungi | 459 | Methanomicrobia | 50 |
| Firmicutes | 393 | Metazoa | 424 | Methanococci | 41 |
| Cyanobacteria | 101 | Euglenozoa | 28 | Halobacteria | 40 |
| Bacteroidetes/Chlorobi | 92 | Alveolata | 23 | Thermococci | 40 |
| Deinococcus–Thermus | 31 | Stramenopiles | 14 | Methanobacteria | 37 |
| Thermotogae | 26 | Amoebozoa | 11 | Archaeoglobi | 16 |
| Tenericutes | 18 | Rhodophyta | 9 | Thermoplasmata | 9 |
| Aquificae | 18 | Haptophyceae | 6 | Methanopyri | 4 |
| Spirochaetes | 16 | Fornicata | 4 | ||
| Chlamydiae–Verrucomicrobia | 9 | Parabasalia | 4 | ||
| Chloroflexi | 8 | ||||
| Planctomycetes | 8 | ||||
| Fusobacteria | 7 | ||||
| Nitrospirae | 2 | ||||
| Thermodesulfobacteria | 2 | ||||
| Chrysiogenetes | 1 | ||||
| Nitrospinae | 1 |
For example, the statement ‘Tenericutes 18’ means that experimental evidence exists for the occurrence of at least 18 MetaCyc pathways in members of this taxonomic group. Major taxonomic groups are grouped by domain and are ordered within each domain based on the number of pathways (number following taxon name) associated with the taxon. A pathway may be associated with multiple organisms.
Figure 2.A section from a pathway that describes the biosynthesis of Lewis-type epitopes by the bacterium Helicobacter pylori. This pathway was drawn using a modified version of the GlycanBuilder software (18). The modifications, introduced by the SRI International, allow the software to display text strings or other, non-glycan, compounds as part of the structure (shown as R in this example). As in non-glycan type pathways, each item in the diagram is a clickable hyperlink to a dedicated page for that item.
Figure 3.A comparison of the reactions content of MetaCyc and the KEGG database. Data for KEGG are available only since 2008. Since that time the number of reactions in MetaCyc has been growing at about twice the rate of the comparable rate in KEGG. As of 20 August 2019 KEGG contains 11,289 biochemical reactions, as opposed to 16,031 reactions in MetaCyc.