| Literature DB >> 16381923 |
Ron Caspi1, Hartmut Foerster, Carol A Fulcher, Rebecca Hopkinson, John Ingraham, Pallavi Kaipa, Markus Krummenacker, Suzanne Paley, John Pick, Seung Y Rhee, Christophe Tissier, Peifen Zhang, Peter D Karp.
Abstract
MetaCyc is a database of metabolic pathways and enzymes located at http://MetaCyc.org/. Its goal is to serve as a metabolic encyclopedia, containing a collection of non-redundant pathways central to small molecule metabolism, which have been reported in the experimental literature. Most of the pathways in MetaCyc occur in microorganisms and plants, although animal pathways are also represented. MetaCyc contains metabolic pathways, enzymatic reactions, enzymes, chemical compounds, genes and review-level comments. Enzyme information includes substrate specificity, kinetic properties, activators, inhibitors, cofactor requirements and links to sequence and structure databases. Data are curated from the primary literature by curators with expertise in biochemistry and molecular biology. MetaCyc serves as a readily accessible comprehensive resource on microbial and plant pathways for genome analysis, basic research, education, metabolic engineering and systems biology. Querying, visualization and curation of the database is supported by SRI's Pathway Tools software. The PathoLogic component of Pathway Tools is used in conjunction with MetaCyc to predict the metabolic network of an organism from its annotated genome. SRI and the European Bioinformatics Institute employed this tool to create pathway/genome databases (PGDBs) for 165 organisms, available at the BioCyc.org website. These PGDBs also include predicted operons and pathway hole fillers.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16381923 PMCID: PMC1347490 DOI: 10.1093/nar/gkj128
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1A representative example of a pathway in MetaCyc. Pathways can be displayed at varying levels of detail. This pathway display depicts an intermediate level of detail including enzymes, EC numbers, genes and chemical structures of the main compounds. Notice the brown arrows that provide hyperlinks to related upstream and downstream pathways.
List of species that have five or more experimentally elucidated pathways represented in MetaCyc
| Bacteria | No. of pathways | Eukarya | No. of pathways | Archaea | No. of pathways |
|---|---|---|---|---|---|
| Escherichia coli | 179 | Arabidopsis thaliana | 116 | Sulfolobus solfataricus | 15 |
| Pseudomonas putida | 35 | Homo sapiens | 42 | Methanocaldococcus jannaschii | 5 |
| Bacillus subtilis | 24 | Glycine max | 36 | ||
| Pseudomonas aeruginosa | 19 | Saccharomyces cerevisiae | 31 | ||
| Mycobacterium tuberculosis | 17 | Pisum sativum | 20 | ||
| Salmonella typhimurium | 14 | Zea mays | 19 | ||
| Haemophilus influenzae | 13 | Solanum tuberosum | 12 | ||
| Deinococcus radiodurans | 12 | Cicer arietinum | 12 | ||
| Mycoplasma pneuomiae | 8 | Oryza sativa | 12 | ||
| Mycobacterium smegmatis | 8 | Spinacia oleraca | 11 | ||
| Sinorhizobium meliloti | 8 | Rattus norvegicus | 10 | ||
| Thauera aromatica | 7 | Hordeum vulgare | 8 | ||
| Klebsiella pneumoniae | 7 | Nicotiana tabacum | 8 | ||
| Lactococcus lactis | 6 | Lycopersicon esculentum | 7 | ||
| Corynebacterium glutamicum | 6 | Medicago sativa | 7 | ||
| Methylobacterium extorquens AM1 | 6 | Brassica napus | 6 | ||
| Pseudomonas fluorescens | 6 | Glycyrrhiza echinata | 6 | ||
| Thermotoga maritima | 6 | Triticum aestivum | 6 | ||
| Paracoccus denitrificans | 5 | Cucumis sativus | 5 | ||
| Arthrobacter globiformis | 5 | Ricinus communis | 5 | ||
| Mycoplasma capricolum | 5 | Pueraria montana | 5 | ||
| Bradyrhizobium japonicum | 5 | ||||
| Bacillus cereus | 5 |
The species are grouped by taxonomic domain and are ordered within each domain based on the number of pathways to which the given species was assigned. Some pathways may be labeled with a higher-level taxon, such as genus, if all the species within that genus are thought to have the given pathway. However, such higher-level taxa are not included in this table.
The distribution of pathways in MetaCyc based on the taxonomic classification of associated species
| Bacteria | No. of pathways | Eukarya | No. of pathways | Archaea | No. of pathways |
|---|---|---|---|---|---|
| Proteobacteria | 598 | Viridiplantae | 396 | Euryarchaeota | 47 |
| Firmicutes | 169 | Metazoa | 61 | Crenarchaeota | 26 |
| Actinobacteria | 94 | Fungi | 58 | ||
| Deinococcus-Thermus | 18 | Euglenozoa | 4 | ||
| Cyanobacteria | 13 | ||||
| Thermotogae | 8 | ||||
| Planctomycetes | 6 | ||||
| Bacteroidetes/Chlorobi | 6 | ||||
| Nitrospirae | 5 | ||||
| Aquificae | 2 | ||||
| Chloroflexi | 2 | ||||
| Spirochaetes | 1 | ||||
| Chlamydiae | 1 | ||||
| Chrysiogenetes | 1 |
The taxonomic groups (phyla for Bacteria and Archaea, kingdoms for Eukarya) are grouped by domain and are ordered within each domain based on the number of pathways associated with the taxon. Euglenozoa are listed separately as this group does not belong to any of the other eukaryotic kingdoms. A pathway may be associated with multiple organisms.
The size of MetaCyc as a function of time from its first release in 1999 to the latest release in 2005 (version 9.5)
| Database objects | 1999 | 2000 | 2001 | 2002 | 2003 | 2004 | 2005 |
|---|---|---|---|---|---|---|---|
| Metabolic pathways | 296 | 366 | 445 | 460 | 491 | 528 | 621 |
| Metabolic pathways with comments | 39 | 83 | 160 | 180 | 232 | 280 | 412 |
| Enzymatic reactions | 3779 | 4002 | 4218 | 4294 | 4817 | 4955 | 5428 |
| Enzymes | 82 | 344 | 1115 | 1267 | 1543 | 1940 | 2698 |
| Enzymes with comments | 75 | 234 | 1054 | 1123 | 1389 | 1716 | 2376 |
| Genes | 0 | 0 | 0 | 600 | 1554 | 1821 | 2662 |
| Compounds | 1949 | 2180 | 2335 | 2404 | 2951 | 3551 | 4620 |
| Literature citations | 184 | 604 | 2381 | 2718 | 3070 | 5050 | 7368 |
Each row depicts the number of different database objects in MetaCyc during the final release for that year.