| Literature DB >> 17965431 |
Ron Caspi1, Hartmut Foerster, Carol A Fulcher, Pallavi Kaipa, Markus Krummenacker, Mario Latendresse, Suzanne Paley, Seung Y Rhee, Alexander G Shearer, Christophe Tissier, Thomas C Walk, Peifen Zhang, Peter D Karp.
Abstract
MetaCyc (MetaCyc.org) is a universal database of metabolic pathways and enzymes from all domains of life. The pathways in MetaCyc are curated from the primary scientific literature, and are experimentally determined small-molecule metabolic pathways. Each reaction in a MetaCyc pathway is annotated with one or more well-characterized enzymes. Because MetaCyc contains only experimentally elucidated knowledge, it provides a uniquely high-quality resource for metabolic pathways and enzymes. BioCyc (BioCyc.org) is a collection of more than 350 organism-specific Pathway/Genome Databases (PGDBs). Each BioCyc PGDB contains the predicted metabolic network of one organism, including metabolic pathways, enzymes, metabolites and reactions predicted by the Pathway Tools software using MetaCyc as a reference database. BioCyc PGDBs also contain predicted operons and predicted pathway hole fillers-predictions of which enzymes may catalyze pathway reactions that have not been assigned to an enzyme. The BioCyc website offers many tools for computational analysis of PGDBs, including comparative analysis and analysis of omics data in a pathway context. The BioCyc PGDBs generated by SRI are offered for adoption by any interested party for the ongoing integration of metabolic and genome-related information about an organism.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17965431 PMCID: PMC2238876 DOI: 10.1093/nar/gkm900
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.An example of a pathway in MetaCyc. Pathways can be displayed at varying levels of detail. This pathway display depicts an intermediate level of detail including enzymes, EC numbers, genes and chemical structures of the main compounds. Notice the green arrow at the bottom of the pathway, which provides a hyperlink to a related downstream pathway.
The size of MetaCyc as a function of time from its first release in 1999 to the latest release in 2007 (version 11.5)
| Database objects | 1999 | 2000 | 2001 | 2002 | 2003 | 2004 | 2005 | 2006 | 2007 |
|---|---|---|---|---|---|---|---|---|---|
| Metabolic pathways | 296 | 366 | 445 | 460 | 491 | 528 | 692 | 879 | 977 |
| Reactions | 3779 | 4002 | 4218 | 4294 | 4817 | 4955 | 5520 | 6113 | 6483 |
| Enzymes | 82 | 344 | 1115 | 1267 | 1543 | 1940 | 3029 | 3841 | 4332 |
| Genes | 0 | 0 | 0 | 600 | 1554 | 1821 | 2931 | 3630 | 3913 |
| Compounds | 1949 | 2180 | 2335 | 2404 | 2951 | 3551 | 4817 | 5978 | 6375 |
| Literature citations | 184 | 604 | 2381 | 2718 | 3070 | 5050 | 8599 | 11 934 | 15 199 |
Each row depicts the number of different database objects in MetaCyc during the final release for that year.
List of species that have 10 or more experimentally elucidated pathways represented in MetaCyc
| Bacteria | Eukarya | Archaea | |||
|---|---|---|---|---|---|
| 225 | 189 | 18 | |||
| 40 | 79 | 16 | |||
| 37 | 67 | 15 | |||
| 36 | 50 | 13 | |||
| 23 | 43 | 10 | |||
| 19 | 41 | ||||
| 14 | 27 | ||||
| 13 | 22 | ||||
| 13 | 22 | ||||
| 13 | 20 | ||||
| 12 | 19 | ||||
| 12 | 16 | ||||
| 12 | 16 | ||||
| 11 | 15 | ||||
| 11 | 15 | ||||
| 10 | 15 | ||||
| 13 | |||||
| 12 | |||||
| 11 | |||||
| 10 | |||||
| 10 |
The species are grouped by taxonomic domain and are ordered within each domain based on the number of pathways (number following species name) to which the given species was assigned. Some pathways may be labeled with a higher-level taxon, such as genus, if all the species within that genus are thought to have the given pathway. However, such higher-level taxa are not included in this table.
Distribution of pathways in MetaCyc based on the taxonomic classification of associated species
| Bacteria | Eukarya | Archaea | |||
|---|---|---|---|---|---|
| Proteobacteria | 534 | Viridiplantae | 401 | Euryarchaeota | 60 |
| Firmicutes | 140 | Metazoa | 104 | Crenarchaeota | 26 |
| Actinobacteria | 94 | Fungi | 115 | ||
| Cyanobacteria | 22 | Euglenozoa | 11 | ||
| Bacteroidetes/Chlorobi | 21 | ||||
| Deinococcus–Thermus | 11 | ||||
| Thermotogae | 10 | ||||
| Spirochaetes | 8 | ||||
| Aquificae | 6 | ||||
| Chlamydiae–Verrucomicrobia | 5 | ||||
| Fusobacteria | 4 | ||||
| Nitrospirae | 2 | ||||
| Planctomycetes | 2 | ||||
| Thermodesulfobacteria | 2 | ||||
| Chloroflexi | 1 | ||||
| Chrysiogenetes | 1 |
Taxonomic groups (phyla for Bacteria and Archaea, kingdoms for Eukarya) are grouped by domain and are ordered within each domain based on the number of pathways (number following taxon name) associated with the taxon. Euglenozoa are listed separately as this group does not belong to any of the other eukaryotic kingdoms. A pathway may be associated with multiple organisms.
Figure 2.The regulatory overview. The diagram is composed of three nested ellipses. The innermost ellipse comprises genes that are regulating other genes, but that are not regulated by any genes. The middle ellipse comprises genes that are both regulators and regulatees (= regulated by some entity), and the outermost ellipse comprises genes that are regulated, but do not regulate. The triangles that extend outward from the outer ellipse are collections of many genes that share the same set of regulator genes-although genes within a triangle may respond to those regulators in different ways. They are drawn within triangles simply to keep the size of the outer ellipse manageable.
Figure 3.The genome overview. This tool allows the user to view the full chromosome of an organism on a single-page diagram, and to paint expression or other omics data onto it.