| Literature DB >> 19284618 |
Seongwon Seo1, Harris A Lewin.
Abstract
BACKGROUND: Metabolic reconstruction of microbial, plant and animal genomes is a necessary step toward understanding the evolutionary origins of metabolism and species-specific adaptive traits. The aims of this study were to reconstruct conserved metabolic pathways in the cattle genome and to identify metabolic pathways with missing genes and proteins. The MetaCyc database and PathwayTools software suite were chosen for this work because they are widely used and easy to implement.Entities:
Mesh:
Year: 2009 PMID: 19284618 PMCID: PMC2669051 DOI: 10.1186/1752-0509-3-33
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
The number of consensus cattle gene pairs in the NCBI and Ensembl cattle genome databases
| Type of match | Number of pairs† | Unique matches‡ |
| Same gene coordinates | 2,109 | 2,109 |
| Gene symbol | 6,479 | 5,187 |
| Entrez gene ID | 16,163 | 8,800 |
| Gene product name | 7,026 | 71 |
| E.C. number | 1,360 | 6 |
| Manual* | N.A. | 27 |
| Total | 33,137 | 16,200 |
| Not matched | 2,276 | 2,276 |
†The total number of matches using each criterion; one-to-one match was not assumed.
‡Number in each row is sequential for the match criteria given under "number of pairs", in addition to the matches for the criteria immediately above; only one-to-one matches were considered.
*Manually matched because the consensus genes were classified into different 'gene type' (e.g. protein coding, pseudogene, tRNA and miscellaneous RNA) in the NCBI and Ensembl gene models.
Distribution of genes in the amalgamated cattle genome database according to the original data sources
| Source | Number of genes |
| Consensus build 3.1 (this study) | 16,200 |
| NCBI build 3.1 only | 12,287 |
| Ensembl build 3.1 only | 8,932 |
| NCBI other genome scaffolds* | 245 |
| Total | 37,664 |
*Scaffolds not incorporated into NCBI genome build 3.1, including some scaffolds from the reference assembly build 2.1.
Figure 1The metabolism-centered approach used for the identification of metabolic pathways and "missing" enzymes. Solid and dashed arrows represent data and information flows, respectively.
Comparison of selected organism-specific pathway genome databases (PGDB)
| Database statistics | Cattle | New PGDB without manual curation | BioCyc | ||||
| Initial | Curated | Human | Mouse | Dog | E. coli | Human | |
| Metabolic pathways | 243 | 217 | 342 | 324 | 151 | 194 | 178 |
| Enzymatic reactions | 1,500 | 1,419 | 2,020 | 1,941 | 1,102 | 1,245 | 1,253 |
| Enzymes | 1,172 | 1,544 | 2,846 | 2,789 | 753 | 1,323 | 2,594 |
| Compounds | 1,116 | 1,021 | 1,390 | 1,325 | 794 | 1,202 | 975 |
| Pathway holes (missing enzymes) | |||||||
| number of pathway holes | 593 | 113 | 587 | 536 | 509 | 35 | 246 |
| pathway holes as a percentage of total reactions in pathways | 56% | 14% | 45% | 43% | 67% | 5% | 36% |
| pathways with no holes | 59 | 165 | 134 | 128 | 17 | 169 | 67 |
| pathways with 1 hole | 51 | 28 | 64 | 65 | 30 | 15 | 35 |
| pathways with 2 holes | 32 | 11 | 40 | 41 | 22 | 7 | 37 |
| pathways with 3 holes | 21 | 1 | 36 | 25 | 15 | 3 | 15 |
| pathways with 4 holes | 14 | 6 | 18 | 18 | 15 | 0 | 11 |
| pathways with >5 holes | 66 | 6 | 50 | 47 | 52 | 0 | 13 |
| total pathways with holes | 184 | 52 | 208 | 196 | 134 | 25 | 111 |
Figure 2Comparison of constructed metabolic pathways by automation with those after manual curation. Numbers of metabolic pathways, shared by the cattle pathway genome database (PGDB) generated computationally and after manual curation. The numbers in parentheses are the total number of pathways in the corresponding cattle PGDB.
List of enzyme activities reported in cattle and other mammals that do not have a mammalian gene identified
| Pathway | Enzyme name | EC number* | Mammalian species |
| Ascorbate glutathione cycle | L-ascorbate peroxidase† | 1.11.1.11 | cattle [ |
| 4-hydroxyproline degradation | 4-hydroxy-2-oxoglutarate aldolase | 4.1.3.16 | cattle [ |
| Ascorbate biosynthesis | gluconolactonase | 3.1.1.17 | cattle [ |
| Ascorbate biosynthesis | uronolactonase | 3.1.1.19 | cattle [ |
| γ-glutamyl cycle | γ-glutamyl cyclotransferase | 2.3.2.4 | cattle [ |
| Selenocysteine biosynthesis | selenocysteine synthase | 2.9.1.1 | mouse [ |
*Enzyme Commission number
†Partial sequence of the protein is known (UniProt: Q7M3G0)
List of metabolic enzyme activities without an identified mammalian gene
| Pathway | Enzyme name | EC number* | Mammalian species |
| 4-hydroxyproline degradation | 4-oxoproline reductase | 1.1.1.104 | human [ |
| Ascorbate biosynthesis | glucuronolactone reductase | 1.1.1.20 | rat [ |
| Ascorbate biosynthesis | 1,4-lactonase | 3.1.1.25 | human [ |
| β-alanine biosynthesis | 3-hydroxypropionate dehydrogenase | 1.1.1.59 | pig [ |
| β-alanine biosynthesis | β-alanine-pyruvate transaminase | 2.6.1.18 | rat [ |
| Cysteine degradataion | hypotaurine dehydrogenase | 1.8.1.3 | rat [ |
| Degradation of purine deoxyribonucleosides | phosphopentomutase | 5.4.2.7 | human [ |
| Glutathione detoxification | cycteine-S-conjugate N-acetyltransferase | 2.3.1.80 | human [ |
| Histidine degradation | histidine aminotransferase | 2.6.1.38 | mouse [ |
| Lysine degradation | pyrroline-2-carboxylate reductase | 1.5.1.1 | mouse [ |
| Lysine degradation | L-lysine oxidase | 1.4.3.14 | mouse [ |
| Purine degradation | guanosine deaminase | 3.5.4.15 | human [ |
| Pyridine nucleotide cycling | nicotinamidase | 3.5.1.19 | mouse [ |
| Tryptophan degradation | aminomuconate-semialdehyde dehydrogenase | 1.2.1.32 | cat [ |
| UDP-N-acetylgalactosamine biosynthesis | UDP-N-acetylglucosamine 4-epimerase | 5.1.3.7 | human [ |
| Valine degradation | pyruvate decarboxylase | 4.1.1.1 | human [ |
*Enzyme Commission number
List of human genes with no cattle ortholog identified
| Gene symbol | Gene Name | Refseq protein | EC number* | Entrez-Gene ID | ||
| Human | Mouse | Dog | ||||
| endothelial cell growth factor 1 (platelet-derived) | NP_001944 | 2.4.2.4 | 1890 | 72962 | none | |
| ceramide kinase | NP_073603; NP_872602 | 2.7.1.138 | 64781 | 223753 | 474464 | |
| fatty acid amide hydrolase 2 | NP_777572 | 3.5.1.4 | 158584 | none | none | |
| asparagine-linked glycosylation 12 homolog | NP_077010 | 2.4.1.130 | 79087 | 223774 | 481196 | |
| glutamyl-tRNA synthetase 2, mitochondrial (putative) | NP_001077083 | 6.1.1.17 | 124454 | 67417 | 479807 | |
*Enzyme Commission number
Figure 3Comparison of constructed 'Cyc' metabolic pathways among cattle, human and . Numbers of metabolic pathways, shared by CattleCyc 1.0, HumanCyc 11.0 and EcoCyc 11.0. The numbers in parentheses are the total number of pathways in the corresponding databases. (A) Consensus pathways at the enzyme level (all enzymes in the pathway have the same EC numbers). (B) Consensus pathways at the functional level (pathways that share biological function even though individual enzymes may differ and alternative reactions may exist).