| Literature DB >> 26992093 |
Woonsu Kim1, Hyesun Park1, Seongwon Seo1.
Abstract
The sequence of cattle genome provided a valuable opportunity to systematically link genetic and metabolic traits of cattle. The objectives of this study were 1) to reconstruct genome-scale cattle-specific metabolic pathways based on the most recent and updated cattle genome build and 2) to identify duplicated metabolic genes in the cattle genome for better understanding of metabolic adaptations in cattle. A bioinformatic pipeline of an organism for amalgamating genomic annotations from multiple sources was updated. Using this, an amalgamated cattle genome database based on UMD_3.1, was created. The amalgamated cattle genome database is composed of a total of 33,292 genes: 19,123 consensus genes between NCBI and Ensembl databases, 8,410 and 5,493 genes only found in NCBI or Ensembl, respectively, and 266 genes from NCBI scaffolds. A metabolic reconstruction of the cattle genome and cattle pathway genome database (PGDB) was also developed using Pathway Tools, followed by an intensive manual curation. The manual curation filled or revised 68 pathway holes, deleted 36 metabolic pathways, and added 23 metabolic pathways. Consequently, the curated cattle PGDB contains 304 metabolic pathways, 2,460 reactions including 2,371 enzymatic reactions, and 4,012 enzymes. Furthermore, this study identified eight duplicated genes in 12 metabolic pathways in the cattle genome compared to human and mouse. Some of these duplicated genes are related with specific hormone biosynthesis and detoxifications. The updated genome-scale metabolic reconstruction is a useful tool for understanding biology and metabolic characteristics in cattle. There has been significant improvements in the quality of cattle genome annotations and the MetaCyc database. The duplicated metabolic genes in the cattle genome compared to human and mouse implies evolutionary changes in the cattle genome and provides a useful information for further research on understanding metabolic adaptations of cattle.Entities:
Mesh:
Year: 2016 PMID: 26992093 PMCID: PMC4798299 DOI: 10.1371/journal.pone.0150974
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The experimental approach used for the reconstruction of a new cattle PGDB and identification of metabolic gene duplications in the cattle genome.
Solid and dash arrows represent data and information flows, respectively.
The number of consensus cattle gene pairs in the NCBI and Ensembl cattle genome databases.
| Number of matched pairs | |||
|---|---|---|---|
| Sequential unique matches | |||
| Type of match | Non-exclusive matches | UMD 3.1 | Btau 3.1 |
| Gene coordinates | 2,467 | 5,295 | 2,109 |
| Gene name | 5,742 | 5,963 | 5,187 |
| Function name | 10,009 | 4,693 | 71 |
| Unigene ID | 11,874 | 1,398 | - |
| Cross-reference gene ID | 16,317 | 1,511 | 8,800 |
| Protein accessions | 16,423 | 262 | - |
| EC numbers | 1,255 | 1 | 6 |
| Manually matched | 27 | ||
| Total | 64,087 | 19,123 | 16,200 |
*Manually matched after manual evaluation. Those pairs were not matched because the genes were classified with different gene type or strand in the NCBI and Ensembl gene models.
Distribution of genes in the amalgamated cattle genome database according to the original data sources.
| Number of genes | ||
|---|---|---|
| Type of match | Btau_3.1 | UMD_3.1 |
| Consensus | 16,200 | 19,123 |
| NCBI build only | 12,287 | 8,410 |
| Ensembl build only | 8,932 | 5,493 |
| NCBI genome scaffolds | 245 | 266 |
| Total | 37,664 | 33,292 |
*Scaffolds not in the current genome assembly but included in the NCBI reference genome build
Comparison of cattle-specific pathway genome database (PGDB).
| New cattle PGDB | |||||
|---|---|---|---|---|---|
| Database statistics | Initial | Curated | CattleCyc | ||
| Metabolic pathways | 316 | 304 | 218 | ||
| Enzymatic reactions | 2,371 | 2,371 | 1,439 | ||
| Enzymes | 4,012 | 4,012 | 1,544 | ||
| Compounds | 1,489 | 1,489 | 1,006 | ||
| Number of pathways holes | 202 | 134 | 134 | ||
| Percentage | 19% | 13% | 17% | ||
| Pathway with no holes | 219 | 237 | 154 | ||
| Pathway with 1 hole | 42 | 32 | 38 | ||
| Pathway with 2 holes | 22 | 19 | 13 | ||
| Pathway with 3 holes | 17 | 7 | 2 | ||
| Pathway with 4 holes | 3 | 2 | 7 | ||
| Pathway with >4 holes | 9 | 7 | 7 | ||
| Total pathway with holes | 93 | 67 | 67 | ||
†Cattle specific pathway genome database constructed based on UMD_3.1
‡CattleCyc constructed based on Btau_3.1
*Pathway holes as percentage total reactions in pathways
Fig 2Comparison of cattle-specific pathway genome database (PGDB) based on UMD_3.1 with the previous version of cattle PGDB based on Btau_3.1 (CattleCyc 1.2.1; http://biocyc.org)
List of cattle-specific duplicated metabolic genes compared to human and mouse.
| Gene name | Chr | Start | End | Description | Pathway |
|---|---|---|---|---|---|
| 19 | 55904213 | 55905449 | Arylalkylamine N-acetyltransferase | Serotonin and melatonin biosynthesis | |
| 19 | 55917031 | 55918872 | |||
| 10 | 85431609 | 85435220 | Acyl-CoA thioesterase 4 | Acetyl-CoA hydrolysis, oleate biosynthesis II (animals) | |
| 10 | 85375286 | 85380537 | |||
| 15 | 35836626 | 35837711 | Adenosine kinase | Adenine and adenosine salvage VI | |
| 28 | 30215525 | 30732466 | |||
| 4 | 99223160 | 99254542 | 2,3-biphosphoglycerate mutase | Glycolysis/gluconeogenesis, Rapoport-Luebering glycolytic shunt | |
| 10 | 78463269 | 78464048 | |||
| 20 | 6721327 | 6753670 | Hexosaminidase B | chondroitin sulfate degradation (metazoa), dermatan sulfate degradation (metazoa) | |
| 20 | 6760133 | 6794235 | |||
| 26 | 25060114 | 25074160 | Glutathione S- transferase omega 1 | Arsenate detoxification I (glutaredoxin), glutathione-mediated detoxification | |
| 26 | 25088448 | 25097722 | |||
| 1 | 65922482 | 65928620 | NADH hydrogenase (ubiquinone) 1 beta sub complex, 4, 15kda | Aerobic respiration (cytochrome c) | |
| 21 | 13421643 | 13422116 | |||
| 1 | 3113948 | 3122613 | Superoxide dismutase 1, soluble | Superoxide radicals degradation | |
| 13 | 51930067 | 51930888 |
Fig 3Gene duplication of BPGM in the cattle genome.
Up- and down-stream flanking genes of each duplicated BPGM are shown.