| Literature DB >> 19906710 |
Vineet K Sharma1, Naveen Kumar, Tulika Prakash, Todd D Taylor.
Abstract
Microbial enzymes have many known applications as biocatalysts in biotechnology, agriculture, medical and other industries. However, only a few enzymes are currently employed for such commercial applications. In this scenario, the current onslaught of metagenomic data provides a new unexplored treasure trove of genomic wealth that can not only enhance the enzyme repertoire by the discovery of novel commercially useful enzymes (CUEs) but can also reveal better functional variants for existing CUEs. We prepared a catalogue of CUEs using text mining of PubMed abstracts and other publicly available information, and manually curated the data to identify 510 CUEs. Further, in order to identify novel homologues of these CUEs, we identified potential ORFs in publicly available metagenomic datasets from 10 diverse sources. Using this strategy, we have developed a resource called MetaBioME (http://metasystems.riken.jp/metabiome/) that comprises (i) a database of CUEs and (ii) a comprehensive platform to facilitate homology-based computational identification of novel homologous CUEs from metagenomic and bacterial genomic datasets. Using MetaBioME, we have identified several novel homologues to known CUEs that can potentially serve as leads for further experimental verification.Entities:
Mesh:
Year: 2009 PMID: 19906710 PMCID: PMC2808964 DOI: 10.1093/nar/gkp1001
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Steps in the construction of the MetaBioME database.
Distribution of CUEs showing significant homology with novel ORFs in metagenomic datasets
| Application | aTotal potential homologs (%) | Homologous ORFs predicted in Metagenomic Datasets (Coverage: ≥90%, Identity ≥50%)b | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Human gut | Mouse gut | Termite gut | Marine | Mine drainage | Sludge | Soil | Microbial mat | Whale fall | ||
| Agriculture | 14 (34) | 11 | 1 | 4 | 12 | 3 | 7 | 5 | 4 | 6 |
| Biosensor | 34 (51) | 27 | 1 | 3 | 30 | 1 | 19 | 10 | 7 | 11 |
| Biotechnology | 104 (54) | 79 | 7 | 18 | 92 | 15 | 53 | 22 | 14 | 33 |
| Energy | 6 (55) | 6 | 1 | 2 | 5 | 0 | 1 | 0 | 1 | 0 |
| Environment | 31 (49) | 17 | 3 | 2 | 29 | 1 | 11 | 7 | 5 | 8 |
| Food and Nutrition | 46 (47) | 39 | 1 | 6 | 39 | 1 | 13 | 6 | 3 | 12 |
| Medical | 38 (42) | 33 | 5 | 7 | 32 | 5 | 21 | 13 | 7 | 16 |
| Miscellaneous | 8 (47) | 7 | 1 | 4 | 8 | 1 | 4 | 2 | 2 | 2 |
| Other industries | 7 (35) | 6 | 0 | 1 | 5 | 0 | 2 | 1 | 1 | 2 |
aTotal number of homologous ORFs and their percentages (in brackets) that showed ≥50% sequence identity and ≥90% alignment coverage with CUEs, taken together for all metagenomic datasets.
bThe number of homologous CUEs out of the total number of CUEs in that category.