| Literature DB >> 26664751 |
Satish Kumar1, Kishore Kumar Krishnani1, Bharat Bhushan2, Manoj Pandit Brahmane1.
Abstract
In recent years, metagenomics has emerged as a powerful tool for mining of hidden microbial treasure in a culture independent manner. In the last two decades, metagenomics has been applied extensively to exploit concealed potential of microbial communities from almost all sorts of habitats. A brief historic progress made over the period is discussed in terms of origin of metagenomics to its current state and also the discovery of novel biological functions of commercial importance from metagenomes of diverse habitats. The present review also highlights the paradigm shift of metagenomics from basic study of community composition to insight into the microbial community dynamics for harnessing the full potential of uncultured microbes with more emphasis on the implication of breakthrough developments, namely, Next Generation Sequencing, advanced bioinformatics tools, and systems biology.Entities:
Year: 2015 PMID: 26664751 PMCID: PMC4664791 DOI: 10.1155/2015/121735
Source DB: PubMed Journal: Biotechnol Res Int ISSN: 2090-3146
Biological functions derived from the metagenomes from diverse habitats.
| Type of activity exhibited by the metagenomic clone | Library type | Number of clones screened/size of DNA used for library construction | Sampling site | Screening method | Reference |
|---|---|---|---|---|---|
| Lipase | Plasmid and fosmid | 29.3 Gb of cloned soil DNA | German forest soil (horizon A) | Phenotypic detection | [ |
| Fosmid | 200,000 clones | Qiongdongnan basin, South China Sea (water depth 778.5 m) | Phenotypic screening | [ | |
| Fosmid | 15,000 clones | Peat-swamp forest soil from Narathiwat Province, Thailand | Phenotypic detection | [ | |
|
| |||||
| Esterase | Plasmid | 20,000 clones | High Andean forest soil | Phenotypic detection | [ |
| Fosmid | 20000 | Deep-sea sediment | Phenotypic detection | [ | |
| Fosmid | 142,900 | Red pepper plant rhizosphere and strawberry plant rhizosphere | Phenotypic detection | [ | |
|
| |||||
| Protease | Fosmid | 17000 | Surface sand from the Gobi and Death Valley deserts | Phenotypic detection | [ |
| Plasmid | 70,000 | Goat skin surface | Phenotypic detection (skimmed milk) | [ | |
|
| |||||
| Laccase | Plasmid | 8000 | Mangrove soil | Phenotypic detection | [ |
| Phagemid | Not mentioned | Bovine rumen microflora | Phenotypic detection | [ | |
|
| |||||
| Agarase | Cosmids | 1,532 | Soil from uncultivated field (Germany) | Phenotypical detection | [ |
|
| |||||
| Amidase | Plasmids | 193,000 | Soil and enrichment cultures from marine sediment, goose pond, lakeshore, and an agricultural field (Netherlands) | Heterologous complementation | [ |
|
| |||||
| Alcohol oxidoreductase | Plasmids | 900,000 and 400,000 | Soil and enrichment cultures from a sugar beet field (Germany), | Phenotypic detection | [ |
|
| |||||
| Antibiotics and bioactive compounds with anti-infective properties | Fosmid | 80,500 clones from Yuseong and 33,200 clones from Jindong Valley forest soil | Forest soil from Jindong Valley | Phenotypic detection | [ |
| Cosmids | Not mentioned | Bromeliad tank water (Costa Rica) | Phenotypical detection | [ | |
| BAC | 24,546 | Soil | Phenotypic detection | [ | |
|
| |||||
| DNA polymerase 1 | Plasmid | 21,198 Sanger | Octopus hot spring (93°C) in Yellowstone National Park | Activity-based screening | [ |
|
| |||||
| Na+/H+ antiporters | Plasmid | 8,000 | Chaerhan Salt Lake, China | Heterologous complementation | [ |
|
| |||||
| Cellulases and xylanases | Fosmid library | Not mentioned | Hindgut of wood-feeding termite | AZCL-HE cellulose and AZCL-Xylan based assay | [ |
|
| |||||
| Phytases | Fosmid library | 14,440 | Soil | Functional screening | [ |
Comparison of the unique features of NGS platforms widely applied in metagenomic research.
| Sequencer | Roche/454 GS FLX Titanium | HiSeq 2000 | SOLiDv4 |
|---|---|---|---|
| NGS chemistry | Pyrosequencing | Sequencing by synthesis | Sequencing by ligation and exact call chemistry |
|
| |||
| Library/template preparation | Emulsion PCR (emPCR) | Solid phase amplification | Emulsion PCR for fragment/mate-pair end sequencing |
|
| |||
| Average read length | 250–310 bp (highest among the NGS platforms) | Initially it was 36, now approaching 150 | 35 |
|
| |||
| Run time (days) | 24 hours (fastest of all) | 4 days (fragment run) | 7 days (fragment run) |
|
| |||
| Output data/run | 0.7 Gb | 600 Gb | 120 Gb |
|
| |||
| Advantage | Longer reads | High throughput | Highest accuracy due to ECC (exact call chemistry) |
|
| |||
| Limitations | High error rate in homopolymer region | Short read length | Long run time |
A brief description of bioinformatic tools commonly employed for postsequencing analysis of metagenomic sequence data.
| Postsequencing task | Bioinformatic tool | Brief description | URL | Reference |
|---|---|---|---|---|
| Metagenomic assembly tool | MetaVelvet | Decomposes a de Bruijn graph into individual subgraphs on the basis of coverage (abundance) difference and graph connectivity. |
| [ |
| Meta-IDBA | Implies partitioning the de Bruijn graph into isolated components of different species by grouping similar regions of similar subspecies and partitioning the graph into components based on the topological structure of the graph. |
| [ | |
| Genovo | Uses Bayesian approach and generative probabilistic model of read generation which works by discovering likely sequence reconstructions under the model. |
| [ | |
| Bambus 2 | Uses mate-pair information during the assembly process which is not used by Meta-IDBA, MetaVelvet, and Genovo. |
| [ | |
|
| ||||
| Short read alignment and mapping to reference genome | Bowtie | An ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences which employs Burrows-Wheeler index based on the full-text minute-space (FM) index having low memory footprint (1.3 GB only) |
| [ |
| BWA | Employed for mapping low-divergent sequences against a large reference genome. |
| [ | |
| SOAP 3 | Fast, accurate, and sensitive GPU-based short read aligner which delivers high speed and sensitivity simultaneously. |
| [ | |
| mrsFAST | A cache oblivious mapper that is designed to map short reads to reference genome. |
| [ | |
|
| ||||
| Microbial diversity analysis | MLST | Exploits unambiguous nature and electronic portability of nucleotide sequence data for the characterization of microorganisms. |
| [ |
| Axiome | Streamlines and manages analysis of small subunit (SSU) rRNA marker data in QIIME and mothur. |
| [ | |
| PHACCS | Uses the contig spectrum from shotgun DNA based on modified Lander-Waterman algorithm sequence assemblies to predict structure of viral communities and make predictions about diversity. |
| [ | |
|
| ||||
| Functional annotation | RAMMCAP | An ultrafast method that can cluster and annotate one million metagenomic reads in only hundreds of CPU hours. |
| [ |
|
| ||||
| Gene annotation/gene calling | FragGeneScan | Combines sequencing error models and codon usages in a hidden Markov model to improve the prediction of protein-coding region in short reads. |
| [ |
| MetaGeneMark | An ab initio gene prediction tool with updated heuristic models designed for metagenomic sequences. |
| [ | |
| MetaGeneAnnotator | Precisely predicts all kinds of prokaryotic genes from a single or a set of anonymous genomic sequences having a variety of lengths. |
| [ | |
|
| ||||
| Binning | TETRA | Based on statistical analysis of tetranucleotide usage patterns in genomic fragments which automate the task of comparative tetranucleotide frequency analysis and outperform (G+C) content based analysis. |
| [ |
| MetaCluster 5.0 | A two-round binning method that separates reads of high-abundance species from those of low-abundance species in two different rounds and aims at identifying both low-abundance and high-abundance species in the presence of a large amount of noise due to many extremely low-abundance species. |
| [ | |
| Phymm | Uses interpolated Markov models (IMMs) to characterize variable-length oligonucleotides typical of a phylogenetic grouping. |
| [ | |
|
| ||||
| Automated platforms/servers for comparative and functional analysis of metagenomic sequence data | MG-RAST | MG-RAST (the Metagenomics RAST) server is an automated analysis platform which provides upload, quality control, automated annotation, and analysis for prokaryotic metagenomic shotgun samples. |
| [ |
| MetAMOS | An open source and modular metagenomic assembly and analysis pipeline leveraging over 20 existing tools with some new tools integrated as well. |
| [ | |
| MEGAN 4 | Released in 2011 for taxonomic analysis, comparative analysis, and functional analysis methods based on the SEED and KEGG (Kyoto Encyclopedia for Genes and Genomes) |
| [ | |
| IMG/M | A data management and analysis system for microbial community genomes (metagenomes) hosted at the Department of Energy's (DOE) Joint Genome Institute (JGI). |
| [ | |
| CAMERA | Provides access to raw environmental sequence data, with associated metadata, precomputed annotation, and analyses. |
| [ | |
| GALAXY | A publicly available web service, with software system that provides support for analysis of genomic, comparative genomic, and functional genomic data through a framework that gives experimentalists simple interfaces to powerful tools while automatically managing the computational details. |
| [ | |