| Literature DB >> 33833738 |
Alla L Lapidus1, Anton I Korobeynikov1.
Abstract
Metagenomics is a segment of conventional microbial genomics dedicated to the sequencing and analysis of combined genomic DNA of entire environmental samples. The most critical step of the metagenomic data analysis is the reconstruction of individual genes and genomes of the microorganisms in the communities using metagenomic assemblers - computational programs that put together small fragments of sequenced DNA generated by sequencing instruments. Here, we describe the challenges of metagenomic assembly, a wide spectrum of applications in which metagenomic assemblies were used to better understand the ecology and evolution of microbial ecosystems, and present one of the most efficient microbial assemblers, SPAdes that was upgraded to become applicable for metagenomics.Entities:
Keywords: SPAdes; algorithms; metagenomic assembly; metagenomics; microbiota
Year: 2021 PMID: 33833738 PMCID: PMC8021871 DOI: 10.3389/fmicb.2021.613791
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1High-level overview of metaSPAdes assembly pipeline with the crucial steps and data flow outlined. “AG” denotes an ordinary (strain-level) assembly graph and “CAG” is a consensus assembly graph.
Figure 2A pre-genomic tree of life representing three main domains: Bacteria, Archaea, and Eukarya. Stylized image reproduced from Woese and Fox (1997).
Figure 3Next generation sequencing based extended tree of life representing the tree that includes 92 bacterial phyla, 26 archaeal phyla, and all 5 Eukaryotic supergroups. Stylized image reproduced from Hug et al. (2016).