| Literature DB >> 32670511 |
Clémence Frioux1,2, Dipali Singh3, Tamas Korcsmaros2,4, Falk Hildebrand2,4.
Abstract
Metagenomic sequencing of complete microbial communities has greatly enhanced our understanding of the taxonomic composition of microbiotas. This has led to breakthrough developments in bioinformatic disciplines such as assembly, gene clustering, metagenomic binning of species genomes and the discovery of an incredible, so far undiscovered, taxonomic diversity. However, functional annotations and estimating metabolic processes from single species - or communities - is still challenging. Earlier approaches relied mostly on inferring the presence of key enzymes for metabolic pathways in the whole metagenome, ignoring the genomic context of such enzymes, resulting in the 'bag-of-genes' approach to estimate functional capacities of microbiotas. Here, we review recent developments in metagenomic bioinformatics, with a special focus on emerging technologies to simulate and estimate metabolic information, that can be derived from metagenomic assembled genomes. Genome-scale metabolic models can be used to model the emergent properties of microbial consortia and whole communities, and the progress in this area is reviewed. While this subfield of metagenomics is still in its infancy, it is becoming evident that there is a dire need for further bioinformatic tools to address the complex combinatorial problems in modelling the metabolism of large communities as a 'bag-of-genomes'.Entities:
Keywords: Bioinformatics; Gene Functions; Metabolic modelling; Metagenomic-assembled genomes; Metagenomics; Microbiota; Systems biology
Year: 2020 PMID: 32670511 PMCID: PMC7347713 DOI: 10.1016/j.csbj.2020.06.028
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Overview of strategies for functional inference using metagenomics. SSU: Small Sub-Unit. LSU: Large Sub-Unit. DB: database. MAG: Metagenome-Assembled Genome. GSM: Genome-scale Metabolic Model.
Comparison of amplicon sequencing and shotgun metagenomics approaches.
| Amplicon sequencing – Advantages | Metagenomics – Advantages |
|---|---|
Easy to use & cost-effective Standardised approaches & mature bioinformatics Clearly defined taxonomy Good software for reference-free “species” delineation | PCR free approach Genomes of actual strains in sample can be assembled MAGs can be the basis of or associated to GSMs Very diverse analyses possible |
| Amplicon sequencing – Disadvantages | Metagenomics – Disadvantages |
Taxonomic biases in amplification Resolution limited to genus or species level Abundance estimates unreliable due to 16S/18S copy number variations and PCR biases Actual gene content of species unknown Taxonomic representation dependent on primer choice (e.g. Archaea require specific primers | Limitations imposed by sequencing depth, coverage requirements for successful assembly usually only met for dominant microbes Complex bioinformatics & analysis Still costly |
Fig. 2Diversity of methods for metabolic modelling in communities of organisms.
Comparison of some tools and frameworks for GSM-based modelling of interactions in communities. BU: 'bottom-up' i.e. association of individual GSMs into small communities. TD: 'top-down' i.e. analyses starting from large metagenomic-identified communities.
| Tool/Framework | Modelling | Application | Approach |
|---|---|---|---|
| DMMM | dynamic steady-state | a community of 2 bacteria | BU |
| OptCom | steady-state | multi-objective optimisation of communities from 2 to 4 species | BU |
| dOptCom | dynamic steady-state | multi-objective & multi-level optimisation of 3-species communities | BU |
| CASINO | steady-state | 6-species communities | BU |
| COMETS | dynamic steady-state + spatial | 2 and 3-species communities | BU |
| BacArena | dynamic steady-state + spatial | 7-species community | BU |
| SteadyCom | steady-state | 4 and 9-species communities | BU |
| Greenblum | topological | ‘bag-of-genes' per sample | TD |
| Metage2Metabo | network expansion | de novo GSM reconstruction, global analyses and community reduction | TD |
| MMinte | steady-state | pairwise analyses and interactions | TD |
| MICOM | steady-state | metagenomic samples mapped to existing GSMs or newly reconstructed GSMs drafts from genomes following OTUs alignment | TD |