| Literature DB >> 33920040 |
Joao Pedro Saraiva1, Anja Worrich1, Canan Karakoç1,2, Rene Kallies1, Antonis Chatzinotas1,2,3, Florian Centler1, Ulisses Nunes da Rocha1.
Abstract
Mining interspecies interactions remain a challenge due to the complex nature of microbial communities and the need for computational power to handle big data. Our meta-analysis indicates that genetic potential alone does not resolve all issues involving mining of microbial interactions. Nevertheless, it can be used as the starting point to infer synergistic interspecies interactions and to limit the search space (i.e., number of species and metabolic reactions) to a manageable size. A reduced search space decreases the number of additional experiments necessary to validate the inferred putative interactions. As validation experiments, we examine how multi-omics and state of the art imaging techniques may further improve our understanding of species interactions' role in ecosystem processes. Finally, we analyze pros and cons from the current methods to infer microbial interactions from genetic potential and propose a new theoretical framework based on: (i) genomic information of key members of a community; (ii) information of ecosystem processes involved with a specific hypothesis or research question; (iii) the ability to identify putative species' contributions to ecosystem processes of interest; and, (iv) validation of putative microbial interactions through integration of other data sources.Entities:
Keywords: ecosystem processes; microbial communities; multi-omics; synergistic interactions
Year: 2021 PMID: 33920040 PMCID: PMC8070991 DOI: 10.3390/microorganisms9040840
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Figure 1Interplay between microbial community size, functional potential and interspecies interactions. Functional richness and redundancy increase with microbial community richness. Three microbial communities (A) are represented with different levels of species richness (i, ii and iii), i.e. different number of unique species (represented by a geometric shape). Each species is capable of performing a number of functions that are represented by a specific color (B). For example, community is composed by two species each capable of performing two different sets of functions. The number of unique functions illustrates the functional richness of each microbial community. Functional redundancy is determined by the number of microbes with the genetic potential to perform the same function. Thus, an increase in the number of unique species is more likely to result in an increase of the functional redundancy (B1) and richness (B2) of a microbial community. For example, from community to community there is an increase of two species and one unique function but the number of species capable of performing multiple functions (i.e., functional redundancy) also doubled for four functions: orange, yellow, blue and red. Furthermore, the combinations of interspecies interactions (C) is not only dependent on the individual microorganism’s genetic potential but also determined by the environmental conditions. For example, growth in Environment 1 requires a microbe’s ability to perform two functions (blue and yellow) while for Environment 2 three functions are required (orange, blue and red). Although not a linear relationship, the higher the number of species in a microbial community, the higher the probability of an increased number of interspecies interactions (as long as the genetic potential is present).
Pros and cons of current modelling approaches to predict microbial interactions, environments where the selected approaches have been used and respective references.
| Approach | Pros | Cons | Environments | References |
|---|---|---|---|---|
| Supra-organism | Global reaction network is possible and allows for prediction of shifts in pathway activity by measuring gene relative abundance. | Genetic potential of individual species not determined. | Anaerobic mixed culture fermentations | [ |
| Contribution of individual species to shifts in pathway activity not determined since interactions are based on genes/reactions. | Agricultural soil and seep sea “whale fall” carcasses | [ | ||
| Population-based | Species boundaries explicitly defined. | High computational and manual curation efforts since full genome-scale metabolic models for each species is required. | Corals | [ |
| Anoxic sediments | [ | |||
| Batch and Continuous cultures | [ | |||
| Synthetic microbial systems | [ | |||
| Guild-based | Less complex models since grouping of species is based on their known functional traits. | Requires previous knowledge on functional traits. | Soil | [ |
Outcomes and limitations of different methods to study microbial interactions. We assigned four validation strategies to confirm microbial interaction as following: (1) Expression or activity assays (e.g., transcriptomics, proteomics, metabolomics, RT-PCR, FBA); (2) 3D structure and spatial variability; (3) Substrate specificity; and, (4) Temporal variability.
| Outcome | Limitations | Methods | Environment | Validation | Ref. a | |||
|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | |||||
| Improvement in the identification of microbial community species. | Lack of mechanistic understanding of species interactions. | Combination of MALDI-TOF MS b analysis and high-throughput sequencing 16S rRNA c. | Kimchi | ✓ | O | O | ✓ | [ |
| 16S rRNA gene sequencing. | Human oral environments | O | ✓ | O | O | [ | ||
| Demonstration of the influence of abiotic factors on microbial community dynamics. | High computational and data requirements for reconstruction of individual metabolic models. | Metagenomics, metabolic network reconstruction and FBA d. | Anaerobic digestion microbiomes | ✓ | O | ✓ | O | [ |
| Lack of mechanistic understanding of species interactions. | PLS-PM e | Rice soil rhizosphere | O | ✓ | O | ✓ | [ | |
| 16S rRNA gene sequencing. | Urban and forest park soil litter layers | O | ✓ | O | ✓ | [ | ||
| In vivo experiment of meadow steppe soil under different precipitation regimes. | Topsoil | ✓ | ✓ | O | ✓ | [ | ||
| High computational and data requirements for reconstruction of individual metabolic models and complex wet-lab experiments required for validation. | Metabolic network reconstruction, EFM f and FBA. | Acid-sulfate-chloride springs | ✓ | O | ✓ | O | [ | |
| Demonstration of the influence of interspecies interactions on microbial community dynamics. | Lack of mechanistic understanding of species interactions. | Co-culture of isolates, RNA-Seq g and RT-qPCR h. | Wine fermentation | ✓ | O | O | ✓ | [ |
| qPCRi and 16S rRNA gene sequencing. | Mixed bacterial consortia | ✓ | O | O | ✓ | [ | ||
| Improved mechanistic understanding of interspecies interactions. | Complex wet-lab experiments required for validation. | SIP j and Metagenomics. | Continuous up-flow anaerobic sludge blanket reactors | ✓ | O | ✓ | ✓ | [ |
| Pure and co-cultures and cyclic voltammetry analysis. | Palm oil mill effluent | O | O | ✓ | ✓ | [ | ||
| High computational and data requirements for reconstruction of individual metabolic models. | Mono- and co-culture, metabolic network reconstruction, bipartite graphs, HPLC k, CGQ l, GC-MS m; SIP. | In silicon experiments with pure and co-culture | ✓ | O | ✓ | ✓ | [ | |
| Metabolic network reconstruction and cFBA n. | In silicon experiments pure cultures | ✓ | O | ✓ | O | [ | ||
| Metabolic network reconstruction, evolutionary game theory and FBA. | In silicon experiments pure cultures | ✓ | O | O | O | [ | ||
| Metagenomics, Metatranscriptomics. | Synthetic human gut | ✓ | ✓ | O | O | [ | ||
a Ref., numbers in between brackets represent references for the different studies; b MALDI-TOF: matrix-assisted laser desorption/ionization; c rRNA: Ribosomal ribonucleic acid; d FBA: Flux Balance Analysis; e PLS-PM: Partial least squares - path model; f EFM: elementary flux mode; g RNA-Seq: Ribonucleic acid sequencing; h RT-qPCR: Real Time quantitative polymerase chain reaction; i qPCR: Quantitative polymerase chain reaction; j SIP: stable isotope probing; k HPLC: High-performance liquid chromatography; l CGQ: cell growth quantifier; m GC-MS: Gas chromatography mass spectrometry; n cFBA: Community Flux Balance Analysis.
Figure 2Theoretical work frame for prediction of interspecies interactions. Here, a microbial community is composed of four species (A) and a hypothetical ecosystem process/pathway of interest (B) requires the presence of five single protein-encoding genes and five protein-encoding genes that participate in the formation of two protein complexes. First, annotation of individual genomes from the microbial community is performed using only the set of target genes (C). From here, one can determine which species possess the complete functional potential to perform the target pathway (D). Additionally, one can also determine which species or groups of species possess a combined genomic potential to perform the complete ecosystem process (E) putative interacting species. Further refinement of the generated lists can be achieved by the inclusion of experimental data, species absolute abundances, literature searches, specialized databases and other omics data types (e.g., transcriptomics, metabolomics and proteomics) (F) leading to increased robustness of predictions and reduction of the number of interspecies interactions for experimental validation (G).