| Literature DB >> 35712047 |
Ksenia Guseva1, Sean Darcy1, Eva Simon1,2, Lauren V Alteio1, Alicia Montesinos-Navarro3, Christina Kaiser1.
Abstract
Network analysis has been used for many years in ecological research to analyze organismal associations, for example in food webs, plant-plant or plant-animal interactions. Although network analysis is widely applied in microbial ecology, only recently has it entered the realms of soil microbial ecology, shown by a rapid rise in studies applying co-occurrence analysis to soil microbial communities. While this application offers great potential for deeper insights into the ecological structure of soil microbial ecosystems, it also brings new challenges related to the specific characteristics of soil datasets and the type of ecological questions that can be addressed. In this Perspectives Paper we assess the challenges of applying network analysis to soil microbial ecology due to the small-scale heterogeneity of the soil environment and the nature of soil microbial datasets. We review the different approaches of network construction that are commonly applied to soil microbial datasets and discuss their features and limitations. Using a test dataset of microbial communities from two depths of a forest soil, we demonstrate how different experimental designs and network constructing algorithms affect the structure of the resulting networks, and how this in turn may influence ecological conclusions. We will also reveal how assumptions of the construction method, methods of preparing the dataset, and definitions of thresholds affect the network structure. Finally, we discuss the particular questions in soil microbial ecology that can be approached by analyzing and interpreting specific network properties. Targeting these network properties in a meaningful way will allow applying this technique not in merely descriptive, but in hypothesis-driven research. Analysing microbial networks in soils opens a window to a better understanding of the complexity of microbial communities. However, this approach is unfortunately often used to draw conclusions which are far beyond the scientific evidence it can provide, which has damaged its reputation for soil microbial analysis. In this Perspectives Paper, we would like to sharpen the view for the real potential of microbial co-occurrence analysis in soils, and at the same time raise awareness regarding its limitations and the many ways how it can be misused or misinterpreted.Entities:
Keywords: Co-occurrence networks; Ecological networks; Microbial community structure; Microbial network analysis; Soil microbial ecology
Year: 2022 PMID: 35712047 PMCID: PMC9125165 DOI: 10.1016/j.soilbio.2022.108604
Source DB: PubMed Journal: Soil Biol Biochem ISSN: 0038-0717 Impact factor: 8.546
Fig. 1The hidden relationships (left diagram) within an ecosystem are reconstructed (inferred) from limited observations and measurements, in this case of associations between microorganisms (right diagram).
Fig. 2Workflow for the preparation of a dataset which precedes the network construction. It involves filtering out some ASVs and a data transformation, such as scaling the number of reads a by the total read sum in a sample S, or taking a centered log ratio (clr). The transformed dataset contains relative abundance values constrained by a constant (cte) sum T, or even inferred absolute abundance values for each ASV in each individual sample.
Definitions used in this Perspective Paper.
| Useful definition | |
|---|---|
| Co-occurrence or association networks | Networks constructed from co-occurrence or co-exclusion (abundance) data. |
| Interaction networks | Network of causal relations that exist between organisms in nature. |
| Network reconstruction | The aim to reconstruct the underlying ecological reality, for example the network of interactions in nature, from observed data. |
| Network model construction or network construction | Construction of a model network from observed data, which may or may not capture the underlying ecological reality. |
| Prevalence | Number of samples where a species is present across the complete set of samples. |
| Compositional data | Data where its components represent proportions (parts of some-thing), in other words their sum has a constant value. |
| Subcompositional coherence | Results obtained for a whole data set should not contradict the results obtained from any of its parts. Here in particular if we take into account all taxa or filter some of them out. |
| Prevalence threshold | A threshold, which cuts off taxa with low prevalence in environmental samples. Done previous to network construction. |
| Total sum threshold | A threshold, which filters taxa with low total number of reads, in other words low average read numbers in all environmental samples. Done before network construction. |
| Edge threshold | Threshold for pairwise association measure (e.g. correlation value) defining the establishment of an edge between taxa. Done for network construction. |
Fig. 3The choice of the threshold, to establish an association edge, changes the sparsity of the constructed networks. (a) Frequency, N of the Spearman's rank correlation coefficients ρ obtained for all ASV pairs in the upper soil core (in gray). The selected ρ values by the use of different edge thresholds τ are marked with different colors (red and dark blue). In light blue we show ρ selected as significant by comparison with a null model (for that we use 1000 shuffled versions of our dataset, and p-value = 0.05). (b) Number of positive edges (ρ > 0) in networks constructed with different edge thresholds. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
Fig. 4Association networks of microbial taxa in a Beech forest soil at different soil depths constructed from Spearman's rank correlation coefficient at different edge thresholds. Shown are networks constructed from upper and lower soil layers combined (a, b), and from each soil layer separately (c–f), for edge thresholds τ = 0.7 (a, c, e) and 0.8 (b, d, f). Notice how the selected value for the edge threshold changes the number of edges in the network as well as the network structure (for details on preparation of the data set see Supplemental material). The network of the combined dataset (a, b) shows not the total 595, but only 461 ASVs shared with either one of the two other datasets (lower panels). The network with the phylogenetic classifications of nodes is also provided in the Supplement. Network visualization was done with graph-tool (Peixoto, 2014).
Fig. 5Networks of positive associations for microorganisms in a Beech forest soil at different soil depths constructed by different network inference methods. Networks constructed from upper and lower soil layers combined are shown in (a, b), and from each soil layer separately in (c–f). Networks were constructed by (left panel) SparCC or by (right panel) SPIEC-EASI-like approach (transformed with clr, and then using GLASSO combined with Stars). To improve visibility nodes with no edges are not shown.
Fig. 6The edges detected by different network construction methods, using Spearman correlation (edge threshold 0.7), SPARCC (threshold 0.65) and GLASSO (STARS for model selection) with and the overlap among them.
Fig. 7Some network properties that can be used to characterise microbial communities, with the respective examples of questions that can be addressed by their use.
Examples of ecological questions that can be assessed with particular network properties depending on the meaning of the links in the network.
| Network property | If network edges can be interpreted as interactions | If network edges can be interpreted as co-occurrences |
|---|---|---|
| Node degree | Are there species more generalist in their interactions than others? | Are there species with broader niche preference than others? |
| How important are species for ecosystem functioning? ( | ||
| Degree centrality | Do highly connected taxa (i.e. hubs) support higher levels of ecosystem functions? | |
| Betweenness centrality | Are there taxa which act as brokers, transmitting the effect of multiple interactions, as many paths between taxa pass through them? | |
| Assortative patterns (Modularity) | Do we find groups of species that tend to interact more among each other than with other species (specificity of interactions)? ( | Do some species tend to co-occur with each other more often than with other species? ( |
| How are environmental factors reflected in the co-occurrence patterns? ( | ||
| Do some factors (i.e. invasion processes) enhance a cluster pattern of co-occurrence? ( | ||
| Disassortative patterns (Nestedness) | Do interaction specialists interact more with generalists than with each other? ( | Do species co-occurring with few species tend to co-occur with species that co-occur with many others? (Co-occurrence patterns mirror confounders, e.g. relative abundance of species or broad/narrow niche preference) |
| Path distance (of pairs or average) | Are the interactions overall tight (direct) or loose (indirect)? ( | Can distances in the network help us to understand the community assembly process? ( |
| Connectance | Which proportion of all potential interactions are actually realized (complexity/stability of the network)? ( | |
| Are there redundant pathways in the interaction network? | ||
| Transitivity | Do we find positive (negative) feedback loops? ( | |
| Ratio of +/− | What is the ratio of cooperation to competition? ( | |
Fig. 8Use of network analysis in exploratory vs hypothesis driven research. (a) Taking into account that community composition is shaped by underlying ecological mechanisms driven by the physical environment or by interactions among taxa. (b) In general, the measured co-occurrence patterns are used to construct association networks, which are different from the interaction networks. (c) Hypothesis-driven research is key to design comparative experiments aiming to isolate and contrast the community assembly mechanism of interest. In particular, established ecological theories can complement hypothesis generation.