| Literature DB >> 35749381 |
Natalia Favila1, David Madrigal-Trejo2, Daniel Legorreta1, Jazmín Sánchez-Pérez2, Laura Espinosa-Asuar2, Luis E Eguiarte2, Valeria Souza2,3.
Abstract
Applications of network theory to microbial ecology are an emerging and promising approach to understanding both global and local patterns in the structure and interplay of these microbial communities. In this paper, we present an open-source python toolbox which consists of two modules: on one hand, we introduce a visualization module that incorporates the use of UMAP, a dimensionality reduction technique that focuses on local patterns, and HDBSCAN, a clustering technique based on density; on the other hand, we have included a module that runs an enhanced version of the SparCC code, sustaining larger datasets than before, and we couple the resulting networks with network theory analyses to describe the resulting co-occurrence networks, including several novel analyses, such as structural balance metrics and a proposal to discover the underlying topology of a co-occurrence network. We validated the proposed toolbox on 1) a simple and well described biological network of kombucha, consisting of 48 ASVs, and 2) we validate the improvements of our new version of SparCC. Finally, we showcase the use of the MicNet toolbox on a large dataset from Archean Domes, consisting of more than 2,000 ASVs. Our toolbox is freely available as a github repository (https://github.com/Labevo/MicNetToolbox), and it is accompanied by a web dashboard (http://micnetapplb-1212130533.us-east-1.elb.amazonaws.com) that can be used in a simple and straightforward manner with relative abundance data. This easy-to-use implementation is aimed to microbial ecologists with little to no experience in programming, while the most experienced bioinformatics will also be able to manipulate the source code's functions with ease.Entities:
Mesh:
Year: 2022 PMID: 35749381 PMCID: PMC9231805 DOI: 10.1371/journal.pone.0259756
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Description of several network metrics and properties currently used in biological networks, including some of their prospective interpretations.
| Metric/Network property | Definition | Prospective biological Interpretation in microbial co-occurrence networks | References |
|---|---|---|---|
| Total Nodes/Vertices | Total Entities within a network. | Total number of taxa (species, OTUs/ASVs) in a network (species richness); number of connected taxa; common measure of ecosystem state in response to perturbations | [ |
| Edges, Links, Relationship, connection | Relationship or associations between nodes. For co-occurrence networks, relationships exist between pairs of nodes. | Ecological associations, including interspecific interactions, niche overlap, cross-feeding, abiotic co-occurrence drivers, among others | [ |
| Density, Connectance, Complexity (network scale), Interactions diversity, Probability of connection | Fraction of edges that are actually present in the network with respect to all possible edges. | Reflection of the incidence of ecosystemic processes; Possible measure of ecological resilience; organization level of the community; measure of complexity in the microbial network | [ |
| Connectivity | Total number of relationships in a network | Total number of ecological associations within a biological network | [ |
| Connected component | Sets of nodes, where every pair of nodes have a path between them. | Microbial network where every OTU/ASV have an indirect ecological association with every other OUT/ASV | [ |
| Average degree, Complexity (taxon scale), Connectedness (normalized degree) | Average number of edges connected to a node; average number of neighbors for a given node. | Measure of complexity in the microbial network | [ |
| Degree centrality | Centrality of a node based on degree. i.e., nodes with higher degree are more central to the network. It is a measurement of popularity. | Keystone taxa; taxa that interacts the most within the community | [ |
| Closeness centrality | Centrality of a node based on its proximity to all other nodes in the network. It is a measure of broadcaster nodes, that is, nodes that can influence the network fastest | Keystone taxa; taxa that, if perturbed, influence the network the fastest. | [ |
| Betweenness centrality | Centrality of a node based on how often a node is situated on paths between other nodes. It is a measurement of bridge nodes | Keystone taxa; taxa more important in communication in the network. | [ |
| PageRank | Centrality measure that computes a ranking of the nodes based on the structure of the incoming links. It identifies hub nodes. | Keystone taxa | [ |
| Negative:Positive relationship ratio, Behavior | Ratio of positive and negative relationships. If > 1 there are more negative interactions, if < 1 there are more positive interactions present in the network. | Potential measure of cooperation level within the community; measure of community stability (ecological resilience and resistance) | [ |
| Average shortest path length (AL), Average geodesic path | Average number of steps in the shortest paths from one node to another. It is calculated for all pairs and then averaged. | Microbial networks usually present small AL: measure of network’s response speed to perturbations (ecological resilience); community cohesion; measure of information and substance flow | [ |
| Diameter, Longest geodesic path | Length of the longest finite geodesic path anywhere in the network. | Measure of information and substance flow | [ |
| Small world index (SW) | Index based on a tradeoff between high clustering coefficient and short path length, the defining characteristics of small-world networks. Networks with SW > 1 are said to have more “small-worldness”. | Microbial network topological property. Small-world microbial networks suggests that any two members in the community could interact with each other through a few intermediaries. | [ |
| Clustering coefficient, Transitivity | Average probability that two nodes neighbors of a third node are also connected between each other. | Presence of tripartite relationships (e.g., higher-order biological interactions) within the community; possible measure for redundance. | [ |
| Modularity, Assortativity (when normalized) | Quantification of compartmentalization into subgroups. Loosely speaking, high modularity means that there are more edges within groups and fewer between groups. | Modules/Clusters have been interpreted as niches; shared ecological functions among taxa; spatial compartmentalization; similar habitat preferences; measure of community stability (ecological resilience and resistance) | [ |
| Triad motifs and Balanced triads fraction | Motifs are overrepresented subnetworks (patterns). Triad motifs are classified by balanced or imbalanced based on the relationship types (positive or negative). | Motifs can be relevant in information flow (e.g., quorum sensing); potential biomarkers for microbiome perturbed state. | [ |