| Literature DB >> 22646978 |
Ye Deng1, Yi-Huei Jiang, Yunfeng Yang, Zhili He, Feng Luo, Jizhong Zhou.
Abstract
BACKGROUND: Understanding the interaction among different species within a community and their responses to environmental changes is a central goal in ecology. However, defining the network structure in a microbial community is very challenging due to their extremely high diversity and as-yet uncultivated status. Although recent advance of metagenomic technologies, such as high throughout sequencing and functional gene arrays, provide revolutionary tools for analyzing microbial community structure, it is still difficult to examine network interactions in a microbial community based on high-throughput metagenomics data.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22646978 PMCID: PMC3428680 DOI: 10.1186/1471-2105-13-113
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Overview of the Random Matrix Theory (RMT)-based molecular ecological network analysis. Two major parts are included, network construction and network analyses. In each of them, several key steps are outlined.
Figure 2Process of random matrix theory-based approach for automatically detecting threshold to construct molecular ecological networks.
The network topological indexes used in this study
| Connectivity | It is also called node degree. It is the most commonly used concept for desibing the topological property of a node in a network. | [ | ||
| Stress centrality | It is used to desibe the number of geodesic paths that pass through the ith node. High Stress node can serve as a broker. | [ | ||
| Betweenness | It is used to desibe the ratio of paths that pass through the ith node. High Betweenness node can serve as a broker similar to stress centrality. | [ | ||
| Eigenvector centrality | It is used to desibe the degree of a central node that it is connected to other central nodes. | [ | ||
| Clustering coefficient | It desibes how well a node is connected with its neighbors. If it is fully connected to its neighbors, the clustering coefficient is 1. A value close to 0 means that there are hardly any connections with its neighbors. It was used to desibe hierarchical properties of networks. | [ | ||
| Vulnerability | It measures the deease of node i on the system performance if node i and all associated links are removed. | [ | ||
| Average connectivity | Higher | [ | ||
| Average geodesic distance | A smaller | [ | ||
| Geodesic efficiency | all parameters shown above. | It is the opposite of | [ | |
| Harmonic geodesic distance | The reciprocal of | [ | ||
| Centralization of degree | max( | It is close to 1 for a network with star topology and in contrast close to 0 for a network where each node has the same connectivity. | [ | |
| Centralization of betweenness | max( | It is close to 0 for a network where each node has the same betweenness, and the bigger the more difference among all betweenness values. | [ | |
| Centralization of stress centrality | max( | It is close to 0 for a network where each node has the same stress centrality, and the bigger the more difference among all stress centrality values. | [ | |
| Centralization of eigenvector centrality | max( | It is close to 0 for a network where each node has the same eigenvector centrality, and the bigger the more difference among all eigenvector centrality values. | [ | |
| Density | It is closely related to the average connectivity. | [ | ||
| Average clustering coefficient | It is used to measure the extent of module structure present in a network. | [ | ||
| Transitivity | Sometimes it is also called the entire clustering coefficient. It has been shown to be a key structural property in social networks. | [ | ||
| Connectedness | It is one of the most important measurements for summarizing hierarchical structures. | [ | ||
Common characters of complex networks
| It is a most notable characteristic in complex systems. It was used to desibe the finding that most nodes in a network have few neighbors while few nodes have large amount of neighbors. In most cases, the connectivity distribution asymptotically follows a power law [ | |
| It is a terminology in network analyses to depict the average distance between nodes in a network is short, usually logarithmically with the total number of nodes [ | |
| It was used to demonstrate a network which could be naturally divided into communities or modules [ | |
| It was used to depict the networks which could be arranged into a hierarchy of groups representing in a tree structure. Several studies demonstrated that metabolic networks are usually accompanied by a hierarchical modularity [ |
Topological properties of the empirical molecular ecological networks (MENs) of additional miobial communities and their associated random MENs
| Habitats of communitiesb | Empirical networks | Random networks | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Similarity threshold ( | Network size ( | R2 of power law | R2 of scaling law | Average path (GD) | Average Clustering coefficient ( | Modularity & (the number of modules) | Average path (GD) | Average clustering coefficient ( | Modularity ( | |
| Grassland soils under elevated CO2, MN (i) | 0.80 | 254 | 0.79 | 0.25 | 3.09 | 0.22 | 0.44 (18) | 3.00 ± 0.03 | 0.099 ± 0.009 | 0.31 ± 0.01 |
| Grassland soils under ambient CO2, MN (i) | 0.80 | 184 | 0.88 | 0.11 | 4.21 | 0.10 | 0.65 (16) | 3.84 ± 0.06 | 0.028 ± 0.007 | 0.52 ± 0.01 |
| Lake sediment, Lake DePue, WI (ii) | 0.92 | 151 | 0.85 | 0.73 | 3.47 | 0.09 | 0.48 (8) | 3.46 ± 0.05 | 0.046 ± 0.010 | 0.45 ± 0.01 |
| Groundwater, Well 101–2, Oak Ridge, TN (iii) | 0.95 | 107 | 0.74 | 0.44 | 3.12 | 0.29 | 0.52 (11) | 3.13 ± 0.07 | 0.081 ± 0.017 | 0.40 ± 0.01 |
| Groundwater Well 102–2, Oak Ridge, TN (iii) | 0.89 | 140 | 0.79 | 0.21 | 4.22 | 0.17 | 0.67 (12) | 3.89 ± 0.08 | 0.033 ± 0.012 | 0.53 ± 0.01 |
| Groundwater Well 102–3, Oak Ridge, TN (iii) | 0.87 | 117 | 0.85 | 0.19 | 3.57 | 0.25 | 0.64 (13) | 3.54 ± 0.09 | 0.049 ± 0.013 | 0.48 ± 0.01 |
| Grassland soils under warming, Norman, OK (iv) | 0.76 | 177 | 0.83 | 0.48 | 3.91 | 0.13 | 0.67 (18) | 3.94 ± 0.20 | 0.020 ± 0.008 | 0.44 ± 0.01 |
| Grassland soils under unwarming, Norman, OK (iv) | 0.76 | 152 | 0.88 | 0.10 | 2.71 | 0.09 | 0.61 (20) | 3.39 ± 0.23 | 0.038 ± 0.010 | 0.47 ± 0.01 |
| Grassland soils under elevated CO2, MN (i) | 0.78 | 263 | 0.89 | 0.26 | 3.95 | 0.25 | 0.81 (34) | 3.98 ± 0.22 | 0.015 ± 0.006 | 0.61 ± 0.02 |
| Grassland soils under ambient CO2, MN (i) | 077 | 292 | 0.87 | 0.22 | 4.26 | 0.27 | 0.85 (36) | 4.10 ± 0.20 | 0.017 ± 0.005 | 0.59 ± 0.01 |
| Agricultural soil, Africa (v) | 0.77 | 384 | 0.86 | 0.20 | 4.99 | 0.34 | 0.86 (32) | 3.99 ± 0.04 | 0.020 ± 0.004 | 0.48 ± 0.01 |
| Human intestine, Stanford, CA (vi) | 0.86 | 215 | 0.92 | 0.18 | 3.55 | 0.13 | 0.69 (27) | 4.23 ± 0.10 | 0.025 ± 0.009 | 0.58 ± 0.01 |
aVarious parameters of the empirical networks and generation of random networks are explained in the Table 1.
bSample sources: (i) the grassland soils under elevated and ambient CO2 were collected from a free-air CO2 enrichment field in Minnesota which were analyzed with both GeoChip3.0 and 16 S pyrosequencing [49]. The fMENs analysis was desibed in Zhou et al. [27] and pMENs analysis was desibed in Zhou et al. [28]. (ii) The lake sediment samples from Lake DePue were analyzed with GeoChip 2.0. (iii) The groundwater samples from three different Wells in Oak Ridge, Tennessee were analyzed with GeoChip 2.0 [50]. (iv) The grassland samples under warming and unwarming were collected from the long term warming experiment at Oklahoma [51] and analyzed with 16 S pyrosequencing [48]. (v) The pyrosequencing data of agricultural soils from Africa and the groundwater samples from Oak Ridge was provided by Dr. Tiedje and his colleagues at Michigan State University. (vi) The human intestine sample from Stanford was desibed elsewhere [52].
Figure 3The robustness to noise of RMT-based MEN construction. Ineasing levels of Gaussian noise were added to the pyrosequencing datasets under experimental warming. The mean of noise was zero and standard deviation (σnoise) was set to 5, 10, 20, 30 to 100 % of the average of relative abundance of whole dataset. The thresholds (S) of all permutated datasets were set to 0.76 that was consistent with original dataset.
Figure 4The submodules of the warming pMEN. (A) The network graph with submodule structure by the fast greedy modularity optimization method. Each node signifies an OTU, which could correspond to a miobial population. Colors of the nodes indicate different major phyla. A blue edge indicates a positive interaction between two individual nodes, while a red edge indicates a negative interaction. (B) The correlations and heatmap to show module eigengenes of warming pMEN. The upper part is the hierarchical clustering based on the Pearson correlations among module eigengenes and the below heatmap shows the coefficient values (r). Red color means higher correlation whereas green color signified lower correlation. (C) ZP-plot showing distribution of OTUs based on their module-based topological roles. Each dot represents an OTU in the dataset of warming (red), or unwarming (green). The topological role of each OTU was determined according to the scatter plot of within-module connectivity (z) and among-module connectivity (P) [55,60].
The partial Mantel tests on connectivity vs. the OTU significances of soil geochemical variables and soil temperature in warming pyrosquencing molecular ecological network
| All detected OTUs | 177 | 0.104 | 0.159 | ||
| 35 | 0.059 | 0.234 | −0.054 | 0.800 | |
| 63 | −0.033 | 0.650 | 0.077 | 0.135 | |
| 5 | −0.339 | 0.663 | 0.367 | 0.108 | |
| 6 | −0.082 | 0.521 | −0.202 | 0.788 | |
| 26 | −0.057 | 0.721 | 0.096 | 0.155 | |
| 12 | 0.590 | −0.001 | 0.430 | ||
| 6 | 0.338 | 0.088 | −0.298 | 0.877 | |
| 4 | 0.030 | 0.772 | 0.796 | 0.243 | |
| 5 | 0.926 | −0.755 | 1.000 | ||
aSoil variables used for OTU significance calculations: pH values, NO3-Nitrogen and soil carbon contents.
bCorrelation coefficient based on Mantel test.
cThe significance (probability) of Mantel test.
Figure 5The correlations between module eigengenes and environmental traits in the warming pMEN. The color of each plot indicates the correlation between corresponding module eigengene and environmental trait. Red color means highly positive correlation and green color means highly negative correlation. The numbers in each plot are the correlation coefficient (r) and significance (p) in parentheses. The environmental traits include soil pH value (pH), NO3-nitrogen content (NO3N), soil carbon content (SC) and average soil temperature (avgT).
Figure 6An overview of molecular ecological network analysis pipeline (MENAP).