| Literature DB >> 34254819 |
Gongchao Jing1,2, Yufeng Zhang3, Lu Liu1,2, Zengbin Wang1,2, Zheng Sun1,2, Rob Knight4, Xiaoquan Su3, Jian Xu1,2.
Abstract
Microbiomes are inherently linked by their structural similarity, yet the global features of such similarity are not clear. Here, we propose as a solution a search-based microbiome transition network. By traversing a composition-similarity-based network of 177,022 microbiomes, we show that although the compositions are distinct by habitat, each microbiome is on-average only seven neighbors from any other microbiome on Earth, indicating the inherent homology of microbiomes at the global scale. This network is scale-free, suggesting a high degree of stability and robustness in microbiome transition. By tracking the minimum spanning tree in this network, a global roadmap of microbiome dispersal was derived that tracks the potential paths of formulating and propagating microbiome diversity. Such search-based global microbiome networks, reconstructed within hours on just one computing node, provide a readily expanded reference for tracing the origin and evolution of existing or new microbiomes. IMPORTANCE It remains unclear whether and how compositional changes at the "community to community" level among microbiomes are linked to the origin and evolution of global microbiome diversity. Here we propose a microbiome transition model and a network-based analysis framework to describe and simulate the variation and dispersal of the global microbial beta-diversity across multiple habitats. The traversal of a transition network with 177,022 samples shows the inherent homology of microbiome at the global scale. Then a global roadmap of microbiome dispersal derived from the network tracks the potential paths of formulating and propagating microbiome diversity. Such search-based microbiome network provides a readily expanded reference for tracing the origin and evolution of existing or new microbiomes at the global scale.Entities:
Keywords: beta diversity; data mining; microbiome transition; network; scale-free
Year: 2021 PMID: 34254819 PMCID: PMC8407412 DOI: 10.1128/mSystems.00394-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
Distribution of samples among the habitats
| Habitat | Source | No. of samples | |
|---|---|---|---|
| Human-associated | Gut | Feces, etc. | 51,706 |
| Skin | Hand, arm, head, leg, etc. | 19,455 | |
| Oral | Tongue, saliva, plaque, etc. | 10,896 | |
| Other human body-site | Hair, lung, blood, eye, etc. | 3,018 | |
| Urogenital | Vagina, urine, etc. | 1,204 | |
| Nose | Nostril | 489 | |
| Animal-associated | Mammal animal | Mouse, rabbit, dog, deer, etc. | 29,918 |
| Nonmammal animal | Sponge, fish, insect, etc. | 11,172 | |
| Environmental | Building | Indoor environment, etc. | 11,248 |
| Soil | Grass cover, cropland, soil sediment, etc. | 10,507 | |
| Marine water | Sea water | 6,090 | |
| Lake | Lake water, lake sediment, etc. | 4,234 | |
| Plant | Plant rhizosphere, plant surface, etc. | 3,456 | |
| Freshwater | Blank control, tap water, etc. | 3,112 | |
| River | River water, river sediment, etc. | 2,248 | |
| Milk | Tanker milk, blended solo milk, etc. | 1,636 | |
| Sand | Beach, desert, sand sediment, etc. | 968 | |
| Food | Food surface, etc. | 780 | |
| Other | Other | Other environment | 4,074 |
| Mock | Mock microbiome | 811 | |
| Total | 177,022 | ||
FIG 1Distribution of pairwise similarity in n = 177,022 microbiome samples. (A) A P value of <0.01 for significant similarity values in the permutation determined the threshold of 0.868 (under the shadow) for putative direct transition. (B) The threshold has P value = 0.0022 among between-habitat similarity distribution. (C) The threshold is higher than the upper boundary of within-habitat similarities for most habitats. The three panels use the same y axis. P values are calculated by permutation test.
FIG 2Global microbiome network predicts the microbiome habitat. (A) Frequency of within-habitat direct transition is significantly higher than that of between-habitat. P value is calculated by a two-sided t test. (B) The habitat of 89.28% of samples is correctly predicted by the microbiome network. The inner ring represents the proportion of real habitats and the outer ring is the proportion of predicted habitats.
Prediction of habitat based on the microbiome network
| Habitat | No. of samples | No. of correctly predicted samples | % Accuracy | Within-habitat transition frequency | Between-habitat transition frequency |
|---|---|---|---|---|---|
| Gut | 51,706 | 50,431 | 97.53 | 66.89 | 5.40 |
| Skin | 19,455 | 17,464 | 89.77 | 52.11 | 17.82 |
| Oral | 10,896 | 10,070 | 92.42 | 55.10 | 10.98 |
| Other human body-site | 3,018 | 1,777 | 58.88 | 24.64 | 37.92 |
| Urogenital | 1,204 | 1,046 | 86.88 | 44.16 | 17.29 |
| Nose | 489 | 91 | 18.61 | 5.00 | 55.29 |
| Mammal animal | 29,918 | 28,010 | 93.62 | 50.67 | 8.87 |
| Nonmammal animal | 11,172 | 8,077 | 72.30 | 19.81 | 19.95 |
| Building | 11,248 | 8,942 | 79.50 | 40.24 | 24.54 |
| Soil | 10,507 | 9,978 | 94.97 | 54.49 | 7.28 |
| Marine water | 6,090 | 3,960 | 65.02 | 24.29 | 17.77 |
| Lake | 4,234 | 3,983 | 94.07 | 49.64 | 7.35 |
| Plant | 3,456 | 3,127 | 90.48 | 46.91 | 15.16 |
| Freshwater | 3,112 | 1,671 | 53.70 | 22.51 | 28.66 |
| River | 2,248 | 2,011 | 89.46 | 36.84 | 15.55 |
| Milk | 1,636 | 1,565 | 95.66 | 55.21 | 8.23 |
| Sand | 968 | 864 | 89.26 | 44.72 | 23.54 |
| Food | 780 | 677 | 86.79 | 39.01 | 26.57 |
| Other | 4,074 | 3,573 | 87.70 | 38.88 | 13.64 |
FIG 3Robustness of the global microbiome network. (A) Node degree (number of linked neighbors) of the network follows the Poisson distribution, suggesting the network is scale-free. (B) The effect of random node removal on the main closure in as function of sample rate. (C) The mean shortest transition step and maximum transition step (diameter).
FIG 4Roadmap of the global microbiome transition among habitats. (A) Bold lines are the roadmap that represents the maximum overall similarity, in which arrows indicate the transitions are bidirectional. The number of samples in each habitat is scaled by the node size, and the within-habitat transition frequency is represented by the node color depth (compared to the rim). Thin lines show the high frequent transitions between habitats. (B) Principle-coordinate analysis (PCoA) parsed from a subset of 140 microbiomes demonstrates the roadmap by the equivalent topology. (C) The phylum-level compositional shift of a microbiome transition route for marine to gut environment. (D) The genus-level compositional shift of a transition case from a freshwater microbiome to gut samples.
FIG 5Transition of the human microbiome across time and body sites. (A to C) The within-habitat microbiome transition of gut (A), oral (B), and skin (C) of individual I across 396 time points. (D) The oral-skin microbiome transition of individual I across time. Only selected samples are shown. The transition patterns of individual II are shown in Fig. S5 in the supplemental material. (E and F) PCoA of the two individuals’ time-series microbiomes: skin and oral microbiomes are linked in a closure by direct transition (highlighted by gray dotted line) and gut samples form another closure.
FIG 6Microbiome transition across habitats and geographical locations. (A) The 3,850 samples from six habitats are included in three isolated transition closures, of which the sample proportions are 43.22%, 7.90%, and 45.53%, respectively. (B) After adding an extra 1,635 bridge samples from the MSE database, the three closures merged by direct transition into a single closure, which contains 97.74% of the samples.