| Literature DB >> 35641510 |
Yun Kit Yeoh1,2,3, Yang Sun4,5, Lawrence Yuk Ting Ip1,6, Lan Wang4,5, Francis K L Chan1,2,6, Yinglei Miao7,8, Siew C Ng9,10,11.
Abstract
Prevotella species in the human gut microbiome are primarily comprised of Prevotella copri, and its diversity and function were recently investigated in detail. Much less is known about other Prevotella species in the human gut. Here, we examined the composition of Prevotella species in human guts by mapping publicly available gut metagenomes to a dereplicated set of metagenome-assembled genomes (MAGs) representing Prevotella lineages found in human guts. In most human cohorts, P. copri is the most relatively abundant species (e.g. up to 14.3% relative abundance in Tangshan, China). However, more than half of the metagenome reads in several cohorts mapped to Prevotella MAGs representing P. stercorea and several other species sister to P. stercorea and P. copri. Analyses of genes encoded in these genomes indicated that P. stercorea and related lineages lacked many hemicellulose degrading enzymes and were thus less likely to metabolise hemicelluloses compared with P. copri and copri-related lineages. Instead, P. stercorea genomes possess several carbohydrate esterases that may be involved in releasing ester modifications from carbohydrates to facilitate their degradation. These findings reveal unexplored Prevotella diversity in the human gut and indicate possible niche partitions among these related species.Entities:
Mesh:
Year: 2022 PMID: 35641510 PMCID: PMC9156738 DOI: 10.1038/s41598-022-12721-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Phylogenetic tree representing phylogenetic relationships among Prevotella isolates and metagenome-assembled genomes (MAGs) recovered from human gut metagenomes. Five Paraprevotella genomes were used as outgroup to root the tree. MAGs included in this tree were downloaded from publicly available repositories[15,17,18] and from healthy human gut metagenome surveys of Hong Kong[24] and Yunnan populations[25]. Genomes were dereplicated at 99% average nucleotide identity (ANI) and a concatenated amino acid alignment consisting of 120 phylogenetically informative marker genes (from GTDB) was used to infer phylogenetic trees using IQTREE2. Taxonomy labels from GTDB are shown for species represented by at least 20 genomes (i.e., lower odds of spurious misassemblies or genome binning) (see Table S2 for genome counts); species boundaries are indicated by the alternating grey shades. Note that the GTDB-based taxonomy used here is not comparable to the four P. copri clades A–D from Tett et al.[1]- these four clades from Tett et al.[1] are indicated separately in the figure. The outer colour strip indicates source country/continent of the respective genomes shaded by continent.
Figure 2Estimated Prevotella relative abundances across several human gut metagenome data sets. Phylogenetic tree indicates relationships of the Prevotella genomes dereplicated at the species level (> 95% average nucleotide identity). Only genomes with reads mapped are included in this figure. Taxonomy labels are shown for species with at least 1% relative abundance (indicated with *) or at least 50% prevalence (indicated with ^) in any one data set. See Table S4 for relative abundance and prevalence values.
Distribution of total Prevotella in the human gut.
| Data set | Relative abundance ± standard deviation (%) | Prevalence (%) | Number of metagenomes |
|---|---|---|---|
| Austria | 0.09 ± 0.35 | 11.11 | 63 |
| Denmark | 2.42 ± 6.19 | 41.35 | 474 |
| Fiji | 36.08 ± 17.64 | 97.83 | 138 |
| France | 2.26 ± 5.99 | 52.46 | 61 |
| Hong Kong | 6.16 ± 13.21 | 33.27 | 547 |
| Israel | 5.82 ± 10.34 | 58.00 | 150 |
| Japan | 8.15 ± 14.75 | 41.95 | 236 |
| Lima (Peru) | 0.37 ± 1.56 | 19.48 | 77 |
| Mongolia | 19.61 ± 16.05 | 91.82 | 110 |
| El Salvador | 10.24 ± 9.70 | 94.74 | 114 |
| Spain | 3.38 ± 9.69 | 39.68 | 378 |
| Sweden | 2.58 ± 9.87 | 37.00 | 100 |
| Tangshan | 20.48 ± 24.84 | 56.63 | 196 |
| Tanzania | 7.36 ± 8.85 | 79.10 | 67 |
| USA | 1.12 ± 4.85 | 17.60 | 358 |
| Yunnan rural | 8.22 ± 12.32 | 69.32 | 427 |
| Yunnan urban | 8.70 ± 13.67 | 56.39 | 454 |
| Zhejiang | 4.87 ± 10.66 | 37.93 | 145 |
Most relatively abundant and prevalent Prevotella species in the human gut.
| Species (GTDB taxonomy) | Average relative abundance ± standard deviation (%) | Average prevalence (%) |
|---|---|---|
| 3.27 ± 3.62 | 30.73 ± 26.43 | |
| 1.10 ± 1.59 | 20.11 ± 25.05 | |
| 0.59 ± 0.82 | 16.89 ± 25.17 | |
| 0.58 ± 0.92 | 14.44 ± 20.81 | |
| 0.37 ± 0.61 | 15.53 ± 25.16 | |
| 0.28 ± 0.43 | 10.22 ± 14.77 | |
| 0.25 ± 0.29 | 8.59 ± 8.51 | |
| 0.25 ± 0.36 | 14.18 ± 22.96 | |
| 0.20 ± 0.38 | 10.48 ± 18.11 | |
| 0.19 ± 0.36 | 14.47 ± 23.20 | |
| 0.16 ± 0.37 | 8.05 ± 16.36 | |
| 0.11 ± 0.16 | 7.31 ± 13.58 | |
| 0.10 ± 0.21 | 14.75 ± 25.02 | |
| 0.09 ± 0.15 | 8.83 ± 16.88 | |
| 0.09 ± 0.21 | 7.43 ± 15.23 |
Figure 3Distribution of carbohydrate active enzyme (CAZy) families in human gut Prevotella (average copies per genome). Only genomes with > 1% relative abundance or > 50% prevalence as indicated in Fig. 2 are included. CAZy families were identified in the genomes using Hidden Markov Models. Asterisks indicate families with statistically different copy counts (FDR < 0.001 Kruskal Wallis test with false discovery rate adjustment) and with at least one copy in any one of the three groups (copri, stercorea or other). See Table S7 for copy numbers. CE carbohydrate esterase, GH glycoside hydrolase, GT glycosyl transferase, PL polysaccharide lyase.