| Literature DB >> 31911589 |
Chang Liu1,2, Nan Zhou1,2, Meng-Xuan Du1, Yu-Tong Sun1, Kai Wang3,4, Yu-Jing Wang1,3, Dan-Hua Li1, Hai-Ying Yu1, Yuqin Song1,2, Bing-Bing Bai5, Yuhua Xin6, Linhuan Wu6, Cheng-Ying Jiang1,2,3, Jie Feng1,2, Hua Xiang1, Yuguang Zhou6, Juncai Ma6, Jun Wang3,5, Hong-Wei Liu7,8, Shuang-Jiang Liu9,10,11.
Abstract
Mice are widely used as experimental models for gut microbiome (GM) studies, yet the majority of mouse GM members remain uncharacterized. Here, we report the construction of a mouse gut microbial biobank (mGMB) that contains 126 species, represented by 244 strains that have been deposited in the China General Microorganism Culture Collection. We sequence and phenotypically characterize 77 potential new species and propose their nomenclatures. The mGMB includes 22 and 17 species that are significantly enriched in ob/ob and wild-type C57BL/6J mouse cecal samples, respectively. The genomes of the 126 species in the mGMB cover 52% of the metagenomic nonredundant gene catalog (sequence identity ≥ 60%) and represent 93-95% of the KEGG-Orthology-annotated functions of the sampled mouse GMs. The microbial and genome data assembled in the mGMB enlarges the taxonomic characterization of mouse GMs and represents a useful resource for studies of host-microbe interactions and of GM functions associated with host health and diseases.Entities:
Mesh:
Year: 2020 PMID: 31911589 PMCID: PMC6946648 DOI: 10.1038/s41467-019-13836-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1The taxonomic diversity and contribution of mGMB to the mouse intestinal bacterial collection.
a The cladogram displays the taxonomic diversity of bacteria in the mGMB and miBC. The mGMB and miBC together have 180 bacterial species from 110 genera, 33 families, and 5 phyla. The background is color-coded according to phyla. All 104 species unique to the mGMB are marked with star symbols, and the 77 new species are indicated with red stars. All 62 genera uniquely represented by mGMB members are indicated with pentagon symbols, and the 43 newly identified genera are indicated with red pentagons. The five families unique to the mGMB are symbolized by solid circles. The 126 species in the mGMB are indicated by the first-level purple external ring (labeled as mGMB in panel a). The 76 species in the miBC are indicated by the second-level green external ring (labeled as miBC in panel a). The 22 species in both the mGMB and miBC are indicated by the third-level yellow external ring (labeled as in common in panel a). b The Venn diagram displays the read coverage of mGMB and miBC to the 16S rRNA gene amplicon dataset of mouse samples (n = 93) from miBC work[12] at species level (sequence identity > 97%). The miBC-specific (2.27 ± 0.47%): the proportion of reads uniquely covered by the miBC; The mGMB-specific (25.92 ± 1.12%): the proportion of reads uniquely covered by the mGMB; The In-common (16.11 ± 1.58%): the proportion of reads shared by both collections. The coverage rate is present as mean ± SEM. Source data are provided as a Source Data file.
Fig. 2The 16S rRNA gene amplicon-based analysis of the prevalence of novel species in the mGMB.
a Radar plot depicting the prevalence of 77 novel taxa within the host-associated microbiotas from diverse hosts. The numbers of novel species that got hits on OTUs in datasets were marked in orange. OB.CNA: the gut microbiota of ob/ob mice from China (n = 12), WT.CNA: the gut microbiota of C57BL/6J mice from China (n = 12), OB.DEN: the gut microbiota of ob/ob mice from Denmark (n = 239), WT.DEN: the gut microbiota of C57BL/6J mice from Denmark (n = 120), OB.tre = t: the gut microbiota of ob/ob mice from China treated with anti-metabolic-syndrome medicine SA-7 (n = 31), DIO.USA: the gut microbiota of diet-induced obese mice from the USA (n = 25), CD1.USA: the gut microbiota of outbred CD1 mice from the USA (n = 208), miBC: the gut microbiota of mice with different genetic backgrounds and housed in various facilities in Europe and America from miBC (n = 93), HG.AUS: the gut microbiota of humans from Australia (n = 300), HG.USA: the gut microbiota of humans from USA (n = 97), MG: the gut microbiota of rhesus monkeys from China (n = 160), HV: the microbiota of the human vagina (n = 20), HO: the microbiota of the human oral cavity (n = 66). b, c The novel taxa improved the annotation rate of the 16S rRNA gene amplicon data of murine GMs at the genus level (b) and at the species level (c). LTP version_132 (olive drab): data annotated using LTP database version 132, LTP version_mGMB (light yellow): data annotated using a customized LTP database by supplementing the 16S rRNA gene sequences of 77 novel species; data are shown in box-and-whiskers plot, center line: median, bounds of box: quartile, whiskers: Tukey extreme; the mean ± SEM of annotation rates using different databases were given in the panels and were statistically determined to be significantly different (p < 0.001) by t test. The n numbers represent the biologically independent samples. Source data are provided as a Source Data file.
Fig. 3The mGMB defines the core- and pan-microbiota of mouse gut microbiota.
a The mGMB coverage of the core- and pan-genera of the ob/ob mice. b, The mGMB coverage of the core- and pan-genera of the diverse-background mice. The bar chart shows the frequency of occurrence (FO) of each genus in the analyzed samples (definition: FO = 100% is defined when a genus is present in all samples, while FO = 0 is defined when a genus is present in none of the samples), n = 274 in (a), n = 93 in (b); the box-and-whiskers plot shows the relative abundance (RA) of each taxon, center line: median, bounds of box: quartile, whiskers: Tukey extreme. The RA is exhibited in the percentage value logarithm. Core-genera: genera with FO > 80% and an average RA > 0.1% (log 10 (RA) > −3); pan-genera: genera with FO > 5%. The cutoff values for core- and pan-genera are marked with vertical dashed lines in the panel; purple/blue: genera covered by the mGMB; cream: genera not covered by the mGMB; purple triangle markers: core-genera for both the ob/ob and diverse-background mice[12]. The n numbers represent the biologically independent samples. Source data are provided as a Source Data file.
Fig. 4The metagenomic functions covered by the mGMB and the GM features associated with mouse phenotypes.
a The mapping profile of metagenomic short reads of OB and WT to the mGMB pan-genome. OB: metagenomes of ob/ob mouse GMs (n = 6); WT: metagenomes of C57BL/6J mouse GMs (n = 6). The Anvi’o tree displays the hierarchical clustering of the mGMB pan-genome; WT layers a–f represent the detection of each split in WT; OB a–f represents the detection of each split in OB; the outermost yellow layer displays the splits in the pan-genome mapped by the metagenomic reads; the most outer blue layer displays the splits in the pan-genome unmapped to the metagenomic reads. b KO-based coverage of mouse gut microbiomes by genomes randomly picked from the mGMB. OB: the KEGG Ortholog (KO) pool of metagenomes of ob/ob mouse GMs (n = 6); WT: the KO pool of metagenomes of C57BL/6J mouse GMs (n = 6); OB-specific: KOs appearing specifically in OB; WT-specific: KOs appearing specifically in WT; iMGMC gene catalog: the integrated mouse gut metagenome catalog (iMGMC) comprising 4.6 million unique genes and 660 high-quality metagenome-assembled genomes[27]. Data are shown with box plot, center line: median, bounds of box: extreme. c The features of the gut microbiota at the genus level associated wild-type (C57BL/6J) and metabolic-syndrome (ob/ob C57BL/6J) mice. OB: the 16S rRNA gene amplicons of ob/ob mouse cecal samples (n = 12) used for previous bacterial isolation; WT: the 16S rRNA gene amplicons of the counterpart wild-type C57BL/6J mice (n = 12). Data are shown in a box-and-whiskers plot, center line: median, bounds of box: quartile, whiskers: Tukey extreme; statistical comparisons of the data between the two groups used the t test, ∗P < 0.05; ∗∗P < 0.01; ∗∗∗P < 0.001; purple: enriched in OB, blue: enriched in WT; mint-colored background: genera exclusively presenting in either the OB or WT groups; cream-colored background: genera presenting in both groups but with significant difference in RA; gray genus names: genera that were not recovered by the mGMB. The n numbers represent the biologically independent samples. P-values are provided as a Source Data file.
Relative abundances of mGMB bacteria in ob/ob and C57BL/6J mice.
| Taxon number | Taxonomy | RA of WT | RA of OB | Significance | Taxon number | Taxonomy | RA of WT | RA of OB | Significance | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Taxon 151 | 7.05E–03 | 2.34E–04 | *** | WT | Taxon 43 | 7.75E–04 | 3.33E–03 | *** | OB | ||
| ns | Taxon 61 | 1.36E–03 | 4.06E–03 | ns | |||||||
| Taxon 4 | 0.00E+00 | 1.40E–04 | ns | Taxon 62 | 1.00E–03 | 1.50E–03 | ns | ||||
| Taxon 73 | 8.53E–03 | 1.06E–02 | ns | ||||||||
| Taxon 8 | 2.29E–06 | 1.15E–05 | Taxon 74 | 1.17E–03 | 2.40E–03 | ns | |||||
| Taxon 111 | 9.18E–06 | 9.18E–06 | ns | Taxon 78 | 5.57E–04 | 5.30E–04 | ns | ||||
| Taxon 81 | 2.56E-03 | 6.26E–04 | *** | WT | |||||||
| Taxon 33 | 1.91E–02 | 1.25E–02 | ns | Taxon 82 | 2.52E–05 | 2.02E–04 | ns | ||||
| Taxon 53 | 3.92E–02 | 2.58E–02 | ns | Taxon 83 | 8.49E–05 | 9.41E–05 | ns | ||||
| Taxon 54 | 3.38E–03 | 3.55E–03 | ns | Taxon 85 | 1.08E–04 | 9.41E–05 | ns | ||||
| Taxon 141 | 1.74E–04 | 7.57E–05 | *** | WT | Taxon 86 | 1.08E–04 | 1.61E–04 | ns | |||
| Taxon 152 | 1.02E–03 | 3.88E–04 | * | WT | Taxon 87 | 6.31E–03 | 4.60E–03 | ns | |||
| Taxon 153 | 1.67E–02 | 1.29E–02 | ns | Taxon 88 | 1.22E–03 | 9.08E–04 | ns | ||||
| Taxon 89 | 1.82E–03 | 1.36E–03 | ns | ||||||||
| Taxon 17 | 7.69E–04 | 2.89E–04 | ns | Taxon 90 | 2.66E–04 | 9.34E–04 | ns | ||||
| Taxon 71 | 7.11E–05 | 3.21E–05 | * | WT | Taxon 93 | 1.09E–03 | 1.85E–03 | ns | |||
| Taxon 94 | 5.74E–05 | 1.72E–04 | ** | OB | |||||||
| Taxon 10 | 7.96E–04 | 6.42E–04 | * | OB | Taxon 99 | 3.52E–03 | 1.16E–02 | * | OB | ||
| Taxon 42 | 6.77E–04 | 1.51E–03 | ns | Taxon 101 | 1.00E–03 | 2.66E–03 | ns | ||||
| Taxon 45 | 8.81E–04 | 4.28E–03 | ** | OB | Taxon 114 | 2.64E–03 | 6.09E–03 | * | OB | ||
| Taxon 47 | 4.82E–04 | 2.04E–03 | ** | OB | Taxon 116 | 2.64E–03 | 6.09E–03 | * | OB | ||
| Taxon 50 | 9.81E–03 | 6.86E–03 | ns | Taxon 117 | 2.64E–03 | 6.09E–03 | * | OB | |||
| Taxon 92 | 7.02E–04 | 3.53E–03 | ** | OB | Taxon 118 | 1.40E–02 | 1.69E–02 | ns | |||
| Taxon 96 | 4.57E–04 | 2.95E–03 | * | OB | Taxon 119 | 1.87E–02 | 2.11E–02 | ns | |||
| Taxon 103 | 1.86E–03 | 3.93E–03 | ns | Taxon 120 | 3.18E–03 | 4.70E–03 | ns | ||||
| Taxon 115 | 1.45E–02 | 7.07E–03 | ns | Taxon 131 | 5.05E–05 | 4.07E–03 | ns | ||||
| Taxon 13 | 1.62E–03 | 2.23E–03 | ns | Taxon 29 | 2.00E–02 | 1.63E–02 | ** | WT | |||
| Taxon 40 | 2.98E–05 | 9.64E–05 | ns | Taxon 52 | 6.88E–04 | 4.57E–04 | ns | ||||
| Taxon 104 | 9.06E–04 | 9.61E–04 | ns | Taxon 64 | 6.62E–03 | 5.07E–03 | ns | ||||
| Taxon 105 | 1.62E–03 | 2.23E–03 | ns | Taxon 72 | 5.05E–02 | 2.95E–02 | ** | WT | |||
| Taxon 41 | 1.18E–02 | 9.03E–03 | ns | Taxon 68 | 4.36E–05 | 8.05E–04 | ns | ||||
| Taxon 60 | 1.18E–02 | 9.01E–03 | ns | ||||||||
| Taxon 76 | 3.05E–03 | 1.01E–02 | * | OB | Taxon 123 | 3.26E–03 | 3.83E–03 | ns | |||
| Taxon 130 | 1.18E–02 | 1.00E–02 | ns | Taxon 125 | 3.20E–03 | 2.48E–03 | ns | ||||
| Taxon 132 | 4.42E–03 | 2.44E–03 | ** | WT | Taxon 126 | 7.10E–03 | 7.51E–03 | ns | |||
| Taxon 133 | 1.18E–02 | 1.00E–02 | ns | Taxon 127 | 8.42E–04 | 1.03E–03 | ns | ||||
| Taxon 129 | 2.82E–03 | 4.12E–03 | ns | ||||||||
| Taxon 51 | 6.88E–05 | 1.70E–04 | ns | ||||||||
| Taxon 55 | 2.28E–03 | 2.45E-03 | ns | ||||||||
| Taxon 19 | 4.86E–04 | 7.11E–05 | ** | WT | Taxon 106 | 6.88E–06 | 1.61E-05 | ns | |||
| Taxon 22 | 3.23E–04 | 4.96E–04 | ns | ||||||||
| Taxon 23 | 3.23E–04 | 3.62E–04 | ns | Taxon 145 | 9.18E–06 | 1.93E–04 | ns | ||||
| Taxon 25 | 6.88E–06 | 1.86E–04 | ns | ||||||||
| Taxon 142 | 2.84E–04 | 2.71E–04 | ns | Taxon 67 | 3.04E–03 | 1.39E–03 | *** | WT | |||
| Taxon 143 | 2.84E–04 | 4.22E–04 | ns | ||||||||
| Taxon 144 | 2.84E–04 | 4.22E–04 | ns | Taxon 11 | 5.02E–04 | 1.50E–03 | ** | OB | |||
| Taxon 12 | 2.07E–03 | 4.15E–03 | ns | ||||||||
| Taxon 5 | 3.44E–05 | 2.00E–04 | ns | Taxon 37 | 1.17E–03 | 3.23E–03 | *** | OB | |||
| Taxon 6 | 3.44E–05 | 4.36E–05 | ns | Taxon 38 | 4.29E–04 | 1.07E–03 | ** | OB | |||
| Taxon 108 | 3.44E–05 | 3.90E–05 | ns | Taxon 39 | 4.29E–04 | 9.27E–04 | ** | OB | |||
| Taxon 109 | 3.44E–05 | 3.90E–05 | ns | Taxon 56 | 5.05E–05 | 5.05E–05 | ns | ||||
| Taxon 75 | 5.04E–03 | 9.86E–03 | ns | ||||||||
| Taxon 59 | 9.68E–04 | 2.06E–05 | ** | WT | Taxon 95 | 9.81E–03 | 6.92E–03 | ns | |||
| Taxon 66 | 2.37E–03 | 1.72E–04 | *** | WT | Taxon 121 | 2.17E–03 | 2.38E–03 | ns | |||
| Taxon 122 | 4.29E–04 | 1.03E–03 | * | OB | Taxon 139 | 1.17E–03 | 3.04E–03 | *** | OB | ||
| Taxon 140 | 1.99E–03 | 5.30E–04 | *** | WT | |||||||
| Taxon 97 | 9.66E–04 | 2.55E–04 | ns | ||||||||
| Taxon 98 | 2.12E–03 | 2.66E–04 | *** | WT | Taxon 110 | 3.44E–05 | 2.06E–05 | ns | |||
| Taxon 1 | 3.47E–03 | 1.22E–02 | *** | OB | Taxon 34 | 5.01E–03 | 1.44E–03 | ** | WT | ||
| Taxon 14 | 9.41E–05 | 4.43E–04 | ** | OB | Taxon 77 | 7.46E–04 | 7.80E–04 | ns | |||
| Taxon 15 | 3.49E–04 | 2.20E–03 | ** | OB | Taxon 149 | 5.01E–03 | 2.13E–03 | * | WT | ||
| Taxon 2 | 1.73E–02 | 2.23E–02 | ns | Taxon 154 | 5.01E–03 | 1.37E–03 | ** | WT | |||
| Taxon 3 | 1.74E–02 | 3.64E–02 | * | OB | |||||||
| Taxon 107 | 0.00E+00 | 1.84E–05 | ns |
The statistical analysis of data between OB and WT groups used the t test, and the test significance is marked with asterisks: ∗P < 0.05; ∗∗P < 0.01; ∗∗∗P < 0.001, (ns) not significant. The taxon enriched in OB group is marked with OB next to its significant marks, while taxon enriched in WT is marked with WT next to its significant marks