| Literature DB >> 30559407 |
Tommi Vatanen1, Damian R Plichta1, Juhi Somani2, Philipp C Münch3,4, Timothy D Arthur1, Andrew Brantley Hall1, Sabine Rudolf5, Edward J Oakeley6, Xiaobo Ke1,6, Rachel A Young6, Henry J Haiser6, Raivo Kolde1, Moran Yassour1,7, Kristiina Luopajärvi8,9, Heli Siljander8,9,10, Suvi M Virtanen11,12,13, Jorma Ilonen14,15, Raivo Uibo16, Vallo Tillmann17, Sergei Mokurov18, Natalya Dorshakova19, Jeffrey A Porter6, Alice C McHardy3, Harri Lähdesmäki2, Hera Vlamakis1, Curtis Huttenhower1,20, Mikael Knip8,9,10,21, Ramnik J Xavier22,23,24,25.
Abstract
The human gut microbiome matures towards the adult composition during the first years of life and is implicated in early immune development. Here, we investigate the effects of microbial genomic diversity on gut microbiome development using integrated early childhood data sets collected in the DIABIMMUNE study in Finland, Estonia and Russian Karelia. We show that gut microbial diversity is associated with household location and linear growth of children. Single nucleotide polymorphism- and metagenomic assembly-based strain tracking revealed large and highly dynamic microbial pangenomes, especially in the genus Bacteroides, in which we identified evidence of variability deriving from Bacteroides-targeting bacteriophages. Our analyses revealed functional consequences of strain diversity; only 10% of Finnish infants harboured Bifidobacterium longum subsp. infantis, a subspecies specialized in human milk metabolism, whereas Russian infants commonly maintained a probiotic Bifidobacterium bifidum strain in infancy. Groups of bacteria contributing to diverse, characterized metabolic pathways converged to highly subject-specific configurations over the first two years of life. This longitudinal study extends the current view of early gut microbial community assembly based on strain-level genomic variation.Entities:
Mesh:
Year: 2018 PMID: 30559407 PMCID: PMC6384140 DOI: 10.1038/s41564-018-0321-5
Source DB: PubMed Journal: Nat Microbiol ISSN: 2058-5276 Impact factor: 17.745
DIABIMMUNE microbiome cohort statistics.
Distribution of study subjects, stool samples with sequencing data and several other external variables across the study sites. The table shows the number of study subjects (N) per category unless otherwise specified. T1D autoantibody and diagnosis information as of Nov. 2016.
| Finland | Estonia | Russia | |
|---|---|---|---|
| Study subjects | 140 | 80 | 73 |
| Samples profiled by 16S rRNA gene sequencing (median per subject) | 2,080 (9) | 501 (6) | 623 (7) |
| Samples profiled by metagenomic sequencing (median per subject) | 616 (4) | 221 (3) | 317 (3) |
| Males/Females | 78/62 | 39/41 | 40/33 |
| Caesarean sections | 9 | 6 | 12 |
| Mean maternal age at birth (sd) | 31.1 (4.9) | 29.1 (5.1) | 27.8 (4.7) |
| Born in rural household | 10 (7.7%) | 19 (23.8%) | 0 |
| T1D AAB seropositive subjects | 11 | 4 | 1 |
| Subjects with T1D diagnosis | 5 | 1 | 1 |
Figure 1.Strain diversity across species in early gut metagenomes.
A SNP haplotype similarities per species based on all pairwise comparisons (dominant strain per species per sample) and stratified to intra-subject and inter-subject comparisons. Species containing >10 comparisons in both strata are shown. B Gene content similarities (the percentage of shared genes in the smallest of the two genomes) per species, evaluated on pangenomes generated by metagenomic assembly. Boxplots as in panel A. The box (A, B) shows the interquartile range (IQR), the vertical line shows the median and the whiskers show the range of the data (up to 1.5 times IQR). Sample size (n) per boxplot in panel A gives number of comparisons per panels A and B. C The size of core and accessory genomes per species stratified by the functional annotation of genes using eggNOG. Panels A-C are ordered according to the size of the metagenomic pangenome. D Pearson’s correlation coefficients between SNP- and gene content-based similarity measures between strains. Sample size (n) is indicated. E B. dorei strains’ SNP and gene content similarities show low Pearson’s correlation (r=0.2, n=8646 comparisons from metagenomes, n=136 comparisons of isolate genomes). Comparisons between isolate genomes are shown in orange for reference. F E. coli strains’ SNP and gene content similarities (Pearson’s r=0.88, n=16,110 comparisons).
Figure 2.Bifidobacterium strains in DIABIMMUNE children.
A Phylogenetic tree of B. longum strains in DIABIMMUNE stool samples and 18 NCBI B. longum isolate genomes based on SNP haplotypes. Highlighted B. infantis strains (red) include two reference sequences (ATCC 15697). The heatmap illustrates strain-specific carriage of 21 genes in the B. infantis HMO gene cluster responsible for intracellular HMO degradation, evaluated using the metagenomic data. B B. longum relative abundance stratified by country and B. longum strain; B. infantis (highlighted red in panel A) has, on average, higher relative abundance compared to other B. longum strains (mixed effects logistic regression p=0.00049). The box shows the interquartile range (IQR), the vertical line shows the median and the whiskers show the range of the data (up to 1.5 times IQR). Number of samples (n) are indicated below each box and include samples from subjects with no breastfeeding information. C Relative abundance of B. bifidum longitudinally stratified by the countries up to 24 months of age (n = 864). Russians have more B. bifidum, especially during the first year of life. The curves show locally weighted scatterplot smoothing (LOESS) fits for the relative abundances, and shaded areas show 95% confidence interval for each fit, as implemented in geom_smooth function in ggplot2 R package. D Phylogenetic tree of B. bifidum strains in the DIABIMMUNE stool samples based on SNP haplotypes. Strains with >99.5% sequence similarity have been collapsed into a single tip. A known strain, B. bifidum 791, was found in 79 stool samples. The scale bars on phylogenetic trees denote difference in sequence similarity of SNP haplotypes.
Figure 3.Contributional diversity of microbial pathways.
A-B We applied alpha- (A) and beta-diversity (B) to the distribution of species contributing to functional categories (GO biological process terms), measuring their contributional diversities. The histograms show the mean alpha- (A) and beta-diversities (B) per GO term stratified by time windows. Colored shapes show (A) examples of pathways with different trends and (B) mean intra- and inter-subject beta-diversities of taxonomic profiles. C-D Species contributing to (C) siderophore biosynthetic process and (D) siderophore transport. Colors displaying the contributions of individual species are linearly scaled within the log-scaled total bar height depicting the total abundance of the pathway.