| Literature DB >> 33329454 |
Sein Park1,2, Martin Steinegger2,3, Ho-Seong Cho4, Jongsik Chun1,2,3.
Abstract
Limosilactobacillus reuteri is a model symbiont that colonizes the guts of vertebrates in studies on host adaptation of the gut symbiont. Previous studies have investigated host-specific phylogenetic and functional properties by isolating the genomic sequence. This dependency on genome isolation is a significant bottleneck. Here, we propose a method to study the association between L. reuteri and its hosts directly from metagenomic reads without strain isolation using pan-genomes. We characterized the host-specificity of L. reuteri in metagenomic samples, not only in previously studied organisms (mice and pigs) but also in dogs. For each sample, two types of profiles were generated: (1) genome-based strain type abundance profiles and (2) gene composition profiles. Our profiles showed host-association of L. reuteri in both phylogenetic and functional aspects without depending on host-specific genome isolation. We observed not only the presence of host-specific lineages, but also the dominant lineages associated with the different hosts. Furthermore, we showed that metagenome-assembled genomes provide detailed insights into the host-specificity of L. reuteri. We inferred evolutionary trajectories of host-associative L. reuteri strains in the metagenomic samples by placing the metagenome-assembled genomes into a phylogenetic tree and identified novel host-specific genes that were unannotated in existing pan-genome databases. Our pan-genomic approach reduces the need for time-consuming and expensive host-specific genome isolation, while producing consistent results with previous host-association findings in mice and pigs. Additionally, we predicted associations that have not yet been studied in dogs.Entities:
Keywords: Limosilactobacillus reuteri; host-specificity; host-symbiont interaction; metagenome; pan-genome
Year: 2020 PMID: 33329454 PMCID: PMC7717999 DOI: 10.3389/fmicb.2020.585622
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Overview of analysis. (A) Construction of the Kraken database of genome-based strain types (GSTs) based on the phylogenetic tree. (B) Construction of the pan-genome database. (C) Profiling the metagenome samples based on the reference databases built above.
FIGURE 2GST abundance profiles. (A) Population structures of L. reuteri in metagenome samples are visualized as Principal Coordinate Analysis (PCoA) plots with weighted UniFrac distance metric. (B) GST abundance profiles are represented as a heatmap, visualizing the relative abundance of the GSTs. Phylogenetic relationships between the GSTs are illustrated as a tree on the left (branch lengths are ignored). (C) A heatmap visualizing the relative abundance of host-specific lineages assigned from the GSTs. The host groups are shown in different colors on the bottom. Scatter and box plots representing the relative abundance of (D) GSTs and (E) host-specific lineages of each host group and sample.
FIGURE 3Phylogenetic trees of MAGs. Phylogenetic trees display the placement of L. reuteri MAGs assembled from the reads of the metagenome samples in the (A) “Pig” group, (B) “Mouse 1” and “Mouse 3” groups, (C) “Mouse 2” and “Mouse 4” groups and (D) “Dog” group, respectively. (E) A phylogenetic tree inferred from the MAGs without reference genomes.
FIGURE 4Host-specific functional structures of each host group. The stacked bar plots visualize the proportion of the functional categories of host-specific genes. (A) represents the host-specific genes identified from the pan-genome database, while (B) represents those identified from the pan-genome database and the MAGs.