| Literature DB >> 32397065 |
Nathanael J Bangayan1, Baochen Shi1, Jerry Trinh1, Emma Barnard1, Gabriela Kasimatis1, Emily Curd1,2,3, Huiying Li1,3.
Abstract
The microbiome plays an important role in human physiology. The composition of the human microbiome has been described at the phylum, class, genus, and species levels, however, it is largely unknown at the strain level. The importance of strain-level differences in microbial communities has been increasingly recognized in understanding disease associations. Current methods for identifying strain populations often require deep metagenomic sequencing and a comprehensive set of reference genomes. In this study, we developed a method, metagenomic multi-locus sequence typing (MG-MLST), to determine strain-level composition in a microbial community by combining high-throughput sequencing with multi-locus sequence typing (MLST). We used a commensal bacterium, Propionibacterium acnes, as an example to test the ability of MG-MLST in identifying the strain composition. Using simulated communities, MG-MLST accurately predicted the strain populations in all samples. We further validated the method using MLST gene amplicon libraries and metagenomic shotgun sequencing data of clinical skin samples. MG-MLST yielded consistent results of the strain composition to those obtained from nearly full-length 16S rRNA clone libraries and metagenomic shotgun sequencing analysis. When comparing strain-level differences between acne and healthy skin microbiomes, we demonstrated that strains of RT2/6 were highly associated with healthy skin, consistent with previous findings. In summary, MG-MLST provides a quantitative analysis of the strain populations in the microbiome with diversity and richness. It can be applied to microbiome studies to reveal strain-level differences between groups, which are critical in many microorganism-related diseases.Entities:
Keywords: MLST; Propionibacterium acnes; metagenomics; method; microbiome; strain
Year: 2020 PMID: 32397065 PMCID: PMC7284976 DOI: 10.3390/microorganisms8050684
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Figure 1The strain composition predicted by STRUCTURE is highly consistent with the expected P. acnes populations based on simulated data. The top panel shows the ribotype (RT) group composition predicted by STRUCTURE based on the simulated communities. The bottom panel shows the expected RT group composition. Each column represents the relative abundances of the RT groups in each sample. The expected composition of each simulated community is listed in Table S2. Set A communities were generated to contain a single RT group per sample. Set B communities were generated to randomly have varying relative abundances of the RT groups. Five representative samples among the total 100 simulated communities are shown. Figure S2 lists all 100 communities of Set B. Set C communities were generated to mimic the population structures from previously characterized clinical skin samples [5]. Pearson’s correlations were calculated to compare the predicted population composition with the expected data as shown on the top.
Figure 2The strain composition predicted by metagenomic multi-locus sequence typing (MG-MLST) is highly consistent with the P. acnes population structure based on 16S ribotyping. The first column of each sample represents the predicted strain composition using the sequence data obtained from the 454 amplicon library. The second column represents the strain composition based on the 16S ribotype data [5]. Pearson’s correlations between the two methods are shown on the top.
Figure 3Comparison of the MLST schemes in predicting strain composition using MG-MLST. (a) Ribotype (RT) group composition predicted by STRUCTURE based on the Aahrus four gene set (fba, lac, recA, and zno). (b) RT group composition predicted by STRUCTURE based on the Belfast MLST4 scheme (aroE, guaA, tly, and camp2). (c) RT group composition predicted by STRUCTURE based on the combined eight gene set (Aarhus–Belfast) (fba, lac, recA, zno, aroE, guaA, tly, and camp2). (d) RT group composition based on 16S ribotype data. Pearson’s correlations are shown in Table S3.
Comparison of top strain assignment between MG-MLST and MetaMLST.
| Sample | MG-MLST | MetaMLST | |||
|---|---|---|---|---|---|
| Sequence Type (ST) Assigned | Ribotype (RT) Assigned | Relative Abundance of RT | Sequence Type (ST) Assigned | Ribotype (RT) Assigned | |
| H01 | 6,7,25,27,28,30 | 2/6 | 0.869 | 7 | 6 |
| H02 | 6,7,25,27,28,30 | 2/6 | 0.704 | 30 | 2 |
| H03 | 6,7,25,27,28,30 | 2/6 | 0.886 | 100 | New |
| H04 | 1,5 | 1 | 0.656 | New | New |
| H05 | 2 | 3 | 0.691 | 2 | 3 |
| H06 | 1,5 | 1 | 0.511 | 5 | 1 |
| H07 | 1,5 | 1 | 0.634 | New | New |
| H08 | 1,5 | 1 | 0.378 | New | New |
| H09 | 1,5 | 1 | 0.477 | 4 | 8 |
| H10 | 4,13,21 | 8 | 0.443 | New | New |
| H11 | 4,13,21 | 8 | 0.922 | 4 | 8 |
| H12 | 4,13,21 | 8 | 0.912 | 4 | 8 |
| H13 | 1,5 | 1 | 0.414 | 115 | 1 |
| A01 | 1,5 | 1 | 0.935 | 1 | 1 |
| A02 | 1,5 | 1 | 0.840 | New | New |
| A03 | 1,5 | 1 | 0.851 | 53 | New |
| A04 | 2,22,23,24,36,91 | 3 | 0.779 | 2 | 3 |
| A05 | 2 | 3 | 0.537 | New | New |
| A06 | 1,5 | 1 | 0.825 | New | New |
| A07 | 2 | 3 | 0.862 | New | New |
| A08 | 6,7,25,27,28,30 | 1 | 0.360 | New | New |
| A09 | 3,10,11,17,70 | 4/5 | 0.468 | New | New |
| A10 | 1,5 | 1 | 0.414 | New | New |
| A11 | 3,10,11,17,70 | 4/5 | 0.460 | New | New |
| A12 | 2,22,23,24,36,91 | 3 | 0.747 | 22 | 3 |
| A13 | 4,13,21 | 8 | 0.538 | New | New |
Samples highlighted in gray have consistent assignment between the two methods.