| Literature DB >> 35531325 |
Vitor Heidrich1,2, Lilian T Inoue1, Paula F Asprino1, Fabiana Bettoni1, Antonio C H Mariotti3, Diogo A Bastos4, Denis L F Jardim4, Marco A Arap5, Anamaria A Camargo1.
Abstract
Accessibility to next-generation sequencing (NGS) technologies has enabled the profiling of microbial communities living in distinct habitats. 16S ribosomal RNA (rRNA) gene sequencing is widely used for microbiota profiling with NGS technologies. Since most used NGS platforms generate short reads, sequencing the full-length 16S rRNA gene is impractical. Therefore, choosing which 16S rRNA hypervariable region to sequence is critical in microbiota profiling studies. All nine 16S rRNA hypervariable regions are taxonomically informative, but due to variability in profiling performance for specific clades, choosing the ideal 16S rRNA hypervariable region will depend on the bacterial composition of the habitat under study. Recently, NGS allowed the identification of microbes in the urinary tract, and urinary microbiota has become an active research area. However, there is no current study evaluating the performance of different 16S rRNA hypervariable regions for male urinary microbiota profiling. We collected urine samples from male volunteers and profiled their urinary microbiota by sequencing a panel of six amplicons encompassing all nine 16S rRNA hypervariable regions. Systematic comparisons of their performance indicate V1V2 hypervariable regions better assess the taxa commonly present in male urine samples, suggesting V1V2 amplicon sequencing is more suitable for male urinary microbiota profiling. We believe our results will be helpful to guide this crucial methodological choice in future male urinary microbiota studies.Entities:
Keywords: 16S amplicon sequencing; 16S rRNA primers ; bladder microbiota; urinary microbiota; urobiome
Mesh:
Substances:
Year: 2022 PMID: 35531325 PMCID: PMC9069555 DOI: 10.3389/fcimb.2022.862338
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 6.073
Figure 1Sequencing output and taxonomic resolution for each 16S rRNA amplicon-specific dataset. (A) Number of reads generated and retained after filtering steps for each amplicon-specific dataset. (B) Relative frequency of reads retained after filtering steps averaged over all libraries for each amplicon-specific dataset. (C) Relative frequency of reads retained after filtering steps per library for each amplicon-specific dataset. (D) Percentage of sequences with assigned taxonomy (per taxonomic level) for each amplicon-specific dataset.
Figure 2Richness and phylogenetic diversity across 16S rRNA amplicon-specific datasets. (A) Amplicon sequence variant (ASV) richness per amplicon-specific dataset. (B) Taxonomic richness (phylum to species level) per amplicon-specific dataset. (C) Faith’s Phylogenetic Diversity (PD) across amplicon-specific datasets. (D) Sequence variability (entropy) along ASVs nucleotide positions (20-nucleotides rolling average) for each amplicon-specific dataset. Only nucleotide positions up to the median ASV size per amplicon-specific dataset are considered.
Figure 3Taxonomic composition across 16S rRNA amplicon-specific datasets. (A) Barplot depicting intersections between the genera detected in each amplicon-specific dataset. Total rIchness at genus level is shown in the lower-left subplot. (B) Boxplot comparing dissimilarities between different libraries and within the same libraries as profiled with different amplicons. Dissimilarity metrics considered are Bray-Curtis (BC) and Jaccard (J). Statistical significance was evaluated by the Mann-Whitney U test. The boxes highlight the median value and cover the 25th and 75th percentiles, with whiskers extending to the more extreme value within 1.5 times the length of the box. (C) Average genera relative abundance per amplicon-specific dataset. Only the 32 most abundant genera are shown (based on minimum relative abundance in at least one sample, which is adjusted for each plot). ****P < 0.0001.
Figure 4Richness and taxonomic composition of Sidle-reconstructed datasets. (A) Taxonomic richness (phylum to species level) per amplicon-specific or Sidle-reconstructed dataset. (B) Ambiguity in taxonomic assignment per amplicon-specific or Sidle-reconstructed dataset. (C) Average genera relative abundance per amplicon-specific or Sidle-reconstructed dataset. Only the 32 most abundant genera are shown (based on minimum relative abundance in at least one sample, which is adjusted for each plot).