| Literature DB >> 29531289 |
Ahmed Moustafa1, Weizhong Li1,2, Harinder Singh3, Kelvin J Moncera2, Manolito G Torralba2, Yanbao Yu3, Oriol Manuel4, William Biggs1, J Craig Venter1,2,3, Karen E Nelson1,2,3, Rembert Pieper5,6, Amalio Telenti7.
Abstract
Urine culture and microscopy techniques are used to profile the bacterial species present in urinary tract infections. To gain insight into the urinary flora, we analyzed clinical laboratory features and the microbial metagenome of 121 clean-catch urine samples. 16S rDNA gene signatures were successfully obtained for 116 participants, while metagenome sequencing data was successfully generated for samples from 49 participants. Although 16S rDNA sequencing was more sensitive, metagenome sequencing allowed for a more comprehensive and unbiased representation of the microbial flora, including eukarya and viral pathogens, and of bacterial virulence factors. Urine samples positive by metagenome sequencing contained a plethora of bacterial (median 41 genera/sample), eukarya (median 2 species/sample) and viral sequences (median 3 viruses/sample). Genomic analyses suggested cases of infection with potential pathogens that are often missed during routine urine culture due to species specific growth requirements. While conventional microbiological methods are inadequate to identify a large diversity of microbial species that are present in urine, genomic approaches appear to more comprehensively and quantitatively describe the urinary microbiome.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29531289 PMCID: PMC5847550 DOI: 10.1038/s41598-018-22660-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Definition of clinical and laboratory groups. The study used an unbiased approach to the classification of specimens using 20 parameters from the laboratory analysis of urine. (A) Explained variance from PCA; the first two PCs were retained for downstream clustering analyses. (B) Contributing factors (loadings) to the first two PCs. Note that the directionality of the loadings reflect enrichment independently of sign and direction. (C) Clustering of samples is based on the method of partitioning around medoids (pam).
Microbiome Sequencing performance rate. Clusters are defined on the basis of clinical and laboratory metadata. Cluster 1 is interpreted as reflecting contamination, Cluster 2 is most consistent with urinary infection, Cluster 2 is of unclear nature.
| Microbiome | Cluster 1 | Cluster 2 | Cluster 3 | Total | ||||
|---|---|---|---|---|---|---|---|---|
| Count | Percent | Count | Percent | Count | Percent | Count | Percent | |
| MG + 16S | 5 | 56% | 35 | 56% | 7 | 14% | 47 | 39% |
| MG | 0 | 0% | 1 | 2% | 1 | 2% | 2 | 2% |
| 16S | 4 | 44% | 27 | 43% | 38 | 78% | 69 | 57% |
| None | 0 | 0% | 0 | 0% | 3 | 6% | 3 | 2% |
| Total | 9 | 63 | 49 | 121 | ||||
MG: metagenome sequencing, 16S: 16S rDNA sequencing.
Figure 2Normalized abundance of bacterial genera across the clinical groups using 16S rDNA. 116 samples were successfully analysed by 16S rDNA sequencing and grouped according to the clinical laboratory clusters. Proteobacteria were the predominant phylum in Cluster 2 - the cluster that represents infection, with prominent identification of Citrobacter, Enterobacter, Escherichia. Clusters 1 and 3 were more diverse in composition (Fig. S1).
Figure 3Metagenome sequencing mapped reads per sample. 49 samples were successfully sequenced and grouped according to the clinical laboratory clusters. Each point represents a sample. The thick line in the boxplot represents the median number of reads for the cluster.
Figure 4Ranking of bacterial genera by counts from metagenome sequencing across clinical laboratory clusters. Shown are bacteria observed with at least 1% of total reads in a sample. Analyses reflect results from 49 samples that were successfully sequenced and grouped according to the clinical laboratory clusters. Each point represents a genus in a sample. The horizontal represents the median number of reads for the genus.
Relationship between metagenome sequencing and routine culture.
| Clinical and laboratory Cluster | Majority species (≥10E5 reads) | Number of samples | Culture results |
|---|---|---|---|
| Cluster 1 | 1 | Negative | |
| 1 | Negative | ||
| 1 | Negative | ||
| 1 | Negative | ||
| Cluster 2 |
| 5 | |
|
| 3 | Negative | |
|
| 2 | Negative | |
|
| 1 | Mixed | |
| 1 | Mixed | ||
| 1 |
| ||
| 1 |
| ||
| 1 |
| ||
| 1 | Negative | ||
|
| 1 |
| |
|
| 1 | Mixed | |
| 1 | Negative | ||
|
| 1 | Negative | |
| 1 | Negative | ||
| 1 |
| ||
| 1 | Mixed | ||
| Cluster 3 |
| 1 | Negative |
Figure 5Virulence factors across clinical laboratory clusters. Metagenome sequencing data was used to search for open reading frames (ORFs) compared against the database VFDB[25] to identify virulence factor genes with over 95% sequence identity. Listed are the factors identified in the dataset, grouped by taxonomic binning, with the VFDB accession number in parenthesis. The left panel shows enrichment in the abundance of ORFs across clusters. Here, the abundance is the depth of coverage of the genome where the ORFs were predicted. The right panel shows prevalence of samples that contain organisms carrying the corresponding virulence factor in each cluster.
Figure 6Eukarya read counts across clinical laboratory clusters. Shown are eukarya observed with at least 10 sequence reads in a sample. Analyses reflect results from 49 samples that were successfully sequenced and grouped according to the clinical laboratory clusters. Each point represents a species in a sample.
Figure 7Viral read counts across clinical laboratory clusters. Shown are viruses observed with at least one sequence read in a sample. Analyses reflect results from 49 samples that were successfully sequenced and grouped according to the clinical laboratory clusters. Each point represents a virus in a sample.