| Literature DB >> 35585088 |
Jinyuan Yan1, Chen Liao2, Bradford P Taylor3, Emily Fontana4, Luigi A Amoretti4, Roberta J Wright4, Eric R Littmann5, Anqi Dai6, Nicholas Waters6, Jonathan U Peled6,7, Ying Taur4, Miguel-Angel Perales6,7, Benjamin A Siranosian8, Ami S Bhatt8,9,10, Marcel R M van den Brink6,7, Eric G Pamer5, Jonas Schluter11, Joao B Xavier12.
Abstract
Hospitalized patients receiving hematopoietic cell transplants provide a unique opportunity to study the human gut microbiome. We previously compiled a large-scale longitudinal dataset of fecal microbiota and associated metadata, but we had limited that analysis to taxonomic composition of bacteria from 16S rRNA gene sequencing. Here we augment those data with shotgun metagenomics. The compilation amounts to a nested subset of 395 samples compiled from different studies at Memorial Sloan Kettering. Shotgun metagenomics describes the microbiome at the functional level, particularly in antimicrobial resistances and virulence factors. We provide accession numbers that link each sample to the paired-end sequencing files deposited in a public repository, which can be directly accessed by the online services of PATRIC to be analyzed without the users having to download or transfer the files. Then, we show how shotgun sequencing enables the assembly of genomes from metagenomic data. The new data, combined with the metadata published previously, enables new functional studies of the microbiomes of patients with cancer receiving bone marrow transplantation.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35585088 PMCID: PMC9117330 DOI: 10.1038/s41597-022-01302-9
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1The metagenomic samples cover the majority of microbiome compositional states observed in fecal samples from allo-HCT patients. (a) The t-SNE plot built using the taxonomic composition obtained by 16S amplicon sequencing of >10,000 samples from >1,000 unique patients;[13] the different colors indicate the most abundant taxon in each sample. (b) Location of nested subset of 395 samples from 49 unique patients with shotgun sequencing is broadly distributed across the entire map. (c) The sequencing depth of shotgun sequenced samples varies between 106 reads to 108 reads, with outliers in the liquid samples whose microbiome may yield different sizes of libraries.
Details for normalization and inclusion of taxa in the 16S vs. shotgun comparison.
| Analysis step | Input | Output | Procedure |
|---|---|---|---|
| Normalization of shotgun taxa | Table of reads aligned to taxa produced by the Kraken2 pipeline implemented in PATRIC[ | Table of relative abundances or bacterial genus. | A table of read counts per genus was first made for each sample. The total number of genus-aligned reads in that sample was then used to normalize the abundance of each genus. |
| Joining the tables of 16S abundances and shotgun abundances | Two relative abundance tables representing the genera present in each sample, one for 16S and one for shotgun. | A single table showing the relative abundances of each genus computed by the two methods. | The tables were joined using the logic of inner joining: only the genera present in both tables were included in the output table. Genera undetected in any of the samples (missing data) were set to 0. |
| Comparing shotgun and 16S at higher taxonomic levels | The genus-level table produced in the step above. | A table at each taxonomic level (family, order, class phylum) | The method aggregates the genus at each higher taxonomic level by adding relative abundances. |
Fig. 2Taxonomic composition of the microbiome in patient stool samples agrees in general between shotgun sequencing and 16S rRNA amplicon sequencing, with some notable differences. (a,b) The taxonomic composition is determined by 16S rRNA sequencing (A) and shotgun metagenomics (B) for the samples from a single patient (PatientID 1252). The samples are ordered in time and the dashed line separates the samples collected before and after allo-HCT. (c) The median composition (red dot) in Firmicutes can be notably different when determined using the two approaches (ranksum test, p < 0.05).
Fig. 3Correlation between the taxonomic classifications obtained by shotgun sequencing and 16S rRNA amplicon sequencing. The correlation between the two approaches is different at each taxonomic level, but seems unaffected by the read depth of each sample (a–e). Formed stool mainly contains samples with higher diversity, and high diversity samples usually display lower correlation between the two sequencing pipelines (f–j). Each point is a taxon from one of 395 samples. Black dots in (f–j) indicate the median of each category. The numbers on the x-axes display the number of samples in different diversity groups.
Fig. 4The shotgun sequencing detects the presence of antibiotic resistance genes, using the PATRIC service with the CARD database. (a) Localization of the vanA(+/−) samples in 16S clustering map shows a high concentration of vanA(+) samples in the region of domination by Enterococcus (green in Fig. 1a). (b) PCR(+) samples have higher relative abundance of the vanA gene detected by shotgun sequencing. (c) The vanA and vanB genes are practically mutually exclusive in patients’ stool samples. The samples with two genes simultaneously detected represent a very small fraction of the total samples. The abundances of the two genes are not correlated.
Fig. 5Shotgun sequencing data provide metagenomically-assembled genomes (MAGs) that compare well with the genomes of isolates from the same patient stool samples. MAGs from E. faecium obtained from different samples collected from patient 1044 reveal an intraspecies diversity. The phylogenetic tree contains the 7 MAGs and 26 E. faecium genomes obtained from isolates and analyzed in a previous study[34]. The number of days after each sample is the day relative to the HCT of this patient.
| Measurement(s) | Metagenomics |
| Technology Type(s) | next generation DNA sequencing |
| Sample Characteristic - Organism | Bacteria |
| Sample Characteristic - Environment | gut microbiome |
| Sample Characteristic - Location | New York City |