| Literature DB >> 33952321 |
Jonathan L Golob1, Amy D Willis2, Samuel S Minot3, Kevin C Barry4, Caroline Kasman5.
Abstract
Researchers must be able to generate experimentally testable hypotheses from sequencing-based observational microbiome experiments to discover the mechanisms underlying the influence of gut microbes on human health. We describe geneshot, a novel bioinformatics tool for identifying testable hypotheses based on gene-level metagenomic analysis of WGS microbiome data. By applying geneshot to two independent previously published cohorts, we identify microbial genomic islands consistently associated with response to immune checkpoint inhibitor (ICI)-based cancer treatment in culturable type strains. The identified genomic islands are within operons involved in type II secretion, TonB-dependent transport, and bacteriophage growth.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33952321 PMCID: PMC8097837 DOI: 10.1186/s13059-021-02355-6
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Schematic diagram of the geneshot tool for gene-level metagenomic analysis of microbiome experiments. a Description of the data flow through geneshot, with the coordinated execution of multiple constituent bioinformatic processes in order to analyze the experimental data on the basis of protein-coding genes encoded by microbes present in each physical specimen. De novo assembly is performed independently for each biological specimen, and the resulting protein-coding sequences are aggregated across all assemblies prior to deduplication. Alignment of WGS reads to the deduplicated gene catalog is performed independently for each specimen, and the resulting abundance data is aggregated across all specimens in order to compute pairwise gene co-abundance values. b Summary of the geneshot analysis to identify microbes associated with ICI response from published WGS datasets, indicating the number of biological features identified at each step
Fig. 2Genomic coordinates of ICI response-associated CAGs for Odoribacter splanchnicus DSM 220712. The relative abundance of each CAG (a, b, c, i) is shown across specimens, grouping specimens by cohort (x-position) and ICI response outcome (color). The genomic coordinates of individual CAGs (d, e, f) are shown with orange arrows for the region of alignment for each gene and blue arrows with text indicating the NCBI annotations in that region. The occurrence of CAGs across multiple reference genomes (g) as well as stool microbiome specimens from this dataset (h) differentiates the core genomic elements (underlined in blue) from the accessory (or strain-specific) genetic elements (underlined in orange)