| Literature DB >> 29577086 |
Jeremiah J Minich1, Qiyun Zhu2, Stefan Janssen2, Ryan Hendrickson3, Amnon Amir2, Russ Vetter4, John Hyde4, Megan M Doty5, Kristina Stillwell5, James Benardini3, Jae H Kim5, Eric E Allen1,6, Kasthuri Venkateswaran3, Rob Knight2,7,6.
Abstract
Microbiome analyses of low-biomass samples are challenging because of contamination and inefficiencies, leading many investigators to employ low-throughput methods with minimal controls. We developed a new automated protocol, KatharoSeq (from the Greek katharos [clean]), that outperforms single-tube extractions while processing at least five times as fast. KatharoSeq incorporates positive and negative controls to reveal the whole bacterial community from inputs of as few as 50 cells and correctly identifies 90.6% (standard error, 0.013%) of the reads from 500 cells. To demonstrate the broad utility of KatharoSeq, we performed 16S rRNA amplicon and shotgun metagenome analyses of the Jet Propulsion Laboratory spacecraft assembly facility (SAF; n = 192, 96), 52 rooms of a neonatal intensive care unit (NICU; n = 388, 337), and an endangered-abalone-rearing facility (n = 192, 123), obtaining spatially resolved, unique microbiomes reproducible across hundreds of samples. The SAF, our primary focus, contains 32 sOTUs (sub-OTUs, defined as exact sequence matches) and their inferred variants identified by the deblur algorithm, with four (Acinetobacter lwoffii, Paracoccus marcusii, Mycobacterium sp., and Novosphingobium) being present in >75% of the samples. According to microbial spatial topography, the most abundant cleanroom contaminant, A. lwoffii, is related to human foot traffic exposure. In the NICU, we have been able to discriminate environmental exposure related to patient infectious disease, and in the abalone facility, we show that microbial communities reflect the marine environment rather than human input. Consequently, we demonstrate the feasibility and utility of large-scale, low-biomass metagenomic analyses using the KatharoSeq protocol. IMPORTANCE Various indoor, outdoor, and host-associated environments contain small quantities of microbial biomass and represent a niche that is often understudied because of technical constraints. Many studies that attempt to evaluate these low-biomass microbiome samples are riddled with erroneous results that are typically false positive signals obtained during the sampling process. We have investigated various low-biomass kits and methods to determine the limit of detection of these pipelines. Here we present KatharoSeq, a high-throughput protocol combining laboratory and bioinformatic methods that can differentiate a true positive signal in samples with as few as 50 to 500 cells. We demonstrate the application of this method in three unique low-biomass environments, including a SAF, a hospital NICU, and an abalone-rearing facility.Entities:
Keywords: 16S rRNA amplicon; Acinetobacter; NICU; Staphylococcus; Vibrio; abalone; built environment; low biomass; metagenomics; microbial ecology; neonatal intensive care unit
Year: 2018 PMID: 29577086 PMCID: PMC5864415 DOI: 10.1128/mSystems.00218-17
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1 Low-biomass microbiome kit evaluation. (a) Experimental design of low-biomass kit evaluating negative and positive controls (5, 50, 500, and 5,000 bacterial cells) across three solid-phase and two magnetic-bead-based DNA extraction methods. LT, low throughput; HT, high throughput. (b) Libraries of 16S rRNA amplicons were sequenced, and the limit of detection of B. subtilis from DNA extraction was determined by comparing the composition of the expected target in known inputs of cells with that in negative controls by a nonparametric Kruskal-Wallis test (Benjamini-Hochberg FDR of 0.05). (c) The limit of detection of the assay and approximate background noise were determined by calculating the K1/2 value by using the allosteric sigmoidal equation on the Bacillus composition. (d) The median read counts (interquartile range) were compared against blanks by using a nonparametric Kruskal-Wallis test (Benjamini-Hochberg FDR of 0.05) to distinguish signal from noise. (e) Read counts were plotted against the expected composition of the target and fitted with an allosteric sigmoidal equation to describe the number of reads where 50% of the read composition was the expected target. This was performed for DNA extraction positive controls from the Mo Bio PowerMag kit, which performed the best, with a limit of detection of 50 cells.
FIG 2 Description of the low-biomass microbiome bench protocol and computational analysis. KatharoSeq-specific recommendations are highlighted throughout the pipeline from sample collection to bioinformatic sample exclusion.
FIG 3 Optimized KatharoSeq protocol applied to a JPL cleanroom facility. (a) The number of deblurred reads represented by the DNA extraction and PCR positive controls is plotted against the composition of the target organism. The distribution is fitted with an allosteric sigmoidal equation that describes the number of reads when 50% of the composition is recorded (K1/2). (b) The numbers of reads (median and interquartile range) per control type and floor samples are depicted, with the red dotted line indicating the sample exclusion value of 1,696 reads. NC, negative control. (c) Beta diversity PCoA plots calculated from weighted UniFrac distances from the JPL samples (n = 59) and controls (n = 15) with at least 1,696 deblurred reads, colored by foot traffic frequency (human exposure) and control type. (d) All 32 sOTUs found to be associated with the JPL floor community are shown in a heatmap in comparison with controls. (e) Associations of human foot traffic with sample composition of the top four most abundant sOTUs. (f) The most abundant sOTU in the samples, A. lwoffii, is mapped onto the JPL facility 2D map, with the samples deviating from the cluster noted with a red asterisk.
FIG 4 Microbial diversity was broadly compared across three unique built-environment study sites for 16S rRNA amplicon and shotgun metagenome sequencing. (a) Microbial richness of 16S rRNA amplicon data was calculated for various sample types within the built environment and organized by human exposure (H, high human exposure; M, medium human exposure; L, low human exposure). LBM, low biomass, indicates the kit testing controls from the Mobio Powermag kit. Beta diversity PCoA plots of weighted UniFrac distances of 16S rRNA amplicon (c) and Bray-Curtis distances of shotgun metagenomic samples (d) demonstrate study-specific microbial communities. The floor samples within the three built environments are solid spheres in the 16S rRNA amplicon plot (c), while other samples are transparent. (d) Heatmap depicting the top 464 of the 16,417 most abundant sOTUs across the 438 samples that passed QC. Floor samples are indicated by a line to demonstrate the similarity and differences of microbes on the floors of the three built environments.