| Literature DB >> 30809204 |
Laura M Carroll1, Martin Wiedmann1, Manjari Mukherjee2, David C Nicholas3, Lisa A Mingle4, Nellie B Dumas4, Jocelyn A Cole4, Jasna Kovac2.
Abstract
The Bacillus cereus group comprises multiple species capable of causing emetic or diarrheal foodborne illness. Despite being responsible for tens of thousands of illnesses each year in the U.S. alone, whole-genome sequencing (WGS) is not yet routinely employed to characterize B. cereus group isolates from foodborne outbreaks. Here, we describe the first WGS-based characterization of isolates linked to an outbreak caused by members of the B. cereus group. In conjunction with a 2016 outbreak traced to a supplier of refried beans served by a fast food restaurant chain in upstate New York, a total of 33 B. cereus group isolates were obtained from human cases (n = 7) and food samples (n = 26). Emetic (n = 30) and diarrheal (n = 3) isolates were most closely related to B. paranthracis (group III) and B. cereus sensu stricto (group IV), respectively. WGS indicated that the 30 emetic isolates (24 and 6 from food and humans, respectively) were closely related and formed a well-supported clade distinct from publicly available emetic group III genomes with an identical sequence type (ST 26). The 30 emetic group III isolates from this outbreak differed from each other by a mean of 8.3 to 11.9 core single nucleotide polymorphisms (SNPs), while differing from publicly available emetic group III ST 26 B. cereus group genomes by a mean of 301.7-528.0 core SNPs, depending on the SNP calling methodology used. Using a WST-1 cell proliferation assay, the strains isolated from this outbreak had only mild detrimental effects on HeLa cell metabolic activity compared to reference diarrheal strain B. cereus ATCC 14579. We hypothesize that the outbreak was a single source outbreak caused by emetic group III B. cereus belonging to the B. paranthracis species, although food samples were not tested for presence of the emetic toxin cereulide. In addition to showcasing how WGS can be used to characterize B. cereus group strains linked to a foodborne outbreak, we also discuss potential microbiological and epidemiological challenges presented by B. cereus group outbreaks, and we offer recommendations for analyzing WGS data from the isolates associated with them.Entities:
Keywords: Bacillus cereus; SNP calling; cytotoxicity; emetic disease; foodborne outbreak; genomic epidemiology; whole-genome sequencing
Year: 2019 PMID: 30809204 PMCID: PMC6379260 DOI: 10.3389/fmicb.2019.00144
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Variant calling pipelines tested in this study.
| CFSAN | Read mapping | Yes | PE reads (fastq) | Bowtie2 | Varscan | |
| Freebayes | Read mapping | Yes | PE reads (fastq) | BWA MEM | Freebayes | |
| kSNP3 | No | Contigs (fasta) | Not applicable | kSNP3 | ||
| LYVE-SET | Read mapping | Yes | PE reads (fastq) | SMALT | Varscan | |
| Parsnp | Core genome alignment | Yes | Contigs (fasta) | Not applicable | Parsnp | |
| Samtools | Read mapping | Yes | PE reads (fastq) | BWA MEM | Samtools/Bcftools |
CFSAN, U.S. Food and Drug Administration (FDA) Center for Food Safety and Applied Nutrition SNP pipeline; LYVE-SET, U.S. Centers for Disease Control and Prevention (CDC) Listeria, Yersinia, Vibrio, and Enterobacteriaceae SNP Extraction Tool.
PE reads, Illumina paired-end reads.
Reference genomes used for reference-based variant calling in this study.
| IV | All 33 isolates from two groups (groups III and IV) | 98.8–98.9 (group IV) | NC_004722.1 | Complete Genome | ||
| III | All 33 isolates from two groups (groups III and IV); 30 emetic group III isolates | 92.0–92.2 (group IV) | NC_011658.1 | Complete Genome | Human clinical isolate associated with an emetic outbreak in 1972 (cooked rice, United Kingdom); identical virulotype, MLST sequence type, | |
| VII | All 33 isolates from two groups (groups III and IV) | 82.6–82.7 (group IV) | NC_009674.1 | Complete Genome | Type strain of | |
| FOOD_10_19_16_RSNT1_2H_R9-6393 | III | 30 emetic group III isolates | 92.0–92.2 (group IV) | SRR6825038 | Contigs | Emetic isolate from the outbreak reported here; assembly had high per-base coverage, as well as the fewest number of contigs of all genome assemblies from isolates in this outbreak |
Phylogenetic group determined via panC group assignment function in BTyper version 2.2.0.
Data set(s) in this study for which given genome was used as a reference genome for reference-based SNP calling.
Minimum and maximum average nucleotide identity (ANI) values of reference strain relative to group IV and group III genomes sequenced in this outbreak (n = 3 and 30, respectively) calculated using FastANI.
Minimum ANI value was <100 prior to rounding.
List of outbreak isolates and corresponding metadata, single- and multi-locus sequence types, and species.
| FOOD_10_18_16_LFTOV_NA_R9-6400 | Food | Leftovers | 9-Oct-16 | 18-Oct-16 | Unknown | III | 26 | 125 | |
| FOOD_10_18_16_LFTOV_NA_R9-6401 | Food | Leftovers | 9-Oct-16 | 18-Oct-16 | Unknown | III | 26 | 125 | |
| FOOD_10_18_16_LFTOV_NA_R9-6402 | Food | Leftovers | 9-Oct-16 | 18-Oct-16 | Unknown | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_1B_R9-6388 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 1/B | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_1B_R9-6389 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 1/B | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_1B_R9-6390 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 1/B | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_1B_R9-6391 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 1/B | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_2A_R9-6386 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 2/A | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_2A_R9-6387 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 2/A | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_2H_R9-6392 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 2/H | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_2H_R9-6393 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 2/H | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_2H_R9-6394 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 2/H | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_2H_R9-6395 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 2/H | III | 26 | 125 | |
| FOOD_10_19_16_RSNT1_2H_R9-6396 | Food | Restaurant 1 | 6-Oct-16 | 19-Oct-16 | 2/H | III | 26 | 125 | |
| FOOD_10_19_16_RSNT2_2A_R9-6397 | Food | Restaurant 2 | 6-Oct-16 | 19-Oct-16 | 2/A | III | 26 | 125 | |
| FOOD_10_19_16_RSNT2_2A_R9-6398 | Food | Restaurant 2 | 6-Oct-16 | 19-Oct-16 | 2/A | III | 26 | 125 | |
| FOOD_10_19_16_RSNT2_2A_R9-6399 | Food | Restaurant 2 | 6-Oct-16 | 19-Oct-16 | 2/A | III | 26 | 125 | |
| FOOD_10_19_16_RSNT3_1E_R9-6407 | Food | Restaurant 3 | 6-Oct-16 | 19-Oct-16 | 1/E | III | 26 | 125 | |
| FOOD_10_19_16_RSNT3_2A_R9-6403 | Food | Restaurant 3 | 6-Oct-16 | 19-Oct-16 | 2/A | III | 26 | 125 | |
| FOOD_10_19_16_RSNT3_2A_R9-6404 | Food | Restaurant 3 | 6-Oct-16 | 19-Oct-16 | 2/A | III | 26 | 125 | |
| FOOD_10_19_16_RSNT3_2A_R9-6405 | Food | Restaurant 3 | 6-Oct-16 | 19-Oct-16 | 2/A | III | 26 | 125 | |
| FOOD_10_19_16_RSNT4_2B_R9-6408 | Food | Restaurant 4 | 6-Oct-16 | 19-Oct-16 | 2/B | III | 26 | 125 | |
| FOOD_10_19_16_RSNT4_2B_R9-6409 | Food | Restaurant 4 | 6-Oct-16 | 19-Oct-16 | 2/B | III | 26 | 125 | |
| FOOD_10_19_16_RSNT5_1C_R9-6411 | Food | Restaurant 5 | 6-Oct-16 | 19-Oct-16 | 1/C | III | 26 | 125 | |
| HUMN_10_18_16_FECAL_NA_R9-6384 | Human | Feces | 7-Oct-16 | 18-Oct-16 | NA | III | 26 | 125 | |
| HUMN_10_18_16_FECAL_NA_R9-6385 | Human | Feces | 8-Oct-16 | 18-Oct-16 | NA | III | 26 | 125 | |
| HUMN_10_18_16_FECAL_NA_R9-6412 | Human | Feces | 8-Oct-16 | 18-Oct-16 | NA | III | 26 | 125 | |
| HUMN_10_19_16_FECAL_NA_R9-6381 | Human | Feces | 7-Oct-16 | 19-Oct-16 | NA | III | 26 | 125 | |
| HUMN_10_19_16_FECAL_NA_R9-6382 | Human | Feces | 7-Oct-16 | 19-Oct-16 | NA | III | 26 | 125 | |
| HUMN_10_19_16_FECAL_NA_R9-6383 | Human | Feces | 7-Oct-16 | 19-Oct-16 | NA | III | 26 | 125 | |
| FOOD_10_19_16_RSNT3_1E_R9-6406 | Food | Restaurant 3 | 6-Oct-16 | 19-Oct-16 | 1/E | IV | 24 | 92 | |
| FOOD_10_19_16_RSNT5_1C_R9-6410 | Food | Restaurant 5 | 6-Oct-16 | 19-Oct-16 | 1/C | IV | 24 | 92 | |
| HUMN_10_26_16_FECAL_NA_R9-6413 | Human | Feces | 8-Oct-16 | 26-Oct-16 | NA | IV | 142 | 92 |
Production date is designated by either 1 or 2; batch is one of A through H.
panC group assigned in silico using BTyper 2.2.0.
Multi-locus sequence typing (MLST) sequence type (ST) assigned in silico using BTyper 2.2.0.
rpoB allelic type (AT) determined using Sanger sequencing and verified in silico using BTyper 2.2.0.
ANI, average nucleotide identity calculated using FastANI.
Figure 1Maximum likelihood phylogeny of core SNPs identified in 33 isolates sequenced in conjunction with a B. cereus outbreak, as well as genomes of the 18 currently recognized B. cereus group species (shown in gray). Core SNPs were identified in all genomes using kSNP3. Heatmap corresponds to presence/absence of B. cereus group virulence genes detected in each sequence using BTyper. Tip labels in maroon and teal correspond to the seven human clinical isolates and 26 isolates from food sequenced in conjunction with this outbreak, respectively. Phylogeny is rooted at the midpoint, and branch labels correspond to bootstrap support percentages out of 500 replicates. Due to the short lengths and low bootstrap support (all values <10) of branches within the outbreak clade, bootstrap support percentages are not shown on branches within the outbreak clade.
Figure 2Percentage viability of HeLa cells when treated with supernatants of each isolate as determined by the WST-1 assay. Viability was calculated as ratio of corrected absorbance of solution when HeLa cells were treated with supernatants to the ratio of corrected absorbance of solution when HeLa cells were treated with BHI (i.e., negative control), converted to percentages. The columns represent the mean viabilities, while the error bars represent standard deviations for 12 technical replicates. Any two bars that do not share a common alphabetic character had significantly different percentage viability values (P < 0.05).
Figure 3Number of core SNPs identified in 33 B. cereus group isolates from two phylogenetic groups (30 and 3 isolates from groups III and IV, respectively), sequenced in conjunction with a foodborne outbreak. Combinations of five reference-based variant calling pipelines and three reference genomes, as well as one reference-free SNP calling method (kSNP3), were tested.
Figure 4Comparison of core SNP positions reported by five reference-based variant-calling pipelines for 33 B. cereus group strains isolated in association with a foodborne outbreak, with the chromosomes of (A) B. cereus AH187 (group III), (B) B. cereus s.s. ATCC 14579 (group IV), and (C) B. cytotoxicus NVH 391-98 (group VII) used as reference genomes. Ellipses represent each pipeline.
Figure 5(A) Number of core SNPs and (B) total number of SNPs identified in 30 emetic B. cereus group III strains isolated in association with a foodborne outbreak. Combinations of (A) five and (B) four reference-based variant calling pipelines and two reference genomes (either dustmasked or unmasked) were tested, along with one reference-free SNP calling method (kSNP3). Because the Parsnp pipeline reports core SNPs by definition, it was excluded from Figure 5B (total SNPs). For quantification of the total number of SNPs (Figure 5B), all sites with more than one unique character were counted.
Figure 6Ranges of pairwise (A) core SNP differences and (B) total SNP differences between 30 emetic group III B. cereus group strains isolated in conjunction with a foodborne outbreak. Combinations of (A) five and (B) four reference-based variant calling pipelines and two reference genomes (either dustmasked or unmasked), as well as one reference-free SNP calling method (kSNP3) were tested. Lower and upper box hinges correspond to the first and third quartiles, respectively. Lower and upper whiskers extend from the hinge to the smallest and largest values no more distant than 1.5 times the interquartile range from the hinge, respectively. Points represent pairwise distances that fall beyond the ends of the whiskers. Because the Parsnp pipeline reports core SNPs by definition, it was excluded from Figure 6B (pairwise differences in total SNPs). For quantification of pairwise differences in the total number of SNPs (Figure 6B), all sites with more than one unique character were included.
Figure 7Comparison of core SNP positions reported by five variant-calling pipelines for 30 emetic group III B. cereus group outbreak isolates. Ellipses represent each pipeline, all of which used the chromosome of emetic group III B. cereus AH187 as a reference for variant calling.
Maximum likelihood phylogenies of 30 emetic group III outbreak isolates considered to be more topologically similar than would be expected by chance (P < 0.05).
| AH187_CFSAN_NOdust_all | AH187_CFSAN_NOdust_core | 0 |
| AH187_CFSAN_NOdust_all | AH187_LYVE-SET_NOdust_all | 0 |
| AH187_CFSAN_NOdust_all | AH187_LYVE-SET_NOdust_core | 0.0171 |
| AH187_CFSAN_NOdust_all | AH187_LYVE-SET_YESdust_all | 0 |
| AH187_CFSAN_NOdust_all | AH187_LYVE-SET_YESdust_core | 0.0171 |
| AH187_CFSAN_NOdust_core | AH187_LYVE-SET_NOdust_all | 0 |
| AH187_CFSAN_NOdust_core | AH187_LYVE-SET_NOdust_core | 0.0171 |
| AH187_CFSAN_NOdust_core | AH187_LYVE-SET_YESdust_all | 0 |
| AH187_CFSAN_NOdust_core | AH187_LYVE-SET_YESdust_core | 0.0171 |
| AH187_Freebayes_NOdust_core | AH187_Freebayes_YESdust_core | 0.0342 |
| AH187_LYVE-SET_NOdust_all | AH187_LYVE-SET_NOdust_core | 0.0171 |
| AH187_LYVE-SET_NOdust_all | AH187_LYVE-SET_YESdust_all | 0 |
| AH187_LYVE-SET_NOdust_all | AH187_LYVE-SET_YESdust_core | 0.0171 |
| AH187_LYVE-SET_NOdust_core | AH187_LYVE-SET_YESdust_core | 0 |
| AH187_LYVE-SET_YESdust_all | AH187_LYVE-SET_YESdust_core | 0.0171 |
| AH187_Parsnp_NOdust_core | AH187_Parsnp_YESdust_core | 0.0171 |
Obtained from pairwise tests of tree topologies using a Z test based on the Kendall-Colijn metric (Kendall and Colijn, .
Names of reference and query phylogenies denote reference genome (“AH187” for reference-based pipelines, “NOREF” for reference-free kSNP pipeline), pipeline (“CFSAN,” “Freebayes,” “kSNP,” “LYVE-SET,” “Parsnp,” or “Samtools”), reference genome masking (“NOdust” for an unmasked reference genome, “YESdust” for a dustmasked reference genome, or “NAdust” for reference-free kSNP pipeline, for which dustmasking is not applicable), and SNPs used to construct the phylogeny (“core” for core SNPs, or “all” for core and accessory SNPs), separated by an underscore (“_”).
Bonferroni-corrected P-values for all tests that were significant at the α = 0.05 level.
Figure 8Maximum likelihood phylogenies of 30 emetic group III isolates (ST 26) sequenced in conjunction with a B. cereus outbreak, as well as all other emetic group III ST 26 genomes available in NCBI (n = 25; shown in black). Trees were constructed using core SNPs identified using (A) kSNP3 or (B) Parsnp. Tip labels in maroon and teal correspond to the six human clinical isolates and 24 isolates from food sequenced in conjunction with this outbreak, respectively. Branch labels correspond to bootstrap support percentages out of 1,000 replicates. Due to the short lengths and low bootstrap support of branches within the outbreak clade, bootstrap support percentages are not shown on branches within the outbreak clade.