| Literature DB >> 28625989 |
Laura M Carroll1, Jasna Kovac2, Rachel A Miller1, Martin Wiedmann1.
Abstract
The Bacillus cereus group comprises nine species, several of which are pathogenic. Differentiating between isolates that may cause disease and those that do not is a matter of public health and economic importance, but it can be particularly challenging due to the high genomic similarity within the group. To this end, we have developed BTyper, a computational tool that employs a combination of (i) virulence gene-based typing, (ii) multilocus sequence typing (MLST), (iii) panC clade typing, and (iv) rpoB allelic typing to rapidly classify B. cereus group isolates using nucleotide sequencing data. BTyper was applied to a set of 662 B. cereus group genome assemblies to (i) identify anthrax-associated genes in non-B. anthracis members of the B. cereus group, and (ii) identify assemblies from B. cereus group strains with emetic potential. With BTyper, the anthrax toxin genes cya, lef, and pagA were detected in 8 genomes classified by the NCBI as B. cereus that clustered into two distinct groups using k-medoids clustering, while either the B. anthracis poly-γ-d-glutamate capsule biosynthesis genes capABCDE or the hyaluronic acid capsule hasA gene was detected in an additional 16 assemblies classified as either B. cereus or Bacillus thuringiensis isolated from clinical, environmental, and food sources. The emetic toxin genes cesABCD were detected in 24 assemblies belonging to panC clades III and VI that had been isolated from food, clinical, and environmental settings. The command line version of BTyper is available at https://github.com/lmc297/BTyper In addition, BMiner, a companion application for analyzing multiple BTyper output files in aggregate, can be found at https://github.com/lmc297/BMinerIMPORTANCE Bacillus cereus is a foodborne pathogen that is estimated to cause tens of thousands of illnesses each year in the United States alone. Even with molecular methods, it can be difficult to distinguish nonpathogenic B. cereus group isolates from their pathogenic counterparts, including the human pathogen Bacillus anthracis, which is responsible for anthrax, as well as the insect pathogen B. thuringiensis By using the variety of typing schemes employed by BTyper, users can rapidly classify, characterize, and assess the virulence potential of any isolate using its nucleotide sequencing data.Entities:
Keywords: Bacillus cereus group; taxonomy; virulence genes; whole-genome sequencing
Year: 2017 PMID: 28625989 PMCID: PMC5561296 DOI: 10.1128/AEM.01096-17
Source DB: PubMed Journal: Appl Environ Microbiol ISSN: 0099-2240 Impact factor: 4.792
FIG 1BTyper command line workflow for various types of data and default typing methods. Input datum type is listed in the left margin, while typing methods are listed at the top of the chart. Command line parameters associated with a particular typing method are shown in parentheses. FSL, Food Safety Lab.
Percentage of isolates in which BTyper correctly identified the presence/absence of eight virulence genes, MLST, rpoB AT, and panC clade
| Data set | Virulence gene (%) | MLST ST (%) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Training ( | |||||||||||
| Assemblies | 100 | 100 | 100 | 100 | 95.5 | 100 | 90.9 | 95.5 | 100 | 100 | 100 |
| PE reads | 100 | 90.9 | 100 | 90.9 | 95.5 | 95.5 | 90.9 | 95.5 | 100 | 100 | 100 |
| Validation ( | |||||||||||
| Assemblies | 91.7 | 100 | 95.8 | 87.5 | 95.8 | 100 | 100 | 91.7 | 95.8 | 100 | 100 |
| PE reads | 91.7 | 100 | 91.7 | 87.5 | 95.8 | 100 | 100 | 91.7 | 95.8 | 100 | 100 |
| Total ( | |||||||||||
| Assemblies | 95.7 | 100 | 97.8 | 93.5 | 95.7 | 100 | 95.7 | 93.5 | 97.8 | 100 | 100 |
| PE reads | 95.7 | 95.7 | 95.7 | 89.1 | 95.7 | 97.8 | 95.7 | 93.5 | 97.8 | 100 | 100 |
Presence/absence of eight virulence genes from previously published WGS data (training set) or PCR (validation set).
Multilocus sequence typing (MLST) results from previously published WGS data (training set) or Sanger sequencing (validation set).
rpoB allelic typing (AT) results from previously published WGS data (training set) or Sanger sequencing (validation set).
panC clade typing results from previously published WGS data.
Illumina paired-end (PE) reads.
FIG 2Percentage (%) of B. cereus group assemblies in which a particular virulence gene was detected. Minimum identity and coverage thresholds of 50 and 70%, respectively, were used for virulence gene detection.
FIG 3Closest-matching phylogenetic clade using the panC loci from 662 B. cereus group genome assemblies. A panC locus could not be assigned in 4 genome assemblies, which is denoted by “NA.”
Virulence genes significantly associated with 5 B. cereus group phylogenetic clades after a Bonferroni correction
| Clade | Genes |
|---|---|
| II | |
| III | |
| IV | |
| V | |
| VI |
Significant at a P value of <0.05. For exact corrected P values, see Table S7.
Indicates a virulence gene that was detected only in its respective clade (includes clades I and VII).
Indicates a virulence gene that was detected in all members of its respective clade.
FIG 4Principal-component analysis (PCA) of 662 B. cereus group genome assemblies based on presence/absence of virulence genes. Virulence gene typing was carried out using BTyper, while PCA was performed using BMiner. Principal components 1 (PC1) and 2 (PC2) are plotted on the x and y axes, respectively, while principal component 3 (PC3) corresponds to point size. Plots are colored by isolate species, as found in NCBI (A), and assigned cluster using k-medoids (B). To view interactive versions of these plots containing isolate names and metadata, all BTyper final results files and metadata can be downloaded from https://github.com/lmc297/BTyper/tree/master/sample_data and viewed in BMiner.
FIG 5k-medoids clusters based on presence/absence of virulence genes detected using BTyper. Size corresponds to the number of assemblies assigned to a given cluster, while panC corresponds to panC clades found in the cluster, with an asterisk denoting one or more assemblies that could not be placed into a panC clade. Numbers within cells correspond to the proportion of assemblies in a given cluster in which the corresponding virulence gene was detected. Green shading corresponds to a virulence gene detected in more than 90% of all assemblies in a cluster, while red shading corresponds to a virulence gene detected in fewer than 10% of all assemblies in a cluster. Yellow shading corresponds to B. anthracis-associated genes detected in fewer than 90% but greater than 0% of assemblies in a cluster.
Non-anthracis Bacillus assemblies in which anthrax toxin genes cya, lef, and/or pagA were detected using BTyper
| Cluster | NCBI species classification | GenBank accession no. | Strain | Isolate source (reference) | Gene(s) detected? | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | III | 03BB102 | Human with fatal pneumonia, San Antonio, TX, USA ( | + | + | + | − | + | + | ||
| 1 | III | Biovar anthracis strain CI | Chimpanzee with fatal anthrax, Taï National Park, Côte d'Ivoire ( | + | + | + | + | + | + | ||
| 22 | III | G9241 | Human with pneumonia, nausea, and vomiting, LA, USA ( | + | + | + | + | + | − | ||
| 22 | III | BcFL2013 | Human with anthrax-like skin lesion, FL, USA ( | + | + | + | + | + | − | ||
| 22 | III | 03BB87 | Human with fatal pneumonia, Lubbock, TX, USA ( | + | + | + | + | + | − | ||
| 22 | III | LA2007 | Human with fatal pneumonia and septic shock, Galliano, LA, USA ( | + | + | + | + | + | − | ||
Clusters were assigned using a k-medoids approach (k = 31).
panC clades were assigned using BTyper.
Multiple accession numbers are given for strains associated with multiple assemblies.
FIG 6Nonmetric multidimensional scaling (NMDS) plot of Bacillus cereus group clusters that (i) possessed at least one assembly that was classified as Bacillus anthracis in NCBI, and/or (ii) possessed at least one assembly in which at least one B. anthracis-associated virulence gene (cya, lef, pagA, atxA, hasA, and/or capABCDE) was detected using BTyper. NMDS was performed in BMiner using virulence gene presence/absence data and a Jaccard dissimilarity metric. Isolates are represented by points, and convex hulls and shading correspond to the assigned k-medoids cluster. Virulence genes are plotted in dark gray.
Non-anthracis Bacillus assemblies in which B. anthracis-associated genes were detected, excluding anthrax toxin genes cya, lef, and pagA and regulator atxA
| Cluster | NCBI species classification | GenBank accession no. | Strain/isolate ID | Isolate source (reference) | Gene detected? | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | III | JRS1 | − | + | + | + | − | − | |||
| 6 | III | AH1273 | Human blood, Iceland ( | − | + | + | + | + | + | ||
| 6 | III | AH1272 | Amniotic fluid, Iceland ( | − | + | − | + | + | + | ||
| 6 | III | 03BB108 | Dust containing pneumonia-causing | − | + | + | + | + | + | ||
| 6 | IV | Schrouff | Food ( | − | + | + | + | + | + | ||
| 6 | IV | K-5975c | Food ( | − | + | + | + | + | + | ||
| 6 | IV | HuB4-4 | Soil, Belgium ( | − | + | − | + | + | + | ||
| 6 | III | Serovar Monterrey strain BGSC 4AJ1 | Mexico ( | − | + | + | + | + | + | ||
| 6 | IV | BGSC 4C1 | − | + | + | + | + | + | |||
| 17 | VI | FSL H8-0485 | Soil, USA ( | + | − | − | − | − | − | ||
| 17 | III | Et10/1 | Geothermal spring, Lirima thermal springs, Chile ( | − | − | + | + | − | − | ||
| 21 | IV | F65185 | Open fracture, NY, USA ( | − | + | + | + | + | + | ||
| 21 | V | VD115 | Soil, Guadeloupe ( | − | + | + | + | + | + | ||
| 21 | IV | BGSC 4BT1 | Red soil, China ( | − | + | + | + | + | − | ||
| 29 | III | MOD1_Bc119 | Whole black pepper, USA ( | − | + | + | + | + | + | ||
ID, identification.
capE was detected at a lower amino acid identity (47.7%, compared to the default threshold of 50%).
Multiple accession numbers are given for strains associated with multiple assemblies.
B. cereus group assemblies in which emetic toxin genes cesABCD were detected
| Cluster | NCBI species classification | GenBank accession no. | Strain | Isolate source (reference) | |
|---|---|---|---|---|---|
| 12 | III | AH187 | Vomit of a person who ate cooked rice; isolate was associated with an emetic outbreak in 1972 ( | ||
| 12 | III | BDRD-ST26 | BDRD stock strain ( | ||
| 12 | III | H3081.97 | Food; emetic toxin-producing isolate from 1997 outbreak linked to rice, TX, USA ( | ||
| 12 | III | NC7401 | Emetic isolate ( | ||
| 12 | III | IS075 | Wild mammal (vole) ( | ||
| 12 | III | AND1407 | Black currant ( | ||
| 12 | III | MSX-A12 | Not available ( | ||
| 12 | III | IS845/00 | Bank vole, Poland ( | ||
| 12 | III | IS195 | Bank vole, Poland ( | ||
| 12 | III | F1-15 | Foodborne source ( | ||
| 12 | III | MB.15 | Food, Munich, Germany ( | ||
| 12 | III | MB.18 | Food, Munich, Germany ( | ||
| 12 | III | MB.16 | Food, Munich, Germany ( | ||
| 12 | III | MB.17 | Food, Munich, Germany ( | ||
| 12 | III | MB.21 | Food, Munich, Germany ( | ||
| 12 | III | MB.8 | Food, Munich, Germany ( | ||
| 12 | III | MB.8-1 | Food, Munich, Germany ( | ||
| 12 | III | MB.20 | Food, Munich, Germany ( | ||
| 12 | III | MB.22 | Food, Munich, Germany ( | ||
| 24 | VI | MC67 | Sandy loam, Møn, Denmark ( | ||
| 24 | VI | CER074 | Raw milk ( | ||
| 24 | VI | CER057 | Parsley ( | ||
| 24 | VI | BtB2-4 | Forest soil ( | ||
| 24 | VI | MC118 | Sandy loam, Møn, Denmark ( |
BDRD, Biological Defense Research Directorate.