| Literature DB >> 27930729 |
Susan R Leonard1, Mark K Mammel1, David W Lacher1, Christopher A Elkins1.
Abstract
Consumption of fresh bagged spinach contaminated with Shiga toxin-producing Escherichia coli (STEC) has led to severe illness and death; however current culture-based methods to detect foodborne STEC are time consuming. Since not all STEC strains are considered pathogenic to humans, it is crucial to incorporate virulence characterization of STEC in the detection method. In this study, we assess the comprehensiveness of utilizing a shotgun metagenomics approach for detection and strain-level identification by spiking spinach with a variety of genomically disparate STEC strains at a low contamination level of 0.1 CFU/g. Molecular serotyping, virulence gene characterization, microbial community analysis, and E. coli core gene single nucleotide polymorphism (SNP) analysis were performed on metagenomic sequence data from enriched samples. It was determined from bacterial community analysis that E. coli, which was classified at the phylogroup level, was a major component of the population in most samples. However, in over half the samples, molecular serotyping revealed the presence of indigenous E. coli which also contributed to the percent abundance of E. coli. Despite the presence of additional E. coli strains, the serotype and virulence genes of the spiked STEC, including correct Shiga toxin subtype, were detected in 94% of the samples with a total number of reads per sample averaging 2.4 million. Variation in STEC abundance and/or detection was observed in replicate spiked samples, indicating an effect from the indigenous microbiota during enrichment. SNP analysis of the metagenomic data correctly placed the spiked STEC in a phylogeny of related strains in cases where the indigenous E. coli did not predominate in the enriched sample. Also, for these samples, our analysis demonstrates that strain-level phylogenetic resolution is possible using shotgun metagenomic data for determining the genomic relatedness of a contaminating STEC strain to other closely related E. coli.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27930729 PMCID: PMC5145215 DOI: 10.1371/journal.pone.0167870
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of STEC strains included in this study.
| EC1276 | Sakai | O157:H7 | E | Stx1a, Stx2a | Japan, sprout outbreak, 1996 | BA000007 |
| EC1611 | MI06-63, TW14359 | O157:H7 | E | Stx2a, Stx2c | USA (MI), spinach outbreak, 2006 | CP001368 |
| EC1738 | 550659 | O157:H7 | E | Stx2c | USA, cookie dough outbreak, 2009 | AKMN01 |
| EC1971 | MI10-01, TW16058 | O145:H28 | E | Stx2a | USA (MI), romaine lettuce outbreak, 2010 | JHEZ01 |
| EC2002 | 0202 2501, TW14998 | O145:H28 | E | Stx1a, Stx2a | USA (CT), clinical isolate, 2004 | SRS1263853 |
| EC1660 | MT#2, TW08023 | O121:H19 | B1 | Stx2a | USA (MT), clinical isolate, 1998 | SRS1263852 |
| EC1705 | EC-6804, TW08980 | O121:H19 | B1 | Stx2a | Japan, well water (clinical isolate), 2002 | SRS1263851 |
| EC1917 | 2012000615 | O26:H11 | B1 | Stx1a | USA (OH), clover sprout outbreak, 2012 | SRS1263855 |
| EC1894 | 2011C-3493 | O104:H4 | B1 | Stx2a | Germany, sprout outbreak, 2011 | CP003289 |
| EC2634 | B2F1 | O91:H21 | B1 | Stx2d1, Stx2d2 | Canada, patient with HUS, 1985 | AFDQ01 |
| EC1623 | CL-3, TW01391 | O113:H21 | B1 | Stx2a, Stx2d | Canada, patient with HUS, 1980 | AGTH01 |
| EC2635 | C165-02 | O73:H18 | D | Stx2d | Patient with bloody diarrhea | AFDR01 |
Fig 1Microbial community associated with enriched STEC-spiked spinach.
Spinach samples were spiked with STEC at a level of 0.1 CFU/g and enriched. Triplicate experiments for each strain were performed using bagged spinach purchased at different times and designated m1, m2, and m3. The E. coli phylogroup of the spiked STEC strain is indicated at the top of the figure. Bacterial genera or species contributing ≥1% to the total identified population in at least one sample are shown and the sum of all other species identified in lower abundances is included in “other”. Samples in which the serogroup of the spiked STEC was the only serogroup detected (green asterisk) and samples for which the serotype or virulence genes were not detected (red asterisk) are indicated.
Read counts mapping to virulence genes and serotyping loci.
| EC1276m1* | 807 | 1026 | 0 | 0 | 2153 | 1189 | 728 | 532 | 674 |
| EC1276m2 | 29 | 31 | 0 | 0 | 82 | 36 | 39 | 30 | 71 |
| EC1276m3 | 285 | 297 | 0 | 0 | 852 | 448 | 334 | 220 | 314 |
| EC1611m1* | 558 | 537 | 0 | 1461 | 685 | 470 | 382 | 409 | |
| EC1611m2* | 0 | 307 | 288 | 0 | 912 | 322 | 282 | 223 | 273 |
| EC1611m3 | 0 | 405 | 404 | 0 | 1074 | 439 | 276 | 230 | 386 |
| EC1738m1* | 0 | 0 | 510 | 0 | 1318 | 484 | 376 | 326 | 386 |
| EC1738m2 | 0 | 0 | 27 | 0 | 76 | 38 | 27 | 26 | 29 |
| EC1738m3* | 0 | 0 | 313 | 0 | 1104 | 420 | 189 | 185 | 271 |
| EC1971m1 | 0 | 537 | 0 | 0 | 1371 | 490 | 280 | 219 | 231 |
| EC1971m2 | 0 | 69 | 0 | 0 | 199 | 51 | 14 | 18 | 24 |
| EC1971m3 | 0 | 182 | 0 | 0 | 516 | 198 | 77 | 74 | 78 |
| EC2002m1* | 474 | 374 | 0 | 0 | 1340 | 593 | 251 | 186 | 194 |
| EC2002m2 | 119 | 119 | 0 | 0 | 429 | 150 | 72 | 69 | 61 |
| EC2002m3 | 441 | 294 | 0 | 0 | 1185 | 682 | 154 | 143 | 197 |
| EC1660m1 | 0 | 444 | 0 | 0 | 902 | 104 | 400 | 284 | 229 |
| EC1660m2* | 0 | 236 | 0 | 0 | 512 | 172 | 186 | 106 | 131 |
| EC1660m3 | 0 | 199 | 0 | 0 | 414 | 254 | 223 | 143 | 111 |
| EC1705m1* | 0 | 112 | 0 | 0 | 233 | 118 | 103 | 85 | 59 |
| EC1705m2 | 0 | 2 | 0 | 0 | 3 | 0 | 3 | 0 | 0 |
| EC1705m3 | 0 | 35 | 0 | 0 | 70 | 51 | 23 | 19 | 13 |
| EC1917m1* | 190 | 0 | 0 | 0 | 428 | 78 | 73 | 74 | 90 |
| EC1917m2 | 24 | 0 | 0 | 0 | 77 | 18 | 46 | 31 | 22 |
| EC1917m3 | 79 | 0 | 0 | 0 | 246 | 65 | 33 | 53 | 19 |
| EC1894m1 | 139 | 0 | 0 | 0 | 149 | 79 | 24 | ||
| EC1894m2* | 0 | 213 | 0 | 0 | 0 | 0 | 280 | 221 | 50 |
| EC1894m3* | 0 | 999 | 0 | 0 | 0 | 0 | 979 | 669 | 195 |
| EC2634m1 | 0 | 0 | 373 | 616 | 112 | 126 | 84 | ||
| EC2634m2* | 0 | 0 | 0 | 354 | 0 | 377 | 190 | 179 | 50 |
| EC2634m3* | 0 | 0 | 0 | 1991 | 0 | 2157 | 579 | 593 | 304 |
| EC1623m1 | 0 | 86 | 0 | 77 | 0 | 221 | 62 | 54 | 21 |
| EC1623m2* | 0 | 151 | 0 | 148 | 0 | 282 | 116 | 102 | 59 |
| EC1623m3* | 0 | 507 | 0 | 510 | 0 | 875 | 395 | 247 | 155 |
| EC2635m1 | 1 | 3 | 2 | 4 | |||||
| EC2635m2* | 0 | 0 | 0 | 24 | 0 | 0 | 12 | 32 | 17 |
| EC2635m3* | 0 | 0 | 0 | 270 | 0 | 0 | 140 | 140 | 146 |
Values shown in grey italic font are not expected for the spiked STEC strain. Locus query lengths are: stx1 (1227 bp), stx2 (1182 bp), eae (2805–2847 bp), ehxA (2997 bp), wzx (1236–1395 bp), wzy (1023–1332 bp), fliC (204–678 bp)
a Samples for which the spiked strain serogroup was the only serogroup detected are marked *.
b Allele specific for the serotype of the spiked STEC strain.
c The read mapped to a location that is identical among the stx2a, stx2c, and stx2d subtypes, so the actual subtype present in this sample could not be determined, but was expected to be stx2d since that is the allele strain EC2635 possesses.
Percent reads mapping to STEC genome and genome coverage.
| Sample | Mapped reads (%) | Average genome coverage (x) |
|---|---|---|
| EC1276m1 | 81.2 | 100 |
| EC1611m1 | 77.1 | 67 |
| EC1611m2 | 42.6 | 50 |
| EC1738m1 | 72.1 | 55 |
| EC1738m3 | 48.3 | 48 |
| EC2002m1 | 81.0 | 59 |
| EC1660m2 | 39.7 | 32 |
| EC1705m1 | 15.4 | 16 |
| EC1917m1 | 35.7 | 24 |
| EC1894m2 | 54.7 | 38 |
| EC1894m3 | 87.5 | 145 |
| EC2634m2 | 39.2 | 35 |
| EC2634m3 | 89.6 | 137 |
| EC1623m2 | 32.2 | 28 |
| EC1623m3 | 83.9 | 74 |
| EC2635m2 | 5.56 | 5 |
| EC2635m3 | 31.5 | 35 |
a Only samples in which the spiked STEC strain was the only E. coli detected are included.
Number of contigs, single nucleotide polymorphic (SNP) sites and nucleotide mismatches at SNP sites.
| EC1276 | 327 | 873248 | 7583 | 844403 | 32842 | 1801 | 818245 | 4710 | |
| EC1611 | 948 | 873445 | 4908 | 872810 | 1314 | 851317 | 4077 | ||
| EC1738 | 1230 | 871720 | 8212 | 840233 | 33899 | 3519 | 871667 | ||
| EC1971 | 228 | 874165 | 0 | 7091 | 836773 | 26915 | 5973 | 803805 | 22507 |
| EC2002 | 193 | 873248 | 10861 | 806822 | 29025 | 1475 | 873252 | 11 | |
| EC1660 | 312 | 875599 | 0 | 4805 | 875394 | 881 | 868009 | 217 | |
| EC1705 | 4756 | 868669 | 6324 | 872445 | 25311 | 7590 | 861015 | 19182 | |
| EC1917 | 2018 | 873292 | 4947 | 847663 | 11072 | 4784 | 856966 | 15496 | |
| EC1894 | 2902 | 853263 | 1743 | 2480 | 876238 | 469 | 876620 | ||
| EC2634 | 7299 | 825772 | 1713 | 4703 | 875152 | 126 | 875165 | ||
| EC1623 | 3964 | 867138 | 118 | 4625 | 872914 | 158 | 873595 | ||
| EC2635 | 4189 | 851205 | 45934 | 7509 | 511437 | 278 | 871661 |
a Number of contigs in the metagenome (filtered to retain contigs >500 bp and >5x coverage)
b Number of SNP locations covered in common between the spiked STEC genome and the assembled metagenome out of 876,716 included in the core gene analysis scheme.
c Values for samples in which the spiked STEC was the only E. coli detected are underlined.
Fig 2Phylogenetic relationships resulting from E. coli core gene SNP analysis of spiked STEC genomes, metagenomic samples, and other E. coli strains.
This neighbor-joining tree was constructed using a p distance matrix and 500 bootstrap replications. Metagenomic samples are color-coded as follows: green, spiked STEC only E. coli detected in sample; blue, indigenous E. coli present but clusters with spiked strain; and red, indigenous E. coli present and does not cluster with spiked strain. Closed genome strains are labeled with their strain name and GenBank accession number. Phylogroups are indicated by the square brackets. Bootstrap values >80% are given at the internal nodes.
Fig 3Finer-scale resolution of the phylogenetic relationships resulting from E. coli core gene SNP analysis.
Neighbor-joining trees for the O157:H7 and O77:H18 clonal groups were constructed using a p distance matrix and 500 bootstrap replications. The spiked STEC strains and metagenomic samples are indicated by the black and gray circles, respectively. Bootstrap values >80% are given at the internal nodes.
Fig 4Finer-scale resolution of the phylogenetic relationships resulting from E. coli core gene SNP analysis.
Neighbor-joining trees for the O91:H21, O104:H4, O121:H19, O26:H11, O145:H28, and O113:H21 clonal groups were constructed using a p distance matrix and 500 bootstrap replications. The spiked STEC strains and metagenomic samples are indicated by the black and gray circles, respectively. Bootstrap values >80% are given at the internal nodes.