Literature DB >> 27258142

Whole Genome DNA Sequence Analysis of Salmonella subspecies enterica serotype Tennessee obtained from related peanut butter foodborne outbreaks.

Mark R Wilson1, Eric Brown2, Chris Keys2, Errol Strain2, Yan Luo2, Tim Muruvanda2, Christopher Grim2,3, Junia Jean-Gilles Beaubrun3, Karen Jarvis3, Laura Ewing3, Gopal Gopinath3, Darcy Hanes3, Marc W Allard2, Steven Musser2.   

Abstract

Establishing an association between possible food sources and clinical isolates requires discriminating the suspected pathogen from an environmental background, and distinguishing it from other closely-related foodborne pathogens. We used whole genome sequencing (WGS) to Salmonella subspecies enterica serotype Tennessee (S. Tennessee) to describe genomic diversity across the serovar as well as among and within outbreak clades of strains associated with contaminated peanut butter. We analyzed 71 isolates of S. Tennessee from disparate food, environmental, and clinical sources and 2 other closely-related Salmonella serovars as outgroups (S. Kentucky and S. Cubana), which were also shot-gun sequenced. A whole genome single nucleotide polymorphism (SNP) analysis was performed using a maximum likelihood approach to infer phylogenetic relationships. Several monophyletic lineages of S. Tennessee with limited SNP variability were identified that recapitulated several food contamination events. S. Tennessee clades were separated from outgroup salmonellae by more than sixteen thousand SNPs. Intra-serovar diversity of S. Tennessee was small compared to the chosen outgroups (1,153 SNPs), suggesting recent divergence of some S. Tennessee clades. Analysis of all 1,153 SNPs structuring an S. Tennessee peanut butter outbreak cluster revealed that isolates from several food, plant, and clinical isolates were very closely related, as they had only a few SNP differences between them. SNP-based cluster analyses linked specific food sources to several clinical S. Tennessee strains isolated in separate contamination events. Environmental and clinical isolates had very similar whole genome sequences; no markers were found that could be used to discriminate between these sources. Finally, we identified SNPs within variable S. Tennessee genes that may be useful markers for the development of rapid surveillance and typing methods, potentially aiding in traceback efforts during future outbreaks. Using WGS can delimit contamination sources for foodborne illnesses across multiple outbreaks and reveal otherwise undetected DNA sequence differences essential to the tracing of bacterial pathogens as they emerge.

Entities:  

Mesh:

Year:  2016        PMID: 27258142      PMCID: PMC4892500          DOI: 10.1371/journal.pone.0146929

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Salmonella enterica one of the most common causes of foodborne illness outbreaks. Although most serotypes are able to cause human disease, only about 20 of the over 2,500 identified Salmonella serotypes are typically associated with human disease. [1,2,3]. However, even serotypes that are infrequently reported can become significant threats to public health. For example, The Tennessee serovar has historically been uncommon among the Salmonella serotypes reported from food sources. In fact, the average reported cases of S. enterica Tennessee infection once represented only about 0.01% of all reported Salmonella serotypes [2]. Between 1994–2004, there were only 52 cases in which S. Tennessee was the main cause of foodborne infections [2], and only one outbreak of S. Tennessee infection, associated with powdered milk products and infant formula was reported to the Centers for Disease Control (CDC) in 1993 [4,5]. However, in November 2006, public health officials at CDC and state health departments detected a substantial increase in the reported incidence of isolates of Salmonella serotype Tennessee. As of May 22, 2007, a total of 628 persons infected with an outbreak strain of Salmonella serotype Tennessee had been reported from 47 states since August 1, 2006. In a multistate case-control study conducted during February 5–13, 2007, illness was strongly associated with consumption of either of two brands (Brand 1 and Brand 2) of peanut butter produced at the same plant [2,4,6,7]. Based on these findings, the plant ceased production and recalled both products on February 14, 2007 [6,8,9]. The outbreak strain of Salmonella Tennessee was subsequently isolated from several opened and unopened jars of Brand 1 and Brand 2 peanut butter and from environmental samples obtained from the plant. New case reports decreased drastically after the product recall. In 2008–2009, a second national outbreak associated with peanut butter occurred. In these cases the peanut butter was found to have been contaminated with Salmonella Typhimurium. Larger numbers of children were infected in these later cases [6,10]. Interviews conducted with infected patients revealed that the outbreak occurred within 3 large institutions (2 care facilities and 1 elementary school) where the patients ate their meals [6]. Further investigation and review of food menus revealed a common food source eaten by infected patients [6]. Interestingly, during this outbreak investigation, CDC’s PulseNet identified and confirmed the presence of Salmonella serotypes other than Typhimurium in both food and environmental samples. Further investigation determined that an S. Tennessee isolate detected during this second outbreak had a pulse-field gel electrophoresis (PFGE) pattern that was indistinguishable from those S. Tennessee outbreak strains found during the 2006–2007 outbreaks, obtained from unopened and opened jars of one of the same brands of peanut butter. These findings suggested a possible association between the two outbreaks, despite being separated by an approximately two-year time frame [6,10]. Interestingly, the two implicated production plants are located approximately 70 km from one another. However, in the later outbreak the S. Tennessee strains were not directly associated with human illness [6,10]. If one accepts a common-source hypothesis of the S. Tennessee serovars in these outbreaks, it demonstrates not only the potential for widespread illness arising from locally contaminated products which are then broadly distributed, but also the possibility of illnesses arising from bacterial serovars that have not been previously implicated in major foodborne illness outbreaks in the United States. From what is known about the ability of Salmonella to thrive in particular environments, this hypothesis is reasonable. These organisms may contaminate peanuts during growth, harvest, or storage, and are able to survive high temperatures in a high-fat, low-water environment [11]. Therefore, although peanut butter typically undergoes heat treatment up to temperatures >158°F (>70°C), such heating may not always eliminate salmonellae [12]. It is also possible that processed peanut butter may be contaminated by bacteria that enter the production environment after heat treatment is complete, through raw peanuts or other sources, such as animals in the production plant. The bacteria may be brought into the plant on containers, humans from the outside environment, or other ingredients used to make peanut butter. These outbreaks suggest that the contamination of processed foods can occur after a heat-treatment step, underscoring the need for additional preventive controls in food-processing plants, and ongoing food safety surveillance. Establishing an association between possible sources of food contamination and clinical isolates requires discriminating the suspected pathogen from the environmental background, and distinguishing it from other closely-related foodborne pathogens [13–16, 17–21]. The accurate subtyping and subsequent clustering of bacterial isolates associated with a foodborne outbreak event is important for a successful epidemiological investigation and the eventual traceback to a specific food or environmental source. However, phylogenetically closely related strains from a phylogenetic perspective can confound these investigations because of the limited genetic differentiation among serovars, such as Salmonella Enteritidis [22–29, 30]. Therefore, to provide a more rigorous analysis of the diversity found within these outbreaks, we performed the first whole genome DNA sequence analysis of S. Tennessee outbreak strains, and proceeded to perform a detailed phylogenetic analysis. We performed whole genome shotgun sequencing (WGS) on isolates related to the S. Tennessee-peanut butter outbreak and other isolates derived from the same serovar. Samples of S. Tennessee obtained from cilantro food sources were sequenced for comparative purposes. Whole genome shotgun sequencing is an emerging molecular epidemiological tool [30-34]. Recent studies have shown that the voluminous amount of DNA sequence data accumulated via WGS can be used to distinguish among very closely related isolates, far beyond what close inspection of PFGE patterns and MLVA typing can reveal [30]. Further, WGS can identify the nature of the specific molecular difference(s) among sets of isolates, leading to the identification of characteristics that can be placed onto phylogenetic trees to show evolutionary relationships among the taxa under scrutiny. The phylogenetic trees can also serve the purpose of showing, in graphical form, the scale of the evolutionary distances between isolates that have different PFGE patterns. In order to evaluate how WGS could assist in the identification of these isolates, we generated one closed genome sequence and 70 draft genomes of S. Tennessee isolates, including 28 isolates with two different PFGE patterns (JNXX01.0011 and JNXX01.0010) from the peanut butter outbreak, four related historical clinical isolates, eight environmental isolates with matching PFGE JNXX01.0011 profiles, three internal isolates, and 28 background isolates to establish the phylogenetic context of the diversity. Fig 1 shows the genome organization while Fig 2 depicts the phylogenetic results from these analyses.
Fig 1

Whole genome alignment showing placement of mobile elements (outer rings) in representative samples of this study, GC skew (inner ring) and GC content by strand (second ring).

Fig 2

Cladogram of S. Tennessee serovar diversity showing major clades C1-C4, and the number of SNPs (in green) defining and residing within each clade.

Materials and Methods

Growth of bacterial strains, and genomic and plasmid DNA isolation

Genomic DNA was isolated from overnight cultures as follows: each initial pure culture sample was taken from frozen stock, plated on Trypticase Soy Agar, and incubated overnight at 37°C. After incubation, cells were taken from the plate and inoculated into Trypticase Soy Broth and cultured for DNA extraction. All samples were representative cultures from a full-plate inoculation and were not single colonies. Genomic DNA was extracted using Qiagen DNeasy kits. The cilantro samples were provided through the U.S. Department of Agriculture (USDA) Microbiological Data Program (MDP). Samples collected in Michigan, Florida, New York, Ohio, and Washington were shipped overnight at room temperature and processed immediately upon receipt for the presence of S. enterica. Cilantro was weighed into sterile Whirl-Pak bags, 100 g per sample, and 500 ml of modified Buffered Peptone Water (mBPW) [35] was added to each bag. The samples were manually mixed for 2 min and then incubated overnight at 37°C. The overnight enrichment cultures were subcultured into Tetrathionate Broth (TB) and Rappaport-Vassiliadis (RV) media and incubated according to the Bacteriological Analytical Manual (BAM) Chapter 5 Salmonella [36]. Following overnight incubation the TB and RV cultures were streaked onto Hektoen Enteric (HE), Xylose-Lysine-Tergitol 4 (XLT-4), and Bismuth Sulfite (BS) agar plates and the plates were incubated overnight at 37°C. Colonies demonstrating typical S. enterica morphology on each selective agar plate were subcultured onto 5% Sheep Blood Agar (SBA) plates for further characterization. Colonies from SBA plates were confirmed as Salmonella using the Vitek® 2 Compact. The serotype was determined using the Premitest ® following the manufacturer’s instructions and a PCR serotyping method [37]. The PFGE pattern for each isolate was also determined using the CDC method for S. enterica.

Library construction and genome sequencing

For this study, 71 S. Tennessee isolates from a variety of sources were sequenced. Of these, 42 isolates were shotgun sequenced using the Roche 454 GS Titanium NGS technology [38]. Each isolate was run on one quarter of a Titanium plate, producing roughly 250,000 reads per draft genome and providing an average genome coverage of ~20X. Illumina MiSeqTM was used to sequence 28 isolates. The remaining isolate served as our reference for mapping; it was used to prepare a single 10 kb library following the Pacific Biosciences sample preparation methods for C2 chemistry. That 10 kb library was then sequenced using PacBio RS II on 4 single-molecule real-time (SMRT) cells using a 120-minute collection protocol, which provided a closed genome with an average genome coverage of > 200X. Our taxon sampling also included one S. Kentucky and one S. Cubana genome (Table 1), which were sequenced using Roche 454 GS Titanium and Illumina MiseqTM chemistries, respectively. These two Salmonella serotypes, Cubana (Genbank accession APAG0000000) and Kentucky (Genbank accession AOYZ00000000) had previously been shown to be close relatives to S. Tennessee [39], and hence served as outgroups in this study.
Table 1

Isolates and metadata from the peanut butter-derived sources, including samples obtained from outbreak-associated foods and clinical samples.

Metadata associated with isolates in this studyAccession no(s)
Tree LabelSalmonella enterica subsp. enterica Serovar and StrainCollect-ion locationIsolation sourcePFGE pattern (primary enzyme)PFGE pattern (secondary enzyme)Plat_ formBio-ProjectWGSSRA
1339_Fishmeal_2008_USA:TX(PB)Tennessee str. CFSAN001339USA:TXfishmealJNXX01.0011Pacbio65249AOXR00000000SRR955294
1340_Lamb-based_Meal_2008_NewZealand(PB)Tennessee str. CFSAN001340New Zealandmeat and bonemeal, lamb-basedJNXX01.001145465247AOXQ00000000SRR955295
1341_Poultry_Feathermeal_2008_USA:TX(PB)Tennessee str. CFSAN001341USA:TXpoultry feathermeal, hydrolizedJNXX01.0011454183850SRR1048310
1342_Fishmeal_2009_USA:TX(PB)Tennessee str. CFSAN001342USA:TXfishmealJNXX01.0011454183848SRR1048336
1343_Cotton_Seed_Hulls_2009_USA:TX(PB)Tennessee str. CFSAN001343USA:TXcotton seed hullsJNXX01.0011454183851SRR1048291
1344_Soy_Bean_Hull_Pellets_2009_USA:TX(PB)Tennessee str. CFSAN001344USA:TXsoy bean hull pelletsJNXX01.0011454183847SRR1048289
1345_Peanut_Butter_2007_USATennessee str. CFSAN001345USApeanut butter45466695
1346_Environmental_2006_USA:GA(PB)Tennessee str. CFSAN001346USA:GAenvironmentalJNXX01.0011454183850
1347_Environmental_2007_USA:GA(PB)Tennessee str. CFSAN001347USA:GAenvironmentalJNXX01.0011454183848 
1348_Clinical_2004_USA:IA(PB)Tennessee str. CFSAN001348USA:IAstoolJNXX01.0011454183851SRR1048318
1349_Clinical_2006_USA:IA(PB)Tennessee str. CFSAN001349USA:IAstoolJNXX01.0011JNXA26.0001454183847SRR1048241
1350_Clinical_2006_USA:IA(PB)Tennessee str. CFSAN001350USA:IAstoolJNXX01.0011JNXA26.0001454183850SRR1048273
1351_Clinical_2006_USA:IA(PB)Tennessee str. CFSAN001351USA:IAstoolJNXX01.0011454183848SRR1048258
1352_Clinical_2006_USA:IA(PB)Tennessee str. CFSAN001352USA:IAmissingJNXX01.0011JNXA26.0001454183851
1353_Clinical_2006_USA:IA(PB)Tennessee str. CFSAN001353USA:IAurine sampleJNXX01.0011JNXA26.0001454183847SRR1048254
1354_Clinical_2006_USA:IA(PB)Tennessee str. CFSAN001354USA:IAstoolJNXX01.0011JNXA26.0001454183850SRR1048250
1355_Peanut_Butter_2007_USA:IA(PB)Tennessee str. CFSAN001355USA:IApeanut butterJNXX01.0011454183848SRR1048333
1365_Clinical_2004_USA:MA(PB)Tennessee str. CFSAN001365USA:MAstool sampleJNXX01.0010454183851SRR1048252
1366_Clinical_2005_USA:MA(PB)Tennessee str. CFSAN001366USA:MAstool sampleJNXX01.0026454183847SRR1048329
1367_Clinical_2006_USA:MA(PB)Tennessee str. CFSAN001367USA:MAwoundJNXX01.0011JNXA26.0001454183850SRR1048246
1368_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001368USA:GApeanut butterJNXX01.0010454183848SRR1048300
1369_Environmental_2007_USA:GA(PB)Tennessee str. CFSAN001369USA:GAenvironmentalJNXX01.0011454183851SRR1048323
1370_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001370USA:GApeanut butterJNXX01.0011454183847SRR1048330
1371_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001371USA:GApeanut butterJNXX01.0011454183850
1372_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001372USA:GApeanut butterJNXX01.0011454183848SRR1048334
1373_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001373USA:GApeanut butterJNXX01.0011454183851SRR1048312
1374_Clinical_2006_USA:MA(PB)Tennessee str. CFSAN001374USA:MAstool sampleJNXX01.0011JNXA26.0001454183847SRR1048255
1375_Clinical_2006_USA:MA(PB)Tennessee str. CFSAN001375USA:MAurine sampleJNXX01.0011JNXA26.0001454183850SRR1048626
1376_Clinical_2004_USA:TN(PB)Tennessee str. CFSAN001376USA:TNhumanJNXX01.0011454183848SRR1048242
1377_Peanut_Butter_From_Patient_2007_USA:TN(PB)Tennessee str. CFSAN001377USA:TNpeanut butter from sick patientJNXX01.0011JNXA26.0001454183851SRR1048238
1378_Peanut_Butter_From_Patient_2007_USA:KY(PB)Tennessee str. CFSAN001378USA:KYpeanut butter from sick patientJNXX01.0011JNXA26.0001454183847SRR1048247
1379_Peanut_Butter_2009_USA:GA(PB)Tennessee str. CFSAN001379USA:GApeanut butterJNXX01.0011JNXA26.0001454183850SRR1048331
1380_Blood_Meal_2010_USA:NY(PB)Tennessee str. CFSAN001380USA:NYblood mealJNXX01.0011454183848SRR1048311
1381_Hydrolyzed_Vegetable_Protein_Powder_2010_USA:NV(PB)Tennessee str. CFSAN001381USA:NVhydrolyzed vegetable protein powderJNXX01.0189JNXA26.0016454183851SRR1048335
1382_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001382USA:GApeanut butterJNXX01.0011454183847SRR1048290
1383_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001383USA:GApeanut butterJNXX01.0011454183850SRR1048315
1384_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001384USA:GApeanut butterJNXX01.0010454183848SRR1048298
1385_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001385USA:GApeanut butterJNXX01.0011454183851SRR1048285
1386_Environmental_2006_USA:GA(PB)Tennessee str. CFSAN001386USA:GAenvironmentalJNXX01.0011454183847SRR1048622
1387_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001387USA:GApeanut butterJNXX01.0010454183850SRR1048319
1388_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001388USA:GApeanut butterJNXX01.0011454183848SRR1048236
1389_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001389USA:GApeanut butterJNXX01.0011454183851SRR1048305
1390_Peanut_Butter_2007_USA:GA(PB)Tennessee str. CFSAN001390USA:GApeanut butterJNXX01.0011454183847SRR1048239
2961_Produce_Field_Drag_Swab_2012_USA:NYTennessee str. CFSAN002961USA:NYproduce field—drag swab Miseq183850SRR1012301
3018_Peanut_Butter_2007_USA:NYTennessee str. CFSAN003018USA:NYpeanut butterJNXX01.0011Miseq183850SRR949423
3019_Peanut_Butter_2007_USA:NYTennessee str. CFSAN003019USA:NYpeanut butterJNXX01.0011Miseq183850SRR949424
3170_Cilantro_2011_USA:FLTennessee str. CFSAN003170USA:FLcilantro Miseq186035SRR1033577
3171_Cilantro_2011_USA:FLTennessee str. CFSAN003171USA:FLcilantro Miseq186035SRR1033550
3172_Cilantro_2011_USA:FLTennessee str. CFSAN003172USA:FLcilantro Miseq186035SRR952679
3173_Cilantro_2011_USA:FLTennessee str. CFSAN003173USA:FLcilantro Miseq186035SRR1033559
3174_Cilantro_2011_USA:FLTennessee str. CFSAN003174USA:FLcilantro Miseq186035SRR1033540
3175_Cilantro_2011_USA:FLTennessee str. CFSAN003175USA:FLcilantro Miseq186035SRR1033508
3176_Cilantro_2011_USA:FLTennessee str. CFSAN003176USA:FLcilantro Miseq186035SRR1033543
3186_Cilantro_2011_USA:NYTennessee str. CFSAN003186USA:NYcilantro Miseq186035SRR1043945
3187_Cilantro_2011_USA:NYTennessee str. CFSAN003187USA:NYcilantro Miseq186035SRR1041887
3188_Cilantro_2011_USA:NYTennessee str. CFSAN003188USA:NYcilantro Miseq186035SRR1043946
3189_Cilantro_2011_USA:NYTennessee str. CFSAN003189USA:NYcilantro Miseq186035SRR1033469
3190_Cilantro_2011_USA:NYTennessee str. CFSAN003190USA:NYcilantro Miseq186035SRR1036442
3191_Cilantro_2011_USA:NYTennessee str. CFSAN003191USA:NYcilantro Miseq186035SRR1036432
3192_Cilantro_2011_USA:NYTennessee str. CFSAN003192USA:NYcilantro Miseq186035SRR1033568
3193_Cilantro_2011_USA:NYTennessee str. CFSAN003193USA:NYcilantro Miseq186035SRR1041894
3194_Cilantro_2011_USA:NYTennessee str. CFSAN003194USA:NYcilantro Miseq186035SRR952681
3196_Cilantro_2011_USA:NYTennessee str. CFSAN003196USA:NYcilantro Miseq186035SRR1033564
3197_Cilantro_2011_USA:NYTennessee str. CFSAN003197USA:NYcilantro Miseq186035SRR1033473
3198_Cilantro_2011_USA:NYTennessee str. CFSAN003198USA:NYcilantro Miseq186035SRR1033528
3199_Cilantro_2012_USA:OHTennessee str. CFSAN003199USA:OHcilantro Miseq186035SRR1033579
3202_Cilantro_2012_USA:OHTennessee str. CFSAN003202USA:OHcilantro Miseq186035SRR1036448
3203_Cilantro_2012_USA:OHTennessee str. CFSAN003203USA:OHcilantro Miseq186035SRR1036443
5186_Celery_Stalk_Leaf_2011_USA:ILTennessee str. CFSAN005186USA:ILcelery stalk and leafJNXX01.0112Miseq186035SRR1048302
5226_Environmental_Swab_2010_USA:UTTennessee str. CFSAN005226USA:UTswab JNXA26.0001Miseq186035SRR1049678
5302_Sunflower_Kernels_2010_ChinaTennessee str. CFSAN005302Chinasunflower kernelsJNXX01.0002Miseq186035SRR1049693
1337_S.KentuckyKentucky str. CFSAN001337USA:PAfecal sample45466693AOYZ00000000SRR955293
1083_S.CubanaCubana str. CFSAN001083Philippinesdessicated coconut Miseq167394APAG00000000SRR955257
Libraries were constructed from cilantro-derived samples using the Nextera XT DNA sample preparation kit (Illumina, San Diego, CA), and whole-genome sequencing was performed on a MiSeqTM benchtop sequencer (Illumina, San Diego, CA), using 500-cycle paired-end reagent kit v2.

Genome assembly and annotation

De novo assemblies were created for each isolate, using Roche Newbler package (v. 2.6), CLC Genomic Workbench 6.5.1, and SMRT analysis 2.0.1, for isolates sequenced by 454, MiseqTM, and PacBio, respectively. All draft genomes were annotated using NCBI’s Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP, [40]). The reference genome used for mapping reads was CFSAN001339, which is comprised of 1 single circular chromosome. Hence, positional information is specific for the reference. (GenBank accession: CP007505). Phylogenetic trees were constructed using GARLI [41, 42] under the maximum likelihood criterion. The phylogenetic tree in Fig 2 was constructed using GARLI under the GTR + gamma model of nucleotide evolution. Phylogenetic analyses of the data set, including multiple outgroups, were performed on the concatenated SNP matrix described above.

Phylogenomic analysis

The raw reads of each sample were mapped to the closed reference genome, CFSAN001339, using Novoalign V2.08.02 (http://www.novocraft.com), and the variants were called using SAMtools and stored in a VCF file [43]. A custom Python script was used to read through each VCF file and construct a SNP matrix for further phylogenetic analyses, as follows. First, we estimated the site SNP allele frequencies of the strongest non-reference allele [43] and placed them into a list by collecting all of the instances which met the criteria of being present at positions in the reference where one or more isolates differed with a read depth ≥10 and an allele frequency equal to one. Insertions and deletions (indels) in VCF files were ignored. Second, pileup files were generated for each isolate based on the above-mentioned list to determine the appropriate nucleotide state for positions in the list for each isolate based on the following rules: a) if there was no mapped reads at a position it was treated as missing data; b) if different nucleotides were called at the position, the one with frequency larger than 50% was the consensus call for that position; and c) if different nucleotides were called at a position but none had a frequency larger than 50%, that position for that individual isolate was coded as missing data. Third, the mapped consensus base for each isolate at the reference SNP positions were concatenated in a multiple FASTA file for phylogenetic analysis. The maximum likelihood (ML) tree was constructed using GARLI [41,42] with 200 ML replicates and 1000 bootstrap replicates. All GARLI analyses were performed with the default parameter settings and the GTR+gamma nucleotide substitution model. Detailed descriptions of the data analysis pipeline is available [44] as well as github (see https://github.com/CFSAN-Biostatistics/snp-mutator).

Accessions

The whole genome shotgun accessions (WGS), Bioproject accession numbers, and metadata for all the isolates sequenced in this study are listed in Table 1. The NCBI accession numbers for the comparative plasmids discussed herein are: Citrobacter freundii plasmid pCAV1741-110 (CP011655); S. Typhi plasmid pHCM2 (AL513384); and Yersinia pestis pMT (CP010021).

Results

Genome Size, Order and Conservation

We present new draft genomes for 73 Salmonella isolates including CFSAN001337, and CFSAN001083, closely related outgroups, S. Kentucky and S. Cubana, respectively (Table 1). While synteny and genome organization among these isolates was largely conserved, genome size differences were observed due to variations in the presence or absence of several phages and plasmids. Phylogenomic analysis of the S. Tennessee data set, including multiple serovars, was performed on the set of SNPs obtained from the analysis described in the methods. We used the resultant phylogenetic trees to make hypotheses about both the evolution of S. Tennessee subtypes and the outbreak strains and also to support traceback investigations. A list of genes from which the SNPs that characterize the S. Tennessee clade were derived is provided in Table 2. A representative SNP from each of these genes is also provided in the table along with the subgroup that it defines the SNP base pair coordinates. Many of these genes were annotated previously with assigned names and functions; however, additional regions that provided signature SNPs are hypothetical and, as such, are cross-referenced by locus tags only. It is notable that a partial and select set of SNPs from these genes are nonsynonymous, and many cluster two or more S. Tennessee subgroups together, as shown in Table 2 and Fig 1, and many are protein-altering in nature. These data are intriguing given an NGS report documenting positive selection among a significant subset of core genes in adapted Salmonella serovars [45].
Table 2

Annotation of clade-specific SNPs found in serovar S. Tennessee.

LocationAccessionAnnot.Locus_tagPos. in codingNuc. changeAmino acid changeSyn/NonStrandProduct name
Clade C1
1361235CP007505codingSEET0819_0639522GAT->TATD->YN-transaldolase
Node1
1042280CP007505codingSEET0819_0494514TCG->TTGS->LN+phosphate-starvation-inducible protein PsiE
1486959CP007505codingSEET0819_069901203CTG->CTAL->LS+chitinase
248590CP007505codingSEET0819_012602258CAC->CTCH->LN-maltodextrin phosphorylase
4310906CP007505codingSEET0819_20670799GAA->TAAE->StopN-hypothetical protein
Clade C2
1261850CP007505codingSEET0819_05950646CGC->AGCR->SN+UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase
1325788CP007505codingSEET0819_06255711CTG->CTAL->LS-multifunctional aminopeptidase A
1559936CP007505intergenicC->T
1659427CP007505codingSEET0819_07735609ATG->ATAM->IN+multicopper oxidase
2013944CP007505codingSEET0819_09365212CCG->CTGP->LN+hypothetical protein
266867CP007505codingSEET0819_01325771CCG->CCAP->PS+ribokinase
2745712CP007505codingSEET0819_12935360GCC->GCTA->AS+peptide ABC transporter ATP-binding protein
2967521CP007505codingSEET0819_141051042GCT->ACTA->TN+transcriptional regulator
2977833CP007505codingSEET0819_14155400TTC->ATCF->IN-dimethyl sulfoxide reductase
3099086CP007505codingSEET0819_14720358GGT->AGTG->SN-XRE family transcriptional regulator
4290447CP007505codingSEET0819_206201930GGC->AGCG->SN+large repetitive protein
4415903CP007505intergenicT->G
4675774CP007505codingSEET0819_224301074TAC->TATY->YS+S-adenosylmethionine synthetase
4779893CP007505codingSEET0819_22980627CTG->CTAL->LS+disulfide oxidoreductase
4862200CP007505codingSEET0819_233701544AGC->AACS->NN-DEAD/DEAH box helicase
968881CP007505codingSEET0819_045702709CTG->CTAL->LS+DNA-directed RNA polymerase subunit beta
Clade C3
213029CP007505codingSEET0819_011152210GAT->GTTD->VN+transcription accessory protein
224483CP007505codingSEET0819_011651533CTG->CTTL->LS-maltose phosphorylase
281038CP007505codingSEET0819_01400142TGG->TGTW->CN-leucine/isoleucine/valine transporter permease subunit
401342CP007505codingSEET0819_018903GTG->GTAV->VS-hypothetical protein
416161CP007505codingSEET0819_0196057CTG->CTAL->LS+bifunctional glyoxylate/hydroxypyruvate reductase B
451802CP007505codingSEET0819_02130196ACG->CCGT->PN+xylulose kinase
492375CP007505codingSEET0819_02295648TTC->TTTF->FS-glycosyl transferase
516347CP007505intergenicC->A
702891CP007505intergenicA->G
766356CP007505codingSEET0819_03620777ACC->ACAT->TS+phospholipase A
972049CP007505codingSEET0819_04585561GGA->GGTG->GS+
1049743CP007505codingSEET0819_0497548CTG->CTAL->LS+maltose-binding protein
1065827CP007505codingSEET0819_05055429TTC->TTTF->FS+aromatic amino acid aminotransferase
1188083CP007505codingSEET0819_05585428TAT->TGTY->CN-fumarate reductase
1201717CP007505intergenicT->A
1255014CP007505codingSEET0819_059202699GAG->GGGE->GN+hypothetical protein
1271868CP007505codingSEET0819_05995351GGA->GGGG->GS+inosose dehydratase
1315853CP007505codingSEET0819_0621094ACC->CCCT->PN+toxin-antitoxin biofilm protein TabA
1403676CP007505codingSEET0819_065801151CCC->CACP->HN+sigma-54 dependent transcriptional regulator
1583841CP007505codingSEET0819_07400275TAT->TTTY->FN+transcriptional regulator
1623141CP007505codingSEET0819_075602526ATG->ATAM->IN+preprotein translocase subunit SecA
2002950CP007505intergenicT->G
2293435CP007505codingSEET0819_10755774GAG->GAAE->ES+LysR family transcriptional regulator
2402138CP007505codingSEET0819_11250447GAG->GACE->DN+peptidase M15
2462555CP007505codingSEET0819_11600552CTG->CTTL->LS-hypothetical protein
2478416CP007505codingSEET0819_116601670CCT->CTTP->LN+Clp protease ClpX
2488851CP007505codingSEET0819_11710154CTG->TTGL->LS+leucine-responsive transcriptional regulator
2547852CP007505codingSEET0819_11930537ATG->ATAM->IN+amino acid:proton symporter
2560677CP007505codingSEET0819_11970593GAT->GCTD->AN+paraquat-inducible membrane protein A
2592656CP007505codingSEET0819_121551222GCC->CCCA->PN-Pyoverdin chromophore biosynthetic protein pvcC
2693215CP007505codingSEET0819_12675287GTA->GCAV->AN+membrane protein
2823636CP007505codingSEET0819_13360692CTC->CACL->HN+cyclic di-GMP regulator CdgR
2901526CP007505codingSEET0819_13765303GTG->GTTV->VS+secretion system apparatus protein SsaU
2902776CP007505codingSEET0819_137801352ATT->AATI->NN-multidrug transporter
2915852CP007505intergenicA->G
2926925CP007505codingSEET0819_13915214TAC->CACY->HN-glutathione S-transferase
2990295CP007505codingSEET0819_14220633GAC->GAAD->EN-malonic semialdehyde reductase
3062233CP007505codingSEET0819_14570559ACG->GCGT->AN+TetR family transcriptional regulator
3141458CP007505codingSEET0819_149301564GTG->TTGV->LN-hypothetical protein
3166809CP007505codingSEET0819_15030319TTT->ATTF->IN+hypothetical protein
3288278CP007505codingSEET0819_15625248CGC->CACR->HN-peptide chain release factor 1
3353430CP007505codingSEET0819_1595524AAG->AAAK->KS-transcriptional regulator
3461711CP007505codingSEET0819_1657042CAG->CAAQ->QS-glycosyl hydrolase family 88
3556791CP007505codingSEET0819_17120110ACT->AATT->NN-acyl carrier protein
3564805CP007505intergenicT->C
3631271CP007505codingSEET0819_17515656ACC->ATCT->IN+imidazoleglycerol-phosphate dehydratase
3835332CP007505codingSEET0819_18455244TCT->GCTS->AN+transcriptional regulator
3976686CP007505intergenicT->C
3977014CP007505codingSEET0819_19075177TCG->TCTS->SS+integrase
3981933CP007505codingSEET0819_19095732GCT->GCAA->AS-hypothetical protein
3981936CP007505codingSEET0819_19095729CCG->CCAP->PS-hypothetical protein
3981954CP007505codingSEET0819_19095711ACG->ACTT->TS-hypothetical protein
3981966CP007505codingSEET0819_19095699GCC->GCAA->AS-hypothetical protein
3981968CP007505codingSEET0819_19095697GCC->TCCA->SN-hypothetical protein
3998142CP007505codingSEET0819_19180105GGG->GGAG->GS-hypothetical protein
4007005CP007505intergenicT->G
4007065CP007505intergenicA->C
4007316CP007505intergenicC->T
4007326CP007505intergenicA->G
4007341CP007505intergenicG->A
4007389CP007505intergenicA->G
4007514CP007505intergenicG->A
4007522CP007505intergenicT->C
4007523CP007505intergenicA->C
4007528CP007505intergenicA->G
4007551CP007505intergenicA->G
4007874CP007505codingSEET0819_19270528GAT->GAAD->EN-replication protein
4008036CP007505codingSEET0819_19270366TCC->TCAS->SS-replication protein
4008048CP007505codingSEET0819_19270354AAC->AATN->NS-replication protein
4008441CP007505intergenicG->A
4008502CP007505intergenicA->G
4008537CP007505intergenicT->G
4010438CP007505codingSEET0819_19300124GGC->TGCG->CN+hypothetical protein
4010464CP007505codingSEET0819_19300150CAT->CAGH->QN+hypothetical protein
4010470CP007505codingSEET0819_19300156GTC->GTTV->VS+hypothetical protein
4010508CP007505codingSEET0819_19300194TCG->TTGS->LN+hypothetical protein
4010580CP007505codingSEET0819_19300266GCT->GTTA->VN+hypothetical protein
4010593CP007505codingSEET0819_19300279GGA->GGGG->GS+hypothetical protein
4012278CP007505codingSEET0819_1933545TTC->TTTF->FS+regulatory protein
4012281CP007505codingSEET0819_1933548TAC->TATY->YS+regulatory protein
4324886CP007505intergenicG->A
4669017CP007505intergenicT->A
Clade C4
1068806CP007505codingSEET0819_05080360AAT->AAAN->KN-lipoprotein
1935930CP007505codingSEET0819_08985966AAT->AACN->NS+S-adenosylmethionine:tRNA ribosyltransferase-isomerase
3265625CP007505codingSEET0819_15515211GGG->AGGG->RN+hypothetical protein
Node2
1001360CP007505codingSEET0819_04720856GCG->TCGA->SN+isocitrate dehydrogenase
1086809CP007505codingSEET0819_051158047GAT ->AATD->NN+membrane protein
1389803CP007505codingSEET0819_065401764ATT->ATCI->IS-type I restriction-modification protein subunit S
2186959CP007505codingSEET0819_1020529TCT->TTTS->FN-LPS biosynthesis protein
314299CP007505codingSEET0819_01560115GTC->ATCV->IN+copper resistance protein
331099CP007505intergenicG->A
334506CP007505intergenicA->T
4620448CP007505intergenicC->T

Genetic Variation within the Tennessee serovar

As shown in Table 1, the isolates derived from the peanut butter-derived sources, including samples obtained from outbreak-associated foods and clinical samples, were observed to have distinct PFGE profiles. The S. Tennessee isolates from the 2006–2007 outbreak displayed four closely related primary (XbaI-derived) PFGE patterns: JNXX01.0010, JNXX01.0011, JNXX01.0026. [2,6,10]. Secondary patterns (BlnI-derived) for PFGE type JNXX01.0011 were all classified as JNXA26.0001. A set of non-peanut butter-derived S. Tennessee isolates also exhibited one of the same PFGE patterns as found in the peanut butter-derived samples: JNXX01.0011. Samples in this set included isolates from fishmeal (CFSAN001339 and CFSAN001342); lamb from New Zealand (CFSAN001340), poultry (CFSAN001341), cotton seeds (CFSAN001343), and soy beans (CFSAN001344). Other S. Tennessee serovar isolates included in this study that came from non-peanut butter sources also exhibited different PFGE patterns; for example: celery (CFSAN005186, PFGE pattern JNXX01.0112); an environmental swab, (CFSAN005226, PFGE pattern JNXA26.0001); sunflower kernels from China (CFSAN005302, PFGE pattern JNXX01.0002); hydrolyzed vegetable protein powder (CFSAN001381, primary PFGE pattern JNXX01.0189, a secondary PFGE pattern of JNXA26.0016; this isolate also carried a 30kb phage PsP3, discussed further below). The PFGE patterns for all 22 S. Tennessee isolates obtained from cilantro were identical (JNXX01.0011). Analyses of these whole genome sequences revealed that all 22 cilantro isolates of S. Tennessee formed a distinct group. Our PFGE and WGS analyses suggest a common source for these isolates, even though the isolates were collected from 3 states. Cilantro is typically grown in only 2 or 3 areas of the country and provided to the consumer through a complex distribution network, such that the state of collection for this study may not be the state where the cilantro was grown. Further examination of this distribution network revealed that eight of these isolates originated in California, and five originated in Mexico; the origin of the remaining nine could not be determined. A recent report on the potential enhanced virulence of the peanut butter-derived Salmonella isolates [46] led us to compare the genic origin of the SNPs found within the peanut butter-derived strains in our study to the SNPs found in isolates obtained from non-peanut butter sources (Table 2). Many of the observed SNP differences were non-synonymous, coding for amino-acid changes. Further investigation is needed to determine whether or not these coding changes result in virulence changes. Cluster analyses also revealed 13 isolates with the same PFGE pattern as the most common pattern in this outbreak (JNXX01.0011) that do not belong in the outbreak clade. These isolates include those collected from the 2008 peanut butter outbreak, three clinical isolates from MA, two clinical isolate from IA, and seven isolates from animal feed. Additionally, eight of the 13 clinical and two of the environmental isolates in this study are in the outbreak clade. None of the SNPs we identified in this study were specific to clinical or environmental sources. It is noteworthy that no increases in substitutions were identified among the isolates that passed through patients compared to their environmental sources. Had there been an increase or expansion in genetic diversity among the clinical isolates we studied in comparison to isolates collected from other food and environmental sources, we would have expected that genetic diversity to have been visible as longer branch lengths among the terminal tree nodes leading back to the clinical isolates found in the tree.

Phylogenetic analysis

The phylogenetic tree arising from this analysis is depicted in Fig 2. For discussion purposes, we have identified four intra-serovar clades, C1-C4. C1 consists of 31 isolates, all closely related, containing both clinical and environmental sources, and each separated by a single SNP. The node (node 1) defining this clade consists of four unique SNPs. C2 is a small clade of three isolates defined by 16 SNPs. C3 contains 22 isolates, differentiated by 82 total SNPs. All of the C3 isolates were obtained from a cilantro food source. Node 2, defining clades C2-C4, contains 8 unique SNPs. The Tennessee clade is identified by a total of 1,153 SNPs, most of which (1,061) map to the long branch separating the outgroups from the Tennessee-specific isolates. Interestingly, the singleton-containing branches consisting of isolate numbers 1381, 2961, and 5226 all contain large mobile elements.

Specific Genes and SNP-based genetic variation defining the Tennessee serovar

A total of 114 SNPs were found in S. Tennessee genes, and including representatives from each of the four S. Tennessee clades (Table 2). Although many of these changes are synonymous, many others are non-synonymous (discussed further below). Similar to earlier studies, we observed changes in the S. enterica multicopper oxidase gene, (locus tag SEET0819_07735, position 609), a gene reported to harbor many changes within S. Enteritidis strains. Although the gene and protein alignments show many of the same non-synonymous SNP differences that appear in all the S. Tennessee isolates we examined [21], we also identified a change in the S. Tennessee serovars at genome position 1659427, resulting in a M-I amino acid change in the multicopper oxidase gene. Other non-synonymous SNP changes affected genes involved in redox-type chemical reactions. In particular, we found an F-I change in the dimethyl sulfoxide reductase gene at position 2977833; this is a molybdenum-containing enzyme capable of reducing dimethyl sulfoxide (DMSO) to dimethyl sulfide (DMS). This enzyme serves as the terminal reductase under anaerobic conditions in some bacterial species, with DMSO serving as the terminal electron acceptor. At genome position 1261850 there was a change from R-S within the UDP-N-acetylmuramate: L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase gene, a gene involved in peptidoglycan recycling that reutilizes the intact tripeptide L-alanyl-gamma-D-glutamyl-meso-diaminopimelate by linking it to UDP-N-acetylmuramic acid. At position 136125 we found a change from D-Y in the transaldolase gene, an enzyme of the non-oxidative phase of the pentose phosphate pathway. Eleven non-synonymous SNPs fell within hypothetical proteins (at positions 4310906, 2013944, 3265625, 1255014, 3141458, 3166809, 3981968, 4010438, 4010464, 4010508 and 4010580). One SNP mapped to a lipoprotein (1068806), while another fell within a large repetitive protein (4290447). Many of SNPs resulting in amino acid changes were involved in transcriptional regulation or DNA structural modifications related to gene expression. These include changes in the XRE family of transcriptional regulator genes at position 3099086; three generic transcriptional regulator changes at positions 2967521, 1583841, and 3835332, and a change at position 4862200, a S-N alteration in the DEAD/DEAH box helicase gene, a family of DNA-unwinding and RNA-processing proteins (Table 2).

Mobile Elements

Natural selection has been reported in Salmonella and appears to be a major component of the evolution of this pathogen [33, 47]. Some of the variable genes in Salmonella are found in the mobilome, consisting of phages and plasmids, which are often the most promiscuous portions of the bacterial genomes [31, 30, 48–50]. This evolutionary strategy could provide a mechanism whereby highly selected genes could be shaped by natural selection, and then be easily distributed among the members of a serotype and other, more distant, lineages through mobile genetic elements. We have also identified several new plasmids (Table 3) suggesting that whole genome sequencing will continue to provide novel information about the Salmonella genome. Genes contributing to virulence are often carried on mobile elements, therefore it is especially important to study these elements in pathogenic strains.
Table 3

Mobile elements and plasmids found in the S. Tennessee serovar.

 12kb_Insertion30kb_PhagePsP363kb_Insertion110kb_PhageSSU5260kb_PlasmidR478
CFSAN002961++
CFSAN001365+
CFSAN001368+
CFSAN001387+
CFSAN001381+
CFSAN005226+++

Notes: The 63kb_Insertion region was found only in CFSAN005226, which is a singleton in the tree.

CFSAN002961 also has a very large plasmid (similar to Serratia marcescens plasmid R478, 274762 bp) that is not shared by any other isolates. CFSAN002961 is also a singleton in the tree.

CFSAN001365 (2004 MA clinical), 1368 (2007 GA peanut butter), and 1387 (2007 GA peanut butter) are isolates from clade C1, and they seem to share a 110 kb phage (see text for further discussion), which is found to be similar to Salmonella phage SSU5 (103299 bp).

Notes: The 63kb_Insertion region was found only in CFSAN005226, which is a singleton in the tree. CFSAN002961 also has a very large plasmid (similar to Serratia marcescens plasmid R478, 274762 bp) that is not shared by any other isolates. CFSAN002961 is also a singleton in the tree. CFSAN001365 (2004 MA clinical), 1368 (2007 GA peanut butter), and 1387 (2007 GA peanut butter) are isolates from clade C1, and they seem to share a 110 kb phage (see text for further discussion), which is found to be similar to Salmonella phage SSU5 (103299 bp). We found five mobile elements within the Tennessee serovar (Table 3). CFSAN001365 (2004 MA clinical), CFSAN001368 (2007 GA peanut butter), and CFSAN001387 (2007 GA peanut butter) cluster together in clade C1, and they all share a 110 kb phage, which is found to be similar to Salmonella phage SSU5 (103,299 bp). This phage was originally described in S. enterica serovar Typhimurium, and its whole genome was sequenced and analyzed [51]. The double-stranded DNA genome of SSU5 encodes 130 open reading frames with one tRNA for asparagine. Genomic analysis revealed that SSU5 might be the phylogenetic origin of cryptic plasmid pHCM2, harbored by Salmonella Typhi CT18. Our investigation shows that this sequence shares 77% sequence similarity (query cover) with approximately 99% sequence identity with the Citrobacter freundii plasmid pCAV1741-110 and with S. Typhi plasmid pHCM2. Further, it shows some similarity (57%) with 90% sequence identity with the virulence-associated plasmid pMT from Yersinia pestis. Further investigation is warranted to determine whether or not this sequence is carried on a distinct plasmid in Salmonella. Table 3 lists the remaining mobile elements identified here in S. Tennessee, including a 12 kb insertion, a 30 kb PhagePsP3-like element, a 63 kb insertion, the previously mentioned 110 kb phage (SSU5-like), and a 260 kb plasmid R478-like mobile element [52]. Comparison of Figs 1 and 2 shows the relationship between the mobile elements and the phylogenetic signal which accompanies each.

Discussion

The phylogenomic analysis of the S. Tennessee serovar samples contained in this study demonstrates a number of important points that are relevant to foodborne outbreak investigations. First, these results continue to underscore the power of whole genome sequencing in outbreak investigations. Although in most cases PFGE patterns will provide sufficient resolution to determine the relationships between closely related isolates, in some cases additional resolution provides information that would not be available from PFGE patterns alone. Second, the power of genome sequencing leads to the identification of classes of SNPs and mobile elements that help us understand the molecular mechanisms of pathogen virulence. This knowledge will serve to establish new typing methods that are focused on particular genetic changes present in genomes, and may also lead to insights that will affect the development of treatments designed to protect human health. Like other molecular epidemiology studies of Salmonella employing genomic technologies [30-34], this work demonstrates that comparative WGS methods can be employed to clearly augment food contamination investigations by genetically linking the implicated sources of contamination with environmental and clinical isolates. The genomic evidence herein corroborates epidemiological conclusions from outbreak investigations based on statistical analysis and source tracking leads. However, with WGS, one can gain additional detailed micro-evolutionary knowledge within the associated outbreak and reference isolates; thus providing additional evidence linking implicated sources to some of the clinical isolates but not to others that might have initially been associated with this foodborne contamination. Moreover, the level of genetic resolution obtained using WGS methods permits delimiting the scope of an outbreak in the context of an investigation, even for the most genetically homogeneous salmonellae [30]. Phylogenetic evolutionary hypotheses can help us identify reliable diagnostic nucleotide motifs (SNPs, rearrangements, and gene presences) for detecting outbreak strains and understanding the mechanisms that drive the outbreak occurrences. These methods allow both the rapid characterization of the genomes of foodborne pathogenic bacteria and can help to identify the particular source of contamination in the food supply. Using the comparative WGS results and full genomic data reported here we can confirm that some clinical isolates collected during the time of the peanut butter contamination event have the same PFGE Pattern, JNXX01.0011, which has been linked to the implicated environmental isolates previously studied. Importantly, while most of the isolates collected during this time period that share a common PFGE pattern fall into the same clades (Fig 2) with the environmental isolates, several strains known to be unrelated to the outbreak, including historical isolates from earlier analyses, interrupt these lineages, indicating additional potential sources of contamination. Our results corroborate those from a previous study [30]. We found no apparent increase in substitutions among the clinical isolates that passed through patients compared to the environmental clones of those isolates. Fig 2 shows that both clinical and environmental peanut butter isolates cluster within the same clade, with no apparent differences attributable to human gastrointestinal passage. From the data presented, as well as from other published data on mobile elements, it would appear that the elements identified herein are not restricted to closely related isolates in the phylogenetic context. For example, a recently discovered Salmonella plasmid (pSEEE1729_15) has a DNA sequence similar to an E. coli 0157:H7 strain EC4115 [53], suggesting that parts of the mobilome may be transferred between enterobacterial species, while raising the possibility of new acquisitions into the S. Enteritidis pan genome [48]. Consistent with other studies, we did not find any distinctive differences between isolates recovered from food sources and those obtained from clinical samples. A further comparative analysis of the structure and gene organization in the mobile elements in the isolates recovered from peanut butter will be the subject of a subsequent paper. Mining the data of these novel S. Tennessee genomes should provide new genetic targets for pathogen detection by public health laboratories, and support investigations of outbreaks that consist of closely related Salmonella pathogens. Akin to earlier findings of NGS-based differentiation of S. Montevideo isolates associated with pepper and spiced meats [30-32], the signature genetic differences uncovered here will provide additional insight into what will likely remain a common pattern of S. Tennessee associated with the food supply. By identifying unique genetic patterns that can rapidly distinguish among multiple serotypes of closely related pathogens and PFGE types, WGS has become an invaluable tool for future molecular epidemiology investigations.

Conclusions

It appears that, at least in the case of Salmonella, the natural variation observed among strains is both stable and sufficient to allow for high-resolution traceback of food and clinical isolates using NGS. It will be interesting to see whether ample genomic diversity can drive similar outcomes in other problematic taxa and closely related Salmonella serotypes. By providing the phylogenetic context on which to interpret other facile subtyping approaches that focus on more rapidly evolving genetic markers such as MLVA, rep-PCR, and CRISPRs [6–10, 22] NGS can provide a novel suite of SNPs that will be critical to partitioning common Salmonella outbreak strains. Combined with phylogenetic analysis, WGS can illuminate the genetic and evolutionary diversity of important serovars of Salmonella and expand our understanding of the associated epidemiological pathways surrounding specific outbreak strains [28, 29, 31, 32].

CFSAN, strain names, assemblies, and WGS accession numbers.

(XLSX) Click here for additional data file.
  42 in total

1.  pJIE137 carrying blaCTX-M-62 is closely related to p271A carrying blaNDM-1.

Authors:  Sally R Partridge; Ian T Paulsen; Jonathan R Iredell
Journal:  Antimicrob Agents Chemother       Date:  2012-01-17       Impact factor: 5.191

2.  Salmonella Enteritidis strains from poultry exhibit differential responses to acid stress, oxidative stress, and survival in the egg albumen.

Authors:  Devendra H Shah; Carol Casavant; Quincy Hawley; Tarek Addwebi; Douglas R Call; Jean Guard
Journal:  Foodborne Pathog Dis       Date:  2012-02-03       Impact factor: 3.171

3.  Identification of a salmonellosis outbreak by means of molecular sequencing.

Authors:  E Kurt Lienau; Errol Strain; Charles Wang; Jie Zheng; Andrea R Ottesen; Christine E Keys; Thomas S Hammack; Steven M Musser; Eric W Brown; Marc W Allard; Guojie Cao; Jianghong Meng; Robert Stones
Journal:  N Engl J Med       Date:  2011-02-23       Impact factor: 91.245

4.  Whole-genome sequencing and social-network analysis of a tuberculosis outbreak.

Authors:  Jennifer L Gardy; James C Johnston; Shannan J Ho Sui; Victoria J Cook; Lena Shah; Elizabeth Brodkin; Shirley Rempel; Richard Moore; Yongjun Zhao; Robert Holt; Richard Varhol; Inanc Birol; Marcus Lem; Meenu K Sharma; Kevin Elwood; Steven J M Jones; Fiona S L Brinkman; Robert C Brunham; Patrick Tang
Journal:  N Engl J Med       Date:  2011-02-24       Impact factor: 91.245

5.  A phage-typing scheme for Salmonella enteritidis.

Authors:  L R Ward; J D de Sa; B Rowe
Journal:  Epidemiol Infect       Date:  1987-10       Impact factor: 2.451

6.  From the Centers for Disease Control and Prevention. Salmonella serotype Tennessee in powdered milk products and infant formula--Canada and United States, 1993.

Authors: 
Journal:  JAMA       Date:  1993-07-28       Impact factor: 56.272

7.  The complete nucleotide sequence of the resistance plasmid R478: defining the backbone components of incompatibility group H conjugative plasmids through comparative genomics.

Authors:  Matthew W Gilmour; Nicholas R Thomson; Mandy Sanders; Julian Parkhill; Diane E Taylor
Journal:  Plasmid       Date:  2004-11       Impact factor: 3.466

8.  High-throughput sequencing provides insights into genome variation and evolution in Salmonella Typhi.

Authors:  Kathryn E Holt; Julian Parkhill; Camila J Mazzoni; Philippe Roumagnac; François-Xavier Weill; Ian Goodhead; Richard Rance; Stephen Baker; Duncan J Maskell; John Wain; Christiane Dolecek; Mark Achtman; Gordon Dougan
Journal:  Nat Genet       Date:  2008-07-27       Impact factor: 38.330

9.  Evolution of MRSA during hospital transmission and intercontinental spread.

Authors:  Simon R Harris; Edward J Feil; Matthew T G Holden; Michael A Quail; Emma K Nickerson; Narisara Chantratita; Susana Gardete; Ana Tavares; Nick Day; Jodi A Lindsay; Jonathan D Edgeworth; Hermínia de Lencastre; Julian Parkhill; Sharon J Peacock; Stephen D Bentley
Journal:  Science       Date:  2010-01-22       Impact factor: 47.728

10.  Predicting Salmonella enterica serotypes by repetitive sequence-based PCR.

Authors:  Mark G Wise; Gregory R Siragusa; Jodie Plumblee; Mimi Healy; Paula J Cray; Bruce S Seal
Journal:  J Microbiol Methods       Date:  2008-09-14       Impact factor: 2.363

View more
  20 in total

Review 1.  Transforming bacterial disease surveillance and investigation using whole-genome sequence to probe the trace.

Authors:  Biao Kan; Haijian Zhou; Pengcheng Du; Wen Zhang; Xin Lu; Tian Qin; Jianguo Xu
Journal:  Front Med       Date:  2018-01-09       Impact factor: 4.592

2.  A Bioinformatic Pipeline for Improved Genome Analysis and Clustering of Isolates during Outbreaks of Legionnaires' Disease.

Authors:  Wolfgang Haas; Pascal Lapierre; Kimberlee A Musser
Journal:  J Clin Microbiol       Date:  2021-01-21       Impact factor: 5.948

3.  Nonsynonymous Polymorphism Counts in Bacterial Genomes: a Comparative Examination.

Authors:  Sara L Loo; Anna Ong; Wunna Kyaw; Loïc M Thibaut; Ruiting Lan; Mark M Tanaka
Journal:  Appl Environ Microbiol       Date:  2020-12-17       Impact factor: 4.792

4.  Utility of Combining Whole Genome Sequencing with Traditional Investigational Methods To Solve Foodborne Outbreaks of Salmonella Infections Associated with Chicken: A New Tool for Tackling This Challenging Food Vehicle.

Authors:  Samuel J Crowe; Alice Green; Kimberly Hernandez; Vi Peralta; Lyndsay Bottichio; Stephanie Defibaugh-Chavez; Aphrodite Douris; Laura Gieraltowski; Kelley Hise; Karen La-Pham; Karen P Neil; Mustafa Simmons; Glenn Tillman; Beth Tolar; Darlene Wagner; Jamie Wasilenko; Kristin Holt; Eija Trees; Matthew E Wise
Journal:  J Food Prot       Date:  2017-04       Impact factor: 2.077

5.  National outbreak of Salmonella Give linked to a local food manufacturer in Malta, October 2016.

Authors:  A Donachie; T Melillo; L Bubba; H Hartman; M-L Borg
Journal:  Epidemiol Infect       Date:  2018-06-26       Impact factor: 4.434

6.  Strain-Level Discrimination of Shiga Toxin-Producing Escherichia coli in Spinach Using Metagenomic Sequencing.

Authors:  Susan R Leonard; Mark K Mammel; David W Lacher; Christopher A Elkins
Journal:  PLoS One       Date:  2016-12-08       Impact factor: 3.240

7.  Early Recovery of Salmonella from Food Using a 6-Hour Non-selective Pre-enrichment and Reformulation of Tetrathionate Broth.

Authors:  Ninalynn Daquigan; Christopher J Grim; James R White; Darcy E Hanes; Karen G Jarvis
Journal:  Front Microbiol       Date:  2016-12-27       Impact factor: 5.640

Review 8.  A Review of Temperature, pH, and Other Factors that Influence the Survival of Salmonella in Mayonnaise and Other Raw Egg Products.

Authors:  Thilini Piushani Keerthirathne; Kirstin Ross; Howard Fallowfield; Harriet Whiley
Journal:  Pathogens       Date:  2016-11-18

9.  High-Throughput Tabular Data Processor - Platform independent graphical tool for processing large data sets.

Authors:  Piotr Madanecki; Magdalena Bałut; Patrick G Buckley; J Renata Ochocka; Rafał Bartoszewski; David K Crossman; Ludwine M Messiaen; Arkadiusz Piotrowski
Journal:  PLoS One       Date:  2018-02-12       Impact factor: 3.240

10.  A clade of Listeria monocytogenes serotype 4b variant strains linked to recent listeriosis outbreaks associated with produce from a defined geographic region in the US.

Authors:  Laurel S Burall; Christopher J Grim; Atin R Datta
Journal:  PLoS One       Date:  2017-05-02       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.