| Literature DB >> 23409020 |
Guojie Cao1, Jianghong Meng, Errol Strain, Robert Stones, James Pettengill, Shaohua Zhao, Patrick McDermott, Eric Brown, Marc Allard.
Abstract
Salmonella Newport has ranked in the top three Salmonella serotypes associated with foodborne outbreaks from 1995 to 2011 in the United States. In the current study, we selected 26 S. Newport strains isolated from diverse sources and geographic locations and then conducted 454 shotgun pyrosequencing procedures to obtain 16-24 × coverage of high quality draft genomes for each strain. Comparative genomic analysis of 28 S. Newport strains (including 2 reference genomes) and 15 outgroup genomes identified more than 140,000 informative SNPs. A resulting phylogenetic tree consisted of four sublineages and indicated that S. Newport had a clear geographic structure. Strains from Asia were divergent from those from the Americas. Our findings demonstrated that analysis using whole genome sequencing data resulted in a more accurate picture of phylogeny compared to that using single genes or small sets of genes. We selected loci around the mutS gene of S. Newport to differentiate distinct lineages, including those between invH and mutS genes at the 3' end of Salmonella Pathogenicity Island 1 (SPI-1), ste fimbrial operon, and Clustered, Regularly Interspaced, Short Palindromic Repeats (CRISPR) associated-proteins (cas). These genes in the outgroup genomes held high similarity with either S. Newport Lineage II or III at the same loci. S. Newport Lineages II and III have different evolutionary histories in this region and our data demonstrated genetic flow and homologous recombination events around mutS. The findings suggested that S. Newport Lineages II and III diverged early in the serotype evolution and have evolved largely independently. Moreover, we identified genes that could delineate sublineages within the phylogenetic tree and that could be used as potential biomarkers for trace-back investigations during outbreaks. Thus, whole genome sequencing data enabled us to better understand the genetic background of pathogenicity and evolutionary history of S. Newport and also provided additional markers for epidemiological response.Entities:
Mesh:
Year: 2013 PMID: 23409020 PMCID: PMC3569456 DOI: 10.1371/journal.pone.0055687
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of Salmonella Newport strains used in the study.
| ID | Tree Label | PFGE Pattern Number | Antimicrobial Resistance Profile | WGS Accession Number | Draft Genome Size (Mbp) | Number of Contigs |
| 180 | bison_TN_2004 | JJPX01.0218 | SUL | AHTJ00000000 | 4.71 | 95 |
| 181 | caprine_TN_2004 | JJPX01.0381 | SUL | AHTK00000000 | 4.75 | 72 |
| 182 | chicken_MO | JJPX01.0030 | NA | AHTL00000000 | 4.71 | 71 |
| 183 | ground_turkey_MD_2003 | JJPX01.0502 | NA | AHTM00000000 | 4.80 | 88 |
| 184 | equine_TN_2004_1 | JJPX01.0025 | SUL | AHTN00000000 | 4.71 | 66 |
| 185 | turkey_CO | NA | NA | AHTO00000000 | 4.74 | 64 |
| 186 | frog_Vietnam | JJPX01.3333 | NA | AHTP00000000 | 4.67 | 59 |
| 187 | fish_Hong_Kong | JJPX01.0327 | TET | AHTQ00000000 | 4.70 | 76 |
| 188 | fish_Vietnam | JJPX01.1947 | NA | AHTR00000000 | 4.67 | 53 |
| 189 | equine_TN_2004_2 | NA | SUL | AHTS00000000 | 4.96 | 72 |
| 190 | swine_TX | NA | NA | AHTT00000000 | 4.92 | 73 |
| 191 | cattle_NC_2003 | JJPX01.0042 | AMC,AMP,FOX,CHL,KAN,STR,SUL,TET,TIO | AHTU00000000 | 4.90 | 72 |
| 192 | chicken_GA | JJPX01.0238 | NA | AHTV00000000 | 4.93 | 70 |
| 193 | cattle_AZ_2003 | JJPX01.0014 | AMC,AMP,FOX,CHL,STR,SUL,TET,TIO | AHTW00000000 | 4.93 | 69 |
| 194 | canine_AZ_2003 | JJPX01.0014 | AMC,AMP,FOX,CHL,STR,SUL,TET,TIO | AHTX00000000 | 5.02 | 384 |
| 195 | ground_turkey_NM_2008 | JJPX01.0238 | TET | AHTY00000000 | 4.93 | 85 |
| 196 | ground_beef_GA_2004 | JJPX01.0042 | AMC,AMP,FOX,AXO,CHL,STR,SUL,TET,TIO | AHTZ00000000 | 4.89 | 77 |
| 197 | swine_IL_2001 | NA | AMC,AMP,FOX,CHL,GEN,KAN,STR,SUL,TET,TIO | AHUA00000000 | 4.69 | 44 |
| 198 | shrimp_India | NA | NA | AHUB00000000 | 4.81 | 70 |
| 199 | spinach_CO_2008 | JJPX01.0538 | NA | AHUC00000000 | 4.80 | 49 |
| 200 | cheese_Mexico | JJPX01.0372 | NA | AHUD00000000 | 4.65 | 74 |
| 201 | squid_Vietnam | NA | NA | AHUE00000000 | 4.73 | 84 |
| 202 | pepper_Vietnam | NA | NA | AHUF00000000 | 4.65 | 70 |
| 203 | pig_ear_CA | NA | NA | AHUG00000000 | 4.73 | 62 |
| 117 | farm_1_VA_2007 | NA | NA | AJMN00000000 | 4.81 | 91 |
| 118 | farm_15_VA_2007 | NA | NA | AJMO00000000 | 4.81 | 75 |
| NA |
| NA | AMP, CHL, GEN, STR,AXO,SUL,TET | ABEN01000000 | 4.83 | 0 |
| NA |
| NA | NA | ABEW00000000 | 4.95 | 63 |
AMC = Amoxicillin/Clavulanic Acid, AMP = Ampicillin, FOX = Cefoxitin, AXO = Ceftriaxone, CHL = Chloramphenicol, GEN = Gentamicin, KAN = Kanamycin, STR = Streptomycin, SUL = Sulfamethoxazole or Sulfisoxazole, TET = Tetracycline, TIO = Ceftiofur.
These two samples were received from Eastern Shore of Virginia in 2007. Isolates may have been collected earlier than 2007.
Figure 1Parsimony phylogenetic tree of S. Newport and outgroup genomes.
This phylogenetic tree was reconstructed by TNT [38] with 100,000 iterations based on 147,780 genome wide SNPs. All S. Newport strains were grouped into two major clusters, S. Newport Lineages II and III. Lineage II was further grouped into three sublineages, IIA, IIB and IIC. S. Newport displayed clear geographic structure. Asian strains were grouped together and divergent from ones from Americas. At the locus between invH and mutS genes, Lineage II and all outgroup genomes shared Gene Cluster 1; however, Lineage III strains shared Gene Cluster 2. Gene Cluster 3 was only found in strain from fish_Hong_Kong at the 3′ end of Gene Cluster 1. GC1 = Gene Cluster 1; GC2 = Gene Cluster 2; GC3 = Gene Cluster 3. Additionally, Node M includes most MDR strains in the current study.
Figure 2MLST analysis of S. Newport and outgroup genomes.
Seven housekeeping genes were selected and MLST dendrogram was performed by TNT [38] with 100,000 iterations. S. Newport was divided into two major clusters, which were separated by outgroup genomes. Lineage II was divided into three sublineages, which display minor differences compared with the parsimony tree. Sublineage IIA showed closer relatedness with IIC than IIB.
Average pairwise distance (no. of nucleotide difference) for the major groups shown in Figure 1.
| sublineage III | Hadar | Typhimurium | sublineage IIC | sublineage IIA | sublineage IIB | Dublin | Gallinarum | |
| sublineage III | ||||||||
| Hadar | 34418 (93) | |||||||
| Typhimurium | 36094 (90) | 35900 (105) | ||||||
| sublineage IIC | 35048 (128) | 37133 (147) | 38640 (144) | |||||
| sublineage IIA | 35627 (108) | 37320 (152) | 38893 (154) | 17497 (95) | ||||
| sublineage IIB | 36812 (106) | 38529 (122) | 38752 (131) | 15605 (91) | 25768 (85) | |||
| Dublin | 39879 (118) | 40575 (133) | 39275 (154) | 40314 (175) | 40878 (130) | 41749 (136) | ||
| Gallinarum | 43027 (100) | 43758 (159) | 42824 (151) | 43453 (158) | 42666 (133) | 44822 (124) | 22070 (144) | |
| Kentucky | 49260 (106) | 49409 (80) | 48612 (90) | 50194 (96) | 50694 (128) | 48236 (146) | 50955 (98) | 53464 (96) |
The valve refers to number of SNPs differences (standard deviation) between different selected groups and strains. The numbers of base differences per sequence from averaging over all sequence pairs between groups were shown.
Characteristics of genes/open reading frames (ORFs) between invH and mutS genes in Gene Cluster 1 of S. Newport SL254 and Gene Cluster 2 of strain from chicken_MO.
| ORF | Gene Name | Size (bps) | GC% | Best Blastp Hit | Super Family | ||||
| Description | Source | E Value | Locus Tag | ||||||
|
| |||||||||
| A3107 | 282 | 49.3 | Putative ABC-type transport system |
| 2e-45 | NP_457295.1 | DUF1778 | ||
| A3108 | 528 | 47.2 | Acetyltransferase, gnat family |
| 7e-99 | NP_457296.1 | NA | ||
| A3109 |
| 438 | 55.9 | Transposase |
| 1e-81 | AAV66983.1 | NA | |
| A3110 |
| 657 | 41.7 | Serine/threonine-specific proteinphosphatase 2 |
| 1e-125 | AAL21787.1 | MPP | |
| A3111 | 495 | 48.7 | Membrane protein |
| 7e-92 | ACH74700.1 | NA | ||
| A3112 | 669 | 54.1 | Hypothetical protein |
| 1e-125 | EDZ12689.1 | NA | ||
|
| |||||||||
| 11075 |
| 738 | 52.3 | Transposase InsF for insertionsequence IS3A/B/C/D/E/fA |
| 0 | EDX51569.1 | rve | |
| 11080 |
| 171 | 55 | Transposase InsF for insertionsequence IS3A/B/C/D/E/fA |
| 0 | EDX51569.1 | rve | |
| 11085 | 402 | 50.5 | ISPsy11, transposase OrfA |
| 2e-92 | EDX52090.1 | NA | ||
| 11090 |
| 684 | 50.1 | Integrase, catalytic region(ISPsy11, transposase OrfB) |
| 2e-170 | EDX51974.1 | rve | |
| 11095 | 258 | 47.3 | ISEhe3 OrfA |
| 9e-57 | EDX52144.1 | HTH_Hin | ||
| 11100 |
| 90 | 46.7 | Hypothetical protein |
| 6e-21 | ACH75076.1 | NA | |
Differences between Gene Cluster 1 and 2 demonstrated the mosaic genomic structure around mutS gene. Transposase and integrase were found in both sequences, indicating that both of them could be the hot spots for recombination events. The genes in both S. Newport SL254 and strain from chicken_MO are ordered top to bottom as their synteny on bacterial chromosome from 5′ to 3′.
Characteristics of genes/open reading frames (ORFs) between relA and mazG genes of S. Newport SL254 and SL317.
| ORF | Gene Name | Size (bps) | GC% | Best Blastp Hit | Super Family | |||
| Description | Source | E Value | Locus Tag | |||||
|
| ||||||||
| A3171 |
| 588 | 49.3 | putative fimbrial subunit |
| 3e-136 | ACF63661.1 | Fimbrial |
| A3172 |
| 2646 | 55.5 | fimbrial usher protein |
| 0 | ACF64468.1 | PRK15223 |
| A3173 |
| 774 | 55 | chaperone protein PapD |
| 0 | ACF62389.1 | Pili_assembly |
| A3174 |
| 507 | 56.6 | fimbrial subunit |
| 2e-118 | ACF63171.1 | Fimbrial |
| A3175 |
| 471 | 50.7 | fimbrial subunit |
| 1e-110 | ACF62527.1 | Fimbrial |
| A3176 |
| 537 | 52.3 | fimbrial subunit |
| 1e-128 | ACF62131.1 | Fimbrial |
|
| ||||||||
| A4073 | 288 | 48.3 | putative addiction moduleantidote protein |
| 2e-47 | NP_457351.1 | RHH_2 | |
| A4074 | 297 | 38.7 | plasmid stabilizationsystem protein, RelE/ParE family |
| 2e-51 | ZP_02697812.2 | Plasmid_stabil | |
We listed the detailed information of genes between relA and mazG genes. S. Newport SL254 and SL317 were selected. Our data indicated the genomic diversity of this region between Lineages II and III. Interestingly, ORF SNSL254_A3176 and SNSL317_A4073 were found adjoining together in S. Typhi CT18. The existence of ste fimbrial operon might enable Lineage II strains to infect variable hosts. The genes in both S. Newport SL254 and SL317 are ordered top to bottom as their synteny on bacterial chromosome from 5′ to 3′.
Figure 3Parsimony phylogenetic tree for cas genes.
We constructed this parsimony tree with 100,000 iterations by TNT [38] based on concatenated sequences of the cas genes. This dendrogram indicated that cas genes of Lineages II and III were originated from distinct sources.
Figure 4Pulsed Field Gel Electrophoresis (PFGE) profile digested with XbaI.
We performed PFGE analysis of 24 S. Newport strains (without two environmental farm isolates) isolated from diverse sources and geographic locations. PFGE profiles divided these strains into two major clusters with different groupings compared with the phylogenetic tree based on whole genome wide SNPs.