| Literature DB >> 34686701 |
Lijun Hu1, Guojie Cao1, Eric W Brown1, Marc W Allard1, Li M Ma2, Guodong Zhang3.
Abstract
Rapid and sensitive detection of Salmonella is a critical step in routine food quality control, outbreak investigation, and food recalls. Although various genes have been the targets in the design of rapid molecular detection methods for Salmonella, there is limited information on the diversity of these target genes at the level of DNA sequence and the encoded protein structures. In this study, we investigated the diversity of ten target genes (invA, fimA, phoP, spvC, and agfA; ttrRSBCA operon including 5 genes) commonly used in the detection and identification of Salmonella. To this end, we performed whole genome sequencing of 143 isolates of Salmonella serotypes (Enteritidis, Typhimurium, and Heidelberg) obtained from poultry (eggs and chicken). Phylogenetic analysis showed that Salmonella ser. Typhimurium was more diverse than either Enteritidis or Heidelberg. Forty-five non-synonymous mutations were identified in the target genes from the 143 isolates, with the two most common mutations as T ↔ C (15 times) and A ↔ G (13 times). The gene spvC was primarily present in Salmonella ser. Enteritidis isolates and absent from Heidelberg isolates, whereas ttrR was more conserved (0 non-synonymous mutations) than ttrS, ttrB, ttrC, and ttrA (7, 2, 2, and 7 non-synonymous mutations, respectively). Notably, we found one non-synonymous mutation (fimA-Mut.6) across all Salmonella ser. Enteritidis and Salmonella ser. Heidelberg, C → T (496 nt postion), resulting in the change at AA 166 position, Glutamine (Q) → Stop condon (TAG), suggesting that the fimA gene has questionable sites as a target for detection. Using Phyre2 and SWISS-MODEL software, we predicted the structures of the proteins encoded by some of the target genes, illustrating the positions of these non-synonymous mutations that mainly located on the α-helix and β-sheet which are key elements for maintaining the conformation of proteins. These results will facilitate the development of sensitive molecular detection methods for Salmonella.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34686701 PMCID: PMC8536731 DOI: 10.1038/s41598-021-00224-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Phylogenetic tree of 64 Salmonella ser. Enteritidis isolates. Using core SNPs determined by FDA CFSAN SNP pipeline, the tree was constructed using the GTR-CAI model of FASTree 2. The analysis involved 552–565 total SNP positions. Clades and subclades are indicated in colors (A, blue; and B, red) with numbers of total isolates in each clade or subclade in parentheses. Isolates sourced from chicken samples were highlighted under red line, the rest were from egg samples.
Figure 2Phylogenetic tree of 40 Salmonella ser. Typhimurium isolates. Using core SNPs determined by FDA CFSAN SNP pipeline, the tree was constructed under the GTR-CAI model of FASTree 2. The analysis involved 673–1141 total SNP positions. Clades and subclades are indicated in colors (A, blue; and B, red) with numbers of total isolates in each clade or subclade in parentheses. Isolates sourced from eggs and clustered in subclade B2 are highlighted in green, the rest were from chicken samples.
Figure 3Phylogenetic tree of 39 Salmonella ser. Heidelberg isolates. Using core SNPs determined by FDA CFSAN SNP pipeline, the tree was constructed under the GTR-CAI model of FASTree 2. The analysis involved 19–95 total SNP positions. Clades and subclades are indicated in colors (A, blue; B, red; and C, green) with numbers of total isolates in each clade or subclade in parentheses. Isolates sourced from chicken and clustered in subclade B2 are highlighted in red, the rest were from egg samples.
Non-synonymous mutations observed in each target gene from all studied Salmonella isolates.
| Genes | Non-synonymous mutation | Nucleotide (position) | NT change | Protein (position) | AA change | Associated isolates |
|---|---|---|---|---|---|---|
| 770 | C → G | 257 | T → S | All isolates | ||
| 771 | G → C | 257 | T → S | All isolates | ||
| 898 | G → T | 300 | A → S | All isolates | ||
| 1012 | A → T | 338 | T → S | All isolates | ||
| 1125 | T → G | 375 | D → E | All isolates | ||
| 1198 | G → C | 400 | V → L | All isolates | ||
| 1201 | A → G | 401 | S → D | All isolates | ||
| 1202 | G → A | 401 | S → D | All isolates | ||
| 1301 | C → T | 434 | A → V | All isolates | ||
| 1302 | T → G | 434 | A → V | All | ||
| 1325 | C → T | 442 | T → I | All isolates | ||
| 1327 | T → C | 443 | Y → H | All | ||
| 1332 | T → G | 444 | H → Q | All isolates | ||
| 986 | G → A | 329 | S → N | All | ||
| 987 | C → T | 329 | S → N | All | ||
| 1117 | T → G | 373 | C → G | All | ||
| 1450 | G → T | 484 | A → S | |||
| 1465 | A → G | 489 | I → V | All | ||
| 1486 | A → C | 496 | N → H | All | ||
| 1502 | T → C | 501 | V → A | All | ||
| 389 | T → C | 130 | I → T | |||
| 650 | C → A | 217 | P → Q | |||
| 560 | G → A | 187 | R → H | 12 | ||
| 740 | G → T | 247 | C → F | All | ||
| 488 | T → C | 163 | V → A | All | ||
| 634 | C → T | 212 | R → C | 41 | ||
| 1024 | C → G | 342 | Q → E | All | ||
| 2057 | A → G | 686 | H → R | All | ||
| 2066 | C → T | 689 | A → V | All | ||
| 2123 | A → G | 708 | H → R | All | ||
| 2170 | A → G | 724 | T → A | All | ||
| 19 | T → C | 7 | S → P | All | ||
| 86 | C → A | 29 | P → H | 24 | ||
| 127 | A → G | 43 | I → V | All | ||
| 257 | A → G | 86 | D → G | |||
| 319 | C → T | 107 | P → S | |||
| 496 | C → T | 166 | Q → Stop Codon | All | ||
| 630 | A → C | 210 | Q → H | All | ||
| 709 | T → C | 237 | Y → H | All | ||
| 745 | T → C | 249 | Y → H | All | ||
| 751 | T → C | 251 | F → L | All | ||
| 794 | T → G | 265 | V → G | All | ||
| 851 | G → A | 284 | R → H | All | ||
| 857 | A → G | 286 | H → R | All | ||
| 268 | G → A | 90 | E → K | 25 |
a12 Salmonella ser. Typhimurium isolates: CFSAN040229, CFSAN036174, CFSAN035417, CFSAN027862, CFSAN017095, CFSAN017094, CFSAN017093, CFSAN015380, CFSAN015378, CFSAN015377, CFSAN014205, CFSAN013737.
b41 Salmonella ser. Enteritidis isolates: CFSAN057841, CFSAN035309, CFSAN035308,CFSAN035291, CFSAN035289, CFSAN035276, CFSAN035272, CFSAN034232, CFSAN034231, CFSAN034151, CFSAN033543, CFSAN033541, CFSAN032971, CFSAN032970, CFSAN032964, CFSAN032962, CFSAN032959, CFSAN032958, CFSAN030852, CFSAN030839, CFSAN030835, CFSAN030823, CFSAN030816, CFSAN030496, CFSAN030097, CFSAN030086, CFSAN030081, CFSAN030067, CFSAN030066, CFSAN028530, CFSAN027394, CFSAN027378, CFSAN027377, CFSAN025717, CFSAN025708, CFSAN025700, CFSAN024840, CFSAN024814, CFSAN017081, CFSAN002042, CFSAN058030.
c24 Salmonella ser. Typhimurium isolates: CFSAN041965, CFSAN041947, CFSAN041940, CFSAN041934, CFSAN041887, CFSAN041878, CFSAN041875, CFSAN041835, CFSAN041824, CFSAN041662, CFSAN036367, CFSAN036362, CFSAN036272, CFSAN036257, CFSAN036186, CFSAN036183, CFSAN036179, CFSAN036177, CFSAN036172, CFSAN035575, CFSAN035560, CFSAN035525, CFSAN029123, CFSAN029101.
d25 Salmonella ser. Enteritidis isolates: CFSAN035309, CFSAN035308, CFSAN035291, CFSAN035289, CFSAN035276, CFSAN035272, CFSAN033543, CFSAN033541, CFSAN032971, CFSAN032970, CFSAN032964, CFSAN032962, CFSAN030816, CFSAN030496, CFSAN030086, CFSAN030081, CFSAN030067, CFSAN030066, CFSAN028530, CFSAN027394, CFSAN027378, CFSAN025717, CFSAN025708, CFSAN024840, CFSAN024814.
Figure 4Phylogenetic trees of all sequenced Salmonella isolates based on nucleotide sequences of the target genes agfA (A), invA (B), and fimA (C). The trees were constructed by the neighbor jointing method using CLC Genomics Workbench. The scale indicates the sequence percentage of base distance (percent divergence).
Figure 5Phylogenetic trees of all sequenced Salmonella isolates based on the nucleotide sequences of the target genes phoP (A) and spvC (B). The trees were constructed by the neighbor jointing method using CLC Genomics Workbench. The scale indicates the sequence percentage of base distance (percent divergence).
Figure 6Phylogenetic trees of all sequenced Salmonella isolates based on the nucleotide sequences of the target genes ttrS (A), ttrB (B), ttrC (C), and ttrA (D). The trees were constructed by the neighbor jointing method using CLC Genomics Workbench. The scale indicates the sequence percentage of base distance (percent divergence).
Figure 7Predicted protein structures of the target genes invA (A), fimA (B), phoP (C), and spvC (D). Positions of amino acid mutations in each target protein are shown in the circled box and labeled in red in the predicted model.
Figure 8Predicted protein structures of the target genes ttrS (A), ttrB (B), ttrC (C), and ttrA (D). Positions of amino acid mutations in each target protein are shown in the circled box and labeled in red in the predicted model.