| Literature DB >> 35208824 |
Ruimin Gao1,2, Hongsheng Huang1, Jérémie Hamel3, Roger C Levesque4, Lawrence D Goodridge5, Dele Ogunremi1,5.
Abstract
We have developed a targeted, amplicon-based next-generation sequencing method to detect and analyze 227 virulence genes (VG) of Salmonella (AmpliSeqSalm_227VG) for assessing the pathogenicity potential of Salmonella. The procedure was developed using 80 reference genomes representing 75 epidemiologically-relevant serovars associated with human salmonellosis. We applied the AmpliSeqSalm_227VG assay to (a) 35 previously characterized field strains of Salmonella consisting of serovars commonly incriminated in foodborne illnesses and (b) 34 Salmonella strains with undisclosed serological or virulence attributes, and were able to divide Salmonella VGs into two groups: core VGs and variable VGs. The commonest serovars causing foodborne illnesses such as Enteritidis, Typhimurium, Heidelberg and Newport had a high number of VGs (217-227). In contrast, serovars of subspecies not commonly associated with human illnesses, such as houtenae, arizonae and salame, tended to have fewer VGs (177-195). Variable VGs were not only infrequent but, when present, displayed considerable sequence variation: safC, sseL, sseD, sseE, ssaK and stdB showed the highest variation and were linked to strain pathogenicity. In a chicken infection model, VGs belonging to rfb and sse operons showed differences and were linked with pathogenicity. The high-throughput, targeted NGS-based AmpliSeqSalm_227VG procedure provided previously unknown information about variation in select virulence genes that can now be applied to a much larger population of Salmonella for evaluating pathogenicity of various serovars of Salmonella and for risk assessment of foodborne salmonellosis.Entities:
Keywords: AmpliSeq; Salmonella; food safety; pathogenicity; presence/absence; serovar; variant; virulence genes
Year: 2022 PMID: 35208824 PMCID: PMC8879106 DOI: 10.3390/microorganisms10020369
Source DB: PubMed Journal: Microorganisms ISSN: 2076-2607
Salmonella reference genomes used in the design of AmpliSeqSalm_227VG panel and their distribution of 227 known virulence genes.
| ID | Subspecies/Serovar | a Strain | Size | b VGs (Max = 227) |
|---|---|---|---|---|
| CP006602.1 | 4,[5],12:i:- | 08-1736 | 4,822,189 | 227 |
| CP007532.1 | Abaetetuba | ATCC 35640 | 4,547,600 | 199 |
| CP007534.1 | Abony | 0014 | 4,737,447 | 225 |
| CP001138.1 | Agona | SL483 | 4,798,660 | 222 |
| CP019177.1 | Albany | ATCC 51960 | 4,805,448 | 216 |
| CP007531.1 | Anatum | ATCC BAA-1592 | 4,706,101 | 211 |
| CP019116.1 | Antsalova | S01-0511 | 4,648,086 | 196 |
| CP019403.1 | Apapa | SA20060561 | 4,801,658 | 206 |
| CP006693.1 |
| RKS2983 | 4,574,846 | 177 |
| CP006053.1 | Bareilly | CFSAN000189 | 4,730,612 | 210 |
| CP019405.1 | Bergen | ST350 | 4,801,835 | 207 |
| CP019406.1 | Blegdam | S-1824 | 4,693,979 | 222 |
| CP019407.1 | Borreze | SA20041063 | 4,777,558 | 223 |
| HF969015.2 | Bovismorbificans | 3114 | 4,680,283 | 216 |
| CP022490.1 | Braenderup | SA20026289 | 4,734,880 | 209 |
| CP007533.1 | Bredeney | CFSAN001080 | 4,603,849 | 211 |
| CP012833.1 | Cerro | CFSAN001588 | 4,651,400 | 200 |
| CP019178.1 | Chester | ATCC 11997 | 4,660,922 | 214 |
| AE017220.1 | Choleraesuis | SC-B67 | 4,755,700 | 210 |
| CP019408.1 | Crossness | 1422-74 | 4,847,468 | 227 |
| CP006055.1 | Cubana | CFSAN002050 | 4,977,480 | 209 |
| CP022494.1 | Derby | SA20035215 | 4,850,334 | 223 |
| CP019409.1 | Djakarta | S-1087 | 4,668,861 | 195 |
| CP001144.1 | Dublin | CT_02021853 | 4,842,908 | 222 |
| NC_011294.1 | Enteritidis | P125109 | 4,685,848 | 222 |
| NC_011274.1 | Gallinarum | 287/91 | 4,658,697 | 220 |
| CP024165.1 | Gaminara | CFSAN070644 | 4,801,841 | 194 |
| CP017719.1 | Hayindogo | CFSAN050752 | 4,765,719 | 210 |
| CP001120.1 | Heidelberg | SL476 | 4,888,768 | 225 |
| CP019410.1 | Hillingdon | N1529-D3 | 4,618,056 | 194 |
| CM001471.1 |
| ATCC BAA-1581 | 4,672,567 | 175 |
| CP022503.1 | Hvittingfoss | SA20014981 | 4,940,239 | 210 |
| CP022015.1 | India | SA20085604 | 5,395,280 | 192 |
| CP022450.1 | Indiana | D90 | 4,779,514 | 213 |
| LN649235.1 | Infantis | NA | 4,710,675 | 209 |
| CP019181.1 | Inverness | ATCC 10720 | 4,865,682 | 211 |
| LT571437.1 | Java | NCTC5706 | 4,756,780 | 221 |
| CP004027.1 | Javiana | CFSAN001992 | 4,634,161 | 207 |
| CP019411.1 | Johannesburg | ST203 | 4,651,794 | 196 |
| CP022500.1 | Kentucky | SA20030505 | 4,782,363 | 216 |
| CP019412.1 | Koessen | S-1501 | 4,566,169 | 208 |
| CP019413.1 | Krefeld | SA20030536 | 4,942,273 | 216 |
| CP022117.1 | Macclesfield | ST-1643 | 4,822,139 | 202 |
| CP019414.1 | Manchester | ST278 | 4,532,753 | 204 |
| CP022497.1 | Manhattan | SA20084699 | 4,732,484 | 216 |
| CP019183.1 | Mbandaka | ATCC 51958 | 4,905,181 | 198 |
| CP022489.1 | Mbandaka | SA20026234 | 4,796,292 | 207 |
| CP019184.1 | Minnesota | ATCC 49284 | 4,592,393 | 195 |
| CP007530.1 | Montevideo | 507440-20 | 4,694,375 | 195 |
| CP019415.1 | Moscow | S-1843 | 4,690,402 | 222 |
| CP019201.1 | Muenster | 420 | 4756014 | 203 |
| NC_011080.1 | Newport | SL254 | 4,827,641 | 217 |
| CP019404.1 | Newport | SA20113257 | 4,849,139 | 218 |
| CP019416.1 | Nitra | S-1687 | 4,691,807 | 222 |
| CP022034.1 | Onderstepoort | SA20060086 | 4,774,926 | 194 |
| CP022116.1 | Ouakam | SA20034636 | 4,874,915 | 212 |
| CP012346.1 | Panama | ATCC 7378 | 4,555,576 | 208 |
| CP000026.1 | Paratyphi A | ATCC 9150 | 4,585,229 | 210 |
| CP000886.1 | Paratyphi B | SPB7 | 4,858,887 | 227 |
| CP000857.1 | Paratyphi C | RKS4594 | 4,833,080 | 210 |
| CP019186.1 | Pomona | ATCC 10729 | 4,482,117 | 196 |
| CP019189.1 | Poona | ATCC BAA-1673 | 4,876,720 | 192 |
| CP012347.1 | Pullorum | ATCC 9120 | 4,694,842 | 221 |
| CP019192.1 | Rubislaw | ATCC 10717 | 4,572,929 | 198 |
| CP022491.1 | Saintpaul | SA20031783 | 4,775,303 | 226 |
| CP022139.1 |
| NA | 4,859,044 | 187 |
| CP001127.1 | Schwarzengrund | CVM19633 | 4,709,075 | 210 |
| LN868943.1 | Senftenberg | NCTC10384 | 3,746,274 | NA |
| CP012349.1 | Sloterdijk | ATCC 15791 | 4,817,791 | 218 |
| CP017723.1 | Stanleyville | SARB61 | 4,888,463 | 226 |
| CP007505.1 | Tennessee | TXSC_TXSC08-19 | 4,864,410 | 207 |
| CP006717.1 | Thompson | RM6836 | 4,707,648 | 210 |
| NC_003198.1 | Typhi | CT18 | 4,809,037 | 207 |
| HF937208.1 | Typhimurium | DT104 | 4,933,631 | 227 |
| NC_003197.2 | Typhimurium | LT2 | 4,857,450 | 227 |
| CP006048.1 | Typhimurium var. 5- | CFSAN001921 | 4,859,931 | 227 |
| CP019417.1 | Wandsworth | SA20092095 | 4,916,040 | 211 |
| CP022138.1 | Waycross | SA20041608 | 4,812,886 | 198 |
| LN890520.1 | Weltevreden | NA | 5,129,845 | 211 |
| CP019418.1 | Yovokome | S-1850 | 4,640,929 | 215 |
a NA: not available; b VGs = virulence genes.
Figure 1Workflow for the adaptation of AmpliSeq technology to develop and construct the AmpliSeqSalm_227VG panel. VGs: virulence genes.
Figure 2Distribution of 227 virulence genes in 80 reference Salmonella genomes used in designing the AmpliSeqSalm_227VG panel. Red and blue colors represent gene presence and absence, respectively. The x axis contains the list of 7680refs variable genes which are absent in at least one genome and the y axis shows the 80 Salmonella enterica reference genomes.
Results of AmpliSeq analysis performed on strains of Salmonella with known serovar designation.
| Serovars/Strains | Predicted No. | Detected No. | Mapped Reads | Total Reads | a On Target | Assigned | b Variant |
|---|---|---|---|---|---|---|---|
| Agona 07D669 3-5 | 222 | 224 | 331,304 | 337,261 | 0.98 | Agona | 1 |
| Enteritidis SE864 | 222 | 221 | 306,812 | 312,339 | 0.98 | Enteritidis | 8 |
| Enteritidis PT8 | 222 | 222 | 313,320 | 319,143 | 0.98 | Enteritidis | 7 |
| Gallinarum SA20103326 | 220 | 226 | 285,325 | 290,275 | 0.98 | Indiana | 28 |
| Heidelberg D1005841 | 225 | 224 | 298,529 | 305,742 | 0.98 | Heidelberg | 0 |
| Indiana 11SU006 5-3 | 213 | 217 | 330,510 | 335,680 | 0.98 | Indiana | 1 |
| Infantis 39 | 209 | 226 | 276,145 | 282,107 | 0.98 | Infantis | 4 |
| Javiana OLF16127-1 | 207 | 211 | 292,639 | 298,829 | 0.98 | Javiana | 47 |
| Kentucky D0827783 | 216 | 215 | 278,679 | 283,095 | 0.98 | Kentucky | 2 |
| Mbandaka 06D1004 8-11 | 207 | 208 | 257,337 | 264,129 | 0.97 | Mbandaka | 1 |
| Montevideo 08D107 18-3 | 195 | 196 | 310,953 | 317,627 | 0.98 | Montevideo | 277 |
| Newport S443 | 211 | 216 | 254,846 | 258,566 | 0.99 | Newport | 21 |
| Newport S447 | 218 | 219 | 301,846 | 308,710 | 0.98 | Newport | 23 |
| Nitra SA20103329 | 222 | 222 | 260,241 | 265,458 | 0.98 | Enteritidis | 390 |
| Paratyphi A SA20114389 | 210 | 214 | 313,073 | 318,553 | 0.98 | Paratyphi A | 5 |
| Pullorum ATCC19945 | 221 | 222 | 283,317 | 288,600 | 0.98 | Pullorum | 2 |
| S.Kentucky | 216 | 217 | 273,264 | 279,534 | 0.98 | Kentucky | 1 |
| S.Typhimurium 07-12345 | 227 | 226 | 248,137 | 252,351 | 0.98 | Typhimurium | 11 |
| S.Typhimurium07-12335 | 227 | 227 | 257,763 | 264,195 | 0.98 | Typhimurium | 11 |
| S.Typhimurium07-22495 | 227 | 227 | 333,463 | 342,427 | 0.97 | Typhimurium | 15 |
| S.Typhimurium07-22792 | 227 | 227 | 292,454 | 300,721 | 0.97 | Typhimurium | 5 |
| S.Typhimurium07-24355 | 227 | 227 | 344,674 | 355,565 | 0.97 | Typhimurium | 14 |
| S.Typhimurium07-29216 | 227 | 227 | 258,620 | 263,517 | 0.98 | Typhimurium | 14 |
| S.Typhimurium07-32529 | 227 | 226 | 263,450 | 271,411 | 0.97 | Typhimurium | 11 |
| S.Typhimurium07-35522 | 227 | 227 | 305,560 | 313,980 | 0.97 | Typhimurium | 11 |
| S.Typhimurium07-7666 | 227 | 226 | 289,621 | 298,397 | 0.97 | Typhimurium | 11 |
| S.Typhimurium07-7670 | 227 | 226 | 294,378 | 302,460 | 0.97 | Typhimurium | 11 |
| S.Worthington_43 | 209 | 209 | 303,102 | 314,934 | 0.96 | Mbandaka | 371 |
| S.Worthington_70 | 209 | 209 | 304,260 | 317,665 | 0.96 | Mbandaka | 372 |
| Schwarzengrund ADRI40 | 210 | 212 | 290,699 | 295,369 | 0.98 | Schwarzengrund | 3 |
| Senftenberg OLF13070 | 217 | 217 | 304,695 | 313,491 | 0.97 | Senftenberg | - |
| Tennesse D1019361 | 207 | 209 | 291,490 | 298,976 | 0.97 | Tennesse | 1 |
| Thompson 08D021 13-6 | 210 | 213 | 292,588 | 297,576 | 0.98 | Thompson | 0 |
| Typhimurium LT2 SA1930 | 227 | 227 | 278,222 | 284,938 | 0.98 | Typhimurium | 0 |
| Weltevreden OLF18006-1 | 211 | 213 | 280,421 | 286,778 | 0.98 | Weltevreden | 0 |
a On target: the proportion of sequenced amplicon reads mapping to the target genes in the reference genome. b Two rounds of reads mapping were performed for the amplicons of each strain. First round of mapping was an automated real time alignment of sequence reads to the preloaded 80 Salmonella reference genomes resulting in serovar identification from the highest reads match. The identified serovar is depicted as the “Assigned Serovar” in this table. Subsequently, using the “Assigned Serovar” genome as reference, the second round amplicon reads mapping was carried out to find the number of variants present between the tested strain (listed in column named “Serovars/strains”) and its respective reference genome. Except for the serovar Montevideo, high variant number was predictive of wrong serotype designation and could be used as a means of assessing suspicious results. For instance, during the 1st round mapping, sequence reads from Worthington, Nitra and Gallinarum incorrectly mapped to the serovars Mbandaka, Enteritidis and Indiana, respectively and showed relatively high to very high number of variants.
Figure 3Analysis metrics for Ion Torrent Personal Genome Machine (PGM) amplicon sequencing for virulence genes present in a pool of 23 Salmonella strains. (A) Heatmap showing an optimal Ion Sphere Particle (ISP) density loading of 81% on the surface of PGM 318 chip. Red and blue colors represented the highest and lowest loading density areas, respectively. (B) A total of 6.9 million reads was obtained from the sequencing run and 82% reads were clonal. (C) Read length distribution of this representative run, with mean, median, mode length of 247 bp, 265 bp and 298 bp, respectively.
Number of virulence gene variants present in each of 29 Salmonella enterica strains identified with the correct serovar designation as a result of automated, real time mapping of sequence reads to preloaded reference genomes by the Ion Torrent sequencer.
| Gene | No. | Gene | No. | Gene | No. | Gene | No. | Gene | No. | Gene | No. |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 26 |
| 5 |
| 2 |
| 2 |
| 1 |
| 1 |
|
| 13 |
| 5 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 11 |
| 4 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 10 |
| 3 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 10 |
| 3 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 10 |
| 3 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 9 |
| 3 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 9 |
| 3 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 9 |
| 3 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 9 |
| 3 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 9 |
| 2 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 6 |
| 2 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 5 |
| 2 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 26 |
| 5 |
| 2 |
| 2 |
| 1 |
| 1 |
|
| 13 |
| 5 |
| 2 |
| 1 |
| 1 |
| 1 |
|
| 11 |
| 4 |
| 2 |
| 1 |
| 1 |
| 1 |
Virulence genes involved in the four known steps of Salmonella pathogenesis.
| VG Type | No. | Gene Names |
|---|---|---|
| Attachment | 49 | |
| Invasion/intracellular survival | 73 | |
| Intramacrophage survival | 39 | |
| Dissemination | 66 | |
| Total | 227 |
Where virulence genes are also known by alternate names, these are indicated in parenthesis, namely: STM1056 (msgA), STM4315 (rtsA), and STM1410 (ssaX).
Figure 4Heatmap illustration showing presence/absence of variable virulence genes (VGs) of Salmonella. (A) The 35 OLF strains representing 23 serovars. (B) The 34 SalFos strains. The x axis contains the list of variable VGs out of 227 used to design primers for the AmpliSeqSalm_227VG tool. The y axis depicts the Salmonella strains and their identities. Red and blue colors represent presence and absence of a VG, respectively. The following serovars are highlighted because more than one strain was used in this study: Typhimurium ●, Enteritidis ▲, Worthington ▲, Kentucky ●. The remainder of the 227 VGs which were found in all strains (core VGs) are not shown in this illustration.
Validation of AmpliSeqSalm_227VG based on blind assessment of virulence and serovar designation for 34 Salmonella strains.
| Strain ID | SalFoS ID | Serovar | a VGs No. |
|---|---|---|---|
| UofLaval23 | S158 | NA | 196 |
| UofLaval4 | S785 | 11:b:e,n,x | 182 |
| UofLaval10 | S774 | 58:d:z6 | 183 |
| UofLaval20 | S361 | Adelaide | 199 |
| UofLaval32 | S1393 | Apapa | 214 |
| UofLaval7 | S603 | Bareilly | 216 |
| UofLaval16 | S333 | Berta | 223 |
| UofLaval29 | S256 | Bovismorbificans | 218 |
| UofLaval12 | S209 | Braenderup | 209 |
| UofLaval33 | S325 | Bredeney | 213 |
| UofLaval14 | S364 | Cerro | 200 |
| UofLaval30 | S1426 | Cubana | 222 |
| UofLaval5 | S229 | Derby | 218 |
| UofLaval31 | S718 | Derby | 223 |
| UofLaval9 | S354 | Gaminara | 198 |
| UofLaval8 | S219 | Hadar | 215 |
| UofLaval24 | S1603 | Havana | 208 |
| UofLaval1 | S1288 | Javiana | 206 |
| UofLaval17 | S551 | Johannesburg | 220 |
| UofLaval3 | S246 | Kentucky | 216 |
| UofLaval19 | S267 | Kiambu | 211 |
| UofLaval25 | S334 | Kottbus | 216 |
| UofLaval6 | S494 | Lockleaze | 192 |
| UofLaval22 | S238 | Mbandaka | 201 |
| UofLaval13 | S1395 | Mississippi | 207 |
| UofLaval2 | S1925 | Muenchen | 218 |
| UofLaval15 | S206 | Muenchen | 215 |
| UofLaval27 | S317 | Ohio | 203 |
| UofLaval28 | S307 | Poona | 197 |
| UofLaval21 | S212 | Stanley | 223 |
| UofLaval18 | S761 | Stanleyville | 218 |
| UofLaval26 | S193 | Thompson | 210 |
| UofLaval34 | S164 | Typhimurium | 226 |
| UofLaval11 | S277 | Uganda | 209 |
a VG = Virulence gene.
Figure 5Heatmap illustration of reads presence/absence for 72 variable virulence genes (variable69strains VGs) from a total of 69 Salmonella strains (35 OLF known and 34 SalFos strains). The x axis shows the 72 variable69strainsVGs while the y axis represents all the tested 69 Salmonella strains. Red and blue colors represent presence and absence of a VG, respectively. For annotation purpose: Typhimurium ●, Enteritidis ▲, Worthington ▲, Kentucky ●. The illustration is a pool of the data from Figure 4A,B to show the clustering of the Salmonella serovars based on analysis of VGs sequences.
Relative distribution of core and variable virulence genes and their participation in the different stages of pathogenesis in Salmonella enterica (n = 69 strains).
| Pathogenesis Stage | Virulence Genes | ||
|---|---|---|---|
| Core69strains | Variable69strains | Total | |
| Attachment | 25 | 24 | 49 |
| Invasion | 54 | 19 | 73 |
| Intramacrophage survival | 32 | 7 | 39 |
| Systemic dissemination | 44 | 22 | 66 |
| Total | 155 | 72 | 227 |
Pathogenicity of 10 Salmonella Typhimurium strains in a chicken infection model and number of variant virulence genes detected with the AmpliSeq tool using Number 8 strain ST07-22495 as index.
| ID/Group | Host | Antigens | Phage Typing | PFGE Patterns | Survival (7d) | Survival (>13d) | 1st Sign | b Variant Number |
|---|---|---|---|---|---|---|---|---|
| 1: ST07-7666 | Sparrow | 4,5:12 | 160 | XAI 0280/BNI 0071 | 50% | 40% | 3d | 10 |
| 2: ST07-7670 | Sparrow | 4,5:12 | 160 | XAI 0280/BNI 0071 | 90% | 70% | 3d | 10 |
| 3: ST07-12345 | Sparrow | 4,5:12 | 160 | XAI 0280/BNI 0071 | 100% | 90% | 3d | 10 |
| 4: ST07-32529 | Sparrow | 4,5:12 | 160 | XAI 0280/BNI 0296 | 60% | a ND | 2d | 10 |
| 5: ST07-12335 | Sparrow | 4,5:12 | 146 | XAI 0021/BNI 0096 | 70% | a ND | 2d | 10 |
| 6: ST07-35522 | Sparrow | 4,5:12 | 40 | XAI 0075/BNI 0297 | 60% | a ND | 2d | 9 |
| 7: ST07-22792 | Cormorant | 4,5:12 | 1 | XAI 0654/BNI 0295 | 100% | 100% | 13d | 17 |
| 8: ST07-22495 | Gull | 4,5:12 | 41 | XAI 0269/BNI 0081 | 60% | 40% | 2d | Index |
| 9: ST07-24355 | Gull | 4,5:12 | 125 | XAI 0269/BNI 0081 | 70% | 40% | 2d | 1 |
| 10: ST07-29216 | Chicken | 4,5:12 | 126 | XAI 0269/BNI 0081 | 100% | 40% | 3d | 1 |
a ND: not determined. b AmpliSeq tool results: variant number represents each instance of sequence variation among the virulence genes as detected following the mapping of the amplicon reads to the Salmonella Typhimurium reference genome.
Figure 6Survival rate plot of chickens inoculated with Salmonella Typhimurium strains and virulence gene variants detected among the strains, n = 10. (A) Survival rate plot of chicken inoculated with 10 S. Typhimurium strains. The sample name is labeled with ID: strain name, day post-infection when clinical signs were first observed, for instance, the number 1 sample was labeled as “1: ST07-7666, 3d”; the sample name was highlighted with the same color as its survival plot line color. The most virulent strain was ST07-22495 No. 8 strain (red color line with red short horizontal line highlighted) whereas the least virulent strain was No. 7 ST07-22792 strain (green color line with green square highlighted). (B) Detected variants among the 10 S. Typhimurium strains used for the infection experiment, using No. 8_ST07-22495 as index. For easy identity, the same color was used for strain names as well in panel (A). The variants causing non-synonymous amino acid changes are highlighted in red.