| Literature DB >> 36159272 |
Emeline Cherchame1, Guy Ilango1, Véronique Noël1, Sabrina Cadel-Six1.
Abstract
Salmonella is the most common cause of gastroenteritis in the world. Over the past 5 years, whole-genome analysis has led to the high-resolution characterization of clinical and foodborne Salmonella responsible for typhoid fever, foodborne illness or contamination of the agro-food chain. Whole-genome analyses are simplified by the availability of high-quality, complete genomes for mapping analysis and for calculating the pairwise distance between genomes, but unfortunately some difficulties may still remain. For some serovars, the complete genome is not available, or some serovars are polyphyletic and knowing the serovar alone is not sufficient for choosing the most appropriate reference genome. For these serovars, it is essential to identify the genetically closest complete genome to be able to carry out precise genome analyses. In this study, we explored the genomic proximity of 650 genomes of the 58 Salmonella enterica subsp. enterica serovars most frequently isolated in humans and from the food chain in the United States (US) and in Europe (EU), with a special focus on France. For each serovar, to take into account their genomic diversity, we included all the multilocus sequence type (MLST) profiles represented in EnteroBase with 10 or more genomes (on 19 July 2021). A phylogenetic analysis using both core- and pan-genome approaches was carried out to identify the genomic proximity of all the Salmonella studied and 20 polyphyletic serovars that have not yet been described in the literature. This study determined the genetic proximity between all 58 serovars studied and revealed polyphyletic serovars, their genomic lineages and MLST profiles. Finally, we enhanced the open-access databases with 73 new genomes and produced a list of high-quality complete reference genomes for 48 S. enterica subsp. enterica serovars among the most isolated in the US, EU, and France.Entities:
Keywords: MLST profile; Salmonella enterica; cgMLST; genomic proximity; pan-phylogenetic analysis; polyphyletic serovars; reference complete genomes
Mesh:
Year: 2022 PMID: 36159272 PMCID: PMC9493441 DOI: 10.3389/fpubh.2022.963188
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
List of the 58 Salmonella serovars identified as being the most frequently isolated in humans and the agri-food sector over a period of 10 years (from 2006 to 2016) in the United States, Europe and France.
| Agona | Derby | Johannesburgb | Muenster | Schwarzengrund |
| Albany | Dublin | Kedougouc | Napoli | Senftenberg |
| Anatum | Enteritidis | Kentucky | Newport | Stanleyc |
| Bananaa | Gallinarumc | Kottbus | Ohio | Tennessee |
| Bareillyb | Give | Livingstone | Oranienburg | Thompson |
| Bovismorbificans | Goldcoastc | London | Panama | Typhi |
| Braenderup | Hadar | Manhattan | Paratyphi B and Java | Typhimurium |
| Brandenburg | Havana | Mbandaka | Poona | Uganda |
| Bredeney | Heidelberg | Minnesota | Readingb | Venezianaa |
| Cerro | Indiana | Mississippib | Rissen | Virchow |
| Chester | Infantis | Montevideo | ||
| Coeln | Javianab | Muenchen | Saintpaul |
Forty-seven serovars were selected because they were common to the United States, Europe and France according to the Venn analyses in Figure 1. Eleven other serovars (visualized by gray fill color in the list) were added because they belong to the top 20 serovars from each country but were not common to all countries. a: serovar from to the top 20 in FR; b: serovars from the top 20 in the US; c: serovars from the top 20 in EU.
Figure 1Venn diagrams illustrating the intersection between the most frequently isolated serovars in the United States (US), Europe (EU) and France (FR) from human cases and from the agri-food sector. Venn analysis was carried out in two steps showed in (A,B). (A) Intersections between the top 25 serovars in human cases and the agri-food sector. The leading 25 human US, EU and FR serovars are included in the orange-, green- and red-outlined areas, respectively. The leading 25 agri-food sector US, EU and FR serovars are included in the yellow-, purple- and blue-outlined areas, respectively. The logical relation between the top 25 serovars in human cases and agri-food sector revealed 33 common serovars. The numbers within the intersections correspond to the common serovars. (B) Intersections between the leading 50 US, EU and FR agri-food sector serovars with the 33 serovars previously selected in (A). The top 50 US, EU and FR agri-food sector serovars are included in the green-, blue- and red-shaded areas, respectively. The previous 33 selected serovars in (A) are included in the purple-shaded area. The logical relation between these four groups revealed 14 other common serovars. The new 14 common serovars are surrounded by black circles. The numbers within the intersections correspond to common serovars. The resulting 47 common serovars obtained by Venn analyses are showed in the Supplementary Figure 1.
Figure 2Phylogenetic cgMLST distance tree of the 650 genome set of Salmonella. The tree is rooted on the Salmonella entrerica subsp. arizonae, diarizonae, houtenae, indica and salamae genomes. For better visualization of the groups, the tree is shown without branch lengths. Branches are colored to distinguish the four groups. Due to the high number of strains in the tree, strain labels are not shown and serovars are indicated along corresponding branches. Strongly supported subgroups are shaded in different colors (for the description of these subgroups see also Figure 3 and Table 3).
Figure 3Phylogenetic pan-genome kmer distance tree of the 219 genome subset of Salmonella. The tree is rooted on the Salmonella entrerica subsp. arizonae, diarizonae, houtenae, indica and salamae genomes. The tree is shown with branch lengths. Polyphyletic serovars are shaded in different colors. Complete genomes are indicated with circles and contigs with triangles. Labels contain the serovar and the MLST profile of each strain. The strongly supported subgroups listed in the Table 3 are highlighted in the margin of the tree. For example, A1 corresponds to the subgroup called “section Livingstone” in Table 3.
Polyphyletic and monophyletic serovars studied.
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|
|
| ST7024 | ST4745 | ST683, 5,220 | ST1035 | Brandenburg | ST20, 65, 249, 873, 2,577 |
|
| ST5146 | ST464, 1,612, 2,129, 2,270 | ST203, 362, 909, 2,553 | Coeln | ST1995, 2,015 | |
|
| ST682 | ST39, 40 | ST71, 72 | Dublin | ST10, 73, 4,406 | |
|
| ST1669 | ST582 | ST212, 808 | Enteritidis | ST11, 183 | |
|
| ST5, 118, 350 | ST156, 166 | ST31, 45, 46, 132, 157, 614 | Gallinarum | ST78, 331 | |
|
| ST23, 169, 174, 1,515, 1,675, 3,613 | ST179, 1,392, 1,512 | ST864 | Goldcoast | ST358 | |
|
| ST1628 | ST412 | ST93 | Hadar | ST33, 473 | |
|
| ST95, 2118 | ST49 | ST27, 50, 680, 1,934, 3,602 | Havana | ST578, 588, 872, 1,237, 1,524, 4,040, 7,676 | |
|
| ST142, 377, 1,499 | ST150 | Heidelberg | ST15 | ||
|
| ST214, 306, 897 | ST505 | Indiana | ST17, 2,040 | ||
|
| ST367, 1,593, 2,407 | ST1291 | Javiana | ST24, 371, 437, 589, 1,547, 1,674, 2,500 | ||
|
| ST411, 1,954 | ST343, 2,063 | Johannesburg | ST471, 515 | ||
|
| ST516, 524 | ST654 | Kedougou | ST1543 | ||
|
| ST32, 603, 2,146, 2,283 | ST79 | London | ST155 | ||
|
| ST152, 314, 2,132 | ST198 | Manhattan | ST18, 44, 2,200 | ||
|
| ST543, 1,941, 2,247 | ST457, 638 | Minnesota | ST548 | ||
|
| ST3016 | ST413, 1,602, 2,141 | Montevideo | ST4, 81, 138, 195, 316, 699, 2,269 | ||
|
| ST448, 5,834 | ST425 | Muenster | ST321, 2,692 | ||
|
| ST83 | ST82, 112, 1,606, 2,769 | Napoli | ST474, 1,637, 2,019, 2,008, 2,095, 5,168 | ||
|
| ST28 | ST43, 86, 88, 110, 127, 149, 307 | Ohio | ST329, 2,029 | ||
|
| ST2794 | ST469 | Panama | ST48 | ||
|
| ST2250 | ST96, 322 | Poona | ST308, 447, 964, 1,069, 2,889 | ||
|
| ST14, 210 | ST185, 217 | Stanley | ST29, 2,045 | ||
|
| ST26 | ST2125, 2,417 | Tennessee | ST319, 1,565 | ||
|
| ST16, 181, 303, 359 | ST197, 1,750 | Typhi | ST1, 2 | ||
| Agona | ST13 | Typhimuriumd | ST19, 34, 36 | |||
| Albany | ST292 | Uganda | ST684 | |||
| Anatum | ST64 | Veneziana | ST2207 | |||
| Braenderup | ST22, 311 | |||||
The polyphyletic and monophyletic serovars identified in the pan-genome phylogenetic analysis (kmer approach) are indicated with the corresponding MLST profiles. The polyphyletic serovars are indicated in bold. a: The difference between kmer and cgMLST phylogenetic approaches involves the Saintpaul MLST profile ST1934, which in the cgMLST tree is associated with ST49; b: the MLST profiles ST28, 43, 86, 88, 110, 127, 149 and 307 comprise the genomes of serovars Paratyphi B and Java; c: the MLST profiles ST14 and 185 comprise the genomes of serovars Senftenberg and S. 1,3,19:z27:-; d: the MLST profiles ST19 and ST34 comprise the genomes of serovar Typimurium and S. 4,[5],12:i:-.
Identified subgroups.
|
| |
|---|---|
|
|
|
| Livingstone A1 | Livingstone ST543, 1,941, 2,247 |
| Ohio ST329, 2,029 | |
| Typhimurium A4 | Saintpaul ST27, 49, 50, 680, 1,934, 3,602 |
| Coeln ST1995, 2,015 | |
| Typhimurium ST19, 34, 36 | |
| Reading ST1628 | |
| Heidelberg ST15 | |
| Enteritidis A5 | Enteritidis ST11, 183 |
| Gallinarum ST78, 331 | |
| Dublin ST10, 73, 4,406 | |
| Berta ST435 | |
| Muenchen A10 | Muenchen ST82, 83, 112, 1,606, 2,769 |
| Manhattan ST18, 44, 2,200 | |
| Newport A11 | Newport ST31,45,46,132, 157,614 |
| Kottbus ST212, 808 | |
| Montevideo B9 | Oranienburg ST179, ST864, 1,392, 1,512 |
| Montevideo ST4, 81, 138, 195, 316, 2,269 | |
| Bredeney B14 | Schwarzengrund ST96, 322 |
| Give ST654 | |
| Bredeney ST241, 306, 897, 505 | |
| Indiana C | Banana ST7024 |
| Bareilly ST5146 | |
| Indiana ST17, 2,040 | |
| Cerro ST367, 1,593, 2,407 | |
| Typhi D | Typhi ST1, 2 |
| Mississippi ST448, 5,834 | |
| Veneziana ST2207 | |
| Napoli ST474, 1,637, 2,008, 2,019, 2,095, 5,168 | |
The subgroups called “section Livingstone, Typhimurium, Enteritidis, Muenchen and Newport” belong to Group A. The subgroup referred to as “sections Montevideo or Bredeney” belong to Group B. The subgroup “section Indiana” belongs to Group C and “section Typhy” to Group D.