| Literature DB >> 33820940 |
Marie A Chattaway1, Gemma C Langridge2, John Wain2,3.
Abstract
Salmonella enterica nomenclature has evolved over the past one hundred years into a highly sophisticated naming convention based on the recognition of antigens by specific antibodies. This serotyping scheme has led to the definition of over 2500 serovars which are well understood, have standing in nomenclature and, for the majority, biological relevance. Therefore, it is highly desirable for any change in naming convention to maintain backwards compatibility with the information linked to these serovars. The routine use of whole genome sequencing and the well-established link between sequence types and serovars presents an opportunity to update the scheme by incorporating the phylogenetically relevant sequence data whilst preserving the best of serotyping nomenclature. Advantages include: overcoming the variability in antibody preparations; removing the need to use laboratory animals and implementing a truly universal system. However, the issue of trying to reproduce the phenotyping gold standard needs to be relaxed if we are to fully embrace the genomic era. We have used whole genome sequence data from over 46,000 isolates of Salmonella enterica subspecies enterica to define clusters in two stages: Multi Locus Sequence Typing followed by antigen prediction. Sequence type-serotype discrepancies were resolved using core SNP clustering to determine the phylogenetic groups and this was confirmed by overlaying the antigenic prediction onto the core SNP clusters and testing the separation of clusters using cgMLST Hierarchical Clustering. This allowed us to define any major antigenic clusters within an ST-here called the MAC type and written as ST-serovar. Using this method, 99.96% of Salmonella isolates reported in the UK were assigned a MAC type and linked to a serovar name taken from the Kauffmann and White scheme. We propose a change for reporting of Salmonella enterica sub-types using the ST followed by serovar.Entities:
Year: 2021 PMID: 33820940 PMCID: PMC8021552 DOI: 10.1038/s41598-021-86243-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Proposed reporting guidelines for Salmonella using genome sequence data to define the major antigenic clusters (MACs).
| ST | Serotypinga | Differential antigenb | Phylogeny grouping | No. Isolates | % | Proposed MAC type/namec | Main hierarchical clustering level to differentiate | Figure |
|---|---|---|---|---|---|---|---|---|
| 20 | Brandenburg | H: l,v | Distinct | 24 | 0.05 | ST20-Brandenburg | HC400_11346 | Figure |
| 20 | Sandiego | H1: e,h | Distinct | 26 | 0.06 | ST20-Sandiego | HC400_2255 | Figure |
| 1985 | Bahati or Durham | O:22 or O:23 | Mixed | 66 | 0.14 | ST1985-Durham | N/A—interspersed | Figure |
| 49 | Saintpaul | H: e,h | Distinct | 75 | 0.16 | ST49-Saintpaul | HC200_8 | Figure |
| 49 | Haifa | H: z10 | Distinct | 130 | 0.28 | ST49-Haifa | HC200_1433 | Figure |
| 582 | Kottbus | H: e,h | Distinct | 42 | 0.09 | ST582-Kottbus | HC900_7054 | Figure |
| 582 | Chailey | H: z23 | Distinct | 6 | 0.01 | ST582-Chailey | HC900_17 | Figure |
| 22 | Braenderup | H2: e,n, z15 | Distinct | 607 | 1.31 | ST22-Braenderup | HC100_185 | Figure |
| 22 | Larochelle | H2: 1,2 | Distinct | 19 | 0.04 | ST22-Larochelle | HC100_1136, 25669, 2664, 28707 | Figure |
| 241 | Bredeney | H: l,v | Distinct | 45 | 0.10 | ST241-Bredeney | HC200_2494, HC200_1335 | Figure |
| 241 | Schwarzengrund | H: d | Distinct | 12 | 0.03 | ST241-Schwarzengrund | HC200_17392 | Figure |
| 897 | Bredeney | H2: 1,7 | Distinct | 8 | 0.02 | ST897-Bredeney | HC400_31544 | Figures |
| 897 | Kimuenza | H2: e,n,x | Distinct | 9 | 0.02 | ST897-Kimuenza | HC400_24937 | Figure |
| 48 | Panama | H: l,v | Distinct | 197 | 0.43 | ST48-Panama | HC400_369 | Figure |
| 48 | Miami | H: a | Distinct | 4 | 0.01 | ST48-Miami | HC400_2307, HC400_67476 | Figure |
| 2019 | Napoli or Zaiman | H:z13 or H:v | Mixed | 21 | 0.05 | ST2019-Napoli | N/A—interspersed | Figure |
| 226 | Carrau | O:6,14 | Distinct | 4 | 0.01 | ST226-Carrau | HC400_363 | Figure |
| 226 | Gatow | O:6,7 | Distinct | 3 | 0.01 | ST226-Gatow | HC400_59526 | Figure |
| 684 | Uganda or Sinstorf | H:l,v,or H: l,z13 | Mixed | 75 | 0.16 | ST684-Uganda | N/A—interspersed | Figure |
| 909 | Bareilly | H: 1,5 | Distinct | 299 | 0.65 | ST909-Bareilly | HC200_899 | Figure |
| 909 | Richmond | H: 1,2 | Distinct | 86 | 0.19 | ST909-Richmond | HC200_101 | Figure |
| 2256 | Brunei or tananarive | O: 8, 20 or O: 6, 8 | Mixed | 7 | 0.02 | ST2256-Brunei | N/A—interspersed | Figure |
| 101 | Bochum | H: r | Distinct | 4 | 0.01 | ST101-Bochum | HC900_491 | Figure |
| 101 | Wien | H: b | Distinct | 5 | 0.01 | ST101-Wien | HC900_95 | Figure |
aNot used in MAC typing but presented for comparison and explanation.
bAny serotype prediction programme can be used though some antigens cannot be distinguished.
cFigure shows core SNP cluster for each antigenic type.
dHierarchical Clustering Level at which the serovars can be differentiated genetically.
No. = number of MAC type, % of the 46,268 Salmonella reported.
Figure 1(A) Phylogenetic analysis of MAC types Brandenburg and Sandiego (ST20). Representative strains were serotyped and serotype result is highlighted in blue (Sandiego) or red (Brandenburg). (B) Phylogenetic analysis of MAC types Bahati and Durham (ST1985). Representative strains were serotyped and serotype result is highlighted in blue (Bahati) or red (Durham).
Number of Novel STs identified each year at PHE.
| Year | No. novel ST | Total Salmonella reported | % of novel ST to Salmonella reported |
|---|---|---|---|
| 2014* | 18 | 6406 | 0.3* |
| 2015 | 165 | 8374 | 2.0 |
| 2016 | 217 | 8930 | 2.4 |
| 2017 | 180 | 9381 | 1.9 |
| 2018 | 180 | 10,143 | 1.8 |
| 2019 (Jan–June) | 67 | 3034 | 2.2 |
| Total | 827 | 46,268 | 1.8 |
*Automated Novel ST pipeline implemented in 2015 so Novel STs underrepresented in 2014 in this analysis.