| Literature DB >> 32152449 |
Sangeeta Banerji1, Sandra Simon1, Andreas Tille2, Angelika Fruth1, Antje Flieger3.
Abstract
Salmonella enterica is the second most reported bacterial cause of food-borne infections in Europe. Therefore molecular surveillance activities based on pathogen subtyping are an important measure of controlling Salmonellosis by public health agencies. In Germany, at the federal level, this work is carried out by the National Reference Center for Salmonella and other Bacterial Enteric Pathogens (NRC). With rise of next generation sequencing techniques, the NRC has introduced whole-genome-based typing methods for S. enterica in 2016. In this study we report on the feasibility of genome-based in silico serotyping in the German setting using raw sequence reads. We found that SeqSero and seven gene MLST showed 98% and 95% concordance, respectively, with classical serotyping for the here evaluated serotypes, including the most common German serotypes S. Enteritidis and S. Typhimurium as well as less frequently found serotypes. The level of concordance increased to >99% when the results of both in silico methods were combined. However, both tools exhibited misidentification of monophasic variants, in particular monophasic S. Typhimurium and therefore need to be fine-tuned for reliable detection of this epidemiologically important variant. We conclude that with adjustments Salmonella genome-based serotyping might become the new gold standard.Entities:
Year: 2020 PMID: 32152449 PMCID: PMC7062728 DOI: 10.1038/s41598-020-61254-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Overview of serotype prediction with SeqSero and MLST.
| Serotype | Sequen-ced Isolates | Correlation | Ambiguous | Prediction failure | Miscorrelation | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Seq-Sero | MLST | Seq-Sero+MLST | Seq-Sero | MLST | Seq-Sero+MLST | Seq-Sero | MLST | Seq-Sero + MLST | Seq-Sero | MLST | Seq-Sero+MLST | ||
| Agona | 3 | 3 | 3 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Choleraesuis | 33 | 0 | 33 | 33 | 30 or Typhisuis or Paratyphi C | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 0 |
| Choleraesuis monophasic | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 3 | 3 |
| Derby | 55 | 55 | 55 | 55 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 11:z41:e,n,z15 (novel serovar) | 10 | 10 | 10 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Enteritidis | 115 | 115 | 115 | 115 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Infantis | 50 | 49 | 50 | 50 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| Kentucky | 7 | 7 | 7 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Kintambo | 3 | 0 | 3 | 3 | 3 or Washington | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Kottbus | 12 | 0 | 12 | 12 | 12 or Ferruch | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Mbandaka | 15 | 15 | 15 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Mikawasima | 10 | 10 | 10 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Muenchen | 25 | 0 | 25 | 25 | 25 or Virginia | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Paratyphi B | 6 | 6 | 6 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Paratyphi B monophasic | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Total (%) | 100.0 | 84.0 | 95.2 | 99.4 | 14.0 | 0 | 0 | 1.0 | 0 | 0 | 1.0 | 4.8 | 0.6 |
| Overall (%) | 100.0 | 98.0 | 95.2 | 99.4 | — | — | — | 1.0 | 0 | 0 | 1.0 | 4.8 | 0.6 |
| Strathcona | 2 | 2 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Stourbridge | 14 | 14 | 14 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Sundsvall | 1 | 0 | 1 | 1 | 1 or Soahanina or Sundsvall | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Typhi | 74 | 74 | 74 | 74 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Typhimurium biphasic | 52 | 52 | 32 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 | 0 |
| Typhimurium monophasic | 19 | 17 | 17 | 19 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 0 |
| Serologically rough | 6 | 5 | 6 | 6 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| Total number | 520 | 437 | 495 | 517 | 73 | 0 | 0 | 5 | 0 | 0 | 5 | 25 | 3 |
| Total (%) | 100.0 | 84.0 | 95.2 | 14.0 | 0 | 0 | 0 | 1.0 | 0 | 0 | 1.0 | 4.8 | 0.6 |
| Overall (%) | 100.0 | 98.0 | 95.2 | 99.4 | — | — | — | 1.0 | 0 | 0 | 1.0 | 4.8 | 0.6 |
Serotype was first determined by classical serotyping. Whole genome sequences were then analyzed with SeqSero or MLST. Correlation means that the predicted serotype was the same as the classically determined serovar. Ambiguous means that the correct serotype was listed among others. Prediction failure means that no complete antigenic formula was derived. Miscorrelation means that a wrong antigenic formula was derived. Overall (%) is the sum of Ambiguous (%) and Correlation (%). Final results are shown, i.e. after resequencing if data quality was not met.
Figure 1Unweighted Pair Group Method with Arithmetic mean (UPGMA) tree of all investigated isolates based on 7-gene MLST. The tree shows that serovars correlate with STs and different STs can belong to the same eBG. Colors are based on ST. ND means no official eBG number available from Enterobase.
Figure 2Minimal Spanning tree of monophasic and biphasic S. Typhimurium isolates based on the Enterobase core genome MLST scheme and 7-gene MLST. The tree reveals that S. Typhimurium isolates cluster according to ST rather than expression of flagellin. Colors are based on phase and STs.
Overview of Serovars with corresponding MLST sequence types and e-Burst groups.
| Sequence type | e-Burst Group | Number of Isolates | |
|---|---|---|---|
| Agona | 13 | 54 | 3 |
| Choleraesuis | 139 | 6 | 1 |
| Choleraesuis | 145 | 6 | 36 |
| Derby | 39 | 57 | 6 |
| Derby | 774 | 57 | 1 |
| Derby | 40 | 57 | 5 |
| Derby | 71 | 244 | 2 |
| Derby | 682 | 264 | 41 |
| Enteritidis | 11 | 4 | 110 |
| Enteritidis | 183 | 4 | 5 |
| 11:z41:e,n,z15 | 2914 | 472 | 10 |
| Infantis | 32 | 31 | 49 |
| Infantis | 2283 | 31 | 1 |
| Kentucky | 198 | 56 | 7 |
| Kintambo | 407 | 400 | 1 |
| Kintambo | 2839 | ND | 1 |
| Kintambo | 5841 | ND | 1 |
| Kottbus | 212 | 64 | 11 |
| Kottbus | 1669 | 63 | 1 |
| Mikawasima | 1815 | 247 | 10 |
| Mbandaka | 413 | 62 | 15 |
| Muenchen | 82 | 8 | 25 |
| Paratyphi B | 86 | 5 | 6 |
| Paratyphi B mono (var Java) | 42 | 32 | 1 |
| Paratyphi C | 146 | 20 | 2 |
| Poano | 557 | 87 | 2 |
| Strathcona | 2559 | ND | 2 |
| Stourbridge | 736 | 438 | 8 |
| Stourbridge (only RKI data) | 3736 | 464 | 6 |
| Sundsvall (first typed as Poano) | 488 | 305.2 | 1 |
| Subsp. II | 781 | 340 | 1 |
| Typhi | 1 | 13 | 38 |
| Typhi | 2 | 13 | 32 |
| Typhi | 2173 | 13 | 1 |
| Typhi | 2209 | 13 | 1 |
| Typhi | 3677 | 13 | 2 |
| Typhimurium & monophasic var. | 19 | 1 | 36 |
| Typhimurium & monophasic var. | 34 | 1 | 39 |
| Total | 39 | >26 | 520 |
ND: No official eBG number available from Enterobase.
Overview of advantages and drawbacks of the investigated typing methods and their sources of errors.
| Typing Method | Advantage | Drawback | Main reasons for errors | How to address sources of errors |
|---|---|---|---|---|
| Serotyping | Directly determines phenotype | No typing of rough strains possible | Lack of experience with serotyping | Intensively trained staff |
| Well established method | Requires high quality antisera | Quality control mechanism | ||
| SeqSero | Classification analogous to classical serotyping | Genotype may not correspond to phenotype due to undetected mutations | Low sequence data quality | Quality control mechanism, e.g. of sequencing process |
| No assembly required | High quality sequencing data required (e.g. coverage, contamination) | Monophasic variants are only determined by lack of | Improve detection method for monophasic variants | |
| Can be automated | ||||
| MLST-based typing | Provides phylogenetic information | High quality sequencing data required (e.g. coverage, contamination) | Low sequence data quality | Quality control mechanism, e.g. of sequencing process |
| Can be automated | Assembly recommended |
Concerning classical serotyping we also referred to Hendriksen et al.[13].