| Literature DB >> 31682222 |
Michael Payne1, Sophie Octavia1, Laurence Don Wai Luu1, Cristina Sotomayor-Castillo2,3, Qinning Wang3, Alfred Chin Yen Tay4, Vitali Sintchenko2,3, Mark M Tanaka1, Ruiting Lan1.
Abstract
Salmonella enterica serovar Typhimurium is the leading cause of salmonellosis in Australia, and the ability to identify outbreaks and their sources is vital to public health. Here, we examined the utility of whole-genome sequencing (WGS), including complete genome sequencing with Oxford Nanopore technologies, in examining 105 isolates from an endemic multi-locus variable number tandem repeat analysis (MLVA) type over 5 years. The MLVA type was very homogeneous, with 90 % of the isolates falling into groups with a five SNP cut-off. We developed a new two-step approach for outbreak detection using WGS. The first clustering at a zero single nucleotide polymorphism (SNP) cut-off was used to detect outbreak clusters that each occurred within a 4 week window and then a second clustering with dynamically increased SNP cut-offs were used to generate outbreak investigation clusters capable of identifying all outbreak cases. This approach offered optimal specificity and sensitivity for outbreak detection and investigation, in particular of those caused by endemic MLVA types or clones with low genetic diversity. We further showed that inclusion of complete genome sequences detected no additional mutational events for genomic outbreak surveillance. Phylogenetic analysis found that the MLVA type was likely to have been derived recently from a single source that persisted over 5 years, and seeded numerous sporadic infections and outbreaks. Our findings suggest that SNP cut-offs for outbreak cluster detection and public-health surveillance should be based on the local diversity of the relevant strains over time. These findings have general applicability to outbreak detection of bacterial pathogens.Entities:
Keywords: Salmonella Typhimurium; bacterial population genomics; genetic clustering; genomic epidemiology; outbreak detection
Mesh:
Substances:
Year: 2021 PMID: 31682222 PMCID: PMC8627665 DOI: 10.1099/mgen.0.000310
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Frequency of the 3-9-7-12-523 MLVA type in NSW, Australia, from 2007 to 2014.
|
Year |
No. of 3-9-7-12-523 isolates |
Total no. of isolates |
Percentage |
Rank* |
|---|---|---|---|---|
|
2007 |
3 |
247 |
1.21 |
11 (115) |
|
2008 |
28 |
1240 |
2.26 |
9 (233) |
|
2009 |
211 |
1604 |
13.15 |
2 (292) |
|
2010 |
36 |
973 |
3.70 |
3 (247) |
|
2011 |
292 |
2104 |
13.87 |
1 (375) |
|
2012 |
116 |
1677 |
6.91 |
3 (293) |
|
2013 |
55 |
1980 |
2.77 |
9 (379) |
|
2014 |
116 |
2287 |
5.07 |
4 (322) |
*Numbers in brackets are the total number of MLVA types in that year.
Fig. 1.Maximum parsimony phylogeny of 105 . Typhimurium isolates from MLVA type 3-9-7-12-523 from NSW from 2010 to 2014. (a) An overview of the phylogeny, which includes the three major clades (A in red, B in green and C in blue) that include 101 of 105 isolates in the MLVA type. The remaining isolates are shown in black, as is the LT2 reference that is used as an outgroup. (b) A detailed view of the three large clades including the three known outbreaks from 2010, 2012 and 2014. Ancestral nodes for each of clade A, B and C are highlighted in red, green and blue, and have eight, four and seven branches, respectively. All of these branches arise from a polytomy in each case, highlighting the low genetic diversity of the source strain. The year (Y), month (M), day (D) and region of isolation for each isolate are shown, as are single linkage SNP clusters with zero to five SNP cut-offs. At each cut-off, isolates are grouped into clusters, and each cluster is assigned a cluster number and corresponding colour.
Fig. 2.Characteristics of zero to five SNP clusters derived from 105 MLVA type 3-9-7-12-523 isolates. (a) The number of isolates that are grouped into SNP clusters by single linkage clustering at each of six cut-offs from zero to five SNPs. The number of clusters at each level is shown above each column. (b) The period of time between isolation of the first isolate and the last (lifespan) for each cluster in days at each SNP cut-off. Median cluster length is shown with horizontal bars.