| Literature DB >> 34730485 |
Isabelle Bernaquez1, Christiane Gaudreau2,3, Pierre A Pilon4,5, Sadjia Bekal1,3.
Abstract
Entities:
Keywords: SNVPhyl; Shigella; mobile genetic elements; outbreak; whole-genome sequencing; whole/core genome MLST
Mesh:
Year: 2021 PMID: 34730485 PMCID: PMC8743557 DOI: 10.1099/mgen.0.000672
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Scatterplot of all outbreak-related pairwise differences of coreMLST, cgMLST and wgMLST against SNVPhyl for (a) and (b).
Summary of the SNVPhyl, coreMLST, cgMLST and wgMLST subtyping results, diversity indexes and statistical significances for the complete 91 . and 232 . isolate collections
|
Species |
Method |
No. of hqSNV locations/loci* |
No. of ST |
Size of largest ST |
Simpson’s DI |
Jackknife pseudo-values 95 % CIs |
| ||
|---|---|---|---|---|---|---|---|---|---|
|
SNVPhyl |
coreMLST |
cgMLST | |||||||
|
|
SNVPhyl |
2717 |
76 |
5 |
0.995 |
(0.990–0.999) | |||
|
coreMLST |
2511 |
63 |
5 |
0.987 |
(0.980–0.994) |
0.039 | |||
|
cgMLST |
3398 |
86 |
2 |
0.999 |
(0.997–1.000) |
0.060 |
<0.001 | ||
|
wgMLST |
4051 |
89 |
2 |
1.000 |
(0.999–1.000) |
0.036 |
<0.001 |
0.213 | |
|
|
SNVPhyl |
1072 |
128 |
35 |
0.970 |
(0.957–0.984) | |||
|
coreMLST |
2511 |
110 |
28 |
0.970 |
(0.960–0.980) |
0.976 | |||
|
cgMLST |
3613 |
196 |
4 |
0.998 |
(0.997–0.999) |
<0.001 |
<0.001 | ||
|
wgMLST |
4426 |
209 |
4 |
0.999 |
(0.998–1.000) |
<0.001 |
<0.001 |
0.013 | |
*Number of sites used to generate phylogeny by SNVPhyl. Number of alleles included in the exported categorical data matrixes from BioNumerics.
†Significant differences in discriminatory power are indicated in green (P-value <0.05 and non-overlapping CIs).
CG, core genome; CI, confidence interval; DI, diversity index; hqSNV, high-quality single-nucleotide variant; MLST, multi-locus sequence typing; SNVPhyl, single-nucleotide phylogenomics pipeline; ST, sequence type; WG, whole genome.
Features, intra-outbreak/subcluster genetic distances and largest minimum spanning distances determined by SNVPhyl, coreMLST, cgMLST and wgMLST of the four and five outbreaks under study in addition to the six and six subclusters
|
Species |
Outbreaks/ Subclusters |
Year |
Serotypes |
No. of sequenced isolates |
Duration (days)* |
Sex (M/F) |
Age range |
Epidemiology (frequency) |
Intra-outbreak distance ranges† |
Largest minimum spanning distances‡ | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
SNVPhyl |
coreMLST |
cgMLST |
wgMLST |
SNVPhyl |
coreMLST |
cgMLST |
wgMLST | ||||||||||
|
|
SF-01 |
2017–2019 |
2a and Y |
31 |
1012 |
31/0 |
23–86 |
MSM (14) |
0–37 (17) |
0–30 (13) |
1–41 (19) |
1–53 (28) |
23 |
16 |
23 |
26 | |
|
SF-01a |
2018–2019 |
2a |
4 |
289 |
4/0 |
23–67 |
MSM (4)+Travel (1) |
1–6 (3.5) |
0–2 (1) |
1–5 (3) |
1–8 (4) |
4 |
1 |
3 |
5 | ||
|
SF-01b |
2019 |
2a |
6 |
257 |
6/0 |
23–86 |
MSM (2) |
0–4 (2) |
0–4 (1) |
1–5 (3) |
2–7 (4) |
3 |
3 |
3 |
3 | ||
|
SF-01c |
2019 |
2a |
5 |
69 |
5/0 |
36–59 |
MSM (3) |
0–3 (1.5) |
0–2 (1) |
2–6 (3) |
2–8 (4.5) |
2 |
1 |
3 |
4 | ||
|
SF-01d |
2018–2019 |
Y |
7 |
322 |
7/0 |
29–78 |
MSM (1)+Travel (1) |
0–10 (5) |
0–9 (4) |
2–16 (8) |
5–18 (13) |
5 |
4 |
8 |
8 | ||
|
SF-02 |
2018–2019 |
1b and 3b |
42 |
529 |
40/2 |
24–72 |
MSM (23)+Travel (1) |
0–17 (6) |
0–10 (3) |
0–14 (6) |
0–26 (9) |
8 |
4 |
6 |
11 | ||
|
SF-02a |
2018–2019 |
1b and 3b |
15 |
510 |
14/1 |
36–68 |
MSM (9) |
0–10 (4) |
0–4 (2) |
0–8 (4) |
1–12 (6) |
3 |
2 |
3§ |
5§ | ||
|
SF-02b |
2019 |
1b |
3 |
76 |
2/1 |
38–46 |
MSM (2) |
0–1 (1) |
0–1 (1) |
0–3 (2) |
2–5 (2) |
1 |
1 |
2 |
2 | ||
|
SF-03 |
2018–2019 |
2a |
7 |
226 |
7/0 |
39–71 |
MSM (6)+Travel (1) |
0–6 (3) |
0–3 (2) |
1–6 (4) |
2–8 (5) |
3 |
2 |
4 |
4 | ||
|
SF-04 |
2018 |
1b |
5 |
25 |
2/3 |
4–37 |
Food (3)+Familial (1) |
0–2 (1) |
0 (0) |
1–6 (3) |
1–7 (3) |
1 |
0 |
3 |
3 | ||
|
|
SS-01 |
2013 |
– |
4 |
5 |
0/4 |
24–54 |
Non-MSM (4) |
0 (0) |
0–1 (0.5) |
1–3 (2) |
1–4 (2) |
0 |
1 |
2 |
2 | |
|
SS-02 |
2014 |
– |
3 |
22 |
0/3 |
46–53 |
Travel (3) |
1–7 (6) |
1–3 (1) |
2–7 (4) |
2–8 (5) |
6 |
1 |
4 |
4 | ||
|
SS-03 |
2017–2019 |
– |
57 |
797 |
54/3 |
21–75 |
MSM (29)+Travel (1) |
0–16 (5) |
0–9 (3) |
0–16 (6) |
0–34 (8) |
6 |
4 |
6 |
16 | ||
|
SS-03a |
2017–2018 |
– |
16 |
155 |
16/0 |
23–75 |
MSM (12)+Travel (1) |
0–3 (1) |
0–2 (1) |
0–6 (2) |
0–8 (4) |
1 |
1 |
4§ |
4§ | ||
|
SS-03b |
2018–2019 |
– |
21 |
603 |
20/1 |
22–70 |
MSM (13) |
0–4 (2) |
0–5 (1) |
0–10 (4) |
0–29 (6) |
2 |
4 |
4 |
16 | ||
|
SS-04 |
2018 |
– |
7 |
33 |
3/4 |
1–76 |
Food (7) |
0–1 (0) |
0–1 (0) |
0–3 (1) |
0–5 (2) |
1 |
1 |
1 |
2 | ||
|
SS-05 |
2018–2019 |
– |
120 |
407 |
116/4 |
1–71 |
MSM (28)+Travel (2) |
0–23 (8) |
0–16 (6) |
0–24 (10) |
0–117 (15) |
9 |
8 |
10 |
19 | ||
|
SS-05a |
2018–2019 |
– |
69 |
396 |
67/2 |
24–70 |
MSM (19)+Travel (1) |
0–7 (1) |
0–5 (2) |
0–10 (4) |
0–57 (5) |
3 |
2 |
5 |
7 | ||
|
SS-05b |
2018–2019 |
– |
36 |
351 |
36/0 |
21–71 |
MSM (8) |
0–8 (2) |
0–6 (1) |
0–10 (4) |
0–75 (6) |
3 |
3 |
4 |
31§ | ||
|
SS-05c |
2019 |
– |
3 |
281 |
3/0 |
28–47 |
Unknown |
0–4 (4) |
0–3 (3) |
2–6 (5) |
2–7 (6) |
1 |
3 |
5 |
5 | ||
|
SS-05d |
2019 |
– |
4 |
29 |
2/2 |
1–41 |
Unknown |
0–1 (0.5) |
0–1 (1) |
2–4 (3) |
2–5 (4) |
4 |
1 |
3 |
4 | ||
*Total number of days between the isolate dates of the first and last outbreak/subcluster-related isolates.
†Minimum and maximum pairwise differences between outbreak/subcluster-related isolates. Median pairwise genetic differences are in parentheses.
‡Largest genetic distance between neighbouring outbreak/subcluster-related isolates according to the minimum spanning trees (n differences).
§All isolates did not group together into a single minimum spanning cluster. The distance reported is the largest sum of minimum spanning distances between subcluster-related isolates.
CG, core genome; MLST, multi-locus sequence typing; MSM, men who have sex with men; SNVPhyl, single-nucleotide variant phylogenomics pipeline; WG, whole genome.
Fig. 2.WgMLST ML phylogenetic tree and plasmid profiles of the 57 isolates involved in the SS-03 outbreak. aPlasmid clusters are defined as a group of closed reference plasmids with high sequence similarity by MOB-cluster. Contigs of plasmid origin in the draft assemblies of the isolates under study were then assigned a plasmid cluster.
Fig. 3.Summary of the reference-based hqSNV locations and BioNumerics loci used to generate phylogeny via SNVPhyl, coreMLST, cgMLST and wgMLST for all 10 (a) and 11 (b) outbreaks and subclusters.
Fig. 4.Box and Tukey whisker plots of the pairwise genetic distances for all 10 (a) and 11 (b) outbreaks and subclusters via the use of isolated (I) outbreak/subcluster-specific collections and the complete (C) species-specific isolate collections as inputs to SNVPhyl, coreMLST, cgMLST and wgMLST. The black dots inside the boxes represent the mean genetic distances.