| Literature DB >> 29593690 |
Anne-Catherine Portmann1, Coralie Fournier2, Johan Gimonet1, Catherine Ngom-Bru1, Caroline Barretto1, Leen Baert1.
Abstract
Whole genome sequencing (WGS), using high throughput sequencing technology, reveals the complete sequence of the bacterial genome in a few days. WGS is increasingly being used for source tracking, pathogen surveillance and outbreak investigation due to its high discriminatory power. In the food industry, WGS used for source tracking is beneficial to support contamination investigations. Despite its increased use, no standards or guidelines are available today for the use of WGS in outbreak and/or trace-back investigations. Here we present a validation of our complete (end-to-end) WGS workflow for Listeria monocytogenes and Salmonella enterica including: subculture of isolates, DNA extraction, sequencing and bioinformatics analysis. This end-to-end WGS workflow was evaluated according to the following performance criteria: stability, repeatability, reproducibility, discriminatory power, and epidemiological concordance. The current study showed that few single nucleotide polymorphism (SNPs) were observed for L. monocytogenes and S. enterica when comparing genome sequences from five independent colonies from the first subculture and five independent colonies after the tenth subculture. Consequently, the stability of the WGS workflow for L. monocytogenes and S. enterica was demonstrated despite the few genomic variations that can occur during subculturing steps. Repeatability and reproducibility were also demonstrated. The WGS workflow was shown to have a high discriminatory power and has the ability to show genetic relatedness. Additionally, the WGS workflow was able to reproduce published outbreak investigation results, illustrating its capability of showing epidemiological concordance. The current study proposes a validation approach comprising all steps of a WGS workflow and demonstrates that the workflow can be applied to L. monocytogenes or S. enterica.Entities:
Keywords: Listeria monocytogenes; SNP; Salmonella enterica; WGS; outbreak; phylogeny; source tracking; validation
Year: 2018 PMID: 29593690 PMCID: PMC5861296 DOI: 10.3389/fmicb.2018.00446
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Stability and Repeatability dataset for L. monocytogenes and S. enterica.
| PIR00542 | SAMN08125517 | 1/2a | ||
| PIR00543 | SAMN08125518 | 1/2a | ||
| PIR00545 | SAMN08125520 | 1/2b | ||
| PIR00540 | SAMN08125515 | 1/2c | ||
| PIR00541 | SAMN08125516 | 1/2c | ||
| PIR00544 | SAMN08125519 | 4b | ||
| PIR00546 | SAMN08125521 | 4b | ||
| PIR00547 | SAMN08125522 | 4b | ||
| PIR00532 | SAMN08125525 | Enteritidis | ||
| PIR00558 | SAMN08125532 | Enteritidis | ||
| PIR00533 | SAMN08125526 | Hadar | ||
| PIR00534 | SAMN08125527 | Hadar | ||
| PIR00537 | SAMN08125530 | Tennessee | ||
| PIR00535 | SAMN08125528 | Tennessee | ||
| PIR00536 | SAMN08125529 | Typhimurium | ||
| PIR00538 | SAMN08125531 | Typhimurium |
S. enterica and L. monocytogenes sequences dataset and their Sequence Read Archive numbers used for the stability and repeatability experiments.
| PIR00540 | SRR6347524 | SRR6347523 | SRR6347352 | SRR6347349 | SRR6347348 | SRR6347347 | SRR6347542 | SRR6347538 | SRR6347486 | SRR6347426 |
| PIR00541 | SRR6347510 | SRR6347509 | SRR6347512 | SRR6347511 | SRR6347506 | SRR6347505 | SRR6347508 | SRR6347507 | SRR6347514 | SRR6347513 |
| PIR00542 | SRR6347487 | SRR6347488 | SRR6347489 | SRR6347490 | SRR6347491 | SRR6347492 | SRR6347493 | SRR6347494 | SRR6347484 | SRR6347485 |
| PIR00543 | SRR6347473 | SRR6347472 | SRR6347471 | SRR6347470 | SRR6347469 | SRR6347468 | SRR6347467 | SRR6347466 | SRR6347465 | SRR6347464 |
| PIR00544 | SRR6347427 | SRR6347432 | SRR6347424 | SRR6347425 | SRR6347442 | SRR6347443 | SRR6347403 | SRR6347441 | SRR6347428 | SRR6347429 |
| PIR00545 | SRR6347407 | SRR6347406 | SRR6347409 | SRR6347408 | SRR6347411 | SRR6347410 | SRR6347413 | SRR6347412 | SRR6347405 | SRR6347404 |
| PIR00546 | SRR6347388 | SRR6347389 | SRR6347390 | SRR6347391 | SRR6347384 | SRR6347385 | SRR6347386 | SRR6347387 | SRR6347382 | SRR6347383 |
| PIR00547 | SRR6347353 | SRR6347402 | SRR6347351 | SRR6347350 | SRR6347357 | SRR6347356 | SRR6347355 | SRR6347354 | SRR6347360 | SRR6347359 |
| PIR00532 | SRR6347380 | SRR6347381 | SRR6347378 | SRR6347379 | SRR6347376 | SRR6347377 | SRR6347374 | SRR6347375 | SRR6347372 | SRR6347373 |
| PIR00533 | SRR6347545 | SRR6347544 | SRR6347547 | SRR6347546 | SRR6347541 | SRR6347540 | SRR6347543 | SRR6347527 | SRR6347539 | SRR6347525 |
| PIR00534 | SRR6347478 | SRR6347479 | SRR6347480 | SRR6347481 | SRR6347474 | SRR6347475 | SRR6347476 | SRR6347477 | SRR6347482 | SRR6347483 |
| PIR00535 | SRR6347498 | SRR6347497 | SRR6347496 | SRR6347495 | SRR6347502 | SRR6347501 | SRR6347500 | SRR6347499 | SRR6347504 | SRR6347503 |
| PIR00536 | SRR6347358 | SRR6347361 | SRR6347521 | SRR6347522 | SRR6347519 | SRR6347520 | SRR6347517 | SRR6347518 | SRR6347515 | SRR6347516 |
| PIR00537 | SRR6347535 | SRR6347534 | SRR6347537 | SRR6347536 | SRR6347531 | SRR6347530 | SRR6347533 | SRR6347532 | SRR6347529 | SRR6347528 |
| PIR00538 | SRR6347364 | SRR6347365 | SRR6347366 | SRR6347367 | SRR6347368 | SRR6347369 | SRR6347370 | SRR6347371 | SRR6347362 | SRR6347363 |
| PIR00558 | SRR6347399 | SRR6347398 | SRR6347397 | SRR6347396 | SRR6347395 | SRR6347394 | SRR6347393 | SRR6347392 | SRR6347401 | SRR6347400 |
| Stability | x | x | x | x | x | x | x | x | x | x |
| Repeatability | x | x | x | x | x | |||||
For each strain ten genome sequences were generated: five from independent colonies of the first subculture (A, B, C, D, and E) and five from independent colonies after the tenth subculture (A1, A2, A3, A4, and A5). For each analysis the de novo assembled Illumina reads of isolate (A) were used as reference.
×:used in the experimental setup.
L. monocytogenes sequence data used for assessing the reproducibility by hqSNP analysis.
| PIR00542 | *SRR6347440 | DEO1 | SO1 | SL1 | |
| SRR6347433 | DEO1 | SO2 | SL1 | ||
| SRR6347434 | DEO1 | SO3 | SL2 | ||
| SRR6347435 | DEO2 | SO1 | SL1 | ||
| SRR6347436 | DEO2 | SO2 | SL1 | ||
| SRR6347430 | DEO2 | SO3 | SL2 | ||
| PIR00544 | *SRR6347431 | DEO1 | SO1 | SL1 | |
| SRR6347459 | DEO1 | SO2 | SL1 | ||
| SRR6347458 | DEO1 | SO3 | SL2 | ||
| SRR6347457 | DEO2 | SO1 | SL1 | ||
| SRR6347456 | DEO2 | SO2 | SL1 | ||
| SRR6347463 | DEO2 | SO3 | SL2 | ||
| PIR00547 | *SRR6347462 | DEO1 | SO1 | SL1 | |
| SRR6347461 | DEO1 | SO2 | SL1 | ||
| SRR6347460 | DEO1 | SO3 | SL2 | ||
| SRR6347455 | DEO2 | SO1 | SL1 | ||
| SRR6347454 | DEO2 | SO2 | SL1 | ||
| SRR6347526 | DEO2 | SO3 | SL2 |
Two different operators performed the DNA extractions (DEO1 and DEO2).
Three different sequencing operators (SO1, SO2, and SO3) sequenced the DNA at
two different sequencing laboratories (SL1 and SL2).
For each analysis two references were used and evaluated: de novo assembled Illumina reads of isolate (.
S. enterica sequence data used for assessing the reproducibility by hqSNP analysis.
| PIR00534 | *SRR6347416 | DEO1 | SO1 | SL1 | |
| SRR6347417 | DEO1 | SO2 | SL1 | ||
| SRR6347414 | DEO1 | SO3 | SL2 | ||
| SRR6347415 | DEO2 | SO1 | SL1 | ||
| SRR6347420 | DEO2 | SO2 | SL1 | ||
| SRR6347421 | DEO2 | SO3 | SL2 | ||
| PIR00536 | *SRR6347418 | DEO1 | SO1 | SL1 | |
| SRR6347419 | DEO1 | SO2 | SL1 | ||
| SRR6347422 | DEO1 | SO3 | SL2 | ||
| SRR6347423 | DEO2 | SO1 | SL1 | ||
| SRR6347445 | DEO2 | SO2 | SL1 | ||
| SRR6347444 | DEO2 | SO3 | SL2 | ||
| PIR00503 | *SRR6347447 | DEO1 | SO1 | SL1 | |
| SRR6347446 | DEO1 | SO2 | SL1 | ||
| SRR6347449 | DEO1 | SO3 | SL2 | ||
| SRR6347448 | DEO2 | SO1 | SL1 | ||
| SRR6347451 | DEO2 | SO2 | SL1 | ||
| SRR6347450 | DEO2 | SO3 | SL2 |
Two different operators performed the DNA extractions (DEO1 and DEO2).
Three different sequencing operators (SO1, SO2, and SO3) sequenced the DNA at
two different sequencing laboratories (SL1 and SL2).
For each analysis two references were used and evaluated: de novo assembled Illumina reads of isolate (.
L. monocytogenes sequence dataset for assessing the discriminatory power by hqSNP analysis.
| 1/2c | ATCC 51779 (Belgium) Independent colonies from the first subculture | PIR00541 (A) | SRR6347510 | Group 1 |
| PIR00541 (B) | SRR6347509 | |||
| PIR00541 (C) | SRR6347512 | |||
| 1/2a | ATCC 51775 (Belgium) Independent colonies from the first subculture | PIR00542 (A) | SRR6347487 | Group 2 |
| PIR00542 (B) | SRR6347488 | |||
| PIR00542 (C) | SRR6347489 | |||
| 4b | ATCC 13932 (Germany) Independent colonies from the first subculture | PIR00547 (A) | SRR6347353 | Group 3 |
| PIR00547 (B) | SRR6347402 | |||
| PIR00547 (C) | SRR6347351 | |||
| NCTC 11994 (UK) | PIR00492 | SRR6347453 | Group 4 | |
| SRR6347452 | ||||
| PIR00548 | SRR6347437 | |||
| SRR6347438 | ||||
| SRR6347439 |
For this isolate two genome sequences were generated from independent subcultures.
For this isolate three genome sequences were generated from independent subcultures.
S. enterica sequence dataset for assessing the discriminatory power by hqSNP analysis.
| PIR00532 (A) | SRR6347380 | Independent colonies from the first subculture | Group 1 | |
| PIR00532 (B) | SRR6347381 | |||
| PIR00532 (C) | SRR6347378 | |||
| PIR00558 (A) | SRR6347399 | ATCC BAA-708 Independent colonies from the first subculture | Group 2 | |
| PIR00558 (B) | SRR6347398 | |||
| PIR00558 (C) | SRR6347397 | |||
| PHE: 32476 | SRR1965122 | European egg outbreak | Group 3 | |
| PHE: 21785 | SRR1965313 | |||
| PHE: 32477 | SRR1966289 | |||
| S14BD01753 | SRR2088895 | Belgian egg outbreak | Group 4 | |
| S14FP01877 | SRR2088898 | |||
| PIR00535 (A) | SRR6347498 | ATCC 10722 Independent colonies from the first subculture | Group 1 | |
| PIR00535 (B) | SRR6347497 | |||
| PIR00535 (C) | SRR6347496 | |||
| PIR00537 (A) | SRR6347535 | Independent colonies from the first subculture | Group 2 | |
| PIR00537 (B) | SRR6347534 | |||
| PIR00537 (C) | SRR6347537 | |||
| CFSAN001387 | SRR965704 | American peanut butter outbreaks | Group 3 | |
| CFSAN001349 | SRR1177176 | |||
| PIR00536 (A) | SRR6347358 | ATCC13311 Independent colonies from the first subculture | Group 1 | |
| PIR00536 (B) | SRR6347361 | |||
| PIR00536 (C) | SRR6347521 | |||
| PIR00538 (A) | SRR6347364 | Independent colonies from the first subculture | Group 2 | |
| PIR00538 (B) | SRR6347365 | |||
| PIR00538 (C) | SRR6347366 | |||
| Styph-0803T57157 | ERR277220 | Large Denmark outbreak | Group 3 | |
| Styph-0808S61603 | ERR277226 | |||
| 1687 | SRR1645473 | Australian chocolate mousse outbreak | Group 4 | |
| 1700 | SRR1645486 |
Sequence data obtained from the stability experiment described in Table .
Maximum SNPs observed during the stability test of L. monocytogenes between five independent colonies from the first subculture and after the tenth subculture.
| PIR00542 | 1/2a | 0 | 0 |
| PIR00543 | 1/2a | 0 | 0 |
| PIR00545 | 1/2b | 0 | 0 |
| PIR00540 | 1/2c | 0 | 0 |
| PIR00541 | 1/2c | 1 | 1 |
| PIR00544 | 4b | 2 | 3 |
| PIR00546 | 4b | 0 | 0 |
| PIR00547 | 4b | 0 | 0 |
Maximum SNPs observed during the stability test of Salmonella enterica between five independent colonies from the first subculture and after the tenth subculture.
| PIR00532 | Enteritidis | 0 | 0 |
| PIR00558 | Enteritidis | 0 | 1 |
| PIR00533 | Hadar | 0 | 0 |
| PIR00534 | Hadar | 0 | 0 |
| PIR00537 | Tennessee | 0 | 0 |
| PIR00535 | Tennessee | 0 | 1 |
| PIR00536 | Typhimurium | 0 | 0 |
| PIR00538 | Typhimurium | 0 | 1 |
Figure 1Phylogenetic tree based on SNP differences from selected patient isolates and the lettuce reference isolate. *corresponds to the recalled lettuce isolate. The SRR1263956 corresponds to the Ohio patient isolate. Tree was generated with Garli and drawn with Figtree.
Figure 2Phylogenetic tree based on SNP differences between all patient isolates and the reference lettuce isolate. *corresponds to the recalled lettuce isolate. All the other isolates are patients' isolates. The SRR1263956 corresponds to the Ohio patient isolate. Tree was generated with Garli and drawn with Figtree.
Figure 3Phylogenetic tree based on SNPs differences from S. Enteritidis sequences shared by the authors. Blue: previous UK cases isolates, red: Spanish isolates, black: UK patients' isolates, purple: supplementary data provided by the authors (not linked to the outbreak) and green: the food isolate. Tree was generated with Garli and drawn with Figtree.
Figure 4Phylogenetic tree based on SNPs differences from S. Enteritidis obtained with the food isolate as reference. Blue: previous UK cases isolates, red: Spanish isolates, black: UK patients' isolates, light green: supplementary data provided by the authors (not linked to the outbreak) and green: the food isolate. Tree was generated with Garli and drawn with Figtree.