| Literature DB >> 30042741 |
Arthur W Pightling1, James B Pettengill1, Yan Luo1, Joseph D Baugher1, Hugh Rand1, Errol Strain1.
Abstract
Whole-genome sequence (WGS) analysis has revolutionized the food safety industry by enabling high-resolution typing of foodborne bacteria. Higher resolving power allows investigators to identify origins of contamination during illness outbreaks and regulatory activities quickly and accurately. Government agencies and industry stakeholders worldwide are now analyzing WGS data routinely. Although researchers have published many studies that assess the efficacy of WGS data analysis for source attribution, guidance for interpreting WGS analyses is lacking. Here, we provide the framework for interpreting WGS analyses used by the Food and Drug Administration's Center for Food Safety and Applied Nutrition (CFSAN). We based this framework on the experiences of CFSAN investigators, collaborations and interactions with government and industry partners, and evaluation of the published literature. A fundamental question for investigators is whether two or more bacteria arose from the same source of contamination. Analysts often count the numbers of nucleotide differences [single-nucleotide polymorphisms (SNPs)] between two or more genome sequences to measure genetic distances. However, using SNP thresholds alone to assess whether bacteria originated from the same source can be misleading. Bacteria that are isolated from food, environmental, or clinical samples are representatives of bacterial populations. These populations are subject to evolutionary forces that can change genome sequences. Therefore, interpreting WGS analyses of foodborne bacteria requires a more sophisticated approach. Here, we present a framework for interpreting WGS analyses that combines SNP counts with phylogenetic tree topologies and bootstrap support. We also clarify the roles of WGS, epidemiological, traceback, and other evidence in forming the conclusions of investigations. Finally, we present examples that illustrate the application of this framework to real-world situations.Entities:
Keywords: Escherichia coli; Listeria monocytogenes; Salmonella enterica; genomic epidemiology; interpretation; outbreak investigation; phylogenetics; whole-genome sequence
Year: 2018 PMID: 30042741 PMCID: PMC6048267 DOI: 10.3389/fmicb.2018.01482
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Maximum pairwise SNPs measured during investigations into foodborne illness outbreaks and contamination events.
| Organism | Maximum SNP count (number) | Maximum SNP count (range) | Reference | ||
|---|---|---|---|---|---|
| <21 | 21–100 | >100 | |||
| 4 | X | ||||
| 15 | X | ||||
| 9 | X | ||||
| 12 | X | ||||
| 18 | X | ||||
| 20 | X | ||||
| 21 | X | ||||
| 28 | X | ||||
| 29 | X | ||||
| 42 | X | ||||
| 67 | X | ||||
| 2 | X | ||||
| 3 | X | ||||
| 3 | X | ||||
| 6 | X | ||||
| 12 | X | ||||
| 30 | X | ||||
Conditions used to determine whether whole-genome sequence analyses support a match between two or more genomes.
| Supports | Neutral | Does not support | |
|---|---|---|---|
| SNP distance | <21 | 21–100 | >100 |
| Bootstrap support | >0.89 | 0.80–0.89 | <0.80 |
| Tree topology | Monophyletic | Paraphyletic | Polyphyletic |
Characteristics of the examples presented in this paper.
| Example | SNP distance | Bootstrap support | Tree topology | Epidemiology, traceback, or compliance findings | Conclusion |
|---|---|---|---|---|---|
| Identifying the source of an | Supports | Supports | Supports | Supports | Match |
| Matching food isolates from one firm to environmental isolates from another firm | Supports | Supports | Supports | Supports | Match |
| Identifying a resident pathogen | Supports | Supports | Supports | Not applicable | Not applicable |
| Populations of environmental isolates can be very diverse | Neutral | Supports | Supports | Not applicable | Not applicable |
| Analyzing paraphyletic relationships | Supports | Neutral | Neutral | Does not support | No match |
| Evidence that isolates arose from the same source by WGS does not necessarily mean that they are linked | Supports | Supports | Supports | Does not support | No match |