| Literature DB >> 31354679 |
Silin Tang1, Renato H Orsi2, Hao Luo1, Chongtao Ge1, Guangtao Zhang1, Robert C Baker1, Abigail Stevenson1, Martin Wiedmann2.
Abstract
The food industry is facing a major transition regarding methods for confirmation, characterization, and subtyping of Salmonella. Whole-genome sequencing (WGS) is rapidly becoming both the method of choice and the gold standard for Salmonella subtyping; however, routine use of WGS by the food industry is often not feasible due to cost constraints or the need for rapid results. To facilitate selection of subtyping methods by the food industry, we present: (i) a comparison between classical serotyping and selected widely used molecular-based subtyping methods including pulsed-field gel electrophoresis, multilocus sequence typing, and WGS (including WGS-based serovar prediction) and (ii) a scoring system to evaluate and compare Salmonella subtyping assays. This literature-based assessment supports the superior discriminatory power of WGS for source tracking and root cause elimination in food safety incident; however, circumstances in which use of other subtyping methods may be warranted were also identified. This review provides practical guidance for the food industry and presents a starting point for further comparative evaluation of Salmonella characterization and subtyping methods.Entities:
Keywords: MLST; PFGE; Salmonella; WGS; food industry; serotyping; subtyping
Year: 2019 PMID: 31354679 PMCID: PMC6639432 DOI: 10.3389/fmicb.2019.01591
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Overview of Salmonella characterization and subtyping methods.
| Classical White–Kauffman serotyping | While | Very poor subtype discrimination; only valuable as subtyping method for rare and unusual serovars. | 2–17 days (usually >5 days) ( | 2–4 weeks | Classical serotyping is likely to be replaced rapidly by WGS-based serovar prediction. Main value for industry is as a rapid confirmation and subtype screen if access exists to lab that can provide rapid turnaround time. | $5–65 ( | ∼$175 |
| Pulsed-field gel electrophoresis (PFGE) | Intermediate ability to predict serovars | Good subtyping discrimination for most serovars. Some PFGE patterns are very common within some serovars (e.g., Pattern 4 for | 4–6 days ( | 2–3 weeks | Has been the gold standard subtyping method for | $7–50 ( | $130–200 |
| Multiple locus variable number of tandem repeats (VNTR) analysis (MLVA) | Intermediate ability to predict serovars | Good subtyping discrimination for most serovars. May perform better than PFGE for some serovars but worse for others. | 1–2 days | NA2 | Has been used as a secondary subtyping method to compensate the low discriminatory power of serotyping and PFGE for some | $9–36 ( | NA2 |
| Legacy multilocus sequence typing (legacy MLST) | Intermediate ability to predict serovars | Better than conventional serotyping and riboprinting, worse than PFGE and WGS. | 1–2 days | 2–3 days | Main value for industry is as a rapid confirmation and subtype screen, can be used to select the reference genome for WGS data analysis. | $30–82 ( | ∼$280 |
| Whole-genome sequencing (WGS) | Currently available serovar-prediction software using WGS data work well for less common serovars. May not work for extremely rare serovars. | Best discrimination among molecular subtyping approaches | 3–17 days ( | 2–8 weeks | For companies with high demand of isolates to be subtyped, WGS is probably the most affordable and fastest method that provides the best discrimination. In addition, | $60–230 ( | $100 (using Illumina HiSeq X series)–up to more than $500 (using Illumina MiSeq) |
FIGURE 1Timeline of the development of selected molecular subtyping and characterization methods for Salmonella (; Gilson et al., 1990; Threlfall and Frost, 1990; Hulton et al., 1991; Martin et al., 1992; Lindstedt et al., 2003, 2013; Healy et al., 2005; Grimont and Weill, 2007; Zou et al., 2010; Wattiau et al., 2011; PulseNet, 2014; CDC, 2016a, 2019; Nadon et al., 2017).
Comparison of molecular characterization methods for prediction of Salmonella1 serovars.
| PFGE | ||||
| 80 | 6 | Turkey processing plant | 99 | |
| 68 | 10 | Swine farms | 84 | |
| 674 | 12 | Swine | 85 | |
| 866 | 8 | Food animals, production facilities, and clinical samples | 96 | |
| 1,128 | 31 | Food, animals, humans, natural environment, and processing plants | 97 | |
| 46 | 40 | Human and cattle | 75 | |
| 1,486 | 110 | New York State Department of Health, isolates received in 2012; human clinics | 96 | |
| 1,437 | 131 | New York State Department of Health, isolates received in 2013; human clinics | 91 | |
| 1,558 | 107 | New York State Department of Health, isolates received in 2014; human clinics | 90 | |
| Legacy MLST | ||||
| 25 | 7 | Chickens | 92 | |
| 66 | 1 | Cattle, birds, horses, and other animals | 99 | |
| 110 | 25 | Human and veterinary source | 98 | |
| 152 | 33 | Reference collection | 100 | |
| 4,257 | 554 | Reference collection | 88 | |
| 46 | 40 | Human and cattle | 91 | |
| 42,400 | 624 | SRA collection | 91 | |
| 7,338 | 263 | Human | 96 | |
| WGS-(SeqSero) | ||||
| 308 | 72 | CDC collection | 99 | |
| 3,306 | 228 | Genome Trakr collection | 93 | |
| 354 | 44 | GenBank collection | 92 | |
| WGS-(SISTR) | ||||
| 4,291 | 246 | SRA and NCBI Assembly collections | 95 | |
| 42,400 | 624 | SRA collection | 97 | |
Examples of serovars incorrectly predicted by PFGE.
| Montevideo (clustered with Senftenberg) | 6,7 | g,m,s | No phase 2 antigen | |
| Senftenberg (clustered with Montevideo) | 1,3,19 | g,s,t | No phase 2 antigen | |
| Typhimurium var. Copenhagen (clustered with 4,[5],12:i:- and Typhimurium) | 1,4,12 | I | 1,2 | |
| 4,5,12:i:- (clustered with Typhimurium var. Copenhagen and Typhimurium) | 4,5,12 | I | No phase 2 antigen | |
| Typhimurium (clustered with Typhimurium var. Copenhagen and 4,[5],12:i:-) | 1,4,5,12 | I | 1,2 | |
| Saintpaul (clustered with Typhimurium var. Copenhagen and Typhimurium) | 1,4,5,12 | e,h | 1,2 | |
| Putten (clustered with Agona) | 13, 23 | D | l, w | |
| Agona (clustered with Putten) | 4,12 | f,g,s | No phase 2 antigen | |
| Paratyphi B | 1,4,5,12 | B | 1,2 | |
| Give | 3,10 | l,v | 1,7 | |
| Newport | 6,8 | e,h | 1,2 |
Proposed evaluation criteria for Salmonella characterization methods that may be used routinely in the food industry1.
| Stability | Consistency of the typing result for an isolate after its primary isolation and during laboratory storage and subculture. | Typing results should be stable during laboratory storage and subculture; strain markers should not mutate too rapidly to change the strain’s position in the epidemiological context; data on the stability of the markers should be available. | Rapid mutations and recombination of the marker(s) during storage and subculture could lead to poor reproducibility. | 0 – Extremely poor stability |
| Typeability | Ability to assign a type to all isolates tested by it. | Typeability should be as high as possible. | Poor typeability could be found in assays using a scheme that does not cover genetic variation in full; typeability may also be reduced if some isolates show high endogenous nuclease activity. | 0 – Extremely poor typeability |
| Discriminatory power | Ability to assign a different type to two unrelated strains; discriminatory power can be expressed using Simpson’s index of diversity (SID) | Discriminatory power should be as high as possible. For highly discriminatory methods, clustering using phylogenetic analysis tools can be used to define isolates that share a recent common ancestor. | Discriminatory power is highly dependent on the marker(s) selected for typing. | 0 – Extremely poor discriminatory power (<80%, SID <0.80) |
| Epidemiological concordance | Ability to reflect, agree with, and possibly further illuminate the available epidemiological information about the cases under study. | Epidemiological concordance should be as high as possible; strains from the same outbreak or strains that are otherwise linked by epidemiological evidence should be classified into the same subtype (or phylogenetically characterized as sharing a recent common ancestor). | Low epidemiological concordance could be found in assays that either target “low stability markers” or an assay with limited discriminatory power, which will group together isolates that are epidemiologically unrelated. | 0 – Extremely poor epidemiological concordance; <80% isolates are classified correctly. |
| Reproducibility | Ability to perform reproducibly in different laboratories and with different personnel. | Results should be highly reproducible (>99%). | Poor reproducibility could be the results of (i) technically difficult assay (leading to technical errors by personnel, e.g., cross-contamination), (ii) reagents not standardized sufficiently, (iii) equipment not performing reproducibly, (iv) poorly optimized typing system, (v) sensitivity of equipment or assay system to environmental factors (e.g., humidity, temperature), (vi) bias in observing, recording, analysis, and interpretation of the results; (vii) or assays targeting biologically highly variable markers (e.g., some of the surface antigens targeted by classical serotyping). | 0 – Extremely poor reproducibility; <80%; meaning for >20% of isolates results are not reproducible between labs |
| Repeatability | Ability to produce the same results in the same laboratory with the same equipment and personnel | Results should be highly repeatable ( > 99%) | Poor repeatability could be the result of i) technically difficult assay (leading to technical errors by personnel, e.g., cross-contamination), ii) reagents not standardized sufficiently, iii) equipment not performing reproducibly. | 0 – Extremely low repeatability (<90%; meaning for >10% of isolates results are not repeatable) |
| Serovar prediction ability | Ability to accurately predict the serovar of a given strain. | Range, as the number of identifiable serovars, and accuracy (i.e., percentage of isolates with correct serovar identification) should be maximized. Accuracy should be given priority over range as misclassification may lead to worse decisions than non-classification. | Poor serovar prediction could be a result of (i) limited database coverage of different serovars, (ii) low discriminatory power, (iii) low typeability, (iv) no standard protocol of serovar prediction with produced data. | 0 – Extremely low serovar prediction accuracy (serovar is correctly predicted for <70% of serovars) |
| Speed | Time to results from pure single colony | <5 days | Speed can be influenced by throughput, equipment, and data analysis program used for a given assay | 0 – >1 month |
| Ease of use | Ease of use encompasses technical simplicity, workload, suitability for high throughput test, ease of data analysis, and result interpretation | Ease of use is important for the implementation of an assay in the internal laboratories of food industry, less important when using services provided by a commercial laboratory. | Poor ease of use is usually caused by the high level of expertise and experience required by a given assay, e.g., bioinformatics expertise to analyze data produced by the assay. | 0 – The given assay requires extremely high level of expertise and experience in specific techniques (PhD level scientist with >4 days of specialized training) |
| Cost | Total cost encompasses cost of equipment reagent/consumables, data analysis platform, and staffing. For routine use, we usually just assess the reagent cost per isolate. Staffing cost can vary considerably in different regions/countries within a given turnaround time, thus needs to be assessed separately with actual local situations. | A balance between efficiency/effectiveness and cost of a given assay is more important than pursuing low cost, because low cost may potentially lead to larger economic loss and extra investigation time caused by poor quality of typing result. | High cost per isolate for routine test is usually caused by high reagent cost and long turnaround time (leading to high staffing cost). | We recommend to use the actual reagent cost per isolate plus staffing cost estimated with given turnaround time to compare the assay being validated to the currently/previously used methods by food industry; data here are based on costs from commercial laboratories in North America and Europe: |