| Literature DB >> 29997582 |
Matthias Kiel1, Pierre Sagory-Zalkind2, Céline Miganeh2, Christoph Stork1, Andreas Leimbach1, Camilla Sekse3, Alexander Mellmann1, François Rechenmann2, Ulrich Dobrindt1.
Abstract
It would be desirable to have an unambiguous scheme for the typing of Shiga toxin-producing Escherichia coli (STEC) isolates to subpopulations. Such a scheme should take the high genomic plasticity of E. coli into account and utilize the stratification of STEC into subgroups, based on serotype or phylogeny. Therefore, our goal was to identify specific marker combinations for improved classification of STEC subtypes. We developed and evaluated two bioinformatic pipelines for genomic marker identification from larger sets of bacterial genome sequences. Pipeline A performed all-against-all BLASTp analyses of gene products predicted in STEC genome test sets against a set of control genomes. Pipeline B identified STEC marker genes by comparing the STEC core proteome and the "pan proteome" of a non-STEC control group. Both pipelines defined an overlapping, but not identical set of discriminative markers for different STEC subgroups. Differential marker prediction resulted from differences in genome assembly, ORF finding and inclusion cut-offs in both workflows. Based on the output of the pipelines, we defined new specific markers for STEC serogroups and phylogenetic groups frequently associated with outbreaks and cases of foodborne illnesses. These included STEC serogroups O157, O26, O45, O103, O111, O121, and O145, Shiga toxin-positive enteroaggregative E. coli O104:H4, and HUS-associated sequence type (ST)306. We evaluated these STEC marker genes for their presence in whole genome sequence data sets. Based on the identified discriminative markers, we developed a multiplex PCR (mPCR) approach for detection and typing of the targeted STEC. The specificity of the mPCR primer pairs was verified using well-defined clinical STEC isolates as well as isolates from the ECOR, DEC, and HUSEC collections. The application of the STEC mPCR for food analysis was tested with inoculated milk. In summary, we evaluated two different strategies to screen large genome sequence data sets for discriminative markers and implemented novel marker genes found in this genome-wide approach into a DNA-based typing tool for STEC that can be used for the characterization of STEC from clinical and food samples.Entities:
Keywords: O157; STEC; comparative genomics; multiplex PCR; non-O157
Year: 2018 PMID: 29997582 PMCID: PMC6028524 DOI: 10.3389/fmicb.2018.01321
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
PCR primer sequences and concentrations used in this study.
Reference strains used in this study.
| Reference strains | MLST | Serotype | ||
|---|---|---|---|---|
| HUSEC003 | ST11 (CC11) | O157:H7 | 2 | Positive |
| HUSEC007 | ST17 (CC20) | O103:H2 | 2 | Positive |
| HUSEC011 | ST16 (CC29) | O111:H8 | 1 + 2 | Positive |
| HUSEC017 | ST21 (CC29) | O26:H11 | 1 + 2 | Positive |
| HUSEC021 | ST32 (CC32) | O145:H28 | 2 | Positive |
| HUSEC031 | ST306 | OR:H- | 1 | Positive |
| HUSEC035 | ST655 | O121:H19 | 2 | Positive |
| HUSEC041 | ST678 | O104:H4 | 2 | Negative |
| LB408196i1 | ST301 (CC165) | O45:H2 | 2 | Positive |
Amount of biomarkers after checkpoints given in Figure for each genome subgroup used for bioinformatics pipeline A.
| STEC subgroup | Number of genomes | Number of biomarkers after pipeline (Checkpoint 1) | Number of biomarkers after BLASTp verification (Checkpoint 2) | Number of biomarkers used for | Number of biomarkers after |
|---|---|---|---|---|---|
| CC11 | 40 | 42 | 9 | 8 | 4 |
| CC11 (pipeline B) | 6 | 70 | 6 | 0 | 0 |
| CC20 | 11 | 5 | 0 | 0 | 0 |
| CC20 (pipeline B) | 4 | 22 | 0 | 4 | 3 |
| CC29 | 24 | 1 | 1 | 1 | 1 |
| ST32 | 7 | 155 | 5 | 4 | 2 |
| ST306 | 9 | 251 | 15 | 5 | 1 |
| ST678 | 6 | 112 | 4 | 4 | 1 |
| O26 | 10 | 2 | 2 | 2 | 2 |
| O45 | 1 | 307 | 12 | 4 | 3 |
| O103 | 9 | 2 | 2 | 2 | 2 |
| O104 | 8 | 8 | 6 | 6 | 2 |
| O111 | 13 | 8 | 6 | 6 | 3 |
| O121 | 1 | 103 | 14 | 5 | 1 |
| O145 | 11 | 8 | 3 | 3 | 1 |
| Sum | 160 | 1096 | 85 | 54 | 26 |
In silico and in vitro specificity of the predicted biomarkers.
| Primer name | ||||
|---|---|---|---|---|
| CC11 | 547/556 | 98.38% | 15/15 | 100.00% |
| O104 | 121/121 | 100.00% | 15/15 | 100.00% |
| O111 | 120/120 | 100.00% | 11/11 | 100.00% |
| O26 | 66/66 | 100.00% | 14/14 | 100.00% |
| O45 | 10/10 | 100.00% | 5/5 | 100.00% |
| O121 | 6/5 | 83.33% | 6/6 | 100.00% |
| CC20 | 43/76 | 56.58% | 11/11 | 100.00% |
| ST306 | 9/9 | 100.00% | 11/11 | 100.00% |
| O103 | 30/31 | 96.77% | 11/11 | 100.00% |
| CC29 | 140/136 | 97.14% | 27/27 | 100.00% |
| ST32 | 27/12 | 44.44% | 11/11 | 100.00% |
| ST678 | 119/112 | 94.12% | 13/13 | 100.00% |
| O145 | 26/26 | 100.00% | 18/18 | 100.00% |
| 389/395 | 98.48% | 48/48 | 100.00% | |
| 585/652 | 89.72% | 87/87 | 100.00% | |
| 898/925 | 97.08% | 97/97 | 100.00% | |
| 2898/3015 | 96.12% | 127/127 | 100.00% | |
Protein function of biomarker.
Results of milk dilution series with 28 PCR cycles.
| Marker | PCR product (bp) | 90000 CFU/PCR | 9000 CFU/PCR | 900 CFU/PCR | 90 CFU/PCR | 9 CFU/PCR |
|---|---|---|---|---|---|---|
| CC11 | 93 | xx | x | - | - | – |
| O104 | 218 | xx | xx | - | - | – |
| O111 | 300 | xx | xx | x | - | – |
| O26 | 441 | xx | xx | (x) | - | – |
| O45 | 591 | xx | xx | - | – | – |
| CC20 | 195 | xx | xx | - | – | – |
| O103 | 310 | xx | xx | - | – | – |
| CC29 | 437 | xx | xx | x | (x) | – |
| ST32 | 613 | xx | xx | (x) | – | – |
| ST678 | 138 | xx | x | - | – | – |
| O145 | 298 | xx | x | - | – | – |
| 542 | xx | xx | x | – | – | |
| 715 | xx | x | (x) | – | – | |
| 842 | xx | xx | (x) | – | – | |
| ST306 | 240 | xx | xx | x | – | – |
| O121 | 395 | xx | xx | x | – | – |
| 1075 | xx | xx | x | – | – | |