| Literature DB >> 24105689 |
Stefan Bletz1, Martina Bielaszewska, Shana R Leopold, Robin Köck, Anika Witten, Jörg Schuldes, Wenlan Zhang, Helge Karch, Alexander Mellmann.
Abstract
Enterohemorrhagic Escherichia coli (EHEC) O26:H11/H⁻ is the predominant non-O157 EHEC serotype among patients with diarrhea, bloody diarrhea, and hemolytic uremic syndrome (HUS) worldwide. To elucidate their phylogeny and association between their phylogenetic background and clinical outcome of the infection, we investigated 120 EHEC O26:H11/H⁻ strains isolated between 1965 and 2012 from asymptomatic carriers and patients with diarrhea or HUS. Whole-genome shotgun sequencing (WGS) was applied to ten representative EHEC O26 isolates to determine single nucleotide polymorphism (SNP) localizations within a predefined set of core genes. A multiplex SNP assay, comprising a randomly distributed subset of 48 SNPs, was established to detect SNPs in 110 additional EHEC O26 strains. Within approximately 1 Mb of core genes, WGS resulted in 476 high-quality bi-allelic SNP localizations. Forty-eight of these were subsequently investigated in 110 EHEC O26 and four different SNP clonal complexes (SNP-CC) were identified. SNP-CC2 was significantly associated with the development of HUS. Within the subsequently established evolutionary model of EHEC O26, we dated the emergence of human EHEC O26 to approximately 19,700 years ago and demonstrated a recent evolution within humans into the 4 SNP-CCs over the past 1,650 years. WGS and subsequent SNP typing enabled us to gain new insights into the evolution of EHEC O26 suggesting a common theme in this EHEC group with analogies to EHEC O157. In addition, the SNP-CC analysis may help to assess a risk in infected individuals for the progression to HUS and to implement more specific infection control measures.Entities:
Keywords: EHEC O26; SNP typing; enterohemorrhagic E. coli; evolution; whole-genome shotgun sequencing
Mesh:
Year: 2013 PMID: 24105689 PMCID: PMC3814194 DOI: 10.1093/gbe/evt136
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FDistribution of the investigated 1,130 core genome ORFs (in green), of the discovered 476 SNPs (in blue), and the 48 SNPs of the multiplex assay (in red) illustrated in a circular map of reference genome of O26:H11 strain 11368.
List of the 48 Synaptomorphic SNPs in 47 Loci in This Study Based on the Genome Sequence of O26:H11 Strain 11368 (GenBank accession number NC_013361.1)
| Locus Tag | Gene (No. of SNPs) | Absolute SNP Position | SNP | SNP Effect | SNP Allele Frequency |
|---|---|---|---|---|---|
| ECO26_0083 | 90659 | T → G | Synonymous | 78.33 | |
| ECO26_0094 | 102806 | T → G | Nonsynonymous (Tyr → Asp) | 5.83 | |
| ECO26_0341 | 363280 | T → G | Synonymous | 56.67 | |
| ECO26_0370 | 398342 | T → C | Nonsynonymous (Tyr → His) | 56.67 | |
| ECO26_0554 | 598509 | A → G | Nonsynonymous (Thr → Ala) | 5.83 | |
| ECO26_0653 | 693357 | A → C | Nonsynonymous (Lys → Thr) | 89.17 | |
| ECO26_0785 | 841192 | C → G | Nonsynonymous (Asp → Glu) | 6.67 | |
| ECO26_0787 | 844779 | T → G | Synonymous | 10.00 | |
| ECO26_0968 | 1020736 | T → A | Synonymous | 6.67 | |
| ECO26_1012 | 1066493 | G → C | Nonsynonymous (Arg → Thr) | 56.67 | |
| ECO26_1062 | 1135005 | T → G | Nonsynonymous (Ile → Ser) | 89.17 | |
| ECO26_1083 | ycbG (1) | 1157959 | A → G | Nonsynonymous (Asn → Asp) | 89.17 |
| ECO26_1434 | ptsG (1) | 1447293 | A → C | Nonsynonymous (Asp → Ala) | 100.00 |
| ECO26_1531 | 1528765 | C → T | Nonsynonymous (Ala → Val) | 89.17 | |
| ECO26_1687 | 1652262 | A → G | Synonymous | 100.00 | |
| ECO26_1741 | 1711223 | T → C | Nonsynonymous (Val → Ala) | 89.17 | |
| ECO26_1835 | 1787173 | T → C | Nonsynonymous (Met → Thr) | 100.00 | |
| ECO26_1890 | 1844594 | G → A | Nonsynonymous (Gly → Asp) | 89.17 | |
| ECO26_2286 | 2221328 | A → T | Nonsynonymous (Glu → Val) | 56.67 | |
| ECO26_2339 | 2265373 | T → C | Nonsynonymous (Leu → Pro) | 89.17 | |
| ECO26_2367 | 2297263 | C → G | Synonymous | 56.67 | |
| ECO26_2432 | 2368018 | C → A | Nonsynonymous (Leu → Ile) | 6.67 | |
| ECO26_2433 | 2368309 | C → A | Nonsynonymous (Pro → Thr) | 99.17 | |
| ECO26_2838 | 2735850 | A → G | Nonsynonymous (His → Arg) | 89.17 | |
| ECO26_3081 | 3018001 | T → C | Nonsynonymous (Val → Ala) | 89.17 | |
| ECO26_3092 | 3030971 | A → T | Nonsynonymous (Ser → Cys) | 89.17 | |
| ECO26_3306 | 3232688 | G → T | Nonsynonymous (Arg → Leu) | 56.67 | |
| ECO26_3433 | 3349414 | G → A | Synonymous | 6.67 | |
| ECO26_3489 | 3410529 | T → G | Nonsynonymous (Cys → Gly) | 6.67 | |
| ECO26_3612 | 3551044 | T → A | Synonymous | 89.17 | |
| ECO26_3961 | 3914494 | A → G | Synonymous | 88.33 | |
| ECO26_3979 | 3939467 | G → A | Nonsynonymous (Val → Ile) | 6.67 | |
| ECO26_4164 | 4129239 | A → G | Synonymous | 100.00 | |
| ECO26_4280#1 | 4253102 | T → C | Synonymous | 98.33 | |
| ECO26_4280#2 | 4254024 | T → G | Synonymous | 56.67 | |
| ECO26_4302 | 4272578 | T → C | Nonsynonymous (Val → Ala) | 56.67 | |
| ECO26_4343 | 4314205 | T → C | Nonsynonymous (Cys → Arg) | 56.67 | |
| ECO26_4351 | 4326496 | G → T | Synonymous | 89.17 | |
| ECO26_4486 | 4446501 | A → C | Synonymous | 6.67 | |
| ECO26_4837 | 4858567 | T → C | Nonsynonymous (Val → Ala) | 90.00 | |
| ECO26_4917 | 4943021 | A → C | Synonymous | 89.17 | |
| ECO26_5089 | 5143059 | C → A | Nonsynonymous (Asn → Lys) | 89.17 | |
| ECO26_5096 | 5151319 | T → C | Synonymous | 100.00 | |
| ECO26_5099 | 5157488 | C → T | Nonsynonymous (Leu → Phe) | 90.00 | |
| ECO26_5139 | 5206902 | T → C | Nonsynonymous (Val → Ala) | 89.17 | |
| ECO26_5227 | 5302486 | A → G | Nonsynonymous (Tyr → Cys) | 89.17 | |
| ECO26_5302 | 5389233 | C → G | Synonymous | 89.17 | |
| ECO26_5384 | 5467060 | T → G | Synonymous | 89.17 |
aSNP allele frequency of 120 isolates used in this study.
bcanSNP for SNP-CC4.
cSNP differentiating SNP-CC2 and SNP-CC3
dCanonical SNP (canSNP) for SNP-CC1.
FThe minimum spanning tree (MST) shows the molecular phylogeny of 120 EHEC O26 isolates. The different colors represent the symptoms of the infected patients. Each node represents a unique SNP profile. SNP clonal clusters (SNP-CCs) are numbered (SNP-CC1 to SNP-CC4). The node size reflects the number of isolates. Small numbers on connecting lines display the distance (number of differing SNPs) between two nodes.
FNeighbor-joining tree based on 1,130 concatenated ORFs of ten representative isolates (gray) and reference strains O26:H11 (11368) and O111:H− (11128) (white). The SNP clonal clusters (SNP-CCs) are marked and demonstrate the quartering of the phylogenetic tree. Phylogenetic analysis generated by MEGA5 (Tamura et al. 2011).
FEvolutionary model and calculated age distances for EHEC O26 pathogens based on the neighbor-joining tree (fig. 3) and inserted in the SNP clonal clusters (SNP-CC1 to SNP-CC4). Blue boxes are the EHEC O26 isolates with shotgun genome sequencing data. In gray, hypothetical founders of O26 isolates are shown (A01 to A10). The ancestry is calculated based on the phylogeny displayed in figure 3. White boxes show the two EHEC O26:H11 and EHEC O111:H− reference strains (strains 11368 and 11128, respectively) that are fully sequenced; EHEC O111:H− is assumed to be the closest relative of serogroup O26 (Whittam et al. 1988). Blue lines connect the isolates and the hypothetical ancestors and red numbers show the synonymous/nonsynonymous SNPs between these genotypes. The gray line connects the O111:H− reference strain and the first O26 ancestor A10 as the common O26/O111 ancestor is not known. Distances are not drawn to scale.
Distribution of Diseases over Four Different SNP-CC of 120 EHEC O26 Strains Isolated from Patients and Associations of EHEC O26 SNP-CCs with HUS
| SNP-CC (Total No. Isolates) | Disease (HUS/BD/D/A/U) | OR (95% CI) (HUS) | |
|---|---|---|---|
| SNP-CC1 (8) | 5/0/2/0/1 | 0.96 (0.19–5.41) | 0.96 |
| SNP-CC2 (60) | 47/0/11/1/1 | 3.86 (1.63–9.30) | <0.01 |
| SNP-CC3 (39) | 22/3/11/3/0 | 0.65 (0.27–1.52) | 0.28 |
| SNP-CC4 (13) | 2/5/6/0/0 | 0.08 (0.01–0.42) | <0.01 |
Note.—BD, bloody diarrhea; D, diarrhea; A, asymptomatic; U, unknown; OR, odds ratio; CI, confidence interval.