| Literature DB >> 28932209 |
Sabine Delannoy1, Patricia Mariani-Kurkdjian2,3,4, Hattie E Webb5, Stephane Bonacorsi2,3,4, Patrick Fach1.
Abstract
Shiga toxin-producing Escherichia coli of serotype O26:H11/H- constitute a diverse group of strains and several clones with distinct genetic characteristics have been identified and characterized. Whole genome sequencing was performed using Illumina and PacBio technologies on eight stx2-positive O26:H11 strains circulating in France. Comparative analyses of the whole genome of the stx2-positive O26:H11 strains indicate that several clones of EHEC O26:H11 are co-circulating in France. Phylogenetic analysis of the French strains together with stx2-positive and stx-negative E. coli O26:H11 genomes obtained from Genbank indicates the existence of four clonal complexes (SNP-CCs) separated in two distinct lineages, one of which comprises the "new French clone" (SNP-CC1) that appears genetically closely related to stx-negative attaching and effacing E. coli (AEEC) strains. Interestingly, the whole genome SNP (wgSNP) phylogeny is summarized in the cas gene phylogeny, and a simple qPCR assay targeting the CRISPR array specific to SNP-CC1 (SP_O26-E) can distinguish between the two main lineages. The PacBio sequencing allowed a detailed analysis of the mobile genetic elements (MGEs) of the strains. Numerous MGEs were identified in each strain, including a large number of prophages and up to four large plasmids, representing overall 8.7-19.8% of the total genome size. Analysis of the prophage pool of the strains shows a considerable diversity with a complex history of recombination. Each clonal complex (SNP-CC) is characterized by a unique set of plasmids and phages, including stx-prophages, suggesting evolution through separate acquisition events. Overall, the MGEs appear to play a major role in O26:H11 intra-serotype clonal diversification.Entities:
Keywords: E. coli; Escherichia coli; STEC; Shiga toxin-producing E. coli; comparative genomics; mobile genetic elements; phylogenetic relationship; stx2
Year: 2017 PMID: 28932209 PMCID: PMC5592225 DOI: 10.3389/fmicb.2017.01625
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Genetic characteristics of the E. coli O26:H11 strains included in this study.
| 36084 | ST21 | CC3 | − | France (2013) | HUS | Delannoy et al., | ||
| 36079 | ST21 | CC3 | − | France (2013) | HUS | This study | ||
| 36708 | ST29 | CC2 | − | France (2013) | HUS | Delannoy et al., | ||
| 34827 | ST29 | CC1 | + | France (2012) | HUS | Delannoy et al., | ||
| 34870 | ST29 | CC1 | + | France (2012) | HUS | Delannoy et al., | ||
| 36348 | ST29 | CC1 | + | France (2013) | HUS | Delannoy et al., | ||
| 36293 | ST29 | CC1 | + | France (2013) | HUS | Delannoy et al., | ||
| 36493 | ST29 | CC1 | + | France (2013) | HUS | Delannoy et al., | ||
| DEC10D | − | ST21 | CC3 | −# | France (1952) | Diarrhea | Hazen et al., | |
| DEC9A | − | ST29 | CC1 | +# | USA (1961) | Diarrhea | Hazen et al., | |
| DEC9B | − | ST29 | CC1 | +# | USA (1979) | Diarrhea | Hazen et al., | |
| DEC9C | − | ST29 | CC1 | +# | Switzerland (1952) | Diarrhea | Hazen et al., | |
| DEC9D | − | ST29 | CC1 | +# | Denmark (1967) | Diarrhea | Hazen et al., | |
| DEC9E | − | ST29 | CC1 | +# | Mexico (1986) | Diarrhea | Hazen et al., | |
| FHI4 | ST21 | CC3 | −# | Norway (2002) | HUS | Haugum et al., | ||
| FHI24 | ST29 | CC2 | −# | Norway (2007) | HUS | Haugum et al., | ||
| FHI27 | ST29 | CC2 | −# | Norway (2008) | HUS | Haugum et al., | ||
| 11368 | ST21 | CC4 | −# | Japan (2001) | Diarrhea | Ogura et al., |
The genome sequenced extracted from Genbank were tested in silico for the SP_O26_E marker (indicated by .
Figure 1Phylogenetic overview of the O26:H11 isolates. The whole genomes of the 18 isolates were compared using Gegenees 2.1 and Mega 6.06. The heat-plot comparison matrix is based on a fragmented alignment using BLASTN (200/100) with a threshold set at 30%. The percentage similarities between the conserved regions of the genomes are indicated in the comparison matrix, where the colors vary from red (low similarity) to green (high similarity). The distance matrix was used to produce a dendrogram in Mega using the neighbor-joining method. The optimal tree with the sum of branch length = 0.05552887 is displayed (branch lengths are shown above the branches).
Figure 2Minimum spanning tree based on 48 SNPs (Bletz et al., 2013). Each node represents a unique SNP profile. The node size is proportional to the number of isolates. The four clonal complexes (SNP-CCs) are represented by different colors: SNP-CC1 is shown in green, SNP-CC2 in yellow, SNP-CC3 in red and SNP-CC4 in blue. This figure was created with Splitstree (version 4.12.6). Representative strains for each SNP-CC and sequenced by Bletz et al. (2013) are indicated in black, French strains sequenced in this study are indicated in light blue and strains extracted from published WGS projects are indicated in purple.
Figure 3Phylogenetic relationships of O26:H11 strains. The phylogenetic relationships of the 18 strains were assessed by whole-genome SNPs (wgSNP) analysis using CSI Phylogeny 1.2 on the CGE server. The SNPs alignment generated was imported and analyzed in Mega6. The evolutionary history was inferred using the Maximum Likelihood method based on the Jukes-Cantor model. The bootstrap consensus tree inferred from 100 replicates is displayed. Branches corresponding to partitions reproduced in <50% bootstrap replicates are collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (100 replicates) is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. All positions containing gaps and missing data were eliminated. There were a total of 4,419 positions in the final dataset. The stx subtype and insertion site(s) of the stx-phage are indicated next to the isolate name. *Indicates that the insertion site was inferred.
Figure 4Evolutionary relationships of the O26:H11 strains based on the concatenated ORFs of the eight cas genes. (A) Schematic representation of the type I-E CRISPR-Cas system. The cas genes are drawn to scale. (B) Maximum Likelihood tree based on the concatenated ORFs of the eight cas genes. The evolutionary history was inferred by using the Maximum Likelihood method based on the Jukes-Cantor model. The bootstrap consensus tree inferred from 100 replicates is displayed. Branches corresponding to partitions reproduced in <50% bootstrap replicates are collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (100 replicates) is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach, and then selecting the topology with superior log likelihood value. All positions containing gaps and missing data were eliminated. There were a total of 8421 positions in the final dataset. Evolutionary analyses were conducted in MEGA6. Isolates DEC9A, DEC9E, and DEC10D were not included in the analysis due to the presence of frameshifts.
Insertion sites and selected characteristics of the stx2 prophages.
| 34827 | Contig 2 | Incomplete ( | 25 | ||
| Contig 17 | Incomplete ( | 34 | |||
| 34870 | Contig 5 | Complete ( | 52 | ||
| 36079 | Contig 1 | Incomplete ( | 25 | ||
| Contig 4 | Incomplete ( | 20 | |||
| 36084 | Contig 14 | Incomplete ( | 20 | ||
| 36293 | Contig 10 | Incomplete ( | 22 | ||
| Contig 13 | Incomplete ( | 22 | |||
| 36348 | Contig 21 | Incomplete | 43 | ||
| 36493 | Contig 1 | Complete ( | 46 | ||
| 36708 | Contig 6 | Complete ( | 60 | ||
| FHI24 | Contig 37 | Complete ( | 60 | ||
| FHI27 | Contig 8 | Incomplete | 25 | ||
| FHI4 | Contig 38 | Incomplete ( | 37 |
Indicates that the insertion site was inferred after scanning the known insertions sites yecE, wrbA, yehV, sbcB, Z2577, argW, prfC, and torST by in silico PCR.
When identified, the attL (integrase side) and attR (tail proteins side) sites are indicated between brackets. The sequence of the att sites at the yecE and yciD insertion sites was 5′-CAYGCAGTTAA-3′ and 5′-TTGAAACSAT-3′ at the wrbA insertion sites.
Indicates that the prophage is presumed incomplete.
Figure 5Comparative analysis of the various stx-prophages in the O26:H11/H- strains. (A) Comparison of SNP-CC1 Stx2a. (B) Comparison of SNP-CC1 Stx2d prophages. (C) Comparison of SNP-CC2 Stx2a prophages. (D) Comparison of SNP-CC3 Stx2a prophages. The sequences of the stx-prophages were compared with EasyFig. The homologous regions are connected. The color of the zone connecting the strains is related to the direct or reverse homology between the strains according to the scale present at the left of each comparison. The arrows depict ORFs identified with PROKKA. The direction of the arrow represents the transcription orientation. The ORFs are color-coded according to their predicted function. Black arrows indicate ORFs outside of the predicted prophages.
Size of the mobilome (plasmids and phages) of the O26:H11/H- isolates.
| 36,084 | 5,849,490 | 5,757,624 | 1 | 91,866 | 22 | 904,600 |
| 36,079 | 5,735,083 | 5,658,663 | 1 | 76,420 | 21 | 859,100 |
| 36,708 | 5,894,762 | 5,770,096 | 1 | 124,666 | 24 | 987,000 |
| 34,870 | 5,798,318 | 5,598,857 | 2 | 199,461 | 23 | 897,200 |
| 34,827 | 5,933,672 | 5,639,586 | 4 | 294,086 | 18 | 840,500 |
| 36,293 | 5,790,328 | 5,544,723 | 2 | 245,605 | 20 | 757,200 |
| 36,493 | 5,758,521 | 5,537,274 | 2 | 221,247 | 20 | 720,700 |
| 36,348 | 5,978,287 | 5,732,373 | 2 | 245,914 | 22 | 939,100 |
| FHI4 | 5,486,605 | 5,293,109 | 2 | 193,496 | 18 | 638,800 |
| FHI24 | 5,612,104 | 5,504,369 | 1 | 107,735 | 23 | 831,300 |
| FHI27 | 5,588,374 | 5,460,140 | 1 | 128,234 | 23 | 802,700 |
| DEC9A | 5,408,446 | 5,408,446 | – | – | 17 | 564,900 |
| DEC9B | 5,361,604 | 5,361,604 | – | – | 16 | 630,100 |
| DEC9C | 5,194,722 | 5,194,722 | – | – | 13 | 450,700 |
| DEC9D | 5,485,621 | 5,422,169 | 1 | 63,452 | 19 | 648,400 |
| DEC9E | 5,430,771 | 5,430,771 | – | – | 18 | 647,700 |
| DEC10D | 5,404,073 | 5,295,503 | 2 | 108,570 | 17 | 660,200 |
Figure 6Phylogenetic relationships of the phageome of O26:H11/H- strains. The phage regions of the isolates as determined with PHASTER were compared using Gegenees 2.1 and Mega 6.06. The heat-plot comparison matrix is based on a fragmented alignment using BLASTN (200/100). The average scores of all fragment comparisons are indicated in the comparison matrix. The distance matrix was exported and used to produce a dendrogram in Mega using the Neighbor-joining method. The optimal tree with the sum of branch length = 1.78224617 is shown. The topology only of the tree is displayed with the branch length indicated next to the branches.
Distribution of large plasmids in the O26:H11/H- isolates.
| SNP-CC3 | 36084 | 91,866 | pO26-1 ( | ||
| 36079 | 76,420 | pO26-1 ( | |||
| FHI4 | 80,100 | pO26-1 ( | |||
| 113,396 | Large complex resistance locus ( | pE17-16 ( | |||
| DEC10D | 85,715 | pO26-1 ( | |||
| 22,855 | Type IV conjugative transfer system (incomplete) | pO26-2 ( | |||
| SNP-CC2 | 36708 | 124,666 | pO145-12761 ( | ||
| FHI24 | 107,735 | pO145-12761 ( | |||
| FHI27 | 128,234 | pO145-12761 ( | |||
| 34870 | 161,837 | pHUSEC2011-1 ( | |||
| SNP-CC1 | 34827 | 37,624 | Type IV secretion system conjugation apparatus | pOLA52 ( | |
| 161,706 | pHUSEC2011-1 ( | ||||
| 35,343 | Type IV secretion system conjugation apparatus | pOLA52 ( | |||
| 31,084 | pCROD2 ( | ||||
| 65,953 | pRM12761 ( | ||||
| 36293 | 162,963 | pHUSEC2011-1 ( | |||
| 82,642 | Type IV transfer system | pO113 ( | |||
| 36348 | 153,502 | pHUSEC2011-1 ( | |||
| 92,412 | pO113 ( | ||||
| 36493 | 132,780 | pHUSEC2011-1 ( | |||
| 88,467 | pEC-B24 ( | ||||
| DEC9D | 63,452 | Type IV conjugative transfer system | pEC-B24 ( |
Indicates plasmids present on a single contig.
Indicates plasmids where putative phage-related sequences were identified by the PHASTER server.