| Literature DB >> 19575794 |
Galeb S Abu-Ali1, David W Lacher, Lukas M Wick, Weihong Qi, Thomas S Whittam.
Abstract
BACKGROUND: Evolutionary analyses of enterohemorrhagic Escherichia coli (EHEC) have identified two distantly related clonal groups: EHEC 1, including serotype O157:H7 and its inferred ancestor O55:H7; and EHEC 2, comprised of several serogroups (O26, O111, O118, etc.). These two clonal groups differ in their virulence and global distribution. Although several fully annotated genomic sequences exist for strains of serotype O157:H7, much less is known about the genomic composition of EHEC 2. In this study, we analyzed a set of 24 clinical EHEC 2 strains representing serotypes O26:H11, O111:H8/H11, O118:H16, O153:H11 and O15:H11 from humans and animals by comparative genomic hybridization (CGH) on an oligoarray based on the O157:H7 Sakai genome.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19575794 PMCID: PMC2713265 DOI: 10.1186/1471-2164-10-296
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Properties of strains used in this study sorted by serotype.
| Straina | Serotypeb | Host | Clinicalc | Location | Dated | STe | Sourcef, Reference | |
| DEC 9f | O26: [h11] | Human | diarrhea | USA, S. D. | 1974 | - | 106 | CDC, [ |
| DEC 10e | O26:H11 | Calf | scours | USA, S. D. | 1989 | 1 | 106 | Francis, D., [ |
| F5863 | O26:H11 | Human | diarrhea | USA, Nebr. | 1998 | 1 | 106 | Fey, P., [ |
| 97–3250 | O26:H11 | Human | HUS | USA, Idaho | 1997 | 1,2 | 104 | O'Brien, [ |
| 413/89-1 | O26: [h11] | Calf | diarrhea | Germany | 1998 | 1 | 106 | Wieler, L., [ |
| DA-22 | O26: [h11] | Human | diarrhea | USA, D.C. | 1999 | 1 | 106 | Acheson, D. W. |
| 03-ST-296 | O26:H11 | Human | b. d. | USA, Mich. | 2003 | 1 | 106 | MDCH, [ |
| CB 7505 | O26:H11 | Calf | no data | Germany | 1998 | 1 | 106 | Beutin, L., [ |
| DEC 8c | O111: [h11] | Calf | scours | USA, S. D. | 1986 | 1 | 107 | Francis, D., [ |
| DEC 8d | O111:H11 | Human | diarrhea | Cuba | 1953 | - | 106 | Orskov, F., [ |
| C408 | O111: [h8] | Calf | diarrhea | Scotland | 1993 | 1 | 106 | Hart, C. A., [ |
| BCL71 | O111: [h8] | Calf | diarrhea | USA, Calif. | 1993 | 1,2 | 106 | Love, B.C. |
| ML178190 | O111: [h8] | Human | diarrhea | USA, Nebr. | 1998 | 1,2 | 106 | Fey, P., [ |
| W29104 | O111:H8 | Human | diarrhea | USA, Nebr. | 1998 | 1,2 | 106 | Fey, P., [ |
| EK34 | O111: [h8] | Human | diarrhea | USA, Wash. | 1999 | 1 | 106 | Tarr, P., [ |
| EK35 | O111:H8 | Human | diarrhea | USA, Wash. | 2001 | 1 | 106 | Tarr, P., [ |
| RW2030 | O118: [h16] | Calf | diarrhea | Germany | 1994 | 1 | 106 | Wieler, L., [ |
| RW1302 | O118: [h16] | Calf | diarrhea | Germany | 1994 | 1 | 106 | Wieler, L., [ |
| 666/89 | O118:H16 | Calf | diarrhea | Germany | 1989 | 1 | 106 | Wieler, L., [ |
| 05482 | O118:H16 | Human | HUS | Germany | 1996 | 1 | 106 | Beutin, L., [ |
| EK36 | O118:H16 | Human | diarrhea | USA, Wash. | 2001 | 1 | 106 | Tarr, P., [ |
| EK37 | O118:H16 | Human | diarrhea | USA, Wash. | 2000 | 1 | 106 | Tarr, P., [ |
| RDEC-1 | O15: [h11] | Rabbit | diarrhea | USA, S.C. | 1970s | - | 681 | ECRC, [ |
| 02–3751 | O153: [h11] | Rabbit | HUS | USA, Mass. | 2002 | 1 | 104 | Fox, J., [ |
| 97–3256 | O55:H7 | Human | diarrhea | USA, Mich. | 1997 | 2 | 73 | O'Brien, [ |
a. designations assigned to strains deposited in the STEC Reference Center
b. [h] – flagellar allele determined by fliC gene sequencing; H – expression of flagellar type confirmed by reaction to antisera. To avoid confusion in text, flagellar type will be denoted as H, regardless whether it was determined by sequencing or serologic typing.
c. b. d. – bloody diarrhea; HUS – hemolytic uremic syndrome; scours – neonatal calf diarrhea.
d. Year of isolation.
e. ST – sequence type based on MLST of 7 housekeeping genes (aspC, clpX, fadD, icdA, lysP, mdh, and uidA).
f. MDCH – Michigan Dept. of Community Health.
Figure 1Phylogenetic relationships of EHEC and EPEC sequence types. The sequence types (STs) of EHEC 2 belong to a clonal group (CG 14), which is more closely related to EPEC 2 (CG 17), than EHEC 1 STs (CG 11). The phylogenetic tree was constructed using the Neighbor-joining algorithm based on the Kimura 2-parameter distance matrix of nucleotide substitution. Bootstrap confidence values were based on 1000 replicates. Only those higher than 70% are shown.
Percentage of Sakai genes that are present, divergent/absent or variably absent or present (VAP) in all 24 EHEC 2 strains.
| Backbone (shared with K-12) | Sakai-specific | ||
| phage-related | Bacterial | ||
| Present | 80.9% | 5.8% 6.5% | |
| Divergent/absent | 1.1% | 9.5% 53.0% | |
| VAPa | 18.0% | 84.7% 40.5% | |
a – genes that were detected in at least one of the 24 EHEC 2 strains, but not in all EHEC 2 strains.
Percentages of Sakai genes found in tested strains sorted by serotype.
| Serotype | Strain | Sakai genes on array | Backbone (shared with K-12) | Sakai-specific | |
| phage-related | Bacterial | ||||
| O26: [h11] | DEC 9f | 78% | 94% | 30% | 20% |
| O26:H11 | DEC 10e | 82% | 96% | 51% | 22% |
| O26:H11 | F5863 | 84% | 97% | 56% | 25% |
| O26:H11 | 97–3250 | 85% | 97% | 65% | 23% |
| O26: [h11] | 413/89-1 | 83% | 96% | 56% | 20% |
| O26: [h11] | DA-22 | 84% | 97% | 57% | 24% |
| O26:H11 | 03-ST-296 | 84% | 97% | 54% | 22% |
| O26:H11 | CB 7505 | 84% | 96% | 59% | 30% |
| O26:H11 | 83% | 96% | 54% | 23% | |
| 2.2% | 1% | 10.3% | 3.2% | ||
| O111: [h11] | DEC 8c | 82% | 94% | 55% | 19% |
| O111:H11 | DEC 8d | 77% | 93% | 31% | 21% |
| O111: [h8] | C408 | 82% | 95% | 49% | 24% |
| O111: [h8] | BCL71 | 83% | 95% | 58% | 24% |
| O111: [h8] | ML178190 | 82% | 95% | 52% | 23% |
| O111:H8 | W29104 | 81% | 95% | 48% | 23% |
| O111: [h8] | EK34 | 81% | 95% | 47% | 24% |
| O111:H8 | EK35 | 80% | 94% | 49% | 23% |
| O111:H8 | 82% | 95% | 51% | 24% | |
| 1% | 0.4 | 4% | 0.5% | ||
| O118: [h16] | RW2030 | 84% | 96% | 58% | 23% |
| O118: [h16] | RW1302 | 82% | 95% | 56% | 19% |
| O118:H16 | 666/89 | 83% | 95% | 57% | 21% |
| O118:H16 | 05482 | 82% | 96% | 53% | 22% |
| O118:H16 | EK36 | 83% | 96% | 53% | 21% |
| O118:H16 | EK37 | 84% | 97% | 55% | 25% |
| O118:H16 | 83% | 96% | 55% | 22% | |
| 0.9% | 0.8% | 2.1% | 2% | ||
| O153: [h11] | 02–3751 | 84% | 97% | 60% | 24% |
| O15: [h11] | RDEC-1 | 80% | 94% | 42% | 22% |
| O55:H7 | 97–3256 | 84% | 97% | 33% | 70% |
a – Stan. Dev., standard deviation.
Figure 2Distribution of Sakai genes among EHEC 2 clinical strains. The three histograms represent distribution trends of three Sakai gene groups in EHEC 2 strains: Sakai bacterial genes (left histogram – hatched bars), Sakai phage genes (middle histogram, open bars), and backbone genes (right histogram – hatched bars). The levels of Sakai gene content conservation were calculated for each EHEC 2 strain by dividing the number of Sakai genes, from a particular gene group, found in a strain by the total number of Sakai genes from the respective gene group, represented on the oligoarray; these values were expressed as percentages. Each bar represents the number of EHEC 2 strains that were found to have the same percentage of Sakai gene content conservation. Each strain is represented on each histogram and the bars in each histogram add up to 24, the total number of strains investigated. One exception is the bar representing Sakai phage gene content conservation in strain DEC9f, which is hidden by the hatched bar representing the Sakai bacterial gene content conservation in strain CB7505. As can be seen in Table 3, strain DEC9f has 30% of Sakai phage genes and strain CB7505 has 30% of Sakai bacterial genes, causing the bars to overlap. Numbers above each plot represent the average for each group of genes and the range of the distribution is given in parentheses.
Figure 3Split decomposition analysis of Sakai genes in 24 EHEC 2 strains. The network was generated based on the presence/absence of 4800 Sakai genes among 24 EHEC 2 strains. 144 genes were excluded because their probe intensities were below those of randomized negative controls in the various Sakai/EHEC 2 hybridizations. Node labels refer to strain names (listed in Table 1). Parallel edges represent phylogenetic incompatibilities in the data set, which are indicative of parallel gene gain/loss by multiple transduction events. The network was generated in Splitstree 4.3, using neighbor net with the uncorrected p distance. Scale bar represents number of gene differences (present or divergent/absent) per gene site.
Figure 4Split decomposition analysis of compatible parsimony informative genes and singleton genes in 24 EHEC 2 strains. Gray ovals encompass serotype-specific clusters of O118:H16 and O111:H8 strains. Node labels refer to strain names (listed in Table 1). The network was generated in Splitstree 4.3, using neighbor net with the uncorrected p distance. Scale bar represents number of gene differences (present or divergent/absent) per gene site. Percent bootstrap confidence values based on 1000 replicates are shown for selected edges.
Figure 5Gene content of Sakai phage genes and the LEE island in EHEC 2 strains. Sakai phage genes inferred as present or divergent/absent were grouped and sorted according to the Sakai annotation. Colormaps, with dendrograms, of individual phages were generated in R software (v 2.4.0.), using the 'gplots' package (v 2.3.2). Present genes are depicted as black, absent/divergent as white. Gray squares symbolize genes that have been classified as present after the cutoff was relaxed for 20%, representing a 'low' level of gene divergence. Dendrogram labels refer to strain names (Table 1). Labels with asterisks in the Sp15 and Sp5 colormaps refer to strains that were positive for stx1 and stx2 genes, respectively. Labels with open boxes in the LEE colormap represent animal strains. Arrows and numerals atop the LEE colormap represent operons and the direction of their transcription. The ECs numbers for the phage genes depicted, and the distribution of these genes, are provided in Additional file 2. Sp – Sakai prophage, SpLE – Sakai prophage-like element, TAI – tellurite resistance and adherence island.
Conservation of O157 LEE operons in a set of 24 EHEC 2 strains.
| LEE1 (9)a | LEE2 (6) | LEE3 (7) | LEE5 (3) | LEE4 (8) | |
| Humanb | 8.8 ± 0.6 | 3.1 ± 1.7 | 4.8 ± 1.1 | 1.0 ± 0.0 | 4.4 ± 0.5 |
| Animalc | 3.2 ± 1.2 | 0.6 ± 0.9 | 2.7 ± 0.9 | 0.1 ± 0.2 | 3.8 ± 0.9 |
a. The number of genes in each operon is given in parentheses.
b. Refers to the 10 human isolates in the top cluster of the dendrogram in the LEE image in Figure 5, not including O55:H7. Values represent average number of genes in an operon, with standard deviation.
c. Refers to the 11 animal and 3 human isolates in the bottom cluster of the dendrogram in the LEE image in Figure 5. Values represent average number of genes in an operon, with standard deviation.