| Literature DB >> 34878971 |
Ruriko Nishida1, Keiji Nakamura1, Itsuki Taniguchi1, Kazunori Murase2, Tadasuke Ooka3, Yoshitoshi Ogura4, Yasuhiro Gotoh1, Takehiko Itoh5, Atsushi Toyoda6, Jacques Georges Mainil7, Denis Piérard8, Kazuko Seto9, Tetsuya Harada9, Junko Isobe10, Keiko Kimata10, Yoshiki Etoh11, Mitsuhiro Hamasaki11, Hiroshi Narimatsu12, Jun Yatsuyanagi13, Mitsuhiro Kameyama14, Yuko Matsumoto15, Yuhki Nagai16, Jun Kawase17, Eiji Yokoyama18, Kazuhiko Ishikawa19, Takayuki Shiomoto20, Kenichi Lee21, Dongchon Kang1, Koichi Akashi1, Makoto Ohnishi21, Sunao Iyoda21, Tetsuya Hayashi1.
Abstract
Shiga toxin (Stx)-producing Escherichia coli (STEC) are foodborne pathogens causing serious diseases, such as haemorrhagic colitis and haemolytic uraemic syndrome. Although O157:H7 STEC strains have been the most prevalent, incidences of STEC infections by several other serotypes have recently increased. O121:H19 STEC is one of these major non-O157 STECs, but systematic whole genome sequence (WGS) analyses have not yet been conducted on this STEC. Here, we performed a global WGS analysis of 638 O121:H19 strains, including 143 sequenced in this study, and a detailed comparison of 11 complete genomes, including four obtained in this study. By serotype-wide WGS analysis, we found that O121:H19 strains were divided into four lineages, including major and second major lineages (named L1 and L3, respectively), and that the locus of enterocyte effacement (LEE) encoding a type III secretion system (T3SS) was acquired by the common ancestor of O121:H19. Analyses of 11 complete genomes belonging to L1 or L3 revealed remarkable interlineage differences in the prophage pool and prophage-encoded T3SS effector repertoire, independent acquisition of virulence plasmids by the two lineages, and high conservation in the prophage repertoire, including that for Stx2a phages in lineage L1. Further sequence determination of complete Stx2a phage genomes of 49 strains confirmed that Stx2a phages in lineage L1 are highly conserved short-tailed phages, while those in lineage L3 are long-tailed lambda-like phages with notable genomic diversity, suggesting that an Stx2a phage was acquired by the common ancestor of L1 and has been stably maintained. Consistent with these genomic features of Stx2a phages, most lineage L1 strains produced much higher levels of Stx2a than lineage L3 strains. Altogether, this study provides a global phylogenetic overview of O121:H19 STEC and shows the interlineage genomic differences and the highly conserved genomic features of the major lineage within this serotype of STEC.Entities:
Keywords: Shiga toxin-producing Escherichia coli O121:H19; bacteriophage; comparative genomics; phylogenetic analysis; plasmid; population structure
Mesh:
Substances:
Year: 2021 PMID: 34878971 PMCID: PMC8767318 DOI: 10.1099/mgen.0.000716
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Phylogenetic relationships between O121:H19 strains and their close relatives. An unrooted ML tree was constructed based on the recombination-free SNPs (4056 sites) identified on the chromosomal backbone sequence (3424039 bp). Strain names are displayed on each tip with their STs or serotypes indicated in parentheses. The O121:H19 branches are shown in magenta. Information on allele IDs for MLST, the distribution of stx1, stx2, eae, and hlyA genes (major virulence genes of STEC/EPEC) in ST655, and the SLVs and DLVs of ST655 are presented in the inset. Asterisks indicate STs with no available genome sequence information. Regarding the distribution of virulence genes of each ST, when the gene was detected in at least one strain in an ST, the ST was regarded as positive. The proportions (%) of positive strains in each ST were indicated in parentheses. NI, no information. Bar, the mean number of nucleotide substitutions per site.
The O121:H19 strain set analysed in this study
|
Country |
Sources |
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
|
Human |
Animals |
Foods |
Environment |
No information |
Total |
|
|
|
Negative | |
|
Japan |
196 |
2 |
0 |
1 |
2 |
201 |
1 |
5 |
195 |
0 |
|
United States |
242 |
21 |
55 |
8 |
34 |
360 |
10 |
4 |
338 |
8 |
|
Canada |
38 |
0 |
15 |
1 |
3 |
57 |
0 |
0 |
57 |
0 |
|
European countries* |
8 |
0 |
1 |
0 |
3 |
12 |
0 |
0 |
12 |
0 |
|
Other countries† |
1 |
1 |
0 |
0 |
1 |
3 |
0 |
0 |
0 |
3 |
|
No information |
0 |
0 |
0 |
0 |
5 |
5 |
0 |
0 |
4 |
1 |
|
Total |
485 |
24 |
71 |
10 |
48 |
638 |
11 |
9 |
606 |
12 |
*Belgium (n=5), Finland (n=3), Denmark (n=2), Italy (n=1), France (n=1)
†Korea (n=1), New Zealand (n=1), South Africa (n=1)
Fig. 2.Phylogenetic relationship of the 442 O121:H19 strains. ML trees were reconstructed based on the recombination-free SNPs (7371 sites) identified on the chromosomal backbone sequence (3312074 bp) with an O64:H19 strain (NCTC9064) belonging to ST2952 (DLV of ST655) as an outgroup. In panel (a), four lineages identified by hierBAPS (L1-L4) are indicated with the names of the main ST and the number of strains in parentheses. L1 includes strains belonging to four STs (ST5536, ST6000, ST6689, and ST7245). L3 includes ST5772 and ST8892 (one strain each), which are indicated by dots. In panel (b), strain information was mapped on an ML tree. Branches for non-L1 lineages are indicated in orange with a genetic distance scale different from that for L1. Completely sequenced strains (n=11) are indicated (1A, 51104; 1B, 2014C-3599; 1C, RM8352; 1D, 16–9255; 1E, 2015C-3107; 1F, FWSEC0006; 1G, 2014C-3655; 1H, 2014C-4423; 1I, E15042; 3A, SE14002; and 3B, CEC14159). The presence or absence of the stx1, stx2, eae, and hlyA genes is indicated by a filled or open box, respectively. JPN: Japan, USA: United States of America, CAN: Canada, Eur: European countries, NI: No information. Bar, the mean number of nucleotide substitutions per site.
General genomic features of the eleven completely sequenced O121:H19 STEC strains
|
Sequence Type (ST) |
655 |
655 |
655 |
655 |
655 |
655 |
655 |
655 |
655 |
655 |
655 |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
Lineage* |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
3 |
3 |
|
Strain |
51104 |
2014C- 3599† |
RM8352‡ |
16–9255‡ |
2015C- 3107† |
FWSEC0006‡ |
2014C- 3655† |
2014C- 4423† |
E15042 |
SE14002 |
CEC14159 |
|
Strain ID in this paper |
1A |
1B |
1C |
1D |
1E |
1F |
1G |
1H |
1I |
3A |
3B |
|
Accession No. |
AP024471- 2 |
CP027435- 6 |
CP028110- 1 |
CP022407- 8 |
CP027317- 8 |
CP031910-1 |
CP027350- 1 |
CP027454- 6 |
AP024478- 9 |
AP024473- 4 |
AP024475- 7 |
|
Reference |
This study |
Patel PN, |
Parker CT, |
Robertson J, |
Patel PN, |
Tyson S, |
Patel PN, |
Patel PN, |
This study |
This study |
This study |
|
Chromosome (kb) |
5391 |
5400 |
5391 |
5398 |
5388 |
5399 |
5443 |
5339 |
5365 |
5236 |
5221 |
|
CDSs |
5205 |
5228 |
5249 |
5249 |
5229 |
5226 |
5291 |
5199 |
5194 |
5000 |
4993 |
|
rRNA operons |
7 |
7 |
7 |
7 |
7 |
7 |
7 |
7 |
7 |
7 |
7 |
|
tRNAs |
99 |
102 |
102 |
102 |
102 |
102 |
102 |
101 |
101 |
102 |
105 |
|
Prophages |
15 |
15 |
15 |
15 |
15 |
15 |
16 |
15 |
14 |
13 |
14 |
|
Integrative elements |
5 |
5 |
5 |
5 |
5 |
5 |
5 |
5 |
5 |
6 |
5 |
|
Plasmid (kb) |
82 |
84 |
83 |
82 |
82 |
81 |
97§ |
80/73 |
81 |
88 |
89/88 |
|
CDSs (plasmid total) |
84 |
88 |
89 |
80 |
83 |
78 |
125 |
175 |
82 |
89 |
181 |
|
Total genome size (kb) |
5473 |
5484 |
5474 |
5480 |
5470 |
5480 |
5540 |
5492 |
5446 |
5324 |
5398 |
*Determined by hierBAPS.
†Annotated using DFAST in this study.
‡Re-annotated using DFAST in this study.
§Plasmid is not closed.
Fig. 3.Conservation and variation of the prophages and integrative elements in the 11 complete O121:H19 genomes. The chromosomal integration sites of prophages and integrative elements (IEs) identified in the 11 complete genomes are shown on the prophage-removed chromosome backbone (K-12 ∆PP) of K-12 MG1655. Insertion sites are indicated by gene names or intergenic regions with tRNA and tmRNA genes marked by asterisks. ‘att in PP’ indicated by a dagger (†) indicates an attB site within the prophage genome integrated into the serU gene (see Fig. S3 for more details). Prophages encoding stx2 and stx1 are indicated by red and green, respectively. The IEs corresponding to the LEE and SpLE1 are indicated by blue and magenta, respectively.
Fig. 4.Variation in the Stx2 production level and Stx2a phage genome sequence between O121:H19 strains. The left panel shows an ML tree of 52 strains used in this analysis and seven completely sequenced lineage L1 strains. The tree was constructed based on the recombination-free SNPs (3439 sites) identified on the chromosome backbone (4010134 bp in total), and the geographic information and strain IDs of each strain (see Table S3 for details) are also shown. The seven complete genomes downloaded from the public database are indicated by an asterisk. In the centre panel, the MMC-induced Stx2 production levels are shown by the mean values with standard deviations of biological triplicates. The experimental conditions for Stx2 production measurement were optimized using four strains (see Fig. S1 for details). The right panel schematically presents the alignment of genome sequences of Stx2a phages of the strains belonging to L1. The sequences were aligned with the IS-free genome sequence of the Stx2a phage of strain 4151 (ID07) shown at the top. All these phages are integrated into the argW gene. SNPs, indels, and IS insertions detected at the same site are indicated by broken lines. The sequences of the two regions depicted by open rectangles were highly divergent and unable to be aligned with the reference sequence, and their genetic structures are shown in Fig. S8. The Stx2a phages of two lineage L3 strains were integrated into yecE, and their genome sequences were different from those of the Stx2a phages of L1 strains. See Fig. S8 for their genome sequences and comparison with the Stx2a phages in lineage L1. JP: Japan, US: United States of America, CA: Canada, BE: Belgium. Bar, the mean number of nucleotide substitutions per site.