| Literature DB >> 30849935 |
Vijay K Sharma1, Suryatej Akavaram2, Robert G Schaut2,3, Darrell O Bayles4.
Abstract
BACKGROUND: Escherichia coli O157:H7 (O157) has been linked to numerous foodborne disease outbreaks. The ability to rapidly sequence and analyze genomes is important for understanding epidemiology, virulence, survival, and evolution of outbreak strains. In the current study, we performed comparative genomics to determine structural and functional features of the genome of a foodborne O157 isolate NADC 6564 and infer its evolutionary relationship to other O157 strains.Entities:
Keywords: Bacteriophages; Genomic islands; Genomics; Mobile elements; O157; Stress response
Mesh:
Substances:
Year: 2019 PMID: 30849935 PMCID: PMC6408774 DOI: 10.1186/s12864-019-5568-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1a Circular map of the chromosome of E. coli O157:H7 strain NADC 6564 generated from its annotated sequence. The map was constructed by downloading the chromosomal sequence into the Blast Ring Image Generator. The legend on the top right corner shows rings representing GC content, GC skew, inserted phage or phage-like elements (red), and ORFs on – (orange) and + (blue) DNA strands. b Comparing chromosomal organization of E. coli O157:H7 strain NADC 6564 to published chromosomal sequences of other E. coli O157:H7 strains (NADC 6565, EDL933, Sakai, FRIK2533, TW14359), non-O157 strains (O111 and O26) and a non-pathogenic E. coli K12 strain MG1655. Linear Conservative Blocks or LCB (5–18 when counting from the left) that are underlined with a long black arrow were inverted in their arrangement in strain NADC 6564 compared to the other strains included in this comparison. The LCB 18 and 19 that are underlined with a shorter black arrow had inverse orientation in Sakai compared to the orientation of these two LCB in other strains. Chromosomal sequences were aligned using Mauve
Comparison of basic features of the strain NADC 6564 chromosome and reference O157:H7 strains
| Strainsa | Size (Mb) | CDSb | tRNA genes | rRNA genes | Genomic Islands (GI) | Size range GI (kb) | % GC | Reference | Source |
|---|---|---|---|---|---|---|---|---|---|
| NADC 6564 | 5.46 | 5542 | 103 | 22 | 53 | 4–85 | 50.0 | [ | Ground Beef |
| EDL933 | 5.53 | 5587 | 100 | 22 | 63 | 3–88 | 50.4 | [ | Ground Beef |
| Sakai | 5.49 | 5504 | 103 | 22 | 71 | 4–59 | 50.5 | [ | Sprouts |
| TW14359 | 5.52 | 5555 | 108 | 22 | 44 | ND | 50.5 | [ | Spinach |
| EC4115 | 5.57 | 5608 | 109 | 22 | NDc | ND | 50.5 | [ | Spinach |
aAccession numbers: CP017251.1 (NADC 6564), CP008957.1 (EDL933), BA000007.2 (Sakai), CP001368.1 (TW14359), and NC_011353.1 (EC4115); bCDS Protein-coding sequence, cND Not determined
Bacteriophage profile of strain NADC 6564
| Bacteriophage | Sequence | Number of Phage Proteins | Chromosomal | Genes Flankinga | GC% | Accession |
|---|---|---|---|---|---|---|
| 1/Stx2-converting phage 1717 | 21 | 7 | 1,172,843–1,193,884 | BHW77_05970 and BHW77_06120 | 41.96 | NC_011357 |
| 2/ | 6.4 | 4 | 1,473,190–1,479,623 | BHW77_07465 and tRNA-Arg | 46.18 | NC_005344 |
| 3/Enterobacteria phage YYZ-2008 | 32.1 | 18 | 1,723,923–1,756,041 | BHW77_08615 and BHW77_08855 | 52.10 | NC_011356 |
| 4/Enterobacteria phage BP-4795 | 6.8 | 3 | 1,899,932–1,906,773 | BHW77_09510 and BHW77_09560 | 51.91 | NC_004813 |
| 5/Enterobacteria phage BP-4795 | 56.5 | 14 | 1,922,013–1,978,603 | BHW77_09625 and BHW77_09965 | 51.37 | NC_004813 |
| 6/Enterobacteria phage 933 W | 70.2 | 65 | 2,015,661–2,085,889 | BHW77_10140 and BHW77_10630 | 50.16 | NC_000924 |
| 7/Enterobacteria phage BP-4795 | 53.1 | 24 | 2,308,130–2,361,308 | BHW77_11820 and BHW77_12200 | 49.29 | NC_004813 |
| 8/ | 15.2 | 4 | 2,364,182–2,379,410 | BHW77_12210 and BHW77_12335 | 51.17 | NC_031264 |
| 9/ | 53.3 | 29 | 2,385,028–2,438,396 | BHW77_12355 and BHW77_12685 | 50.26 | NC_001416 |
| 10/Stx2-converting phage 1717 | 2.6 | 3 | 2,428,442–2,431,075 | BHW77_12620 and BHW77_12650 | 55.88 | NC_011357 |
| 11/Enterobacteria phage YYZ-2008 | 113.4 | 31 | 2,514,193–2,627,682 | BHW77_13055 and BHW77_13845 | 51.50 | NC_011356 |
| 12/Enterobacteria phage BP-4795 | 60 | 29 | 2,800,143–2,860,221 | BHW77_14595 and BHW77_14980 | 50.25 | NC_004813 |
| 13/Enterobacteria phage BP-4795 | 57.4 | 32 | 2,967,268–3,024,670 | BHW77_15525 and BHW77_15920 | 50.94 | NC_004813 |
| 14/Enterobacteria phage P88 | 24.8 | 18 | 3,361,891–3,386,762 | BHW77_17640 and BHW77_17830 | 50.31 | NC_026014 |
| 15/ | 30.1 | 14 | 3,440,475–3,470,640 | BHW77_18140 and BHW77_18340 | 54.39 | NC_001416 |
| 16/ | 24.1 | 6 | 3,463,508–3,487,685 | BHW77_18270 and BHW77_18470 | 47.40 | NC_025434 |
| 17/Enterobacteria phage cdtI | 48.2 | 22 | 3,709,595–3,757,796 | BHW77_19495 and BHW77_19800 | 50.11 | NC_009514 |
| 18/Enterobacteria phage SfI | 26.1 | 6 | 4,330,960–4,357,158 | BHW77_22420 and BHW77_22605 | 45.57 | NC_027339 |
| 19/Acidianus tailed spindle virus | 10.1 | 1 | 4,801,573–4,811,702 | BHW77_24685 and BHW77_24755 | 46.57 | NC_029316 |
aGenes flanking the phage insertion sites are represented as locus-tags in the annotated sequence (Accession number: CP017251.1)
Fig. 2Linear maps showing bacteriophage insertion sites in linear conserved blocks generated by Mauve alignment of the chromosomal sequences of strains NADC 6564 and reference strains EDL933 and Sakai. Phages are numbered from 1 to 19 and arrows indicate sites of phage insertion in LCB
Fig. 3a Comparison of Blast Ring Image Generated for E. coli O157:H7 strain NADC 6564 (black innermost ring) with similar images generated for other lineage I (EDL933, red ring; Sakai, purple ring), lineage II (FRIK2533, green ring), and lineage I/II (TW14359, blue ring) E. coli O157:H7 strains, non-O157 strains (O111, yellow ring; O26, blue ring) and a non-pathogenic E. coli K12 strain MG1655 (outermost orange ring). The colored legend to the right shows the degree of nucleotide sequence homology between the sequences of strain NADC 6564 to other O157:H7 and non-O157:H7 E. coli strains. The white gaps in rings represent genomic regions that are missing in the corresponding genomes compared to strain NADC 6564. Locations of genomic islands (GI) and phage or phage-like elements (P) are indicated on the two outer rings. b Magnified picture of the 39-kb insertion element (IE) identified in the chromosome of E. coli O157:H7 strain NADC 6564 but lacking in the chromosomes of other strains listed in the legend on the right side panel. This element is bounded at both ends by a 56 bp repeat sequence (not shown in the figure) and inserted at the tRNA-leu gene in the chromosome. The outer most ring shows relative locations of various genes and their encoded proteins and associated functions
Specific genes and their functions encoded in the 39-kb insertion element
| Genes | Proteins/enzymes | Functions |
|---|---|---|
|
| Phosphoethanolamine transferase (PEAX)a | LPS modification/colistin resistance |
|
| DNA sulfur modification proteinsb | DNA phosphorothioation (replacing non-bridging phosphate oxygen with sulfur in the DNA backbone)/oxidative stress response |
|
| DNA phosphorothioation-dependent restriction enzymesa | DNA restriction of phosphorothioated DNA/foreign DNA recognition and restriction |
|
| ProQ | Osmoprotectant transport activator |
aPEAX represents phosphoethanolamine transferase encoded by the peaX gene located on the 39-kb IE of NADC 6564
bActivities of these enzymes were verified by specific in vitro assays as described in Methods and Results
Fig. 4Confirming DNA phosphorothioation and DNA restriction activities encoded by dndBCDE and dptFGH operons. a Equal amounts of genomic DNA isolated from NADC 6564 and other strains listed on the right side of the gel picture was either treated or not treated with iodine. The treated DNA was loaded in to lanes marked with + and untreated DNA loaded in to lanes marked with – signs. Smearing effect in the lane loaded with iodine-treated DNA is indicative of DNA degradation compared to the presence of intact DNA (approximately 25 to 50 kb in size based on the size of molecular weight markers loaded in lanes marked M) in lanes loaded with untreated DNA. b and c Confirmation of S-DNA modification-dependent restriction activity encoded by the dpt operon. The pUC18 plasmid DNA lacking S-DNA modification was electroporated into strains NADC 6564 (dpt operon-positive) and EDL933 (dpt operon-negative) and recovery of this plasmid was determined by comparing number of colonies produced on LB agar-carbenicillin (100 μg/mL) by the electroporated cultures of these strains (b). Absolute colony counts shown represent mean plus and minus standard deviation of three independent experiments (c). d Table showing comparisons of nucleotide sequence homology of peaX gene encoded on the IE to other four pea genes located elsewhere on the chromosome of NADC 6564 and to mcr genes encoding colistin resistance in E. coli. Numbers in colored checker boxes represent homology between compared genes numbered 1–7 and labeled with corresponding gene name on the outer margins of the Table. e Graph showing the plot of minimum inhibitory concentration (MIC) of colistin in strain NADC 6564 compared to strains EDL933 and Sakai lacking the peaX gene and the 39 kb insertion element. E. coli TOP10 (sensitive to colistin) and E. coli BEAR119605 (resistant to colistin) were used as negative and positive controls in the MIC assay. Results are mean plus the standard deviation of three independent assays. **** = p < 0.0001
Fig. 5Cladograms showing evolutionary relationship of E. coli O157 strain NADC 6564 to other pathogenic O157 and non-O157 strains. a Cladogram produced by constructing concatenated core genomes from whole genome sequences of 40 bacterial strains listed in Additional file 6: Table S1. A total of 2284 core genes were identified among the 40 strains used for constructing core genomes. These core genomes were used to infer a maximum likelihood phylogenetic tree using IQTREE. The generated tree was visualized using FigTree. Since there was no outgroup included in the tree, FigTree’s midpoint rooting method was used to root the tree. b Cladogram inferred using the parSNP-Gingr pipeline. Whole genomes for the sequences being compared were provided as input to parSNP using the NADC 6564 annotated GenBank file as the reference. Evolutionary trees were created by visualizing the parSNP output in Gingr