| Literature DB >> 30519380 |
Hyein Jang1, Jungha Woo1, Youyoung Lee1, Flavia Negrete1, Samantha Finkelstein1, Hannah R Chase1, Nicole Addy1, Laura Ewing1, Junia Jean Gilles Beaubrun1, Isha Patel1, Jayanthi Gangiredla1, Athmanya Eshwar2, Ziad W Jaradat3, Kunho Seo4, Srikumar Shabarinath5,6, Séamus Fanning5,6, Roger Stephan2, Angelika Lehner2, Ben D Tall1, Gopal R Gopinath1.
Abstract
Cronobacter sakazakii is a Gram-negative opportunistic pathogen that causes life- threatening infantile infections, such as meningitis, septicemia, and necrotizing enterocolitis, as well as pneumonia, septicemia, and urinary tract and wound infections in adults. Here, we report 26 draft genome sequences of C. sakazakii, which were obtained from dried spices from the USA, the Middle East, China, and the Republic of Korea. The average genome size of the C. sakazakii genomes was 4393 kb, with an average of 4055 protein coding genes, and an average genome G + C content of 56.9%. The genomes contained genes related to carbohydrate transport and metabolism, amino acid transport and metabolism, and cell wall/membrane biogenesis. In addition, we identified genes encoding proteins involved in osmotic responses such as DnaJ, Aquaproin Z, ProQ, and TreF, as well as virulence-related and heat shock-related proteins. Interestingly, a metabolic island comprised of a variably-sized xylose utilization operon was found within the spice-associated C. sakazakii genomes, which supports the hypothesis that plants may serve as transmission vectors or alternative hosts for Cronobacter species. The presence of the genes identified in this study can support the remarkable phenotypic traits of C. sakazakii such as the organism's capabilities of adaptation and survival in response to adverse growth environmental conditions (e.g. osmotic and desiccative stresses). Accordingly, the genome analyses provided insights into many aspects of physiology and evolutionary history of this important foodborne pathogen.Entities:
Keywords: Cronobacter sakazakii; Draft Genomes; Dried Spices; Plant-origin; WGS
Year: 2018 PMID: 30519380 PMCID: PMC6267090 DOI: 10.1186/s40793-018-0339-6
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Transmission electron photomicrograph of a typical Cronobacter sakazakii strain (ES632) grown on Trypticase soy agar supplemented with 1% sodium chloride, and incubated at 37 °C for 22 h. The cells were negatively stained with 0.5% sodium phosphotungstate (pH 6.8). Note the presence of numerous peritrichously expressed flagella (arrow). Bar represents 1 μm
Classification and general features of C. sakazakii strains used in this study
| MGS ID | Property | Term | Evidence Codea |
|---|---|---|---|
| Classification | Domain: Bacteria | ||
| Phylum: Proteobacteria | |||
| Class: Gammaproteobacteria | |||
| Order: Enterobacteriales | |||
| Family: | |||
| Genus: | |||
| Species: | |||
| Strains: MOD1_AS-2, MOD1_AS-4, MOD1_AS-13, MOD1_AS-15, MOD1_Jor20, MOD1_Jor22, MOD1_Jor44, MOD1_Jor93, MOD1_Jor96, MOD1_Jor103, MOD1_Jor146, MOD1_Jor148, MOD1_Jor151, MOD1_Jor154, MOD1_Jor172, MOD1_Jor173, MOD1_Jor178, MOD1_Jor183, MOD1_KW3, MOD1_KW13, MOD1_O21–13, MOD1_O21–16, MOD1_O26–1, MOD1_O26–4, MOD1_O23mB, MOD1_788569 | |||
| Gram stain | Negative | TAS [ | |
| Cell shape | Rod-shaped | TAS [ | |
| Motility | Motile by peritrichous flagella | TAS [ | |
| Sporulation | Non-sporulating | TAS [ | |
| Temperature range | 6 to 45 °C | TAS [ | |
| Optimum temperature | 37 °C | TAS [ | |
| pH range | pH 5 to 10 | TAS [ | |
| Carbon source | α-D-glucose, β-D-fructose, D-galactose, trehalose, D-mannose, α-melibiose, sucrose, raffinose, maltotriose, maltose, α-lactose, 1–0-methyl α/β-galactopyranoside, cellobiose, β-gentiobiose, 1–0-methyl β-D-glucopyranoside, aesculin, L-arabinose, D-xylose, glycerol, D-mannitol, L-malate, D-glucuronate, D-galacturonate, 2-keto-D-gluconate, N-acetyl D-glucosamine, arbutin, DL-α-glycerol-phosphate, dihydroxyacetone, D-ribose, L-lyxose, pyruvic acid, D-gluconate, DL-lactate, succinate, fumarate, DL-glycerate, D-glucosamine, L-aspartate, L-glutamate, L-proline, D-alanine, L-alanine and L-serine. | TAS [ | |
| MIG5–6 | Habitat | Environment, Eukaryotic plant-origin, Human | TAS [ |
| Energy source | Chemoheterotrophic | TAS [ | |
| MIG6–3 | Salinity | Grows up to 10% NaCl | TAS [ |
| MIG5–22 | Oxygen requirement | Facultatively anaerobic | TAS [ |
| MIG5–15 | Biotic relationship | Eukaryotic plant-origin, Human | TAS [ |
| MIG5–14 | Pathogenicity | Human pathogen | TAS [ |
| MIG5–23 | Isolation | Bacteriological Analytical Manual, ISO/TS 22964:2017 | TAS [ |
| MIG5–4 | Geographic location | USA, Europe, Asia, Central America, South America | TAS [ |
| MIG5–5 | Sample collection | Plant-origin | TAS [ |
| MIG5–4.1 | Latitude | variable | TAS [ |
| MIG5–4.2 | Longitude | variable | TAS [ |
| MIG5–4.4 | Altitude | variable | TAS [ |
aEvidence codes: TAS Traceable author statement (i.e., a direct report exists in the literature). These codes are from the Gene Ontology project [42]
Minimum information about a genome sequence (MIGS); project information for the 26 spice- associated C. sakazakii strains
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Improved high-quality draft |
| MIGS-28 | Libraries used | Illumina Nextera XT, pair-end |
| MIGS 29 | Sequencing platforms | Illumina MiSeq |
| MIGS 31.2 | Fold coverage | 50X |
| MIGS 30 | Assemblers | de novo assembly, CLC Genomics Workbench version 9.0 |
| MIGS 32 | Gene calling method | RAST annotation server [ |
| Locus Tag | See Table | |
| Genbank ID | See Table | |
| GenBank Date of Release | 2018/03/07 | |
| GOLD ID | SEE Table | |
| BIOPROJECT | PRJNA258403 ( | |
| Project relevance | Food Safety, source attribution |
Draft genomes, source, geographic locale, genome size, topology, %G + C content, No. of CDS, sequence type (ST), accession numbers, GOLD project ID, and locus tag of strains captured under the FDA-CFSAN Cronobacter GenomeTrakr NCBI BioProject PRJNA258403 and used in this study
| Strain Name | Source | Geographic Locale | Genome Size (kb) | Topology | G + C content (%) | No. of CDS | ST | NCBI Accession no. | GOLD Analysis Project IDb | Locus tag |
|---|---|---|---|---|---|---|---|---|---|---|
| MOD1_Jor173 | Unknown Spice | Jordan | 4403 | Circular | 56.9 | 4030 | 1, CC1 | PVCG00000000 | Ga0259519 | PVCG01 |
| MOD1_Jor146 | Liquorice | Jordan | 4409 | Circular | 56.9 | 4059 | 3, CC3 | PVMV00000000 | Ga0259523 | PVMV01 |
| MOD1_Jor96 | Fennel | Jordan | 4667 | Circular | 56.6 | 4337 | 4, CC4 | PVCE00000000 | Ga0259516 | PVCE01 |
| MOD1_Jor148 | Unknown Spice | Jordan | 4573 | Circular | 56.8 | 4251 | 4, CC4 | PVCF00000000 | Ga0259517 | PVCF01 |
| MOD1_Jor154 | Unknown Spice | Jordan | 4392 | Circular | 56.9 | 4064 | 4, CC4 | NITP00000000 | Ga0260550 | NITP01 |
| MOD1_Jor178 | Chamomile | Jordan | 4787 | Circular | 56.4 | 4409 | 4, CC4 | PVBV00000000 | Ga0259520 | PVBV01 |
| MOD1_KW13 | Dried Garlic | Republic of Korea | 4493 | Circular | 56.9 | 4176 | 13, CC13 | NITD00000000 | Ga0260553 | NITD01 |
| MOD1_Jor183 | Unknown Spice | Jordan | 4326 | Circular | 56.9 | 3934 | 21, CC21 | NITN00000000 | Ga0260551 | NITN01 |
| MOD1_788569 | Siberian Ginseng, | China | 4503 | Circular | 56.8 | 4162 | 31, CC31 | PVCL00000000 | Ga0259506 | PVCL01 |
| MOD1_KW3 | Dried Hot Pepper | Republic of Korea | 4372 | Circular | 56.9 | 4042 | 40, CC40 | NITH00000000 | Ga0260552 | NITH01 |
| MOD1_AS-2 | Allspice | USA | 4306 | Circular | 57.0 | 3987 | 64, CC64 | PVCH00000000 | Ga0259508 | PVCH01 |
| MOD1_AS-4 | Allspice | USA | 4297 | Circular | 57.0 | 3975 | 64, CC64 | PVCI00000000 | Ga0259509 | PVCI01 |
| MOD1_AS-13 | Allspice | USA | 4312 | Circular | 57.0 | 3980 | 64, CC64 | PVCJ00000000 | Ga0259510 | PVCJ01 |
| MOD1_AS-15 | Allspice | USA | 4313 | Circular | 57.0 | 3983 | 64, CC64 | PVCK00000000 | Ga0259511 | PVCK01 |
| MOD1_Jor172 | Unknown Spice | Jordan | 4331 | Circular | 57.0 | 4012 | 64, CC64 | NCWD00000000 | Ga0260555 | NCWD01 |
| MOD1_O21_16 | Oregano | USA | 4407 | Circular | 57.0 | 4071 | 99, CC99 | PVSQ00000000 | Ga0260560 | PVSQ01 |
| MOD1_O26_1 | Oregano | USA | 4408 | Circular | 57.0 | 4071 | 99, CC99 | PVBX00000000 | Ga0259522 | PVBX01 |
| MOD1_O21_13 | Oregano | USA | 4375 | Circular | 57.0 | 4059 | 219, CC155 | PVBW00000000 | Ga0259521 | PVBW01 |
| MOD1_O23mB | Oregano | USA | 4339 | Circular | 56.9 | 3991 | 226, CC8 | PVBZ00000000 | Ga0259507 | PVBZ01 |
| MOD1_O26_4 | Oregano | USA | 4338 | Circular | 56.9 | 3972 | 226, CC8 | PVBY00000000 | Ga0260554 | PVBY01 |
| MOD1_Jor20 | Unknown Spice | Jordan | 4468 | Circular | 56.7 | 4117 | 226, CC8 | PVCA00000000 | Ga0259512 | PVCA01 |
| MOD1_Jor22 | Chamomile | Jordan | 4469 | Circular | 56.7 | 4112 | 226, CC8 | PVCB00000000 | Ga0259513 | PVCB01 |
| MOD1_Jor44 | Unknown Spice | Jordan | 4482 | Circular | 56.9 | 4133 | 8, CC8a | PVCC00000000 | Ga0259514 | PVCC01 |
| MOD1_Jor151 | Unknown Spice | Jordan | 4489 | Circular | 56.9 | 4142 | 8, CC8a | PVMW00000000 | Ga0259518 | PVMW01 |
| MOD1_Jor93 | Unknown Spice | Jordan | 4331 | Circular | 57.1 | 3973 | 643 | PVCD00000000 | Ga0259515 | PVCD01 |
| MOD1_Jor103 | Unknown Spice | Jordan | 4425 | Circular | 57.0 | 4014 | 643 | NITR00000000 | Ga0260549 | NITR01 |
aSix exact matches (100% homology) of the allelic profiles (allele profile number in parentheses) for the Cronobacter MLST genes: (8) fusA, (7) glnS, (5) gltB, (8) gyrB, (15) infB and (10) pps, and the closest match of these strains in the MLST database is strain 2274, MLST ID 1390 (alias, L1). The closest ST match is ST8, CC8 except that the allelic profile number for atpD was 121 for these strains which differs from the reported allelic profile number 11 for this ST.
bJGI IMG/MER study ID number is Gs0133658
Summary of the genome statistics of the 26 C. sakazakii strains evaluated in this studya
| Attribute | Value | Range | % of Total |
|---|---|---|---|
| Genome size (kb) | 4393 | 4052-4716 | 100.0 |
| DNA coding (kb) | 3898 | 3779-4160 | 88.3 |
| Number of DNA G + C bases (kb) | 2510 | 2438-2664 | 56.9 |
| DNA scaffolds | 46.2 | 23–100 | 100.0 |
| Total genes | 4207 | 4090-4541 | 100.0 |
| Protein coding genes | 4055 | 3937-4383 | 96.4 |
| RNA genes | 151.6 | 142–162 | 3.6 |
| Pseudo genesd | 100.6 | 73–157 | - c |
| Genes in internal clusters | 887.1 | 829–962 | 21.0 |
| Genes assigned to COGs | 3,151b | 3101-3251 | 74.9 |
| Genes with Pfam domain | 3877 | 3595-3879 | 87.4 |
| Genes with signal peptides | 413.5 | 403–436 | 9.8 |
| Genes with transmembrane proteins | 994.6 | 978–1038 | 23.7 |
| CRISPR repeatsd | 2.6 | 2–4 | -c |
aData was obtained from the JGI IMG pipeline. Note: Genome statistics for each individual strain is shown in Additional file 1: Table S1
bThe number of genes assigned to COGs by NCBI was 3902 compared to the value (3151 genes) assigned by the JGI IMG pipeline
cNCBI pipeline did not have the % total for the CRISPR repeats and pseudo genes
dData was obtained from the NCBI, https://www.ncbi.nlm.nih.gov/nuccore
Summary of the average number of genes and percentage of each genome representing each COG functional category associated with the 26 C. sakazakii strains evaluated in this studya
| Code | Value | %age | Description |
|---|---|---|---|
| J | 177 | 4.5 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.0 | RNA processing and modification |
| K | 304 | 7.8 | Transcription |
| L | 168 | 4.3 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 46 | 1.2 | Cell cycle control, Cell division, chromosome partitioning |
| V | 54 | 1.4 | Defense mechanisms |
| T | 149 | 3.8 | Signal transduction mechanisms |
| M | 245 | 6.3 | Cell wall/membrane biogenesis |
| N | 75 | 1.9 | Cell motility |
| U | 64 | 1.6 | Intracellular trafficking and secretion |
| O | 146 | 3.8 | Posttranslational modification, protein turnover, chaperones |
| C | 206 | 5.3 | Energy production and conversion |
| G | 323 | 8.3 | Carbohydrate transport and metabolism |
| E | 281 | 7.2 | Amino acid transport and metabolism |
| F | 102 | 2.6 | Nucleotide transport and metabolism |
| H | 148 | 3.8 | Coenzyme transport and metabolism |
| I | 84 | 2.1 | Lipid transport and metabolism |
| P | 235 | 6.0 | Inorganic ion transport and metabolism |
| Q | 46 | 1.2 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 0 | 0.0 | General function prediction only |
| S | 894 | 22.9 | Function unknown |
| – | 154 | 4.0 | Not in COGs |
The total is based on the total average number of protein coding genes (3902) for the genome. aNote: A summary of the total number of COG alleles per strain is shown in Additional file 2: Table S2. Individual strain’s genome statistics is shown in Additional file 3: Table S3
Prevalence and distribution of pESA3 alleles associated with the virulence plasmid and pESA2/pCTU3 plasmids harbored by 26 spice-associated C. sakazakii isolates
| No. of | pESA3/pCTU1 (incFIB, | No. of isolates with the indicated plasmidotypea | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| T6SS | FHA | Iron acquisition | Other plasmidsb | |||||||
|
| Int L |
| R end | Int R |
|
|
| pESA2/pCTU2 | pCTU3 (incH1) | ||
| 26 | 26 (100) | 26 (100) | 26 (100) | 7 (27) | 11 (42) | 3 (12) | 6 (23) | 26 (100) | 26 (100) | 1 (4) | 5 (21) |
aNumbers within parentheses are the percentage of PCR-positive strains for each gene locus in relation to the total number of plasmid- harboring spice-associated C. sakazkaii strains
bOnly 24 strains were analyzed by PCR for presence of pESA2 and pCTU3 (MOD1_788569 and MOD1_O123mB strains were not analyzed). Therefore, the percent positive for pESA2 and pCTU3 were calculated using a total number of 24 strains
Fig. 2Phylogenetic analysis of Cronobacter sakazakii strains isolated from spices, compared with eight representative Cronobacter species strains (marked with superscripted ‘T’ after each strain’s name). NCBI GenBank Accession numbers of type strains: C. malonaticus LMG 23826T (NZ_CP013940), C. turicensis LMG 23827T (NC_013282), C. universalis NCTC 9529T (NZ_CP012257), C. muytjensii ATCC 51329T (NZ_CP012268), C. dublinensis subsp. dublinensis LMG 23823T (NZ_CP012266), C. dublinensis subsp. lactaridi LMG 23825T (NZ_AJKX00000000), C. dublinensis subsp. lausannensis LMG 23824T (NZ_AJKY00000000), and C. condimenti LMG 26250T (NZ_CP012264). Whole genome SNP analysis was carried out using kSNP3 software [48]. The phylogenetic tree was built using neighbor-joining method [65] and the evolutionary distances were computed using the Maximum Composite Likelihood method [66] available on MEGA7 phylogenetic suite [67]. The bootstrap values obtained from 500 bootstrap replicates are reported as percentages at the nodes [68]. Sequence type (ST) information was obtained by uploading each strain’s genome assembly to the Cronobacter MLST website (http://pubmlst.org/cronobacter/) after which the ST information was manually overlaid onto the tree with different color. Note that the phylogeny among the strains followed ST evolutionary lineages. The scale bar indicates 0.10 substitutions per nucleotide position
Fig. 3Schematic map made using XPlas, Map DNA for Mac OS XAp (http://www.iayork.com) showing the annotated xylose utilization operon from MOD1_AS-15 C. sakazakii strain (a). Xylose utilization operon for C. sakazakii strains MOD1_Jor22, _Jor151, AS-15,_ KW3 which were extracted from PairWise Alignments using Geneious (https://www.geneious.com/) showing the identical sequence repeat regions which are associated with each gene of the operon and captured using the identical repeat sequence region function in Geneious (b). Repeat regions are denoted by bronzed colored lines below each operon gene. These repeated regions are also shown in Additional file 5: Table S5. c Nucleotide sequence alignment captured in Geneious for repeat region five in xylB for MOD1_AS-15 and MOD-1_Jor22 showing the presence of the repeat region in MOD1_Jor22 (56,266 to 56,280, see red box). Note that bax can contain two to three identical repeat regions which suggest that this is an important highly regulated gene. bax has been shown to induce cell apoptosis of Arabidopsis protoplast cells through reactive oxygen independent and dependent processes namely DNA fragmentation, increased vacuolation, and loss of plasma membrane integrity [61]
Summary of inverted repeat, palindrome, and direct repeat present in C. sakazakii strains MOD1_Jor151 and MOD1_Jor173 genomesa
| Type of repeats | Strain | Gene | Sequence |
|---|---|---|---|
| IRb | MOD1_Jor151 (108,510-108,524)c |
| GCCTTTCGCCAGCGG… |
| MOD1_Jor151 (117,747-117,761) |
| …CCGCTGGCGAAAGGC | |
| MOD1_Jor151 (120,139-120,154) |
| GACAAATGGCAGCCAG… | |
| MOD1_Jor151 (122,314-122,329) |
| …CTGGCTGCCATTTGTC | |
| MOD1_Jor151 (119,330-119,345) |
| GCTGTTTCGCGAAGGC… | |
| MOD1_Jor151 (122,381-122,396) |
| …GCCTTCGCGAAACAGC | |
| P | MOD1_Jor151 (116,843-116,858) |
| CATGGTCG CGACCATG… |
| MOD1_Jor151 (116,843-116,858) |
| …CATGGTCG CGACCATG | |
| DR | MOD1_Jor151 (113,704-113,719) |
| TCACCAGCTGGTGCAG… |
| MOD1_Jor151 (123,862-123,877) |
| TCACCAGCTGGTGCAG… | |
| MOD1_Jor151 (116,936-116,950) |
| GTAACGCTTCGCGAT… | |
| MOD1_Jor151 (123,587-123,601) |
| GTAACGCTTCGCGAT… | |
| MOD_Jor173 (74,114-74,128) |
| TGTGCTGGTGCCGCC… | |
| MOD_Jor173 (81,051-81,065) |
| TGTGCTGGTGCCGCC… |
aGenome assemblies were analyzed using the sequence repeat finder algorithm within Geneious. These two examples represent the various sequence repeat permutations found among the 26 spice-associated strains. For specific locations of the sequence repeats for each stain please refer to Additional file 5: Table S5
bAbbreviations: IR Inverted repeat, P Palindrome, DR Direct repeat
cNumbers within the parenthesis refer to the start and end base position of sequence repeats within Geneous
Fig. 4Schematic representation of xylose utilization by C. sakazakii strain MOD1_Jor22. The proposed model for xylose utilization involves activation of the xylose regulon by the binding of D-xylose with XylR. It is thought that D-xylose enters the cell either through diffusion or transport via XylE or XylFGH. In addition, xylanase A (XynA) is secreted to the extracellular milieu through an unknown type 2 secretion component where it can digest xylan to β-xyloside which is then brought into the cell via a xyloside transporter (XynT, a putative β-xyloside transporter) where XynB (β-xylosidase) converts it to D-xylose. Though unconfirmed, α- xyloside is thought to be transported into the cell where XylS (α-xylosidase) converts to D- xylose. D-xylose then is converted to D-xylulose by XylA (xylose isomerase) and then converted to D-xylulose-5P by XylB (xylulokinase). This physiological pathway is identical to that of E. coli. Similar to that of E. coli, Cronobacter also have anaerobic metabolic pathway where D- xylose is converted to xylitol by oxidoreductase and then converted to D-xyloulose using NAD(P)-dependent alcohol dehydrogenase. D-xylulose-5P is then shunted into pentose- phosphate pathway