Literature DB >> 28293682

Genome Sequence of the Thermotolerant Foodborne Pathogen Salmonella enterica Serovar Senftenberg ATCC 43845 and Phylogenetic Analysis of Loci Encoding Increased Protein Quality Control Mechanisms.

Scott V Nguyen1, Gregory P Harhay1, James L Bono1, Timothy P L Smith1, Dayna M Harhay1.   

Abstract

Salmonella enterica subsp. enterica bacteria are important foodborne pathogens with major economic impact. Some isolates exhibit increased heat tolerance, a concern for food safety. Analysis of a finished-quality genome sequence of an isolate commonly used in heat resistance studies, S. enterica subsp. enterica serovar Senftenberg 775W (ATCC 43845), demonstrated an interesting observation that this strain contains not just one, but two horizontally acquired thermotolerance locus homologs. These two loci reside on a large 341.3-kbp plasmid that is similar to the well-studied IncHI2 R478 plasmid but lacks any antibiotic resistance genes found on R478 or other IncHI2 plasmids. As this historical Salmonella isolate has been in use since 1941, comparative analysis of the plasmid and of the thermotolerance loci contained on the plasmid will provide insight into the evolution of heat resistance loci as well as acquisition of resistance determinants in IncHI2 plasmids. IMPORTANCE Thermal interventions are commonly used in the food industry as a means of mitigating pathogen contamination in food products. Concern over heat-resistant food contaminants has recently increased, with the identification of a conserved locus shown to confer heat resistance in disparate lineages of Gram-negative bacteria. Complete sequence analysis of a historical isolate of Salmonella enterica serovar Senftenberg, used in numerous studies because of its novel heat resistance, revealed that this important strain possesses two distinct copies of this conserved thermotolerance locus, residing on a multireplicon IncHI2/IncHI2A plasmid. Phylogenetic analysis of these loci in comparison with homologs identified in various bacterial genera provides an opportunity to examine the evolution and distribution of loci conferring resistance to environmental stressors, such as heat and desiccation.

Entities:  

Keywords:  IncHI2; Salmonella; genome analysis; phylogenetic analysis; thermotolerance

Year:  2017        PMID: 28293682      PMCID: PMC5347186          DOI: 10.1128/mSystems.00190-16

Source DB:  PubMed          Journal:  mSystems        ISSN: 2379-5077            Impact factor:   6.496


INTRODUCTION

Heat-resistant bacterial contaminants have increasingly become a source of concern in the last several years especially with regard to food safety and human health. Although many members of the family Enterobacteriaceae exhibit variable heat resistance after heat shock, relatively few strains have been found to demonstrate an innate thermotolerant phenotype. Recently, however, a genetic locus found to confer heat resistance has been identified in certain strains of Cronobacter sakazakii, Escherichia coli, and Klebsiella species (1–3). Using a top-down proteomic approach, Williams et al. identified a protein biomarker sequence in Cronobacter correlating with thermotolerance to reverse engineer the DNA sequence of the protein that was homologous to a protein found in thermotolerant Methylobacillus flagellatus (4). Gajdosova and coworkers expanded upon this work and sequenced an 18-kbp region containing this open reading frame (ORF) and determined that the full locus is required for highest thermotolerance (1). Independently, Bojer et al. (3) isolated several Klebsiella pneumoniae strains over the course of a 2-year surveillance study of patients in a Danish hospital; this study demonstrated multiple antibiotic resistance phenotypes (3). These nosocomial isolates were associated with persistence in contaminating reusable endoscopes, and Bojer et al. (3) hypothesized that the adaptability of the isolates was due to their resiliency to harsh environmental stressors such as heat. Bojer et al. then identified a locus encoding small heat shock proteins and a novel Clp ATPase termed ClpK (3). This locus showed high similarity to the same chromosomal region from Methylobacillus that Williams et al. (4) identified and Gajdosova later sequenced (1). Subsequently, Bojer and coworkers reported that the clpK gene was cotransferred with plasmids encoding multiple drug resistance hosted by emerging K. pneumoniae clones associated with nosocomial outbreaks (5). More recently, Mercer et al. (2) and Dlusskaya et al. (6) sequenced an E. coli beef isolate that was able to survive in beef patties grilled to 71°C. Genomic comparisons of highly heat-resistant E. coli isolates resulted in the identification of a 14-kbp locus of heat resistance with greater than 99% identity to the aforementioned loci in Cronobacter and Klebsiella (2). Lee et al. expanded on the thermotolerance genetic module, noting that other ORFs on the loci are homologous to proteins involved in protein homeostasis and proposed that the loci be called transmissible loci for protein quality control (TLPQC) (7, 8). This locus appears to have been horizontally acquired in some bacterial lineages, as it is found not only in a number of clinically relevant members of the family Enterobacteriaceae but also more broadly in other Proteobacteria species (7). A striking example of innate thermotolerance in the foodborne pathogen Salmonella enterica subsp. enterica has been reported for a particular strain of S. enterica serovar Senftenberg (9). This isolate, S. Senftenberg ATCC 43845 (originally referred to as strain 775W), has been used extensively in thermal resistance studies over the last several decades (9–12); it was first reported to be extremely thermotolerant in 1946 by Winter et al. (13), having been isolated previously by E. Beckler in 1941 from Chinese egg powder at the Massachusetts Department of Public Health (14). While dried egg products are not commonly sold in modern times, dried egg powder was used in rations in many Western countries, especially England, during the onset of World War II. Indeed, multiple reports of salmonellosis in the 1940s were associated with consumption of powdered eggs (15, 16). Despite being used in numerous studies examining thermotolerance, the genetic determinants conferring heat resistance in S. Senftenberg ATCC 43845 have not been identified previously. Analysis of the complete genome of S. Senftenberg ATCC 43845 revealed the presence of two closely related loci with more than 98% identity to the thermotolerance loci previously studied in Cronobacter, Escherichia, Klebsiella, and Pseudomonas (1–3, 8) on a potentially conjugable IncHI2 plasmid. As the synteny of the genetic loci is highly conserved broadly across diverse members of the phylum Proteobacteria, here we designate the putative thermotolerance loci in S. Senftenberg TLPQC-1 and TLPQC-2 (Fig. 1) in keeping with the nomenclature proposed by Lee et al. (7). Comparative analysis of the S. Senftenberg TLPQCs, as well as other known homologs, provides a unifying framework for understanding the origin and evolution of loci conferring mechanisms of increased protein maintenance and reveals the prevalence and conservation of these loci within the Proteobacteria.
FIG 1 

Genetic synteny is highly conserved between TLPQC (transmissible loci for protein quality control). The synteny of the TLPQC is highly conserved, suggesting an origin of horizontal gene transfer between several genera (Salmonella, Methylobacillus, Cronobacter, Escherichia, and Klebsiella). Despite reductions in the loci, the synteny of the loci is conserved in the various clades. Conserved ORFs with predicted functions are colored in the figure. The functions are indicated by the following letters (and colors): A, small heat shock proteins (dark blue); B, Clp protease (green); C, cardiolipin synthase (teal); D, FtsH protease (pink); E, YfdX family protein (magenta); F, thioredoxin (black); G, KefB glutathione-regulated potassium efflux pump (purple); H, Zn-dependent protease (red); and I, periplasmic serine protease (dark green). Letters within parentheses indicate that the gene is truncated. The E. coli AW 1.7 nucleotide accession number is NZ_LDYJ01000141.

Genetic synteny is highly conserved between TLPQC (transmissible loci for protein quality control). The synteny of the TLPQC is highly conserved, suggesting an origin of horizontal gene transfer between several genera (Salmonella, Methylobacillus, Cronobacter, Escherichia, and Klebsiella). Despite reductions in the loci, the synteny of the loci is conserved in the various clades. Conserved ORFs with predicted functions are colored in the figure. The functions are indicated by the following letters (and colors): A, small heat shock proteins (dark blue); B, Clp protease (green); C, cardiolipin synthase (teal); D, FtsH protease (pink); E, YfdX family protein (magenta); F, thioredoxin (black); G, KefB glutathione-regulated potassium efflux pump (purple); H, Zn-dependent protease (red); and I, periplasmic serine protease (dark green). Letters within parentheses indicate that the gene is truncated. The E. coli AW 1.7 nucleotide accession number is NZ_LDYJ01000141.

RESULTS AND DISCUSSION

Genomic features of S. Senftenberg ATCC 43845.

S. Senftenberg ATCC 43845 has been used extensively in thermal resistance studies over the last several decades, having been identified as an extremely thermotolerant strain of Salmonella in 1946 by Winter et al. (13). A draft genome assembly consisting of 114 contigs (139-kb contig N50; accession number NZ_AOXX00000000) and a total length of 5,184,857 bases have previously been reported for this strain (17). Here we present the complete and closed, finished-quality genome sequence of S. Senftenberg ATCC 43845 based on long-read sequencing. The circularized chromosome consists of 4,920,660 bp with a chromosomal GC content of 52.2%, comparable to other finished Salmonella (Fig. 2). Annotation by the NCBI Prokaryotic Genome Annotation Pipeline (version 3.3) indicated the presence of 5,105 total genes, with 4,980 coding sequences (CDS) and 125 genes encoding RNAs (GenBank accession number CP016837). This includes 3,930 predicted proteins with homology to known proteins, in addition to 979 hypothetical or “miscellaneous” predicted proteins. In addition, the chromosome contains potential restriction-modification (RM) systems, including a DNA (cytosine-5)-methyltransferase motif, as well as phage integration-related N6-adenine methylation systems. Genome analysis using PHASTER (18) revealed the presence of two intact prophages and one questionable prophage (Fig. 2). Manual inspection revealed a Mu-like prophage, with up to 77% nucleotide identity but only 11% coverage of phage D108 by PHASTER at nucleotide positions 30,172 to 82,470. The Mu-like prophage is integrated downstream of mnmE (also known as thdF or trmE) which is a hot spot integration site for other mobile genetic elements in Salmonella (19). The other intact phage identified by PHASTER is a prophage related to the Haemophilus influenzae HP1 phage at positions 738,135 to 768,638. Manual inspection of the phage region identified as questionable by PHASTER analysis showed what appears instead to be an intact prophage related to the Salmonella phage vB_SosS_Oslo at nucleotides 1,549,983 to 1,599,675. No known virulence factors were associated with any of the prophages. A putative integrative conjugative element (ICE) identified by BLASTn is integrated into tRNAPhe at nucleotide positions 4,324,610 to 4,433,343. BLAST analysis of the ICE showed no virulence factors; however, the element appears to encode an arsenic resistance locus and a copper homeostasis and silver resistance island (CHASRI) (20). Significantly, there were no predicted thermotolerance loci on the chromosome of the isolate revealed by BLAST analysis.
FIG 2 

Genome map of S. Senftenberg ATCC 43845. The outer ring displays the mobile genetic elements found in S. Senftenberg ATCC 43845. The second and third rings depict predicted CDS on the forward and reverse strands, respectively. The innermost rings depict GC content and skew.

Genome map of S. Senftenberg ATCC 43845. The outer ring displays the mobile genetic elements found in S. Senftenberg ATCC 43845. The second and third rings depict predicted CDS on the forward and reverse strands, respectively. The innermost rings depict GC content and skew.

Features of the pSSE ATCC-43845 plasmid.

The finished assembly of S. Senftenberg ATCC 43845 revealed the presence of a large 341,373-bp circular plasmid, not previously described for this strain, and is here designated pSSE ATCC-43845. Combining the plasmid length with the chromosome takes the total genome complement of the strain to 5,262,033 bases, slightly exceeding that predicted by the short-read draft assembly (17). Annotation of this plasmid sequence via the same method as the chromosome predicts 336 genes on the plasmid, including two complete thermotolerance loci with distinct sequences between nucleotides 189,272 to 211,036 and 134,033 to 148,118 of GenBank accession number CP016838 (Fig. 3). These two loci are designated TLPQC-1 and TLPQC-2. Both thermotolerance islands have characteristic high average GC content (61%), contain the canonical clp protease gene, and are flanked by small heat shock proteins and predicted DNA-binding genes, in keeping with previously reported heat tolerance islands (Fig. 1) (1–3, 7). The GC content of the islands suggests that the origins of TLPQC-mediated heat resistance lie in Proteobacteria with higher average GC content than Salmonella species.
FIG 3 

Map of the pSSE ATCC-43845 plasmid. The thermotolerance islands are indicated as orange regions on the map and demonstrate increased high GC content (black track). pSSE ATCC-43845 shares high homology with the backbone of the R478 plasmid (red tBLASTx). Base modification signals generated by BaseModFunctions v2.1.R and Circos are overlaid on the plasmid map. Qmod values are height proportional to the interpulse distance ratio for each of the modified bases on the positive strand (red outermost track) and negative strand (red innermost track). Blue hash marks correspond to the CGCAN8TCNG motifs present on the plasmid.

Map of the pSSE ATCC-43845 plasmid. The thermotolerance islands are indicated as orange regions on the map and demonstrate increased high GC content (black track). pSSE ATCC-43845 shares high homology with the backbone of the R478 plasmid (red tBLASTx). Base modification signals generated by BaseModFunctions v2.1.R and Circos are overlaid on the plasmid map. Qmod values are height proportional to the interpulse distance ratio for each of the modified bases on the positive strand (red outermost track) and negative strand (red innermost track). Blue hash marks correspond to the CGCAN8TCNG motifs present on the plasmid. BLASTn comparison of the entire plasmid sequence to microbial sequences in GenBank indicates that it is closely related to the lncHI2 R478 plasmid, sharing up to 98% identity with more than 61% coverage. In silico analysis of the plasmid sequence with PlasmidFinder (21) identified two replicon regions, IncHI2 and IncHI2A. Comparative analyses further revealed that the pSSE ATCC-43845 plasmid has a comparable GC content value of 48.0% to the GC content of R478 at 45.5% (Fig. 3) (22, 23). Further evidence that pSSE ATCC-43845 is a member of the IncHI2 plasmid family is found by comparing transposon insertion sites which have been used as markers to track closely related plasmids of the same family. In R478 and closely related IncHI2 plasmids with antibiotic resistance, Tn10 and the linked tetracycline resistance genes have been used for this purpose (24), and in pSSE ATCC-43845, the Tn10 transposition site (BFF41_25185 [Fig. 4]) was found to be intact. As with R478, several heavy metal resistance determinants are present on pSSE ATCC-43845, including a tellurite resistance island and another copy of CHASRI (23, 25). As illustrated in Fig. 4, the variable regions of R478 and pSSE ATCC-43845 are bracketed by the tellurite resistance operon and CHASRI. This variable region in R478 encodes mercury resistance and chloramphenicol (cat) and kanamycin (aphA) resistance, while in pSSE ATCC-43845, this region contains the two TLPQC loci. DNA sequence analyses showed that pSSE ATCC-43845 does not contain genes encoding drug resistance determinants. In keeping with this finding, antimicrobial sensitivity phenotyping (as evaluated by broth microdilution testing methods; TREK, Sensititre CMV3AGNF plates) showed S. Senftenberg ATCC-43845 to be pansusceptible to 15 antimicrobial agents tested on the panel. This finding is novel and suggests that pSSE ATCC-43845 is a relic of IncHI2 plasmids before the era of therapeutic antibiotic use, as present-day examples of IncHI2 plasmids have frequently been identified as carriers of antibiotic resistance genes (26–28).
FIG 4 

BLASTN comparison of the R478 and pSSE ATCC-43845 plasmids. The backbone for R478 is highly conserved in pSSE ATCC-43845. The majority of differences between the two plasmids reside between the tellurite resistance island and CHASRI (copper homeostasis and silver resistance island). The lack of any known antibiotic resistance genes, such as aphA (kanamycin) or cat (chloramphenicol), on pSSE ATCC-43845 is also noted. The integration site for Tn10 with tetracycline resistance is in BFF41_25185 in pSSE ATCC-4384. Heavy metal resistance genes (purple), antibiotic resistance genes (teal), TLPQC genes (red), transposase genes (blue), CHASRI genes (silver), and rep (magenta) and the mercury resistance (mer), methylase (dam), tellurite resistance (ter), arsenic resistance (ars), and replicase (rep) genes are indicated.

BLASTN comparison of the R478 and pSSE ATCC-43845 plasmids. The backbone for R478 is highly conserved in pSSE ATCC-43845. The majority of differences between the two plasmids reside between the tellurite resistance island and CHASRI (copper homeostasis and silver resistance island). The lack of any known antibiotic resistance genes, such as aphA (kanamycin) or cat (chloramphenicol), on pSSE ATCC-43845 is also noted. The integration site for Tn10 with tetracycline resistance is in BFF41_25185 in pSSE ATCC-4384. Heavy metal resistance genes (purple), antibiotic resistance genes (teal), TLPQC genes (red), transposase genes (blue), CHASRI genes (silver), and rep (magenta) and the mercury resistance (mer), methylase (dam), tellurite resistance (ter), arsenic resistance (ars), and replicase (rep) genes are indicated.

Methylation analysis.

Sequence analysis using the Pacific Biosciences platform produces data that can detect base modification in the source DNA (29). Detecting these modifications is dependent on both the depth of sequence coverage and the magnitude of the effect of the modification on the kinetics of nucleotide incorporation. Nucleotide modification is predicted from the interpulse distance data (see Materials and Methods) with N6-adenine methylation producing a strong signal that is easily detected at low coverage, while N5-cytosine methylation demonstrates a weaker signal that requires deeper coverage. Methylation motif analysis of the S. Senftenberg sequence data and comparisons to established motifs in REBASE (30) showed adenine N6-methylation of the sites GATC, CAGAG, and ATGCAT (the adenine methylated is shown underlined), as observed previously with other Salmonella serovars (31). Additional motifs were identified in S. Senftenberg and are summarized in Table 1. Of particular interest was methylation of the CGCAN8TCNG motif, as the presence of a cluster of three of these motifs was observed in each of the novel clp protease genes found in TLPQC-1 and TLPQC-2. This is illustrated in Fig. 3, where modification values (Qmod) indicating methylation of this site on the positive and negative strands are visualized as the outermost and innermost tracks, respectively, on pSSE ATCC-43845. Depiction of the methylation state of this site on the plasmid also reveals a skew of the presence of these motifs, which is likely due to the chimerism of the R478 backbone. The presence of these motifs in TLPQC-1 and -2 is noteworthy and hints at the possibility that this motif may play some regulatory role in the expression of genes contained in the TLPQC loci. Additional base modification motif maps of the plasmid are visualized in the supplemental figures (see Fig. S1 and S2 in the supplemental material).
TABLE 1 

Base modification summary table

MotifaMTase ORFbNo. of motifs in genomecNo. of motifs modifiedc% modifiedcType/subtypedCoveragee
CAGAGBFF41_177606,6076,57699.5III beta82.9
ATGCATBFF41_0269577624837.7//37.7II beta88.8
ACYN6GTTC BFF41_00175797//797793//79699.5//99.9I gamma80.6
CGCAN8TCNGf BFF41_20045849//849847//83899.8//98.7I gamma82.3
GAAN7RTACfBFF41_210251,036//1,0361,019//1,03098.4//99.4I gamma82.0
GATCgBFF41_0220541,31241,20299.7//99.7II82.6

Modified adenine bases are shown in bold type; cognate bases are shown in bold font.

Motifs are assigned to their respective putative methylases (methylase [MTase] ORFs).

Numbers separated by double slashes indicate the values for motifs detected on the positive and negative strand shown before and after the double slashes, respectively, if known.

The type and subtype refer to the class of methylase.

Coverage indicates the depth of sequence at the minimum quality threshold set in the protocol.

Motifs determined to be unique to S. Senftenberg ATCC 43845, as determined by REBASE.

The GATC motif cannot be assigned unambiguously, but BFF41_02205 was assigned as the most likely candidate.

GATC methylation for pSSE ATCC-43845. The map of plasmid pSSE ATCC-43845 is shown, with the thermotolerance locus islands indicated as orange regions. Base modification signals generated by BaseModFunctions v2.1.R and Circos are overlaid on the plasmid map. Qmod values are height proportional to the interpulse distance ratio for each of the modified bases on the positive strand (red outmost track) and negative strand (red innermost track). Blue hash marks correspond to the GATC motifs present on the plasmid. Download FIG S1, PDF file, 2.2 MB. CAGAG methylation for pSSE ATCC-43845. The map of plasmid pSSE ATCC-43845 is shown, with the thermotolerance locus islands indicated as orange regions. Base modification signals generated by BaseModFunctions.v.2.1.R and Circos are overlaid on the plasmid map. Qmod values are height proportional to the interpulse distance ratio for each of the modified bases on the positive strand (red outmost track) and negative strand (red innermost track). Green hash marks correspond to the CAGAG motifs present on the plasmid. Download FIG S2, PDF file, 1.6 MB. Base modification summary table Modified adenine bases are shown in bold type; cognate bases are shown in bold font. Motifs are assigned to their respective putative methylases (methylase [MTase] ORFs). Numbers separated by double slashes indicate the values for motifs detected on the positive and negative strand shown before and after the double slashes, respectively, if known. The type and subtype refer to the class of methylase. Coverage indicates the depth of sequence at the minimum quality threshold set in the protocol. Motifs determined to be unique to S. Senftenberg ATCC 43845, as determined by REBASE. The GATC motif cannot be assigned unambiguously, but BFF41_02205 was assigned as the most likely candidate.

Comparison of TLPQC loci.

A phylogenetic analysis (see Materials and Methods) was performed to determine the relationship of the Salmonella heat tolerance islands to those in Klebsiella plasmids, Pseudomonas chromosomes, and other bacteria. Heat tolerance islands were identified by searching the literature (previous reports) or by searching for similarity of nucleotide sequence records in GenBank to representative TLPQC sequences by BLASTn (see Materials and Methods). A total of 91 heat tolerance islands, including 48 reported to be located on chromosomes, 42 reported to reside on plasmids, and one presumably on a plasmid (Table 2), as well as the two TLPQC islands from plasmid pSSE ATCC-43845, were included in the analysis. For each species, the thermotolerance island was manually curated from the reported sequence (Table 2). Previously, two clades (A and B) of heat tolerance islands, which were termed “locus of heat resistance” (LHR), were identified (2). We chose to use instead the nomenclature of TLPQC proposed and adopted by Lee et al. (7), since a number of ORFs within the LHR islands have been shown or predicted to encode protein quality control functions involved in surviving a range of stressors beyond heat, including antibiotic exposure, desiccation, and oxidative stress. However, we propose to amend the TLPQC classification by Lee et al. (7) and incorporate phylogenetic analysis from our work and Mercer et al. (2) for multiple clades (Fig. 5). Thus, the broader phylogeny incorporating the pSSE ATCC-43845 islands (Fig. 5) indicates three clades, TLPQC-1, -2, and -3, with the third clade comprised of a reduced TLPQC island primarily observed in plasmids of Klebsiella. There is high genetic conservation within clades and less genetic conservation between clades, and genetic synteny is maintained across multiple genera (Fig. 1 and 5). The two pSSE ATCC-43845 islands fall within clades 1 and 2, consistent with the names assigned here, and are no more closely related to other plasmid-borne islands than to chromosome-borne islands.
TABLE 2 

Strain information for the sequence data used in the phylogenetic analysis of TLPQC loci in this study

SpeciesStrain or plasmidAccession no.TLPQCaLocation
Achromobacter xylosoxidansFDAARGOS_162CP0140651Chromosome
MN001CP0120461Chromosome
Alicycliphilus denitrificansK601CP0026571Chromosome
Burkholderia multivoransATCC 17616CP0008681Chromosome
Burkholderia multivorans chromosome 1ATCC BAA-247CP0098321Chromosome
Cupriavidus gilardii chromosome 1CR3CP0105161Chromosome
Dechlorosoma suillumPSCP0031531Chromosome
Desulfovibrio alaskensisG20CP0001121Chromosome
Klebsiella pneumoniaeJ1 plasmid 1CP0137121Plasmid
Marinobacter aquaeoleiVT8CP0005141Chromosome
Methylobacillus flagellatusKTCP0002841Chromosome
Pandoraea apistaAU2161CP0115011Chromosome
DSM 16535CP0134811Chromosome
TF80G25CP0112791Chromosome
TF81F4CP0105181Chromosome
Pantoea sp.PSNIH2CP0098661Chromosome
Pseudomonas aeruginosaATCC 27853CP0151171Chromosome
F30658CP0088571Chromosome
F9670CP0088731Chromosome
PA38182HG5300681Chromosome
S04 90CP0113691Chromosome
S86968CP0088651Chromosome
T38079CP0088661Chromosome
W36662CP0088701Chromosome
W45909CP0088711Chromosome
W60856CP0088641Chromosome
Carb01 63CP0113171Chromosome
Pseudomonas balearicaDSM 6083CP0075111Chromosome
Pseudomonas pseudoalcaligenesCECT 5344HG9168261Chromosome
Pseudomonas resinovoransNBRC 106553AP0130681Chromosome
Pseudomonas sp.VLB120CP0039611Chromosome
Pseudomonas stutzeriCGMCC 1.1803CP0028811Chromosome
DSM 4166CP0026221Chromosome
Ralstonia mannitolilytica chromosome 1SN82F48CP0107991Chromosome
Ralstonia pickettii chromosome 112DCP0016441Chromosome
DTP0602CP0066671Chromosome
Salmonella enterica SenftenbergpSSE ATCC-43845CP0168381Plasmid
Vibrio parahaemolyticus chromosome 1UCM-V493CP0070041Chromosome
Cronobacter malonaticusLMG 23826CP0139402Chromosome
CMCC 45402CP0067312Chromosome
Cronobacter sakazakiiATCC 29544CP0110472Chromosome
Enterobacter asburiaeCAV1043 pCAV1043-51CP0115872Plasmid
Enterobacter cloacaeECNIH5 pENT-22eCP0098552Plasmid
ECR091 pENT-4bdCP0089072Plasmid
Escherichia coliP12bCP0022912Chromosome
Klebsiella oxytocapKO_JKo3_1 DNAAP0149522Plasmid
Klebsiella pneumoniaeCAV1193 pCAV1193-258CP0133232Plasmid
CAV1344 pCAV1344-250CP0116232Plasmid
KPNIH39 pKPN-332CP0147632Plasmid
pKPN_CZJX4244242Plasmid
Klebsiella pneumoniae subsp. pneumoniaeKPNIH27 pKPN-262CP0077342Plasmid
Obesumbacterium proteusDSM 2777CP0146082Chromosome
Salmonella enterica HavanaCFSAN024771JWQI01000019.12Unknown
Salmonella enterica SenftenbergpSSE ATCC-43845CP0168382Plasmid
Yersinia enterocolitica(Type O:5) YE53/03HF5719882Chromosome
FORC-002CP0094562Chromosome
Cronobacter sakazakiiSP291CP0040913Chromosome
NCTC 8155CP0122533Chromosome
Enterobacter asburiaeCAV1043CP0115913Chromosome
Enterobacter cloacae complex chromosome 135734CP0121623Chromosome
Enterobacter cloacae subsp. cloacaeATCC 13047CP0019183Chromosome
Klebsiella pneumoniae207M1D0 KPN207_p3LT2164393Plasmid
30660/NJST258_1 pNJST258N1CP0069273Plasmid
32192pCP0105743Plasmid
34618 p34618CP0103933Plasmid
500_1420 p500_1420CP0119813Plasmid
BK32179 pBK32179JX4304483Plasmid
CAV1392 pCAV1392-131CP0115773Plasmid
DMC1097 pDMC1097CP0119773Plasmid
JM45 p1CP0066573Plasmid
KP-1 pKP1-19CP0128843Plasmid
KpN01 pKpN01-SILCP0129893Plasmid
KpN06 pKpN06-SILCP0129943Plasmid
Kpn555 pKPN-d90CP0151323Plasmid
KPNIH36 pKPN-fffCP0146493Plasmid
O6CO7 pIT-6C07LT0096883Plasmid
pKN-LS6JX4429743Plasmid
pKp848CTXLM9947173Plasmid
Plasmid 1CP0158233Plasmid
PMK1 PMK1-ACP0089303Plasmid
pUUH239.2CP0024743Plasmid
ST15 pKP02022KF7199723Plasmid
ST23 pKP007KF7199713Plasmid
ST258 pKPN-ITJN2337043Plasmid
ST48 pKP09085KF7199703Plasmid
U25 PU25001KT2032863Plasmid
UHKPC07 pUHKPC07CP0119873Plasmid
UHKPC33 pUHKPC33CP0119903Plasmid
Klebsiella pneumoniae subsp. pneumoniaeKPNIH10 pKPN-498CP0077293Plasmid
KPNIH24 pKPN-e44CP0088003Plasmid
KPNIH32 pKPN-a68CP0097773Plasmid
KPX pKPX-1AP0120553Plasmid
MGH 78578 pKPN3CP0006483Plasmid

The TLPQC-1, TLPQC-2, and TLPQC-3 loci are indicated by 1, 2, and 3, respectively, in the table.

FIG 5 

Genomic comparison of thermotolerance islands. MrBayes phylogeny tree reveals three different clades for TLPQC-1 (red branches), TLPQC-2 (blue branches), and TLPQC-3 (green branches). Graphic representations of representatives indicated in bold along the tree are shown to the right in the inset to illustrate conservation, with gray connections between the loci indicating sequence homology. The thermotolerance islands of Proteobacteria show high degrees of conservation, indicating an origin of horizontal gene transfer. Very high genetic conservation is observed within the clade, with less conservation between clades. The truncated cardiolipin synthase and FtsH CDS is conserved across all members of TLPQC-2. Strains in the inset are as follows: Methylobacillus flagellatus (MF), S. Senftenberg TLPQC-1 (SS1), Cronobacter sakazakii (CS), S. Senftenberg TLPQC-2 (SS2), Klebsiella pneumoniae (KP), and Enterobacter asburiae (EA). The functions of the genes are indicated by the following letters (and colors): A, small heat shock proteins (dark blue); B, Clp protease (green); C, cardiolipin synthase (teal); D, FtsH protease (pink); E, YfdX family protein (magenta); F, thioredoxin (black); G, KefB glutathione-regulated potassium efflux pump (purple); H, conserved protein with GGDEF domain (yellow); I, Zn-dependent protease (red); J, periplasmic serine protease (dark green). CDS in orange indicate hypothetical proteins. Black stars indicate TLPQCs that are found within bioinformatically identified integrative conjugative elements.

Strain information for the sequence data used in the phylogenetic analysis of TLPQC loci in this study The TLPQC-1, TLPQC-2, and TLPQC-3 loci are indicated by 1, 2, and 3, respectively, in the table. Genomic comparison of thermotolerance islands. MrBayes phylogeny tree reveals three different clades for TLPQC-1 (red branches), TLPQC-2 (blue branches), and TLPQC-3 (green branches). Graphic representations of representatives indicated in bold along the tree are shown to the right in the inset to illustrate conservation, with gray connections between the loci indicating sequence homology. The thermotolerance islands of Proteobacteria show high degrees of conservation, indicating an origin of horizontal gene transfer. Very high genetic conservation is observed within the clade, with less conservation between clades. The truncated cardiolipin synthase and FtsH CDS is conserved across all members of TLPQC-2. Strains in the inset are as follows: Methylobacillus flagellatus (MF), S. Senftenberg TLPQC-1 (SS1), Cronobacter sakazakii (CS), S. Senftenberg TLPQC-2 (SS2), Klebsiella pneumoniae (KP), and Enterobacter asburiae (EA). The functions of the genes are indicated by the following letters (and colors): A, small heat shock proteins (dark blue); B, Clp protease (green); C, cardiolipin synthase (teal); D, FtsH protease (pink); E, YfdX family protein (magenta); F, thioredoxin (black); G, KefB glutathione-regulated potassium efflux pump (purple); H, conserved protein with GGDEF domain (yellow); I, Zn-dependent protease (red); J, periplasmic serine protease (dark green). CDS in orange indicate hypothetical proteins. Black stars indicate TLPQCs that are found within bioinformatically identified integrative conjugative elements. Members of the TLPQC-1 clade are broadly distributed within the Proteobacteria (Fig. 5) with examples found in environmental isolates such as Methylobacillus flagellatus and Dechlorosoma suillum (32, 33), as well as within clinically relevant bacteria such as Pseudomonas, Pandorea, and Burkholderia (8). The majority of TLPQC-1 were found to be chromosomally located, with the exception of S. Senftenberg TLPQC-1 and another on plasmid 1 of Klebsiella pneumoniae J1 (GenBank accession number CP013712). Comparative analyses of these chromosomally bound TLPQCs suggest that they are inserted into larger mobile genetic elements (data not shown). TLPQC-1 genes in strains marked with black stars (Fig. 5) indicate that the loci are found within an integrative conjugative element (ICE) that is integrated into tRNAGly. Attachment repeats flanking the putative ICE were identified with the core att nucleotide sequence 5′ TGGAGCGGGCGATGGGAA. Members of the TLPQC-1 lineage contain the longest islands of the three lineages (~15-kbp average), with core genes predicted to encode a phage-like transcriptional regulator, two small heat shock protein variants, a novel Clp protease, cardiolipin synthase, the membrane-associated ATP-dependent protease FtsH, two variants of YfdX protein family with potential chaperone activity, thioredoxin, a glutathione regulated K+ efflux pump related to KefB, a phosphate starvation-inducible PsiE family protein, a Zn-dependent protease, and a periplasmic serine protease (7) (Fig. 5). Although most of the ORFs in TLPQC-1 have not been experimentally verified, one of the small heat shock proteins, sHsp20c, was recently characterized and found to aggregate in a 24-mer that plays a role in ameliorating stress resistance as a protein chaperone in Pseudomonas (8). Additionally, the novel Clp protease, ClpGG1, has been shown to confer thermal resistance in Cronobacter, Escherichia, Pseudomonas, and other Enterobacteriaceae (3, 8). The rest of the ORFs in TLPQC-1 have not been studied in depth; however, it has been shown that the full TLPQC is needed for the greatest thermal protection (1). The conservation of TLPQC and protein domains within the uncharacterized ORFs to known proteases, chaperones, and other quality control proteins suggests that these additional ORFs may contribute to survival in other adverse conditions such as osmotic shock or desiccation resistance. The second lineage observed, TLPQC-2, is the best-studied TLPQC thus far, having first been identified in Cronobacter sakazakii contaminants of powdered infant formula (1). Islands in this lineage are shorter than TLPQC-1 (~14 kb) primarily because of the truncation of genes predicted to encode cardiolipin synthase (cls) and FtsH (hflB). Although these truncated ORFs are conserved across various TLPQC-2s, it is unknown whether they are transcribed and expressed or what function they might serve. TLPQC-2 islands were observed on both chromosomal and plasmid locations and were predominantly found in members of the Enterobacteriaceae, including Escherichia, Obesumbacterium, Klebsiella, Cronobacter, Salmonella, and Yersinia (Fig. 5) (1–3). Bacterial species that possess TLPQC-2 are of great interest to food industry and human health communities, as many are relevant to food safety such as Obesumbacterium proteus as brewery yeast contaminants (34) and C. sakazakii as infant formula contaminants (1). Given that a number of TLPQC-2 loci are found on plasmids, the potential exists for transmission and subsequent conferring of heat resistance to foodborne pathogens. In S. Senftenberg, the presence of the two TLPQCs on a multireplicon conjugative plasmid highlights the potentially mobile nature of these environmental stress resistance determinants. The third lineage, TLPQC-3, is the shortest of the three loci (mean length of 6.5 kb), and composed only of the ORFs that encode a heat shock protein, the Clp protease, a truncated version of cardiolipin synthase, the Zn-dependent protease, and the periplasmic serine protease (Fig. 5). Found primarily within Klebsiella pneumoniae plasmids, this TLPQC has been associated with the emerging multidrug-resistant K. pneumoniae ST16 clone (5, 35). The Clp protease in TLPQC-3 has been characterized and was named ClpK, sharing 98.3% amino acid identity with the pSSE ATCC-43845 ClpGG1 (3). The high degree of conservation of the core elements within these islands underscores their apparent importance to host fitness.

Conclusions.

We describe the presence of two broadly disseminated but uncommon TLPQCs present in the important pathogen Salmonella enterica. The discovery of two TLPQCs in S. Senftenberg ATCC 43845 underscores its uniqueness and suggests a likely mechanism for the unusually thermotolerant phenotype repeatedly observed for this strain (9–13). As this strain of S. Senftenberg has been maintained in collections since at least 1941, S. Senftenberg ATCC 43845 is also one of the oldest known isolates to harbor TLPQCs. The presence of TLPQC-1 and TLPQC-2 on a single plasmid raises questions about the selective pressures and genetic events that led to their evolution. As the original source of S. Senftenberg ATCC 43845 was powdered eggs, it is tempting to suggest a correlation between the acquisition of the TLPQC loci and exposure to conditions likely encountered in the production of powdered egg. Regardless of how it evolved, the complete genome sequence and methylome data of this unusually thermotolerant strain, as well as the phylogenetic analysis of these broadly distributed islands conferring enhanced mechanisms of protein maintenance, will facilitate future studies on the role of TLPQC loci in extreme heat and desiccation resistance within Gram-negative bacteria important to human health and food safety. In addition, the complete sequence of an IncHI2 R478-like plasmid isolated at the dawn of the era of therapeutic antibiotic use will aid in furthering our understanding of the evolution of resistance plasmids.

MATERIALS AND METHODS

DNA isolation and genome sequencing.

S. Senftenberg ATCC 43845 was acquired from ATCC and cultured statically at 37°C in Trypticase soy broth (Becton, Dickinson, Franklin Lakes, NJ) for 18 to 20 h. DNA was purified using Qiagen genomic-tip 100/G columns and the DNeasy blood and tissue DNA isolation kit (Qiagen, Valencia, CA) using the manufacturer’s recommended protocol. Sequencing libraries were prepared following the recommended Pacific Bioscience protocols. Single molecule real-time sequencing (SMRT) was performed using P6/C4 chemistry on a Pacific Bioscience (PacBio) RS II instrument (Pacific Biosciences, Menlo Park CA), resulting in average subreads of >7 kbp and mean coverage of 172.59×. The HGAP3 protocol in smrtanalysis v2.3 was used to assemble sequence reads and polish the contig. To circularize the contigs, a self/self dot plot of the contig sequences was generated in Geneious 9.1.5 (Biomatters Ltd., New Zealand) and used to identify duplicated sequence at the contig ends. The duplicated sequence was trimmed to generate a circularized sequence (36). OriFinder was used to determine the origin of replication and to reset base position 1 of the chromosome. The position of circularization was confirmed by mapping all the reads to the renumbered contig using the resequencing protocol in smrtanalysis v2.3, which also provided a second round of polishing via the included quiver routine. Genome and plasmid sequence data were annotated using the NCBI Prokaryotic Genome Annotation Pipeline and deposited into GenBank (GenBank accession numbers CP016837 and CP016838, respectively).

Genome analysis.

The sequences of the thermoresistance cluster originally identified in C. sakazakii (GenBank accession number FR714908) (1) and the thermotolerance loci in S. Senftenberg were used to identify related islands in the GenBank nr database using BLASTn. Genomes with strong similarity to these sequences were identified from 20 genera, and genome sequences were used to extract 91 additional thermotolerance islands by manual inspection to determine the island boundaries (Table 2). Sequence inspection and manipulation were performed in Geneious vR9.1.5. Manual inspection included BLAST comparisons of flanking mobile element genes, which revealed little homology between the different TLPQCs so these flanking sequences were trimmed and discarded to define the boundaries of the TLPQC. The extracted TLPQCs were aligned by MAFFT (37) in Geneious 9.1.5, and the resulting aligned sequences were analyzed by jModelTest 2 to determine the parameters for MrBayes (36, 38, 39). The best model for the full-aligned sequence of the TLPQC was determined to be TRN+I+G (Tamura-Nei model with invariant sites and discrete gamma distribution) using the Akaike information criterion (40). MrBayes 3.2.6, utilized as a Geneious plug-in, was used to calculate a Bayesian inference tree of the various TLPQCs. As the TRN+I+G model is not possible in MrBayes 3.2.6, the GTR+I+G model (generalized time reversible model with invariant sites and discrete gamma distribution) was used, as overparameterizeration gives substantially less bias than underparameterizeration (41), and 3,000,000 iterations were performed with these defined parameters. The first 25% of the iterations were discarded as burn-in, as the Markov chains have not reached stationary and may alter the final result. Visualization of the tracer output and examining the standard deviation of the split frequencies were used to assess the quality of the tree. As the standard deviation of the split frequencies reached only <0.04, the chains have truly not converged but were acceptable, given the extensive computing resources to run the calculations. Additional maximum likelihood trees generated by PhyML (42) gave further support for the Bayesian consensus tree (not shown). Representatives of TLPQCs derived from the Bayesian phylogenetic tree were extracted and visualized in an Easyfig BLASTn comparison (43). CGView Server was used to visualize chromosome and plasmid sequence data with default tBLASTx settings (44).

Base modification analysis.

Genome analysis of nucleotide base modifications were detected using the RS_Modification Motif_Analysis.1 protocol in smrtanalysis v2.3 (Pacific Biosciences, Menlo Park, CA) with the default threshold quality value (QV) of 30. The resulting motif_summary.csv files were uploaded to the restriction enzyme database (REBASE) and assessed for the presence of novel restriction modification systems (30). Additionally, kinetic Qmod values [defined by the log-transformed P value from the t test, −10log(P value)] with a threshold value of 55 were extracted and processed by BaseModFunctions v2.1.R (45) for visualization in Circos (46). The threshold value of 55 was based on the bimodal distribution of modification values observed in the kinetic detection map. Base modification data have been uploaded to GenBank.

Accession number(s).

Accession numbers for the strains used in this study can be found in Table 2.
  42 in total

1.  Selection of models of DNA evolution with jModelTest.

Authors:  David Posada
Journal:  Methods Mol Biol       Date:  2009

2.  Thermal resistance of Salmonella senftenberg 775W in dry animal feeds.

Authors:  T S Liu; G H Snoeyenbos; V L Carlson
Journal:  Avian Dis       Date:  1969-08       Impact factor: 1.577

3.  A top-down proteomics approach for differentiating thermal resistant strains of Enterobacter sakazakii.

Authors:  Tracie L Williams; Steven R Monday; Sharon Edelson-Mammel; Robert Buchanan; Steven M Musser
Journal:  Proteomics       Date:  2005-11       Impact factor: 3.984

4.  Evolution of IncHI2 plasmids via acquisition of transposons carrying antibiotic resistance determinants.

Authors:  Amy K Cain; Ruth M Hall
Journal:  J Antimicrob Chemother       Date:  2012-02-14       Impact factor: 5.790

5.  The complete nucleotide sequence of the resistance plasmid R478: defining the backbone components of incompatibility group H conjugative plasmids through comparative genomics.

Authors:  Matthew W Gilmour; Nicholas R Thomson; Mandy Sanders; Julian Parkhill; Diane E Taylor
Journal:  Plasmid       Date:  2004-11       Impact factor: 3.466

6.  Genome of Methylobacillus flagellatus, molecular basis for obligate methylotrophy, and polyphyletic origin of methylotrophy.

Authors:  Ludmila Chistoserdova; Alla Lapidus; Cliff Han; Lynne Goodwin; Liz Saunders; Tom Brettin; Roxanne Tapia; Paul Gilna; Susan Lucas; Paul M Richardson; Mary E Lidstrom
Journal:  J Bacteriol       Date:  2007-04-06       Impact factor: 3.490

7.  Heat resistance mediated by a new plasmid encoded Clp ATPase, ClpK, as a possible novel mechanism for nosocomial persistence of Klebsiella pneumoniae.

Authors:  Martin Saxtorph Bojer; Carsten Struve; Hanne Ingmer; Dennis Schrøder Hansen; Karen Angeliki Krogfelt
Journal:  PLoS One       Date:  2010-11-09       Impact factor: 3.240

8.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

9.  Phylogenetic diversity of the enteric pathogen Salmonella enterica subsp. enterica inferred from genome-wide reference-free SNP characters.

Authors:  Ruth E Timme; James B Pettengill; Marc W Allard; Errol Strain; Rodolphe Barrangou; Chris Wehnes; Joann S Van Kessel; Jeffrey S Karns; Steven M Musser; Eric W Brown
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

10.  PHASTER: a better, faster version of the PHAST phage search tool.

Authors:  David Arndt; Jason R Grant; Ana Marcu; Tanvir Sajed; Allison Pon; Yongjie Liang; David S Wishart
Journal:  Nucleic Acids Res       Date:  2016-05-03       Impact factor: 16.971

View more
  13 in total

1.  Salmonella enterica subsp. enterica Serovar Heidelberg Food Isolates Associated with a Salmonellosis Outbreak Have Enhanced Stress Tolerance Capabilities.

Authors:  Andrea J Etter; Alyssa M West; John L Burnett; Sophie Tongyu Wu; Deklin R Veenhuizen; Raeya A Ogas; Haley F Oliver
Journal:  Appl Environ Microbiol       Date:  2019-08-01       Impact factor: 4.792

2.  Functional Analysis of Genes Comprising the Locus of Heat Resistance in Escherichia coli.

Authors:  Ryan Mercer; Oanh Nguyen; Qixing Ou; Lynn McMullen; Michael G Gänzle
Journal:  Appl Environ Microbiol       Date:  2017-09-29       Impact factor: 4.792

3.  Stand-alone ClpG disaggregase confers superior heat tolerance to bacteria.

Authors:  Changhan Lee; Kamila B Franke; Shady Mansour Kamal; Hyunhee Kim; Heinrich Lünsdorf; Jasmin Jäger; Manfred Nimtz; Janja Trček; Lothar Jänsch; Bernd Bukau; Axel Mogk; Ute Römling
Journal:  Proc Natl Acad Sci U S A       Date:  2017-12-20       Impact factor: 11.205

4.  Genetic Characteristics of the Transmissible Locus of Stress Tolerance (tLST) and tLST Harboring Escherichia coli as Revealed by Large-Scale Genomic Analysis.

Authors:  Peipei Zhang; Xianqin Yang
Journal:  Appl Environ Microbiol       Date:  2022-03-14       Impact factor: 5.005

5.  Isolation and genomic characterization of a pathogenic Providencia rettgeri strain G0519 in turtle Trachemys scripta.

Authors:  Minghao Ye; Xiucai Hu; Aijun Lü; Jingfeng Sun; Chengxun Chen
Journal:  Antonie Van Leeuwenhoek       Date:  2020-09-20       Impact factor: 2.271

6.  Genetic Determinants of Stress Resistance in Desiccated Salmonella enterica.

Authors:  Zhiying Wang; Tongbo Zhu; Zhao Chen; Jianghong Meng; David J Simpson; Michael G Gänzle
Journal:  Appl Environ Microbiol       Date:  2021-09-29       Impact factor: 4.792

7.  Complete Genome Sequence of the Novel Virulent Phage PMBT28 with Lytic Activity against Thermotolerant Salmonella enterica subsp. enterica Serovar Senftenberg ATCC 43845.

Authors:  Sabrina Koberg; Erik Brinks; Viktoria Albrecht; Horst Neve; Charles M A P Franz
Journal:  Genome Announc       Date:  2018-06-28

Review 8.  Horizontal Transmission of Stress Resistance Genes Shape the Ecology of Beta- and Gamma-Proteobacteria.

Authors:  Shady Mansour Kamal; David J Simpson; Zhiying Wang; Michael Gänzle; Ute Römling
Journal:  Front Microbiol       Date:  2021-07-06       Impact factor: 5.640

9.  Why? - Successful Pseudomonas aeruginosa clones with a focus on clone C.

Authors:  Changhan Lee; Jens Klockgether; Sebastian Fischer; Janja Trcek; Burkhard Tümmler; Ute Römling
Journal:  FEMS Microbiol Rev       Date:  2020-11-24       Impact factor: 16.408

10.  Research Note: Repetitive element-based polymerase chain reaction genotyping improves efficiency of Salmonella surveillance in a model broiler production system.

Authors:  G K Walker; M M Suyemoto; L B Borst; J Brake
Journal:  Poult Sci       Date:  2020-03-11       Impact factor: 4.014

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.