A K M Firoj Mahmud1, Kristina Nilsson1, Anna Fahlgren1, Roberto Navais1, Rajdeep Choudhury1, Kemal Avican1, Maria Fällman2. 1. Department of Molecular Biology, Laboratory for Molecular Infection Medicine Sweden (MIMS), Umeå Centre for Microbial Research (UCMR), Umeå University, Umeå, Sweden. 2. Department of Molecular Biology, Laboratory for Molecular Infection Medicine Sweden (MIMS), Umeå Centre for Microbial Research (UCMR), Umeå University, Umeå, Sweden maria.fallman@umu.se.
Abstract
RpoN, an alternative sigma factor commonly known as σ54, is implicated in persistent stages of Yersinia pseudotuberculosis infections in which genes associated with this regulator are upregulated. We here combined phenotypic and genomic assays to provide insight into its role and function in this pathogen. RpoN was found essential for Y. pseudotuberculosis virulence in mice, and in vitro functional assays showed that it controls biofilm formation and motility. Mapping genome-wide associations of Y. pseudotuberculosis RpoN using chromatin immunoprecipitation coupled with next-generation sequencing identified an RpoN binding motif located at 103 inter- and intragenic sites on both sense and antisense strands. Deletion of rpoN had a large impact on gene expression, including downregulation of genes encoding proteins involved in flagellar assembly, chemotaxis, and quorum sensing. There were also clear indications of cross talk with other sigma factors, together with indirect effects due to altered expression of other regulators. Matching differential gene expression with locations of the binding sites implicated around 130 genes or operons potentially activated or repressed by RpoN. Mutagenesis of selected intergenic binding sites confirmed both positive and negative regulatory effects of RpoN binding. Corresponding mutations of intragenic sense sites had less impact on associated gene expression. Surprisingly, mutating intragenic sites on the antisense strand commonly reduced expression of genes carried by the corresponding sense strand.IMPORTANCE The alternative sigma factor RpoN (σ54), which is widely distributed in eubacteria, has been implicated in controlling gene expression of importance for numerous functions including virulence. Proper responses to host environments are crucial for bacteria to establish infection, and regulatory mechanisms involved are therefore of high interest for development of future therapeutics. Little is known about the function of RpoN in the intestinal pathogen Y. pseudotuberculosis, and we therefore investigated its regulatory role in this pathogen. This regulator was indeed found to be critical for establishment of infection in mice, likely involving its requirement for motility and biofilm formation. The RpoN regulon involved both activating and suppressive effects on gene expression which could be confirmed with mutagenesis of identified binding sites. This is the first study of its kind of RpoN in Y. pseudotuberculosis, revealing complex regulation of gene expression involving both productive and silent effects of its binding to DNA, providing important information about RpoN regulation in enterobacteria.
RpoN, an alternative sigma factor commonly known as σ54, is implicated in persistent stages of Yersinia pseudotuberculosis infections in which genes associated with this regulator are upregulated. We here combined phenotypic and genomic assays to provide insight into its role and function in this pathogen. RpoN was found essential for Y. pseudotuberculosis virulence in mice, and in vitro functional assays showed that it controls biofilm formation and motility. Mapping genome-wide associations of Y. pseudotuberculosis RpoN using chromatin immunoprecipitation coupled with next-generation sequencing identified an RpoN binding motif located at 103 inter- and intragenic sites on both sense and antisense strands. Deletion of rpoN had a large impact on gene expression, including downregulation of genes encoding proteins involved in flagellar assembly, chemotaxis, and quorum sensing. There were also clear indications of cross talk with other sigma factors, together with indirect effects due to altered expression of other regulators. Matching differential gene expression with locations of the binding sites implicated around 130 genes or operons potentially activated or repressed by RpoN. Mutagenesis of selected intergenic binding sites confirmed both positive and negative regulatory effects of RpoN binding. Corresponding mutations of intragenic sense sites had less impact on associated gene expression. Surprisingly, mutating intragenic sites on the antisense strand commonly reduced expression of genes carried by the corresponding sense strand.IMPORTANCE The alternative sigma factor RpoN (σ54), which is widely distributed in eubacteria, has been implicated in controlling gene expression of importance for numerous functions including virulence. Proper responses to host environments are crucial for bacteria to establish infection, and regulatory mechanisms involved are therefore of high interest for development of future therapeutics. Little is known about the function of RpoN in the intestinal pathogen Y. pseudotuberculosis, and we therefore investigated its regulatory role in this pathogen. This regulator was indeed found to be critical for establishment of infection in mice, likely involving its requirement for motility and biofilm formation. The RpoN regulon involved both activating and suppressive effects on gene expression which could be confirmed with mutagenesis of identified binding sites. This is the first study of its kind of RpoN in Y. pseudotuberculosis, revealing complex regulation of gene expression involving both productive and silent effects of its binding to DNA, providing important information about RpoN regulation in enterobacteria.
During infection of a host, bacteria are exposed to rapid changes in the environment, such as changes in temperature, pH, osmolarity, and nutrient levels, and immune cell attacks. Bacteria usually cope with these types of changes through stress responses, alterations of their gene expression that are adaptive to the new environment (1). This type of infection-associated transcriptomic reprogramming was obvious in a previous in vivo transcriptomic study of Yersinia pseudotuberculosis isolated from cecal lymphoid compartments of infectedmice (2). In that model, plasmid-carried virulence genes known to be necessary for tissue invasion and resistance toward initial attacks from phagocytes were highly expressed during the early phase of the infection. After about one-and-a-half months of symptomless infection, the expression pattern had changed so that genes encoding proteins involved in adaption and resistance to different types of stresses dominated, while expression of the plasmid-carried virulence genes was considerably reduced. This highlights the importance for bacteria of both adapting to new environments and the regulatory mechanisms involved. Hence, increasing our understanding of bacterial function during infection is of great interest. Mechanisms of bacterial adaptation inform choices of potential targets for new antibiotics, with gene products required to maintain infections considered more promising than those of classical virulence genes.Transcriptional reprogramming is commonly controlled by various transcriptional regulators that are activated in response to external signals. A major class of transcriptional regulators are sigma factors, which upon activation associate with the core RNA polymerase (RNAP), promoting its binding to specific initiation sites and subsequent open complex formation for transcription of downstream genes (3). There are different types of sigma factors in bacteria, where RpoD or sigma 70 (σ70) is the primary and housekeeping sigma factor active during exponential growth (4–6). Other alternative sigma factors such as RpoE, RpoS, RpoH, and RpoN, which recognize promoter sequences distinct from that of RpoD, regulate transcription under specific conditions, allowing expression of genes required for the bacteria to cope with and adapt to particular situations (5). One of the sigma factors that attracted interest during the analysis of data from our previous in vivo transcriptomic analysis of Y. pseudotuberculosis was RpoN or sigma 54 (σ54), which together with many of its associated proteins, including activators and modulating proteins, was upregulated during the persistent stages of infection, when the expression of genes important for adaptation to the tissue environment dominated (2). RpoN has been reported to control regulation of genes involved in nitrogen metabolism, flagella, and motility, but biofilm formation and quorum sensing can also be affected in rpoN mutant strains (7–10). In some species, RpoN also influences regulation of type III and type VI secretion (8, 11–13), and there are many reports implicating RpoN as a regulator of bacterial virulence (9, 14, 15).RpoN is structurally and functionally distinct from other sigma factors in that transcription initiation commonly depends on its binding to activating proteins termed bacterial enhancer-binding proteins (EBPs) (6). These EBPs use ATP catalysis to remodel RNAP DNA binding to initiate transcription (16). The affinity of RpoN for the core RNAP is higher than that of most other RpoD-related alternative sigma factors, allowing it to compete efficiently for RNAP binding. Regulation by RpoN can be either direct or indirect via activation of different positive or negative regulators, including other sigma factors (5, 17, 18). Compared with other sigma factors, direct cross talk whereby different sigma factors regulate the same gene is particularly high for RpoN and is also commonly seen for genes that encode proteins involved in complex processes with different levels of regulation, such as adaption, chemotaxis, adhesion, and protein secretion (19). Adding to the complexity is the variation between different bacteria in the specific signals regulating RpoN and the specific downstream outcomes. One example is biofilm formation, where a deletion of the rpoN gene results in severe effects on the capacity to form biofilms in many bacteria but where the opposite is seen in some other bacteria (20, 21).This study investigated the regulatory role of RpoN in Y. pseudotuberculosis, which had not previously been addressed in detail. Neither has the RpoN regulon been defined for this pathogen. The results revealed that RpoN is crucial in Y. pseudotuberculosis to establish infection and is required for biofilm formation and motility. Chromatin immunoprecipitation coupled with next-generation sequencing (ChIP-seq) was used to determine genome-wide binding and revealed more than 100 RpoN binding sites with both inter- and intragenic locations. Transcriptomic data from bacteria lacking rpoN implied a complex regulatory network with direct or indirect effects. Matching the locations of ChIP peaks with transcriptomic data allowed retrieval of more than 130 genes potentially regulated by RpoN, some novel and some known from previous studies. Mutagenesis of selected RpoN binding sites confirmed both activating and suppressive roles of upstream intergenic RpoN binding. This was not seen for sites of intragenic binding to the sense strand. In contrast, mutation of RpoN binding motifs on the antisense strand commonly resulted in suppressed expression of the gene on the sense strand, implicating a novel regulatory mechanism.
RESULTS AND DISCUSSION
RpoN participates in regulation of Y. pseudotuberculosis biofilm formation and motility and is required for virulence.
To reveal the importance of RpoN in Y. pseudotuberculosis virulence, an rpoN deletion mutant strain was constructed to be used in mouseinfection studies. The resulting ΔrpoN strain was tested for eventual defects in growth at 26°C reaching stationary phase and during virulence induction at 37°C and also tested for acid sensitivity (see Fig. S1A and B in the supplemental material). The mutant strain was here found to grow and tolerate acid to the same extent as the wild-type (WT) strain, excluding possible negative effects on the ability of the mutant to reach and proliferate at the infection site caused by the passage through the acidic environment in the stomach. The halted growth that occurs when shifting to 37°C and depleting extracellular Ca2+ is a known consequence of increased expression of the virulence plasmid (22), which was seen also for the ΔrpoN strain. In accordance with prior results using this acute virulence model of Y. pseudotuberculosis oral infection, the WT strain initially colonized all infectedmice, and all showed clear signs of disease and succumbed at day 5 to 7 postinfection (Fig. 1A). The ΔrpoN strain, on the other hand, had colonized all mice to a limited extent 1 day after oral infection and showed increased colonization at day 3, but then the infection declined, and the majority were cleared at day 14 to 21. At 28 days postinfection (dpi), all mice had cleared the infection (Fig. 1B and Fig. S1C). None of the miceinfected with the ΔrpoN strain showed signs of disease during the infection period, indicating clear attenuation. We also employed the Y. pseudotuberculosis infection model for persistent infection of the cecal lymphoid compartment, where a fraction of miceinfected with low doses of the WT strain develop acute disease and another fraction carry bacteria in cecal tissue for prolonged times without showing symptoms of disease (23). Upon low-dose oral infection with the ΔrpoN strain, only 75% of infectedmice were initially colonized compared with miceinfected with the corresponding WT strain, all of which were colonized (Fig. S1C). Further, none of the ΔrpoN strain-infectedmice developed acute infection, whereas the fraction causing symptomless persistent infection was almost similar to that of the WT strain (Fig. S1C), suggesting a role for this regulator during the early stages of infection. We next tested the ΔrpoN strain in different phenotypic assays and found that the mutant strain was deficient in biofilm formation and motility (Fig. 1C and D). The biofilm and motility phenotypes could be complemented by expressing RpoN in trans (ΔrpoN/prpoN). Hence, it is obvious that the ΔrpoN mutant strain has limited functional capacity that likely contributes to the observed attenuation in virulence. Together, these data indicate a pivotal role for RpoN in Y. pseudotuberculosis virulence.
FIG 1
RpoN is required for Y. pseudotuberculosis virulence and essential for motility and biofilm formation. (A) Female FVB/N mice (n = 6) were orally infected with 4 × 108 CFU/ml of Y. pseudotuberculosis WT (dashed line) or the isogenic ΔrpoN mutant strain (solid line). Data are presented as percent live mice during the course of infection (0 to 28 days). (B) IVIS in vivo images showing anesthetized mice during indicated days of infection. Pseudocolors show levels of light emitted by bioluminescent bacteria. Mice infected with Y. pseudotuberculosis WT became sick at day 5 to 7 and were sacrificed. (C) Biofilm formation by Y. pseudotuberculosis WT and corresponding strains lacking the rpoN gene (ΔrpoN), ΔrpoN complemented in trans with a plasmid expressing RpoN (ΔrpoN/prpoN), V5-tagged rpoN in the chromosome under its native promoter (rpoN::V5), and ΔrpoN complemented in trans with a plasmid expressing V5-tagged rpoN (ΔrpoN/prpoN::V5). Biofilm mass was determined by dissolving biofilm material and measuring the OD590. Data are presented as means ± SEM for six individual experiments. (D) Motility in agarose by the same strains as indicated in panel C. One representative experiment of three replicates is shown.
RpoN is required for Y. pseudotuberculosis virulence and essential for motility and biofilm formation. (A) Female FVB/N mice (n = 6) were orally infected with 4 × 108 CFU/ml of Y. pseudotuberculosis WT (dashed line) or the isogenic ΔrpoN mutant strain (solid line). Data are presented as percent live mice during the course of infection (0 to 28 days). (B) IVIS in vivo images showing anesthetized mice during indicated days of infection. Pseudocolors show levels of light emitted by bioluminescent bacteria. Miceinfected with Y. pseudotuberculosis WT became sick at day 5 to 7 and were sacrificed. (C) Biofilm formation by Y. pseudotuberculosis WT and corresponding strains lacking the rpoN gene (ΔrpoN), ΔrpoN complemented in trans with a plasmid expressing RpoN (ΔrpoN/prpoN), V5-tagged rpoN in the chromosome under its native promoter (rpoN::V5), and ΔrpoN complemented in trans with a plasmid expressing V5-tagged rpoN (ΔrpoN/prpoN::V5). Biofilm mass was determined by dissolving biofilm material and measuring the OD590. Data are presented as means ± SEM for six individual experiments. (D) Motility in agarose by the same strains as indicated in panel C. One representative experiment of three replicates is shown.High- and low-dose infection by Y. pseudotuberculosis WT and ΔrpoN in female FVB/N mice. (A) Growth curves for WT and ΔrpoN strains in LB at 26°C and during virulence induction at 37°C. (B) Acid sensitivity test. Overnight cultures were diluted to 108 CFU/ml and exposed to different pHs for 2 h. Tenfold serial dilutions were spotted on LA plates. (C) Mice (one experiment, n = 6) were orally infected with a mean of 4 × 108 CFU/ml for high-dose infection, and for low-dose infection, mice (one experiment: WT, n = 40, and ΔrpoN, n = 20) were infected with a mean of 0.8 × 106 CFU/ml. Mice that succumbed to acute infection were immediately sacrificed. The percentage of mice developing acute infection, the mice with symptom-free infection, and the fraction that cleared the infection were determined at 28 days after infection and related to total number of mice initially infected. Download FIG S1, PDF file, 1.4 MB.
RpoN binds to multiple sites on the Y. pseudotuberculosis chromosome.
Since previous studies of RpoN-mediated events have indicated variation in the regulatory network between different bacterial species, and since studies of RpoN in Y. pseudotuberculosis are limited, we set out to uncover the regulatory mechanisms involved. As a first step, we employed ChIP-seq to determine the RpoN DNA binding sites in Y. pseudotuberculosis. ChIP-seq strongly depends on an appropriate antibody, and we chose to fuse the C terminus of RpoN to the 3×V5 epitope (14 amino acids found in C-terminal sequence of the P and V proteins of simian virus 5), which has been shown to function in similar approaches in other bacteria (24). We generated a vector construct to overexpress RpoN-V5 (prpoN::V5) and also a strain expressing RpoN-V5 from its native promoter, by inserting the sequence for the V5 tag in the 3′ end of the rpoN gene (rpoN::V5). Repeating the assays for biofilm formation and motility and including the strains expressing V5-tagged RpoN (ΔrpoN/prpoN::V5 and the rpoN::V5 strains) showed that the V5 tag did not interfere with RpoN function (Fig. 1C and D). It is commonly assumed that RpoN levels and binding to RNAP and to DNA are relatively stable and that the EBPs play the regulatory role. However, our finding of differential expression of rpoN in vivo compared with its expression level in vitro (2), and the possibility of chromosomal structural changes influencing gene expression under certain conditions (2, 25, 26), prompted us to map the binding of RpoN in bacteria subjected to more than one condition. The conditions used were exponential growth at 26°C, where RpoN-V5 either is overexpressed in trans (prpoN::V5) or is expressed in cis under its native promoter (rpoN::V5), the latter to ensure proper stoichiometry and thereby avoid side effects of competition with other sigma factors. We also included samples subjected to virulence-inducing conditions, with a shift to 37°C and depletion of extracellular Ca2+ for 75 min using the rpoN::V5 strain.The resulting ChIP-seq data were subjected to a high-stringency bioinformatic analysis with a cutoff of a 2.5-fold difference over genomic noise. The analysis that was done for all samples individually identified totally 119 ChIP-seq peaks representing putative sites for RpoN binding. The number of peaks was in the range previously shown for Escherichia coli, Salmonella enterica serovar Typhimurium, and Vibrio cholerae (8, 27, 28). Some of the peaks were narrow and distinct, covering 200 to 300 nucleotides, whereas others were relatively broad and covered 300 to 800 nucleotides (see Table S1 in the supplemental material). All 119 predicted peaks were used to identify a common sequence motif in the ±50-bp regions of the peak center. We identified a motif that resembled the RpoN −24/12 promoter element found in other bacteria (27, 29, 30) (Fig. 2A). To determine motif strength, defined by level of enrichment of RpoN within a binding site, NN-GG-N9-TGC-NN was used as the base for position-weight matrix calculations. The identified motifs had PSSM (position-specific scoring matrix) scores ranging from 3 to 12, and the motif sequences were found to cover the peak center area (Fig. 2B). The fact that the majority of the predicted motifs are found around the peak center implies that they are genuine RpoN binding sites.
FIG 2
ChIP-seq identifies RpoN binding sites in Y. pseudotuberculosis genome. (A) RpoN consensus binding motif in Y. pseudotuberculosis derived from 119 ChIP-seq peaks, determined using BCRANK (E value = 1.8e−110). (B) Position of motifs relative to peak centers calculated using the Centrimo tool, MEME suite. The graph indicates the average density of motif position for all 119 motif-containing regions, using 10-bp bins from position −50 to +50 relative to the ChIP-seq peak. For identification of an RpoN consensus binding motif (A) and its position (B), all 3 ChIP-seq data sets (rpoN::V5 during logarithmic growth and virulence induction and prpoN::V5 during logarithmic growth) were used with a stringent cutoff of false-discovery rate (FDR) (1E−30). (C) Venn diagram showing the number of RpoN binding sites identified by ChIP sequencing of Y. pseudotuberculosis under different growth conditions. (D) Venn diagram showing number of RpoN binding sites with homologous genomic positions in Y. pseudotuberculosis, E. coli, and S. Typhimurium. Those genes which were found to be conserved based on function and phylogenetic distance ratios among all three bacteria and whose corresponding gene was reported (by a ChIP-seq study) to be regulated by RpoN were included in the Venn diagram. (E) Schematic illustration of four classes of RpoN binding sites based on their positions in relation to gene coding sequences. (F) Distribution of each class of RpoN binding site.
ChIP-seq identifies RpoN binding sites in Y. pseudotuberculosis genome. (A) RpoN consensus binding motif in Y. pseudotuberculosis derived from 119 ChIP-seq peaks, determined using BCRANK (E value = 1.8e−110). (B) Position of motifs relative to peak centers calculated using the Centrimo tool, MEME suite. The graph indicates the average density of motif position for all 119 motif-containing regions, using 10-bp bins from position −50 to +50 relative to the ChIP-seq peak. For identification of an RpoN consensus binding motif (A) and its position (B), all 3 ChIP-seq data sets (rpoN::V5 during logarithmic growth and virulence induction and prpoN::V5 during logarithmic growth) were used with a stringent cutoff of false-discovery rate (FDR) (1E−30). (C) Venn diagram showing the number of RpoN binding sites identified by ChIP sequencing of Y. pseudotuberculosis under different growth conditions. (D) Venn diagram showing number of RpoN binding sites with homologous genomic positions in Y. pseudotuberculosis, E. coli, and S. Typhimurium. Those genes which were found to be conserved based on function and phylogenetic distance ratios among all three bacteria and whose corresponding gene was reported (by a ChIP-seq study) to be regulated by RpoN were included in the Venn diagram. (E) Schematic illustration of four classes of RpoN binding sites based on their positions in relation to gene coding sequences. (F) Distribution of each class of RpoN binding site.ChIP-seq peaks and predicted motif coordinates. Download Table S1, PDF file, 0.8 MB.We set a cutoff for the PSSM of scores of >7 for high-confidence peaks, which yielded 103 peaks encompassing 112 binding sites (Fig. 2C; Tables 1 and 2). There were relatively few ChIP-seq peaks without motifs (16 out of 119; Table S2) compared with the findings in some other studies (24, 27, 28). This probably reflects the high-stringency analysis used, which limits the number of false positives commonly found in ChIP data from highly transcribed regions (31, 32). The majority of the peak-associated RpoN binding sites were found in bacteria expressing RpoN-V5 from its native promoter during logarithmic growth. The corresponding samples from bacteria overexpressing RpoN-V5 lacked six of those peaks, and even more peaks were missing from the samples from virulence-inducing conditions. The reasons for these differences are not obvious, but in the case of peaks missing in bacteria induced for expression of the virulence plasmid, the availability of exposed sites might have been affected by structural changes in the chromosome per se that can be part of mechanisms suppressing chromosomal gene expression (2, 25, 26). In bacteria overexpressing RpoN-V5, there might be a saturation effect at high RpoN-V5 concentrations, with the precipitation of RpoN molecules not associated with any binding site that dilutes samples resulting in relatively less precipitated DNA where DNA fragments from low-abundance binding sites are missed. Notably, unlike other sigma factors, RpoN can bind its DNA sequence without RNAP, although the binding is 10-fold less efficient than the RpoN:RNAP complex (33). ChIP-seq should therefore also detect RpoN-DNA interactions that are independent of active transcription.
TABLE 1
Intergenic sense (A) and antisense (B) RpoN binding sites
Peak
PC
PSL
PSI
PSO
Motif sequence
Strand (+/−)
PSSM
Associated σ factor(s)
Downstream gene(s)
Distance from PC
Intergenic sense (A sites)
IGS1
117503
10.6
13.9
9.3
CTGGCTTGATTTATGCAA
+
10.26
NA
YPK_0100
NA
IGS2
353423
9.6
11.0
8.7
TTGGCTTGTTTATTGCTT
−
11.14
NA
YPK_0329
NA
IGS3
779812
10.5
11.5
8.6
CTGGCACGTTAGTTGCAT
+
11.29
NA
YPK_0719
NA
IGS4
1127431
7.7
5.8
7.1
TAGGCTTGGATCTTGCTT
+
10.15
NA
YPK_1001
NA
IGS5
1384233
13.1
14.2
9.3
TTGGCACGCTAACTGCAT
+
10.82
NA
YPK_1257, YPK_1258S
NA
IGS6
1469648
5.2
NA
3.4
CTGGCATGAGTCGTGCTC
−
10.08
σ24,54,70
YPK_1333
29, −70
IGS7
1700633
7.4
4.9
9.7
CTGGCACGATTACTGCAA
+
10.75
σ54,70
YPK_1538
69
IGS8
1770585
5.9
4.4
7.3
CTGGCACGATCTTTTCAT
−
10.32
σ70,54
YPK_1600E,S
−23
IGS9
2092867
16.8
8.4
6.5
TGGGCACGCATTTTGCCT
−
10.54
σ70,28,54
YPK_1886
−28, −127
IGS10
2101765
16.5
15.0
13.5
TTGGCACGACTCTTGATT
+
11.44
σ70,54
YPK_1894E,S,V
−37
IGS11
2153679
4.7
5.4
NA
CTGGTGCAGATTTTGCAG
−
9.6
σ70,24,54
YPK_1940
−4, −14
IGS12
2194598
16.6
16.8
13.8
TTGGCACGATAACTGCTT
−
10.96
σ38,70,54
YPK_1974
−14, −43
IGS12
2194598
16.6
16.8
13.8
AGGGGTTAATATTTGCGT
+
9.34
σ54,70,38
YPK_1975
61, 141
IGS13
2298808
7.2
3.5
5.2
TTGGGATGGTTGATGCAT
−
10.22
σ54,70
YPK_2070
39
IGS13
2298808
7.2
3.5
5.2
TTGGAATAGTTGATGCAT
−
9.94
σ54,70
YPK_2070
39
IGS14
2473234
4.3
NA
4.5
GTGGCACGAAAGCTGCTG
−
10
σ54,24,38,70
YPK_2229E,S
12, 38, 52
IGS15
2990747
16.9
7.7
8.0
ATGGCATCTTATTTGCTC
+
10.51
σ54,38,70
YPK_2707
6, 31
IGS16
3172389
16.6
14.6
11.7
TTGGCATGGCAATTGCGC
−
10.63
NA
YPK_2873
NA
IGS17
3205379
15.4
15.3
11.8
ATGGCATGATAATTGCTT
−
11.42
σ24,54,70
YPK_2908E,S
44, −44
IGS17
3205379
15.4
15.3
11.8
TTGGCATAGGAATTGCCT
+
10.63
σ54,24,70
YPK_2909E,S
66, 104
IGS18
3225401
16.1
9.7
8.8
TTGGTCCAAGAGTTGCTT
+
10.11
NA
YPK_2927
NA
IGS19
3300593
8.9
7.7
6.2
TTGGCGTGTTTTTTGCAT
+
11.21
σ54,70
YPK_3001
39
IGS20
3311541
12.6
13.3
10.6
CTGGCACAAACCTTGCAT
+
10.73
σ54,70,38,24
YPK_3010S
15, 17, 23
IGS20
3311541
12.6
13.3
10.6
ATGGCTGCTTTACTGCCT
+
9.24
σ54,70,38,24
YPK_3010S
15, 17, 23
IGS20
3311541
12.6
13.3
10.6
CAGGGCTATATTCCGCTT
+
8.12
σ54,70,38,24
YPK_3010S
15, 17, 23
IGS21
3520103
13.5
9.2
10.8
ATGGCATAGCCTTTGCTT
−
10.36
NA
YPK_3220E,S
NA
IGS22
3650370
3.6
NA
2.7
TTGGCATGGTACTTGCAA
−
11.11
σ24,70,54
YPK_3329
−29, −39
IGS23
3651607
11.2
6.3
5.4
TTGTCAGGTTTCGTGCTG
−
10.18
σ32,24,54,70
YPK_3330
−40, −9
IGS23
3651607
11.2
6.3
5.4
CTGGAACAGCTCTTGCTT
+
10.08
σ70
YPK_3331
60
IGS24
4250134
12.0
13.5
10.1
TTGGCACGTTTCTTGTAA
−
10.43
NA
YPK_3857S,E
NA
IGS25
4358596
5.6
3.3
4.0
TTGGCGCGATTCATGCCT
−
10.45
σ70,38
YPK_3950
−154, 136
IGS25
4358596
5.6
3.3
4.0
GAGGCATGAATCGCGCCA
+
8.2
NA
YPK_3951
NA
IGS26
4621308
10.7
13.2
9.0
TTGGTGCCTCATTTGCGC
−
9.18
NA
YPK_4188
NA
IGS26
4621308
10.7
13.2
9.0
TTGGCATAGATTTCGCAA
+
9.96
σ70,28,24
YPK_4189E,S,V
−112, −169, 178
Intergenic antisense (B sites)
IGAs1
2892926
5.4
5.2
4.6
TTGGCGTGAATTTTGCGC
−
10.68
σ32,28
YPK_2636
−17, 14
IGAs2
3172389
16.6
14.6
11.7
AGGGCATAAACGGTGCAA
+
8.71
NA
YPK_2874
NA
IGAs3
4432739
9.6
11.8
8.2
CTGGCACGCTAAGTGCAA
−
10.25
NA
YPK_4019
NA
Abbreviations: IGS, intergenic sense; IGAs, intergenic antisense; PC, peak center base pair; PSL, peak strength logarithmic phase; PSI, peak strength induction; PSO, peak strength over expression of RpoN; PSSM, position-specific score matrix; XE/S/V, prefix indicating existence of RpoN binding motif in similar location in relation to orthologous genes in E. coli (E), V. cholerae (V), or S. Typhimurium (S); NA, not applicable.
TABLE 2
Intragenic sense (C) and antisense (D) RpoN binding sites
Peak
PC
PSL
PSI
PSO
Motif sequence
Strand (+/−)
PSSM
Associated σ factor(s)
CDS
Downstream gene(s)
Distance from PC
Intragenic sense (C sites)
IrGS1
173082
7.9
3.8
4.3
ATGGCACGATTAATGCCA
+
10.3
σ54,32,28
YPK_0145S
YPK_0147, YPK_0148, YPK_0149
52, 140
IrGS2
506362
6.0
5.9
4.9
ATGGTCCGTTTATTGCGT
+
10.2
σ54,32,24
YPK_0464
YPK_0465
85, 123
IrGS3
535257
3.8
NA
NA
ATGGCACAAAATGTGCTG
−
10.1
σ54,24,32
YPK_0482
YPK_0481
57, 187
IrGS4
613541
10.8
5.5
7.0
TTGGCTCGATTTATGCGT
−
10.7
σ54,70,24
YPK_0560
YPK_0559
4, 21
IrGS5
653552
6.5
5.3
3.3
ATGGCGCAGTTTATGCGT
+
9.99
σ54,32
YPK_0589
YPK_0591, YPK_0592
42
IrGS6
818023
4.2
NA
NA
GCGGCATGGTTGTTGCAA
−
9.99
σ38,54,70,28,32
YPK_0744
NA
−16, 35, 36, 39
IrGS7
821601
4.7
2.9
3.3
GCGGCATGGTTGTTGCAA
−
9.99
σ54,28,32
YPK_0746
YPK_0745
175, 132
IrGS8
940388
4.1
NA
3.4
TTGGCATTGTTGATGCTC
+
10.1
σ24,28,54,38
YPK_0824
YPK_0825
−26, −26, 64
IrGS9
1141660
6.5
4.2
3.4
TTGGTATGATTTATGCCT
+
10.6
σ54,70,38,32,24
YPK_1016
YPK_1018
35, 36, 76, 78
IrGS10
1225391
3.3
NA
NA
TTGGCTGAAAACTTGCAG
+
9.88
σ54,24
YPK_1084
YPK_1085
28
IrGS11
1308478
3.5
NA
NA
CAGGCATGGATTATGCAA
+
9.59
σ70,24,32,38,54
YPK_1170
YPK_1171, YPK_1172
−4, −9, −19, −27
IrGS12
1475860
6.0
4.2
3.5
TTGGCCCACGCCTTGCTT
+
9.92
σ24,54,28
YPK_1338
YPK_1339
−13, 16
IrGS13
1528613
6.6
4.6
3.7
CCGGCACGTTTTGTGCAG
+
9.86
σ24,54,38
YPK_1386
YPK_1387
−19, 14
IrGS14
1605153
18.0
9.3
9.0
TTGGTCTGACTTTTGCTT
−
10.4
σ54,32,38
YPK_1454
YPK_1452, YPK_1453
21, 45
IrGS15
1930700
3.6
NA
2.8
TTGGCGGGAATATTGCTT
−
10.7
σ54,28
YPK_1742
NA
86
IrGS16
2058280
6.7
4.8
3.7
TTGGTATGTTATTTGCAA
+
10.7
σ24,28,54,38,70
YPK_1853
NA
−44, −44, 22, 135
IrGS16
2058280
6.7
4.8
3.7
TAAGCATGAAACGTGCAT
+
9.04
σ32,70,54,38,28
YPK_1853
NA
−44, −44, 22, 135
IrGS17
2211354
7.4
3.4
4.5
TTGGTACATTTATTGCGC
−
10.4
σ24,54,38
YPK_1988
NA
−49, 123
IrGS18
2408285
4.5
NA
3.6
CTGGCACGTCTGATGCAA
+
9.98
σ32,54
YPK_2169
YPK_2170, YPK_2171
−19
IrGS19
2536664
7.6
4.1
5.5
TTGGCATGGTAATTGAAT
−
10.7
σ24,54,28,32
YPK_2282
YPK_2281
−44, 7, 8
IrGS20
2598060
7.1
4.7
6.2
TTGGCACAAAGCTTGCTC
−
10.6
σ32,54,28,24
YPK_2364
YPK_2362
−16, 4, 15
IrGS21
2599974
6.9
7.5
6.3
CTGGCACGTAAATTGTAT
−
9.67
σ24,70,54,28,32
YPK_2365
YPK_2364
−30, −50, 100, 112
IrGS22
2663968
11.3
6.5
5.8
TTGGCCCGCTTCTTGCGC
−
10.6
σ70,24,54,28,32
YPK_2431V
YPK_2429S, YPK_2430
−29, −31, 32, 46
IrGS23
2760688
8.9
6.0
3.7
ACGGAGCACTTCTTGCAT
−
9.25
σ54,24,32
YPK_2519
YPK_2517, YPK_2518
69, 120
IrGS24
2815385
3.0
NA
NA
ACGGTATAATTATTGCGT
−
10
σ28,54,70
YPK_2564
NA
−1, 70
IrGS25
2870433
3.2
NA
NA
ATGGTATAAAAATTGCGC
−
9.85
NA
YPK_2615
NA
NA
IrGS26
2978162
15.7
8.2
7.7
TTGGCACGATTGATGCTC
−
10.9
σ70,28,38,54
YPK_2696
YPK_2695
−26, −53, −86
IrGS27
3075166
4.1
3.3
2.2
CTGGCTTAATGTTTGCAT
+
10.5
NA
YPK_2785
YPK_2786, YPK_2787
NA
IrGS28
3195552
4.4
NA
NA
CTGGCAAAAATTATGCCC
+
9.63
σ32,38,28,54,24
YPK_2901
YPK_2902
−7, −24, −44, 146
IrGS29
3266581
3.4
NA
NA
CTGGAACGTTATTTGCAG
−
10.4
σ38,32,54,24,70
YPK_2968
YPK_2967
−1, −13, 20, 148
IrGS30
3276693
3.3
NA
3.1
TTGGCATCAAAGTTGCCG
+
10.3
σ54,38,24
YPK_2976
YPK_2977
67, 147
IrGS31
3386998
4.7
NA
2.9
TCGGCCCGTTTATTGCTC
−
10.4
σ54,28,32,38,28
YPK_3080
YPK_3079
44, 86, 91
IrGS32
3448213
2.6
2.4
NA
GTGGCGCGTTATTTGCGT
−
10.5
σ28,24
YPK_3152
NA
80, 152
IrGS33
3491164
5.6
4.1
3.5
GTGGAACAGGTTTTGCAC
−
9.94
σ28,32,54,70
YPK_3192
YPK_3191
−5, −10, 181
IrGS34
3558532
6.5
5.8
4.2
TTGGTACGTTACTTGCTC
+
10.7
σ28,54,32,38,24
YPK_3253
NA
−44, 73, 89, 95
IrGS35
3621908
4.7
5.0
4.4
CTGGCAAATTTTCTGAAA
−
9.43
σ54,32,28
YPK_3308
YPK_3307
2, 57
IrGS36
3658670
6.0
5.5
3.9
TTGGCACAAAACGTGCGT
+
10.4
σ54,32,38
YPK_3338
YPK_3339
137, 198
IrGS37
3674947
4.2
5.4
3.6
ATGGAACAGGACTTGCAT
−
9.85
σ24,54
YPK_3349
NA
−42
IrGS38
3948038
5.0
3.0
2.3
CCGGCCCGCATGATGCCG
−
8.67
σ54,38,24
YPK_3583
NA
10, 38
IrGS39
4007370
3.1
NA
NA
TTGGTGCGGATTATGCAG
−
9.57
σ38,24,54
YPK_3630
YPK_3629
−30, −197
IrGS40
4106580
3.0
NA
2.5
CTGGCACCAGTTATGCGT
−
10.6
σ24,38,32,54,28
YPK_3718
YPK_3717
−47, −49, −50, 107
IrGS41
4186557
5.2
3.9
3.1
AAGGGGCAATCTTTGCAA
−
8.94
σ38,54,24
YPK_3799
YPK_3798
−134, 194
IrGS42
4216234
11.9
9.6
7.5
TTGGTATAGGTTTTGCAG
+
10.1
σ32,54
YPK_3826
YPK_3827, YPK_3828
−167
IrGS43
4341322
3.8
NA
NA
ATGGCACAGTTACTGCAG
+
7.68
NA
YPK_3934
NA
NA
IrGS44
4441905
10.2
14.2
9.2
TTGGCACGGAAAATGCTA
−
10.5
σ24,32,54
YPK_4027S
YPK_4026
−104, −137
IrGS45
4448335
8.6
11.2
7.7
TTGGCATTGTTCTTGCTT
−
11.1
σ24,54
YPK_4033
YPK_4032E,S
−148
IrGS46
4561060
5.1
2.7
3.2
TTGGTATGGTTTTCGCCG
−
9.67
σ32,54,70
YPK_4127
YPK_4126
−179, 123
IrGS47
4584577
4.5
4.3
3.3
TTGGTACGGGGTTTGCGT
+
10.1
σ54,70,24
YPK_4151
NA
133, 191
IrGS48
4602274
5.6
4.2
4.0
TTGGCGCACAAATTGCTC
+
10.1
NA
YPK_4174
NA
NA
IrGS49
4612122
5.6
4.2
4.0
ATGGCACGCTAGTTGCGG
−
10.6
σ54,70,24
YPK_4181V
NA
173, 123
Intragenic antisense (D sites)
IrGAs1
128696
3.0
NA
NA
CTGGCCCAATACATGCAT
−
10.1
σ54,28,24,32
YPK_0107
YPK_105, YPK_106
2, 11, 27
IrGAs2
318299
NA
2.9
NA
CTGGCTTGTCGGATGCAC
−
9.06
σ38,54,32
YPK_0280
YPK_0289
−44, 132
IrGAs3
549836
4.7
3.3
NA
TTGGTCCAGTAATTGCTG
+
9.9
σ54,32,38
YPK_0496
YPK_0495
15, 164
IrGAs4
630656
3.3
2.6
NA
ACGGCTCAATTGTTGCAT
−
9.97
σ24,54,32,28
YPK_0712
YPK_0713, YPK_0714
−19, 20, 140
IrGAs5
1053922
3.6
NA
NA
CAGGCACGATACGTGCAT
+
10.1
σ54,24,38
YPK_0933
168, 172
IrGAs6
1099750
2.8
NA
NA
CCGGCATGCATCATGCAC
+
9.39
σ24,54,38
YPK_0978
YPK_0979
−8, 159
IrGAs7
1204307
2.5
NA
NA
TTGGTACAGTTTTTTCCA
−
9.45
σ54,32,28
YPK_1066
YPK_1065
109, 112
IrGAs8
1265403
12.0
7.4
8.0
TTGGCATGCATATTGCGC
−
10.9
σ38,54,24,32,70
YPK_1125
YPK_1126
−27, 4, 8, 21
IrGAs9
1357208
5.8
4.3
3.1
CTGGCATGTCATTTGCTG
−
10.6
σ54,70,38,28
YPK_1228
NA
63, 152, 182
IrGAs10
1894351
3.5
NA
NA
TGGGCACACTCGTTGCAT
−
10
σ54,28,24,38
YPK_1706
NA
42, 44, 167
IrGAs11
2058280
6.7
4.8
3.7
AATGCACGTTTCATGCTT
−
9.09
σ54,28,32,38
YPK_1853
NA
67, 152, 182
IrGAs12
2760688
8.9
6.0
3.7
TAGGGTTTCTGGCTGCGG
+
7.27
σ32,24,54,28
YPK_2519
YPK_2520
−39, −22, 128
IrGAs13
3195552
4.2
NA
3.5
TGGGCATAATTTTTGCCA
−
10.6
σ54,28
YPK_2901
NA
19
IrGAs14
3296918
10.0
8.8
7.9
TAGGCACGAGATTTGCAT
−
10.7
σ54,32,28,38
YPK_2997
YPK_2996E
168, 189, 199
IrGAs15
3347078
3.5
NA
NA
TTGGCAAGAAATCTGCAC
+
10.7
σ38,70,54,28
YPK_3045
NA
−5, −7, 3
IrGAs16
3383005
13.6
11.8
9.3
ATGGCACGGATTTTGCCC
−
10.8
σ38,24,54,70
YPK_3075
YPK_3074
−43, −50, 72
IrGAs17
3435112
5.4
3.7
NA
AAGGCACGATAGTTGCGT
−
10.5
σ54,28,32,38
YPK_3144
NA
52, 104, 119
IrGAs18
3603190
3.1
NA
NA
CTGGCACTGATATTGCCG
−
10.2
σ70,32,54,38
YPK_3293
NA
−26, −8, 187
IrGAs19
3621908
4.7
5.0
4.4
ATGGCATCAATATTGCTA
+
10.5
σ54,32,28
YPK_3308
YPK_3309
106, 113
IrGAs20
3984487
6.0
5.0
3.6
CTGGCTTAAATCTTGCGT
+
10.4
σ54,24,38,32
YPK_3613
YPK_3614
75, 110, 125
IrGAs21
4135862
4.4
NA
3.5
CTGGCACGCTTTCTGCAA
+
10.6
σ32,54,24,38
YPK_3743E
NA
−15, 22, 123
IrGAs22
4186557
5.2
3.9
3.1
CAGGAACGGATCTTGCAA
+
9.38
NA
YPK_3799
NA
IrGAs23
4288435
11.2
13.7
10.2
CTGGCTCAATTAATGCAT
−
10.3
σ24,28,54
YPK_3887
NA
−102, −178
IrGAs24
4309656
5.1
5.0
3.4
CTGGTGTAAATATTGCAC
+
9.89
NA
YPK_3909
NA
IrGAs25
4371420
11.5
7.9
5.2
GCGGCGCATTTTTTGCAT
+
9.83
σ24,32,54
YPK_3962
NA
−143, −198
Abbreviations: IrGS, intragenic sense; IrGAs, intragenic antisense; PC, peak center base pair; PSL, peak strength logarithmic phase; PSI, peak strength induction; PSO, peak strength over expression of RpoN; PSSM, position-specific score matrix; XE/S/V, prefix indicating existence of RpoN binding motif in similar location in relation to orthologous genes in E. coli (E), V. cholerae (V), or S. Typhimurium (S); NA, not applicable.
Intergenic sense (A) and antisense (B) RpoN binding sitesAbbreviations: IGS, intergenic sense; IGAs, intergenic antisense; PC, peak center base pair; PSL, peak strength logarithmic phase; PSI, peak strength induction; PSO, peak strength over expression of RpoN; PSSM, position-specific score matrix; XE/S/V, prefix indicating existence of RpoN binding motif in similar location in relation to orthologous genes in E. coli (E), V. cholerae (V), or S. Typhimurium (S); NA, not applicable.Intragenic sense (C) and antisense (D) RpoN binding sitesAbbreviations: IrGS, intragenic sense; IrGAs, intragenic antisense; PC, peak center base pair; PSL, peak strength logarithmic phase; PSI, peak strength induction; PSO, peak strength over expression of RpoN; PSSM, position-specific score matrix; XE/S/V, prefix indicating existence of RpoN binding motif in similar location in relation to orthologous genes in E. coli (E), V. cholerae (V), or S. Typhimurium (S); NA, not applicable.Low-confidence peaks of RpoN binding site. Download Table S2, PDF file, 0.3 MB.The robustness of the analysis was further verified by the identification of RpoN binding sites at intergenic regions upstream of genes previously shown to be regulated by RpoN in other bacteria. Examples here are YPK_1894 (pspA), YPK_3220 (glnK), YPK_3857 (pspG), and YPK_4189 (glnA) (8, 27, 28). Comparing our high-stringency peaks with the peaks identified by ChIP-seq in E. coli and S. Typhimurium, it was obvious that the location of many of the peaks in relation to coding DNA sequences (CDS) was conserved, and some were also shared with V. cholerae (Fig. 2D and Tables 1, 2, and 3). The conserved locations of RpoN binding sites included locations upstream of pspA and glnA with orthologs in E. coli, S. Typhimurium, and V. cholerae, for example, YPK_2229, YPK_2908, and YPK_1600, with orthologs in E. coli and S. Typhimurium (Table 3). There were also many novel binding sites identified in Y. pseudotuberculosis, where a majority were intragenic with some also on the noncoding strand (Fig. 2E and F). The presence of sense and antisense intragenic binding sites for RpoN is in accordance with what has been previously found for S. Typhimurium and E. coli (27, 28). The function and mechanisms of RpoN intragenic binding are generally unknown, but there are examples where this binding can drive transcription of downstream genes with long 5′ untranslated regions (UTRs) (27, 34). The putative RpoN binding sites identified were divided into groups (A to D) based on their position and orientation. Groups A (26 sites) and B (3 sites) comprise binding sites in intergenic regions. Group A sites are oriented toward the 5′ end of the nearest coding sequence, and those in group B are oriented toward the 3′ end of the neighboring coding sequence (Fig. 2E and F; Table 1). Groups C (49 sites) and D (25 sites) comprise intragenic binding sites; those in group C are oriented in the sense direction and those in group D in the antisense direction (Fig. 2E and F; Table 2). In general, binding sites in group A were associated with stronger peaks than the sites in groups C and D (Tables 1 and 2). Further, compared with the intergenic binding sites among which a relatively large fraction appears to be conserved among E. coli, S. Typhimurium, and Y. pseudotuberculosis, the fraction of conserved intragenic RpoN binding sites was considerably lower (Tables 1, 2, and 3).
TABLE 3
Existence of RpoN binding motif with similar location in relation to orthologous genes
Abbreviations and symbols: IGS, intergenic sense; IrGS, intragenic sense; locus tag, locus tag of the gene in relation to predicted RpoN binding site in Y. pseudotuberculosis YPIII; √, reported RpoN binding motif with similar location in relation to orthologous genes.
Existence of RpoN binding motif with similar location in relation to orthologous genesAbbreviations and symbols: IGS, intergenic sense; IrGS, intragenic sense; locus tag, locus tag of the gene in relation to predicted RpoN binding site in Y. pseudotuberculosis YPIII; √, reported RpoN binding motif with similar location in relation to orthologous genes.To reveal possible sigma factor cross talk in Y. pseudotuberculosis, we also screened for RpoD, RpoE, RpoS, RpoH, and FliA binding sites close (±200 nt) to the identified RpoN binding sites. This screen showed many potential dual and sometimes triple sigma factor binding regions close to each other in 68% of all A sites (Table 1). Even more multiple binding regions were found associated with intragenic sites, with clusters of 2 to 5 sigma factor sites in more than 90% of all C and D sites (Table 2). Notably, all sigma factor binding regions associated with A sites had a binding site for RpoD (σ70), and nearly half of them contained sites for RpoE (σ24) and/or RpoS (σ38). In contrast, binding sites for FliA (σ28) were very rare and sites for RpoH (σ32) were absent. The multiple binding sites associated with C sites included sites for RpoE (60%), RpoH (54%), RpoS (40%), FliA (42%), and RpoD (30%). D sites were similar, with binding sites for RpoS (64%), RpoH (60%), FliA (52%), RpoE (48%), and RpoD (20%).
Deletion of rpoN has a large impact on the Y. pseudotuberculosis transcriptome, with both direct and indirect effects.
Next, we aimed to determine whether the identified binding sites could indeed be coupled to gene expression in Y. pseudotuberculosis. For this we employed transcriptome sequencing (RNA-seq) on WT and ΔrpoN bacteria at 26°C in stationary phase and at 37°C with virulence induction. Analysis of differentially expressed genes revealed a markedly different gene expression pattern in the ΔrpoN strain compared with the isogenic WT strain. More than 500 genes were found to be differentially regulated at 26°C in stationary phase: 294 were downregulated and 213 were upregulated in ΔrpoN (Fig. 3A and B). The effect was even more pronounced at 37°C under virulence-inducing conditions: almost 1,700 genes were affected, with 766 genes downregulated and 929 upregulated. The reason for this discrepancy, with a much higher number of genes affected under virulence-inducing conditions compared with 26°C at stationary phase, might be a higher degree of stress associated with the former. This is a condition known to involve activation of different alternative sigma factors as well as other global regulators. Functional annotation analysis of the differentially expressed genes showed downregulation of genes involved in nitrogen metabolism, flagellar assembly, chemotaxis, and quorum sensing under both conditions (Fig. 3C and Fig. S2). There was also downregulation of fatty acid biosynthesis and metabolism, but this was not seen in samples of bacteria subjected to virulence-inducing conditions. For these samples, additional pathways were affected by the deletion of rpoN, including, for example, low expression of genes involved in the type III secretion system (T3SS) that normally is highly upregulated under this condition, downregulation of DNA replication and amino acid biosynthesis, and upregulation of genes involved in carbon metabolism, gluconeogenesis, and ribosomal organization, with the latter possibly reflecting the stress of translational reprogramming.
FIG 3
Deletion of rpoN has a substantial influence on the Y. pseudotuberculosis transcriptome. (A) Volcano plot showing differentially expressed genes in the Y. pseudotuberculosis ΔrpoN strain compared with the WT strain at stationary phase (OD = 2.0 at 26°C) and during virulence induction (Ca2+ depletion, at 37°C). Upregulated genes are indicated in red and downregulated genes in green. (B) Venn diagram showing the number of differentially expressed genes under the indicated conditions. The criteria for differential expression shown in panels A and B were fold changes of >1.5 with a P value of <0.05. (C) Pathway enrichment (KEGG) of genes differentially expressed both in stationary phase and after virulence induction. Pathways are ranked by the negative log10 of the P value of the enrichment score. The P values were calculated using the Bonferroni correction. (D) Plots showing expression of genes associated with intergenic or intragenic high-confidence peaks (A, B, C, and D sites). The y axis indicates expression values from RNA-seq at stationary phase and virulence induction (log2 fold change of ΔrpoN/WT), and the x axis indicates the strength of the associated ChIP peaks (the RpoN binding peak over control in log2 fold) where higher peak strength corresponds to stronger enrichment of RpoN within a binding site. Significantly upregulated (red circles) and downregulated (green circles) genes, as well as nonsignificant changes (black circles), are indicated for the different binding site groups. The portion of potential RpoN binding sites associated with differential gene expression was 88% for group A, 79% for group C, and 68% for group D.
Deletion of rpoN has a substantial influence on the Y. pseudotuberculosis transcriptome. (A) Volcano plot showing differentially expressed genes in the Y. pseudotuberculosis ΔrpoN strain compared with the WT strain at stationary phase (OD = 2.0 at 26°C) and during virulence induction (Ca2+ depletion, at 37°C). Upregulated genes are indicated in red and downregulated genes in green. (B) Venn diagram showing the number of differentially expressed genes under the indicated conditions. The criteria for differential expression shown in panels A and B were fold changes of >1.5 with a P value of <0.05. (C) Pathway enrichment (KEGG) of genes differentially expressed both in stationary phase and after virulence induction. Pathways are ranked by the negative log10 of the P value of the enrichment score. The P values were calculated using the Bonferroni correction. (D) Plots showing expression of genes associated with intergenic or intragenic high-confidence peaks (A, B, C, and D sites). The y axis indicates expression values from RNA-seq at stationary phase and virulence induction (log2 fold change of ΔrpoN/WT), and the x axis indicates the strength of the associated ChIP peaks (the RpoN binding peak over control in log2 fold) where higher peak strength corresponds to stronger enrichment of RpoN within a binding site. Significantly upregulated (red circles) and downregulated (green circles) genes, as well as nonsignificant changes (black circles), are indicated for the different binding site groups. The portion of potential RpoN binding sites associated with differential gene expression was 88% for group A, 79% for group C, and 68% for group D.Pathway enrichment in differentially expressed genes (ΔrpoN/WT) at stationary phase and under virulence induction. The pathway enrichment score was calculated as the probability of a gene’s enrichment in a pathway other than by random chance. Pathways are ranked by the negative log10 of the P value of the enrichment score. The P values were calculated using the Bonferroni correction and used to evaluate the significance of the pathway under a particular condition. Up- and downregulation of the pathway were predicted by calculating the Z score of differential expression of the genes in a certain pathway. Download FIG S2, PDF file, 0.1 MB.Although the expected effects of RpoN deletion, such as reduced expression of genes involved in nitrogen metabolism, flagella, and quorum sensing were obvious, the impact on the Y. pseudotuberculosis transcriptome was massive, clearly reflecting deletion of a global regulator. This accords with previous studies, which found that the absence of RpoN commonly results in a global effect involving both direct and indirect effects on gene expression (9, 24, 35). The differential gene expression analysis revealed changed expression levels of other sigma factors in the ΔrpoN strain (Table S3). Expression of RpoD, for example, was significantly upregulated, whereas expression of RpoE was downregulated both in stationary phase and under virulence-inducing conditions. RpoS, on the other hand, was upregulated in stationary phase but downregulated under inducing conditions. In addition to other sigma factors, the mRNA levels of other transcriptional regulators were affected, such as those of CpxR, RovA, FlhCD, and others (Table S3). Hence, these effects, together with effects on the transcription of other sigma factors, are expected to contribute extensively to indirect effects on gene expression, adding further complexity to the data set.Global regulators differentially expressed in ΔrpoN strain. Download Table S3, PDF file, 0.4 MB.
Identified RpoN binding sites mediate both positive and negative regulation of gene expression.
Given the complexity, including indirect effects, in the RNA-seq data set, we next aimed to reveal direct effects of RpoN. The RpoN binding sites identified in the ChIP-seq analysis likely include both active promoters driving transcription and suppressive and silent binding of RpoN to the chromosome. The potential RpoN binding sites identified were therefore matched with detected changes in gene expression levels (Fig. 3D). Among the identified binding sites in group A, 88% were associated with differential gene expression, involving both upregulated and downregulated genes (Fig. 3D). The intragenic C and D sites were also associated with differential gene expression, 79% for the C sites and 68% for the D sites. For C sites, the differential gene expression included both upregulation and downregulation of genes containing the motif as well as downstream genes. For the antisense D sites, there was a larger fraction of upregulated than downregulated genes, suggesting potential suppressive effects of the RpoN binding (Fig. 3D). In general, a relatively large portion of the differentially expressed genes associated with RpoN binding showed increased expression in the ΔrpoN strain. Binding by RpoN might suppress transcription by nearby sigma factors and possibly other transcription factors binding to the same region, where the absence of RpoN would then allow transcription to occur (8, 27). Also, collision as a consequence of RpoN intragenic binding has been suggested (36, 37).To explore the potential importance of the RpoN binding sites identified in Y. pseudotuberculosis, we set out to mutate some of them to reveal their effects on gene expression. For this, we chose binding sites with locations indicative of putative positive or negative regulation by RpoN, which was the case for many A and D sites. We could not identify any C-site ChIP-seq peak indicative of transcriptional activation of downstream genes in our data set. The sites were mutated by the exchange of 3 to 6 nucleotides in the conserved TGG and TGC sequences of the RpoN binding motif. In intragenic motifs, the exchanged nucleotides were selected in order to minimize changes in the encoded protein (see Table S5 for details). The selected sites included seven A and five D sites (Fig. 4 and Table S4), and the effects of the mutations on gene expression were verified by qPCR. For all putative activating A sites, those represented by downstream genes downregulated in the absence of RpoN, point mutations in the RpoN binding motifs in the WT strain resulted in reduced transcription. This class of binding sites also showed the highest degree of conservation. The most prominent effect of the disruptive nucleotide exchange was seen for pspA (YPK_1894), a gene known to be activated by RpoN (38, 39). There were also indications of inhibitory effects of RpoN binding to A sites, where the expression of downstream genes was increased in both the ΔrpoN strain and the corresponding binding site mutant. Intriguingly, four of the five binding site mutations in D sites resulted in increased expression of the CDS on the opposite strand, suggesting a suppressive effect of RpoN binding. We also mutated some of the C sites, but here no effect on transcription compared with that in the WT strain could be seen. Notably, peaks associated with C sites were commonly flatter and broader than the peaks covering the mutated A and D binding sites (Fig. 4 and Fig. S3).
FIG 4
Validation of productive RpoN binding. (Left) Relative expression of indicated genes in bacteria when the associated RpoN binding site is mutated (white bars) and in the ΔrpoN strain (black bars). Expression of genes in the mutant strains was determined by qPCR and presented as log2 fold relative expression compared to expression in the WT strain, where positive values represent suppressed transcription and negative values indicate activated transcription. Results are shown for expression under virulence-inducing conditions for YPK_0329, YPK_1886, YPK_1894, YPK_2927, YPK_2997, YPK_3010, YPK_3075, YPK_3613, YPK_3887, and YPK_3962 and for expression during stationary phase at 26°C for YPK_2908 and YPK_2909. Values represent means ± SEM of three independent experiments. (Right) Schematic illustration of the shapes and genomic contexts of ChIP peaks together with the positions and orientations of RpoN binding of associated motifs.
Validation of productive RpoN binding. (Left) Relative expression of indicated genes in bacteria when the associated RpoN binding site is mutated (white bars) and in the ΔrpoN strain (black bars). Expression of genes in the mutant strains was determined by qPCR and presented as log2 fold relative expression compared to expression in the WT strain, where positive values represent suppressed transcription and negative values indicate activated transcription. Results are shown for expression under virulence-inducing conditions for YPK_0329, YPK_1886, YPK_1894, YPK_2927, YPK_2997, YPK_3010, YPK_3075, YPK_3613, YPK_3887, and YPK_3962 and for expression during stationary phase at 26°C for YPK_2908 and YPK_2909. Values represent means ± SEM of three independent experiments. (Right) Schematic illustration of the shapes and genomic contexts of ChIP peaks together with the positions and orientations of RpoN binding of associated motifs.Validations of productive RpoN binding. Download Table S4, PDF file, 0.3 MB.List of strains used in this study. Download Table S5, PDF file, 0.6 MB.Schematic representation of selected intragenic sense (C) RpoN binding sites for qPCR. The arrow indicates the position and direction of the RpoN binding motif. ChIP peaks are shown in blue, representing their coverage in the genome of logarithmically grown bacteria (26°C). Download FIG S3, PDF file, 0.4 MB.Taken together, we have by mutagenesis been able to verify effects of RpoN binding to the binding motifs identified in a ChIP-seq screen of Y. pseudotuberculosis. Activating as well as suppressing effects of intergenic RpoN binding were verified by mutating intergenic A sites. Among these verified productive RpoN binding sites, some were known, such as those in PspA, PspP, and GlnA, whereas RpoN regulation of TppB, UgpB, and DkgA is described for the first time. There were also indications of inhibitory effects of RpoN binding to intragenic D sites. As discussed earlier, how RpoN suppresses transcription is less clear, but for binding to the sense strand it might occur by steric hindrance, either by the RpoN-RNA polymerase complex or by RpoN alone binding to the DNA. How this can affect expression from the opposite strand, as would be the case for the observed inhibitory effect of mutating RpoN binding D sites, is less clear, but it might involve disturbed strand separation. For intragenic RpoN binding, we saw effects on transcription only by mutating binding sites on the antisense strand. Thus, intragenic binding by RpoN to coding regions on the sense strand is likely silent and may be used for storage of RpoN-RNAP. RpoN-mediated suppression by binding to internal antisense sequences of genes carried on the opposite strand has not been shown previously, and its mechanism remains to be elucidated.
MATERIALS AND METHODS
Bacterial strains and growth conditions.
Strains and plasmids are listed in Table S5 in the supplemental material. Yersinia pseudotuberculosis strain YPIII was used in this study. Escherichia coli S17-1 λpir was used for cloning and conjugation. Antibiotics were used at the following concentrations: ampicillin (100 μg/ml), kanamycin (50 μg/ml), and chloramphenicol (25 μg/ml). Motility was tested on Luria-Bertani (LB) medium with 0.6% agar. Biofilm assays were carried out as described previously, using LB medium in glass tubes (40). All strains were routinely grown at 26°C in LB medium containing kanamycin (50 μg/ml). For ChIP- and RNA-seq analyses, cultures were grown in LB medium to the desired OD600. Arabinose (0.005%) was used for 30-min induction at 26°C for overexpression of rpoN. To reach the virulence induction condition, overnight bacterial cultures were diluted to an OD600 of 0.05 in LB and grown at 26°C. After 1 h, calcium was depleted by adding 5 mM EGTA and 20 mM MgCl2 and cultures were shifted to 37°C (22).
Strain construction.
In-frame gene deletion and insertion of the V5 epitope and binding-site mutations in Y. pseudotuberculosis were performed using an In-Fusion HD cloning kit (Clontech) according to the manufacturer’s instructions. Briefly, the flanking regions of the respective gene were amplified by PCR and cloned into the suicide vector pDM4. This construct was used to transform S17-1 and then transferred into recipient strains through conjugation. Conjugants were purified and incubated on 5% sucrose to recombine out the vector together with WT sequence. Deletion or mutation was confirmed by PCR. For trans-complementation, the gene was PCR amplified and cloned into the pBAD24 plasmid. For gene induction, rpoN and a C-terminal 3×V5 epitope tag were PCR amplified and cloned into the pBAD18 plasmid. All constructs were verified by sequencing. Primers used in this study are listed in Table S6.Primers used in this study. Download Table S6, PDF file, 0.5 MB.
Ethics statement.
Mice were housed and treated in accordance with the Swedish National Board for Laboratory Animals guidelines. All the animal procedures were approved by the Animal Ethics Committee of Umeå University (Dnr A108-10). Mice were allowed for 1 week to conform to the new environment before the experiments started.
Mouse infections.
Female FVB/N mice (Taconic) 8 weeks old were deprived of food and water for 16 h before infection. For infection of mice, overnight cultures of the Y. pseudotuberculosis strains were suspended in sterilized tap water supplemented with 150 mM NaCl, reaching an approximate CFU count of 106/ml for low-dose infection and 108 CFU/ml for acute infection. Mice were allowed to drink for 6 h. The infection dose was calculated based on viable count and the volume of drinking water supplemented with bacteria that was consumed. Frequent inspections of mice were carried out routinely to ensure no prominent clinical signs were overlooked. Infectedmice showing notable clinical signs were euthanized promptly to prevent suffering.The infections were monitored using the IVIS Spectrum in vivo imaging system (Caliper Life Sciences) routinely every 3rd day until 15 days postinfection (dpi) and later every week up to 28 dpi. The mice were anesthetized using the XGI-8 gas anesthesia system (Caliper Life Sciences) and 2.5% IsoFloVet (Orion Pharma, Abbott Laboratories Ltd., Great Britain) in oxygen for initial anesthesia and 0.5% isoflurane in oxygen during IVIS imaging. After infection, some mice were euthanized and dissected to analyze bacterial localization and presence in various organs, including the intestine, mesenteric lymph nodes, liver, and spleen. The organs were imaged using IVIS. Living Image software, version 3.1 (Caliper Life Sciences, Inc.), was used for image acquisition and data analysis.
Motility and biofilm assays.
Determination of swimming motility was performed as described previously (41). A 5-μl aliquot of a diluted overnight culture (OD600 of 1.0) was spotted at the center of a 0.6% LB soft agar plate and incubated at 26°C for 24 h. Bacterial motility was determined by measuring the diameter of the bacterial growth area.Biofilm formation was determined as previously described (40). Overnight cultures were diluted to an OD600 of 0.05 and grown to an OD600 of 0.5 in a 26°C shaking water bath. A 1-ml aliquot of the bacterial culture was pelleted and dissolved in 2 ml fresh LB. The suspension was transferred to glass tubes and incubated for 48 h at 37°C without shaking. After incubation, the bacterial suspension was discarded and tubes were gently washed 3 times with PBS and stained with 0.1% crystal violet (Sigma-Aldrich) for 15 min, followed by 3 successive washes with PBS. The biofilms on the tube surface were thereafter dissolved with 33% acetic acid for 15 min, and the absorbance at 590 nm was measured with an Ultrospec 2100 Pro spectrophotometer (Amersham Biosciences, Piscataway, NJ).
Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq).
Y. pseudotuberculosis ΔrpoN expressing RpoN::V5 in trans under the inducible araBAD promoter (YPIII, ΔrpoN/pIBX, prpoN::V5) and Y. pseudotuberculosis expressing RpoN:V5 in cis under its native promoter (YPIII, rpoN::V5/pIBX) were grown overnight in LB, diluted to an OD600 of 0.05, and grown for 4 h at 26°C, at which point the prpoN::V5 plasmid was induced with 0.005% arabinose for 30 min. Virulence-induced cultures were shifted to 37°C after 1 h at 26°C and incubated for 75 min under calcium-depleted conditions. Nontagged WT Y. pseudotuberculosis grown at 26°C for 4 h was used as a negative control. Cultures were cross-linked with 1% formaldehyde at room temperature for 10 min. For subsequent ChIP-seq, 10 OD600 units was used for each biological replicate. Chromatin immunoprecipitation was performed as described previously, with slight modifications (27). The 1-ml samples were sonicated in AFA milliTUBEs using a Covaris E220 sonicator with the following settings: peak power, 140 W; duty factor, −5%; cycles/burst, 200; time, 16 min. Immunoprecipitation was done using 90 μl anti-V5 agarose affinity gel (Sigma A 7345) overnight at 4°C. ChIP DNA was eluted in 100 μl elution buffer, treated with RNase A and proteinase K, and purified using a ChIP DNA Clean & Concentrator kit (Zymo Research, USA). AMPure XP magnetic beads (Beckman Coulter), 1.5× and 0.8×, were used to remove adapters and clean and concentrate the DNA. Failsafe polymerase (Nordic Biolabs AB, Sweden) was used for efficient library amplification.
ChIP-seq data analysis.
A custom pipeline in Python was used for ChIP-seq data analysis, data visualization, and downstream bioinformatic and computational analyses (the pipeline is available on request). The quality of raw reads generated from HiSeq 2500 was checked using FastQC (42). Raw reads were trimmed by 5 nt at both the 5′ and 3′ ends and aligned with the reference Y. pseudotuberculosis YPIII chromosome (NC_010465) and plasmid (NC_006153) using BWA (43). Aligned reads were converted to BAM files using SAMtools (44), and duplicate reads were removed with Picard (45) by deduplicate function. Peaks and RpoN binding sites are calculated in all three ChIP-seq samples with a stringent cutoff of false-discovery rate (FDR) (1E−30). Before peak calling, two biological replicates for each sample and the input (control) were merged using SAMtools. Peak calling was done using MACS (2.1.2) (46) with the following custom settings: log2FC > 1 over input, –broad –g, –broad-cutoff = 0.1. Identified peak coordinates in the genome were used to identify the probable regulated genes using the R packages ChIPpeakAnno and ChIPseeker (47, 48). De novo motif prediction was done using the bioconductor package BCRANK (49). A position-weight matrix (PWM) of the predicted motif was used to map the motif presence around the peak center using the FIMO-meme package (Linux version) (50). A region of 50 bases upstream and downstream of the peak center was used as input for searching for the PWM of the motif. The SIGffRid tool was used to predict the binding sites of other sigma factors within 200 bases upstream and downstream of each predicted RpoN binding peak center. To check the conservation of RpoN-regulated genes, cross-species homology prediction was done by comparing Y. pseudotuberculosis YPIII, V. cholerae 037, E. coli O157, and S. Typhimurium LT2 using OrthologueDB (51). Yersinia genes having orthologs in at least one of the species mentioned above were checked for previous ChIP-seq studies (24, 27, 28). Figures are plotted using the ggplot2 package in R (Linux version 4.0.2).
RNA isolation and Illumina sequencing.
Total RNA was isolated from three independent biological replicates of WT Y. pseudotuberculosis and an isogenic rpoN deletion strain at stationary phase and under virulence-inducing conditions. For isolation and purification of RNA, the TRIzol method (Ambion Life Technologies, Carlsbad, CA) and the Direct-zol RNA kit (Zymo Research, USA) were used. To remove unwanted DNA, purified RNA was treated with DNase for 30 min at room temperature according to the manufacturer’s instructions. RNA quantity was measured by Qubit (Nordic Biolabs AB, Sweden). RNA from stationary-phase bacteria was collected after 8 h of growth (OD600 of ∼2) at 26°C in LB. For virulence-inducing conditions, RNA was isolated from cells grown exponentially at 26°C, shifted to 37°C, and incubated in Ca2+-depleted medium for 3 h.For sequencing, cDNA libraries were prepared using the ScriptSeq Complete kit (Epicentre, Madison, WI, USA) according to the manufacturer’s instructions. rRNA was depleted from total RNA using a Ribo-Zero rRNA removal kit for bacteria (Epicentre, Madison, WI, USA) following the manufacturer’s protocol. The resulting cDNA libraries were purified using AMPure XP (Beckman Coulter) and quantified using an Agilent 2100 Bioanalyzer. Sequencing was done using an Illumina MiSeq.Raw RNA-seq reads were trimmed from 5′ and 3′ ends by Trimmomatic (52) until all the adapter and low-quality bases (Phred quality score >30) were removed and the sequences passed quality checking by FastQC (42). ProkSeq, a complete RNA-seq data analysis package for prokaryotes, was used for further RNA-seq data processing, quality control, and visualization (53). It includes all the tools mentioned below as well as SAMtools (44) and BEDTools (54). Reads were aligned to the Y. pseudotuberculosis YPIII chromosome (NC_010465) and plasmid (NC_006153) using Bowtie 2 (55) with the unique-mapping option. Postmapped read quality checking was done using RSeQC (56), and the numbers of reads for each gene were counted using featureCounts (57). Differential gene expression was determined using DESeq2, using the shrinkage estimation of dispersion option (lfcSrink =True) to generate more accurate estimates of differential expression in fold changes (58). Differential expression of a gene was defined using absolute log2 fold change values of ≥1.5 and false-discovery rate values of <0.05. Figures are plotted using the ggplot2 package in R (Linux version 4.0.2) and GraphPad Prism (version 8.0).
cDNA preparation and qPCR.
To validate the ChIP-seq and RNA-seq results, qPCR was performed using qPCRBio SyGreen Mix (PCR Biosystems) and a Bio-Rad CFX Maestro real-time PCR machine. Bacterial strains were grown for 2.5 h at 37°C under inducing conditions in LB (OD600 of ∼0.6) or at 26°C for 8 h, and total RNA was extracted as described above. Isolated RNA was used as the template for cDNA synthesis using a RevertAid First-Strand cDNA synthesis kit (Thermo Scientific). Experiments were done in triplicate for each mutant. Among 8 genes tested, YPK_0831 was the most stable under relevant conditions and was selected as an internal control in order to calculate the relative expression levels of tested genes, using appropriate primers (see Table S6 in the supplemental material).
Data availability.
The RNA-seq and ChIP-seq data files have been deposited in Gene Expression Omnibus (GEO) under accession numbers GSE155606, GSE155607, and GSE155608. All the computer code and pipeline used in these studies are available on request.
Authors: Kemal Avican; Anna Fahlgren; Mikael Huss; Ann Kathrin Heroven; Michael Beckstette; Petra Dersch; Maria Fällman Journal: PLoS Pathog Date: 2015-01-15 Impact factor: 6.823
Authors: Megan G Lloyd; Benjamin R Lundgren; Clayton W Hall; Luke B-P Gagnon; Thien-Fah Mah; Jennifer F Moffat; Christopher T Nomura Journal: Sci Rep Date: 2017-10-03 Impact factor: 4.379