Literature DB >> 23470992

A comparison of dense transposon insertion libraries in the Salmonella serovars Typhi and Typhimurium.

Lars Barquist1, Gemma C Langridge, Daniel J Turner, Minh-Duy Phan, A Keith Turner, Alex Bateman, Julian Parkhill, John Wain, Paul P Gardner.   

Abstract

Salmonella Typhi and Typhimurium diverged only ∼50 000 years ago, yet have very different host ranges and pathogenicity. Despite the availability of multiple whole-genome sequences, the genetic differences that have driven these changes in phenotype are only beginning to be understood. In this study, we use transposon-directed insertion-site sequencing to probe differences in gene requirements for competitive growth in rich media between these two closely related serovars. We identify a conserved core of 281 genes that are required for growth in both serovars, 228 of which are essential in Escherichia coli. We are able to identify active prophage elements through the requirement for their repressors. We also find distinct differences in requirements for genes involved in cell surface structure biogenesis and iron utilization. Finally, we demonstrate that transposon-directed insertion-site sequencing is not only applicable to the protein-coding content of the cell but also has sufficient resolution to generate hypotheses regarding the functions of non-coding RNAs (ncRNAs) as well. We are able to assign probable functions to a number of cis-regulatory ncRNA elements, as well as to infer likely differences in trans-acting ncRNA regulatory networks.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23470992      PMCID: PMC3632133          DOI: 10.1093/nar/gkt148

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Salmonella enterica subspecies enterica serovars Typhi (S. Typhi) and Typhimurium (S. Typhimurium) are important human pathogens with distinctly different lifestyles. S. Typhi is host-restricted to humans and causes typhoid fever. This potentially fatal systemic illness affects at least 21 million people annually, primarily in developing countries (1–3) and is capable of colonizing the gall bladder creating asymptomatic carriers; such individuals are the primary source of this human restricted infection, exemplified by the case of ‘Typhoid Mary’ (4). S. Typhimurium, conversely, is a generalist, infecting a wide range of mammals and birds in addition to being a leading cause of foodborne gastroenteritis in human populations. Control of S. Typhimurium infection in livestock destined for the human food chain is of great economic importance, particularly in swine and cattle (5,6). Additionally, S. Typhimurium causes an invasive disease in mice, which has been used extensively as a model for pathogenicity in general and human typhoid fever specifically (7). Despite this long history of investigation, the genomic factors that contribute to these differences in lifestyle remain unclear. More than 85% of predicted coding sequences are conserved between the two serovars in sequenced genomes of multiple strains (8–11). The horizontal acquisition of both plasmids and pathogenicity islands during the evolution of the salmonellae is believed to have impacted upon their disease potential. A 100 kb plasmid, encoding the spv (Salmonella plasmid virulence) genes, is found in some S. Typhimurium strains and contributes significantly towards systemic infection in animal models (12,13). S. Typhi is known to have harboured IncHI1 plasmids conferring antibiotic resistance since the 1970’s (14), and there is evidence that these strains present a higher bacterial load in the blood during human infection (15). Similar plasmids have been isolated from S. Typhimurium (16–18). Salmonella pathogenicity islands (SPI)-1 and -2 are common to both serovars and are required for invasion of epithelial cells [reviewed in (19)] and survival inside macrophages respectively (20,21). S. Typhi additionally incorporates SPI-7 and SPI-10, which contain the Vi surface antigen and a number of other putative virulence factors (22–24). Acquisition of virulence determinants is not the sole explanation for the differing disease phenotypes displayed in humans by S. Typhimurium and S. Typhi; genome degradation is an important feature of the S. Typhi genome, in common with other host-restricted serovars such as S. Paratyphi A (humans) and S. Gallinarum (chickens). In each of these serovars, pseudogenes account for 4–7% of the genome (9,25–27). Loss of function has occurred in a number of S. Typhi genes that have been shown to encode intestinal colonization and persistence determinants in S. Typhimurium (28). Numerous sugar transport and degradation pathways have also been interrupted (9) but remain intact in S. Typhimurium, potentially underlying the restricted host niche occupied by S. Typhi. In addition to its history as a model organism for pathogenicity, S. Typhimurium has recently served as a model organism for the elucidation of non-coding RNA (ncRNA) function (29). These include cis-acting switches, such as RNA-based temperature and magnesium ion sensors (30,31), together with a host of predicted metabolite-sensing riboswitches. Additionally, a large number of trans-acting small RNAs (sRNAs) have been identified within the S. Typhimurium genome (32), some with known roles in virulence (33). These sRNAs generally control a regulon of mRNA transcripts through an antisense binding mechanism mediated by the protein Hfq in response to stress. The functions of these molecules have generally been explored in either S. Typhimurium or Escherichia coli, and it is unknown how stable these functions and regulons are over evolutionary time (34). Transposon mutagenesis has previously been used to assess the requirement of particular genes for cellular viability. The advent of next-generation sequencing has allowed simultaneous identification of all transposon insertion sites within libraries of up to 1 million independent mutants (35–38), enabling us to answer the basic question of which genes are required for in vitro growth with extremely fine resolution. By using transposon mutant libraries of this density, which in S. Typhi represents on average >80 unique insertions per gene (35), shorter regions of the genome can be interrogated, including ncRNAs (38). In addition, once these libraries exist, they can be screened through various selective conditions to further reveal which functions are required for growth/survival. Using Illumina-based transposon-directed insertion-site sequencing [TraDIS (35)] with large mutant libraries of both S. Typhimurium and S. Typhi, we investigated whether these Salmonellae require the same protein-coding and ncRNA gene sets for competitive growth under laboratory conditions, and whether there are differences that reflect intrinsic differences in the pathogenic niches these bacteria inhabit.

MATERIALS AND METHODS

Strains

S. Typhimurium strain SL3261 was used to generate the large transposon mutant library, and contains a deletion relative to the parent strain SL1344. The 2166 bp deletion ranges from 153 bp within aroA (normally 1284 bp) to the last 42 bp of cmk, forming two pseudogenes and deleting the intervening gene SL0916 completely. For comparison, we used our previously generated S. Typhi Ty2 transposon library (35).

Annotation

For S. Typhimurium strain SL3261, we used feature annotations drawn from the SL1344 genome (EMBL-Bank accession FQ312003.1), ignoring the deleted aroA, ycaL and cmk genes. We re-analysed our S. Typhi Ty2 transposon library with features drawn from an updated genome annotation (EMBL-Bank accession AE014613.1). We supplemented the EMBL-Bank annotations with ncRNA annotations drawn from Rfam 10.1 (39), Sittka et al. (40), Chinni et al. (41), Raghavan et al. (42) and Kröger et al. (32). Selected protein-coding gene annotations were supplemented using the HMMER webserver (43) and Pfam (44).

Creation of S. Typhimurium transposon mutant library

S. Typhimurium was mutagenized using a Tn5-derived transposon as described previously (35). Briefly, the transposon was combined with the EZ-Tn5 transposase (Epicenter, Madison, USA) and electroporated into S. Typhimurium. Transformants were selected by plating on LB agar containing 15 μg/ml kanamycin and harvested directly from the plates following overnight incubation. A typical electroporation experiment generated a batch of between 50 000 and 150 000 individual mutants. Ten batches were pooled together to create a mutant library comprising ∼930 000 transposon mutants.

DNA manipulations and sequencing

Genomic DNA was extracted from the library pool samples using tip-100 g columns and the genomic DNA buffer set from Qiagen (Crawley, UK). DNA was prepared for nucleotide sequencing as described previously (35). Before sequencing, a 22-cycle PCR was performed as previously described (35). Sequencing took place on a single end Illumina flowcell using an Illumina GAII sequencer, for 36 cycles of sequencing, using a custom sequencing primer and 2× Hybridization Buffer (35). The custom primer was designed such that the first 10 bp of each read was transposon sequence.

Sequence analysis

The Illumina FASTQ sequence files were parsed for 100% identity to the 5′ 10 bp of the transposon (TAAGAGACAG). Sequence reads, which matched, were stripped of the transposon tag and subsequently mapped to the S. Typhimurium SL1344 or S. Typhi Ty2 chromosomes using Maq version maq-0.6.8 (45). Approximately 12 million sequence reads were generated from the sequencing run, which used two lanes on the Illumina flowcell. Precise insertion sites were determined using the output from the Maq mapview command, which gives the first nucleotide position to which each read mapped. The number and frequency of insertions mapping to each nucleotide in the appropriate genome was then determined.

Statistical analysis of required genes

The number of insertion sites for any gene is dependent on its length; therefore, the values were made comparable by dividing the number of insertion sites by the gene length, giving an ‘insertion index’ for each gene. As before (35), the distribution of insertion indices was bimodal, corresponding to the required (mode at 0) and non-required models. We fitted gamma distributions for the two modes using the R MASS library (http://www.r-project.org). Log2-likelihood ratios (LLR) were calculated between the required and non-required models, and we called a gene required if it had an LLR of less than −2, indicating it was at least four times more likely according to the required model than the non-required model. ‘Non-required’ genes were assigned for an LLR of >2. Genes falling between the two thresholds were considered ‘ambiguous’ for the purpose of this analysis. This procedure lead to genes being called as required in S. Typhimurium when their insertion index was <0.020 and ambiguous between 0.020 and 0.027. The equivalent cut-offs for the S. Typhi library are 0.0147 and 0.0186, respectively. We calculated a P-value for the observed number of insertion sites per gene using a Poisson approximation with rate R = N/G where N is the number of unique insert sites (549 086) and G is the number of bases in the genome (4 878 012). The P-value for at least X consecutive bases without an insert site is e(-RX), giving a 5% cut-off at 27 bp and a 1% cut-off at 41 bp. For every gene g with n reads observed in S. Typhi and n reads observed in S. Typhimurium, we calculated the log2 fold change ratio S(n]. The correction of 100 reads smoothes out the high scores for genes with very low numbers of observed reads. We fitted a normal model to the mode +/− 2 sample standard deviations of the distribution of S and calculated P-values for each gene according to the fit. We considered genes with a P-value of ≤0.05 under the normal model to be uniquely required by one serovar.

RESULTS AND DISCUSSION

TraDIS assay of every Salmonella Typhimurium protein-coding gene

Approximately 930 000 mutants of S. Typhimurium were generated using a Tn5-derived transposon. In all, 549 086 unique insertion sites were recovered from the mutant library using short-read sequencing with transposon-specific primers. This is a substantially higher density than the 371 775 insertions recovered from S. Typhi previously (35). The S. Typhimurium library contains an average of one insertion every 9 bp or >100 unique inserts per gene (Figure 1). The large number of unique insertion sites allowed every gene to be assayed; assuming random insertion across the genome, a region of 41 bp without an insertion was statistically significant (P < 0.01). As previously noted in S. Typhi, the distribution of length-normalized insertions per gene is bimodal (see Supplementary Figure S1), with one mode at 0. We interpret genes falling in to the distribution around this mode as being required for competitive growth within a mixed population under laboratory conditions (hereafter ‘required’). Of these, 57 contained no insertions whatsoever and were mostly involved in core cellular processes (see Table 1, Supplementary Data Set).
Figure 1.

Genome-wide transposon mutagenesis of S. Typhimurium. Circular plot showing gene content, distribution of required genes and insertion density along the S. Typhimurium chromosome. The outer scale is marked in megabases. Circular tracks range from 1 (outer track) to 6 (inner track). Track 1, all forward-strand genes (colour-coded according to function: dark blue, pathogenicity/adaptation; black, energy metabolism; red, information transfer; dark green, membranes/surface structures; cyan, degradation of macromolecules; purple, degradation of small molecules; yellow, central/intermediary metabolism; light blue, regulators; pink, phage/IS elements; orange, conserved hypothetical; pale green, unknown function; brown, pseudogenes.); track 2, all reverse-strand genes (colour-coded as for forward-strand genes); track 3, S. Typhimurium required genes (red); track 4, 56 genes required by S. Typhimurium, but not by S. Typhi (dark blue, see also Table 1); track 5, transposon insertion density; track 6, GC bias [(G − C)/(G + C)]), khaki indicates values >1; purple <1.

Table 1.

Core gene functions in S. Typhimurium

Biological processSub-processRequired genesNon-required genes
Cell divisionftsALKQWYZ, minE, mukB, SL2391ftsHJNX*, minCD, sdiA, cedA, sulA
DNA replicationPolymerases I, II and IIIdnaENQX, holABpolAB, holCDE
SupercoilinggyrAB, parCE
Primosome-associateddnaBCGT, priA, ssbpriB*C, rep
TranscriptionRNA polymeraserpoABC
Sigma, elongation, anti- and termination factorsnusBG, rpoDH, rhonusA, rpoENS
TranslationtRNA-synthetasesalaS, argS, asnS, aspS, cysS, glnS, gltX, glyQS, hisS, ileS, leuS, lysS, metG, pheST, proS, serS, thrS, tyrS, valS,trpS, trpS2
Ribosome componentsrplBCDEFJKLMNOPQRSTUVWXY, rpmABCDHI, rpsABCDEFGHIJKLMNPQSTrplAI, rpmEE2, rpmFGJJ2, rpsOR*U*V
Initiation, elongation and peptide chain release factorsfusA, infABC, prfAB, tsf, yrdCefp, prfCH, selB, tuf
Biosynthetic pathways
PeptidoglycanmurABCDEFGI,ddl, ddlA
Fatty acidsaccABCD, fabABDGHIZ

Protein-coding genes providing fundamental biological functions in S. Typhimurium. Genes in bold are required in S. Typhi (LLR between required and non-required models less than −2; see ‘Materials and Methods’ section). Asterisk indicates genes ambiguous in S. Typhimurium, having a LLR between −2 and 2.

Genome-wide transposon mutagenesis of S. Typhimurium. Circular plot showing gene content, distribution of required genes and insertion density along the S. Typhimurium chromosome. The outer scale is marked in megabases. Circular tracks range from 1 (outer track) to 6 (inner track). Track 1, all forward-strand genes (colour-coded according to function: dark blue, pathogenicity/adaptation; black, energy metabolism; red, information transfer; dark green, membranes/surface structures; cyan, degradation of macromolecules; purple, degradation of small molecules; yellow, central/intermediary metabolism; light blue, regulators; pink, phage/IS elements; orange, conserved hypothetical; pale green, unknown function; brown, pseudogenes.); track 2, all reverse-strand genes (colour-coded as for forward-strand genes); track 3, S. Typhimurium required genes (red); track 4, 56 genes required by S. Typhimurium, but not by S. Typhi (dark blue, see also Table 1); track 5, transposon insertion density; track 6, GC bias [(G − C)/(G + C)]), khaki indicates values >1; purple <1. Core gene functions in S. Typhimurium Protein-coding genes providing fundamental biological functions in S. Typhimurium. Genes in bold are required in S. Typhi (LLR between required and non-required models less than −2; see ‘Materials and Methods’ section). Asterisk indicates genes ambiguous in S. Typhimurium, having a LLR between −2 and 2. There was a bias in the frequency of transposon insertion towards the origin of replication. This likely occurred as the bacteria were in exponential growth phase immediately before transformation with the transposon. In this phase of growth, multiple replication forks would have been initiated, meaning genes closer to the origin were in greater copy number and hence more likely to be a target for insertion. We also observed a bias for transposon insertions in A + T rich regions, as was previously observed in the construction of an S. Typhi mutant library (35). However, the insertion density achieved is sufficient to discriminate between required and non-required genes easily. As was first seen in S. Typhi (35), we observed transposon insertions into genes upstream of required genes in the same operon, suggesting that most insertions do not have polar effects leading to the inactivation of downstream genes. Analysis of the S. Typhimurium mutant library allowed us to identify 353 coding sequences required for growth under laboratory conditions, and 4112 non-required coding sequences (see Supplementary Data Set). We were unable to assign 65 genes to either the required or non-required category. Sixty of these genes, which we will refer to as ‘ambiguous’, had LLRs between −2 and 2. The final five unassigned genes had lengths <60 bases, and they were removed from the analysis. All other genes contained enough insertions or were of sufficient length to generate credible LLR scores. Thus, every gene was assayed, and we were able to draw conclusions for 98.7% of the coding genome in a single sequencing run (Figure 1).

Cross-species comparison of genes required for growth

Gene essentiality has previously been assayed in Salmonella using insertion-duplication mutagenesis (46). Knuth et al. estimated 490 genes are essential to growth in clonal populations, though 36 of these have subsequently been successfully deleted (47). Although TraDIS assays gene requirements after a brief period of competitive growth on rich media, we identify a smaller required set than Knuth et al. of ∼350 genes in each serovar, closer to current estimates of ∼300 essential genes in E. coli (48). To demonstrate that TraDIS does identify genes known to have strong effects on growth, as well as to test our predictive power for determining gene essentiality, we compared our required gene sets in S. Typhimurium and S. Typhi to essential genes determined by systematic single-gene knockouts in the E. coli K-12 Keio collection (48). We identified orthologous genes in the three data sets by best reciprocal FASTA hits exhibiting >30% sequence identity for the amino acid sequences. Required orthologous genes identified in this manner share a significantly higher average percentage sequence identity with their E. coli counterparts than expected for a random set of orthologues, at ∼94% identity as compared with ∼87% for all orthologous genes. In 100 000 randomly chosen gene sets of the same size as our required set, we did not find a single set where the average shared identity exceeded 90%, indicating that required genes identified by TraDIS are more highly conserved at the nucleotide level than other orthologous protein-coding sequences. Baba et al. (48) have defined an essentiality score for each gene in E. coli based on evidence from four experimental techniques for determining gene essentiality: targeted knockouts using λ-red mediated homologous recombination (48), genetic footprinting (49,50), large-scale chromosomal deletions (51) and transposon mutagenesis (52). Scores range from −4 to 3, with negative scores indicating evidence for non-essentiality and positive scores indicating evidence for essentiality. Comparing the overlap between essential gene sets in E. coli, S. Typhi and S. Typhimurium, we find a set of 228 E. coli genes, which have a Keio essentiality score of at least 0.5 (i.e. there is evidence for gene essentiality; see Figure 2) that have TraDIS-predicted required orthologues in both S. Typhi and S. Typhimurium, constituting ∼85% of E. coli genes with evidence for essentiality indicating that gene requirements are largely conserved between these genera. Including orthologous genes that are only predicted to be essential by TraDIS in S. Typhi or S. Typhimurium raises this figure to nearly 93%. The majority of shared required genes between all three bacteria are responsible for fundamental cell processes, including cell division, transcription and translation. A number of key metabolic pathways are also represented, such as fatty acid and peptidoglycan biosynthesis (Table 1). A recent study in the alphaproteobacteria Caulobacter crescentus reported 210 shared essential genes with E. coli, despite C. crescentus sharing less than a third as many orthologous genes with E. coli as Salmonella serovars (38). This suggests the existence of a shared core of ∼200 essential proteobacterial genes, with the comparatively rapid turnover of 150–250 ‘non-core’ lineage-specific essential genes.
Figure 2.

Comparison of required genes. Venn diagrams showing (A) the overlap of all genes (outer circles, light colours) and required genes (inner circles, dark colours) between S. Typhimurium and S. Typhi (excluding genes required in one serovar only, which do not have significantly different read-counts). Black numbers refer to all genes, white numbers to required genes. (B) the overlap of all required genes between S. Typhimurium (blue), S. Typhi (green) and E. coli (purple). White numbers refer to genes with Keio essentiality scores ≥0.5.

Comparison of required genes. Venn diagrams showing (A) the overlap of all genes (outer circles, light colours) and required genes (inner circles, dark colours) between S. Typhimurium and S. Typhi (excluding genes required in one serovar only, which do not have significantly different read-counts). Black numbers refer to all genes, white numbers to required genes. (B) the overlap of all required genes between S. Typhimurium (blue), S. Typhi (green) and E. coli (purple). White numbers refer to genes with Keio essentiality scores ≥0.5. If we make the simplistic assumption that gene essentiality should be conserved between E. coli and Salmonella, we can use the overlap of our predictions with the Keio essential genes to provide an estimate of our TraDIS libraries’ accuracy for predicting that a gene will be required in a clonal population. Of the 2632 orthologous E. coli genes, which have a Keio essentiality score of less than −0.5 (i.e. there is evidence for gene non-essentiality), only 33 are predicted to be required by TraDIS in both Salmonella serovars. S. Typhi contains the largest number of genes predicted by TraDIS to be required with E. coli orthologues with negative Keio essentiality scores. However, even if we assume these are all incorrect predictions of gene essentiality, this still gives a gene-wise false positive rate (FPR) of ∼2.7% (81 of 2981 orthologues) and a positive predictive value (PPV) of ∼75% (247 with essentiality scores ≥0.5 of 328 predictions with some Keio essentiality score). Under these same criteria, the S. Typhimurium data set has a lower gene-wise FPR of ∼1.6% (51 of 3122 orthologues) and a higher PPV of ∼82% (234 of 285 predictions as before), as we would expect given the library’s higher insertion density. In reality, these FPRs and PPVs are only estimates; genes that are not essential in E. coli may become essential in the different genomic context of Salmonella serovars and vice versa, particularly in the case of S. Typhi where wide-spread pseudogene formation has eliminated potentially redundant pathways (26,27). Additionally, TraDIS will naturally over-predict essentiality in comparison with targeted knockouts, as our library creation protocol necessarily contains a short period of competitive growth between mutants during the recovery from electro-transformation and selection. As a consequence, genes that cause major growth defects, but not necessarily a complete lack of viability in clonal populations, may be reported as ‘required’.

Serovar-specific genes required for growth

Many of the required genes present in only one serovar encoded phage repressors, for instance the cI proteins of Fels-2/SopE and ST35 (see Supplementary Tables S2 and S3). Repressors maintain the lysogenic state of prophage, preventing transcription of early lytic genes (53). Transposon insertions into these genes will relieve this repression and trigger the lytic cycle, resulting in cell death, and consequently mutants are not represented in the sequenced library. This again broadens the definition of ‘required’ genes; such repressors may not be required for cellular viability in the traditional sense, but once present in these particular genomes, their maintenance is required for continued viability as long as the rest of the phage remains intact. Serovars Typhimurium and Typhi both contain eight apparent large phage-derived genomic regions (54,55). We were able to identify required repressors in all the intact lambdoid, P2-like and P22-like prophage in both genomes, including Gifsy-1, Gifsy-2 and Fels-2/SopE (see Supplementary Tables S2 and S3). With the exception of the SLP203 P22-like prophage in S. Typhimurium, all of these repressors lack the peptidase domain of the classical lambda repressor gene cI. This implies that the default anti-repression mechanism of Salmonella prophage may be more similar to a trans-acting mechanism recently discovered in Gifsy phage (56) than to the phage lambda repressor’s RecA-induced self-cleavage mechanism. We are also able to confirm that most phage remnants and fusions contained no active repressors, with the exception of the SLP281 degenerate P2-like prophage in S. Typhimurium. This degenerate prophage contains both intact replication and integration genes, but appears to lack tail and head proteins, suggesting it may depend on another phage for production of viral particles. Both genomes also encode P4-like satellite prophage, which rely on ‘helper’ phage for lytic functions and use a complex antisense-RNA based regulation mechanism for decision pathways regarding cell fate (57) using structural homologs of the IsrK (58) and C4 ncRNAs (59), known as seqA and CI RNA in the P4 literature, respectively. Although the mechanism of P4 lysogenic maintenance is not known, the IsrK-like ncRNAs of two potentially active P4-like prophage in S. Typhi are required under TraDIS. This sequence element has previously been shown to be essential for the establishment of the P4 lysogenic state (60), and we predict based on our observations that it may be necessary for lysogenic maintenance as well. The fact that some lambdoid prophage in S. Typhimurium encode non-coding genes structurally similar to the IsrK-C4 immunity system of P4 raises the possibility that these systems may be acting as a defense mechanism of sorts, protecting the prophage from predatory satellite phage capable of co-opting its lytic genes. In addition to repressors, 4 prophage cargo genes in S. Typhimurium and one in S. Typhi are required (See Tables 2 and 3; Supplementary Tables S2 and S3). The S. Typhimurium prophage cargo genes encode a PhoPQ-regulated protein, a protein predicted to be involved in natural transformation, an endodeoxyribonuclease and a hypothetical protein. The S. Typhi prophage cargo gene encodes a protein containing the DNA-binding HIRAN domain (62) believed to be involved in the repair of damaged DNA. These warrant further investigation, as they are genes that have been recently acquired and become necessary for survival in rich media.
Table 2.

Genes uniquely required in S. Typhimurium

Ty insertsTy readsSL insertsSL readsSL IDSL gene lengthTy IDTy gene lengthNameFunction
No orthologue in S. Typhi
18123SL07421269putative cation transporter
980SL0830516conserved hypothetical protein
421SL0831855putative electron transfer flavoprotein (beta subunit)
00SL0950323predicted bacteriophage protein, potential phage repressor Gifsy-2
1175SL1179789envFlipoprotein
318SL1480249antitoxin Phd_YefM, type II toxin-antitoxin system
432SL1527264ydcXputative inner membrane protein
13SL1560717putative membrane protein
750SL1601859putative transcriptional regulator (pseudogene)
436SL1799201bacteriophage encoded pagK (phoPQ-activated protein)
522SL1830A434conserved hypothetical protein (pseudogene)
327SL1967677predicted bacteriophage protein, potential phage repressor SLP203
115SL2045A63yoeIshort ORF
17107SL2066900rfbJCDP-abequose synthase
334SL2549209endodeoxyribonuclease
4149SL2593449putative DNA-binding protein, potential phage repressor Gifsy-1 SLP272
37SL2633846putative repressor protein, phage SLP281
21SL2695978smfputative competence protein
539SL4132291hypothetical protein
545SL4354A303conserved hypothetical protein
Present in S. Typhi but required only in S. Typhimurium*
    36474526SL0032441t0033306putative transcriptional regulator
    713491148SL0623642t2232576lipBlipoate-protein ligase B
    15135461064SL0702897t2156894putative glycosyl transferase
    1943007961SL07031134t21551134galactosyltransferase
    23134991567SL07061779t21521780putative glycosyltransferase, cell wall biogenesis
    84104124SL0707834t2151834putative glycosyltransferase, cell wall biogenesis
    493671470SL07221569t21361569cydAcytochrome d ubiquinol oxidase subunit I
    741613522SL1069693t1789693putative secreted protein
    2019911SL1203150t1146156hypothetical protein
    2029015SL1264315t1209315putative membrane protein
    84384626SL1327402t1261384spiCputative pathogenicity island 2 secreted effector protein
    66769535SL1331270t1265327sseAT3SS chaperone
    3630725SL1341228t1275228ssaHputative pathogenicity island protein
    4740713SL1342249t1276249ssaIputative pathogenicity island protein
    1443197514SL1343750t1277750ssaJputative pathogenicity island lipoprotein
    63847526SL1354267t1288267ssaSputative type III secretion protein
    73762444SL1355780t1289780ssaTputative type III secretion protein
    302261248SL1386693t1322693rnfE/ydgQElectron transport complex protein rnfE
    265333729165SL14731557t14631557pqaAPhoPQ-activated protein
    85765635SL1532951t1511951sifBputative virulence effector protein
    2215616174SL15611227t1534a141sseJSalmonella translocated effector protein (SseJ)
    11916391044SL1563762t1536762putative periplasmic amino acid-binding protein
    1072440544SL1564648t1537648putative ABC amino acid transporter permease
    183164620118SL16281355t16121364hypothetical protein
    2317715SL1659183t1640183yciGconserved hypothetical protein
    7861716104SL16841014t16641014Hnrputative regulatory protein
    37277425SL1785396t1022396conserved hypothetical protein
    1662823927SL1793915t1016915pagOinner membrane protein
    28311322SL1794159t1015159putative inner membrane protein
    2315514SL1823972t0988972msbBlipid A acyltransferase
    604021158SL20641002t07861002rfbVputative glycosyl transferase
    87524759SL20651293t07851299rfbXputative O-antigen transporter
    665591374SL2069774t0780774rfbFglucose-1-phosphate cytidylyltransferase
    41204514SL38281830t36581830glmSglucosamine-fructose-6-phosphate aminotransferase
    27288523SL4250288t4220288putative GerE family regulatory protein
    14826331689SL4251876t4221876araC family regulatory protein

Genes determined to be uniquely required in S. Typhimurium. SL, S. Typhimurium; Ty, S. Typhi; inserts refer to the number of unique insertion sites within a gene; reads refer to the number of sequence reads over all insertions sites within a gene. Shaded rows indicate genes shown to be H-NS repressed in (61).

*P-value (associated with log2 read ratio) < 0.05.

asseJ is a pseudogene in S. Typhi.

Table 3.

Genes uniquely required in S. Typhi

SL insertsSL readsTy insertsTy readsTy IDTy gene lengthSL IDSL gene lengthNameFunction
No orthologue in S. Tm
15t1332132malYpseudogene
232t1920405possible repressor protein, prophage 10/Gifsy-2
212t3157165conserved hypothetical protein
212t3166228spurious ORF annotation overlapping the RnaseP/M1 RNA
6196t3402570cIrepressor protein, cs 73 prophage
458t3415741HIRAN-domain family gene, potential DNA repair
16t4531150hypothetical secreted protein
Present in S. Typhimurium but required only in S. Typhi*
    19917921859t00951287SL00931287surAsurvival protein SurA precursor
    45498322t0123459SL0119459yabB/mraZconserved hypothetical protein
    1205891132t02031281SL02031281hemLglutamate-1-semialdehyde 2,1-aminomutase
    123982225t02241353SL02241353yaeL/rsePZinc metallopeptidase
    67452114t0270576SL2604576rpoERNA polymerase sigma-E factor
    14076000t05872286SL22462286nrdAribonucleoside-diphosphate reductase 1 alpha chain
    1136411542t21402802SL07182802sucA2-oxoglutarate dehydrogenase E1 component
    1167531336t21771641SL06801641pgmphosphoglucomutase
    80542915t22761008SL05801008fepDferric enterobactin transport protein FepD
    9359122t2277990SL0579990fepGferric enterobactin transport protein FepG
    6450856t2278795SL0578795fepCferric enterobactin transport ATP-binding protein FepC
    201112912116t24102355SL04442355lonLon protease
    95518820t27301062SL28091062recAarecA protein
    1357191639t29961992SL30521947tktAtransketolase
    7635839t31201434SL31731434rfaEADP-heptose synthase
    2131976650t32651071SL33211071degSserine protease
    43448310t3326606SL3925606yigPconserved hypothetical protein
    1245711736t33842025SL38722025repATP-dependent DNA helicase
    1751208621t36212787SL39472787polADNA polymerase I
    117797913t38081047SL36771047waaFADP-heptose-LPS heptosyltransferase II
    17616281459t41531080SL41831080alralanine racemase
    14011271038t4411951SL4294951miaAtRNA delta-2-isopentenylpyrophosphate transferase

Genes determined to be uniquely required in S. Typhi. SL, S. Typhimurium; Ty, S. Typhi; inserts refer to the number of unique insertion sites within a gene; reads refer to the number of sequence reads over all insertions sites within a gene.

*P-value (associated with log2 read ratio) < 0.05.

aThe assignment of recA as a required gene has been described previously (35), but briefly is believed to be due to the presence of the priC pseudogene in Typhi.

Genes uniquely required in S. Typhimurium Genes determined to be uniquely required in S. Typhimurium. SL, S. Typhimurium; Ty, S. Typhi; inserts refer to the number of unique insertion sites within a gene; reads refer to the number of sequence reads over all insertions sites within a gene. Shaded rows indicate genes shown to be H-NS repressed in (61). *P-value (associated with log2 read ratio) < 0.05. asseJ is a pseudogene in S. Typhi. Genes uniquely required in S. Typhi Genes determined to be uniquely required in S. Typhi. SL, S. Typhimurium; Ty, S. Typhi; inserts refer to the number of unique insertion sites within a gene; reads refer to the number of sequence reads over all insertions sites within a gene. *P-value (associated with log2 read ratio) < 0.05. aThe assignment of recA as a required gene has been described previously (35), but briefly is believed to be due to the presence of the priC pseudogene in Typhi. To compare differences between requirements for orthologous genes in both serovars, we calculated log-fold read ratios to eliminate genes, which were classified differently in S. Typhi and S. Typhimurium but did not have significantly different read densities (see ‘Materials and Methods’ section.) Even after this correction, 36 S. Typhimurium genes had a significantly lower frequency of transposon insertion compared with the equivalent genes in S. Typhi (P < 0.05), including four encoding hypothetical proteins (Table 2). This indicates that these gene products play a vital role in S. Typhimurium, but not in S. Typhi when grown under laboratory conditions. A major difference between the two serovars is in the requirement for genes involved in cell wall biosynthesis (see Figure 3). A set of four genes (SL0702, SL0703, SL0706 and SL0707) in an operonic structure putatively involved in cell wall biogenesis is required in S. Typhimurium, but not in S. Typhi. The protein encoded by SL0706 is a pseudogene in S. Typhi (Ty2 unique ID: t2152) owing to a 1 bp deletion at codon 62 that causes a frameshift (Figure 4a). This operon contains an additional two pseudogenes in S. Typhi (t2154 and t2150), as well as a single different pseudogene (SL0700) in S. Typhimurium, indicating that this difference in gene requirements reflects the evolutionary adaptation of these serovars to their respective niches. Similarly, four genes (rfbV, rfbX, rfbJ and rfbF) within an O-antigen biosynthetic operon are required by S. Typhimurium, but not S. Typhi. There appears to have been a shuffling of O-antigen biosynthetic genes since the divergence between the two serovars, and rfbJ, encoding a CDP-abequose synthase, has been lost from S. Typhi altogether. These broader requirements for cell wall-associated biosynthetic and transporter genes suggest that surface structure biogenesis is of greater importance in S. Typhimurium.
Figure 3.

Comparison of cell surface operon structure and requirements. Diagram illustrating cell surface operons with different requirement patterns in S. Typhimurium and S. Typhi. The top figure is of an uncharacterized operon putatively involved in cell wall biogenesis, whereas the bottom figure shows a portion of the rfb operon involved in O-antigen biosynthesis. Plots along the top and bottom of each figure show insertions in S. Typhimurium and S. Typhi, respectively, with read depth on the y-axis with a maximum cut-off of 100 reads. Genes in blue are required in S. Typhimurium, genes in white are pseudogenes; others are in grey. Grey rectangles represent BLAST hits between orthologous genes, with percentage nucleotide identity coloured on the scale to the right of each figure.

Figure 4.

Proposed differences in sRNA utilization. Diagram illustrating inferred required sRNA regulatory networks under TraDIS. Molecules required in S. Typhi are in black and in S. Typhimurium are in white. RseA, in black/grey check, is ambiguous in S. Typhi. Non-required molecules are in grey. Diamonds indicate sRNAs, circles regulatory proteins, ovals proteases, oblong shapes are membrane-anchored proteins, and rounded squares are outer membrane porins.

Comparison of cell surface operon structure and requirements. Diagram illustrating cell surface operons with different requirement patterns in S. Typhimurium and S. Typhi. The top figure is of an uncharacterized operon putatively involved in cell wall biogenesis, whereas the bottom figure shows a portion of the rfb operon involved in O-antigen biosynthesis. Plots along the top and bottom of each figure show insertions in S. Typhimurium and S. Typhi, respectively, with read depth on the y-axis with a maximum cut-off of 100 reads. Genes in blue are required in S. Typhimurium, genes in white are pseudogenes; others are in grey. Grey rectangles represent BLAST hits between orthologous genes, with percentage nucleotide identity coloured on the scale to the right of each figure. Proposed differences in sRNA utilization. Diagram illustrating inferred required sRNA regulatory networks under TraDIS. Molecules required in S. Typhi are in black and in S. Typhimurium are in white. RseA, in black/grey check, is ambiguous in S. Typhi. Non-required molecules are in grey. Diamonds indicate sRNAs, circles regulatory proteins, ovals proteases, oblong shapes are membrane-anchored proteins, and rounded squares are outer membrane porins. We also identified seven genes from the shared pathogenicity island SPI-2 that appear to contain few or no transposon insertions only in S. Typhimurium under laboratory conditions. These genes (spiC, sseA and ssaHIJT) are thought to encode components of the SPI-2 type III secretion system apparatus (T3SS) (63). In addition, the effector genes sseJ and sifB, whose products are secreted through the SPI-2-encoded T3SS (64,65), also fell into the ‘required’ category in S. Typhimurium alone. All of these genes display high A + T nucleotide sequence and have been previously shown (in S. Typhimurium) to be strongly bound by the nucleoid-associated protein H-NS, encoded by hns (61,66). Therefore, rather than being ‘required’, it is instead possible that access for the transposon was sufficiently restricted that very few insertions occurred at these sites. In further support of this hypothesis, a comparison of the binding pattern of H-NS detected in studies using S. Typhimurium LT2 with the TraDIS results from the SPI-2 locus indicated that high regions of H-NS enrichment correlated well with both the ssa genes described here and with sseJ (61,66) (see Supplementary Figure S1). An earlier study also suggests that high-density DNA-binding proteins can block Mu, Tn5 and Tn10 insertion (67); however, a genome-wide study of the effects of H-NS binding on transposition would be necessary to confirm this effect. Indeed, the generation of null S. Typhimurium mutants in sseJ and sifB, as well as many others generated at the SPI-2 locus suggest that these genes are not truly a requirement for growth in this serovar (65,68–70). Although this is a reminder that the interpretation of gene requirement needs to be made with care, the effect of H-NS on transposon insertion is not genome-wide. If this were the case, there would be an under-representation of transposon mutants in high A + T regions (known for H-NS binding), which is not what we observed. In total, only 21 required genes fall into the ‘hns-repressed’ category described in Navarre et al. (61) (see Table 2; Supplementary Table S1); the remainder (almost 400) contained sufficient transposon insertions to conclude they were non-required. In addition, all SPI-1 genes that encode another T3SS and are of high A + T content were also found to be non-required. This phenomenon was not observed in S. Typhi, possibly because the strain used harbours the pHCM1 plasmid, which encodes the H-NS-like protein sfh and has been shown to affect H-NS binding (71,72). Twenty-two S. Typhi genes had a significantly lower frequency of transposon insertion compared with orthologues in S. Typhimurium (P < 0.05), indicating that they are required only in S. Typhi for growth under laboratory conditions (Table 3), including the fepBDGC operon. This indicates a requirement for ferric [Fe(III)] rather than ferrous [Fe(II)] iron. This can be explained by the presence of Fe(III) in the bloodstream, where S. Typhi can be found during typhoid fever (15). These genes function to recover the ferric chelator enterobactin from the periplasm, acting with a number of proteins known to aid the passage of this siderophore through the outer membrane (73). It has long been noted that aroA mutants of S. Typhi, deficient in their ability to synthesize enterobactin, exhibit severe growth defects on complex media, whereas similar mutants of S. Typhimurium grow normally under the same conditions (74), though the mechanism has not been clear. Our results suggest that this difference in growth of aroA mutants is caused by a requirement for iron uptake through the fep system in S. Typhi. During host adaptation, S. Typhi has accumulated pseudogenes in many iron transport and response systems (27), presumably because they are not necessary for survival in the niche S. Typhi occupies in the human host, which may have led to this dependence on fep genes. In contrast, S. Typhimurium generally causes intestinal rather than systemic infection and is able to use a wider range of iron sources, including Fe(II), a soluble form of iron present under anaerobic conditions such as those found in the intestine (75).

TraDIS provides resolution sufficient to evaluate ncRNA contributions to fitness

Under a Poisson approximation to the transposon insertion process, a region of 41 (in S. Typhimurium) or 60 bases (in S. Typhi) has only a 1% probability of not containing an insertion by chance. NcRNAs tend to be considerably shorter than their protein-coding counterparts, but this gives us sufficient resolution to assay most of the non-coding complement of the Salmonella genome. As a proof of principle, we performed an analysis of the best-understood class of small ncRNAs, the tRNAs. Francis Crick hypothesized that a single tRNA could recognize more than one codon through wobble recognition (76), where a non-canonical G-U base pair is formed between the first (wobble) position of the anticodon and the third nucleotide in the codon. As a result, some codons are covered by multiple tRNAs, whereas others are covered non-redundantly by a single tRNA. We expect that singleton wobble-capable tRNAs, i.e. wobble tRNAs which recognize a codon uniquely, will be required. In addition, we inferred the requirement for other tRNAs through the non-redundant coverage of their codons and used this to benchmark our ability to use TraDIS to reliably interrogate short genomic intervals. The S. Typhi and S. Typhimurium genomes encode 78 and 85 (plus one pseudogene) tRNAs, respectively, with 40 anticodons, as identified by tRNAscan-SE (77). In S. Typhi, 10 of 11 singleton wobble tRNAs are predicted to be required or ambiguous, compared with 16 tRNAs below the ambiguous LLR cut-off overall (significant enrichment at the 0.05 level, two-tailed Fisher’s exact test P-value: 6.4e-08). Similarly in S. Typhimurium, 9 of 11 singleton wobble tRNAs are required or ambiguous compared with 15 required or ambiguous tRNAs overall, again showing a significant enrichment of required tRNAs in this subset (Fisher’s exact test P-value: 5.2e-07). The one singleton wobble tRNA, which is consistently not required in both serovars is the tRNA-Pro(GGG), which occurs within a four-member codon family. It has previously been shown in S. Typhimurium that tRNA-Pro(UGG) can read all four proline codons in vivo owing to a cmo5U34 modification to the anticodon, obviating the need for a functional tRNA-Pro(GGG) (78) and making this tRNA non-required. The other non-required singleton wobble tRNA in S. Typhimurium, tRNA-Leu(GAG), is similarly a member of a four-member codon family. We predict tRNA-Leu(TAG) may also be capable of recognizing all four leucine codons in this serovar; such a leucine ‘four-way wobble’ has been previously inferred in at least one other bacterial species (79,80). Of the six required non-wobble tRNAs in each serovar, four are shared. These include two non-wobble singleton tRNAs covering codons uniquely, as well as a tRNA with the ATG anticodon, which is post-transcriptionally modified by the required protein mesJ/tilS to recognize the isoleucine codon ATA (80). An additional two required tRNAs in both serovars, one shared and one with a differing anticodon, contain Gln anticodons and are part of a polycistronic tRNA operon containing other required tRNAs. This operon is conserved in E. coli with the exception of an additional tRNA-Gln at the 3′ end that has been lost in the Salmonella lineage. It is possible that transposon insertions early in the operon may interfere with processing of the polycistronic transcript in to mature tRNAs. Finally, we do not observe insertions in a tRNA-Met and a tRNA-Val in S. Typhi and S. Typhimurium, respectively. Using this analysis of the tRNAs, we estimate a worst-case PPV for these short molecules (∼76 bases) at 81%, in line with our previous estimates for conserved protein-coding genes, and a FPR of <4%, higher than for protein-coding genes but still well within the typical tolerance of high-throughput experiments. This assumes that the ‘required’ operonic tRNA-Glns and the serovar-specific tRNA-Met and tRNA-Val are all false positives; it is not clear that this is in fact the case. Surveying the shared required ncRNA content of both serovars (see Table 4), we find that the RNA components of the signal recognition particle (SRP) and RNaseP, two universally conserved ncRNAs, are required as expected. The SRP is an essential component of the cellular secretion machinery, whereas RnaseP is necessary for the maturation of tRNAs. We also find a number of required known and potential cis-regulatory molecules associated with genes required for growth under laboratory conditions in both serovars. The RFN riboswitch controls ribB, a 3,4-dihydroxy-2-butanone 4-phosphate synthase involved in riboflavin biosynthesis, in response to flavin mononucleotide concentrations (83). Additionally, we are able to assign putative functions to a number of previously uncharacterized required non-coding transcripts through their 5′ association with required genes. SroE, a 90 nt molecule discovered in an early sRNA screen (84), is consistently located at the 5′ end of the required hisS gene across its phylogenic distribution in the Enterobacteriaceae. Given this consistent association and the function of HisS as a histidyl-tRNA synthetase, we hypothesize that this region may act in a manner similar to a T-box leader, inducing or repressing expression in response to tRNA-His levels. The thrU leader sequence, recently discovered in a deep-sequencing screen of E. coli (42), appears to regulate a polycistronic operon of required singleton wobble tRNAs. Three additional required cis-regulatory elements, t44, S15 and StyR-8, are associated with required ribosomal proteins, highlighting the central role ncRNA elements play in regulating fundamental cellular processes.
Table 4.

Candidate required ncRNAs greater than 60 nt in length, excluding rRNA and tRNA

Element nameRfam accessionFunctionHfq-bindingDownstream protein-coding gene(s)Downstream gene requiredReferences
Required or ambiguous in both S. Typhi and S. Typhimurium
    SRPRF00169RNA component of the signal recognition particle(81)
    RNase PRF00010RNA component of RNase PybaZN(82)
    RFNRF00050FMN-sensing riboswitch controlling the ribB generibBY(83)
    SroERF00371Putative cis-regulatory element controlling the hisS genehisSY(84)
    ThrU LeaderNAPutative cis-regulatory element controlling the ThrU tRNA operon(42)
    t44RF00127Cis-regulatory element controlling the ribosomal rpsB generpsBY(85-87)
    S15bRF00114Translational regulator of the ribosomal S15 proteinrpsOY(88)
    StyR-8NAPutative cis-regulatory element controlling the ribosomal rpmB generpmBY(41)
    MicARF00078sRNA involved in cellular response to extracytoplasmic stressYluxSN(29)
    DsrAbRF00014sRNA regulator of H-NSYmngBN(89)
    STnc10NAPutative sRNAnhaAN(40)
    STnc60bNAPutative sRNA(40)
    STnc840NAVerified sRNA derived from 3' UTR of the flgL geneY(90)
    IS0420a,bNAPutative ncRNArmfN(42,91)
    RGO0bNAPutative sRNA identified in E. coli(42)
Required or ambiguous in S. Typhimurium only
    rne5RF00040RNase E autoregulatory 5' elementrneY(92)
    RydCRF00505sRNA regulator of the yejABEF ABC transporterY(93)
    RydBRF00118Putative ncRNA(94)
    STnc510NAPutative sRNApagD/pagCY/N(40)
    STnc460bNAPutative sRNA(40)
    STnc170NAPutative sRNASL1458N(40)
    STnc130NAPutative sRNAdmsAN(40)
    RseXRF01401sRNA regulator of OmpA and OmpCY(95)
    IsrJRF01393sRNA regulator of SPI-1 effector protein secretion(40,58)
    IsrIRF01392Island-encoded Hfq-binding sRNAYSL1028Y(40,58,90)
Required or ambiguous in S. Typhi only
    RybBRF00110sRNA involved in cellular response to extracytoplasmic stressY(29)
    tk5aNAPutative ncRNA(42,96)
    STnc750NAVerified sRNAYSpeBN(32,90)
    StyR-44aRF01830Putative multicopy (2/6 copies required in S. Typhi) ncRNA associated with ribosomal RNA operon23S rRNAN(41)
    tp2NAPutative ncRNAaceEN(42,96)
    RdlDRF01813RdlD RNA anti-toxin, 1/2 copies required in S. Typhi(97)
    STnc120aNAPutative sRNA(40)
    tp28aNAPutative ncRNAfurN(42,96)
    Phe LeaderaRF01859Phenylalanine peptide leader sequence associated with the required PheST operonPheSY(98)
    RimP LeaderRF01770Putative cis-regulator of the rimP-nusA-infB operonrimPY(99)
    GlmYRF00128Trans-acting regulator of the GlmS gene(100)

Known and putative non-coding elements classified as required or ambiguous in this screen. Required ncRNAs have an LLR between required and non-required models of less than −2; see ‘Materials and Methods’ section.

a,bncRNAs that are amibiguous (LLR between −2 and 2) in S. Typhi(a) or in S. Typhimurium(b). Hfq-binding annotations are taken from (90). The downstream protein-coding genes columns report annotated CDS or ribosomal RNA start sites within 100 bases of each candidate required non-coding element on either strand and whether these downstream sequences are also classified as required.

Candidate required ncRNAs greater than 60 nt in length, excluding rRNA and tRNA Known and putative non-coding elements classified as required or ambiguous in this screen. Required ncRNAs have an LLR between required and non-required models of less than −2; see ‘Materials and Methods’ section. a,bncRNAs that are amibiguous (LLR between −2 and 2) in S. Typhi(a) or in S. Typhimurium(b). Hfq-binding annotations are taken from (90). The downstream protein-coding genes columns report annotated CDS or ribosomal RNA start sites within 100 bases of each candidate required non-coding element on either strand and whether these downstream sequences are also classified as required.

The sRNAs required for competitive growth

Inferring functions for potential trans-acting ncRNA molecules, such as anti-sense binding sRNAs, from requirement patterns alone is more difficult than for cis-acting elements, as we cannot rely on adjacent genes to provide any information. It is also important to keep in mind that TraDIS assays requirements after a brief competition within a large library of mutants on permissive media. This may be particularly important when surveying the bacterial sRNAs, which are known to participate in responses to stress (29). This is demonstrated by two sRNAs involved in the σE-mediated extracytoplamic stress response, RybB and RseX, both of which can be successfully knocked out in S. Typhimurium (101). In S. Typhi, rpoE is required, as it also is in E. coli (48,102). However, in S. Typhimurium, rpoE tolerates a heavy insertion load, implying that σE mutants are not disadvantaged in competitive growth. In S. Typhimurium, the sRNA RseX is required. Overexpression of RseX has previously been shown to compensate for σE essentiality in E. coli by degrading ompA and ompC transcripts (95). This suggests that RseX may also be short-circuiting the σE stress response network in S. Typhimurium (Figure 4). To our knowledge, this is the first evidence of a native (i.e. not experimentally induced) activity of RseX. S. Typhi on the other hand requires σE along with its activating proteases RseP and DegS and anchoring protein RseA, as well as the σE-dependent sRNA RybB, which also regulates OmpA and OmpC in S. Typhimurium, along with a host of other OMPs (103). It is unclear why the σE response is required in S. Typhi, but not S. Typhimurium, though it may partially be due to the major differences in the cell wall and outer membrane between the two serovars. In addition, there are significant differences in the OMP content of the S. Typhi and S. Typhimurium membranes that may be driving alternative mechanisms for coping with membrane stress. For instance, S. Typhi completely lacks OmpD, a major component of the S. Typhimurium outer membrane (104) and a known target of RybB (29). Two additional sRNAs involved in stress response are also required by both S. Typhi and S. Typhimurium. The first, MicA, is known to regulate ompA and the lamB porin-coding gene in S. Typhimurium (105), contributing to the extracytoplasmic stress response. The second, DsrA, has been shown to negatively regulate the nucleoid-forming protein H-NS and enhance translation of the stationary-phase alternative sigma factor σS in E. coli (89), though its regulation of σS does not appear to be conserved in S. Typhimurium (106). Both have been previously deleted in S. Typhimurium and thus are not essential. H-NS knockouts have previously been shown to have severe growth defects in S. Typhimurium that can be rescued by compensatory mutations in either the phoPQ two-component system or rpoS, implying that the lack of H-NS is allowing normally silenced detrimental regions to be transcribed (61). As MicA has recently been shown to negatively regulate phoPQ expression in E. coli (107), it is tempting to speculate that MicA may be moderating the effects of DsrA-induced H-NS repression; however, it is currently unclear whether sRNA regulons are sufficiently conserved between E. coli and S. enterica to justify this hypothesis.

CONCLUSION

The extremely high resolution of TraDIS has allowed us to assay gene requirements in two very closely related Salmonellae with different host ranges. We found, under laboratory conditions, that 58 genes present in both serovars were required in only one, suggesting that identical gene products do not necessarily have the same phenotypic effects in the two different serovar backgrounds. Many of these genes occur in genomic regions or metabolic systems, which contain pseudogenes and/or have undergone reorganization since the divergence of S. Typhi and S. Typhimurium, demonstrating the complementarity of TraDIS and phylogenetic analysis. These changes may, in part, explain differences observed in the pathogenicity and host specificity of these two serovars. In particular, S. Typhimurium showed a requirement for cell surface structure biosynthesis genes; this may be partially explained by the fact that S. Typhi expresses the Vi-antigen, which masks the cell surface, though these genes are not required for survival in our assay. S. Typhi on the other hand has a requirement for iron uptake through the fep system, which enables ferric enterobactin transport. This dependence on enterobactin suggests that S. Typhi is highly adapted to the iron-scarce environments it encounters during systemic infections. Furthermore, this appears to represent a single point of failure in the S. Typhi iron utilization pathways and may present an attractive target for narrow-spectrum antibiotics. Of the ∼4500 protein-coding genes present in each serovar, only ∼350 were sufficiently depleted in transposon insertions to be classified as required for growth in rich media. This means that >92% of the coding genome has sufficient insertion density to be queried in future assays. Dense transposon mutagenesis libraries have been used to assay gene requirements under conditions relevant for infection, including S. Typhi survival in bile (35), Mycobacterium tuberculosis catabolism of cholesterol (108), drug resistance in Pseudomonas aeruginosa (109) and Haemophilus influenzae survival in the lung (110). We expect that parallel experiments querying gene requirements under the same conditions in both serovars examined in this study will yield further insights in to the differences in the infective process between Typhi and Typhimurium and ultimately the pathways that underlie host-adaptation. Both serovars possess substantial complements of horizontally acquired DNA. We have been able to use TraDIS to assay these recently acquired sequences. In particular, we have been able to identify, on a chromosome wide scale, active prophage through the requirement for their repressors. The P4 phage uses an RNA-based system to make decisions regarding cell fate, and structurally similar systems are used by P1, P7 and N15 phage (111,112). C4-like transcripts have been regarded as the primary repressor of lytic functions, though the IsrK-like sequence is known to be essential to the establishment of lysogeny in P4 and is transcribed in at least two phage types (60,112). Our observations in S. Typhi suggest an important role for the IsrK-like sequence in maintenance of the lysogenic state in P4-like phage, though the mechanism remains unclear. Recent advances in high-throughput sequencing have greatly enhanced our ability to detect novel transcripts, such as ncRNAs and short open reading frames (sORFs). Our ability to identify these transcripts now far out-strips our ability to experimentally characterize these sequences. There have been previous efforts at high-throughput characterization of bacterial sRNAs and sORFs in enteric bacteria; however, these have relied on labour-intensive directed knockout libraries (47,113). Here, we have demonstrated that TraDIS has sufficient resolution to reliably query genomic regions as short as 60 bases, in agreement with a recent high-throughput transposon mutagenesis study in the alphaproteobacteria Caulobacter crescentus (38). Our method has the major advantage that library construction does not rely on genome annotation, and newly discovered elements can be surveyed with no further laboratory work. We have been able to assign putative functions to a number of ncRNAs using TraDIS though consideration of their genomic and experimental context. In addition, ncRNA characterization generally is done in model organisms like E. coli or S. Typhimurium, and it is unclear how stable ncRNA regulatory networks are over evolutionary time. By assaying two serovars of Salmonella with the same method under the same conditions, we have seen hints that there may be differences in sRNA regulatory networks between S. Typhi and S. Typhimurium. In particular, we have found that under the same experimental conditions, S. Typhi appears to rely on the σE stress response pathway, whereas S. Typhimurium does not; it is tempting to speculate that this difference in stress response is mediated by the observed difference in requirement for two sRNAs, RybB and RseX. We believe that this combination of high-throughput transposon mutagenesis with a careful consideration of the systems context of individual genes provides a powerful tool for the generation of functional hypotheses. We anticipate that the construction of TraDIS libraries in additional organisms, as well as the passing of these libraries through relevant experimental conditions, will provide further insights into the function of bacterial ncRNAs in addition to the protein-coding gene complement.

ACCESSION NUMBERS

ERA000097, ERA000217.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online: Supplementary Tables 1–3, Supplementary Figures 1–2 and Supplementary Data Set.

FUNDING

Wellcome Trust [WT076964, WT079643 and WT098051]; Medical Research Council. Funding for open access charge: Wellcome Trust. Conflict of interest statement. None declared.
  112 in total

1.  Characterization of the small antisense CI RNA that regulates bacteriophage P4 immunity.

Authors:  Francesca Forti; Ilaria Dragoni; Federica Briani; Gianni Dehò; Daniela Ghisotti
Journal:  J Mol Biol       Date:  2002-01-25       Impact factor: 5.469

2.  FourU: a novel type of RNA thermometer in Salmonella.

Authors:  Torsten Waldminghaus; Nadja Heidrich; Sabine Brantl; Franz Narberhaus
Journal:  Mol Microbiol       Date:  2007-07       Impact factor: 3.501

3.  Genes encoding putative effector proteins of the type III secretion system of Salmonella pathogenicity island 2 are required for bacterial virulence and proliferation in macrophages.

Authors:  M Hensel; J E Shea; S R Waterman; R Mundy; T Nikolaus; G Banks; A Vazquez-Torres; C Gleeson; F C Fang; D W Holden
Journal:  Mol Microbiol       Date:  1998-10       Impact factor: 3.501

4.  Genome-wide detection of novel regulatory RNAs in E. coli.

Authors:  Rahul Raghavan; Eduardo A Groisman; Howard Ochman
Journal:  Genome Res       Date:  2011-06-10       Impact factor: 9.043

5.  Identification of a pathogenicity island required for Salmonella survival in host cells.

Authors:  H Ochman; F C Soncini; F Solomon; E A Groisman
Journal:  Proc Natl Acad Sci U S A       Date:  1996-07-23       Impact factor: 11.205

6.  Experimental determination and system level analysis of essential genes in Escherichia coli MG1655.

Authors:  S Y Gerdes; M D Scholle; J W Campbell; G Balázsi; E Ravasz; M D Daugherty; A L Somera; N C Kyrpides; I Anderson; M S Gelfand; A Bhattacharya; V Kapatral; M D'Souza; M V Baev; Y Grechkin; F Mseeh; M Y Fonstein; R Overbeek; A-L Barabási; Z N Oltvai; A L Osterman
Journal:  J Bacteriol       Date:  2003-10       Impact factor: 3.490

7.  The essential genome of a bacterium.

Authors:  Beat Christen; Eduardo Abeliuk; John M Collier; Virginia S Kalogeraki; Ben Passarelli; John A Coller; Michael J Fero; Harley H McAdams; Lucy Shapiro
Journal:  Mol Syst Biol       Date:  2011-08-30       Impact factor: 11.429

8.  Identification of candidate structured RNAs in the marine organism 'Candidatus Pelagibacter ubique'.

Authors:  Michelle M Meyer; Tyler D Ames; Daniel P Smith; Zasha Weinberg; Michael S Schwalbach; Stephen J Giovannoni; Ronald R Breaker
Journal:  BMC Genomics       Date:  2009-06-16       Impact factor: 3.969

9.  Multidrug-resistant Salmonella enterica serovar paratyphi A harbors IncHI1 plasmids similar to those found in serovar typhi.

Authors:  Kathryn E Holt; Nicholas R Thomson; John Wain; Minh Duy Phan; Satheesh Nair; Rumina Hasan; Zulfiqar A Bhutta; Michael A Quail; Halina Norbertczak; Danielle Walker; Gordon Dougan; Julian Parkhill
Journal:  J Bacteriol       Date:  2007-03-23       Impact factor: 3.490

10.  Two seemingly homologous noncoding RNAs act hierarchically to activate glmS mRNA translation.

Authors:  Johannes H Urban; Jörg Vogel
Journal:  PLoS Biol       Date:  2008-03-18       Impact factor: 8.029

View more
  58 in total

1.  The essential gene set of a photosynthetic organism.

Authors:  Benjamin E Rubin; Kelly M Wetmore; Morgan N Price; Spencer Diamond; Ryan K Shultzaberger; Laura C Lowe; Genevieve Curtin; Adam P Arkin; Adam Deutschbauer; Susan S Golden
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-27       Impact factor: 11.205

2.  Rapid transposon liquid enrichment sequencing (TnLE-seq) for gene fitness evaluation in underdeveloped bacterial systems.

Authors:  Samuel R Fels; Grant M Zane; Sean M Blake; Judy D Wall
Journal:  Appl Environ Microbiol       Date:  2013-09-27       Impact factor: 4.792

3.  General and condition-specific essential functions of Pseudomonas aeruginosa.

Authors:  Samuel A Lee; Larry A Gallagher; Metawee Thongdee; Benjamin J Staudinger; Soyeon Lippman; Pradeep K Singh; Colin Manoil
Journal:  Proc Natl Acad Sci U S A       Date:  2015-04-06       Impact factor: 11.205

4.  The Siderophore Product Ornibactin Is Required for the Bactericidal Activity of Burkholderia contaminans MS14.

Authors:  Peng Deng; Adam Foxfire; Jianhong Xu; Sonya M Baird; Jiayuan Jia; Keren H Delgado; Ronald Shin; Leif Smith; Shi-En Lu
Journal:  Appl Environ Microbiol       Date:  2017-03-31       Impact factor: 4.792

Review 5.  Molecular phenotyping of infection-associated small non-coding RNAs.

Authors:  Lars Barquist; Alexander J Westermann; Jörg Vogel
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2016-11-05       Impact factor: 6.237

Review 6.  SorTn-seq: a high-throughput functional genomics approach to discovering regulators of bacterial gene expression.

Authors:  Leah M Smith; Simon A Jackson; Paul P Gardner; Peter C Fineran
Journal:  Nat Protoc       Date:  2021-08-04       Impact factor: 13.491

7.  Pyrimidine biosynthesis in pathogens - Structures and analysis of dihydroorotases from Yersinia pestis and Vibrio cholerae.

Authors:  Joanna Lipowska; Charles Dylan Miks; Keehwan Kwon; Ludmilla Shuvalova; Heping Zheng; Krzysztof Lewiński; David R Cooper; Ivan G Shabalin; Wladek Minor
Journal:  Int J Biol Macromol       Date:  2019-06-15       Impact factor: 6.953

8.  Tryptophan biosynthesis protects mycobacteria from CD4 T-cell-mediated killing.

Authors:  Yanjia J Zhang; Manchi C Reddy; Thomas R Ioerger; Alissa C Rothchild; Veronique Dartois; Brian M Schuster; Andrej Trauner; Deeann Wallis; Stacy Galaviz; Curtis Huttenhower; James C Sacchettini; Samuel M Behar; Eric J Rubin
Journal:  Cell       Date:  2013-12-05       Impact factor: 41.582

9.  Identification of common highly expressed genes of Salmonella Enteritidis by in silico prediction of gene expression and in vitro transcriptomic analysis.

Authors:  Kim Lam R Chiok; Devendra H Shah
Journal:  Poult Sci       Date:  2019-07-01       Impact factor: 3.352

10.  Selection-Driven Gene Inactivation in Salmonella.

Authors:  Joshua L Cherry
Journal:  Genome Biol Evol       Date:  2020-03-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.