| Literature DB >> 23470992 |
Lars Barquist1, Gemma C Langridge, Daniel J Turner, Minh-Duy Phan, A Keith Turner, Alex Bateman, Julian Parkhill, John Wain, Paul P Gardner.
Abstract
Salmonella Typhi and Typhimurium diverged only ∼50 000 years ago, yet have very different host ranges and pathogenicity. Despite the availability of multiple whole-genome sequences, the genetic differences that have driven these changes in phenotype are only beginning to be understood. In this study, we use transposon-directed insertion-site sequencing to probe differences in gene requirements for competitive growth in rich media between these two closely related serovars. We identify a conserved core of 281 genes that are required for growth in both serovars, 228 of which are essential in Escherichia coli. We are able to identify active prophage elements through the requirement for their repressors. We also find distinct differences in requirements for genes involved in cell surface structure biogenesis and iron utilization. Finally, we demonstrate that transposon-directed insertion-site sequencing is not only applicable to the protein-coding content of the cell but also has sufficient resolution to generate hypotheses regarding the functions of non-coding RNAs (ncRNAs) as well. We are able to assign probable functions to a number of cis-regulatory ncRNA elements, as well as to infer likely differences in trans-acting ncRNA regulatory networks.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23470992 PMCID: PMC3632133 DOI: 10.1093/nar/gkt148
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Genome-wide transposon mutagenesis of S. Typhimurium. Circular plot showing gene content, distribution of required genes and insertion density along the S. Typhimurium chromosome. The outer scale is marked in megabases. Circular tracks range from 1 (outer track) to 6 (inner track). Track 1, all forward-strand genes (colour-coded according to function: dark blue, pathogenicity/adaptation; black, energy metabolism; red, information transfer; dark green, membranes/surface structures; cyan, degradation of macromolecules; purple, degradation of small molecules; yellow, central/intermediary metabolism; light blue, regulators; pink, phage/IS elements; orange, conserved hypothetical; pale green, unknown function; brown, pseudogenes.); track 2, all reverse-strand genes (colour-coded as for forward-strand genes); track 3, S. Typhimurium required genes (red); track 4, 56 genes required by S. Typhimurium, but not by S. Typhi (dark blue, see also Table 1); track 5, transposon insertion density; track 6, GC bias [(G − C)/(G + C)]), khaki indicates values >1; purple <1.
Core gene functions in S. Typhimurium
| Biological process | Sub-process | Required genes | Non-required genes |
|---|---|---|---|
| Cell division | |||
| DNA replication | Polymerases I, II and III | ||
| Supercoiling | |||
| Primosome-associated | |||
| Transcription | RNA polymerase | ||
| Sigma, elongation, anti- and termination factors | |||
| Translation | tRNA-synthetases | ||
| Ribosome components | |||
| Initiation, elongation and peptide chain release factors | |||
| Biosynthetic pathways | |||
| Peptidoglycan | |||
| Fatty acids | – |
Protein-coding genes providing fundamental biological functions in S. Typhimurium. Genes in bold are required in S. Typhi (LLR between required and non-required models less than −2; see ‘Materials and Methods’ section). Asterisk indicates genes ambiguous in S. Typhimurium, having a LLR between −2 and 2.
Figure 2.Comparison of required genes. Venn diagrams showing (A) the overlap of all genes (outer circles, light colours) and required genes (inner circles, dark colours) between S. Typhimurium and S. Typhi (excluding genes required in one serovar only, which do not have significantly different read-counts). Black numbers refer to all genes, white numbers to required genes. (B) the overlap of all required genes between S. Typhimurium (blue), S. Typhi (green) and E. coli (purple). White numbers refer to genes with Keio essentiality scores ≥0.5.
Genes uniquely required in S. Typhimurium
| Ty inserts | Ty reads | SL inserts | SL reads | SL ID | SL gene length | Ty ID | Ty gene length | Name | Function |
|---|---|---|---|---|---|---|---|---|---|
| No orthologue in | |||||||||
| 18 | 123 | SL0742 | 1269 | putative cation transporter | |||||
| 9 | 80 | SL0830 | 516 | conserved hypothetical protein | |||||
| 4 | 21 | SL0831 | 855 | putative electron transfer flavoprotein (beta subunit) | |||||
| 0 | 0 | SL0950 | 323 | predicted bacteriophage protein, potential phage repressor Gifsy-2 | |||||
| 11 | 75 | SL1179 | 789 | envF | lipoprotein | ||||
| 3 | 18 | SL1480 | 249 | antitoxin Phd_YefM, type II toxin-antitoxin system | |||||
| 4 | 32 | SL1527 | 264 | ydcX | putative inner membrane protein | ||||
| 1 | 3 | SL1560 | 717 | putative membrane protein | |||||
| 7 | 50 | SL1601 | 859 | putative transcriptional regulator (pseudogene) | |||||
| 4 | 36 | SL1799 | 201 | bacteriophage encoded pagK (phoPQ-activated protein) | |||||
| 5 | 22 | SL1830A | 434 | conserved hypothetical protein (pseudogene) | |||||
| 3 | 27 | SL1967 | 677 | predicted bacteriophage protein, potential phage repressor SLP203 | |||||
| 1 | 15 | SL2045A | 63 | yoeI | short ORF | ||||
| 17 | 107 | SL2066 | 900 | rfbJ | CDP-abequose synthase | ||||
| 3 | 34 | SL2549 | 209 | endodeoxyribonuclease | |||||
| 4 | 149 | SL2593 | 449 | putative DNA-binding protein, potential phage repressor Gifsy-1 SLP272 | |||||
| 3 | 7 | SL2633 | 846 | putative repressor protein, phage SLP281 | |||||
| 21 | SL2695 | 978 | smf | putative competence protein | |||||
| 5 | 39 | SL4132 | 291 | hypothetical protein | |||||
| 5 | 45 | SL4354A | 303 | conserved hypothetical protein | |||||
| Present in | |||||||||
| 36 | 474 | 5 | 26 | SL0032 | 441 | t0033 | 306 | putative transcriptional regulator | |
| 71 | 349 | 11 | 48 | SL0623 | 642 | t2232 | 576 | lipB | lipoate-protein ligase B |
| 151 | 3546 | 10 | 64 | SL0702 | 897 | t2156 | 894 | putative glycosyl transferase | |
| 194 | 3007 | 9 | 61 | SL0703 | 1134 | t2155 | 1134 | galactosyltransferase | |
| 231 | 3499 | 15 | 67 | SL0706 | 1779 | t2152 | 1780 | putative glycosyltransferase, cell wall biogenesis | |
| 84 | 1041 | 2 | 4 | SL0707 | 834 | t2151 | 834 | putative glycosyltransferase, cell wall biogenesis | |
| 49 | 367 | 14 | 70 | SL0722 | 1569 | t2136 | 1569 | cydA | cytochrome d ubiquinol oxidase subunit I |
| 74 | 1613 | 5 | 22 | SL1069 | 693 | t1789 | 693 | putative secreted protein | |
| 20 | 199 | 1 | 1 | SL1203 | 150 | t1146 | 156 | hypothetical protein | |
| 20 | 290 | 1 | 5 | SL1264 | 315 | t1209 | 315 | putative membrane protein | |
| 84 | 384 | 6 | 26 | SL1327 | 402 | t1261 | 384 | spiC | putative pathogenicity island 2 secreted effector protein |
| 66 | 769 | 5 | 35 | SL1331 | 270 | t1265 | 327 | sseA | T3SS chaperone |
| 36 | 307 | 2 | 5 | SL1341 | 228 | t1275 | 228 | ssaH | putative pathogenicity island protein |
| 47 | 407 | 1 | 3 | SL1342 | 249 | t1276 | 249 | ssaI | putative pathogenicity island protein |
| 144 | 3197 | 5 | 14 | SL1343 | 750 | t1277 | 750 | ssaJ | putative pathogenicity island lipoprotein |
| 63 | 847 | 5 | 26 | SL1354 | 267 | t1288 | 267 | ssaS | putative type III secretion protein |
| 73 | 762 | 4 | 44 | SL1355 | 780 | t1289 | 780 | ssaT | putative type III secretion protein |
| 30 | 226 | 12 | 48 | SL1386 | 693 | t1322 | 693 | rnfE/ydgQ | Electron transport complex protein rnfE |
| 265 | 3337 | 29 | 165 | SL1473 | 1557 | t1463 | 1557 | pqaA | PhoPQ-activated protein |
| 85 | 765 | 6 | 35 | SL1532 | 951 | t1511 | 951 | sifB | putative virulence effector protein |
| 22 | 156 | 16 | 174 | SL1561 | 1227 | t1534 | 141 | sseJ | Salmonella translocated effector protein (SseJ) |
| 119 | 1639 | 10 | 44 | SL1563 | 762 | t1536 | 762 | putative periplasmic amino acid-binding protein | |
| 107 | 2440 | 5 | 44 | SL1564 | 648 | t1537 | 648 | putative ABC amino acid transporter permease | |
| 183 | 1646 | 20 | 118 | SL1628 | 1355 | t1612 | 1364 | hypothetical protein | |
| 23 | 177 | 1 | 5 | SL1659 | 183 | t1640 | 183 | yciG | conserved hypothetical protein |
| 78 | 617 | 16 | 104 | SL1684 | 1014 | t1664 | 1014 | Hnr | putative regulatory protein |
| 37 | 277 | 4 | 25 | SL1785 | 396 | t1022 | 396 | conserved hypothetical protein | |
| 166 | 2823 | 9 | 27 | SL1793 | 915 | t1016 | 915 | pagO | inner membrane protein |
| 28 | 311 | 3 | 22 | SL1794 | 159 | t1015 | 159 | putative inner membrane protein | |
| 23 | 155 | 1 | 4 | SL1823 | 972 | t0988 | 972 | msbB | lipid A acyltransferase |
| 60 | 402 | 11 | 58 | SL2064 | 1002 | t0786 | 1002 | rfbV | putative glycosyl transferase |
| 87 | 524 | 7 | 59 | SL2065 | 1293 | t0785 | 1299 | rfbX | putative O-antigen transporter |
| 66 | 559 | 13 | 74 | SL2069 | 774 | t0780 | 774 | rfbF | glucose-1-phosphate cytidylyltransferase |
| 41 | 204 | 5 | 14 | SL3828 | 1830 | t3658 | 1830 | glmS | glucosamine-fructose-6-phosphate aminotransferase |
| 27 | 288 | 5 | 23 | SL4250 | 288 | t4220 | 288 | putative GerE family regulatory protein | |
| 148 | 2633 | 16 | 89 | SL4251 | 876 | t4221 | 876 | araC family regulatory protein | |
Genes determined to be uniquely required in S. Typhimurium. SL, S. Typhimurium; Ty, S. Typhi; inserts refer to the number of unique insertion sites within a gene; reads refer to the number of sequence reads over all insertions sites within a gene. Shaded rows indicate genes shown to be H-NS repressed in (61).
*P-value (associated with log2 read ratio) < 0.05.
asseJ is a pseudogene in S. Typhi.
Genes uniquely required in S. Typhi
| SL inserts | SL reads | Ty inserts | Ty reads | Ty ID | Ty gene length | SL ID | SL gene length | Name | Function |
|---|---|---|---|---|---|---|---|---|---|
| No orthologue in | |||||||||
| 1 | 5 | t1332 | 132 | malY | pseudogene | ||||
| 2 | 32 | t1920 | 405 | possible repressor protein, prophage 10/Gifsy-2 | |||||
| 2 | 12 | t3157 | 165 | conserved hypothetical protein | |||||
| 2 | 12 | t3166 | 228 | spurious ORF annotation overlapping the RnaseP/M1 RNA | |||||
| 6 | 196 | t3402 | 570 | cI | repressor protein, cs 73 prophage | ||||
| 4 | 58 | t3415 | 741 | HIRAN-domain family gene, potential DNA repair | |||||
| 1 | 6 | t4531 | 150 | hypothetical secreted protein | |||||
| Present in | |||||||||
| 199 | 1792 | 18 | 59 | t0095 | 1287 | SL0093 | 1287 | surA | survival protein SurA precursor |
| 45 | 498 | 3 | 22 | t0123 | 459 | SL0119 | 459 | yabB/mraZ | conserved hypothetical protein |
| 120 | 589 | 11 | 32 | t0203 | 1281 | SL0203 | 1281 | hemL | glutamate-1-semialdehyde 2,1-aminomutase |
| 123 | 982 | 2 | 25 | t0224 | 1353 | SL0224 | 1353 | yaeL/rseP | Zinc metallopeptidase |
| 67 | 452 | 1 | 14 | t0270 | 576 | SL2604 | 576 | rpoE | RNA polymerase sigma-E factor |
| 140 | 760 | 0 | 0 | t0587 | 2286 | SL2246 | 2286 | nrdA | ribonucleoside-diphosphate reductase 1 alpha chain |
| 113 | 641 | 15 | 42 | t2140 | 2802 | SL0718 | 2802 | sucA | 2-oxoglutarate dehydrogenase E1 component |
| 116 | 753 | 13 | 36 | t2177 | 1641 | SL0680 | 1641 | pgm | phosphoglucomutase |
| 80 | 542 | 9 | 15 | t2276 | 1008 | SL0580 | 1008 | fepD | ferric enterobactin transport protein FepD |
| 93 | 591 | 2 | 2 | t2277 | 990 | SL0579 | 990 | fepG | ferric enterobactin transport protein FepG |
| 64 | 508 | 5 | 6 | t2278 | 795 | SL0578 | 795 | fepC | ferric enterobactin transport ATP-binding protein FepC |
| 201 | 1129 | 12 | 116 | t2410 | 2355 | SL0444 | 2355 | lon | Lon protease |
| 95 | 518 | 8 | 20 | t2730 | 1062 | SL2809 | 1062 | recA | recA protein |
| 135 | 719 | 16 | 39 | t2996 | 1992 | SL3052 | 1947 | tktA | transketolase |
| 76 | 358 | 3 | 9 | t3120 | 1434 | SL3173 | 1434 | rfaE | ADP-heptose synthase |
| 213 | 1976 | 6 | 50 | t3265 | 1071 | SL3321 | 1071 | degS | serine protease |
| 43 | 448 | 3 | 10 | t3326 | 606 | SL3925 | 606 | yigP | conserved hypothetical protein |
| 124 | 571 | 17 | 36 | t3384 | 2025 | SL3872 | 2025 | rep | ATP-dependent DNA helicase |
| 175 | 1208 | 6 | 21 | t3621 | 2787 | SL3947 | 2787 | polA | DNA polymerase I |
| 117 | 797 | 9 | 13 | t3808 | 1047 | SL3677 | 1047 | waaF | ADP-heptose-LPS heptosyltransferase II |
| 176 | 1628 | 14 | 59 | t4153 | 1080 | SL4183 | 1080 | alr | alanine racemase |
| 140 | 1127 | 10 | 38 | t4411 | 951 | SL4294 | 951 | miaA | tRNA delta-2-isopentenylpyrophosphate transferase |
Genes determined to be uniquely required in S. Typhi. SL, S. Typhimurium; Ty, S. Typhi; inserts refer to the number of unique insertion sites within a gene; reads refer to the number of sequence reads over all insertions sites within a gene.
*P-value (associated with log2 read ratio) < 0.05.
aThe assignment of recA as a required gene has been described previously (35), but briefly is believed to be due to the presence of the priC pseudogene in Typhi.
Figure 3.Comparison of cell surface operon structure and requirements. Diagram illustrating cell surface operons with different requirement patterns in S. Typhimurium and S. Typhi. The top figure is of an uncharacterized operon putatively involved in cell wall biogenesis, whereas the bottom figure shows a portion of the rfb operon involved in O-antigen biosynthesis. Plots along the top and bottom of each figure show insertions in S. Typhimurium and S. Typhi, respectively, with read depth on the y-axis with a maximum cut-off of 100 reads. Genes in blue are required in S. Typhimurium, genes in white are pseudogenes; others are in grey. Grey rectangles represent BLAST hits between orthologous genes, with percentage nucleotide identity coloured on the scale to the right of each figure.
Figure 4.Proposed differences in sRNA utilization. Diagram illustrating inferred required sRNA regulatory networks under TraDIS. Molecules required in S. Typhi are in black and in S. Typhimurium are in white. RseA, in black/grey check, is ambiguous in S. Typhi. Non-required molecules are in grey. Diamonds indicate sRNAs, circles regulatory proteins, ovals proteases, oblong shapes are membrane-anchored proteins, and rounded squares are outer membrane porins.
Candidate required ncRNAs greater than 60 nt in length, excluding rRNA and tRNA
| Element name | Rfam accession | Function | Hfq-binding | Downstream protein-coding gene(s) | Downstream gene required | References |
|---|---|---|---|---|---|---|
| Required or ambiguous in both | ||||||
| SRP | RF00169 | RNA component of the signal recognition particle | ( | |||
| RNase P | RF00010 | RNA component of RNase P | ybaZ | N | ( | |
| RFN | RF00050 | FMN-sensing riboswitch controlling the ribB gene | ribB | Y | ( | |
| SroE | RF00371 | Putative cis-regulatory element controlling the hisS gene | hisS | Y | ( | |
| ThrU Leader | NA | Putative cis-regulatory element controlling the ThrU tRNA operon | ( | |||
| t44 | RF00127 | Cis-regulatory element controlling the ribosomal rpsB gene | rpsB | Y | ( | |
| S15b | RF00114 | Translational regulator of the ribosomal S15 protein | rpsO | Y | ( | |
| StyR-8 | NA | Putative cis-regulatory element controlling the ribosomal rpmB gene | rpmB | Y | ( | |
| MicA | RF00078 | sRNA involved in cellular response to extracytoplasmic stress | Y | luxS | N | ( |
| DsrAb | RF00014 | sRNA regulator of H-NS | Y | mngB | N | ( |
| STnc10 | NA | Putative sRNA | nhaA | N | ( | |
| STnc60b | NA | Putative sRNA | ( | |||
| STnc840 | NA | Verified sRNA derived from 3' UTR of the flgL gene | Y | ( | ||
| IS0420 | NA | Putative ncRNA | rmf | N | ( | |
| RGO0b | NA | Putative sRNA identified in E. coli | ( | |||
| Required or ambiguous in | ||||||
| rne5 | RF00040 | RNase E autoregulatory 5' element | rne | Y | ( | |
| RydC | RF00505 | sRNA regulator of the yejABEF ABC transporter | Y | ( | ||
| RydB | RF00118 | Putative ncRNA | ( | |||
| STnc510 | NA | Putative sRNA | pagD/pagC | Y/N | ( | |
| STnc460b | NA | Putative sRNA | ( | |||
| STnc170 | NA | Putative sRNA | SL1458 | N | ( | |
| STnc130 | NA | Putative sRNA | dmsA | N | ( | |
| RseX | RF01401 | sRNA regulator of OmpA and OmpC | Y | ( | ||
| IsrJ | RF01393 | sRNA regulator of SPI-1 effector protein secretion | ( | |||
| IsrI | RF01392 | Island-encoded Hfq-binding sRNA | Y | SL1028 | Y | ( |
| Required or ambiguous in | ||||||
| RybB | RF00110 | sRNA involved in cellular response to extracytoplasmic stress | Y | ( | ||
| tk5a | NA | Putative ncRNA | ( | |||
| STnc750 | NA | Verified sRNA | Y | SpeB | N | ( |
| StyR-44a | RF01830 | Putative multicopy (2/6 copies required in | 23S rRNA | N | ( | |
| tp2 | NA | Putative ncRNA | aceE | N | ( | |
| RdlD | RF01813 | RdlD RNA anti-toxin, 1/2 copies required in | ( | |||
| STnc120a | NA | Putative sRNA | ( | |||
| tp28a | NA | Putative ncRNA | fur | N | ( | |
| Phe Leadera | RF01859 | Phenylalanine peptide leader sequence associated with the required PheST operon | PheS | Y | ( | |
| RimP Leader | RF01770 | Putative cis-regulator of the rimP-nusA-infB operon | rimP | Y | ( | |
| GlmY | RF00128 | Trans-acting regulator of the GlmS gene | ( | |||
Known and putative non-coding elements classified as required or ambiguous in this screen. Required ncRNAs have an LLR between required and non-required models of less than −2; see ‘Materials and Methods’ section.
a,bncRNAs that are amibiguous (LLR between −2 and 2) in S. Typhi(a) or in S. Typhimurium(b). Hfq-binding annotations are taken from (90). The downstream protein-coding genes columns report annotated CDS or ribosomal RNA start sites within 100 bases of each candidate required non-coding element on either strand and whether these downstream sequences are also classified as required.