| Literature DB >> 21490962 |
Nicola K Petty1, Theresa Feltwell, Derek Pickard, Simon Clare, Ana L Toribio, Maria Fookes, Kevin Roberts, Rita Monson, Satheesh Nair, Robert A Kingsley, Richard Bulgin, Siouxsie Wiles, David Goulding, Thomas Keane, Craig Corton, Nicola Lennard, David Harris, David Willey, Richard Rance, Lu Yu, Jyoti S Choudhary, Carol Churcher, Michael A Quail, Julian Parkhill, Gad Frankel, Gordon Dougan, George P C Salmond, Nicholas R Thomson.
Abstract
Citrobacter rodentium is a natural mouse pathogen that causes attaching and effacing (A/E) lesions. It shares a common virulence strategy with the clinically significant human A/E pathogens enteropathogenic E. coli (EPEC) and enterohaemorrhagic E. coli (EHEC) and is widely used to model this route of pathogenesis. We previously reported the complete genome sequence of C. rodentium ICC168, where we found that the genome displayed many characteristics of a newly evolved pathogen. In this study, through PFGE, sequencing of isolates showing variation, whole genome transcriptome analysis and examination of the mobile genetic elements, we found that, consistent with our previous hypothesis, the genome of C. rodentium is unstable as a result of repeat-mediated, large-scale genome recombination and because of active transposition of mobile genetic elements such as the prophages. We sequenced an additional C. rodentium strain, EX-33, to reveal that the reference strain ICC168 is representative of the species and that most of the inactivating mutations were common to both isolates and likely to have occurred early on in the evolution of this pathogen. We draw parallels with the evolution of other bacterial pathogens and conclude that C. rodentium is a recently evolved pathogen that may have emerged alongside the development of inbred mice as a model for human disease.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21490962 PMCID: PMC3072379 DOI: 10.1371/journal.ppat.1002018
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Gene differences between C. rodentium strains EX-33 and ICC168.
| CDS ID | Product | EX-33 | ICC168 | Comment |
| ROD_02831 | IS | absent | present | IS |
| ROD_02841 | Hypothetical prophage protein (CRP28) | intact | pseudo | Disrupted by IS |
| ROD_05221 | IS | absent | present | IS |
| ROD_05231 | Putative transposase | intact | pseudo | Disrupted by IS |
| ROD_15301 | Hypothetical protein | pseudo | intact | N-terminal truncated by 932 bp deletion in EX-33 |
| ROD_20681 | IS | absent | present | IS |
| ROD_28451 | IS | absent | present | IS |
| ROD_28951 | IS | absent | present | IS |
| ROD_28941 | Putative exported protein | intact | pseudo | Disrupted by IS |
| ROD_29281 | IS | absent | present | IS |
| ROD_29471 | IS | absent | present | IS |
| ROD_29461/81 | Hypothetical protein | intact | pseudo | Disrupted by IS |
| ROD_31511 | IS | absent | present | IS |
| ROD_31521 | Hypothetical protein | intact | pseudo | Disrupted by IS |
| ROD_33401 | IS | absent | present | IS |
| ROD_33391 | Hypothetical protein | intact | pseudo | Disrupted by IS |
| ROD_35171 | IS | absent | present | IS |
| ROD_42661 | ATP-binding protein of dipeptide ABC transporter | pseudo | intact | Truncated by premature stop codon due to SNP in EX-33 |
| ROD_45501 | IS | absent | present | IS |
| ROD_48241 | Hypothetical protein | pseudo | intact | C-terminal truncated by 4392 bp deletion in EX-33 |
| ROD_48251 | Putative membrane protein | absent | present | 4392 bp deletion in EX-33 |
| ROD_48261 | Putative LysR-family transcriptional regulator | absent | present | 4392 bp deletion in EX-33 |
| ROD_48271 | Putative transport protein | pseudo | intact | N-terminal truncated by 4392 bp deletion in EX-33 |
| ROD_50641 | IS | absent | present | IS |
| ROD_50631 | Putative fimbrial usher protein | intact | pseudo | C-terminal truncated by IS |
| ROD_50632 | Putative fimbrial protein | present | absent | Deletion in ICC168 due to IS |
| ROD_50633 | Putative fimbrial protein | present | absent | Deletion in ICC168 due to IS |
| ROD_50651 | Putative fimbrial adhesin | intact | pseudo | N-terminal truncated by IS |
| ROD_p2_471 | IS | absent | present | IS |
| ROD_p2_461 | Putative conjugal transfer protein TriD | intact | pseudo | Disrupted by IS |
| ROD_p4_51 | Hypothetical protein | pseudo | intact | Truncated by frameshift mutation due to SNP in EX-33 |
*Names used are those in the ICC168 genome.
Figure 1Comparison of the genome of C. rodentium with related bacteria.
Genome comparison of the DNA sequences of C. rodentium ICC168 (middle) with E. coli K12 MG1655 (U00096, top) and Salmonella Typhimurium LT2 (AE006468, bottom). Grey shading between two genomes indicates regions of nucleotide similarity (BLASTN matches with a minimum length of 1000 bp) between sequences on the same strand, green shading highlights where the matching sequences are inverted with respect to each other (percentage identity is indicated). The locations of IS elements in the C. rodentium genome are shown as blue bars. An orange pointer indicates the origin of replication (oriC) in the E. coli genome. The GC Deviation (G−C)/(G+C) plot is shown above each genome (window size 1000 bp) with the switch in GC deviation in C. rodentium highlighted by red shading. The scale bar indicates genome length. This figure was produced using Easyfig [67].
Classification and location of IS elements in C. rodentium.
| Total number | Genomic location | ||||
| IS element | (of which are remnants) | Chromosomal backbone | Genomic Island | Prophage | Plasmid |
| IS | 13 | 9 | 2 | 1 | 1 |
| IS | 1 (1) | 1 | |||
| IS | 2 (2) | 1 | 1 | ||
| IS | 3 (2) | 2 | 1 | ||
| IS | 1 (1) | 1 | |||
| IS | 1 (1) | 1 | |||
| IS | 1 (1) | 1 | |||
| IS | 1 (1) | 1 | |||
| IS | 1 | 1 | |||
| IS | 1 (1) | 1 | |||
| IS | 3 (3) | 1 | 2 | ||
| IS | 1 (1) | 1 | |||
| IS | 1 (1) | 1 | |||
| IS | 2 (2) | 2 | |||
| IS | 2 (2) | 2 | |||
| IS | 1 (1) | 1 | |||
| IS | 1 (1) | 1 | |||
| IS | 1 (1) | 1 | |||
| IS | 1 | 1 | |||
| IS | 1 (1) | 1 | |||
| IS | 24 (2) | 14 | 7 | 2 | 1 |
| IS | 3 | 3 | |||
| IS | 14 (3) | 9 | 3 | 2 | |
| IS | 13 | 13 | |||
| IS | 5 | 5 | |||
| IS | 6 (4) | 6 | |||
| IS | 6 | 2 | 3 | 1 | |
| IS | 1 (1) | 1 | |||
| Unclassified IS | 2 (1) | 1 | 1 | ||
|
|
|
|
|
|
|
In addition to IS elements, there are 12 unclassified transposases (five are located on the chromosome and seven on GIs), nine of which are remnants.
Unique to ICC168, not in EX-33 genome.
The number of remnants was previously reported incorrectly [20]. The correct number of IS element remnants is 34 (32 on the chromosome and 2 on plasmid pCROD1).
Bacterial strains used in this study.
| Strain | Description | Comment | Reference |
|
| |||
| EX-33 | Previously | pCROD1+ |
|
| ICC168 | Previously | pCROD1+ |
|
| ICC169 | Spontaneous NalR derivative of ICC168, original stock | PFGE profile same as for ICC168, pCROD1+ |
|
| ICC169-335 | Dougan laboratory isolate, previously ICC169 | PFGE profile same as for ICC168, pCROD1− | This study |
| ICC169-407 | Dougan laboratory isolate, previously ICC169 | Differences in PFGE profile compared to ICC168, pCROD1+ | This study |
| ICC169-474 | Frankel laboratory isolate, previously ICC169 | PFGE profile same as for ICC168, pCROD1+/− | This study |
| ICC169-476 | Salmond laboratory isolate, previously ICC169 | PFGE profile same as for ICC168, pCROD1− | This study |
| ICC169-496 | Dougan laboratory isolate, previously ICC169 | Differences in PFGE profile compared to ICC168, pCROD1− | This study |
| ICC169c3 | Isolated from ICC169 infected mouse ‘c’ faeces 3 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169c13 | Isolated from ICC169 infected mouse ‘c’ faeces 13 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169c14cae | Isolated from ICC169 infected mouse ‘c’ caecum 14 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169c14col | Isolated from ICC169 infected mouse ‘c’ colon 14 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169a3 | Isolated from ICC169 infected mouse ‘a’ faeces 3 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169a15 | Isolated from ICC169 infected mouse ‘a’ faeces 15 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169-407b15 | Isolated from ICC169-407 infected mouse ‘b’ faeces 15 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169-496c3 | Isolated from ICC169-496 infected mouse ‘c’ faeces 3 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169-496c6 | Isolated from ICC169-496 infected mouse ‘c’ faeces 6 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169-496c10 | Isolated from ICC169-496 infected mouse ‘c’ faeces 10 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169-496c13 | Isolated from ICC169-496 infected mouse ‘c’ faeces 13 days post infection | PFGE profile same as for ICC168 | This study |
| ICC169-496c15 | Isolated from ICC169-496 infected mouse ‘c’ faeces 15 days post infection | PFGE profile same as for ICC168 | This study |
| ICC180 | ICC168 derivative | pCROD1− |
|
| ICC180-P10 | Isolated from ICC180 infected mouse faeces after passage through 10 mice | Differences in PFGE profile compared to ICC180, pCROD1− | This study |
|
| |||
| MG1655 | K-12 wild type, non-pathogenic |
| |
| ER2507 | K-12 derivative, F−,
| New England BioLabs | |
| ER2507 NPL | ΦNP lysogen | This study |
Figure 2PFGE profile of different C. rodentium isolates.
PFGE generated after XbaI cleavage of genomic DNA isolated from different strains and isolates of C. rodentium. Strain ICC168 shows the same PFGE pattern as for ICC169, ICC169-474, ICC169-335 and ICC169-476. Two isolates displayed significant differences in their PFGE profiles (indicated by red arrows); ICC169-407 has a band missing at approximately 340 kb and additional bands of approximately 280 kb and 420 kb; ICC169-496 is also missing the 340 kb band and has two extra bands between 145 and 200 kb. Markers are from New England BioLabs. Band sizes are indicated in kb.
Figure 3Chromosomal rearrangements identified in C. rodentium isolates ICC169-407 and ICC169-496.
Deviations from (A) the sequence of the wild-type strain ICC168, are depicted. Rearrangements are highlighted in red and show: (B) genomic inversion between two copies of ISCro4; (C) genomic inversion between two identical sequences encoding Elongation factor Tu; (D) genomic inversion between identical repeat sequences encoding the T3SS effector NleD and an adjacent transposase; (E) translocation of sequence from one location to another due to recombination between identical sequences in two Rhs elements. The GC deviation (G−C)/(G+C) plot for each genome orientation is shown above each sequence.
Figure 4Genetic organisation of the C. rodentium prophages showing transcriptionally active genes.
The genomes of each of the five intact prophages in the C. rodentium genome are shown aligned with mapped sequence reads for the whole genome transcriptome. The prophage remnant CRPr20 is also included due to its high similarity to CRP38 and the difficulty in mapping repetitive sequences. The RNA-seq data are represented as a plot showing the depth of sequences mapped to the forward strand (blue) and reverse strand (red) above each genome (window size = 200 bp). The majority of prophage genes, including those predicted to encode phage structural and lysis genes (see key), are expressed. Putative cargo genes can be identified by their relatively high levels of expression (numbered CDSs; see Table 4 for details). The scale bar indicates genome length. This figure was produced using Easyfig [67] and Artemis [68].
Highly expressed putative phage cargo genes.
| Prophage | Number | CDS ID | Transcript RPKM Value | Product |
| CRP28 | 1 | ROD_02501 | 43 | hypothetical protein |
| 2 | ROD_02551 | 51 | hypothetical protein | |
| 3 | ROD_02681 | 37 | putative membrane protein | |
| 4 | ROD_02733 | 53 | putative membrane protein | |
| 5 | ROD_02751 | 39 | putative exported protein | |
| 6 | ROD_02761 | 37 | putative exported protein | |
| CRP99 | 7 | ROD_09341 | 72 | putative lipoprotein |
| CRPr20 | 8 | ROD_19801 | 840 | putative membrane protein |
| 9 | ROD_19811 | 642 | putative host toxic membrane protein | |
| 10 | ROD_20041 | 257 | putative membrane protein | |
| ΦNP | 11 | ROD_25751 | 605 | putative damage-inducible protein |
| 12 | ROD_25752 | 512 | hypothetical protein | |
| 13 | ROD_25761 | 400 | hypothetical protein | |
| 14 | ROD_26221 | 1415 | hypothetical protein | |
| 15 | ROD_26231 | 596 | hypothetical protein | |
| 16 | ROD_26241 | 213 | hypothetical protein | |
| CRP38 | 17 | ROD_36461 | 299 | putative exported protein |
| 18 | ROD_36471 | 129 | putative membrane protein | |
| 19 | ROD_36481 | 226 | putative host toxic membrane protein | |
| 20 | ROD_36801 | 262 | putative membrane protein | |
| 21 | ROD_36851 | 121 | hypothetical protein | |
| CRP49 | 22 | ROD_47072 | 422 | hypothetical protein |
| 23 | ROD_47431 | 46 | hypothetical protein |
Putative cargo gene number used in Figure 4.
Prophage-encoded genes that have RPKM values >2x the average for that phage (average RPKM values for each prophage are: CRP28 = 18, CRP99 = 11, CRPr20 = 78, ΦNP = 101, CRP38 = 38, CRP49 = 21). Genes predicted to be involved in the phage lytic or lysogenic cycle were excluded from this list.
The prophage remnant CRPr20 is included in this list due to its high similarity to CRP38 and the difficulty in mapping repetitive sequences. The effector cargo genes encoded on CRPr13, CRPr17 and CRPr33 were described previously [20].
Figure 5Electron micrographs of ΦNP negatively stained with phosphotungstic acid.
All virions can be seen with contracted tails. Bars, 100 nm (top panels), 200 nm (bottom panel).
Figure 6Mouse shedding of C. rodentium.
Bacterial shedding in mouse faeces was monitored over the course of infection from individual mice. The mean count and standard deviation from groups of five mice infected with different C. rodentium isolates (see key) are shown.
Figure 7PFGE of XbaI-digested C. rodentium genomic DNA from isolates recovered before and after mouse inoculation.
(A) PFGE profile of isolate ICC169-496 pre mouse inoculation (left) and isolate ICC169-496c10 recovered from mouse faeces 10 days post inoculation. (B) PFGE profile of isolate ICC169-407b15 recovered from mouse faeces 15 days post inoculation (left) compared to that of the infecting isolate ICC169-407 pre mouse inoculation (right). (C) PFGE profile of the pre mouse inoculation strain ICC180 compared to that for ICC180-P10 recovered from faeces after 10 passages though the mouse. Additional/missing bands are indicated by red arrows.