| Literature DB >> 30323193 |
Federica Palma1, Gerardo Manfreda2, Mickael Silva3, Antonio Parisi4, Dillon O R Barker5, Eduardo N Taboada5, Frédérique Pasquali2, Mirko Rossi6.
Abstract
Salmonella enterica ser. Typhimurium monophasic variant 4,[5],12:i:- has been associated with food-borne epidemics worldwide and swine appeared to be the main reservoir in most of the countries of isolation. However, the monomorphic nature of this serovar has, so far, hindered identification of the source due to expansion of clonal lineages in multiple hosts and food producing systems. Since geographically structured genetic signals can shape bacterial populations, identification of biogeographical markers in S. 1,4,[5],12:i:- genomes can contribute to improving source attribution. In this study, the phylogeographical structure of 148 geographically and temporally related Italian S. 1,4,[5],12:i:- has been investigated. The Italian isolates belong to a large population of clonal S. Typhimurium/1,4,[5],12:i:- isolates collected worldwide in two decades showing up to 2.5% of allele differences. Phylogenetic reconstruction revealed that isolates from the same geographical origin form highly supported monophyletic groups, suggesting discrete geographical segregation. These monophyletic groups are characterized by the gene content of a large sopE-containing prophage. Within this prophage, genome-wide comparison identified several genes overrepresented in strains of Italian origin. This suggests that certain lineages may be characterized by the acquisition of specific accessory genetic markers useful for improving identification of the source in ongoing epidemics.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30323193 PMCID: PMC6189080 DOI: 10.1038/s41598-018-33266-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Accessory genes overrepresented in Italian strains rated by Benjiamini Hochberg (BH) P-value.
| Roary gene name | Prokka annotation | BH P-value | Italian isolates pos* | Italian isolates neg* | Non-Italian isolates pos** | Non-Italian isolates neg** | Gene details*** |
|---|---|---|---|---|---|---|---|
| group_3215 | Hypothetical protein | 1,94E-48 | 70 | 108 | 15 | 1133 | Phage BRO family/N-terminal domain protein |
| group_5738 | Hypothetical protein | 2,97E-47 | 70 | 108 | 17 | 1131 | Phage lambda NC_001416: cell lysis protein/endopeptidase |
| group_7354 | Hypothetical protein | 3,54E-47 | 69 | 109 | 16 | 1132 | Phege Clostr CDMH1 NC_024144: putative signalling/NTPase protein |
| group_7352 | Hypothetical protein | 3,54E-47 | 69 | 109 | 16 | 1132 | Phage hypotetical protein |
| group_5725 | Hypothetical protein | 4,43E-47 | 70 | 108 | 18 | 1130 | Phage Erwini phiEt88_NC_015295: DNA N-6-adenine-methyltransferase |
| group_2349 | Hypothetical protein | 4,43E-47 | 70 | 108 | 18 | 1130 | Phage Entero SfI NC_027339: Ren protein |
| group_5737 | Hypothetical protein | 4,43E-47 | 70 | 108 | 18 | 1130 | Phage Entero lambda NC_001416: Bor protein precursor |
| group_2253 | Hypothetical protein | 4,43E-47 | 70 | 108 | 18 | 1130 | Phage Entero c_1 NC_019706: lysozyme |
| group_7353 | Hypothetical protein | 8,17E-47 | 69 | 109 | 17 | 1131 | Phage Gifsy_1 NC_010392: bacteriophage antiterminator protein Q |
| group_7359 | Hypothetical protein | 3,28E-46 | 69 | 109 | 18 | 1130 | Phage Entero 933 W NC_000924: hypothetical protein |
| group_3054 | Hypothetical protein | 4,28E-46 | 72 | 106 | 23 | 1125 | Phage Shigel SfII NC_021857: hypothetical protein |
| group_4040 | Hypothetical protein | 1,98E-45 | 71 | 107 | 23 | 1125 | Phage hypothetical protein |
| rusA_2 | Crossover junction endodeoxyribonuclease | 3,80E-45 | 70 | 108 | 22 | 1126 | Phage Entero mEp237 NC_019704: Holliday junction resolvase RusA |
| group_7356 | Hypothetical protein | 5,10E-45 | 63 | 115 | 12 | 1136 | Outer membrane protein assembly factor BamE |
| group_4041 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage Entero BP 4795_NC_004813: hypothetical protein |
| group_3216 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage Entero SfI NC_027339: replication protein P |
| group_3214 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage hypothetical protein |
| group_3213 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage hypothetical protein |
| group_1275 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage Salmon SEN34 NC_028699: replication protein O |
| group_4491 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage Rha protein |
| group_4493 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage Stx2 II NC_004914: hypothetical protein |
| group_7358 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage Entero 933 W NC_000924: host-nuclease inhibitor protein Gam |
| group_7351 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage Salmon ST64T NC_004348: holin protein |
| group_932 | Hypothetical protein | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage Entero phi80 NC_021190: CII decision making protein |
| kilR | Killing protein KilR | 9,35E-45 | 70 | 108 | 23 | 1125 | Phage Entero HK225 NC_019717: Kil protein |
| group_3075 | Hypothetical protein | 2,45E-44 | 134 | 44 | 233 | 915 | Hypothetical protein |
| group_3217 | Hypothetical protein | 2,37E-43 | 69 | 109 | 24 | 1124 | Phage Entero 933 W NC_000924: Bet protein |
| group_2348 | Hypothetical protein | 1,65E-42 | 63 | 115 | 16 | 1132 | Predicted NTPase, NACHT family domain [Signal transduction mechanisms]/Ecoli |
| group_686 | Hypothetical protein | 3,66E-42 | 71 | 107 | 30 | 1118 | Putative plasmid associated gene |
| group_2346 | Hypothetical protein | 4,12E-42 | 67 | 111 | 23 | 1125 | Phage hypothetical protein |
| group_7349 | Hypothetical protein | 5,77E-42 | 68 | 110 | 25 | 1123 | Putative plasmid associated gene |
| group_7350 | Hypothetical protein | 1,87E-41 | 68 | 110 | 26 | 1122 | Putative plasmid associated gene |
| group_7348 | Hypothetical protein | 3,53E-40 | 66 | 112 | 25 | 1123 | Putative plasmid associated gene |
| group_3117 | Hypothetical protein | 9,90E-39 | 51 | 127 | 6 | 1142 | Genomic DNA |
| group_1265 | Hypothetical protein | 4,26E-35 | 70 | 108 | 45 | 1103 | Putative plasmid associated gene |
| group_7650 | Hypothetical protein | 4,42E-35 | 52 | 126 | 12 | 1136 | Genomic DNA |
| group_6567 | Hypothetical protein | 1,48E-33 | 60 | 118 | 28 | 1120 | Genommic DNA |
| group_3072 | Hypothetical protein | 1,47E-32 | 111 | 67 | 194 | 954 | Genomic DNA |
| group_7355 | Hypothetical protein | 2,21E-26 | 42 | 136 | 12 | 1136 | Genomic DNA |
| prtR | Putative HTH-type transcriptional regulator | 1,55E-25 | 41 | 137 | 12 | 1136 | O antigen synthesis gene |
| group_7816 | Hypothetical protein | 3,86E-24 | 40 | 138 | 13 | 1135 | Phage hypothetical protein |
| group_7817 | Putative HTH-type transcriptional regulator | 2,65E-23 | 39 | 139 | 13 | 1135 | Phage Salmon ST160 NC_014900: C2 phage |
| rop | Regulatory protein rop | 6,27E-20 | 79 | 99 | 140 | 1008 | Putative plasmid associated gene |
| mbeC | Mobilization protein MbeC | 2,84E-10 | 45 | 133 | 78 | 1070 | Putative plasmid associated gene |
| group_48 | Hypothetical protein | 6,61E-04 | 44 | 134 | 141 | 1007 | Genomic DNA |
| xerC_1 | Tyrosine recombinase XerC | 7,22E-04 | 80 | 98 | 334 | 814 | Phage Shigel SfII_NC_021857: integrase |
| group_7044 | Hypothetical protein | 1,00E-03 | 79 | 99 | 333 | 815 | Genomic DNA |
| group_4380 | Hypothetical protein | 1,48E-03 | 79 | 99 | 337 | 811 | Phage protein flxA |
| sopE | Guanine nucleotide exchange factor SopE | 4,38E-02 | 73 | 105 | 345 | 803 | Phage G-nucleotide exchange factor SopE |
*Number of Italian strains positive (pos) or negative (neg) for the observed gene.
**Number of non-Italian strains positive (pos) or negative (neg) for the observed gene.
***Gene details based on PHAST and on BLASTn against NCBI database.
Figure 1MVSTm prophage region alignment. Alignment of 42.9Kb prophage region of STY194 strain (in the middle) including 62 loci of which 33 (yellow) are that classified by Scoary[38] as strongly associated with Italian MVSTm (group 2 in the text). At the top, S. Typhimurium LT2 showing the insertion of the prophage between the tRNA-thrW locus downstream of proA and the transposase STM0325. At the bottom, similar prophage region of UK strain SAL_JA6411AA with divergent loci coloured in grey. The full match of shared loci (orange and magenta arrows) is showed in blue.
Figure 2Core SNPs maximum likelihood tree. The maximum likelihood tree was inferred based on 11,278 core SNPs detected on 1,289 isolates. Figure 2 shows the tree pruned on clade II with coloured branches for subclade 10 (yellow), 41 (blue) and 61 (magenta). The internal circles indicate the originating geographical area of each isolates indicated by colours as in the legend. Externally, clusters of genes (detailed in Table 1) statistically associated with Italian strains divided by colours in plasmid-related contiguous loci (green); prophage related contiguous loci (yellow); and associated loci spread across the genome (light blue). The black hits are indicating sopE gene presence.
Figure 3SopEϕ loci distribution between Italian and non-Italian isolates. The percentage of genomes with positive match for each locus included in sopE-containing prophage is reported in the chart according to Italian (blue) and non-Italian (orange) origin. Gene details for each locus is indicated on the left as reported by PHAST annotation.
Categorization of 8,787 Salmonella genomes based on the distribution of biogeographical markers.
| Group | % of positive matches of SopEϕ loci | Number of genomes | % of the total dataset of 8,787 genomes |
|---|---|---|---|
| A | ≥90% (equal to SopEϕ positive) | 79 | 1% |
| B | ≥50% < 90% | 352 | 4% |
| C | ≥30% < 50% | 4,905 | 56% |
| D | <30% or no matches | 3,451 | 39% |