| Literature DB >> 32109937 |
Bruno D'Alessandro1, Victoria Pérez Escanda1, Lucía Balestrazzi1, Florencia Grattarola2, Andrés Iriarte1, Derek Pickard3, Lucía Yim1, José Alejandro Chabalgoity1, Laura Betancor4,5.
Abstract
Salmonella enterica serovar Enteritidis is a major cause of foodborne disease in Uruguay since 1995. We used a genomic approach to study a set of isolates from different sources and years. Whole genome phylogeny showed that most of the strains are distributed in two major lineages (E1 and E2), both belonging to MLST sequence type 11 the major ST among serovar Enteritidis. Strikingly, E2 isolates are over-represented in periods of outbreak abundance in Uruguay, while E1 span all epidemic periods. Both lineages circulate in neighbor countries at the same timescale as in Uruguay, and are present in minor numbers in distant countries. We identified allelic variants associated with each lineage. Three genes, ycdX, pduD and hsdM, have distinctive variants in E1 that may result in defective products. Another four genes (ybiO, yiaN, aas, aceA) present variants specific for the E2 lineage. Overall this work shows that S. enterica serovar Enteritidis strains circulating in Uruguay have the same phylogenetic profile than strains circulating in the region, as well as in more distant countries. Based on these results we hypothesize that the E2 lineage, which is more prevalent during epidemics, exhibits a combination of allelic variants that could be associated with its epidemic ability.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32109937 PMCID: PMC7046640 DOI: 10.1038/s41598-020-60502-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(a) Distribution of S. enterica serovars received at the National Salmonella Center over the time (1975 to 2017). The 5 epidemiological periods of isolation are indicated with numbers. In white: non epidemic periods (1, 3 and 5). In orange: epidemic periods (2 and 4). (b) Phylogenetic tree showing the two clusters among Uruguayan S. enterica serovar Enteritidis genomes (E1 in blue, E2 in red and other strains in black). Highlight over the strain designation indicate the period of isolation as explained above. The genome of reference (P125109) is highlighted in green and marked with an asterisk. The isolates are named using a number followed by an underscore and the last two digits of the year of isolation. Scale bar units represents changes per variant site.
Figure 2(a) Phylogenetic tree of the 195 genomes including 61 from Uruguay (white leaves), 12 from Argentina (light blue leaves) and 122 from Brazil (yellow leaves). Branches of E1 lineage are colored in blue. Branches of E2 lineage are indicated in red. Lanes at right represent with different colors the allelic variants for ycdX, pduD, hsdM, ybiO, yiaN, aas and aceA respectively. See colour legends inside the (b). Scale bar units represents changes per variant site. (b) Gene alignments for the different allelic variants of ycdX, pduD, hsdM, ybiO, yiaN, aas and aceA respectively. Genes are depicted as annotated in the genome containing each variant and aligned to the reference P125109. The scale of the alignment in base pairs (bp) is indicated by horizontal bars. SNPs associated to each variant respect to the reference genome are detailed in Table 1. SNPs causing non synonymous variants are marked as X → Y (in single letter aminoacid code) relative to its position in the gene. Synonymous SNPs are only marked relative to its position in the gene (syn SNP).
SNPs and gene variants of the 7 gene markers for the lineages E1 and E2.
| Gene | Variant name | SNP | E1 | E2 | Observations |
|---|---|---|---|---|---|
| ycdX-1 | — | 0% | 100% | ||
| ycdX-2 | (-→ AGCC) 552 | 99% | 0% | Frameshift 187; stop 193 | |
| ycdX-3 | (- → CCAG) 552 | 1% | 0% | Frameshift 187; stop 193 | |
| pduD-1 | — | 0% | 100% | ||
| pduD-2 | (C→ T)396 | 99% | 0% | Stop 133 | |
| — | 3% | 100% | |||
| hsdM-2 | (- → T) 171 | 77% | 0% | Frameshift 60; stop 64; late start CDS. | |
| hsdM-3 | (- →TT) 171 | 13% | 0% | Frameshift 60; stop 60 and CDS in 2nd frame | |
| hsdM-4 | (- → T)178; (G → T) 797 | 1% | 0% | Frameshift 60; stop 64 and CDS in 2nd frame | |
| hsdM-5 | (- → T)178;(T → -) 867 | 3% | 0% | Frameshift 60; stop 64; late start and truncated CDS. | |
| hsdM- 6 | (- → TTT) 178 | 3% | 0% | insert Lys 59 | |
| ybiO-1 | — | 98% | 0% | ||
| ybiO-2 | (T → G) 1803 | 0% | 100% | Gly 601 ->Arg | |
| ybiO-3 | (G → A) 1125 | 2% | 0% | Synonymous | |
| yiaN-1 | — | 100% | 0% | ||
| yiaN-2 | (G → C) 489 | 0% | 100% | Gly 163 ->Ala | |
| aas-1 | — | 94% | 0% | ||
| aas-2 | (T → C) 1301 | 0% | 100% | Synonymous | |
| aas-3 | (C → T) 233 | 5% | 0% | (Pro78 → Lys) | |
| aas-4 | (G → A) 270 | 1% | 0% | Synonymous | |
| aceA-1 | — | 99% | 0% | ||
| aceA-2 | (C → T) 198 | 0% | 100% | Synonymous | |
| aceA-3 | (G → A) 543 | 1% | 0% | Synonymous |
The variant names correspond to those showed in Fig. 2(b). SNP column show the nucleotidic change respect to the reference genome in brackets (bases as in the leading strand of the reference genome P125109) and its relative position to the gene start in subscript. Columns E1 and E2 indicate the percentage of genomes that present the corresponding variant in each lineage. Note that the total number of E1 genomes is 103 and the total number of E2 genomes is 62.