| Literature DB >> 32471418 |
Ana Victoria C Pilar1, Nicholas Petronella2, Forest M Dussault2, Adrian J Verster2, Sadjia Bekal3, Roger C Levesque4, Lawrence Goodridge5,6, Sandeep Tamber7.
Abstract
BACKGROUND: Salmonella enterica is a leading cause of foodborne illness worldwide resulting in considerable public health and economic costs. Testing for the presence of this pathogen in food is often hampered by the presence of background microflora that may present as Salmonella (false positives). False positive isolates belonging to the genus Citrobacter can be difficult to distinguish from Salmonella due to similarities in their genetics, cell surface antigens, and other phenotypes. In order to understand the genetic basis of these similarities, a comparative genomic approach was used to define the pan-, core, accessory, and unique coding sequences of a representative population of Salmonella and Citrobacter strains.Entities:
Keywords: Citrobacter; Core genome; Genomics; Network analysis; Pan-genome; Salmonella
Mesh:
Year: 2020 PMID: 32471418 PMCID: PMC7257147 DOI: 10.1186/s12864-020-06780-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
List of strains and genome sequences from the Salmonella enterica Foodborne Syst-OMICS (SalFoS) database that were used in this study. Highlighted strains indicate false positive Salmonella strains (blue) and rare Salmonella serovars (grey)
Fig. 1Estimation of the Salmonella-Citrobacter pan- and core genome size. Rarefaction analysis demonstrates that the average number of gene clusters in the pan-genome increases while the conserved core gene clusters reaches a plateau as more genomes are analyzed
Fig. 2Consensus tree based on Salmonella-Citrobacter core genome. Maximum likelihood phylogenetic trees were constructed from multiple alignments of concatenated core gene sequences of Salmonella and Citrobacter using RaXML. The consensus tree divided the Salmonella and Citrobacter strains into two distinct clades. The tree confirms that the false positive strains (red and green dots) belong to the genus Citrobacter. Genetic distance is defined by the scale and bootstrap values indicate percentages of 1000 replicates
Fig. 3Citrobacter core genome tree. A maximum likelihood phylogenetic tree constructed from the concatenated core amino acid sequences of 197 Citrobacter strains was used to determine the taxonomic designation of the unspeciated Citrobacter strains in this study (yellow dots). Genome distance estimation was performed using MASH and the core genome was analyzed using centreseq [27]. genes. Genetic distance is defined by the scale and bootstrap values indicate percentages of 1000 replicates. Triangles denote collapsed nodes with n representing the number of strains found in each triangle. For each triangle, the letters represent the group of related strains found in the branches of the node
Fig. 4Functional classification of Salmonella and Citrobacter core and genus-specific unique genomes. Salmonella-Citrobacter core genes are depicted in (a), Salmonella-specific unique genes in (b), and Citrobacter-specific unique genes in (c). Left, proportion of genes represented in each KEGG major category. Right, proportion of putative metabolic genes represented in each KEGG metabolism sub-category
Fig. 5Network analysis based on shared genes. Connectivity between strains indicates the presence of common gene families. Percentage thresholds depicted are 26% (a), 27% (b), 29% (c), and 34% (d). The percentage of shared genes is based on the number of genes shared by two genomes divided by the maximum number of genes found in the genome. Colors indicate the following strains: blue – Salmonella, orange – Citrobacter, red – Citrobacter strains S646, S647, S648, S1285, green – Citrobacter strain S1284