| Literature DB >> 32714310 |
Déborah Merda1, Arnaud Felten1, Noémie Vingadassalon1, Sarah Denayer2, Yacine Titouche3, Lucia Decastelli4, Bernadette Hickey5, Christos Kourtis6, Hristo Daskalov7, Michel-Yves Mistou1, Jacques-Antoine Hennekinne1.
Abstract
Food contamination by staphylococcal enterotoxins (SEs) is responsible for many food poisoning outbreaks (FPOs) each year, and they represent the third leading cause of FPOs in Europe. SEs constitute a protein family with 27 proteins. However, enzyme immunoassays can only detect directly in food the five classical SEs (SEA-SEE). Thus, molecular characterization methods of strains found in food are now used for FPO investigations. Here, we describe the development and implementation of a genomic analysis tool called NAuRA (Nice automatic Research of alleles) that can detect the presence of 27 SEs genes in just one analysis- and create a database of allelic data and protein variants for harmonizing analyses. This tool uses genome assembly data and the 27 protein sequences of SEs. To include the different divergence levels between SE-coding genes, parameters of coverage and identity were generated from 10,000 simulations and a dataset of 244 assembled genomes from strains responsible for outbreaks in Europe as well as the RefSeq reference database. Based on phylogenetic inference performed using maximum-likelihood on the core genomes of the strains in this collection, we demonstrated that strains responsible for FPOs are distributed throughout the phylogenetic tree. Moreover, 71 toxin profiles were obtained using the NAuRA pipeline and these profiles do not follow the evolutionary history of strains. This study presents a pioneering method to investigate strains isolated from food at the genomic level and to analyze the diversity of all 27 SE-coding genes together.Entities:
Keywords: bioinformatics tool; gene detection; genomic; staphylococcal enterotoxins genes; variant diversity
Year: 2020 PMID: 32714310 PMCID: PMC7344154 DOI: 10.3389/fmicb.2020.01483
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Analytical schematic of staphylococcal enterotoxin (SE) gene detection. (A) NAuRA (Nice Automatic Research of Alleles). (B) Determination of the best parameters to filter the BLAST results.
FIGURE 2Phylogeny of Staphylococcus aureus and distribution of staphylococcal enterotoxin (SE) genes. The phylogenetic inference based on the core genome was obtained using Roary on 244 of S. aureus genomes. The stars indicate strains involved in food poisoning outbreaks (FPOs). Each filled circle indicates the presence of SE genes; each color indicates SE genes linked on the same mobile genetic element (MGE). SE genes located on plasmid pIB485 are shown in blue; SE genes located on the same pathogenicity island in yellow or green; SE genes located on the egc cluster in red and all other genes not located on an MGE in black.
FIGURE 3Association pattern of staphylococcal enterotoxin (SE) genes. The UpSet plot shows all combinations of SE genes found in 244 genomes; the histogram shows the number of strains in which the combination was found. Each color background represents the SE genes linked on a given mobile genetic element (MGE). SE genes linked to no other and distributed in the genome are shown against a white background.
FIGURE 4Representation of stapylococcal enterotoxin (SE) protein diversity. (A) Histogram showing the number of protein variants (left axis) found in the 244 genomes according to the SE gene. The filled black circles (right axis) correspond to the number of strains in which the SE gene was found. (B) Boxplots show the distribution of the percentage of identity between the protein variants and the reference sequence. Each color represents the SE genes linked on the same mobile genetic element (MGE). SE genes located directly in the bacterial genome are shown in white.