| Literature DB >> 34704797 |
Mostafa Y Abdel-Glil1,2, Prasad Thomas1, Jörg Linde1, Keith A Jolley3, Dag Harmsen4, Lothar H Wieler5, Heinrich Neubauer1, Christian Seyboldt1.
Abstract
Clostridium perfringens is a spore-forming anaerobic pathogen responsible for a variety of histotoxic and intestinal infections in humans and animals. High-resolution genotyping aiming to identify bacteria at strain level has become increasingly important in modern microbiology to understand pathogen transmission pathways and to tackle infection sources. This study aimed at establishing a publicly available genome-wide multilocus sequence-typing (MLST) scheme for C. perfringens. A total of 1,431 highly conserved core genes (1.34 megabases; 50% of the reference genome genes) were indexed for a core genome-based MLST (cgMLST) scheme for C. perfringens. The scheme was applied to 282 ecologically and geographically diverse genomes, showing that the genotyping results of cgMLST were highly congruent with the core genome-based single-nucleotide-polymorphism typing in terms of resolution and tree topology. In addition, the cgMLST provided a greater discrimination than classical MLST methods for C. perfringens. The usability of the scheme for outbreak analysis was confirmed by reinvestigating published outbreaks of C. perfringens-associated infections in the United States and the United Kingdom. In summary, a publicly available scheme and an allele nomenclature database for genomic typing of C. perfringens have been established and can be used for broad-based and standardized epidemiological studies. IMPORTANCE Global epidemiological surveillance of bacterial pathogens is enhanced by the availability of standard tools and sharing of typing data. The use of whole-genome sequencing has opened the possibility for high-resolution characterization of bacterial strains down to the clonal and subclonal levels. Core genome multilocus sequence typing is a robust system that uses highly conserved core genes for deep genotyping. The method has been successfully and widely used to describe the epidemiology of various bacterial species. Nevertheless, a cgMLST typing scheme for Clostridium perfringens is currently not publicly available. In this study, we (i) developed a cgMLST typing scheme for C. perfringens, (ii) evaluated the performance of the scheme on different sets of C. perfringens genomes from different hosts and geographic regions as well as from different outbreak situations, and, finally, (iii) made this scheme publicly available supported by an allele nomenclature database for global and standard genomic typing.Entities:
Keywords: Clostridium perfringens; SNP; cgMLST; genome typing
Mesh:
Year: 2021 PMID: 34704797 PMCID: PMC8549748 DOI: 10.1128/Spectrum.00533-21
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
FIG 1Schematic of the workflow to define the Clostridium perfringens core genome MLST targets.
Discriminatory power and typing concordance of C. perfringens cgMLST and core genome SNPs
| Typing method | No. of genotypes (out of 282 genomes) | Simpson's diversity index (95% confidence interval) | Adjusted Wallace index of concordance (95% confidence interval) of: | ||
|---|---|---|---|---|---|
| Core genome MLST | Core genome SNPs | Recombination-free core genome SNPs | |||
| Core genome MLST | 259 | 0.999 (0.999–1.000) | 0.758 (0.654–0.863) | 1.000 (1.000–1.000) | |
| Core genome SNPs | 261 | 0.999 (0.999–1.000) | 0.846 (0.718–0.974) | 1.000 (1.000–1.000) | |
| Recombination-free core genome SNPs | 244 | 0.998 (0.997–0.999) | 0.402 (0.254–0.551) | 0.361 (0.261–0.461) | |
FIG 2Comparison between typing results of allele- and nucleotide-based analysis of core genome MLST target genes. (A) Topological correspondence between the cgMLST neighbor-joining tree (left) and the SNP-based maximum-likelihood tree (right) is shown as a tanglegram. The branches of the tree were color-coded based on the species phylogroups indicated on the branches. The tanglegram was generated using Dendroscope v3.2.1027. (B) Phylogenetic maximum-likelihood tree based on nonrecombinant SNPs identified in the 282 Clostridium perfringens genomes. It shows the population structure of the species with the five phylogroups highlighted next to the ML tree, followed by the predicted toxin types of the strains and the origin and country of isolation as in the legend. The phylogenetic tree was visualized using iTOL v5 (49).
Discriminatory power and typing concordance of C. perfringens cgMLST and classical MLST schemes
| Typing method | No. of genotypes (out of 277 genomes) | Simpson's diversity index (95% confidence interval) | Adjusted Wallace index of concordance (95% confidence interval) of: | |||
|---|---|---|---|---|---|---|
| Core genome MLST | Jost scheme | Hibberd scheme | Deguchi scheme | |||
| Core genome MLST | 254 | 0.999 (0.999–1.000) | 1.000 (1.000–1.000) | 1.000 (1.000–1.000) | 1.000 (1.000–1.000) | |
| Jost scheme | 166 | 0.989 (0.985–0.993) | 0.070 (0.028–0.112) | 0.650 (0.597–0.704) | 0.558 (0.436–0.680) | |
| Hibberd scheme | 180 | 0.993 (0.990–0.995) | 0.107 (0.047–0.167) | 1.000 (1.000–1.000) | 0.704 (0.604–0.805) | |
| Deguchi scheme | 191 | 0.993 (0.991–0.996) | 0.115 (0.054–0.176) | 0.919 (0.894–0.945) | 0.755 (0.698–0.813) | |
FIG 3Minimum spanning tree based on the core genome MLST genes of 52 Clostridium perfringens strains from foodborne outbreaks in the United States. Nodes represent core genome MLST sequence types and were labeled according to the outbreak. Classical MLST types of the outbreak strains are also indicated where applicable. Red boxes mark monoclonal clusters detected in the outbreak. Connected blue boxes indicate more than one cluster identified in the outbreak. Clusters were identified based on fewer than seven allelic mismatches (gray shading). Numbers on the connecting lines illustrate the number of target genes with different alleles, represented by solid lines for allele differences below 40 and dotted lines for allele differences above 40. The minimum spanning trees was generated using Ridom SeqSphere v07 (https://www.ridom.de/).
FIG 4Minimum spanning tree based on the core genome MLST genes of 71 Clostridium perfringens strains from foodborne outbreaks in the United Kingdom. Classical MLST types of the outbreak strains are indicated where applicable. Nodes represent core genome MLST sequence types. Red boxes mark monoclonal clusters detected in the outbreak. Connected blue boxes indicate more than one cluster identified in the outbreak. Clusters were identified based on fewer than seven allelic mismatches (gray shading). Numbers on the connecting lines illustrate the number of target genes with different alleles, represented by solid lines for allele differences below 40 and dotted lines for allele differences above 40. The minimum spanning tree was generated using Ridom SeqSphere v07 (https://www.ridom.de/).
FIG 5Investigation of 32 Clostridium perfringens strains from outbreaks in care homes in the United Kingdom. (A) Neighbor-joining tree of the 32 C. perfringens strains using the cgMLST allele typing system, with lineages identified based on SNP analysis highlighted as in the original study (17). (B) Minimum spanning tree using cgMLST of the same genomes. Lineages are indicated with boxes, and outbreaks are color-coded as in the legend. Clusters were identified based on fewer than seven allelic mismatches (gray shading). Numbers on the connecting lines illustrate the number of target genes with different alleles, represented by solid lines for allele differences below 40 and dotted lines for allele differences above 40. The minimum spanning tree was generated using Ridom SeqSphere v07 (https://www.ridom.de/).