| Literature DB >> 31076745 |
Pablo Cruz-Morales1,2, Camila A Orellana1, George Moutafis3, Glenn Moonen3, Gonzalo Rincon3, Lars K Nielsen1, Esteban Marcellin1.
Abstract
Clostridium is a large genus of obligate anaerobes belonging to the Firmicutes phylum of bacteria, most of which have a Gram-positive cell wall structure. The genus includes significant human and animal pathogens, causative of potentially deadly diseases such as tetanus and botulism. Despite their relevance and many studies suggesting that they are not a monophyletic group, the taxonomy of the group has largely been neglected. Currently, species belonging to the genus are placed in the unnatural order defined as Clostridiales, which includes the class Clostridia. Here, we used genomic data from 779 strains to study the taxonomy and evolution of the group. This analysis allowed us to 1) confirm that the group is composed of more than one genus, 2) detect major differences between pathogens classified as a single species within the group of authentic Clostridium spp. (sensu stricto), 3) identify inconsistencies between taxonomy and toxin evolution that reflect on the pervasive misclassification of strains, and 4) identify differential traits within central metabolism of members of what has been defined earlier and confirmed by us as cluster I. Our analysis shows that the current taxonomic classification of Clostridium species hinders the prediction of functions and traits, suggests a new classification for this fascinating class of bacteria, and highlights the importance of phylogenomics for taxonomic studies.Entities:
Keywords: zzm321990 Clostridiumzzm321990 ; pangenome; phylogenomics; taxonomy
Mesh:
Substances:
Year: 2019 PMID: 31076745 PMCID: PMC6656338 DOI: 10.1093/gbe/evz096
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Phylogenetic reconstruction of Clostridium species. Left corner: core protein families identified at different sequence identity cut-offs in all genomes and in genomes located in cluster I. The phylogeny was constructed using 27 markers conserved across 779 genomes (supplementary table S1, Supplementary Material online) deposited in the GenBank database and taxonomically defined as Clostridium. The main clades outside (blue lines) and within (369 taxa, red lines) the cluster I sensu stricto group (real clostridia) have been collapsed and defined as 17 taxonomic subgroups (table 1). Branch support is shown at each node. Uncollapsed clades for subgroups 1–17 are shown in supplementary figures S1–S14, Supplementary Material online.
Subgroup Core Genome Analysis
|
| Number of Strains | Species | Proteins Families in Subgroup Core |
| ||
|---|---|---|---|---|---|---|
| Proteins in Core | Accessory Proteins |
| ||||
| 1 | 106 |
| 890 | 205 | 673 | 2 |
| 2 | 21 |
| 1,071 | 205 | 755 | 99 |
| 3 | 10 |
| 2,214 | 205 | 1,115 | 877 |
| 4 | 16 |
| 1,063 | 205 | 752 | 96 |
| 5 | 42 |
| 1,367 | 205 | 824 | 324 |
| 6 | 11 |
| 462 | 205 | 245 | 4 |
| 7 | 48 |
| 1,319 | 205 | 925 | 178 |
| 8 | 20 |
| 994 | 205 | 769 | 10 |
| 9 | 4 |
| 2,704 | 205 | 1,452 | 1,021 |
| 10 | 8 |
| 1,350 | 205 | 1,037 | 94 |
| 11 | 15 |
| 1,028 | 205 | 779 | 32 |
| 12 | 55 |
| 2,044 | 205 | 1,276 | 547 |
| 13 | 3 |
| 2,040 | 205 | 1,091 | 732 |
| 14 | 2 | Spp. | 1,505 | 205 | 1,043 | 242 |
| 15 | 1 |
| 2,415 | 205 | 1,336 | 854 |
| 16 | 3 |
| 1,500 | 205 | 1,104 | 178 |
| 17 | 5 |
| 992 | 205 | 719 | 58 |
Subgroups within cluster I are defined by the species tree presented in figure 1.
Calculated using representative proteins for each protein family in the core of the subgroups at an identity cut-off of 0.4.
This category includes protein families that are only found in the core of a given subgroup, therefore, they represent subgroup-specific protein families.
. 2.—(A) Phylogenetic reconstruction of BotA toxin proteins. Six A–F clades are consistent with previous reports. Including non-botulinum strains, argentinense, tetani, butyricum, and baratti (marked with a dot). Three new sequences (in black) account for new unclassified toxin diversity. (B) Genome context of BotA homologs found in cluster I strains.
. 3.—Pangenome analysis of selected subgroups. The y axis shows the number of gene families and the x axis shows the number of genomes analyzed. The number of conserved genes was calculated by randomly adding genomes, with 20 replicates (if n > 20) or the same numbers as genomes (if n < 20). This analysis shows large differences in the genetic diversity of the subgroups, with less diversity and almost closed pangenomes in pathogenic subgroups.
. 4.—Phylogenomic analysis of AroA in Clostridium group I. (A) The phylogeny shows that AroA, has significantly diverged in all members of the subgroup 3 (C. tetani; clear blue), subgroup 6 (C. argentinense; yellow) and subgroup 1 (C. sporogenes–C. botulinum B; red) from the rest of the subgroups in cluster I (black). The full tree is provided as supplementary Tree 2, Supplementary Material online. (B) Genome context of AroA homologs. C. autoethanogenum is shown as a typical Group I AroA genome context, whereas divergent homologs show a genome context that includes enzymes from pyrimidine metabolism.