Literature DB >> 29348922

Taxonogenomic description of four new Clostridium species isolated from human gut: 'Clostridium amazonitimonense', 'Clostridium merdae', 'Clostridium massilidielmoense' and 'Clostridium nigeriense'.

M T Alou1, S Ndongo1, L Frégère1, N Labas1, C Andrieu1, M Richez1, C Couderc1, J-P Baudoin1, J Abrahão2, S Brah3, A Diallo1,4, C Sokhna1,4, N Cassir1, B La Scola1, F Cadoret1, D Raoult1,5.   

Abstract

Culturomics investigates microbial diversity of the human microbiome by combining diversified culture conditions, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry and 16S rRNA gene identification. The present study allowed identification of four putative new Clostridium sensu stricto species: 'Clostridium amazonitimonense' strain LF2T, 'Clostridium massilidielmoense' strain MT26T, 'Clostridium nigeriense' strain Marseille-P2414T and 'Clostridium merdae' strain Marseille-P2953T, which we describe using the concept of taxonogenomics. We describe the main characteristics of each bacterium and present their complete genome sequence and annotation.

Entities:  

Keywords:  culturomics; emerging bacteria; human microbiota; taxonogenomics; ‘Clostridium amazonitimonense’; ‘Clostridium massilidielmoense’; ‘Clostridium merdae’; ‘Clostridium nigeriense’

Year:  2017        PMID: 29348922      PMCID: PMC5767839          DOI: 10.1016/j.nmni.2017.11.003

Source DB:  PubMed          Journal:  New Microbes New Infect        ISSN: 2052-2975


Introduction

As showed by a remarkable number of studies, the human gut microbiota plays a tremendous role in human health and the homeostasis of the human body [1], [2], [3]. Conversely, an increasing number of links between the gut microbiota and several diseases has been uncovered; these diseases include nutritional disorders, metabolic diseases and pathologies of the intestine [1], [4], [5], [6]. Techniques used in exploratory studies have evolved alongside technology from culture-based techniques in early studies to today's culture-independent techniques [7]. A turning point in the study of the gut microbiota is in progress, with a return to culture-based techniques. In fact, cultured strains are needed for in vivo and in vitro experiments to apply the extensive knowledge acquired thanks to metagenomic studies. Culturomics, a method invented in our laboratory, is a part of this turning point [8]. It consists of the multiplication of culture conditions with a variation of media and physicochemical parameters such as atmosphere, temperature, pH and salinity in order to exhaustively explore a microbial ecosystem [9]. As part of a global exploratory study designed to improve the known and cultivated bacterial diversity of the human gut microbiota, several previously unknown species of Clostridium were isolated. Exploratory studies have shown that the healthy gut microbiota mostly consists of anaerobic bacteria, which represent 95% of the bacterial diversity [2]. Among these anaerobic bacteria, the genus Clostridium is one of the most represented in the human gut, although taxonomic chaos is associated with this genus and its higher ranks in taxonomy, such as the family Clostridiaceae and the Clostridiales order [7]. This genus was created in 1880 by Prazmowski with the type species Clostridium butyricum [10], [11]. In early taxonomy, which was based on phenotypic characteristics, all Gram-positive, anaerobic and spore-forming bacilli were attributed to the genus Clostridium. Consequently, with the advent of DNA sequencing and phylogenetic analysis, taxonomic chaos was uncovered in this genus, and several clusters were subsequently built, among which cluster I, Clostridium sensu stricto, was built around C. butyricum [11], [12]. Among the 211 Clostridium species with standing in nomenclature, only a third are Clostridium sensu stricto species [10], [12]. These mesophilic species are mostly strict anaerobes with Gram-positive staining and the ability to sporulate. Their metabolism is highly variable because they are able to metabolize various compounds such as carbohydrates, amino acids, alcohols, amino acids and purines. Acid and alcohol can be produced from carbohydrates or via protein metabolism [10], [11]. Here we propose the creation of four putative new Clostridium sensu stricto species: ‘Clostridium amazonitimonense’ strain LF2T (= CSUR P1445 = DSM 28600), ‘Clostridium massilidielmoense’ strain MT26T (= CSUR P2255 = CCUG 69893), ‘Clostridium nigeriense’ strain Marseille-P2414T (= CSUR P2414 = DSM 102218) and one new putative Clostridium cluster IV, ‘Clostridium merdae’ strain Marseille-P2953T (= CSUR P2953 = DSM 103669), which we describe using the concept of taxonogenomics [13]. This concept of new species description is based on the presentation of the phylogenetic, phenotypic and genomic features of the studied new species and their comparison to closely related species.

Materials and methods

Sample information

In order to explore the bacterial diversity of the human gut microbiota, stool samples were collected from several individuals: a healthy 8-month-old Senegalese boy, a healthy 44-month-old Senegalese girl, a healthy 37-year-old Amazonian woman from French Guyana and a healthy 29-year-old Senegalese woman. None of the donors was receiving antibiotic treatment at the time of sample collection. Oral consent was obtained from each adult. For the children, oral consent was provided by the parents. Samples were stored at −80°C upon arrival in the La Timone hospital in Marseille. This study received approval from the Institut Fédératif de Recherche 48 (Faculty of Medicine, Marseille, France) under agreement 09-022.

Strain identification

Each stool sample was cultured using the 18 culture conditions of standardized culturomics [8], [9]. For each condition, preincubation in the medium of an anaerobic blood culture bottle from BD BACTEC (Becton Dickinson [BD], San Diego, CA, USA) was performed during 30 days with seeding on 5% sheep's blood–enriched Colombia agar (bioMérieux, Marcy l’Étoile, France) every 3 days. Colonies were purified before identification using a Microflex spectrometer and a MTP 96 matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) target plate (Bruker Daltonics, Bremen, Germany) as described previously [14], [15]. Each obtained spectrum was matched against the database from Bruker and from the La Timone hospital using MALDI Biotyper 3.0 software by standard pattern matching (with default parameter settings). Matching scores under 1.7 indicated the absence of a match in the database. For these unidentified colonies, the 16S rRNA gene was sequenced using fD1 and rP2 primers as previously described [16]. Kim et al. [17] recommended the creation of a new species at the 16S similarity level threshold of 98.65% with the closest species with standing in nomenclature. The corresponding spectra are then incremented in the database and compared to the spectra of closely related species.

Growth conditions

For each species, optimal growth conditions were determined by testing various temperatures (25, 28, 37, 45 and 56°C), atmospheres (aerobic with or without 5% CO2, microaerophilic and anaerobic atmospheres using the GENbag anaer and GENbag microaer systems, respectively; bioMérieux), pH values (6, 6.5, 7 and 8.5) and NaCl concentrations (0.5%, 1%, 5%, 7.5% and 10%).

Morphologic, biochemical and antibiotic susceptibility tests

The morphologic characteristics of each species, namely Gram staining, motility and spore formation, were determined as previously described [8]. To determine the size, and to confirm the shape of the cells as well as spore formation, the cells were negatively stained and observed with electron microscopy as follows: cells were fixed with 2.5% glutaraldehyde in 0.1 M cacodylate buffer for at least 1 hour at 4°C. A drop of cell suspension was deposited for approximately 5 minutes on glow-discharged formvar carbon film on 400 mesh nickel grids (FCF400-Ni; Electron Microscopy Sciences (EMS), Hatfield, PA, USA). The grids were dried on blotting paper, and cells were negatively stained for 10 seconds with 1% ammonium molybdate solution in filtered water at room temperature. Electron micrographs were acquired with a Tecnai G20 Cryo (FEI Company, Limeil-Brevannes, France) transmission electron microscope operated at 200 keV. Cells were then measured by ImageJ software (Image Processing and Analysis in Java; US National Institutes of Health, Bethesda, MD, USA; http://imagej.nih.gov/) [18], [19]. Cellular fatty acid methyl ester (FAME) analysis was performed by gas chromatography/mass spectrometry (GC/MS). Two samples were prepared with approximately 23 mg of bacterial biomass per tube collected from several culture plates. Fatty acid methyl esters were prepared as described by Sasser [20]. GC/MS analyses were carried out as described previously [21]. Briefly, fatty acid methyl esters were separated using an Elite 5-MS column and monitored by mass spectrometry (Clarus 500–SQ 8 S; PerkinElmer, Courtaboeuf, France). A spectral database search was performed using MS Search 2.0 operated with the Standard Reference Database 1A (National Institute of Standards and Technology, Gaithersburg, MD, USA) and the FAME mass spectral database (Wiley, Chichester, UK). The metabolic capacities of the strains were determined using API ZYM, 20A and 50CH strips according to the manufacturer's instructions. After inoculation of the strips, they were incubated in anaerobic atmosphere for 4, 24 and 48 hours. Other enzymatic features were tested separately, namely oxidase and catalase, which were tested as previously described [8]. The disc diffusion method was used to test antibiotic susceptibility according to European Committee on Antimicrobial Susceptibility Testing recommendations [22].

Genomic DNA preparation and genome sequencing

Bacterial cell lysis was achieved using pretreatments such as lysozyme incubation at 37°C for 2 hours, followed by proteinase K action. DNA was extracted by the EZ1 biorobot (Qiagen, Germantown, MD, USA) with the EZ1 DNA tissues kit. The elution volume was 50 μL. Genomic DNA (gDNA) was quantified by a Qubit assay with the high sensitivity kit (Life Technologies, Carlsbad, CA, USA) to 123.7 ng/μL. gDNA was sequenced with a MiSeq sequencer (Illumina, San Diego, CA, USA) using the mate-pair strategy. The gDNA was barcoded in order to be mixed with 11 other projects using the Nextera Mate Pair sample prep kit (Illumina). The mate-pair library was prepared with 1.5 μg of gDNA using the Nextera mate-pair Illumina guide. The gDNA sample was simultaneously fragmented and tagged with a mate-pair junction adapter. The fragmentation pattern was validated on an Agilent 2100 BioAnalyzer (Agilent Technologies, Santa Clara, CA, USA) with a DNA 7500 labchip. The DNA fragments ranged in size from 1.5 to 11 kb, with an optimal size at 4.031 kb. No size selection was performed, and 385.5 ng of tagmented fragments were circularized. The circularized DNA was mechanically sheared to small fragments with an optimal size of 1070 bp on the Covaris device S2 in T6 tubes (Covaris, Woburn, MA, USA). The library profile was visualized on a High Sensitivity Bioanalyzer LabChip (Agilent Technologies), and the final concentration library was measured at 2.40 nmol/L. The libraries were normalized at 2 nM and pooled. After a denaturation step and dilution at 15 pM, the pool of libraries was loaded onto the reagent cartridge and then onto the instrument along with the flow cell. The automated cluster generation and sequencing runs were performed in a single 2 × 301 bp run.

Genome annotation and analysis

Open reading frames (ORFs) were predicted using Prodigal [23] with default parameters, but the predicted ORFs were excluded if they spanned a sequencing gap region. The predicted bacterial protein sequences were searched against the GenBank [24] and Clusters of Orthologous Groups (COGs) databases using BLASTP (E value 1e-03, coverage 0.7, identity percentage of 30%). If no hit was found, it was searched against the NR database using BLASTP with an E value of 1e-03, a coverage of 0.7 and an identity percentage of 30%, and if the sequence length was smaller than 80 aa, we used an E value of 1e-05. The tRNAScanSE tool [25] was used to find tRNA genes, while ribosomal RNAs were found using RNAmmer [26]. Lipoprotein signal peptides and the number of transmembrane helices were predicted using Phobius [27]. Mobile genetic elements were predicted using PHAST [28] and RAST [29]. ORFans were identified if all the BLASTP performed did not give positive results (E value smaller than 1e-03 for ORFs with sequence size larger than 80 aa or E value smaller than 1e-05 for ORFs with sequence length smaller than 80 aa). Such parameter thresholds have already been used in previous studies to define ORFans. Artemis [30] and DNA Plotter [31] were used for data management and the visualization of genomic features, respectively. The Mauve 2.3.1 alignment tool was used for multiple genomic sequence alignment [32]. Comparator species for genomic comparison were identified in the 16S RNA tree using PhyloPattern software [33]. For each selected genome, the complete genome sequence, proteome genome sequence and ORFeome genome sequence were retrieved from the National Center for Biotechnology Information FTP site. An annotation of the entire proteome was performed to define the distribution of functional classes of predicted genes according to the clusters of orthologous groups of proteins (using the same method as for the genome annotation). Annotation and comparison processes were performed in the Multi-Agent software system DAGOBAH [34], which includes Figenix [35] libraries that provide pipeline analysis. To evaluate the genomic similarity between studied genomes, we determined two parameters: digital DNA-DNA hybridization (DDH), which exhibits a high correlation with DDH [36], [37]; and average genomic identity of orthologous gene sequences (AGIOS) [38], which was designed to be independent from DDH. The AGIOS score is the mean value of nucleotide similarity between all couples of orthologous proteins between the two studied genomes [38], [39].

Results

Strain identification and phylogenetic analyses

Isolation of strains Marseille-P2953 and LF2 occurred, respectively, after 7 days' and 30 days' preincubation in anaerobic blood culture bottles supplemented with rumen and sheep's blood, respectively. Strain Marseille-P2414 was first isolated after 15 days' preincubation in an aerobic blood culture bottle supplemented with rumen and sheep's blood. Finally, strain MT26 was first isolated after 30 days' preincubation in an anaerobic blood culture bottle supplemented with sheep's blood. Because the routine identification using MALDI-TOF MS failed, the 16S rRNA gene of each strain was sequenced. The 16S rRNA gene sequences of strains Marseille-P2953 (LT732642), LF2 (LK021125), Marseille-P2414 (LT161894) and MT26 (LN998063) revealed a 96.41%, 96.95%, 98.23% and 98.44% similarity level with, respectively, Clostridium sporosphaeroides strain DSM 1294, Clostridium putrefaciens strain DSM 1291, Clostridium tertium strain JCM 6289 and Clostridium novyi strain JCM 1406 (Fig. 1, Table 1). All these similarity levels were lower than the 98.65 cutoff defined to create a new species without performing DDH. We thus propose the creation of four new species within the genus Clostridium with the following names: ‘Clostridium merdae’ (strain Marseille-P2953), ‘Clostridium amazonitimonense’ (strain LF2), ‘Clostridium nigeriense’ (strain Marseille-P2414) and ‘Clostridium massilidielmoense’ (strain MT26). The MALDI-TOF MS spectra of the four strains in this study were incremented in our database and compared to those of their closest relatives (Fig. 2).
Fig. 1

Phylogenetic tree highlighting position of ‘Clostridium amazonitimonense,’ ‘Clostridium merdae,’ ‘Clostridium massilidielmoense’ and ‘Clostridium nigeriense’ relative to other type strains within Clostridium genus. Strains and their corresponding GenBank accession numbers for 16S rRNA genes are indicated in brackets. Sequences were aligned using Clustal W (http://www.clustal.org/clustal2/), and phylogenetic inferences were obtained using maximum-likelihood method within MEGA6 software (http://www.megasoftware.net/mega.php). Numbers at nodes are percentages of bootstrap values obtained by repeating analysis 1000 times to generate majority consensus tree. Butyricicoccus pullicaecorum (NR_044490) was used as outgroup. Scale bar = 0.02% nucleotide sequence divergence.

Table 1

Species information

CharacteristicLF2MT26Marseille-P2953Marseille-P2414
Patient informationHealthy 37-year-old Amazonian womanHealthy 8-month-old Senegalese boyHealthy 29-year-old Senegalese womanHealthy 44-month-old girl from Niger
Isolation conditions30 days' anaerobic preincubation in blood culture bottle with rumen and sheep's blood30 days' anaerobic preincubation in blood culture bottle with sheep's blood7 days' anaerobic preincubation in blood culture bottle with rumen and sheep's blood15 days' aerobic preincubation in blood culture bottle with rumen and sheep's blood
Optimal growth temperature37°C37°C37°C37°C
AtmosphereStrict anaerobeStrict anaerobeStrict anaerobeFacultative anaerobe
Similarity percentage with closest species96.95% with Clostridium putrefaciens strain DSM 1291T (AF127024)98.44% with Clostridium novyi strain JCM 1406T (AB045606)96.41% with Clostridium sporosphaeroides strain DSM 1294T (X66002)98.23% with Clostridium tertium strain JCM 6289T (Y18174)
Fig. 2

Reference mass spectra from ‘Clostridium amazonitimonense,’ ‘Clostridium merdae,’ ‘Clostridium massilidielmoense’ and ‘Clostridium nigeriense.’ Spectra from 12 individual colonies were compared and each reference spectrum generated (A). Gel view comparing ‘Clostridium amazonitimonense,’ ‘Clostridium merdae,’ ‘Clostridium massilidielmoense’ and ‘Clostridium nigeriense’ to other species within genus Clostridium. Gel view displays raw spectra of loaded spectrum files arranged in pseudo–gellike look. x-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity expressed by greyscale scheme code. Colour bar and right y-axis indicate relation between colour peaks, with peak intensity in arbitrary units. Displayed species are indicated at left (B).

Phylogenetic tree highlighting position of ‘Clostridium amazonitimonense,’ ‘Clostridium merdae,’ ‘Clostridium massilidielmoense’ and ‘Clostridium nigeriense’ relative to other type strains within Clostridium genus. Strains and their corresponding GenBank accession numbers for 16S rRNA genes are indicated in brackets. Sequences were aligned using Clustal W (http://www.clustal.org/clustal2/), and phylogenetic inferences were obtained using maximum-likelihood method within MEGA6 software (http://www.megasoftware.net/mega.php). Numbers at nodes are percentages of bootstrap values obtained by repeating analysis 1000 times to generate majority consensus tree. Butyricicoccus pullicaecorum (NR_044490) was used as outgroup. Scale bar = 0.02% nucleotide sequence divergence. Reference mass spectra from ‘Clostridium amazonitimonense,’ ‘Clostridium merdae,’ ‘Clostridium massilidielmoense’ and ‘Clostridium nigeriense.’ Spectra from 12 individual colonies were compared and each reference spectrum generated (A). Gel view comparing ‘Clostridium amazonitimonense,’ ‘Clostridium merdae,’ ‘Clostridium massilidielmoense’ and ‘Clostridium nigeriense’ to other species within genus Clostridium. Gel view displays raw spectra of loaded spectrum files arranged in pseudo–gellike look. x-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity expressed by greyscale scheme code. Colour bar and right y-axis indicate relation between colour peaks, with peak intensity in arbitrary units. Displayed species are indicated at left (B). Species information

Phenotypic description

Growth was observed between 28 and 45°C for all studied strains. While strain Marseille-P2414 grew in all tested atmospheres (namely aerobic, microaerophilic and anaerobic), growth was observed only in anaerobic atmosphere for strains Marseille-P2953, LF2 and MT26. All strains exhibited optimal growth at 37°C in anaerobic atmosphere with an incubation period of 24 hours for strain Marseille-P2414 and 48 hours for strains Marseille-P2953, LF2 and MT26. None of the strains was tolerant to NaCl, only growing at a concentration of 0.5%. Conversely, all strains grew at all tested pH concentrations (6, 6.5, 7, 8.5). The phenotypic characteristics of the four strains in this study and their closest relatives are summarized in Table 2. All strains were Gram positive and were spore-forming, rod-shaped cells (Fig. 3). Strains LF2 and Marseille-P2414 were motile, whereas strains Marseille-P2953 and MT26 were nonmotile. Major fatty acids are presented in Table 3. We observed that strains Marseille-P2414, LF2 and MT26 present hexadecanoic acid (16:0) as the major fatty acid composing their cell envelope, confirming these three strains belong to the same group. For strain Marseille-P2953, its major fatty acid was 9-octadecenoic acid (18:1n9), which is minor or absent in the other strains. Although phylogeny clearly classified Marseille-P2953 as a Clostridium species, these data confirmed other phenotypical observations, suggesting that this strain could belong to another Clostridium cluster.
Table 2

Differential characteristics of ‘Clostridium amazonitimonense,’ ‘Clostridium putrefaciens,’ ‘Clostridium merdae,’ ‘Clostridium sporosphaeroides,’ ‘Clostridium nigeriense,’ Clostridium tertium, ‘Clostridium massilidielmoense’ and Clostridium novyi[1], [2]

Characteristic‘C. amazonitimonense’C. putrefaciens‘C. merdae’C. sporosphaeroides‘C. nigeriense’C. tertium‘C. massilidielmoense’C. novyi
Property
 Cell shapeBacilliBacilliBacilliBacilliBacilliBacilliBacilliBacilli
 Cell size (μm)2–2.5NA1.2–1.50.5–0.60.8–1.50.5–1.40.540.6–1.4
 Gram stain++++++++
 Oxygen tolerance++
 Salt tolerance
 Motility+++++
 Endospore formation++++++++
 Major cellular fatty acid15:0 isoNA16:00NA16:00NA16:00NA
Production of:
 CatalaseNANANANA
 OxidaseNANA+NANA
 Alkaline phosphatase+NANA+NA+NA
 UreaseNANANANA
 β-GalactosidaseNANANANA
 N-acetyl-glucosamine+NANA+NA+NA
Acid from:
 l-ArabinoseNANANANA
 Ribose+NANA+NA+NA
 Mannose+NANA+NA+NA
 MannitolNANANANA
 SucroseNA+NANA
 d-Glucose+++NA+NA+NA
 d-Fructose+NANA+NA+NA
 d-Maltose+NA+NA+NA
 d-LactoseNA+NANA
HabitatHuman gutHog muscle tissueHuman gutHuman gutHuman gutHuman gutHuman gutSoil

+, positive result; −, negative result; NA, data not available.

Fig. 3

Electron micrographs of ‘Clostridium nigeriense’ (A), ‘Clostridium amazonitimonense’ (B), ‘Clostridium massilidielmoense’ (C) and ‘Clostridium merdae’ (D).

Table 3

Cellular fatty acid composition (%)

Fatty acidName‘Clostridium nigeriense’‘Clostridium merdae’‘Clostridium massilidielmoense’‘Clostridium amazonitimonense’
16:00Hexadecanoic acid38.9 ± 0.838.5 ± 2.155.0 ± 2.515.3 ± 2.3
14:00Tetradecanoic acidNDND15.7 ± 1.712.0 ± 0.6
18:00Octadecanoic acid6.7 ± 1.13.2 ± 0.63.6 ± 0.9TR
13:00Tridecanoic acidNDND3.0 ± 0.6ND
12:00Dodecanoic acid1.8 ± 0.1TR1.5 ± 1.0TR
04:00Butanoic acidNDNDND1.7 ± 1.0
15:00Pentadecanoic acidTRND2.9 ± 0.6ND
18:1n99-Octadecenoic acid20.1 ± 0.435.4 ± 0.414.4 ± 1.02.3 ± 0.6
13:0 iso11-methyl-Dodecanoic acidNDNDND22.7 ± 1.8
5:0 iso3-methyl-Butanoic acidNDNDND11.2 ± 2.1
18:1n711-Octadecenoic acid7.9 ± 0.2NDNDTR
14:01Tetradecanoic acid6.6 ± 0.35.3 ± 1.4NDND
18:2n69.12-Octadecadienoic acid5.5 ± 0.415.2 ± 0.63.9 ± 1.3TR
16:1n79-Hexadecenoic acid5.3 ± 0.2TRNDND
16:1n97-Hexadecenoic acid3.9 ± 0.2NDNDND
13:0 anteiso10-methyl-Dodecanoic acidNDNDND2.3 ± 0.3
15:0 anteiso12-methyl-Tetradecanoic acidTRTRND4.8 ± 0.5
15:0 iso13-methyl-Tetradecanoic acidTRTRND22.0 ± 1.5

ND, not detected; TR, traces.

Electron micrographs of ‘Clostridium nigeriense’ (A), ‘Clostridium amazonitimonense’ (B), ‘Clostridium massilidielmoense’ (C) and ‘Clostridium merdae’ (D). Differential characteristics of ‘Clostridium amazonitimonense,’ ‘Clostridium putrefaciens,’ ‘Clostridium merdae,’ ‘Clostridium sporosphaeroides,’ ‘Clostridium nigeriense,’ Clostridium tertium, ‘Clostridium massilidielmoense’ and Clostridium novyi[1], [2] +, positive result; −, negative result; NA, data not available. Cellular fatty acid composition (%) ND, not detected; TR, traces.

Genome description and comparison

The genomic properties of all strains are presented in Table 4 and Fig. 4. Genome sizes of the strains ranged from 2.75 Mb for strain Marseille-P2953 to 3.8 Mb for strain Marseille-P2414, while the G+C content ranged from 28.2% for strain Marseille-P2414 to 45.4% for strain Marseille-P2953. The proportion of protein-coding genes was similar in all four genomes, and the distribution into COGs categories was also similar (Table 5). However, we observed that the genome of ‘Clostridium merdae’ strain Marseille-P2953 presented a G+C% level significantly higher than the genomes of the other presented strains (Table 6). The G+C content of strain Marseille-P2953 sets it apart from the three other studied strains, as previously observed by its phenotypical features. Thus, the very large family of Clostridiaceae contains species with great phenotypical and genomic diversity, suggesting that this family, which was created before modern sequencing methods were in use, needs to be greatly reclassified. These observations demonstrate that phenotypical observations are no longer sufficient to classify all the newly discovered species. Whole genome sequencing must be used to characterize new bacterial species.
Table 4

Nucleotide content and gene count levels of genomes

Attribute
‘Clostridium amazonitimonense’
‘Clostridium merdae’
‘Clostridium nigeriense’
‘Clostridium massilidielmoense’
No. of reads
5 235 372
1 841 718
1 832 750
3 097 182
Scaffolds/contigs4/15
4/4
4/19
9/9
Value% of totalaValue% of totalaValue% of totalaValue% of totala
Size (bp)3 761 4171002 748 8601003 799 4741002 759 335100
G+C content (%)1 145 89030.551 246 25845.41 064 55628.2782 30728.3
Coding region (bp)3 129 29383.22 373 50886.33 322 70587.42 453 24388.9
Total genes3553100269910036591002772100
RNA genes1474.1592.21875.1913.3
Protein-coding genes340695.926497.8347294.9268196.7
Genes with function prediction247772.7196674.424570.5198974.2
Genes assigned to COGs224165.816763.2214961.9163961.1
Genes with peptide signals2707.927110.23078.82282.8
ORFans genes1895.51174.41193.4401.5
Genes with transmembrane helices84924.956921.593526.963923.8
Genes associated with PKS or NRPS30.080050.130.1
No. of antibiotic resistance genes10.03000000
No. of genes associated with Pfam-A domainsNANA23868831828624688

COGs, Clusters of Orthologous Groups database; NRPS, nonribosomal peptide synthase; ORF, open reading frame; PKS, polyketide synthase.

The total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome.

Fig. 4

Graphical circular map of genome of ‘Clostridium nigeriense’ (A), ‘Clostridium amazonitimonense’ (B), ‘Clostridium massilidielmoense’ (C) and ‘Clostridium merdae’ (D). From outside to center: Contigs (red/grey), COGs category of genes on forward strand (three circles), genes on forward strand (blue circle), genes on reverse strand (red circle), COGs category on reverse strand (three circles), G+C content. COGs, Clusters of Orthologous Groups database.

Table 5

Number of genes associated with 25 general COGs functional categories

CodeDescription‘Clostridium merdae’
‘Clostridium nigeriense’
‘Clostridium massilidielmoense’
‘Clostridium amazonitimonense’
Value% of totalValue% of totalValue% of totalValue% of totala
[J]Translation1897.162176.251927.161815.31
[A]RNA processing and modification00000000
[K]Transcription1716.481915.501254.662386.99
[L]Replication, recombination and repair772.91852.45873.241353.96
[B]Chromatin structure and dynamics00000010.03
[D]Cell cycle control, mitosis and meiosis311.17401.15391.45240.70
[Y]Nuclear structure00000000
[V]Defense mechanisms532.01752.16582.161053.08
[T]Signal transduction mechanisms973.671173.37943.501353.96
[M]Cell wall/membrane biogenesis752.841293.711063.951043.05
[N]Cell motility431.63531.52471.75611.79
[Z]Cytoskeleton00000000
[W]Extracellular structures70.2660.1770.2600
[U]Intracellular trafficking and secretion200.76260.75250.93341
[O]Posttanslational modification, protein turnover, chaperones612.31792.27752.8742.17
[X]Mobilome: prophages, transposons150.5780.23361.34NANA
[C]Energy production and conversion1064.011394.00933.471374.02
[G]Carbohydrate transport and metabolism1425.382717.801254.661654.84
[E]Amino acid transport and metabolism2128.031955.611505.592196.43
[F]Nucleotide transport and metabolism582.19852.45682.53722.11
[H]Coenzyme transport and metabolism903.411042.991114.14882.58
[I]Lipid transport and metabolism692.61842.42642.39621.82
[P]Inorganic ion transport and metabolism873.291333.83873.241534.49
[Q]Secondary metabolite biosynthesis, transport and catabolism281.06300.86130.48351.02
[R]General function prediction only1585.982025.821515.633209.39
[S]Function unknown863.261333.83913.391905.58
Not in COGs97036.741.32338.101.04238.871.16534.20

COGs, Clusters of Orthologous Groups database; NA, not applicable.

The total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome.

Table 6

Genome comparison of closely related species

OrganismINSDCSize (Mb)G+C (%)Total genes
Clostridium beijerinckiiaCP0007216.48305.425
Clostridium clariflavumCP0030654.9035.723.892
Clostridium cellulovoransCP002160563114.524
Clostridium perfringensaCP0002463528.382.876
‘Clostridium massilidielmoense’FXLO000000002.7628.352.681
Clostridium acetireducensLZFO000000002.4226.752.238
Clostridium acetobutylicumAE0014374330.933.848
Clostridium argentinenseAYSO000000004.7428.64.279
Clostridium carboxidivoransCP0118035.7529.934.973
Clostridium drakeiJIBU000000005.58295.029
Clostridium haemolyticumJENX000000002.6127.432.201
Clostridium homopropionicumLHUR000000003.6531.093.5
Clostridium novyiJENM000000002.4628.581.833
Clostridium tetaniJRGJ000000002.7928.572.516
‘Clostridium nigeriense’FLKB000000003.7928.73.472
Clostridium baratiiJZTY000000003727.952.839
Clostridium diolisAQQG000000005.8429.745.196
Clostridium paraputrificumAUJC000000003.5629.643.491
Clostridium sartagoformeASRV000000003.9827.913.714
‘Clostridium merdae’FXLN000000002.7545.342.64
Clostridium leptumABCB000000002.825052.482
Clostridium thermocellumCP0005683.8438.993.173
Eubacterium siraeumARNC000000002.6244.852.308
Flavonifractor plautiiAGCK000000003.8261.074.278
Intestinimonas butyricproducensCP0113073.3759.063.381
Pseudoflavonifractor capillosusAAXG00000000445914.829
Ruminococcus albusCP0024034.3343.623.991
‘Clostridium amazonitimonense’CCNN000000003.7630.563.046

INSDC, International Nucleotide Sequence Database Collaboration.

The total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome.

Graphical circular map of genome of ‘Clostridium nigeriense’ (A), ‘Clostridium amazonitimonense’ (B), ‘Clostridium massilidielmoense’ (C) and ‘Clostridium merdae’ (D). From outside to center: Contigs (red/grey), COGs category of genes on forward strand (three circles), genes on forward strand (blue circle), genes on reverse strand (red circle), COGs category on reverse strand (three circles), G+C content. COGs, Clusters of Orthologous Groups database. Nucleotide content and gene count levels of genomes COGs, Clusters of Orthologous Groups database; NRPS, nonribosomal peptide synthase; ORF, open reading frame; PKS, polyketide synthase. The total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome. Number of genes associated with 25 general COGs functional categories COGs, Clusters of Orthologous Groups database; NA, not applicable. The total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome. Genome comparison of closely related species INSDC, International Nucleotide Sequence Database Collaboration. The total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome. The DDH comparison presented in Table 7 demonstrates that the four studied new species are clearly distinct new species. Indeed, the DDH level between our four new species and the other Clostridium species comprises a range delimiting a distinct bacterial species.
Table 7

Pairwise comparison of four studied species with nine other species using GGDC

234567891011121314
119.90 ± 2.218.10 ± 2.219.00 ± 2.219.40 ± 2.218.40 ± 2.122.60 ± 2.221.00 ± 2.220.30 ± 2.224.40 ± 2.339.90 ± 2.523.70 ± 2.323.20 ± 2.324.80 ± 2.3
2100.00 ± 0.019.40 ± 2.224.80 ± 2.339.80 ± 2.519.70 ± 2.221.30 ± 2.319.50 ± 2.217.40 ± 2.126.70 ± 2.326.40 ± 2.426.00 ± 2.319.60 ± 2.225.70 ± 2.3
3100.00 ± 0.019.00 ± 2.218.10 ± 2.118.50 ± 2.121.30 ± 2.220.70 ± 2.220.00 ± 2.224.30 ± 2.326.70 ± 2.428.70 ± 2.320.10 ± 2.226.60 ± 2.4
4100.00 ± 0.024.70 ± 2.319.50 ± 2.221.30 ± 2.219.10 ± 2.218.30 ± 2.128.40 ± 2.426.10 ± 2.328.30 ± 2.320.00 ± 2.223.70 ± 2.3
5100.00 ± 0.018.30 ± 2.121.30 ± 2.319.40 ± 2.118.40 ± 2.223.30 ± 2.325.70 ± 2.326.70 ± 2.320.50 ± 2.226.80 ± 2.3
6100.00 ± 0.022.10 ± 2.219.40 ± 2.118.00 ± 2.122.90 ± 2.325.80 ± 2.328.40 ± 2.421.20 ± 2.224.10 ± 2.3
7100.00 ± 0.022.20 ± 2.323.60 ± 2.323.00 ± 2.324.70 ± 2.326.00 ± 2.324.90 ± 2.324.30 ± 2.4
8100.00 ± 0.020.20 ± 2.225.30 ± 2.325.00 ± 2.328.20 ± 2.425.10 ± 2.325.10 ± 2.4
9100.00 ± 0.025.10 ± 2.324.70 ± 2.328.10 ± 2.423.40 ± 2.325.40 ± 2.3
10100.00 ± 0.022.50 ± 2.229.10 ± 2.326.80 ± 2.429.20 ± 2.4
11100.00 ± 0.025.10 ± 2.323.60 ± 2.329.70 ± 2.4
12100.00 ± 0.027.30 ± 2.425.70 ± 2.3
13100.00 ± 0.023.80 ± 2.3
14100.00 ± 0.0

DDH, DNA-DNA hybridization; GGDC, genome-to-genome distance; HSP, high-scoring pair.

Data derived from formula 2 (DDH estimates based on identities/HSP length). Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with 16S rRNA (Fig. 1) and phylogenetic analyses as well as GGDC results.

1, Clostridium cellulovorans; 2, Clostridium massilidielmoense; 3, Clostridium acetireducens; 4, Clostridium haemolyticum; 5, Clostridium novyi; 6, Clostridium tetani; 7, Clostridium nigeriense; 8, Clostridium diolis; 9, Clostridium sartagoforme; 10, Clostridium merdae; 11, Clostridium leptum; 12, Clostridium thermocellum; 13, Clostridium amazonitimonense; 14, Ruminococcus albus.

Pairwise comparison of four studied species with nine other species using GGDC DDH, DNA-DNA hybridization; GGDC, genome-to-genome distance; HSP, high-scoring pair. Data derived from formula 2 (DDH estimates based on identities/HSP length). Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with 16S rRNA (Fig. 1) and phylogenetic analyses as well as GGDC results. 1, Clostridium cellulovorans; 2, Clostridium massilidielmoense; 3, Clostridium acetireducens; 4, Clostridium haemolyticum; 5, Clostridium novyi; 6, Clostridium tetani; 7, Clostridium nigeriense; 8, Clostridium diolis; 9, Clostridium sartagoforme; 10, Clostridium merdae; 11, Clostridium leptum; 12, Clostridium thermocellum; 13, Clostridium amazonitimonense; 14, Ruminococcus albus. Strain Marseille-P2414 shared 1342, 1434, 1553, 1564, 1628 and 1664 orthologous proteins with Clostridium perfringens, Clostridium baratii, Clostridium diolis, Clostridium beijerinckii, Clostridium paraputrificum and Clostridium sartagoforme, respectively (Supplementary Table S1a). Strain MT26 shared 1111, 1114, 1151, 1165, 1253, 1254, 1261, 1294 and 1372 orthologous proteins with Clostridium acetireducens, Clostridium acetobutylicum, Clostridium argentinense, Clostridium tetani, Clostridium carboxidivorans, Clostridium drakei, Clostridium homopropionicum, Clostridium novyi and Clostridium haemolyticum, respectively (Supplementary Table S1b). Strain LF2 shared 774, 1005, 1017 and 1138 orthologous proteins with Clostridium clariflavum, Clostridium perfringens, Clostridium cellulovorans and Clostridium beijerinckii, respectively (Supplementary Table S1c). Finally, strain Marseille-P2953 shared 735 orthologous proteins with Ruminococcus albus and Clostridium thermocellum as well as 740, 810, 833, 838, 867, 889 and 907 orthologous proteins with Eubacterium siraeum, Clostridium leptum, Pseudoflavonifractor capillosus, Intestinimonas butyriciproducens, Flavonifractor plautii, Oscillibacter ruminantium and Oscillibacter valericigenes, respectively (Supplementary Table S1d). Strain Marseille-P2953 shared fewer orthologous proteins than the other described genomes, as this strain belongs to Clostridium cluster IV and has been compared to more closely related species with available genomes, which mostly belong to other genera, unlike the other described strains. AGIOS value ranges were similar among species with standing in nomenclature with compared species and with the described genomes, thus confirming that the described strains are new species.

Conclusion

Considering the 16S rRNA gene sequence similarity level with the closest species, all under the 98.65% cutoff to define a new species without performing DDH; considering the phenotypic characteristics distinguishing these strains from other known Clostridium species; and confirming that they belong to the genus Clostridium and have the genomic characteristics of these strains, we propose the creation of four new Clostridium species: ‘Clostridium nigeriense,’ ‘Clostridium massilidielmoense,’ ‘Clostridium amazonitimonense’ and ‘Clostridium merdae.’

Description of the new species ‘Clostridium nigeriense’

Clostridium nigeriense’ (ni.ge.rien'se, L. gen. adj., from nigeriense, ‘of Niger,’ referring to where the stool sample was collected) was isolated from the stool sample of a healthy girl from Niger and was deposited in the DSMZ collection under accession number DSM 102218 and in the CSUR collection under accession number CSUR P2414. Clostridium nigeriense’ is a Gram-positive bacillus that is motile, spore forming and facultative anaerobic. The test of catalase activity was negative but was positive for oxidase. ‘Clostridium nigeriense’ forms translucent colonies with a mean diameter of 3 mm. Its major cellular fatty acid is hexadecanoic acid (16:0). Its genome is 3 799 474 bp long with 28.27% G+C content. The 16S rRNA gene and genome sequences are available in the European Molecular Biology Laboratory–European Bioinformatics Institute (EMBL-EBI) database under accession numbers LT161894 and FLKB00000000, respectively. Strain Marseille-P2414T is the type strain of the new species ‘Clostridium nigeriense.’

Description of the new species ‘Clostridium amazonitimonense’

Clostridium amazonitimonense’ (am.a.zo.ni.ti.mo.nen'se, N.L. neut. adj., from amazoni (a.ma.zo.ni, N.L. neut. adj.), referring to the Amazonian forest, the location where the stool sample was collected, and timonense (ti.mo.nen'se, N.L. neut. adj.), referring to the La Timone hospital, where the strain was isolated) was isolated from the stool sample of a healthy middle-aged woman from Amazonia and is deposited in the DSMZ collection under accession number DSM 28600 and in the CSUR collection under accession number CSUR P1445. Strain LF2 is a Gram-positive bacillus that is motile, spore forming and a strict anaerobe. It is negative for catalase and oxidase activity. ‘Clostridium amazonitimonense’ forms white colonies with a mean diameter of 5 mm. The genome is 3 761 417 bp long with 30.56% G+C content. The 16S rRNA gene and genome sequences are available in the EMBL-EBI database under accession numbers LK021125 and CCNN00000000, respectively. The type strain LF2T is the type strain of the new species ‘Clostridium amazonitimonense.’

Description of the new species ‘Clostridium massilidielmoense’

Clostridium massilidielmoense’ (mas.si.li.di.el.mo.en'se, L. neut. adj., a combination of massili (ma.ssi.li, N.L. neut. adj.), from Massilia, the Roman name of Marseille, where strain MT26 was isolated; and dielmoense (di.el.mo.en'se, N.L. neut. adj.), from Dielmo, the Senegalese village where the boy from whom the strain was cultivated lived) was isolated from the stool sample of a healthy 8-month-old Senegalese boy and is deposited in the CCUG collection under accession number CCUG 69893 and in the CSUR collection under accession number CSUR P2255. Strain MT26 is a Gram-positive bacillus that is nonmotile, spore forming and a strict anaerobe. It shows positive catalase activity but no oxidase activity. ‘Clostridium massilidielmoense’ forms a translucent biofilm with cells of an average diameter of 0.54 μm. The genome is 2 759 335 bp long with 28.35% G+C content. The 16S rRNA gene and genome sequences are available in the EMBL-EBI database under accession numbers LN998063 and FXLO00000000, respectively. The type strain MT26T is the type strain of the new species ‘Clostridium massilidielmoense.’

Description of the new species ‘Clostridium merdae’

Clostridium merdae’ (mer'dae, L. gen. n., from merdae, ‘of faeces,’ referring to the fact that the strain was isolated from a stool sample) was isolated from the stool sample of a healthy 29-year-old Senegalese woman and is deposited in the DSMZ collection under accession number DSM 103669 and in the CSUR collection under accession number CSUR P2953. Strain Marseille-P2953 is a Gram-positive bacillus that is motile, spore forming and a strict anaerobe. It does not show catalase or oxidase activity. ‘Clostridium merdae’ forms small white colonies with a mean diameter of 1 mm. The genome is 2 748 860 bp long with 45.34% G+C content. The 16S rRNA gene and genome sequences are available in the EMBL-EBI database under accession numbers LT732642 and FXLN00000000, respectively. The type strain Marseille-P2953 is the type strain of the new species ‘Clostridium merdae.’

Conflict of interest

None declared.
  35 in total

1.  A combined transmembrane topology and signal peptide prediction method.

Authors:  Lukas Käll; Anders Krogh; Erik L L Sonnhammer
Journal:  J Mol Biol       Date:  2004-05-14       Impact factor: 5.469

Review 2.  Samples and techniques highlighting the links between obesity and microbiota.

Authors:  Emmanouil Angelakis; Jean-Christophe Lagier
Journal:  Microb Pathog       Date:  2016-01-30       Impact factor: 3.738

3.  A COMPLETE DESCRIPTION OF CLOSTRIDIUM PUTREFACIENS (McBRYDE).

Authors:  W S Sturges; E T Drake
Journal:  J Bacteriol       Date:  1927-09       Impact factor: 3.490

4.  Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry.

Authors:  Piseth Seng; Michel Drancourt; Frédérique Gouriet; Bernard La Scola; Pierre-Edouard Fournier; Jean Marc Rolain; Didier Raoult
Journal:  Clin Infect Dis       Date:  2009-08-15       Impact factor: 9.079

5.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

Review 6.  The human gut microbiome, a taxonomic conundrum.

Authors:  Senthil Alias Sankar; Jean-Christophe Lagier; Pierre Pontarotti; Didier Raoult; Pierre-Edouard Fournier
Journal:  Syst Appl Microbiol       Date:  2015-03-25       Impact factor: 4.022

7.  Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes.

Authors:  Mincheol Kim; Hyun-Seok Oh; Sang-Cheol Park; Jongsik Chun
Journal:  Int J Syst Evol Microbiol       Date:  2014-02       Impact factor: 2.747

8.  PHAST: a fast phage search tool.

Authors:  You Zhou; Yongjie Liang; Karlene H Lynch; Jonathan J Dennis; David S Wishart
Journal:  Nucleic Acids Res       Date:  2011-06-14       Impact factor: 16.971

9.  DNAPlotter: circular and linear interactive genome visualization.

Authors:  Tim Carver; Nick Thomson; Alan Bleasby; Matthew Berriman; Julian Parkhill
Journal:  Bioinformatics       Date:  2008-11-05       Impact factor: 6.937

10.  Genome sequence and description of Anaerosalibacter massiliensis sp. nov.

Authors:  N Dione; S A Sankar; J-C Lagier; S Khelaifia; C Michele; N Armstrong; M Richez; J Abrahão; D Raoult; P-E Fournier
Journal:  New Microbes New Infect       Date:  2016-01-11
View more
  6 in total

1.  Identification of novel autoinducer-2 receptors in Clostridia reveals plasticity in the binding site of the LsrB receptor family.

Authors:  Inês M Torcato; Meghann R Kasal; Patrícia H Brito; Stephen T Miller; Karina B Xavier
Journal:  J Biol Chem       Date:  2019-01-29       Impact factor: 5.157

2.  Effect of fiber source and crude protein level on nursery pig performance and fecal microbial communities.

Authors:  Kelsey L Batson; Alison C Neujahr; Thomas Burkey; Samodha C Fernando; Mike D Tokach; Jason C Woodworth; Robert D Goodband; Joel M DeRouchey; Jordan T Gebhardt; Hilda I Calderón
Journal:  J Anim Sci       Date:  2021-12-01       Impact factor: 3.159

3.  Comparison of intestinal permeability, morphology, and ileal microbial communities of commercial hens housed in conventional cages and cage-free housing systems.

Authors:  Maddison L Wiersema; Lucas R Koester; Stephan Schmitz-Esser; Dawn A Koltes
Journal:  Poult Sci       Date:  2020-11-05       Impact factor: 3.352

4.  Temporal changes in the gut microbiota in farmed Atlantic cod (Gadus morhua) outweigh the response to diet supplementation with macroalgae.

Authors:  C Keating; M Bolton-Warberg; J Hinchcliffe; R Davies; S Whelan; A H L Wan; R D Fitzgerald; S J Davies; U Z Ijaz; C J Smith
Journal:  Anim Microbiome       Date:  2021-01-07

5.  Deciphering the Urinary Microbiota Repertoire by Culturomics Reveals Mostly Anaerobic Bacteria From the Gut.

Authors:  Grégory Dubourg; Aurélie Morand; Fatima Mekhalif; Raphael Godefroy; Alice Corthier; Abdourahamane Yacouba; Ami Diakite; Florent Cornu; Marina Cresci; Samy Brahimi; Aurélia Caputo; Eric Lechevallier; Michel Tsimaratos; Valérie Moal; Jean-Christophe Lagier; Didier Raoult
Journal:  Front Microbiol       Date:  2020-10-16       Impact factor: 5.640

6.  Culturing and Molecular Approaches for Identifying Microbiota Taxa Impacting Children's Obesogenic Phenotypes Related to Xenobiotic Dietary Exposure.

Authors:  Ana López-Moreno; Ángel Ruiz-Moreno; Jesús Pardo-Cacho; Klara Cerk; Alfonso Torres-Sánchez; Pilar Ortiz; Marina Úbeda; Margarita Aguilera
Journal:  Nutrients       Date:  2022-01-06       Impact factor: 5.717

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.