| Literature DB >> 29348922 |
M T Alou1, S Ndongo1, L Frégère1, N Labas1, C Andrieu1, M Richez1, C Couderc1, J-P Baudoin1, J Abrahão2, S Brah3, A Diallo1,4, C Sokhna1,4, N Cassir1, B La Scola1, F Cadoret1, D Raoult1,5.
Abstract
Culturomics investigates microbial diversity of the human microbiome by combining diversified culture conditions, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry and 16S rRNA gene identification. The present study allowed identification of four putative new Clostridium sensu stricto species: 'Clostridium amazonitimonense' strain LF2T, 'Clostridium massilidielmoense' strain MT26T, 'Clostridium nigeriense' strain Marseille-P2414T and 'Clostridium merdae' strain Marseille-P2953T, which we describe using the concept of taxonogenomics. We describe the main characteristics of each bacterium and present their complete genome sequence and annotation.Entities:
Keywords: culturomics; emerging bacteria; human microbiota; taxonogenomics; ‘Clostridium amazonitimonense’; ‘Clostridium massilidielmoense’; ‘Clostridium merdae’; ‘Clostridium nigeriense’
Year: 2017 PMID: 29348922 PMCID: PMC5767839 DOI: 10.1016/j.nmni.2017.11.003
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Fig. 1Phylogenetic tree highlighting position of ‘Clostridium amazonitimonense,’ ‘Clostridium merdae,’ ‘Clostridium massilidielmoense’ and ‘Clostridium nigeriense’ relative to other type strains within Clostridium genus. Strains and their corresponding GenBank accession numbers for 16S rRNA genes are indicated in brackets. Sequences were aligned using Clustal W (http://www.clustal.org/clustal2/), and phylogenetic inferences were obtained using maximum-likelihood method within MEGA6 software (http://www.megasoftware.net/mega.php). Numbers at nodes are percentages of bootstrap values obtained by repeating analysis 1000 times to generate majority consensus tree. Butyricicoccus pullicaecorum (NR_044490) was used as outgroup. Scale bar = 0.02% nucleotide sequence divergence.
Species information
| Characteristic | LF2 | MT26 | Marseille-P2953 | Marseille-P2414 |
|---|---|---|---|---|
| Patient information | Healthy 37-year-old Amazonian woman | Healthy 8-month-old Senegalese boy | Healthy 29-year-old Senegalese woman | Healthy 44-month-old girl from Niger |
| Isolation conditions | 30 days' anaerobic preincubation in blood culture bottle with rumen and sheep's blood | 30 days' anaerobic preincubation in blood culture bottle with sheep's blood | 7 days' anaerobic preincubation in blood culture bottle with rumen and sheep's blood | 15 days' aerobic preincubation in blood culture bottle with rumen and sheep's blood |
| Optimal growth temperature | 37°C | 37°C | 37°C | 37°C |
| Atmosphere | Strict anaerobe | Strict anaerobe | Strict anaerobe | Facultative anaerobe |
| Similarity percentage with closest species | 96.95% with | 98.44% with | 96.41% with | 98.23% with |
Fig. 2Reference mass spectra from ‘Clostridium amazonitimonense,’ ‘Clostridium merdae,’ ‘Clostridium massilidielmoense’ and ‘Clostridium nigeriense.’ Spectra from 12 individual colonies were compared and each reference spectrum generated (A). Gel view comparing ‘Clostridium amazonitimonense,’ ‘Clostridium merdae,’ ‘Clostridium massilidielmoense’ and ‘Clostridium nigeriense’ to other species within genus Clostridium. Gel view displays raw spectra of loaded spectrum files arranged in pseudo–gellike look. x-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity expressed by greyscale scheme code. Colour bar and right y-axis indicate relation between colour peaks, with peak intensity in arbitrary units. Displayed species are indicated at left (B).
Differential characteristics of ‘Clostridium amazonitimonense,’ ‘Clostridium putrefaciens,’ ‘Clostridium merdae,’ ‘Clostridium sporosphaeroides,’ ‘Clostridium nigeriense,’ Clostridium tertium, ‘Clostridium massilidielmoense’ and Clostridium novyi[1], [2]
| Characteristic | ||||||||
|---|---|---|---|---|---|---|---|---|
| Property | ||||||||
| Cell shape | Bacilli | Bacilli | Bacilli | Bacilli | Bacilli | Bacilli | Bacilli | Bacilli |
| Cell size (μm) | 2–2.5 | NA | 1.2–1.5 | 0.5–0.6 | 0.8–1.5 | 0.5–1.4 | 0.54 | 0.6–1.4 |
| Gram stain | + | + | + | + | + | + | + | + |
| Oxygen tolerance | − | − | − | − | + | + | − | − |
| Salt tolerance | − | − | − | − | − | − | − | − |
| Motility | + | − | + | − | + | + | − | + |
| Endospore formation | + | + | + | + | + | + | + | + |
| Major cellular fatty acid | 15:0 iso | NA | 16:00 | NA | 16:00 | NA | 16:00 | NA |
| Production of: | ||||||||
| Catalase | − | NA | − | NA | − | NA | − | NA |
| Oxidase | − | NA | − | NA | + | NA | − | NA |
| Alkaline phosphatase | + | NA | − | NA | + | NA | + | NA |
| Urease | − | NA | − | NA | − | NA | − | NA |
| β-Galactosidase | − | NA | − | NA | − | NA | − | NA |
| | + | NA | − | NA | + | NA | + | NA |
| Acid from: | ||||||||
| | − | NA | − | NA | − | NA | − | NA |
| Ribose | + | NA | − | NA | + | NA | + | NA |
| Mannose | + | NA | − | NA | + | NA | + | NA |
| Mannitol | − | NA | − | NA | − | NA | − | NA |
| Sucrose | − | − | − | NA | + | NA | − | NA |
| | + | + | + | NA | + | NA | + | NA |
| | + | NA | − | NA | + | NA | + | NA |
| | − | − | + | NA | + | NA | + | NA |
| | − | − | − | NA | + | NA | − | NA |
| Habitat | Human gut | Hog muscle tissue | Human gut | Human gut | Human gut | Human gut | Human gut | Soil |
+, positive result; −, negative result; NA, data not available.
Fig. 3Electron micrographs of ‘Clostridium nigeriense’ (A), ‘Clostridium amazonitimonense’ (B), ‘Clostridium massilidielmoense’ (C) and ‘Clostridium merdae’ (D).
Cellular fatty acid composition (%)
| Fatty acid | Name | ||||
|---|---|---|---|---|---|
| 16:00 | Hexadecanoic acid | 38.9 ± 0.8 | 38.5 ± 2.1 | 55.0 ± 2.5 | 15.3 ± 2.3 |
| 14:00 | Tetradecanoic acid | ND | ND | 15.7 ± 1.7 | 12.0 ± 0.6 |
| 18:00 | Octadecanoic acid | 6.7 ± 1.1 | 3.2 ± 0.6 | 3.6 ± 0.9 | TR |
| 13:00 | Tridecanoic acid | ND | ND | 3.0 ± 0.6 | ND |
| 12:00 | Dodecanoic acid | 1.8 ± 0.1 | TR | 1.5 ± 1.0 | TR |
| 04:00 | Butanoic acid | ND | ND | ND | 1.7 ± 1.0 |
| 15:00 | Pentadecanoic acid | TR | ND | 2.9 ± 0.6 | ND |
| 18:1n9 | 9-Octadecenoic acid | 20.1 ± 0.4 | 35.4 ± 0.4 | 14.4 ± 1.0 | 2.3 ± 0.6 |
| 13:0 iso | 11-methyl-Dodecanoic acid | ND | ND | ND | 22.7 ± 1.8 |
| 5:0 iso | 3-methyl-Butanoic acid | ND | ND | ND | 11.2 ± 2.1 |
| 18:1n7 | 11-Octadecenoic acid | 7.9 ± 0.2 | ND | ND | TR |
| 14:01 | Tetradecanoic acid | 6.6 ± 0.3 | 5.3 ± 1.4 | ND | ND |
| 18:2n6 | 9.12-Octadecadienoic acid | 5.5 ± 0.4 | 15.2 ± 0.6 | 3.9 ± 1.3 | TR |
| 16:1n7 | 9-Hexadecenoic acid | 5.3 ± 0.2 | TR | ND | ND |
| 16:1n9 | 7-Hexadecenoic acid | 3.9 ± 0.2 | ND | ND | ND |
| 13:0 anteiso | 10-methyl-Dodecanoic acid | ND | ND | ND | 2.3 ± 0.3 |
| 15:0 anteiso | 12-methyl-Tetradecanoic acid | TR | TR | ND | 4.8 ± 0.5 |
| 15:0 iso | 13-methyl-Tetradecanoic acid | TR | TR | ND | 22.0 ± 1.5 |
ND, not detected; TR, traces.
Nucleotide content and gene count levels of genomes
| Attribute | ||||||||
|---|---|---|---|---|---|---|---|---|
| No. of reads | 5 235 372 | 1 841 718 | 1 832 750 | 3 097 182 | ||||
| Scaffolds/contigs | 4/15 | 4/4 | 4/19 | 9/9 | ||||
| Value | % of total | Value | % of total | Value | % of total | Value | % of total | |
| Size (bp) | 3 761 417 | 100 | 2 748 860 | 100 | 3 799 474 | 100 | 2 759 335 | 100 |
| G+C content (%) | 1 145 890 | 30.55 | 1 246 258 | 45.4 | 1 064 556 | 28.2 | 782 307 | 28.3 |
| Coding region (bp) | 3 129 293 | 83.2 | 2 373 508 | 86.3 | 3 322 705 | 87.4 | 2 453 243 | 88.9 |
| Total genes | 3553 | 100 | 2699 | 100 | 3659 | 100 | 2772 | 100 |
| RNA genes | 147 | 4.1 | 59 | 2.2 | 187 | 5.1 | 91 | 3.3 |
| Protein-coding genes | 3406 | 95.9 | 264 | 97.8 | 3472 | 94.9 | 2681 | 96.7 |
| Genes with function prediction | 2477 | 72.7 | 1966 | 74.4 | 245 | 70.5 | 1989 | 74.2 |
| Genes assigned to COGs | 2241 | 65.8 | 167 | 63.2 | 2149 | 61.9 | 1639 | 61.1 |
| Genes with peptide signals | 270 | 7.9 | 271 | 10.2 | 307 | 8.8 | 228 | 2.8 |
| ORFans genes | 189 | 5.5 | 117 | 4.4 | 119 | 3.4 | 40 | 1.5 |
| Genes with transmembrane helices | 849 | 24.9 | 569 | 21.5 | 935 | 26.9 | 639 | 23.8 |
| Genes associated with PKS or NRPS | 3 | 0.08 | 0 | 0 | 5 | 0.1 | 3 | 0.1 |
| No. of antibiotic resistance genes | 1 | 0.03 | 0 | 0 | 0 | 0 | 0 | 0 |
| No. of genes associated with Pfam-A domains | NA | NA | 2386 | 88 | 3182 | 86 | 246 | 88 |
COGs, Clusters of Orthologous Groups database; NRPS, nonribosomal peptide synthase; ORF, open reading frame; PKS, polyketide synthase.
The total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome.
Fig. 4Graphical circular map of genome of ‘Clostridium nigeriense’ (A), ‘Clostridium amazonitimonense’ (B), ‘Clostridium massilidielmoense’ (C) and ‘Clostridium merdae’ (D). From outside to center: Contigs (red/grey), COGs category of genes on forward strand (three circles), genes on forward strand (blue circle), genes on reverse strand (red circle), COGs category on reverse strand (three circles), G+C content. COGs, Clusters of Orthologous Groups database.
Number of genes associated with 25 general COGs functional categories
| Code | Description | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Value | % of total | Value | % of total | Value | % of total | Value | % of total | ||
| [J] | Translation | 189 | 7.16 | 217 | 6.25 | 192 | 7.16 | 181 | 5.31 |
| [A] | RNA processing and modification | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| [K] | Transcription | 171 | 6.48 | 191 | 5.50 | 125 | 4.66 | 238 | 6.99 |
| [L] | Replication, recombination and repair | 77 | 2.91 | 85 | 2.45 | 87 | 3.24 | 135 | 3.96 |
| [B] | Chromatin structure and dynamics | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.03 |
| [D] | Cell cycle control, mitosis and meiosis | 31 | 1.17 | 40 | 1.15 | 39 | 1.45 | 24 | 0.70 |
| [Y] | Nuclear structure | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| [V] | Defense mechanisms | 53 | 2.01 | 75 | 2.16 | 58 | 2.16 | 105 | 3.08 |
| [T] | Signal transduction mechanisms | 97 | 3.67 | 117 | 3.37 | 94 | 3.50 | 135 | 3.96 |
| [M] | Cell wall/membrane biogenesis | 75 | 2.84 | 129 | 3.71 | 106 | 3.95 | 104 | 3.05 |
| [N] | Cell motility | 43 | 1.63 | 53 | 1.52 | 47 | 1.75 | 61 | 1.79 |
| [Z] | Cytoskeleton | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| [W] | Extracellular structures | 7 | 0.26 | 6 | 0.17 | 7 | 0.26 | 0 | 0 |
| [U] | Intracellular trafficking and secretion | 20 | 0.76 | 26 | 0.75 | 25 | 0.93 | 34 | 1 |
| [O] | Posttanslational modification, protein turnover, chaperones | 61 | 2.31 | 79 | 2.27 | 75 | 2.8 | 74 | 2.17 |
| [X] | Mobilome: prophages, transposons | 15 | 0.57 | 8 | 0.23 | 36 | 1.34 | NA | NA |
| [C] | Energy production and conversion | 106 | 4.01 | 139 | 4.00 | 93 | 3.47 | 137 | 4.02 |
| [G] | Carbohydrate transport and metabolism | 142 | 5.38 | 271 | 7.80 | 125 | 4.66 | 165 | 4.84 |
| [E] | Amino acid transport and metabolism | 212 | 8.03 | 195 | 5.61 | 150 | 5.59 | 219 | 6.43 |
| [F] | Nucleotide transport and metabolism | 58 | 2.19 | 85 | 2.45 | 68 | 2.53 | 72 | 2.11 |
| [H] | Coenzyme transport and metabolism | 90 | 3.41 | 104 | 2.99 | 111 | 4.14 | 88 | 2.58 |
| [I] | Lipid transport and metabolism | 69 | 2.61 | 84 | 2.42 | 64 | 2.39 | 62 | 1.82 |
| [P] | Inorganic ion transport and metabolism | 87 | 3.29 | 133 | 3.83 | 87 | 3.24 | 153 | 4.49 |
| [Q] | Secondary metabolite biosynthesis, transport and catabolism | 28 | 1.06 | 30 | 0.86 | 13 | 0.48 | 35 | 1.02 |
| [R] | General function prediction only | 158 | 5.98 | 202 | 5.82 | 151 | 5.63 | 320 | 9.39 |
| [S] | Function unknown | 86 | 3.26 | 133 | 3.83 | 91 | 3.39 | 190 | 5.58 |
| — | Not in COGs | 970 | 36.74 | 1.323 | 38.10 | 1.042 | 38.87 | 1.165 | 34.20 |
COGs, Clusters of Orthologous Groups database; NA, not applicable.
The total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome.
Genome comparison of closely related species
| Organism | INSDC | Size (Mb) | G+C (%) | Total genes |
|---|---|---|---|---|
| CP000721 | 6.48 | 30 | 5.425 | |
| CP003065 | 4.90 | 35.72 | 3.892 | |
| CP002160 | 56 | 311 | 4.524 | |
| CP000246 | 35 | 28.38 | 2.876 | |
| FXLO00000000 | 2.76 | 28.35 | 2.681 | |
| LZFO00000000 | 2.42 | 26.75 | 2.238 | |
| AE001437 | 43 | 30.93 | 3.848 | |
| AYSO00000000 | 4.74 | 28.6 | 4.279 | |
| CP011803 | 5.75 | 29.93 | 4.973 | |
| JIBU00000000 | 5.58 | 29 | 5.029 | |
| JENX00000000 | 2.61 | 27.43 | 2.201 | |
| LHUR00000000 | 3.65 | 31.09 | 3.5 | |
| JENM00000000 | 2.46 | 28.58 | 1.833 | |
| JRGJ00000000 | 2.79 | 28.57 | 2.516 | |
| FLKB00000000 | 3.79 | 28.7 | 3.472 | |
| JZTY00000000 | 37 | 27.95 | 2.839 | |
| AQQG00000000 | 5.84 | 29.74 | 5.196 | |
| AUJC00000000 | 3.56 | 29.64 | 3.491 | |
| ASRV00000000 | 3.98 | 27.91 | 3.714 | |
| FXLN00000000 | 2.75 | 45.34 | 2.64 | |
| ABCB00000000 | 2.82 | 505 | 2.482 | |
| CP000568 | 3.84 | 38.99 | 3.173 | |
| ARNC00000000 | 2.62 | 44.85 | 2.308 | |
| AGCK00000000 | 3.82 | 61.07 | 4.278 | |
| CP011307 | 3.37 | 59.06 | 3.381 | |
| AAXG00000000 | 44 | 591 | 4.829 | |
| CP002403 | 4.33 | 43.62 | 3.991 | |
| CCNN00000000 | 3.76 | 30.56 | 3.046 |
INSDC, International Nucleotide Sequence Database Collaboration.
The total is based on either the size of the genome in base pairs or the total number of protein-coding genes in the annotated genome.
Pairwise comparison of four studied species with nine other species using GGDC
| 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 19.90 ± 2.2 | 18.10 ± 2.2 | 19.00 ± 2.2 | 19.40 ± 2.2 | 18.40 ± 2.1 | 22.60 ± 2.2 | 21.00 ± 2.2 | 20.30 ± 2.2 | 24.40 ± 2.3 | 39.90 ± 2.5 | 23.70 ± 2.3 | 23.20 ± 2.3 | 24.80 ± 2.3 |
| 2 | 100.00 ± 0.0 | 19.40 ± 2.2 | 24.80 ± 2.3 | 39.80 ± 2.5 | 19.70 ± 2.2 | 21.30 ± 2.3 | 19.50 ± 2.2 | 17.40 ± 2.1 | 26.70 ± 2.3 | 26.40 ± 2.4 | 26.00 ± 2.3 | 19.60 ± 2.2 | 25.70 ± 2.3 |
| 3 | 100.00 ± 0.0 | 19.00 ± 2.2 | 18.10 ± 2.1 | 18.50 ± 2.1 | 21.30 ± 2.2 | 20.70 ± 2.2 | 20.00 ± 2.2 | 24.30 ± 2.3 | 26.70 ± 2.4 | 28.70 ± 2.3 | 20.10 ± 2.2 | 26.60 ± 2.4 | |
| 4 | 100.00 ± 0.0 | 24.70 ± 2.3 | 19.50 ± 2.2 | 21.30 ± 2.2 | 19.10 ± 2.2 | 18.30 ± 2.1 | 28.40 ± 2.4 | 26.10 ± 2.3 | 28.30 ± 2.3 | 20.00 ± 2.2 | 23.70 ± 2.3 | ||
| 5 | 100.00 ± 0.0 | 18.30 ± 2.1 | 21.30 ± 2.3 | 19.40 ± 2.1 | 18.40 ± 2.2 | 23.30 ± 2.3 | 25.70 ± 2.3 | 26.70 ± 2.3 | 20.50 ± 2.2 | 26.80 ± 2.3 | |||
| 6 | 100.00 ± 0.0 | 22.10 ± 2.2 | 19.40 ± 2.1 | 18.00 ± 2.1 | 22.90 ± 2.3 | 25.80 ± 2.3 | 28.40 ± 2.4 | 21.20 ± 2.2 | 24.10 ± 2.3 | ||||
| 7 | 100.00 ± 0.0 | 22.20 ± 2.3 | 23.60 ± 2.3 | 23.00 ± 2.3 | 24.70 ± 2.3 | 26.00 ± 2.3 | 24.90 ± 2.3 | 24.30 ± 2.4 | |||||
| 8 | 100.00 ± 0.0 | 20.20 ± 2.2 | 25.30 ± 2.3 | 25.00 ± 2.3 | 28.20 ± 2.4 | 25.10 ± 2.3 | 25.10 ± 2.4 | ||||||
| 9 | 100.00 ± 0.0 | 25.10 ± 2.3 | 24.70 ± 2.3 | 28.10 ± 2.4 | 23.40 ± 2.3 | 25.40 ± 2.3 | |||||||
| 10 | 100.00 ± 0.0 | 22.50 ± 2.2 | 29.10 ± 2.3 | 26.80 ± 2.4 | 29.20 ± 2.4 | ||||||||
| 11 | 100.00 ± 0.0 | 25.10 ± 2.3 | 23.60 ± 2.3 | 29.70 ± 2.4 | |||||||||
| 12 | 100.00 ± 0.0 | 27.30 ± 2.4 | 25.70 ± 2.3 | ||||||||||
| 13 | 100.00 ± 0.0 | 23.80 ± 2.3 | |||||||||||
| 14 | 100.00 ± 0.0 |
DDH, DNA-DNA hybridization; GGDC, genome-to-genome distance; HSP, high-scoring pair.
Data derived from formula 2 (DDH estimates based on identities/HSP length). Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with 16S rRNA (Fig. 1) and phylogenetic analyses as well as GGDC results.
1, Clostridium cellulovorans; 2, Clostridium massilidielmoense; 3, Clostridium acetireducens; 4, Clostridium haemolyticum; 5, Clostridium novyi; 6, Clostridium tetani; 7, Clostridium nigeriense; 8, Clostridium diolis; 9, Clostridium sartagoforme; 10, Clostridium merdae; 11, Clostridium leptum; 12, Clostridium thermocellum; 13, Clostridium amazonitimonense; 14, Ruminococcus albus.