| Literature DB >> 27818774 |
M Hosny1, S Benamar1, G Durand1, N Armstrong1, C Michelle1, F Cadoret1, B La Scola1, N Cassir1.
Abstract
Clostridium phoceensis sp. nov., strain GD3T (= CSUR P1929 = DSM 100334) is the type strain of C. phoceensis sp. nov., a new species within the genus Clostridium. This strain was isolated from the gut microbiota of a 28-year-old healthy French man. C. phoceensis is a Gram-negative, spore-forming, nonmotile, strictly anaerobic bacterium. We describe its complete genome sequence and annotation, together with its phenotypic characteristics.Entities:
Keywords: Anaerobe; Clostridium phoceensis; culturomics; gut microbiota; taxono-genomics
Year: 2016 PMID: 27818774 PMCID: PMC5078575 DOI: 10.1016/j.nmni.2016.09.008
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Fig. 1Reference mass spectrum (matrix-assisted desorption ionization–time of flight mass spectrometry) from Clostridium phoceensis strain GD3T.
Fig. 2Gram staining of Clostridium phoceensis strain GD3T.
Fig. 3Electron microscopy of Clostridium phoceensis strain GD3T.
Classification, general features and biochemical tests of Clostridium phoceensis strain GD3T
| Property | Term |
|---|---|
| Current classification | Domain: |
| Phylum: | |
| Class: | |
| Order: | |
| Family: | |
| Genus: | |
| Species: | |
| Type strain: GD3 | |
| Gram stain | Negative |
| Cell shape | Bacillus |
| Cell diameter (μm) | 0.5 μm |
| Cell length | 1.8 μm |
| Motility | No |
| Sporulation | Yes |
| Temperature range | Mesophilic |
| Production of: | |
| Alkaline phosphatase | No |
| Catalase | No |
| Oxidase | No |
| Nitrate reductase | No |
| Urease | No |
| β-Galactosidase | No |
| | No |
| Esterase | Yes |
| Acid from: | |
| | No |
| Ribose | No |
| Mannose | No |
| Mannitol | No |
| Sucrose | No |
| | Yes |
| | No |
| | No |
| | No |
| Habitat | Human |
Cellular fatty acid composition (%) of Clostridium phoceensis strain GD3T
| Fatty acid | Name | Mean relative %a |
|---|---|---|
| 16:0 | Hexadecanoic acid | 33.1 ± 1.7 |
| 18:1n9 | 9-Octadecenoic acid | 24.3 ± 0.6 |
| 18:0 | Octadecanoic acid | 20.0 ± 0.1 |
| 18:2n6 | 9,12-Octadecadienoic acid | 8.6 ± 0.1 |
| 18:1n7 | 11-Octadecenoic acid | 5.5 ± 0.5 |
| 14:0 | Tetradecanoic acid | 4.7 ± 0.8 |
| 17:0 | Heptadecanoic acid | TR |
| 16:1n7 | 9-Hexadecenoic acid | TR |
| 15:0 | Pentadecanoic acid | TR |
| 17:0 iso | 15-methyl-Hexadecanoic acid | TR |
| 17:0 anteiso | 14-methyl-Hexadecanoic acid | TR |
| 17:1n7 | 10-Heptadecenoic acid | TR |
| 15:0 iso | 13-Methyl-tetradecanoic acid | TR |
| 15:0 anteiso | 12-Methyl-tetradecanoic acid | TR |
Fig. 4Phylogenetic tree highlighting position of Clostridium phoceensis GD3T relative to other type strains within Clostridiales order. GenBank accession numbers are indicated. Sequences were aligned using CLUSTALW, and phylogenetic inferences were obtained using maximum-likelihood method within MEGA 4 software [23]. Numbers at nodes are bootstrap values obtained by repeating analysis 500 times to generate majority consensus tree. Scale bar represents 5% nucleotide sequence divergence. Rubidus massiliensis was used as outgroup.
Nucleotide content and gene count levels of genome
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of total | |
| Genome size (bp) | 3 453 562 | 100 |
| DNA coding region (bp) | 2 924 785 | 84.68 |
| DNA G+C content (bp) | 2 044 366 | 59.31 |
| Total genes | 3320 | 100 |
| rRNA | 5 | 0.136 |
| tRNA | 51 | 0.116 |
| Protein-coding genes | 3264 | 98.31 |
| Genes with function prediction | 3111 | 17.02 |
| Genes assigned to COGs | 2534 | 57.39 |
| Pseudo genes | 62 | 1.86 |
| Genes in internal clusters | 1060 | 31.92 |
| Genes with Pfam domains | 2817 | 84 |
| Genes with signal peptides | 414 | 12.68 |
| Genes with transmembrane helices | 722 | 22.12 |
| ORFan genes | 227 | 6.95 |
| CRISPR repeats | 14 | 0.02 |
COGs, Clusters of Orthologous Groups database; CRISPR, clustered regularly interspaced short palindromic repeat.
Total is based on either size of genome in base pairs or total number of protein-coding genes in annotated genome.
Fig. 5Graphical circular map of Clostridium phoceensis GD3T genome. From outside in, outer two circles show ORFs oriented in forward (colored by COGs categories) and reverse (colored by COGs categories) directions, respectively. Third circle marks tRNA genes (green). Fourth circle shows G+C% content plot. Innermost circle shows GC skew, with purple indicating negative values and olive indicating positive values. COGs, Clusters of Orthologous Groups database; ORF, open reading frame.
Number of genes associated with 25 general COGs functional categories
| Code | Description | Value | % of total |
|---|---|---|---|
| J | Translation | 150 | 4.59 |
| A | RNA processing and modification | 0 | 0 |
| K | Transcription | 212 | 6.49 |
| L | Replication, recombination and repair | 106 | 3.24 |
| B | Chromatin structure and dynamics | 0 | 0 |
| D | Cell cycle control, mitosis and meiosis | 22 | 0.67 |
| Y | Nuclear structure | 0 | 0 |
| V | Defense mechanisms | 75 | 2.29 |
| T | Signal transduction mechanisms | 94 | 2.87 |
| M | Cell wall/membrane biogenesis | 54 | 1.65 |
| N | Cell motility | 27 | 0.82 |
| Z | Cytoskeleton | 0 | 0 |
| W | Extracellular structures | 0 | 0 |
| U | Intracellular trafficking and secretion | 25 | 0.76 |
| O | Posttranslational modification, protein turnover, chaperones | 51 | 1.56 |
| C | Energy production and conversion | 94 | 2.87 |
| G | Carbohydrate transport and metabolism | 106 | 3.24 |
| E | Amino acid transport and metabolism | 217 | 6.54 |
| F | Nucleotide transport and metabolism | 49 | 1.50 |
| H | Coenzyme transport and metabolism | 63 | 1.93 |
| I | Lipid transport and metabolism | 53 | 1.62 |
| P | Inorganic ion transport and metabolism | 94 | 2.87 |
| Q | Secondary metabolites biosynthesis, transport and catabolism | 29 | 0.88 |
| R | General function prediction only | 231 | 7.07 |
| S | Function unknown | 126 | 3.86 |
| — | Not in COGs | 1684 | 51.59 |
COGS, Clusters of Orthologous Groups database.
Total is based on total number of protein-coding genes in annotated genome.
Fig. 6Distribution of functional classes of predicted genes in genomes from Clostridium phoceensis (Cp), Intestinimonas butyriciproducens strain ER1 (Ib), Flavonifractor plautii ATCC 29863 (Fp), Clostridium leptum DSM 753 (Cl), Clostridium cellulosi DG5 (Cc), Ethanoligenens harbinense YUAN-3 (Eh), Oscillibacter valericigenes Sjm18-20 (Ov), Clostridium viride DSM 6836 (Cv), Eubacterium siraeum V10Sc8a (Es) and Pseudoflavonifractor capillosus ATCC 29799 (PC) genomes according to clusters of orthologous groups of proteins. ATCC, American Type Culture Collection (Manassas, VA, USA).
Orthologous genes shared (upper right) and AGIOS values obtained (lower left)a
| Cp | Ib | Fp | Cl | Cc | Eh | Ov | Cv | Es | Pc | |
|---|---|---|---|---|---|---|---|---|---|---|
| Cp | 846 | 884 | 492 | 562 | 482 | 717 | 634 | 407 | 842 | |
| Ib | 70.10 | 1079 | 508 | 627 | 501 | 771 | 671 | 407 | 1016 | |
| Fp | 71.58 | 73.64 | 523 | 649 | 518 | 818 | 686 | 433 | 1039 | |
| Cl | 61.83 | 61.60 | 61.65 | 587 | 495 | 475 | 432 | 405 | 523 | |
| Cc | 57.17 | 57.91 | 56.82 | 61.62 | 619 | 604 | 562 | 465 | 616 | |
| Eh | 56.64 | 56.86 | 56.92 | 56.13 | 55.34 | 515 | 449 | 415 | 499 | |
| Ov | 65.72 | 65.34 | 65.55 | 61.60 | 58.40 | 56.54 | 653 | 409 | 710 | |
| Cv | 63.84 | 64.57 | 64.21 | 60.85 | 59.13 | 55.02 | 62.58 | 364 | 614 | |
| Es | 57.83 | 58.77 | 57.72 | 60.78 | 61.38 | 54.37 | 59.13 | 58.93 | 427 | |
| Pc | 61.55 | 61.95 | 62.89 | 55.43 | 52.37 | 57.21 | 58.02 | 56.73 | 53.24 |
AGIOS, average genomic identity of orthologous gene sequences; ATCC, American Type Culture Collection (Manassas, VA, USA).
Values in bold are gene numbers. Ten genomes were used for this study: Clostridium phoceensis (Cp), Intestinimonas butyriciproducens strain ER1 (Ib), Flavonifractor plautii ATCC 29863 (Fp), Clostridium leptum DSM 753 (Cl), Clostridium cellulosi DG5 (Cc), Ethanoligenens harbinense YUAN-3 (Eh), Oscillibacter valericigenes Sjm18-20 (Ov), Clostridium viride DSM 6836 (Cv), Eubacterium siraeum V10Sc8a (Es) and Pseudoflavonifractor capillosus ATCC 29799 (PC) genomes according to the clusters of orthologous groups of proteins.