| Literature DB >> 28224040 |
C I Lo1, M Mailhe2, D Ricaboni3, V Vitton4, A Benezech4, C Michelle2, N Armstrong2, F Bittar2, P-E Fournier2, D Raoult2, J-C Lagier2.
Abstract
Massilioclostridium coli strain Marseille-P2976T (= CSUR P2976 = DSM 103344) is a new bacterial genus isolated from the left colon of a patient who underwent colonoscopy for colorectal cancer screening. Massilioclostridium coli is a Gram-negative bacillus, strict anaerobic, nonsporogenous and nonmotile organism. We describe here the strain Marseille-P2976T and provide its complete annotated genome sequence according to taxonogenomics concepts. Its genome is 2 985 330 bp long and contains 2562 predicted genes and 75 RNA genes.Entities:
Keywords: Colon; Massilioclostridium coli; culturomics; microbiota; taxonogenomics
Year: 2017 PMID: 28224040 PMCID: PMC5310180 DOI: 10.1016/j.nmni.2017.01.004
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Classification and general features of Massilioclostridium coli strain Marseille-P2976T
| Property | Term |
|---|---|
| Current classification | Domain: |
| Phylum: | |
| Class: | |
| Order: | |
| Family: | |
| Genus: | |
| Species: | |
| Type strain: Marseille-P2976T | |
| Gram stain | Negative |
| Cell shape | Rod |
| Motility | Nonmotile |
| Sporulation | Non–spore forming |
| Temperature range | Mesophilic |
| Optimum temperature | 37°C |
Fig. 1Phylogenetic trees highlighting position of Massilioclostridium coli strain Marseille-P2976T (= CSUR P2976 = DSM 103344) relative to other strains within the genus Clostridium. Sequences of 16S rRNA gene were aligned by CLUSTALW. Scale bar represents 2% nucleotide sequence divergence.
Fig. 2Reference mass spectrum from Massilioclostridium coli strain Marseille-P2976T. Spectra from 12 individual colonies were compared and reference spectrum generated.
Fig. 3Gel view comparing Massilioclostridium coli strain Marseille-P2976T (= CSUR P2976 = DSM 103344) to other species within Clostridiaceae family. Gel view displays raw spectra of loaded spectrum files arranged in pseudo-gel-like look. X-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity is expressed by greyscale scheme code. Colour bar and right y-axis indicate relation between colour peak displayed and peak intensity in arbitrary units.
Fig. 4Gram staining of Massilioclostridium coli strain Marseille-P2976T.
Fig. 5Transmission electron microscopy of Massilioclostridium coli strain Marseille-P2976T using Morgani 268D (Philips, Amsterdam, The Netherlands) at operating voltage of 60 kV. Scale bar = 1 μm.
Cellular fatty acid composition (%)
| Fatty acid | Name | Mean relative % |
|---|---|---|
| 16:0 iso | 14-methyl-Pentadecanoic acid | 33.2 ± 1.3 |
| 17:0 anteiso | 14-methyl-Hexadecanoic acid | 20.4 ± 0.1 |
| 16:0 | Hexadecanoic acid | 15.3 ± 0.6 |
| 15:0 anteiso | 12-methyl-Tetradecanoic acid | 13.4 ± 0.4 |
| 15:0 iso | 13-methyl-Tetradecanoic acid | 5.0 ± 0.1 |
| 17:0 iso | 15-methyl-Hexadecanoic acid | 2.0 ± 0.2 |
| 18:1n9 | 9-Octadecenoic acid | 1.1 ± 0.1 |
| 18:2n6 | 9,12-Octadecadienoic acid | TR |
| 14:0 | Tetradecanoic acid | TR |
| 17:0 | Heptadecanoic acid | TR |
| 15:0 | Pentadecanoic acid | TR |
TR, trace amounts <1%.
Mean peak area percentage.
Differential characteristics of Massilioclostridium coli strain Marseille-P2976T, Clostridium thermocellum strain JW20T, Clostridium difficile strain 630, Clostridium dakarense strain FF1T, Clostridium beijerinckii strain NCIMB 8052T[4], [35], [36], [37]
| Property | |||||
|---|---|---|---|---|---|
| Cell diameter (μm) | 2.5 | NA | 2.5 | 1.2 | 1.5 |
| Oxygen requirement | − | − | − | − | − |
| Gram stain | − | + | − | + | + |
| Salt requirement | − | − | − | − | − |
| Motility | − | − | − | + | + |
| Endospore formation | − | + | + | + | + |
| Production of: | |||||
| Acid phosphatase | + | − | NA | + | + |
| Catalase | − | − | − | − | − |
| Oxidase | − | − | NA | − | − |
| Indole | − | − | NA | − | NA |
| Urease | − | − | NA | − | − |
| β-Galactosidase | − | − | NA | − | + |
| | − | − | NA | + | NA |
| Ribose | − | NA | NA | − | NA |
| Pyrazinamidase | − | NA | NA | NA | NA |
| Pyrrolidinyl arylamidase | − | NA | NA | NA | NA |
| Mannose | − | − | NA | − | + |
| Mannitol | − | − | + | − | + |
| Sucrose | − | − | + | − | + |
| | − | − | + | + | + |
| | − | − | + | − | + |
| | − | − | + | + | + |
| | − | − | + | − | + |
| Habitat | Human | Louisiana cotton bale | Poplar wood | Human gut | Human gut |
+, positive result; −, negative result; NA, data not available.
Nucleotide content and gene count levels of genome
| Attribute | Genome (Total) | |
|---|---|---|
| Value | % of total | |
| Size (bp) | 2 985 330 | 100.0 |
| G + C content (bp) | 1 193 310 | 39.9 |
| Coding region (bp) | 2 541 428 | 85.1 |
| Total of genes | 2562 | 100.0 |
| RNA genes | 75 | 2.9 |
| Protein-coding genes | 2487 | 100.0 |
| Protein with function prediction | 1599 | 64.2 |
| Protein assigned to COGs | 1369 | 55.0 |
| Genes with peptid signals | 299 | 12.0 |
| Genes with transmembrane helices | 644 | 25.8 |
COGs, Clusters of Orthologous Groups database.
Total is based on either size of genome in base pairs or total number of protein-coding genes in annotated genome.
Fig. 6Graphical circular map of chromosome. From outside to center: genes on forward strain coloured by COGs categories (only gene assigned to COGs), RNA genes (tRNAs green, rRNAs red), GC content and GC skew. COGs, Clusters of Orthologous Groups database.
Number of genes associated with 25 general COGs functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 182 | 7.31 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 134 | 5.38 | Transcription |
| L | 78 | 3.13 | Replication, recombination and repair |
| B | 0 | 0 | Chromatin structure and dynamics |
| D | 33 | 1.32 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 56 | 2.25 | Defense mechanisms |
| T | 67 | 2.69 | Signal transduction mechanisms |
| M | 85 | 3.41 | Cell wall/membrane biogenesis |
| N | 6 | 0.24 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 25 | 1.00 | Intracellular trafficking and secretion |
| O | 62 | 2.49 | Posttranslational modification, protein turnover, chaperones |
| X | 23 | 0.92 | Mobilome: prophages, transposons |
| C | 84 | 3.37 | Energy production and conversion |
| G | 83 | 3.33 | Carbohydrate transport and metabolism |
| E | 132 | 5.30 | Amino acid transport and metabolism |
| F | 55 | 2.21 | Nucleotide transport and metabolism |
| H | 67 | 2.69 | Coenzyme transport and metabolism |
| I | 42 | 1.68 | Lipid transport and metabolism |
| P | 63 | 2.53 | Inorganic ion transport and metabolism |
| Q | 23 | 0.92 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 123 | 4.94 | General function prediction only |
| S | 86 | 3.45 | Function unknown |
| — | 1118 | 44.9 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Total is based on total number of protein-coding genes in annotated genome.
Genome comparison of closely related species to Massilioclostridium coli strain Marseille-P2976T
| Species (strain) | Size (Mb) | G + C (%) | Total genes |
|---|---|---|---|
| 3.82 | 61.0 | 4278 | |
| 4.24 | 59.1 | 4829 | |
| 4.47 | 53.1 | 4723 | |
| 4.33 | 43.6 | 3991 | |
| 3.10 | 54.9 | 2929 | |
| 3.37 | 59.0 | 3381 | |
| 2.98 | 39.9 | 2487 | |
| 3.01 | 55.5 | 2701 |
Fig. 7Distribution of functional classes of predicted genes according to COGs proteins. COGs, Clusters of Orthologous Groups database.
Numbers of orthologous protein shared between genomes (upper right)a
| 778 | 741 | 759 | 751 | 700 | 736 | 731 | ||
| 56.08 | 1278 | 1108 | 1273 | 745 | 1077 | 829 | ||
| 56.06 | 63.35 | 1033 | 1234 | 733 | 1012 | 791 | ||
| 56.75 | 59.35 | 57.74 | 988 | 699 | 1228 | 754 | ||
| 57.62 | 64.32 | 63.74 | 59.87 | 765 | 979 | 804 | ||
| 54.56 | 52.44 | 53.61 | 54.53 | 54.01 | 690 | 727 | ||
| 56.60 | 58.14 | 57.85 | 65.90 | 58.53 | 59.75 | 750 | ||
| 53.40 | 49.96 | 51.11 | 52.55 | 51.23 | 61.76 | 58.23 |
Average percentage similarity of nucleotides corresponding to orthologous protein shared between genomes (lower left) and numbers of proteins per genome (bold).