| Literature DB >> 27354918 |
M Tidjani Alou1, T-T Nguyen1, N Armstrong1, J Rathored1, S Khelaifia1, D Raoult2, P-E Fournier1, J-C Lagier1.
Abstract
Numidum massiliense gen. nov., sp. nov., strain mt3(T) is the type strain of Numidum gen. nov., a new genus within the family Bacillaceae. This strain was isolated from the faecal flora of a Tuareg boy from Algeria. We describe this Gram-positive facultative anaerobic rod and provide its complete annotated genome sequence according to the taxonogenomics concept. Its genome is 3 755 739 bp long and contains 3453 protein-coding genes and 64 RNA genes, including eight rRNA genes.Entities:
Keywords: Bacillaceae; Culturomics; Numidum massiliense genome; gut microbiota; taxonogenomics
Year: 2016 PMID: 27354918 PMCID: PMC4910145 DOI: 10.1016/j.nmni.2016.05.009
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Classification and general features of Numidum massiliense strain mt3T
| Property | Term |
|---|---|
| Current classification | Domain: |
| Phylum: | |
| Class: | |
| Order: | |
| Family: | |
| Genus: | |
| Species: | |
| Type strain: mt3 | |
| Gram stain | Positive |
| Cell shape | Rod |
| Motility | Nonmotile |
| Sporulation | Sporulating |
| Temperature range | Mesophilic |
| Optimum temperature | 37°C |
Fig. 1Phylogenetic tree highlighting position of Numidum massiliense gen. nov., sp. nov. strain mt3T (= CSUR P1305 = DSM 29571) relative to other strains within family Bacillaceae. Scale bar represents 1% nucleotide sequence divergence.
Fig. 2Reference mass spectrum from Numidum massiliense strain mt3T. Spectra from 12 individual colonies were compared and reference spectrum was generated.
Fig. 3Gel view comparing Numidum massiliense strain mt3T (= CSUR P1305 = DSM 29571) to other species within Bacillaceae family. Gel view displays raw spectra of loaded spectrum files arranged in pseudo-gel-like look. X-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity is expressed by greyscale scheme code. Colour bar and right y-axis indicate relation between colour peak is displayed with and peak intensity in arbitrary units. Displayed species are indicated on left.
Fig. 4Gram staining of Numidum massiliense strain mt3T.
Fig. 5Transmission electron microscopy of Numidum massiliense strain mt3T, using Morgani 268D (Philips, Amsterdam, The Netherlands) at operating voltage of 60 kV. Scale bar represents 1 μm.
Cellular fatty acid composition (%)
| Fatty acid | IUPAC name | Mean relative % |
|---|---|---|
| 15:0 iso | 13-methyl-tetradecanoic acid | 87.6 ± 1.6 |
| 15:0 anteiso | 12-methyl-tetradecanoic acid | 5.5 ± 0.4 |
| 17:0 iso | 15-methyl-Hexadecanoic acid | 3.0 ± 0.9 |
| 16:0 | Hexadecanoic acid | 0.9 ± 0.1 |
| 16:0 iso | 14-methyl-Pentadecanoic acid | 0.6 ± 0.1 |
| 18:1n9 | 9-Octadecenoic acid | 0.6 ± 0.2 |
| 13:0 anteiso | 10-methyl-Dodecanoic acid | TR |
| 5:0 iso | 3-methyl-butanoic acid | TR |
| 17:0 anteiso | 14-methyl-Hexadecanoic acid | TR |
| 18:0 | Octadecanoic acid | TR |
| 18:2n6 | 9,12-Octadecadienoic acid | TR |
| 14:0 iso | 12-methyl-Tridecanoic acid | TR |
| 14:0 | Tetradecanoic acid | TR |
IUPAC, International Union of Pure and Applied Chemistry; TR, trace amounts (<1%).
Mean peak area percentage ± standard deviation.
Differential characteristics of Numidum massiliense strain mt3T, Bacillus mannanilyticus strain DSM 16130, Virgibacillus pantothenticus strain ATCC 14576, Virgibacillus dokdonensis DSM 16826, Ornithinibacillus contaminans DSM 22953, Bacillus polygoni strain NCIMB 14282T, Bacillus agaradhaerens strain DSM 8721, Paucisalibacillus globulus strain LMG 23148T, Bacillus barbaricus strain DSM 14730T and Virgibacillus koreensis strain JCM 12387T[33], [34], [35], [36], [37], [38], [39], [40], [41]
| Property | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cell diameter (μm) | 0.5–0.6 | 0.6–0.8 | 0.5–0.7 | 0.6–0.8 | 0.8–1 | 0.4–0.5 | 0.5–0.6 | 0.5 | 0.5 | 0.5–0.7 |
| Oxygen requirement | + | + | + | − | + | + | + | + | + | − |
| Gram stain | + | +/− | + | +/− | + | + | NA | + | + | + |
| Salt requirement | − | − | + | − | − | + | + | − | − | + |
| Motility | − | + | + | + | − | − | NA | − | − | + |
| Endospore formation | + | + | + | + | + | + | + | + | + | + |
| Indole | − | − | − | − | NA | − | NA | NA | − | − |
| Production of: | ||||||||||
| Alkaline phosphatase | + | NA | NA | − | NA | NA | NA | NA | NA | NA |
| Catalase | + | + | + | + | + | + | − | + | + | + |
| Oxidase | − | − | NA | + | + | − | NA | − | − | + |
| Nitrate reductase | + | − | +/− | − | NA | + | + | − | − | − |
| Urease | − | NA | NA | − | NA | NA | − | − | − | − |
| β-Galactosidase | + | NA | NA | − | NA | NA | NA | NA | NA | + |
| N-acetyl-glucosamine | + | NA | + | − | NA | NA | + | + | + | + |
| Acid from: | ||||||||||
| − | − | − | − | NA | NA | + | − | − | + | |
| Ribose | + | NA | + | + | − | − | NA | − | + | NA |
| Mannose | + | + | + | + | − | + | + | + | + | + |
| Mannitol | + | + | − | − | w | + | + | + | − | + |
| Sucrose | − | + | +/− | + | NA | + | + | + | − | NA |
| + | + | + | + | + | + | + | + | + | − | |
| + | + | + | + | NA | + | + | + | + | + | |
| + | + | + | + | NA | + | + | + | + | + | |
| + | + | +/− | + | NA | − | NA | + | +/− | NA | |
| Habitat | Human gut | Industry | Soil | Seawater | Blood | Indigo balls | Industry | Soil | Paint | Salt |
+, positive result; −, negative result; w, weakly positive result; NA, data not available.
Nucleotide content and gene count levels of genome
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of total | |
| Size (bp) | 3 757 266 | 100 |
| G+C content (bp) | 1 955 657 | 52.05 |
| Coding region (bp) | 3 1815 69 | 84.67 |
| Total genes | 3513 | 100 |
| RNA genes | 65 | 1.85 |
| Protein-coding genes | 3448 | 98.14 |
| Genes with function prediction | 2570 | 73.15 |
| Genes assigned to COGs | 2314 | 65.86 |
| Genes with peptide signals | 229 | 6.51 |
| Genes with transmembrane helices | 977 | 27.81 |
COGs, Clusters of Orthologous Groups database.
Total is based on either size of genome in base pairs or total number of protein-coding genes in annotated genome.
Fig. 6Graphical circular map of chromosome. From outside to center: Genes on forward strain coloured by COGs categories (only gene assigned to COGs), RNA genes (tRNAs green, rRNAs red), GC content and GC skew.
Number of genes associated with 25 general COGs functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 150 | 4.35 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 247 | 7.16 | Transcription |
| L | 169 | 4.90 | Replication, recombination and repair |
| B | 1 | 0.03 | Chromatin structure and dynamics |
| D | 30 | 0.87 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 85 | 2.47 | Defense mechanisms |
| T | 123 | 3.57 | Signal transduction mechanisms |
| M | 143 | 4.15 | Cell wall/membrane biogenesis |
| N | 8 | 0.23 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 34 | 0.99 | Intracellular trafficking and secretion |
| O | 93 | 2.70 | Posttranslational modification, protein turnover, chaperones |
| C | 156 | 4.52 | Energy production and conversion |
| G | 234 | 6.79 | Carbohydrate transport and metabolism |
| E | 278 | 8.06 | Amino acid transport and metabolism |
| F | 63 | 1.83 | Nucleotide transport and metabolism |
| H | 95 | 2.76 | Coenzyme transport and metabolism |
| I | 117 | 3.39 | Lipid transport and metabolism |
| P | 163 | 4.73 | Inorganic ion transport and metabolism |
| Q | 75 | 2.18 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 370 | 10.73 | General function prediction only |
| S | 269 | 4.80 | Function unknown |
| — | 2903 | 84.19 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Total is based on total number of protein-coding genes in annotated genome.
Genome comparison of closely related species to Numidum massiliense strain mt3T
| Organism | INSDC | Size (Mb) | G+C (%) | Total genes |
|---|---|---|---|---|
| 3.76 | 52.05 | 3513 | ||
| 5.29 | 39.7 | 5050 | ||
| 4.53 | 39.6 | 4369 | ||
| 4.24 | 35.8 | 4127 | ||
| JXIQ00000000.1 | 3.9 | 42.1 | 3772 | |
| 3.59 | 48.7 | 3368 | ||
| 3.32 | 48.9 | 3256 |
INSDC, International Nucleotide Sequence Database Collaboration.
Fig. 7Distribution of functional classes of predicted genes according to clusters of orthologous groups of proteins.
Numbers of orthologous protein shared between genomes (upper right)a
| 1162 | 1028 | 1191 | 1294 | 1121 | 456 | ||
| 53.11 | 1194 | 1369 | 1471 | 1174 | 511 | ||
| 55.11 | 54.58 | 1301 | 1318 | 972 | 461 | ||
| 54.81 | 56.4 | 58.15 | 1632 | 1141 | 558 | ||
| 54.39 | 56.91 | 57.56 | 66.1 | 1244 | 656 | ||
| 57.96 | 54.52 | 55.66 | 55.94 | 55.59 | 412 | ||
| 52.26 | 55.47 | 54.19 | 58.18 | 58.89 | 52.72 |
Average percentage similarity of nucleotides corresponding to orthologous protein shared between genomes (lower left) and numbers of proteins per genome (bold).
Pairwise comparison of Bacillus niameyensis with eight other species using GGDC, formula 2 (DDH estimates based on identities/HSP length)a
| 100% ± 00 | 2.52% ± 0.13 | 2.53% ± 0.15 | 2.52% ± 0.14 | 2.52% ± 0.13 | 2.52% ± 0.13 | 2.52% ± 0.16 | 2.52% ± 0.21 | |
| 100% ± 00 | 2.53% ± 0.10 | 2.52% ± 0.18 | 2.52% ± 0.13 | 2.52% ± 0.12 | 2.52% ± 0.19 | 2.52% ± 0.20 | ||
| 100% ± 00 | 2.53% ± 0.13 | 2.52% ± 0.12 | 2.53% ± 0.16 | 2.52% ± 0.11 | 2.52% ± 0.21 | |||
| 100% ± 00 | 2.55% ± 0.23 | 2.52% ± 0.18 | 2.52% ± 0.21 | 2.52% ± 0.21 | ||||
| 100% ± 00 | 2.52% ± 0.17 | 2.52% ± 0.23 | 2.52% ± 0.20 | |||||
| 100% ± 00 | 2.52% ± 0.06 | 2.52% ± 0.21 | ||||||
| 100% ± 00 | 2.52% ± 00 | |||||||
| 100% ± 00 |
DDH, DNA-DNA hybridization; GGDC, Genome-to-Genome Distance Calculator; HSP, high-scoring segment pairs.
Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with 16S rRNA (Fig. 1) and phylogenomic analyses as well as GGDC results.