| Literature DB >> 27076913 |
M Tidjani Alou1, J Rathored1, S I Traore1, S Khelaifia1, C Michelle1, S Brah2, B A Diallo3, D Raoult4, J-C Lagier1.
Abstract
Bacillus niameyensis sp. nov. strain SIT3(T) (= CSUR P1266 = DSM 29725) is the type strain of B. niameyensis sp. nov. This Gram-positive strain was isolated from the digestive flora of a child with kwashiorkor and is a facultative anaerobic rod and a member of the Bacillaceae family. This organism is hereby described alongside its complete genome sequence and annotation. The 4 286 116 bp long genome (one chromosome but no plasmid) contains 4130 protein-coding and 66 RNA genes including five rRNA genes.Entities:
Keywords: Bacillus niameyensis; culturomics; genome; malnutrition; taxonogenomics
Year: 2015 PMID: 27076913 PMCID: PMC4815930 DOI: 10.1016/j.nmni.2015.09.011
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Fig. 1Phylogenetic tree highlighting position of Bacillus niameyensis sp. nov. strain SIT3 (= CSUR P1266 = DSM 29725) relative to other type strains within Bacillus genus. Strains and their corresponding GenBank accession numbers for 16S rRNA genes are Bacillus amyloliquefaciensAB006920, Bacillus bataviensisAJ542508, Bacillus drentensisAJ542506, Bacillus endophyticusvAF295302, Bacillus galactosidilyticusAJ535638, Bacillus idriensisAY904033, Bacillus mojavensisAB021191, Bacillus infantisAY904032, Bacillus niaciniAB021194, Bacillus novalisAJ542512, Bacillus pseudomycoides AF13121, Bacillus pocheonensisAB245377, Bacillus thuringiensisD16281, Bacillus thermoamylovoransL27478. Sequences were aligned using Clustal W (http://www.clustal.org/clustal2/), and phylogenetic inferences were obtained using maximum-likelihood method within MEGA 6 (http://www.megasoftware.net/mega.php). Numbers at nodes are percentages of bootstrap values obtained by repeating analysis 1000 times to generate majority consensus tree. Clostridium butyricumAJ458420 was used as outgroup. Scale bar = 1% nucleotide sequence divergence.
Fig. 2Gram staining of B. niameyensis strain SIT3T.
Fig. 3Transmission electron microscopy of B. niameyensis strain SIT3, using Morgani 268D transmission electron microscope (Philips/FEI, Hillsboro, OR, USA) at operating voltage of 60 kV. Scale bar = 500 nm.
Fig. 4Reference mass spectrum from B. niameyensis strain SIT3T. Spectra from 12 individual colonies were compared and reference spectrum generated.
Fig. 5Gel view comparing B. niameyensis (= CSUR P1266 = DSM 29725) to other species within genus Bacillus. Gel view displays raw spectra of loaded spectrum files arranged in pseudo-gel-like look. X-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity expressed by greyscale scheme code. Color bar and right y-axis indicate relation between color peak, with peak intensity in arbitrary units. Displayed species are indicated at left.
Classification and general features of Bacillus niameyensis strain SIT3T
| Property | Term |
|---|---|
| Current classification | Domain: |
| Phylum: | |
| Class: | |
| Order: | |
| Family: | |
| Genus: | |
| Species: | |
| Type strain: SIT3 | |
| Gram stain | Positive |
| Cell shape | Rod |
| Motility | Motile |
| Sporulation | sporulating |
| Temperature range | Mesophilic |
| Optimum temperature | 37°C |
Differential characteristics of Bacillus niameyensis strain SIT3T, Bacillus hackensackii, Bacillus galactosidilyticus DSM 15595, Bacillus fordii DSM 16014, Bacillus fortis DSM 16012, Bacillus oceanisediminis JCM 16506, Bacillus sporothermodurans DSM 10599, Bacills infantis JCM 13438, Bacillus horneckiae MTCC 9535, Bacillus lentus ATCC 10840 [29], [30], [31], [32], [33], [34], [35], [36]
| Property | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cell diameter (μm) | 0.7–1.0 | 2.0–8.0 | 0.7–0.9 | 0.6–0.8 | 0.6–0.8 | 0.6–0.8 | 0.7 | NA | 1.0–1.5 | 0.7–1.2 |
| Oxygen requirement | + | + | + | + | + | + | + | + | + | + |
| Gram stain | + | + | + | − | − | + | + | + | + | + |
| Salt requirement | NA | NA | NA | NA | NA | NA | NA | NA | − | − |
| Motility | + | + | + | + | + | NA | + | NA | + | + |
| Endospore formation | + | + | + | + | + | + | + | NA | + | + |
| Indole | − | − | − | − | − | + | NA | NA | − | NA |
| Production of | ||||||||||
| Alkaline phosphatase | − | NA | NA | NA | NA | NA | NA | NA | + | NA |
| Catalase | + | + | + | + | + | + | + | + | + | + |
| Oxidase | − | − | NA | + | + | + | + | − | − | NA |
| Nitrate reductase | + | − | + | NA | NA | + | − | NA | + | − |
| Urease | − | + | +/− | NA | NA | − | − | NA | − | + |
| β-Galactosidase | + | NA | + | NA | NA | + | NA | NA | − | NA |
| N-acetyl-glucosamine | + | NA | + | +/− | + | NA | NA | + | + | NA |
| Acid from: | ||||||||||
| − | − | +/− | w | − | − | NA | NA | + | + | |
| Ribose | − | − | +/− | w | + | − | NA | + | − | NA |
| Mannose | − | − | + | − | − | − | − | − | + | + |
| Mannitol | − | − | − | − | − | + | − | + | + | + |
| Sucrose | − | − | +/− | − | − | + | NA | + | − | + |
| − | − | w | − | + | + | − | + | − | + | |
| − | − | w | − | − | − | − | + | − | NA | |
| + | − | w | − | − | − | NA | + | + | NA | |
| − | +/− | − | − | − | − | + | − | NA | ||
| Habitat | Human gut | Blood | Raw milk | Raw milk | Milking apparatus | Marine sediment | Milk | Blood | Spacecraft assembly | soil |
NA, data not available; w, weak reaction.
Fig. 6Graphical circular map of chromosome. From outside to center: genes on forward strain colored by COGs categories (only gene assigned to COGs), RNA genes (tRNAs green, rRNAs red), GC content and GC skew. COGs, Clusters of Orthologous Groups database.
Nucleotide content and gene count levels of the genome
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of total | |
| Size (bp) | 4 286 116 | 100 |
| G + C content (bp) | 1 603 007 | 37.4 |
| Coding region (bp) | 3 665 154 | 85.51 |
| Total genes | 4196 | 100 |
| RNA genes | 66 | 1.57 |
| Protein-coding genes | 4130 | 98.42 |
| Genes with function prediction | 3058 | 72.88 |
| CRISPRs | 7 | 0.16 |
| Genes assigned to COGs | 2694 | 64.20 |
| Genes with peptide signals | 294 | 7.0 |
| Genes with transmembrane helices | 1180 | 28.12 |
| Genes associated to PKS or NRPS | 10 | 0.23 |
| Genes associated to mobilome | 2198 | 51.80 |
| Genes associated to toxin/antitoxin | 111 | 2.61 |
| Genes associated to resistance genes | 0 | 0 |
| Genes with paralogues (E value 1e−10) | 1317 | 31.03 |
| Genes larger than 5000 nt | 2 | 0.05 |
COGs, Clusters of Orthologous Groups database; CRISPR, clustered regularly interspaced short palindromic repeat; NRPS, nonribosomal peptide synthase; PKS, polyketide synthase.
Total is based on either size of genome (bp) or total number of protein coding genes in annotated genome.
Number of genes associated with the 25 general COGs functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 180 | 4.36 | Translation |
| A | 0 | 0.00 | RNA processing and modification |
| K | 297 | 7.19 | Transcription |
| L | 185 | 4.48 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 36 | 0.87 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 111 | 2.69 | Defense mechanisms |
| T | 186 | 4.50 | Signal transduction mechanisms |
| M | 173 | 4.19 | Cell wall/membrane biogenesis |
| N | 64 | 1.55 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structures |
| U | 44 | 1.07 | Intracellular trafficking and secretion |
| O | 99 | 2.40 | Posttranslational modification, protein turnover, chaperones |
| C | 156 | 3.78 | Energy production and conversion |
| G | 351 | 8.50 | Carbohydrate transport and metabolism |
| E | 279 | 6.76 | Amino acid transport and metabolism |
| F | 86 | 2.08 | Nucleotide transport and metabolism |
| H | 97 | 2.35 | Coenzyme transport and metabolism |
| I | 85 | 2.06 | Lipid transport and metabolism |
| P | 218 | 5.28 | Inorganic ion transport and metabolism |
| Q | 73 | 1.77 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 520 | 12.59 | General function prediction only |
| S | 326 | 7.89 | Function unknown |
| — | 374 | 8.91 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Total is based on either size of genome (bp) or total number of protein coding genes in annotated genome.
Closely related species with Bacillus niameyensis
| Name of species | Similarity (%) | Accession No. |
|---|---|---|
| 100 | LK985389 | |
| 94.88 | KP120947.1 | |
| 94.45 | ||
| 93.50 | ||
| 94.60 | KP696715.1 | |
| 95.56 | NR_025786.1 |
Number of orthologous proteins shared between genomes (upper right)a
| 4,130 | 1,511 | 1,769 | 1,648 | 1,800 | 1,658 | 1,846 | 832 | |
| 61.24 | 4,486 | 1,940 | 1,730 | 1,886 | 1,601 | 2,040 | 838 | |
| 61.8 | 62.60 | 4,142 | 1,956 | 2,634 | 2,062 | 3,039 | 971 | |
| 65.06 | 61.82 | 62.01 | 4,688 | 1,842 | 1,608 | 2,069 | 824 | |
| 59.99 | 62.25 | 67.18 | 61.3 | 4,142 | 2,074 | 2,728 | 946 | |
| 61.44 | 60.15 | 62.84 | 59.95 | 61.08 | 4,789 | 2,149 | 957 | |
| 61.82 | 62.67 | 88.49 | 62.03 | 67.17 | 63.04 | 5,578 | 1,012 | |
| 54.21 | 51.46 | 52.78 | 52.44 | 50.8 | 54.49 | 52.8 | 4,152 |
Average percentage similarity of nucleotides corresponding to orthologous proteins shared between genomes (lower left) and numbers of proteins per genome.
Fig. 7Distribution of functional classes of predicted genes according to clusters of orthologous groups of proteins.
Pairwise comparison of Bacillus niameyensis with eight other species using GGDC, formula 2 (DDH estimates based on identities/HSP length)a[27], [28]
| 100% ± 00 | 2.53% ± 0.18 | 2.53% ± 0.20 | 2.54% ± 0.22 | 2.54% ± 0.13 | 2.54% ± 0.20 | 2.53% ± 0.22 | 2.53% ± 0.17 | |
| 100% ± 00 | 2.54% ± 0.19 | 2.53% ± 0.17 | 2.55% ± 0.18 | 2.53% ± 0.17 | 2.54% ± 0.21 | 2.53% ± 0.19 | ||
| 100% ± 00 | 2.53% ± 0.19 | 2.60% ± 0.22 | 2.55% ± 0.20 | 3.04% ± 0.11 | 2.53% ± 0.18 | |||
| 100% ± 00 | 2.53% ± 0.18 | 2.53% ± 0.17 | 2.53% ± 0.19 | 2.52% ± 0.18 | ||||
| 100% ± 00 | 2.54% ± 0.15 | 2.60% ± 0.21 | 2.53% ± 0.18 | |||||
| 100% ± 00 | 2.56% ± 0.16 | 2.53% ± 0.18 | ||||||
| 100% ± 00 | 2.53% ± 0.18 | |||||||
| 100% ± 00 |
DDH, DNA-DNA hybridization; GGDC, genome-to-genome distance; HSP, high-scoring pair.
Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with the 16S rRNA (Fig. 1) and phylogenetic analyses as well as GGDC results.