| Literature DB >> 26933503 |
M Tidjani Alou1, J Rathored1, J-C Lagier1, S Khelaifia1, N Labas1, C Sokhna2, A Diallo2, D Raoult3, G Dubourg1.
Abstract
Massilibacterium senegalense gen. nov., sp. nov., strain mt8(T), is the type strain of Massilibacterium gen. nov., a new genus within the Bacillaceae family. This Gram-negative facultative anaerobic rod was isolated from the gut microbiota of a severely malnourished boy. Its phenotypic description is hereby presented with a complete annotation of its genome sequence. This genome is 5 697 950 bp long and contains 5615 protein-coding genes and 178 RNA genes, among which are 40 rRNA genes.Entities:
Keywords: Culturomics; Massilibacterium senegalense; genome; gut microbiota; taxonogenomics
Year: 2016 PMID: 26933503 PMCID: PMC4765772 DOI: 10.1016/j.nmni.2016.01.010
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Classification and general features of Massilibacterium senegalense strain mt8T
| Property | Term |
|---|---|
| Current classification | Domain: |
| Phylum: | |
| Class: | |
| Order: | |
| Family: | |
| Genus: | |
| Species: | |
| Type strain: mt8 | |
| Gram stain | Negative |
| Cell shape | Rod |
| Motility | Motile |
| Sporulation | sporulating |
| Temperature range | Mesophilic |
| Optimum temperature | 37°C |
Fig. 1Phylogenetic tree highlighting the position of Massilibacterium senegalense strain mt8T relative to other close strains. The respective GenBank accession numbers for 16S rRNA genes are indicated in parenthesis. Sequences were aligned using CLUSTALW, and phylogenetic inferences were obtained using the maximum-likelihood method within the MEGA6. Numbers at the nodes are percentages of bootstrap values obtained by repeating the analysis 1,000 times to generate a majority consensus tree. Paenibacillus polymyxa strain KCTC3717 was used as an outgroup. The scale bar represents a 1% nucleotide sequence divergence.
Fig. 2Reference mass spectrum from Massilibacterium senegalense strain mt8T. Spectra from 12 individual colonies were compared and a reference spectrum was generated.
Fig. 3Gel view comparing Massilibacterium senegalense strain mt8T to other species within the Bacillaceae family. The gel view displays the raw spectra of loaded spectrum files arranged in a pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running spectrum number originating from subsequent spectra loading. The peak intensity is expressed by a Gray scale scheme code. The color bar and the right y-axis indicate the link between the color in which a peak is displayed and the peak intensity in arbitrary units. Displayed species are indicated on the left.
Fig. 4Gram staining of Massilibacterium senegalense strain mt8T.
Fig. 5Transmission electron microscopy of Massilibacterium senegalense strain mt8T using Tecnai G20 transmission electron microscope (FEI Company). Scale bar = 500 nm.
Differential characteristics of Massilibacterium senegalense strain mt8T, Bacillus halodurans DSM 497, Bacillus acidicola DSM 14745T, Bacillus wakoensis DSM 2521T, Bacillus hemicellulosilyticus DSM 16731T, Bacillus cellulosilyticus DSM 2522T, Bacillus akibai ATCC 43226T, Bacillus mannanilyticus DSM 16130T, Bacillus okuhidensis DSM 13666T, Bacillus sonorensis DSM 13779T[33], [34], [35], [36], [37]
| Property | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cell diameter (μm) | 1.7–1.9 | 0.5–0.6 | 1.0–1.6 | 0.5–0.8 | 0.3–0.5 | 0.6–0.8 | 0.6–0.8 | 0.6–0.8 | 0.5–1.0 | 1.0 |
| Oxygen requirement | + | + | + | + | + | + | + | + | + | + |
| Gram stain | − | NA | + | + | +/− | + | + | +/− | +/− | + |
| Salt requirement | − | + | − | − | − | − | − | − | − | − |
| Motility | + | NA | + | + | + | + | + | + | + | + |
| Endospore formation | + | + | + | + | + | + | + | + | + | + |
| Indole | − | NA | − | − | − | − | − | − | NA | NA |
| Production of: | ||||||||||
| Alkaline phosphatase | − | NA | NA | NA | NA | NA | NA | NA | NA | NA |
| Catalase | − | NA | + | + | + | + | + | + | + | + |
| Oxidase | − | NA | − | − | − | − | − | − | + | NA |
| Nitrate reductase | + | − | − | + | − | + | + | − | + | + |
| Urease | − | NA | NA | NA | NA | NA | NA | NA | NA | NA |
| β-Galactosidase | − | NA | NA | NA | NA | NA | NA | NA | NA | NA |
| N-acetyl-glucosamine | + | + | NA | NA | NA | NA | NA | NA | + | NA |
| Acid from: | ||||||||||
| | − | + | − | − | − | − | + | − | + | + |
| Ribose | − | − | + | − | − | − | − | − | NA | NA |
| Mannose | + | + | + | + | + | + | + | + | − | NA |
| Mannitol | − | + | + | + | + | − | + | + | + | + |
| Sucrose | + | + | + | + | + | + | + | + | NA | NA |
| | − | + | + | + | + | + | + | + | NA | + |
| | + | + | + | + | + | + | + | + | NA | NA |
| | + | + | + | + | + | + | + | + | NA | NA |
| | − | + | +/− | − | + | + | − | − | + | NA |
| Habitat | Human gut | Soil | Acidic peat bogs | Industry | Industry | Industry | Industry | Industry | Water | Desert |
NA, not available.
Fig. 6Graphical circular map of chromosome. From outside to center: genes on forward strain coloured by COGs categories (only gene assigned to COGs), RNA genes (tRNAs green, rRNAs red), G+C content and G+C skew.
Nucleotide content and gene count levels of the genome
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of total | |
| Size (bp) | 5 697 950 | 100 |
| G+C content (%) | 2 034 168 | 35.7 |
| Coding region (bp) | 4 442 019 | 77.95 |
| Total genes | 5793 | 100 |
| RNA genes | 178 | 3.07 |
| Protein-coding genes | 5615 | 96.92 |
| Genes with function prediction | 4262 | 73.57 |
| Genes assigned to COGs | 3838 | 66.25 |
| Genes with peptide signals | 210 | 3.62 |
| Genes with transmembrane helices | 504 | 8.70 |
| CRISPR repeats | 0 | 0 |
| ORFan genes | 208 | 3.59 |
| Genes associated with PKS or NRPS | 13 | 0.22 |
| No. of antibiotic resistance genes | 0 | 0 |
COGs, Clusters of Orthologous Groups database; CRISPR, clustered regularly interspaced short palindromic repeat.
Total is based on either size of genome (bp) or total number of protein-coding genes in annotated genome.
Number of genes associated with 25 general COGs functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 298 | 10.63 | Translation |
| A | 0 | 0.0 | RNA processing and modification |
| K | 334 | 11.91 | Transcription |
| L | 406 | 14.48 | Replication, recombination and repair |
| B | 2 | 0.07 | Chromatin structure and dynamics |
| D | 68 | 2.43 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 80 | 2.85 | Defense mechanisms |
| T | 252 | 8.99 | Signal transduction mechanisms |
| M | 224 | 7.99 | Cell wall/membrane biogenesis |
| N | 132 | 4.71 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 98 | 3.50 | Intracellular trafficking and secretion |
| O | 190 | 6.78 | Posttranslational modification, protein turnover, chaperones |
| C | 248 | 8.84 | Energy production and conversion |
| G | 140 | 4.99 | Carbohydrate transport and metabolism |
| E | 370 | 13.20 | Amino acid transport and metabolism |
| F | 138 | 4.92 | Nucleotide transport and metabolism |
| H | 226 | 8.06 | Coenzyme transport and metabolism |
| I | 190 | 6.78 | Lipid transport and metabolism |
| P | 286 | 10.20 | Inorganic ion transport and metabolism |
| Q | 92 | 3.28 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 602 | 21.47 | General function prediction only |
| S | 432 | 15.41 | Function unknown |
| — | 424 | 7.31 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Total is based on total number of protein-coding genes in annotated genome.
Genome comparison of species closely related to Massilibacterium senegalense strain mt8T
| Organism | INSDC | Size | G+C% | Protein-coding genes | Total genes |
|---|---|---|---|---|---|
| CTRN00000000.1 | 5.69 | 35.6 | 5615 | 5793 | |
| CP001878.2 | 3.86 | 40.3 | 3704 | 3841 | |
| CP010820.1 | 4.84 | 37.6 | 4548 | 4764 | |
| BAUT00000000.1 | 5.53 | 38.3 | 3912 | 4460 | |
| JHVN00000000.1 | 3.36 | 43.0 | 3245 | 3400 | |
| BA000004.3 | 4.2 | 43.7 | 3903 | 4076 | |
| ACWF00000000.1 | 3.24 | 40.7 | 2832 | 3235 |
Fig. 7Distribution of functional classes of predicted genes according to clusters of orthologous groups of protein.
Number of orthologous proteins shared between genomes (upper right)a
| 4335 | 1496 | 1368 | 1868 | 1615 | 1959 | 1457 | |
| 64.28 | 4767 | 1244 | 1356 | 1456 | 1464 | 1325 | |
| 65.99 | 65.59 | 5615 | 1263 | 1318 | 1321 | 1231 | |
| 70.67 | 64.46 | 66.35 | 4576 | 1495 | 1819 | 1358 | |
| 65.38 | 64.71 | 66.37 | 65.28 | 3463 | 1611 | 1456 | |
| 68.48 | 63.43 | 65.40 | 68.03 | 65.66 | 4066 | 1425 | |
| 65.06 | 65.14 | 65.95 | 64.98 | 68.03 | 65.00 | 3294 |
Average percentage similarity of nucleotides corresponding to orthologous proteins shared between genomes (lower left) and numbers of proteins per genome.
Pairwise comparison of Massilibacterium senegalense strain mt8T with other species using GGDC, formula 2 (DDH estimates based on identities/HSP length)a (upper right)
| 100% | 29.1% ± 2.54 | 26.3% ± 2.55 | 21.2% ± 2.59 | 21.9% ± 2.53 | 27.5% ± 2.56 | 25.7% ± 2.54 | |
| 100% | 29.5% ± 2.54 | 26.1% ± 2.54 | 23.2% ± 2.53 | 30.8% ± 2.54 | 31.9% ± 2.54 | ||
| 100% | 25% ± 2.54 | 21% ± 2.54 | 24.4% ± 2.55 | 30.4% ± 2.54 | |||
| 100% | 19.5% ± 2.53 | 23.3% ± 2.56 | 28.3% ± 2.53 | ||||
| 100% | 22.2% ± 2.53 | 21.6% ± 2.55 | |||||
| 100% | 25.9% ± 2.54 | ||||||
| 100% |
DDH, DNA-DNA hybridization; GGDC, Genome-to-Genome Distance Calculator; HSP, high-scoring segment pairs.
Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with the 16S rRNA (Fig. 3) and phylogenomic analyses as well as GGDC results.