| Literature DB >> 28053703 |
M Tidjani Alou1, J Rathored1, J-C Lagier2, S Khelaifia1, C Michelle1, C Sokhna3, A Diallo3, A B Diallo4, P-E Fournier1, D Raoult5, S Edouard1.
Abstract
Rubeoparvulum massiliense strain mt6T was isolated from the gut microbiota of a severely malnourished boy from Senegal and consisted of facultative anaerobic, spore-forming, nonmotile and Gram-negative rods. R. massiliense showed a 92% similarity with the 16S rRNA of Bacillus mannanilyticus. The genome of strain mt6T is 2 843 796 bp long with a 43.75% G+C content. It contains 2735 protein-coding genes and 76 RNA genes, among which are nine rRNA genes.Entities:
Keywords: Culturomics; Rubeoparvulum massiliense; genome; gut microbiota; taxonogenomics
Year: 2016 PMID: 28053703 PMCID: PMC5198730 DOI: 10.1016/j.nmni.2016.11.003
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Fig. 1Phylogenetic tree highlighting position of Rubeoparvulum massiliense strain mt6T relative to other close strains. Respective GenBank accession numbers for 16S rRNA genes are indicated in parentheses. Sequences were aligned using CLUSTALW, and phylogenetic inferences were obtained using maximum-likelihood method within MEGA6 software. Numbers at nodes are percentages of bootstrap values obtained by repeating analysis 1000 times to generate majority consensus tree. Salinimicrobium catena strain HY1 was used as outgroup. Scale bar represents 5% nucleotide sequence divergence.
Classification and general features of Rubeoparvulum massiliense strain mt6T
| Property | Term |
|---|---|
| Current classification | Domain: |
| Phylum: | |
| Class: | |
| Order: | |
| Family: | |
| Genus: | |
| Species: | |
| Type strain: mt6 | |
| Gram stain | Negative |
| Cell shape | Rod |
| Motility | Motile |
| Sporulation | Sporulating |
| Temperature range | Mesophilic |
| Optimum temperature | 37°C |
Fig. 2Gram staining of Rubeoparvulum massiliense strain mt6T (= CSUR P1473 = DSM 100479).
Fig. 3Transmission electron microscopy of Rubeoparvulum massiliense strain mt6T (= CSUR P1473 = DSM 100479) using Tecnai G20 transmission electron microscope (FEI Company) at operating voltage of 60 kV. Scale bar = 500 nm.
Fig. 4Reference mass spectrum from Rubeoparvulum massiliense strain mt6T (= CSUR P1473 = DSM 100479). Spectra from 12 individual colonies were compared and reference spectrum generated.
Fig. 5Gel view comparing Rubeoparvulum massiliense strain mt6T (= CSUR P1473 = DSM 100479) to other species within Bacillaceae family. Gel view displays raw spectra of loaded spectrum files arranged in pseudo-gel-like look. x-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity is expressed by greyscale scheme code. Colour bar and right y-axis indicate relation between colour peak is displayed, with peak intensity in arbitrary units. Displayed species are indicated at left.
Differential characteristics of Rubeoparvulum massiliense strain mt6T CSUR P1473 = DSM 100479, Bacillus mannanilyticus strain AM-001T DSM 16130T, Tepidibacillus fermentans strain STGHT DSM 23802T, Pullulanibacillus uraniitolerans strain UG-2T DSM 19429T, Alkalibacillus haloalkaliphilus DSM 5271T, Tenuibacillus halotolerans strain YIM 94025T KCTC 33046T, Thalassobacillus devorans strain G-19.1T DSM 16966T, Salinibacillus aidingensis strain 25-7T JCM 12389T, Salinibacillus kushneri strain 8-2T JCM 12390T, Ornithinibacillus bavariensis strain WSBC 24001T DSM 15681T[34], [35], [36], [37], [38], [39], [40], [41]
| Property | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cell diameter (μm) | 1.0 | 0.6–0.8 | 0.3 | 1.0 | 0.3–0.5 | 0.2–0.3 | 1.0–1.2 | 0.4–0.6 | 0.3–0.5 | 0.4 |
| Oxygen requirement | +/− | + | − | + | + | + | + | + | + | + |
| Gram stain | − | +/− | + | + | − | + | + | + | + | + |
| Salt requirement | − | − | + | + | + | + | + | + | + | + |
| Indole | − | − | NA | NA | NA | − | − | NA | NA | − |
| Production of: | ||||||||||
| Catalase | − | + | − | + | + | + | + | + | + | + |
| Oxidase | + | − | + | − | − | + | − | +/− | +/− | + |
| Nitrate reductase | + | − | + | − | − | − | + | − | − | − |
| Urease | + | NA | NA | NA | − | NA | − | − | − | − |
| Acid from: | ||||||||||
| − | − | − | + | NA | − | − | + | + | − | |
| Ribose | − | NA | NA | + | + | NA | − | NA | NA | + |
| Mannose | + | + | NA | + | + | NA | + | + | + | − |
| Mannitol | − | + | NA | NA | − | − | + | + | + | − |
| Sucrose | − | + | − | + | NA | + | + | + | + | − |
| − | + | + | + | − | + | + | + | + | + | |
| − | + | + | + | + | + | + | + | + | − | |
| + | + | + | + | − | − | + | + | + | − | |
| + | − | − | + | NA | − | − | + | + | − | |
| Habitat | Human stool | Industry | Gas storage | Mill tailing effluent | Salt lake | Salt lake | Hypersaline environments | Neutral saline lake | Neutral saline lake | Pasteurized milk |
+, positive result; −, negative result; NA, data not available.
Cellular fatty acid composition (%) of Rubeoparvulum massiliense strain mt6T
| Fatty acid | Name | Mean relative % |
|---|---|---|
| 15:0 anteiso | 12-Methyl-tetradecanoic acid | 25.2 ± 0.3 |
| 13:0 anteiso | 10-Methyl-dodecanoic acid | 15.2 ± 0.2 |
| 16:0 | Hexadecanoic acid | 12.0 ± 0.6 |
| 18:1n9 | 9-Octadecenoic acid | 9.3 ± 0.4 |
| 13:0 iso | 11-Methyl-dodecanoic acid | 7.1 ± 0.1 |
| 18:0 | Octadecanoic acid | 6.3 ± 0.1 |
| 15:0 iso | 13-Methyl-tetradecanoic acid | 5.8 ± 0.1 |
| 14:0 iso | 12-Methyl-tridecanoic acid | 5.6 ± 0.3 |
| 18:2n6 | 9,12-Octadecadienoic acid | 5.3 ± 0.2 |
| 5:0 iso | 3-Methyl-butanoic acid | 1.7 ± 0.1 |
| 14:0 | Tetradecanoic acid | 1.6 ± 0.1 |
| 11:0 anteiso | 8-Methyl-decanoic acid | 1.1 ± 0.1 |
| 18:1n6 | 12-Octadecenoic acid | TR |
| 15:0 | Pentadecanoic acid | TR |
| 12:0 iso | 10-Methyl-undecanoic acid | TR |
| 17:0 | Heptadecanoic acid | TR |
| 18:1n7 | 11-Octadecenoic acid | TR |
| 13:0 | Tridecanoic acid | TR |
| 15:0 3-OH anteiso | 3-Hydroxy-12-methyl-tetradecanoic acid | TR |
| 11:0 iso | 9-Methyl-decanoic acid | TR |
| 16:0 iso | 14-Methyl-pentadecanoic acid | TR |
| 10:0 | Decanoic acid | TR |
| 16:1n7 | 9-Hexadecenoic acid | TR |
| 20:4n6 | 5,8,11,14-Eicosatetraenoic acid | TR |
| 17:0 anteiso | 14-Methyl-hexadecanoic acid | TR |
| 12:0 | Dodecanoic acid | TR |
| 17:0 iso | 15-Methyl-hexadecanoic acid | TR |
TR, trace amounts <1%.
Mean peak area percentage.
Nucleotide content and gene count levels of genome
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of total | |
| Size (bp) | 2 843 796 | 100 |
| G+C content (%) | 1 244 365 | 43.75 |
| Coding region (bp) | 2 498 460 | 87.85 |
| Total genes | 2811 | 100 |
| RNA genes | 76 | 2.70 |
| Protein-coding genes | 2735 | 97.29 |
| Genes with function prediction | 1873 | 66.63 |
| Genes assigned to COGs | 862 | 30.66 |
| Genes with peptide signals | 319 | 11.34 |
| No. of pseudogenes | 13 | 0.46 |
| CRISPR repeats | 0 | 0 |
| No. genes with Pfam-A domains | 2557 | 90.96 |
| ORFans genes | 233 | 8.28 |
| Genes associated with PKS or NRPS | 12 | 0.42 |
| No. of antibiotic resistance genes | 0 | 0 |
COGs, Clusters of Orthologous Groups database.
Total is based on either size of genome in base pairs or total number of protein-coding genes in annotated genome.
Fig. 6Graphical circular map of chromosome. From outside to centre, genes on forward strain are coloured by COGs categories (only gene assigned to COGs), RNA genes (tRNAs green, rRNAs red), G+C content and G+C skew. COGs, Clusters of Orthologous Groups database.
Number of genes associated with 25 general COGs functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 150 | 5.48 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 141 | 5.15 | Transcription |
| L | 139 | 5.08 | Replication, recombination and repair |
| B | 1 | 0.03 | Chromatin structure and dynamics |
| D | 28 | 1.02 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 50 | 1.82 | Defense mechanisms |
| T | 90 | 3.29 | Signal transduction mechanisms |
| M | 82 | 2.99 | Cell wall/membrane biogenesis |
| N | 49 | 1.79 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 39 | 1.42 | Intracellular trafficking and secretion |
| O | 82 | 2.99 | Posttranslational modification, protein turnover, chaperones |
| C | 129 | 4.71 | Energy production and conversion |
| G | 81 | 2.96 | Carbohydrate transport and metabolism |
| E | 227 | 8.29 | Amino acid transport and metabolism |
| F | 62 | 2.26 | Nucleotide transport and metabolism |
| H | 76 | 2.77 | Coenzyme transport and metabolism |
| I | 66 | 2.41 | Lipid transport and metabolism |
| P | 155 | 5.66 | Inorganic ion transport and metabolism |
| Q | 30 | 1.09 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 273 | 9.98 | General function prediction only |
| S | 189 | 6.91 | Function unknown |
| — | 862 | 31.51 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Total is based on total number of protein-coding genes in annotated genome.
Genome comparison of closely related species to Rubeoparvulum massiliense strain mt6T
| Organism | INSDC | Size (Mb) | G+C (%) | Protein-coding genes | Total genes |
|---|---|---|---|---|---|
| CVPE00000000 | 2.84 | 43.75 | 2735 | 2811 | |
| BAMO00000000.1 | 4.53 | 39.6 | 3846 | 4454 | |
| JATL00000000.1 | 5.51 | 54.2 | 5047 | 5297 | |
| APBN00000000.1 | 5.16 | 52.0 | 4817 | 5039 | |
| ATNE00000000.1 | 4.43 | 58.4 | 4072 | 4269 | |
| AFCE00000000.1 | 2.9 | 47.6 | 2741 | 2969 |
INSDC, International Nucleotide Sequence Database Collaboration.
Numbers of orthologous protein shared between genomes (upper right)a
| 1605 | 1606 | 1533 | 1178 | 1079 | ||
| 60.62 | 2713 | 2625 | 1368 | 1286 | ||
| 61.19 | 73.07 | 2645 | 1376 | 1316 | ||
| 59.20 | 75.04 | 75.55 | 135 | 1269 | ||
| 66.10 | 62.97 | 63.15 | 63.13 | 1039 | ||
| 63.12 | 61.17 | 61.57 | 60.30 | 62.87 |
Average percentage similarity of nucleotides corresponding to orthologous protein shared between genomes (lower left) and numbers of proteins per genome (bold).
Pairwise comparison of Rubeoparvulum massiliense strain mt6T with other species using GGDC, formula 2 (DDH estimates based on identities/HSP length),a upper right
| 100% ± 00 | 32.9% ± 2.52 | 32.2% ± 2.52 | 31.6% ± 2.52 | 26.2% ± 2.52 | 26.3% ± 2.52 | |
| 100% ± 00 | 18.8% ± 2.70 | 19.4% ± 2.73 | 30% ± 2.52 | 35.4% ± 2.52 | ||
| 100% ± 00 | 20.2% ± 2.80 | 29.9% ± 2.52 | 30.7% ± 2.52 | |||
| 100% ± 00 | 35.1% ± 2.52 | 23.2% ± 2.52 | ||||
| 100% ± 00 | 27.2% ± 2.52 | |||||
| 100% ± 00 |
Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with 16S rRNA (Fig. 1) and phylogenomic analyses as well as GGDC results.
Fig. 7Distribution of functional classes of predicted genes according COGs of protein. COGs, Clusters of Orthologous Groups database.