| Literature DB >> 28289546 |
E Guilhot1, M T Alou1, J-C Lagier1, N Labas1, C Couderc1, J Delerce1, A Diallo2, C Sokhna2, P-E Fournier1, D Raoult3, S Khelaifia1.
Abstract
Using the culturomics strategy, a wide range of anaerobic bacteria was discovered including Anaeromassilibacillus senegalensis strain mt9T (= CSUR P1511 = DSM 102954), isolated from the gut microbiota of a 1-year-old Senegalese patient with kwashiorkor. This Gram-negative strain is a strictly anaerobic, spore-forming rod motile by a polar flagellum. The 3 511 289 bp long genome of this strain contains 3046 protein-coding and 49 RNA genes, including 45 tRNA and four rRNA genes, and exhibits a G+C content of 52.94%. Here we describe the features of this organism, together with the complete genome sequence and annotation.Entities:
Keywords: Anaeromassilibacillus senegalensis; culturomics; genome; human gut; taxonogenomics
Year: 2017 PMID: 28289546 PMCID: PMC5338722 DOI: 10.1016/j.nmni.2017.01.009
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Classification and general features of Anaeromassilibacillus senegalensis strain mt9T
| Property | Term |
|---|---|
| Current classification | Domain: |
| Phylum | |
| Class | |
| Order | |
| Family | |
| Genus | |
| Species | |
| Type strain | mt9T |
| Gram stain | Negative |
| Cell shape | Rod |
| Motility | Motile |
| Sporulation | sporulating |
| Temperature range | Mesophilic |
| Optimum temperature | 37°C |
Fig. 1Phylogenetic tree highlighting position of Anaeromassilibacillus senegalensis strain mt9T relative to other type strains. 16S rRNA sequences accession numbers are indicated in parentheses. Sequences were aligned by Muscle 3.8.31 with default parameters, and phylogenetic inferences were obtained using neighbour-joining method with 500 bootstrap replicates within MEGA6 software. Only bootstraps >95% are shown. Scale bar represents 0.01% nucleotide sequence divergence.
Fig. 2Reference mass spectrum from Anaeromassilibacillus senegalensis strain mt9T.
Fig. 3Gel view comparing Anaeromassilibacillus senegalensis strain mt9T to closely related genera. Gel view displays raw spectra of loaded spectrum files arranged in pseudo-gel-like look. X-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity is expressed by greyscale scheme code. The color bar and the right y-axis indicate the relation between the color, the peak displayed and the peak intensity in arbitrary units. Displayed species are indicated on the left.
Fig. 4Gram staining of Anaeromassilibacillus senegalensis strain mt9T.
Fig. 5Electron micrographs of Anaeromassilibacillus senegalensis strain mt9T using Tecnai G20 transmission electron microscope (FEI Company) at operating voltage of 200 keV. Scale bar = 500 nm.
Differential characteristics of Anaeromassilibacillus senegalensis DSM 102954, Clostridium thermobutyricum DSM 4928, Clostridium cellulolyticum H10, Clostridium leptum DSM 753, Clostridium sporosphaeroides DSM 1294, and Ethanoligenens harbinense YUAN-3 [26], [27], [28]
| Property | ||||||
|---|---|---|---|---|---|---|
| Cell diameter (μm) | 0.9 | 0.5–0.6 | 1.5 | 0.6–0.8 | 0.4–0.8 | 1.6–2.1 |
| Oxygen requirement | Strictly anaerobic | Strictly anaerobic | Strictly anaerobic | Strictly anaerobic | Strictly anaerobic | Strictly anaerobic |
| Gram stain | Negative | Positive | Positive | Positive | Positive | Negative |
| Motility | Motile | Nonmotile | Motile | Nonmotile | Motile | Nonmotile |
| Endospore formation | + | + | + | + | − | + |
| Indole | − | − | − | NA | + | − |
| Production of: | ||||||
| Alkaline phosphatase | + | NA | NA | NA | − | + |
| Catalase | − | NA | − | NA | − | − |
| Oxidase | − | NA | NA | NA | − | − |
| Nitrate reductase | − | − | − | NA | − | − |
| Urease | − | − | − | NA | + | NA |
| β-galactosidase | + | NA | NA | NA | − | − |
| | + | NA | NA | NA | + | − |
| Acid from: | ||||||
| | − | − | NA | NA | − | + |
| Ribose | + | − | NA | w | + | + |
| Mannose | − | − | NA | − | + | − |
| Mannitol | − | − | NA | − | − | NA |
| Sucrose | − | NA | NA | w | + | − |
| | − | NA | NA | NA | + | + |
| | − | NA | NA | NA | + | + |
| | + | NA | NA | NA | + | + |
| | − | NA | NA | NA | + | + |
| G+C content (%) | 52.94 | 53.5 | 41 | 51 | 47.8 | 37 |
| Habitat | Human gut | Human gut | Compost | Human gut | Environment | Environment |
+, positive result; −, negative result; w, weakly positive result; NA, data not available.
Cellular fatty acid composition (%)
| Fatty acid | Name | Mean relative % |
|---|---|---|
| 16:0 | Hexadecanoic acid | 32.3 ± 1.4 |
| 14:0 | Tetradecanoic acid | 26.5 ± 0.9 |
| 18:1n9 | 9-Octadecenoic acid | 25.8 ± 2.2 |
| 18:1n7 | 11-Octadecenoic acid | 4.0 ± 0.6 |
| 18:2n6 | 9,12-Octadecadienoic acid | 3.0 ± 0.4 |
| 18:0 | Octadecanoic acid | 2.5 ± 0.3 |
| 15:0 | Pentadecanoic acid | 1.5 ± 0.4 |
| 16:1n7 | 9-Hexadecenoic acid | 1.2 ± 0.0 |
| 14:1 | Tetradecanoic acid | TR |
| 12:0 | Dodecanoic acid | TR |
| 15:0 iso | 13-methyl-Tetradecanoic acid | TR |
| 13:0 | Tridecanoic acid | TR |
| 17:0 anteiso | 14-methyl-Hexadecanoic acid | TR |
| 15:0 anteiso | 12-methyl-tetradecanoic acid | TR |
TR, trace amounts <1%.
Mean peak area percentage.
Fig. 6Graphical circular map of genome. From outside to centre: contigs (red/grey), COGs category of genes on forward strand (three circles), genes on forward strand (blue circle), genes on reverse strand (red circle), COGs category on reverse strand (three circles), G+C content. COGs, Clusters of Orthologous Groups database.
Nucleotide content and gene count levels of genome
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of total | |
| Size (bp) | 3 511 289 | 100 |
| G+C content (%) | 1 858 158 | 52.93 |
| Coding region (bp) | 3 051 966 | 86.92 |
| Total genes | 3095 | 100 |
| RNA genes | 49 | 1.58 |
| Protein-coding genes | 3046 | 100 |
| Genes with function prediction | 1773 | 58.20 |
| Genes assigned to COGs | 1526 | 50.09 |
| Genes with peptide signals | 413 | 13.55 |
| Genes with transmembrane helices | 578 | 18.97 |
| ORFans genes | 400 | 13.13 |
| Genes associated with PKS or NRPS | 5 | 0.16 |
| No. of antibiotic resistance genes | 0 | 0 |
COGs, Clusters of Orthologous Groups database.; NRPS, nonribosomal peptide synthase; PKS, polyketide synthase.
Total is based on either size of genome in base pairs or total number of protein-coding genes in annotated genome.
Number of genes associated with 25 general COGs functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 142 | 4.66 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 197 | 6.47 | Transcription |
| L | 102 | 3.35 | Replication, recombination and repair |
| B | 1 | 0.03 | Chromatin structure and dynamics |
| D | 19 | 0.62 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 54 | 1.77 | Defense mechanisms |
| T | 59 | 1.94 | Signal transduction mechanisms |
| M | 80 | 2.63 | Cell wall/membrane biogenesis |
| N | 5 | 0.16 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 22 | 0.72 | Intracellular trafficking and secretion |
| O | 49 | 1.61 | Posttanslational modification, protein turnover, chaperones |
| C | 81 | 2.66 | Energy production and conversion |
| G | 112 | 3.68 | Carbohydrate transport and metabolism |
| E | 146 | 4.79 | Amino acid transport and metabolism |
| F | 49 | 1.61 | Nucleotide transport and metabolism |
| H | 69 | 2.27 | Coenzyme transport and metabolism |
| I | 38 | 1.25 | Lipid transport and metabolism |
| P | 77 | 2.53 | Inorganic ion transport and metabolism |
| Q | 20 | 0.66 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 216 | 7.09 | General function prediction only |
| S | 124 | 4.07 | Function unknown |
| — | 1520 | 49.90 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Genome comparison of closely related species to Anaeromassilibacillus senegalensis strain mt9T
| Organism | INSDC | Size (Mb) | G+C (%) | Protein-coding genes |
|---|---|---|---|---|
| 3.51 | 52.93 | 3046 | ||
| 4.47 | 53.19 | 4723 | ||
| 4.07 | 37.40 | 3390 | ||
| 3.81 | 61.07 | 4278 | ||
| 4.90 | 35.72 | 3892 | ||
| 4.40 | 37.13 | 3867 | ||
| 6.15 | 35.55 | 5082 | ||
| 3.01 | 55.56 | 2701 | ||
| 3.32 | 56.82 | 2756 | ||
| 2.81 | 50.25 | 2482 |
INSDC, International Nucleotide Sequence Database Collaboration.
Numbers of orthologous proteins shared between genomes (upper right)a
| 795 | 760 | 866 | 817 | 734 | 906 | 701 | 800 | 773 | ||
| 52.27 | 745 | 787 | 1348 | 824 | 822 | 601 | 1328 | 1345 | ||
| 62.04 | 58.15 | 841 | 760 | 729 | 729 | 630 | 757 | 735 | ||
| 62.70 | 57.07 | 67.67 | 789 | 766 | 818 | 681 | 780 | 772 | ||
| 53.17 | 66.65 | 58.44 | 57.65 | 826 | 818 | 600 | 1534 | 1269 | ||
| 63.64 | 55.08 | 63.42 | 64.45 | 55.74 | 744 | 605 | 809 | 792 | ||
| 67.12 | 55.30 | 62.08 | 62.83 | 55.64 | 62.71 | 665 | 817 | 786 | ||
| 65.36 | 52.95 | 61.97 | 62.78 | 53.97 | 63.62 | 63.58 | 603 | 584 | ||
| 53.41 | 66.63 | 58.47 | 57.76 | 87.31 | 55.69 | 55.95 | 53.76 | 1247 | ||
| 53.18 | 77.09 | 58.65 | 57.72 | 66.68 | 55.75 | 55.76 | 53.51 | 66.81 |
Average percentage similarity of nucleotides corresponding to orthologous protein shared between genomes (lower left) and numbers of proteins per genome (bold).
Pairwise comparison of Anaeromassilibacillus senegalensis with nine other speciesa
| 100% ± 00 | 23.1% ± 2.5 | 29.0% ± 2.5 | 19.6% ± 2.4 | 30.4% ± 2.5 | 30.0% ± 2.5 | 28.5% ± 2.5 | 23.2% ± 2.5 | 25.6% ± 2.5 | 21.0% ± 2.4 | |
| 100% ± 00 | 25.7% ± 2.5 | 20.5% ± 2.4 | 26.8% ± 2.5 | 27.1% ± 2.5 | 26.6% ± 2.5 | 27.5% ± 2.5 | 19.6% ± 2.4 | 25.5% ± 2.5 | ||
| 100% ± 00 | 25.8% ± 2.5 | 32.2% ± 2.5 | 28.1% ± 2.5 | 29.1% ± 2.5 | 25.7% ± 2.4 | 24.7% ± 2.5 | 27.6% ± 2.5 | |||
| 100% ± 00 | 27.5% ± 2.4 | 25.7% ± 2.5 | 29.0% ± 2.5 | 18.8% ± 2.4 | 20.4% ± 2.4 | 21.0% ± 2.4 | ||||
| 100% ± 00 | 23.6% ± 2.5 | 21.0% ± 2.4 | 25.9% ± 2.5 | 28.4% ± 2.5 | 29.5% ± 2.5 | |||||
| 100% ± 00 | 23.6% ± 2.5 | 26.4% ± 2.5 | 25.8% ± 2.5 | 26.1% ± 2.5 | ||||||
| 100% ± 00 | 27.1% ± 2.5 | 25.2% ± 2.5 | 26.2% ± 2.5 | |||||||
| 100% ± 00 | 20.1% ± 2.4 | 22.2% ± 2.5 | ||||||||
| 100% ± 00 | 28.7% ± 2.5 | |||||||||
| 100% ± 00 |
DDH, DNA-DNA hybridization; GGDC, Genome-to-Genome Distance Calculator; HSP, high-scoring pair.
Comparison made using GGDC, formula 2 (DDH estimates based on identities/HSP length). Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with 16S rRNA and phylogenomic analyses as well as GGDC results.
Fig. 7Distribution of functional classes of predicted genes according to clusters of orthologous groups of proteins of Anaeromassilibacillus senegalensis strain mt9T and other strains.