| Literature DB >> 24501648 |
Mamadou Bhoye Keita1, Seydina M Diene1, Catherine Robert1, Didier Raoult2, Pierre-Edouard Fournier1, Fadi Bittar1.
Abstract
Strain G2(T) sp. nov. is the type strain of B. massiliogorillae, a proposed new species within the genus Bacillus. This strain, whose genome is described here, was isolated in France from the fecal sample of a wild western lowland gorilla from Cameroon. B. massiliogorillae is a facultative anaerobic, Gram-variable, rod-shaped bacterium. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 5,431,633 bp long genome (1 chromosome but no plasmid) contains 5,179 protein-coding and 98 RNA genes, including 91 tRNA genes.Entities:
Keywords: Bacillus massiliogorillae; culturomics; genome; taxonogenomics
Year: 2013 PMID: 24501648 PMCID: PMC3910557 DOI: 10.4056/sigs.4388124
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of strain G2T
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Gram stain | Variable | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Sporulating | IDA | |
| Temperature range | Mesophile | IDA | |
| Optimum temperature | 37°C | IDA | |
| MIGS-6.3 | Salinity | Growth in BHI medium + 2% NaCl | IDA |
| MIGS-22 | Oxygen requirement | Facultative anaerobic | IDA |
| Carbon source | Unknown | NAS | |
| Energy source | Unknown | NAS | |
| MIGS-6 | Habitat | Gorilla gut | IDA |
| MIGS-15 | Biotic relationship | Free living | IDA |
| MIGS-14 | Pathogenicity | Unknown | NAS |
| MIGS-4 | Geographic location | Cameroon | IDA |
| MIGS-5 | Sample collection time | July 2011 | IDA |
| MIGS-4.1 | Latitude | Unknown | NAS |
| MIGS-4.1 | Longitude | Unknown | NAS |
| MIGS-4.3 | Depth | Unknown | NAS |
| MIGS-4.4 | Altitude | Unknown | NAS |
Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [21]. If the evidence is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements.
Figure 1Phylogenetic tree highlighting the position of strain G2T relative to other type strains within the genus. GenBank accession numbers are indicated in parentheses. Sequences were aligned using CLUSTAL X (V2), and phylogenetic inferences obtained using the maximum-likelihood method within the MEGA 5 software [22]. Numbers at the nodes are percentages of bootstrap values obtained by repeating the analysis 1,000 times to generate a majority consensus tree. was used as outgroup. The scale bar represents a 2% nucleotide sequence divergence.
Figure 2Gram staining of strain G2T
Figure 3Transmission electron microscopy of strain G2T, using a Morgani 268D (Philips) at an operating voltage of 60kV. The scale bar represents 1 µm.
Differential phenotypic characteristics between sp. nov. strain G2T and phylogenetically close species.
| | | | | |
|---|---|---|---|---|
| Cell diameter (µm) | 0.87-1.2 | 0.7-0.9 | 0.9-1 | 0.5-0.8 |
| Oxygen requirement | aerobic | aerobic | facultative anaerobic | aerobic |
| Gram stain | var | var | var | var |
| Salt requirement | < 5% | <7% | <10% | <7% |
| Motility | + | v | + | + |
| Endospore formation | + | + | + | + |
| Alkaline phosphatase | + | na | na | na |
| Acid phosphatase | + | na | na | na |
| Catalase | + | + | + | + |
| Oxidase | - | - | na | na |
| Nitrate reductase | - | na | + | na |
| Urease | + | na | na | w |
| α-galactosidase | - | na | na | na |
| β- galactosidase | - | na | + | + |
| β-glucuronidase | - | na | na | na |
| α -glucosidase | + | na | na | na |
| N-acetyl- β -glucosaminidase | + | na | na | na |
| Indole | - | na | na | na |
| Esterase | + | na | na | na |
| Esterase lipase | + | na | na | na |
| Naphthyl-AS-BI-phosphohydrolase | - | na | na | na |
| Phenylalanine arylamidase | - | na | na | na |
| Leucine arylamidase | - | na | na | na |
| Cystine arylamidase | - | na | na | na |
| Valine arylamidase | - | na | na | na |
| Glycine arylamidase | - | na | na | na |
| D-mannose | - | - | na | + |
| Amygdalin | + | - | na | v |
| L-Arabinose | - | - | + | + |
| Cellobiose | + | - | na | + |
| Lactose | + | - | + | + |
| D-xylose | - | -/w | na | + |
| Glucose | + | na | + | + |
| Mannitol | - | na | + | + |
| Arabinose | - | na | + | + |
| Xylose | - | na | + | + |
| Glycerol | - | na | + | + |
| D-Galactose | - | na | na | + |
| Starch | + | na | na | + |
| gorilla gut | soil | soil and lowland marshes | environment and fish gut |
var: variable, +: positive result, -: negative result, na: data not available, w: weak positive result
Figure 4Reference mass spectrum from strain G2T. Spectra from 12 individual colonies were compared and a reference spectrum was generated.
Figure 5Gel view comparing G2T spectra with other members of the genus ( and ). The Gel View displays the raw spectra of all loaded spectrum files arranged in a pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running spectrum number originating from subsequent spectra loading. The peak intensity is expressed by a Gray scale scheme code. The color bar and the right y-axis indicate the relation between the color a peak is displayed with and the peak intensity in arbitrary units.
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | 454 paired-end 3- kb libraries |
| MIGS-29 | Sequencing platform | 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 13× |
| MIGS-30 | Assemblers | Newbler version 2.5.3 |
| MIGS-32 | Gene calling method | Prodigal |
| EMBL Date of Release | April 18, 2013 | |
| EMBL ID | CAVL000000000 | |
| MIGS-13 | Project relevance | Study of the gorilla gut microbiome |
The number of orthologous proteins shared between genomes†
| | | | | |
|---|---|---|---|---|
| | 70.15 | 69.28 | 69.66 | |
| 1,936 | | 68.74 | 68.46 | |
| 1,966 | 1,962 | | 69.86 | |
| 1,877 | 1,873 | 1,903 | |
†Lower left triangle- shared orthologous, upper right triangle- average percentage similarity of nucleotides corresponding to orthologous proteins shared between genomes, bold- number of proteins per genome
Figure 6Graphical circular map of the genome. From outside in: contigs (red / grey), COG category of genes on the forward strand (three circles), genes on forward strand (blue circle), genes on the reverse strand (red circle), COG category on the reverse strand (three circles), GC content. The inner-most circle shows GC skew, purple and olive indicating negative and positive values, respectively.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 180 | 3.48 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0 | RNA processing and modification |
| K | 438 | 8.46 | Transcription |
| L | 191 | 3.69 | Replication, recombination and repair |
| B | 2 | 0.04 | Chromatin structure and dynamics |
| D | 42 | 0.81 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 110 | 2.12 | Defense mechanisms |
| T | 275 | 5.31 | Signal transduction mechanisms |
| M | 182 | 3.51 | Cell wall/membrane biogenesis |
| N | 88 | 1.7 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 63 | 1.22 | Intracellular trafficking and secretion |
| O | 130 | 2.51 | Posttranslational modification, protein turnover, chaperones |
| C | 293 | 5.66 | Energy production and conversion |
| G | 247 | 4.77 | Carbohydrate transport and metabolism |
| E | 474 | 9.15 | Amino acid transport and metabolism |
| F | 110 | 2.12 | Nucleotide transport and metabolism |
| H | 177 | 3.42 | Coenzyme transport and metabolism |
| I | 188 | 3.63 | Lipid transport and metabolism |
| P | 300 | 5.79 | Inorganic ion transport and metabolism |
| Q | 133 | 2.57 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 664 | 12.82 | General function prediction only |
| S | 344 | 6.64 | Function unknown |
| - | 1,269 | 24.50 | Not in COGs |
a The total is based on the total number of protein coding genes in the annotated genome
Nucleotide content and gene count levels of the genome
| | | |
|---|---|---|
| Genome size (bp) | 5,431,633 | 100 |
| Coding region (bp) | 4,561,287 | 83.98 |
| G+C content (bp) | 1,898,498 | 34.95 |
| Total genes | 5,276 | 100 |
| RNA genes | 98 | 1.84 |
| Protein-coding genes | 5,179 | 98.63 |
| Genes with function prediction | 3,801 | 73.39 |
| Genes assigned to COGs | 3,910 | 75.49 |
| Genes with peptide signals | 610 | 11.78 |
| Genes with transmembrane helices | 1,347 | 26.01 |
a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome