| Literature DB >> 27087892 |
Linda Hadjadj1, Jaishriram Rathored1, Mamadou Bhoye Keita1, Caroline Michelle1, Anthony Levasseur1, Didier Raoult2, Pierre-Edouard Fournier1, Jean-Marc Rolain1, Fadi Bittar1.
Abstract
Strain G3(T) (CSUR P207 = DSM 26203) was isolated from the fecal sample of a wild gorilla (Gorilla gorilla subsp gorilla) from Cameroon. It is a Gram-positive, facultative anaerobic short rod. This strain exhibits a 16S rRNA sequence similarity of 98.2 % with Microbacterium thalassium, the closest validly published Microbacterium species and member of the family Microbacteriaceae. Moreover, it shows a low MALDI-TOF-MS score (1.1 to 1.3) that does not allow any identification. Thus, it is likely that this strain represents a new species. Here we describe the phenotypic features of this organism, the complete genome sequence and annotation. The 3,692,770 bp long genome (one chromosome but no plasmid) contains 3,505 protein-coding and 61 RNA genes, including 4 rRNA genes. In addition, digital DNA-DNA hybridization values for the genome of the strain G3(T) against the closest Microbacterium genomes range between 19.7 to 20.5, once again confirming its new status as a new species. On the basis of these polyphasic data, consisting of phenotypic and genomic analyses, we propose the creation of Microbacterium gorillae sp. nov. that contains the strain G3(T).Entities:
Keywords: Culturomics; Genome; Gorilla stool sample; Microbacterium gorillae; Taxonomo-genomics
Year: 2016 PMID: 27087892 PMCID: PMC4832456 DOI: 10.1186/s40793-016-0152-z
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Microbacterium gorillae strain G3T
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain: | TAS [ | |
| Phylum: | TAS [ | ||
| Class: | TAS [ | ||
| Order: | TAS [ | ||
| Family: | TAS [ | ||
| Genus: | TAS [ | ||
| Species: | IDA | ||
| Type strain: G3T | IDA | ||
| Gram stain | Positive | IDA | |
| Cell shape | Rod-shaped | IDA | |
| Motility | Non-motile | IDA | |
| Sporulation | Non-sporulating | IDA | |
| Temperature range | Mesophilic | IDA | |
| Optimum temperature | 25 °C | IDA | |
| pH range; Optimum | Not determined | ||
| Carbon source | Varied (see Additional file | IDA | |
| MIGS-6 | Habitat | Gorilla gut | IDA |
| MIGS-6.3 | Salinity | 2 % NaCl | IDA |
| MIGS-22 | Oxygen requirement | Facultative anaerobic | IDA |
| MIGS-15 | Biotic relationship | Free living | IDA |
| MIGS-14 | Pathogenicity | Unknown | |
| Biosafety level | 2 | NAS | |
| Isolation | Gorilla feces | IDA | |
| MIGS-4 | Geographic location | Cameroon | IDA |
| MIGS-5 | Sample collection time | July 2011 | IDA |
| MIGS-4.1 | Latitude | 2° 47′ 2.1768″ | IDA |
| MIGS-4.2 | Longitude | 13° 1′ 49.6986″ | IDA |
| MIGS-4.4 | Altitude | >600 m above sea level | IDA |
a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement
(i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [37]. If the evidence is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements
Fig. 1Phylogenetic tree highlighting the position of Microbacterium gorillae strain G3T relative to other type strains within the Microbacterium genus using 16S rRNA gene. GenBank accession numbers are indicated in parentheses. Sequences were aligned using MUSCLE. Alignments were then cleaned from highly divergent blocks using Gblocks version 0.91b [38]. Maximum likelihood (ML) phylogenetic tree was generated using RAxML [39], employing the GTR GAMMA substitution model with 500 bootstraps. Numbers at the nodes are percentages of bootstrap values obtained by repeating the analysis 500 times to generate a majority consensus tree. Corynebacterium diphtheriae was used as outgroup. The scale bar represents a rate of substitution per nucleotide position of 0.02. (T) indicates that the sequence used in the tree is from the type strain of the species.* indicates the strains used in the tree have a sequenced genome. # indicates that a sequenced genome is available for this species but not for the strain used to build the tree
Fig. 2Gram staining (left panel) and Transmission electron microscopy using a Morgani 268D (Philips) at an operating voltage of 60 kV (right panel) of M. gorillae strain G3T. The scale bar represents 500 nm
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | Mate pair and paired end |
| MIGS-29 | Sequencing platforms | MiSeq-Illumina |
| MIGS-31.2 | Fold coverage | 213X |
| MIGS-30 | Assemblers | Spades |
| MIGS-32 | Gene calling method | Prodigal |
| Locus Tag | BN1193 | |
| GenBank ID | CDAR00000000 | |
| GenBank Date of Release | November 04, 2014 | |
| GOLD ID | Gp0025154 | |
| BIOPROJECT | PRJEB7582 | |
| MIGS-13 | Source Material Identifier | G3T |
| Project relevance | DSM 26203, CSUR P207 |
Nucleotide content and gene count levels of the genome
| Attribute | Value | % of totala |
|---|---|---|
| Genome size (bp) | 3,692,770 | 100 |
| DNA coding (bp) | 3,396,745 | 92 |
| DNA G + C (bp) | 2,558,287 | 69.3 |
| DNA scaffolds | 14 | |
| Total genes | 3,566 | 100 |
| Protein coding genes | 3,505 | 98.3 |
| RNA genes | 61 | 1.71 |
| Pseudo genes | 226 | 6.33 |
| Genes in internal clusters | ND | ND |
| Genes with function prediction | 2,412 | 68.8 |
| Genes assigned to COGs | 2,202 | 62.8 |
| Genes with Pfam domains | 0 | 0 |
| Genes with signals peptides | 365 | 10.4 |
| Genes with transmembrane helices | 843 | 24.1 |
| CRISPR repeats | 0 | 0 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
ND: Not determined
Fig. 3Graphical circular map of the Microbacterium gorillae strain G3 T chromosome. The outer two circles show open reading frames oriented in the forward (colored by COG categories) and reverse (colored by COG categories) directions, respectively. The third circle shows the RNA genes (tRNAs green, rRNAs red). The fourth circle shows the G + C% content plot. The inner-most circle shows GC skew, purple indicating negative values whereas olive for positive values
Number of genes associated with the 25 general COG functional categories
| Code | Value | % of totala | Description |
|---|---|---|---|
| J | 149 | 4.25 | Translation |
| A | 1 | 0.03 | RNA processing and modification |
| K | 269 | 7.67 | Transcription |
| L | 109 | 3.11 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 16 | 0.46 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 41 | 1.17 | Defense mechanisms |
| T | 75 | 2.14 | Signal transduction mechanisms |
| M | 82 | 2.34 | Cell wall/membrane biogenesis |
| N | 1 | 0.03 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structures |
| U | 24 | 0.68 | Intracellular trafficking and secretion |
| O | 66 | 1.88 | Posttranslational modification, protein turnover, chaperones |
| C | 150 | 4.28 | Energy production and conversion |
| G | 257 | 7.33 | Carbohydrate transport and metabolism |
| E | 325 | 9.27 | Amino acid transport and metabolism |
| F | 69 | 1.97 | Nucleotide transport and metabolism |
| H | 83 | 2.37 | Coenzyme transport and metabolism |
| I | 151 | 4.31 | Lipid transport and metabolism |
| P | 184 | 5.25 | Inorganic ion transport and metabolism |
| Q | 95 | 2.71 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 410 | 11.70 | General function prediction only |
| S | 145 | 4.14 | Function unknown |
| - | 1303 | 37.17 | Not in COGs |
a The total is based on the total number of protein coding genes in the annotated genome
Genomic comparison of M. gorillae sp. nov., strain G3T with other Microbacterium species.
| Species |
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
|
|
| 1,593 | 1,658 | 1,269 | 1,396 | 1,390 | 1,416 | 1,498 | 1,497 |
|
| 75.91 |
| 2,111 | 1,390 | 1,511 | 1,461 | 1,595 | 1,685 | 1,684 |
|
| 75.22 | 85.33 |
| 1,429 | 1,581 | 1,549 | 1,634 | 1,755 | 1,734 |
|
| 75.39 | 76.16 | 75.51 |
| 1,296 | 1,191 | 1,324 | 1446 | 1,349 |
|
| 75.80 | 76.59 | 76.07 | 76.05 |
| 1414 | 1,602 | 1,638 | 1,580 |
|
| 76.41 | 76.99 | 76.50 | 76.34 | 77.94 |
| 1,395 | 1,433 | 1,512 |
|
| 75.66 | 76.36 | 75.90 | 76.43 | 78.49 | 77.34 |
| 1,710 | 1,632 |
|
| 75.64 | 76.48 | 75.84 | 76.30 | 77.64 | 77.64 | 77.52 |
| 1,723 |
|
| 75.85 | 76.89 | 76.34 | 76.53 | 78.06 | 78.60 | 77.82 | 78.10 |
|
The numbers of orthologous proteins shared between genomes (upper right triangle), average percentage similarity of nucleotides corresponding to orthologous protein shared between genomes (lower left triangle) and numbers of proteins per genome (bold)