| Literature DB >> 26649148 |
Rita Aghnatios1, Caroline Cayrou1, Marc Garibal1, Catherine Robert1, Said Azza1, Didier Raoult1, Michel Drancourt1.
Abstract
Gemmata massiliana is a new Planctomycetes bacterium isolated from a hospital water network in France, using a new culture medium. It is an aerobic microorganism with optimal growth at pH 8, at 30 °C and salinity ≤ 1.25 % NaCl. G. massiliana is resistant to β-lactam antibiotics, due to lack of peptidoglycan in its cell wall.G. massiliana shares a 97 % 16S rRNA gene sequence similarity with the nearest species, Gemmata obscuriglobus; and 99 % similarity with unnamed soil isolates. Its 9,249,437-bp genome consists in one chromosome and no detectable plasmid and has a 64.07 % G + C content, 32.94 % of genes encoding for hypothetical proteins. The genome contains an incomplete 19.6-kb phage sequence, 26 CRISPRs, 3 CAS and 15 clusters of secondary metabolites. G. massiliana genome increases knowledge of a poorly known world of bacteria.Entities:
Keywords: Culture; Gemmata; Gemmata massiliana; Gemmata obscuriglobus; Genome; Hospital water network; Planctomycetes
Year: 2015 PMID: 26649148 PMCID: PMC4672568 DOI: 10.1186/s40793-015-0103-0
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Gemmata massiliana strain IIL30T
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain: | TAS [ | |
| Phylum: | TAS [ | ||
| Class: | TAS [ | ||
| Order: | TAS [ | ||
| Family: | TAS [ | ||
| Genus: | TAS [ | ||
| Species: | IDA | ||
| Type strain: IIL30 CSUR P189T | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | Coccus | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Nonsporulating | IDA | |
| Temperature range | Mesophile | IDA | |
| Optimum temperature | 30°C | IDA | |
| pH range; Optimum | 6–8; 8 | IDA | |
| Carbon source | Unknown | ||
| GS-6 | Habitat | Hospital water | IDA |
| MIGS-6.3 | Salinity | 0 % NaCl (w/v) | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Free-living | IDA |
| MIGS-14 | Pathogenicity | Unknown | |
| MIGS-4 | Geographic location | France/Marseille | IDA |
| MIGS-5 | Sample collection | October 2011 | IDA |
| MIGS-4.1 | Latitude | 43.3 | IDA |
| MIGS-4.2 | Longitude | 5.4 | |
| MIGS-4.4 | Altitude | Unknown |
a Evidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [45]
Fig. 1Gemmata massiliana phylogenetic position amongst other Planctomycetes species. The following Phylogenetic tree shows G. massiliana strain CSUR P189T position relatively to Gemmata obscuriglobus, to undescribed Gemmata species and to other Planctomycetes. Sequences were aligned using CLUSTALW [39], and phylogenetic inferences obtained using the neighbor-joining method within the MEGA software [40] only bootstrap values ≥95 % are indicated at nodes. T. maritima (M21774) was used as an outgroup. The scale bar represents a 5 % nucleotide sequence divergence
Fig. 2Gram staining of G. massiliana strain CSUR P189T. The bar scale reprensents 10 μm
Fig. 3G. massiliana transmission electron microscopy. We observed two size-different populations of G. massiliana strain CSUR P189T, using a Morgagni 268D (Philips) at an operating voltage of 60kV. The scale bar represents 500 nm
Fig. 4G. massiliana two cell variants as demonstrated by flow cytometry. Representative histogram of the two cell variants in a G. massiliana sample shown in purple, was generated by a BD LSRFortessa (Le Pont de Claix, France) and FACSDiva software (version 6.2)
Fig. 5Reference mass spectrum of G. Massiliana strain CSUR P189T. This reference spectrum was generated from 10 spectra corresponding to 10 deposited colonies of G. massiliana
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Draft |
| MIGS-28 | Libraries used | 454 paired end 3-kb and shotgun XL+ libraries |
| MIGS-29 | Sequencing platforms | 454 Roche Titanium |
| MIGS-31.2 | Fold coverage | 22.4× |
| MIGS-30 | Assemblers | Newbler version 2.8 |
| MIGS-32 | Gene calling method | CLC genomics workbench 6.0.1 |
| Genbank ID | CBXA000000000.1 | |
| Genbank Date of Release | January 20,2014 | |
| GOLD ID | Gp0033443 | |
| BIOPROJECT | PRJEB621 | |
| MIGS-13 | Source Material Identifier | DSM 26013 |
| Project relevance | Medical, hospital water network |
Fig. 6Graphical circular map of the chromosome. From outside to the center: Genes on the forward strand colored by COG categories (only genes assigned to COG), genes on the reverse strand colored by COG categories (only gene assigned to COG), RNA genes (tRNAs green, rRNAs red), GC content and GC skew
Genome statistics
| Attribute | Value | % of total |
|---|---|---|
| Genome size (bp) | 9,249,437 | 100 |
| DNA coding (bp) | 7,995,313 | 86.44 |
| DNA G + C (bp) | 5,925,688 | 64.07 |
| DNA scaffolds | 22 | 100 |
| Total genes | 8,065 | 100 |
| Protein-coding genes | 7,985 | 99.01 |
| RNA genes | 80 | 0.99 |
| Pseudo genes | NA | - |
| Genes in internal clusters | 3497 | 43.79 |
| Genes with function prediction | 3,237 | 40.53 |
| Genes assigned to COGs | 2,443 | 30.59 |
| Genes with Pfam domains | 6,088 | 76.24 |
| Genes with signal peptides | 1,979 | 24.78 |
| Genes with transmembrane helices | 1,515 | 18.97 |
| CRISPR repeats | 26 | 100 |
The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 168 | 2.10 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.01 | RNA processing and modification |
| K | 360 | 4.51 | Transcription |
| L | 404 | 5.06 | Replication, recombination and repair |
| B | 2 | 0.3 | Chromatin structure and dynamics |
| D | 28 | 0.35 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 98 | 1.23 | Defense mechanisms |
| T | 419 | 5.25 | Signal transduction mechanisms |
| M | 260 | 3.26 | Cell wall/membrane biogenesis |
| N | 94 | 1.18 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 1 | 0.01 | Extracellular structures |
| U | 99 | 1.24 | Intracellular trafficking and secretion |
| O | 170 | 2.13 | Posttranslational modification, protein turnover, chaperones |
| C | 240 | 3.01 | Energy production and conversion |
| G | 240 | 3.01 | Carbohydrate transport and metabolism |
| E | 277 | 3.47 | Amino acid transport and metabolism |
| F | 63 | 0.79 | Nucleotide transport and metabolism |
| H | 130 | 1.63 | Coenzyme transport and metabolism |
| I | 125 | 1.57 | Lipid transport and metabolism |
| P | 188 | 2.35 | Inorganic ion transport and metabolism |
| Q | 143 | 1.79 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 766 | 9.59 | General function prediction only |
| S | 368 | 4.61 | Function unknown |
| - | 4136 | 51.80 | Not in COGs |
The total is based on the total number of protein coding genes in the annotated geno