| Literature DB >> 25780503 |
Carmen Scheuner1, Brian J Tindall1, Megan Lu2, Matt Nolan3, Alla Lapidus3, Jan-Fang Cheng3, Lynne Goodwin2, Sam Pitluck3, Marcel Huntemann3, Konstantinos Liolios3, Ioanna Pagani3, Konstantinos Mavromatis3, Natalia Ivanova3, Amrita Pati3, Amy Chen4, Krishna Palaniappan4, Cynthia D Jeffries5, Loren Hauser5, Miriam Land5, Romano Mwirichia6, Manfred Rohde7, Birte Abt1, John C Detter2, Tanja Woyke3, Jonathan A Eisen8, Victor Markowitz4, Philip Hugenholtz9, Markus Göker1, Nikos C Kyrpides10, Hans-Peter Klenk1.
Abstract
Planctomyces brasiliensis Schlesner 1990 belongs to the order Planctomycetales, which differs from other bacterial taxa by several distinctive features such as internal cell compartmentalization, multiplication by forming buds directly from the spherical, ovoid or pear-shaped mother cell and a cell wall consisting of a proteinaceous layer rather than a peptidoglycan layer. The first strains of P. brasiliensis, including the type strain IFAM 1448(T), were isolated from a water sample of Lagoa Vermelha, a salt pit near Rio de Janeiro, Brasil. This is the second completed genome sequence of a type strain of the genus Planctomyces to be published and the sixth type strain genome sequence from the family Planctomycetaceae. The 6,006,602 bp long genome with its 4,811 protein-coding and 54 RNA genes is a part of the G enomic E ncyclopedia of Bacteria and Archaea project. Phylogenomic analyses indicate that the classification within the Planctomycetaceae is partially in conflict with its evolutionary history, as the positioning of Schlesneria renders the genus Planctomyces paraphyletic. A re-analysis of published fatty-acid measurements also does not support the current arrangement of the two genera. A quantitative comparison of phylogenetic and phenotypic aspects indicates that the three Planctomyces species with type strains available in public culture collections should be placed in separate genera. Thus the genera Gimesia, Planctopirus and Rubinisphaera are proposed to accommodate P. maris, P. limnophilus and P. brasiliensis, respectively. Pronounced differences between the reported G + C content of Gemmata obscuriglobus, Singulisphaera acidiphila and Zavarzinella formosa and G + C content calculated from their genome sequences call for emendation of their species descriptions. In addition to other features, the range of G + C values reported for the genera within the Planctomycetaceae indicates that the descriptions of the family and the order should be emended.Entities:
Keywords: GEBA; Gram-negative; Halotolerant; Non-peptidoglycan bacteria; Planctomycetales; Planctomycetes; Stalked bacteria; Taxonomic descriptions
Year: 2014 PMID: 25780503 PMCID: PMC4334474 DOI: 10.1186/1944-3277-9-10
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of relative to the other species within the family . The tree was inferred from 1,343 aligned characters of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion as previously described [15]. Rooting was done initially using the midpoint method [17] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers above the branches are support values from 400 ML bootstrap replicates (left) and from 1,000 Maximum-Parsimony bootstrap replicates (right) if larger than 60% [15]. Lineages with type strain genome sequencing projects registered in GOLD [18] as unpublished are marked with one star, those listed as published (as well as the target organism) with two stars [19-22].
Classification and general features of DSM 5305 in accordance with the MIGS recommendations[23] as published by the Genome Standards Consortium [24]
| | Current classification | Domain | TAS [ |
| Phylum | TAS [ | ||
| Class " | TAS [ | ||
| Order " | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain DSM 5305 | TAS [ | ||
| | Gram stain | negative | TAS [ |
| | Cell shape | sphere shaped | TAS [ |
| | Motility | motile | TAS [ |
| | Sporulation | none | TAS [ |
| | Temperature range | mesophile | TAS [ |
| | Optimum temperature | 27°C-35°C | TAS [ |
| | Salinity | halotolerant | TAS [ |
| MIGS-22 | Oxygen requirement | aerobe | TAS [ |
| | Carbon source | several sugars, such as D-cellobiose, maltose, trehalose | TAS [ |
| | Energy metabolism | chemoorganotroph | TAS [ |
| MIGS-6 | Habitat | marine fresh water | TAS [ |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | not reported | NAS |
| | Biosafety level | 1 | NAS [ |
| | Isolation | water from salt pit | TAS [ |
| MIGS-4 | Geographic location | Lagoa Vermelha, Brazil | TAS [ |
| MIGS-5 | Sample collection time | November 1982 | TAS [ |
| MIGS-4.1 | Latitude | −22.929 | NAS |
| MIGS-4.2 | Longitude | −42.390 | NAS |
| MIGS-4.3 | Depth | surface water | TAS [ |
| MIGS-4.4 | Altitude | 0 m, sea level | NAS |
Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [34].
Figure 2Scanning-electron micrograph of DSM 5305 highlighting stalks and crateriform structures on the cell surface.
Genome sequencing project information
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: one 454 pyrosequence standard library, one 454 PE library (14 kb insert size), one Illumina library |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 124.8 × Illumina; 91.0 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.0.00.20-PostRelease-10-28-2008-g-3.4.6, phrap |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| | INSDC ID | CP002546 |
| | Genbank Date of Release | March 2, 2011 |
| | GOLD ID | Gc01674 |
| | NCBI project ID | 47863 |
| | Database: IMG-GEBA | 2503707005 |
| MIGS-13 | Source material identifier | DSM 5305 |
| Project relevance | Tree of Life, GEBA |
Genome statistics
| Genome size (bp) | 6,006,602 | 100.00% |
| DNA coding region (bp) | 5,145,779 | 85.67% |
| DNA G + C content (bp) | 3,390,645 | 56.45% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 4,865 | 100.00% |
| RNA genes | 54 | 1.11% |
| rRNA operons | 2 | |
| Protein-coding genes | 4,811 | 98.89% |
| Pseudo genes | 61 | 1.25% |
| Genes with function prediction | 2,800 | 57.55% |
| Genes in paralog clusters | 2,351 | 48.32% |
| Genes assigned to COGs | 3,220 | 66.19% |
| Genes assigned Pfam domains | 3,439 | 70.69% |
| Genes with signal peptides | 645 | 13.26% |
| Genes with transmembrane helices | 1,210 | 24.87% |
| CRISPR repeats | 2 |
Figure 3Graphical circular map of chromosome. From outside to the center: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew (purple/olive).
Number of genes associated with the general COG functional categories
|
| |||
|---|---|---|---|
| J | 149 | 4.0 | Translation, ribosomal structure and biogenesis |
| A | 3 | 0.0 | RNA processing and modification |
| K | 216 | 5.7 | Transcription |
| L | 223 | 5.9 | Replication, recombination and repair |
| B | 4 | 0.1 | Chromatin structure and dynamics |
| D | 32 | 0.9 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 83 | 2.2 | Defense mechanisms |
| T | 213 | 5.7 | Signal transduction mechanisms |
| M | 237 | 6.3 | Cell wall/membrane/envelope biogenesis |
| N | 193 | 5.1 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 250 | 6.7 | Intracellular trafficking, secretion, and vesicular transport |
| O | 140 | 3.7 | Posttranslational modification, protein turnover, chaperones |
| C | 166 | 4.4 | Energy production and conversion |
| G | 190 | 5.1 | Carbohydrate transport and metabolism |
| E | 208 | 5.6 | Amino acid transport and metabolism |
| F | 61 | 1.6 | Nucleotide transport and metabolism |
| H | 131 | 3.5 | Coenzyme transport and metabolism |
| I | 98 | 2.6 | Lipid transport and metabolism |
| P | 258 | 6.9 | Inorganic ion transport and metabolism |
| Q | 68 | 1.8 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 461 | 12.3 | General function prediction only |
| S | 356 | 9.5 | Function unknown |
| - | 1,645 | 33.8 | Not in COGs |
Genomic G + C content of the type strains, including as outgroup
|
| |||
|---|---|---|---|
| 53.6 - 57.4 [ | 57.04* | AANZ00000000 (67); NZ_AANZ00000000 (4)* | |
| 64.4 ± 1.0 [ | 67.18 | ABGO00000000 (923) | |
| 62.2 [ | 62.49 [ | CP002353, CP002354 | |
| 73.0 [ | 73.22 | AP012338, AP012339 | |
| 56.4 ± 0.4 - 57.4 ± 0.3 [ | 57.46 [ | CP001848 | |
| 55.1 - 57.7 [ | 56.45 | CP002546 | |
| 53.24 ± 0.59 [ | 53.68 [ | CP001744, CP001745 | |
| 50.5 [ | 50.45 | ABCE00000000 (126) | |
| 55.0 [ | 55.40 [ | BX119912 | |
| 56.3 [ | 55.66* | AHZR00000000 (112); NZ_AHZR00000000 (24)* | |
| 59.9 [ | 62.23* | AGRX00000000 (113); CP003364 - CP003367* | |
| 62.5 [ | 59.10* | IMG ID: 2548877000 (594); NZ_AIAB00000000 (106)* |
*Denote G + C content values calculated from the updated version of the genome sequence, which is also marked with an asterisk.
Figure 4Phylogenetic tree inferred from the MARE-filtered supermatrix under the maximum likelihood (ML) criterion [82] and rooted with . The branches are scaled in terms of the expected number of substitutions per site. Numbers above the branches (from left to right) are bootstrapping support values [87] (if larger than 60%) from ML/MP MARE-filtered supermatrix; ML/MP unfiltered (full) supermatrix; ML/MP core-genes supermatrix; ML/MP gene-content matrix; ML/MP ortholog-content matrix. Values larger than 95% are shown in bold; dots indicate branches with maximum support under all settings.
Pairwise comparison of the four species using GGDC, formula 2 (DDH estimates based on identities / HSP length)*
| 100.00% | 18.60% ± 2.27 | 21.40% ± 2.34 | 20.90% ± 2.33 | |
| 18.60% ± 2.27 | 100.00% | 24.10% ± 2.39 | 23.20% ± 2.38 | |
| 21.40% ± 2.34 | 24.10% ± 2.39 | 100.00% | 22.00% ± 2.35 | |
| 20.90% ± 2.33 | 23.20% ± 2.38 | 22.00% ± 2.35 | 100.00% |
*The confidence intervals indicate the inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size); see [91] for details. The distance formulas are explained in [89]; formula 2 is recommended, particularly for draft genomes.
Figure 5Venn diagram depicting the intersections of sets of homologous proteins of , , and . Their cardinalities are given in parentheses; for the total number of proteins see Table 3 and the resources listed in Table 5. The Venn diagram was calculated with the corresponding R package [92].
Figure 6Heat map of fatty-acid measurements (A[63], B [65], C [36], D [66], E [68], F [55], G [67], H [70], I [71], J [48]]) from the analyzed (see above). The heat map was calculated with the R package opm [94] with arcsine-square root transformation and Ward clustering applied. Asterisks denote fatty-acid profiles whose sum deviates more than five percent from 100%.
MaSH values calculated for the ML trees inferred in this study
| 0.027 | ~ | ~ | ~ | ~ | ~ | |
| 0.032 | ~ | ~ | ~ | ~ | ~ | |
| 0.072 | 0.414 | 0.272 | 0.287 | 0.166 | 0.185 | |
| 0.074 | 0.382 | 0.240 | 0.245 | 0.193 | 0.215 | |
| 0.086 | ~ | ~ | ~ | ~ | ~ | |
| 0.090 | ~ | ~ | ~ | ~ | ~ | |
| 0.100 | 0.422 | 0.305 | 0.309 | 0.177 | 0.209 | |
| 0.101 | 0.378 | 0.272 | 0.267 | 0.164 | 0.171 | |
| 0.119 | 0.346 | 0.280 | 0.275 | 0.213 | 0.236 | |
| 0.121 | 0.480 | 0.338 | 0.333 | 0.198 | 0.220 | |
| 0.125 | 0.539 | 0.353 | 0.382 | 0.379 | 0.347 | |
| 0.245 | 0.633 | 0.579 | 0.522 | 0.381 | 0.448 | |
| 0.297 | 0.658 | 0.523 | 0.522 | 0.410 | 0.382 | |
| 0.308 | 0.734 | 0.682 | 0.588 | 0.427 | 0.495 | |
| 0.639 | 1.258 | 1.118 | 1.082 | 0.558 | 0.649 |
*Indicate organisms that were only present in the 16S rRNA gene tree.
Polyamines[96 ] and polar lipids [48,65,70,95] for several as reported in the literature
| | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| NR2 | NR | NR | NR | + | - | + | - | + | - | |
| NR | NR | NR | NR | - | - | - | - | + | - | |
| - | - | - | + | + | - | - | - | + | - | |
| - | - | - | + | + | - | + | - | + | - | |
| - | - | - | + | NR | NR | NR | NR | NR | NR | |
| - | - | + | + | + | - | - | - | + | + | |
| - | - | + | + | + | + | - | - | + | - | |
| - | + | + | - | + | + | - | - | + | - | |
| - | - | - | + | + | + | + | + | + | + | |
| + | + | - | + | + | + | + | + | + | - | |
| NR | NR | NR | NR | - | - | + | - | + | - | |
| NR | NR | NR | NR | - | - | + | - | + | - | |
1Phospholipids: DPG Diphosphatidylglygerol; Glyl Glycolipid; PC Phosphatidylcholine; DPE Phosphatidyl-dimethylethanolamine; PG Phosphatidylglycerol; MPE Phosphatidyl-monomethylethanolamine.
2Not reported in the literature.