| Literature DB >> 25197452 |
Michael Visser1, Sofiya N Parshina2, Joana I Alves3, Diana Z Sousa4, Inês A C Pereira5, Gerard Muyzer6, Jan Kuever7, Alexander V Lebedinsky2, Jasper J Koehorst8, Petra Worm1, Caroline M Plugge1, Peter J Schaap8, Lynne A Goodwin9, Alla Lapidus10, Nikos C Kyrpides11, Janine C Detter12, Tanja Woyke11, Patrick Chain9, Karen W Davenport9, Stefan Spring13, Manfred Rohde14, Hans Peter Klenk13, Alfons J M Stams4.
Abstract
Desulfotomaculum nigrificans and D. carboxydivorans are moderately thermophilic members of the polyphyletic spore-forming genus Desulfotomaculum in the family Peptococcaceae. They are phylogenetically very closely related and belong to 'subgroup a' of the Desulfotomaculum cluster 1. D. nigrificans and D. carboxydivorans have a similar growth substrate spectrum; they can grow with glucose and fructose as electron donors in the presence of sulfate. Additionally, both species are able to ferment fructose, although fermentation of glucose is only reported for D. carboxydivorans. D. nigrificans is able to grow with 20% carbon monoxide (CO) coupled to sulfate reduction, while D. carboxydivorans can grow at 100% CO with and without sulfate. Hydrogen is produced during growth with CO by D. carboxydivorans. Here we present a summary of the features of D. nigrificans and D. carboxydivorans together with the description of the complete genome sequencing and annotation of both strains. Moreover, we compared the genomes of both strains to reveal their differences. This comparison led us to propose a reclassification of D. carboxydivorans as a later heterotypic synonym of D. nigrificans.Entities:
Keywords: Clostridiales; Peptococcaceae; Thermophilic spore-forming anaerobes; carboxydotrophic; sulfate reduction
Year: 2014 PMID: 25197452 PMCID: PMC4149029 DOI: 10.4056/sigs.4718645
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Neighbor joining tree based on 16S rRNA sequences showing the phylogenetic affiliation of and related species divided in the subgroups of cluster 1. DSM 574 and DSM 14880 are in bold type. The sequences of different were used as outgroup, but were pruned from the tree. Closed circles represent bootstrap values between 75 and 100%. The scale bar represents 10% sequence divergence.
Figure 2Scanning electron microscopic photograph of DSM 574
Figure 3Scanning electron microscopic photograph of DSM 14880
Classification and general features of DSM 574 according to the MIGS recommendations [10]
| MIGS ID | Property | Term | Evidence |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Gram stain | negative, with a Gram-positive cell wall structure | ||
| Cell shape | rods, rounded ends, sometimes paired | TAS [ | |
| Motility | Slight tumbling, peritrichous flagella | TAS [ | |
| Sporulation | oval, terminal or subterminal, slightly swelling the cell | TAS [ | |
| Temperature range | 30-70 °C | TAS [ | |
| Optimum temperature | 55 °C | TAS [ | |
| Carbon source | glucose and other carbohydrates | TAS [ | |
| Energy source | heterotrophic | TAS [ | |
| Electron acceptor | sulfate, thiosulfate and sulfite. | TAS [ | |
| MIGS-6 | Habitat | soils, compost heaps, thermal spring | TAS [ |
| MIGS-6.3 | Salinity | not reported | |
| MIGS-22 | Oxygen | obligate anaerobic | TAS [ |
| MIGS-15 | Biotic relationship | free living | TAS [ |
| MIGS-14 | Pathogenicity | none | TAS [ |
| MIGS-4 | Geographic location | Delft, The Netherlands | |
| MIGS-5 | Sample collection time | ||
| MIGS-4.1 | Latitude | 52.011 | |
| MIGS-4.2 | Longitude | 4.360 | |
| MIGS-4.3 | Depth | not reported |
Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [22].
Classification and general features of DSM 14880 according to the MIGS recommendations [10]
| MIGS ID | Property | Term | Evidence |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Gram stain | negative, with a Gram-positive cell wall structure | TAS [ | |
| Cell shape | rods, rounded ends, sometimes paired. | TAS [ | |
| Motility | twisting and tumbling motion | TAS [ | |
| Sporulation | oval, terminal or subterminal | TAS [ | |
| Temperature range | 30-68°C | TAS [ | |
| Optimum temperature | 55°C | TAS [ | |
| Carbon source | 100% CO, with and without sulfate | TAS [ | |
| Energy source | hydrogenogenic and heterotrophic growth | TAS [ | |
| Electron acceptor | sulfate, thiosulfate and sulfite. | TAS [ | |
| MIGS-6 | Habitat | Paper mill waste water sludge | |
| MIGS-6.3 | Salinity | 0-17 g NaCl l-1 | TAS [ |
| MIGS-22 | Oxygen | obligate anaerobe | TAS [ |
| MIGS-15 | Biotic relationship | free living | TAS [ |
| MIGS-14 | Pathogenicity | none | |
| MIGS-4 | Geographic location | Eerbeek, the Netherlands | TAS [ |
| MIGS-5 | Sample collection time | 1999 | TAS [ |
| MIGS-4.1 | Latitude | 52.104217 | TAS [ |
| MIGS-4.2 | Longitude | 6.060133 | TAS [ |
| MIGS-4.3 | Depth | not reported |
Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [22].
Genome sequencing project information of DSM 574 and DSM 14880.
| | | | |
|---|---|---|---|
| MIGS-31 | Finishing quality | Permanent draft | Finished |
| MIGS-28 | Libraries used | Three genomic libraries: 454 standard library, 454 PE libraries (7kb insert size), one Illumina library | Four genomic libraries: one 454 pyrosequence standard library, two 454 PE libraries (4kb and 11 kb insert size), one Illumina library |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Fold coverage | 462.8 × Illumina; 35.2 × pyrosequence | 116.8 × Illumina; 50.6 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3-PreRelease-June 30,2009, VELVET version 1.0.13, phrap version SPS - 4.24 | Newbler version 2.3-PreRelease-June 30, 2009, VELVET version 1.0.13, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP | Prodigal 1.4, GenePRIMP |
| INSDC ID | AEVP00000000 | CP002736.1 | |
| Genome Database release | December 10, 2010 | August 13, 2012 | |
| Genbank Date of Release | February 17, 2011 | May 23, 2011 | |
| MIGS-13 | GOLD ID | Gi03933 | Gc01783 |
| Project relevance | Obtain insight into the phylogenetic and physiological diversity of | Obtain insight into the phylogenetic and physiological diversity of |
Genome statistics of DSM 574 (A) and DSM 14880 (B)
| A. Genome (total) | B. Genome (total) | |||
|---|---|---|---|---|
| Value | % of total | Value | % of total | |
| Genome size (bp) | 3,052,787 | 100 | 2,892,255 | 100.00 |
| DNA coding region (bp) | 2,595,629 | 85.02 | 2457154 | 84.96 |
| DNA G+C content (bp) | 1,412,511 | 46.28 | 1,348,537 | 46.63 |
| Total genes | 3,112 | 100 | 2,844 | 100 |
| RNA genes | 98 | 3.15 | 97 | 3.41 |
| Protein-coding genes | 3,014 | 96.85 | 2,747 | 96.59 |
| Genes in paralog clusters | 1,542 | 49.55 | 1,363 | 47.93 |
| Genes assigned to COGs | 2,340 | 75.19 | 2,174 | 76.44 |
| Pseudo genes | 137 | 4.40 | 88 | 3.09 |
| Genes with signal peptides | 582 | 18.70 | 504 | 17.72 |
| Genes with transmembrane helices | 721 | 23.17 | 647 | 22.75 |
Number of DSM 574 and DSM 14880 genes associated with the general COG functional categories.
| DSM 574 | DSM 14880 | ||||
|---|---|---|---|---|---|
| Code | Description | Value | %agea | Value | %agea |
| J | Translation | 153 | 5.97 | 152 | 3.39 |
| A | RNA processing and modification | 1 | 0.04 | 0 | 0.00 |
| K | Transcription | 153 | 5.97 | 139 | 5.85 |
| L | Replication, recombination and repair | 210 | 8.20 | 172 | 7.23 |
| B | Chromatin structure and dynamics | 1 | 0.04 | 1 | 0.04 |
| D | Cell cycle control, mitosis and meiosis | 45 | 1.76 | 45 | 1.89 |
| Y | Nuclear structure | 0 | 0.00 | 0 | 0.00 |
| V | Defense mechanisms | 22 | 0.86 | 22 | 0.93 |
| T | Signal transduction mechanisms | 171 | 6.71 | 148 | 6.22 |
| M | Cell wall/membrane biogenesis | 132 | 5.15 | 126 | 5.3 |
| N | Cell motility | 70 | 2.73 | 68 | 2.86 |
| Z | Cytoskeleton | 0 | 0.00 | 0 | 0.00 |
| W | Extracellular structures | 0 | 0.00 | 0 | 0.00 |
| U | Intracellular trafficking and secretion | 65 | 2.54 | 64 | 2.69 |
| O | Posttranslational modification, protein turnover, chaperones | 83 | 3.24 | 85 | 3.57 |
| C | Energy production and conversion | 217 | 8.47 | 211 | 8.87 |
| G | Carbohydrate transport and metabolism | 125 | 4.88 | 98 | 4.12 |
| E | Amino acid transport and metabolism | 224 | 8.74 | 216 | 9.08 |
| F | Nucleotide transport and metabolism | 62 | 2.42 | 60 | 2.52 |
| H | Coenzyme transport and metabolism | 134 | 5.23 | 133 | 5.59 |
| I | Lipid transport and metabolism | 40 | 1.56 | 36 | 1.51 |
| P | Inorganic ion transport and metabolism | 104 | 4.06 | 101 | 4.25 |
| Q | Secondary metabolites biosynthesis, transport and catabolism | 29 | 1.13 | 27 | 1.14 |
| R | General function prediction only | 261 | 10.19 | 250 | 10.51 |
| S | Function unknown | 241 | 9.41 | 224 | 9.42 |
| - | Not in COGs | 772 | 24.81 | 670 | 23.56 |
a) The total is based on the total number of protein coding genes in the annotated genome.
Description of genes present in the variable regions depicted in Figure 4.
| | |
|---|---|
| 1A | Transposases, recombinases, transport proteins, isomerases, histidine kinase and threonine dehydrogenase |
| 1B | Transposases, recombinases, resolvase and alcohol dehydrogenase |
| 1C | Helicases, DNA-methylation, endonuclease and recombinase |
| 1D | TRAP transporter, Threonine dehydrogenase, 2 keto-4-petnenoate hydratase, sugar kinase, aldolase, glycerol dehydrogenase and mannonate dehydratase |
| 1E | Pilus assembly, proteases and hypothetical proteins dominate this variable region |
| 1F | Protease, DNA methylase, RNA polymerase, recombinase, cytochrome c biogenesis, Fe2+ transport system and many hypothetical proteins |
| 1G | Transposase, secretory protein secB, nucleotide sugar dehydrogenase, glycosyltransferase, sugar epimerase, O-antigen ligase and copper amine oxidase |
| 1H | Pyruvate ferredoxin oxidareductase, transport proteins, sugar phosphate permease, threonine dehydrogenase, transporsase, DNA methylase and endonuclease |
| 1I | Growth inhibitor protein, terminase, phage portal protein, secretory protein, recombinase and many hypothetical proteins |
| 2A | Endonuclease, DNA methylase, transposase, ATP binding protein, ATPase, threonine kinase, pyridoxamine 5’phosphate oxidase, ferric reductase, many hypothetical proteins and the CODH-ECH complex |
| 2B | CRISPR-Cas |
| 2C | DNA-helicases, -methyltransferase, and -replication protein, restriction protein and many hypothetical proteins |
| 2D | Urea metabolism |
| 2E | Mainly transport proteins and agmatinase |
| 2F | Alpha ribazole phosphatase, metal dependent phosphohydrolase, phenylacetate-CoA ligase, methyltransferase, amine oxidase, aldehyde dehydrogenase, transposase, phage tail component and many hypothetical proteins |
| 2G | Pilus associated proteins |
| 2H | Recombinase, integrase, AAA ATPase, restriction modification system, deoxyribonuclease |
| 2I | Many transferase proteins |
Figure 5Organization of cooS and neighboring genes in DSM 574 (A) and DSM 14880 (B). Abbreviations: HP, hypothetical protein; TR, transcriptional regulator.
Figure 6Schematic representation of putative formate dehydrogenases in the genome of DSM 574 and DSM 14880 (A). Including the hypothesized electron interaction of the putative extracellular membrane bound formate dehydrogenase with the putative fumarate reductase (B). The electron acceptor fumarate is reduced to succinate by using formate as an electron donor. Gene locus tag numbers and α-, β-, and γ-subunits are depicted. Moreover, predicted iron-sulfur clusters and other metal-binding sites are indicated.
Figure 7Venn diagram showing a comparison of the protein coding genes of DSM 574 and DSM 14880. The number of overlapping protein coding genes is given inside the areas of the circles and the total number of derived protein sequences used for each strain is shown in parentheses. The figure was created using the program Venn diagram plotter available from the Pacific Northwest National Laboratory Software Distribution Center [48].