| Literature DB >> 22768363 |
Birte Abt, Cliff Han, Carmen Scheuner, Megan Lu, Alla Lapidus, Matt Nolan, Susan Lucas, Nancy Hammon, Shweta Deshpande, Jan-Fang Cheng, Roxanne Tapia, Lynne A Goodwin, Sam Pitluck, Konstantinos Liolios, Ioanna Pagani, Natalia Ivanova, Konstantinos Mavromatis, Natalia Mikhailova, Marcel Huntemann, Amrita Pati, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Evelyne-Marie Brambilla, Manfred Rohde, Stefan Spring, Sabine Gronow, Markus Göker, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, John C Detter.
Abstract
Spirochaeta coccoides Dröge et al. 2006 is a member of the genus Spirochaeta Ehrenberg 1835, one of the oldest named genera within the Bacteria. S. coccoides is an obligately anaerobic, Gram-negative, non-motile, spherical bacterium that was isolated from the hindgut contents of the termite Neotermes castaneus. The species is of interest because it may play an important role in the digestion of breakdown products from cellulose and hemicellulose in the termite gut. Here we provide a taxonomic re-evaluation for strain SPN1(T), and based on physiological and genomic characteristics, we propose its reclassification as a novel species in the genus Sphaerochaeta, a recently published sister group of the Spirochaeta. The 2,227,296 bp long genome of strain SPN1(T) with its 1,866 protein-coding and 58 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: GEBA; Gram-negative; Sphaerochaeta; Spirochaetaceae; chemoorganotrophic; di- and oligosaccharide-degrading; mesophilic; non-motile; obligately anaerobic; termite hindgut
Year: 2012 PMID: 22768363 PMCID: PMC3388779 DOI: 10.4056/sigs.2796069
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of S. coccoides relative to the other type strains within the family Spirochaetaceae. The tree was inferred from 1,360 aligned characters [9,10] of the 16S rRNA gene sequence under the maximum likelihood criterion [11]. Rooting was done initially using the midpoint method [12] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 500 ML bootstrap replicates [13] (left) and from 1,000 maximum parsimony bootstrap replicates [14] (right) if larger than 60% if. Lineages with type strain genome sequencing projects registered in GOLD [15] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks (see [16-19], CP002696 for Treponema brennaborense, CP002903 for S. thermophila, and CP002868 for S. caldaria). Also, genomes that are finished but are missing a second asterisk are S. africana CP003282, S. pleomorpha CP003155 and S. globosa CP002541.
Classification and general features of S. coccoides SPN1T according to the MIGS recommendations [20] and the NamesforLife database [21].
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain SPN1 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | coccoid | TAS [ | |
| Motility | Non-motile | TAS [ | |
| Sporulation | none | TAS [ | |
| Temperature range | mesophile | TAS [ | |
| Optimum temperature | 30°C | TAS [ | |
| Salinity | not reported | ||
| MIGS-22 | Oxygen requirement | obligately anaerobic | TAS [ |
| Carbon source | pentoses (arabinose, xylose), oligosaccharides (maltose, cellobiose, maltotriose, maltotetraose), | TAS [ | |
| Energy metabolism | chemoorganotroph | TAS [ | |
| MIGS-6 | Habitat | digestive tract of lower dry-wood termites | TAS [ |
| MIGS-15 | Biotic relationship | host associated commensal | TAS [ |
| MIGS-14 | Pathogenicity | none | TAS [ |
| Biosafety level | 1 | TAS [ | |
| Isolation | hindgut of | TAS [ | |
| MIGS-4 | Geographic location | not reported | |
| MIGS-5 | Sample collection time | 2005 or before | TAS [ |
| MIGS-4.1 | Latitude | not reported | |
| MIGS-4.2 | Longitude | not reported | |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
a) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [27]. If the evidence code is IDA, then the property should have been directly observed, for the purpose of this specific publication, for a live isolate by one of the authors, or an expert or reputable institution mentioned in the acknowledgements.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | finished |
| MIGS-28 | Libraries used | Three genomic libraries: one 454 pyrosequence standard library, |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 960.0 × Illumina; 40.0 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3, Velvet version 0.7.63, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | CP002659 | |
| Genbank Date of Release | April 27, 2011 | |
| GOLD ID | Gc01739 | |
| NCBI project ID | 48121 | |
| Database: IMG-GEBA | 2503904012 | |
| MIGS-13 | Source material identifier | DSM 17374 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| Value | % of Total | |
|---|---|---|
| Genome size (bp) | 2,227,296 | 100.00% |
| DNA coding region (bp) | 2,003,786 | 89.96% |
| DNA G+C content (bp) | 1,126,077 | 50.56% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 1,924 | 100.00% |
| RNA genes | 58 | 3.01% |
| rRNA operons | 3 | |
| Protein-coding genes | 1,866 | 96.99% |
| Pseudo genes | 44 | 2.29% |
| Genes with function prediction | 1,434 | 74.53% |
| Genes in paralog clusters | 733 | 38.10% |
| Genes assigned to COGs | 1,528 | 76.72% |
| Genes assigned Pfam domains | 1,518 | 78.90% |
| Genes with signal peptides | 314 | 16.32% |
| Genes with transmembrane helices | 524 | 27.23% |
| CRISPR repeats | 4 |
Figure 2Graphical map of the chromosome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| Code | value | %age | Description |
|---|---|---|---|
| J | 143 | 8.5 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 118 | 7.0 | Transcription |
| L | 99 | 5.9 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 58 | 3.5 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 25 | 1.5 | Defense mechanisms |
| T | 59 | 3.5 | Signal transduction mechanisms |
| M | 46 | 2.7 | Cell wall/membrane/envelope biogenesis |
| N | 2 | 0.1 | Cell motility |
| Z | 39 | 2.4 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 19 | 1.1 | Intracellular trafficking, secretion, and vesicular transport |
| O | 54 | 3.2 | Posttranslational modification, protein turnover, chaperones |
| C | 77 | 4.6 | Energy production and conversion |
| G | 260 | 15.5 | Carbohydrate transport and metabolism |
| E | 160 | 9.5 | Amino acid transport and metabolism |
| F | 58 | 3.6 | Nucleotide transport and metabolism |
| H | 42 | 2.5 | Coenzyme transport and metabolism |
| I | 44 | 2.6 | Lipid transport and metabolism |
| P | 58 | 3.6 | Inorganic ion transport and metabolism |
| Q | 15 | 0.9 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 193 | 11.5 | General function prediction only |
| S | 109 | 6.5 | Function unknown |
| - | 396 | 20.6 | Not in COGs |
Result (p-values) from the test of individual links with ParaFit
| Species | p-value, constraint 1 | p-value, constraint 2 |
|---|---|---|
| 0.2882 | 0.0038 | |
| 0.2844 | 0.0230 | |
| 0.2754 | 0.0201 | |
| 0.2076 | 0.0080 | |
| 0.2001 | 0.0149 | |
| 0.1905 | 0.0145 | |
| 0.1830 | 0.0280 | |
| 0.1806 | 0.0124 | |
| 0.1765 | 0.0490 | |
| 0.1749 | 0.0278 | |
| 0.1656 | 0.0241 | |
| 0.1654 | 0.0039 | |
| 0.1592 | 0.0454 | |
| 0.1384 | 0.0539 | |
| 0.1327 | 0.0025 | |
| 0.0863 | 0.0217 | |
| 0.0716 | 0.0010 |
Result (p-values) from the test of individual links with ParaFit for the species with an insignificant result (α = 0.05) in the first approach. The comparison was done between an unconstrained ML tree and the first, genus-based constraint (second column) or the second constraint, based on a revised classification of the group (third column). Note that with a single exception the phylogenetic conflict was resolved by assigning S. coccoides to Sphaerochaeta [4] and three other Spirochaeta species to Treponema [46].
Pairwise comparison of S. coccoides with both Sphaerochaeta species and S. smaragdinae using the GGDC-Genome-to-Genome Distance Calculator.
| | | | ||
|---|---|---|---|---|
| 4.5 | 83.2 | 3.7 | ||
| 3.9 | 83.3 | 3.3 | ||
| 1.2 | 84.6 | 1.0 | ||
| 14.2 | 82.0 | 11.7 | ||
| 1.3 | 84.6 | 1.1 |
Figure 3Phylogenetic tree inferred from completely sequenced genomes of the Spirochaeta type strains. The tree was inferred from 140,413 aligned amino acid characters under the maximum likelihood (ML) criterion and rooted with Leptospira. The branches are scaled in terms of the expected number of substitutions per site. Numbers above the branches are bootstrapping support values (if larger than 60%) from (i) maximum-likelihood supermatrix analysis; (ii) maximum-parsimony supermatrix analysis; (iii) maximum-likelihood gene-content analysis; (iv) maximum-parsimony gene-content analysis. INSDC accession numbers are given in square brackets. Note that the placement of Borrelia is probably caused by long-branch attraction. For further details see the text.
Typical features of reference taxa.
| | | | |
|---|---|---|---|
| Cell shape | coccoid, spherical, not spiral | coccoid, spherical, pleomorphic; not helical or spiral | helical or spiral; spherical bodies under unfavorable growth conditions |
| Cell size | 0.5-2.0 µm | 0.4-2.5 µm | 0.2-0.75 by 5-250 µm |
| Motility | non-motile | non-motile | motile |
| Flagellation | no flagella | no flagella | 2 periplasmic flagella (exception: |
| T-optimum | 30 °C | mesophilic | 25-68 °C |
| pH-optimum | 7.4 | neutrophilic | |
| Oxygen requirement | anaerobe | anaerobe | obligately anaerobe or facultatively anaerobe |
| Fermentation products | acetate, ethanol, formate | acetate, ethanol, formate | acetate, ethanol, CO2, H2 |
| G+C content | 56.6-57.4 mol% [ | 45-48 mol% | 51-65 mol% [ |
On the basis of the above mentioned physiological and phylogenetic characteristics of strain SPN1T, its reclassification into the genus Sphaerochaeta is proposed. The inclusion of Sphaerochaeta in Spirochaetaceae also makes an emendation of the family necessary, as its previous description excludes features specifically found in Sphaerochaeta.