| Literature DB >> 24501624 |
Thomas Kruse1, Julien Maillard2, Lynne Goodwin3, Tanja Woyke4, Hazuki Teshima3, David Bruce3, Chris Detter3, Roxanne Tapia3, Cliff Han3, Marcel Huntemann4, Chia-Lin Wei4, James Han4, Amy Chen4, Nikos Kyrpides4, Ernest Szeto4, Victor Markowitz4, Natalia Ivanova4, Ioanna Pagani4, Amrita Pati4, Sam Pitluck4, Matt Nolan4, Christof Holliger2, Hauke Smidt1.
Abstract
Dehalobacter restrictus strain PER-K23 (DSM 9455) is the type strain of the species Dehalobacter restrictus. D. restrictus strain PER-K23 grows by organohalide respiration, coupling the oxidation of H2 to the reductive dechlorination of tetra- or trichloroethene. Growth has not been observed with any other electron donor or acceptor, nor has fermentative growth been shown. Here we introduce the first full genome of a pure culture within the genus Dehalobacter. The 2,943,336 bp long genome contains 2,826 protein coding and 82 RNA genes, including 5 16S rRNA genes. Interestingly, the genome contains 25 predicted reductive dehalogenase genes, the majority of which appear to be full length. The reductive dehalogenase genes are mainly located in two clusters, suggesting a much larger potential for organohalide respiration than previously anticipated.Entities:
Keywords: Dehalobacter restrictus type strain; PCE; TCE; anaerobe; organohalide respiration; reductive dehalogenases
Year: 2013 PMID: 24501624 PMCID: PMC3910700 DOI: 10.4056/sigs.3787426
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of strain PER-K23 according to MIGS recommendations [7].
| | | | |
|---|---|---|---|
| Domain | TAS [ | ||
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Current classification | Order | TAS [ | |
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain PER-K23 | |||
| Gram stain | Negative | TAS [ | |
| Cell shape | Straight rod | TAS [ | |
| Motility | Motile | TAS [ | |
| Sporulation | Not observed | TAS [ | |
| Temperature range | 10-37 °C | TAS [ | |
| Optimum temperature | 25-30 °C | TAS [ | |
| Carbon source | Acetate, yeast extract | TAS [ | |
| Energy source | H2 as sole electron donor | TAS [ | |
| Terminal electron receptor | PCE and TCE | TAS [ | |
| MIGS-6 | Habitat | Anaerobic river sediment | TAS [ |
| MIGS-6.3 | Salinity | Not tested | |
| MIGS-22 | Oxygen | Strictly anaerobic | [ |
| MIGS-15 | Biotic relationship | Free living | [ |
| MIGS-14 | Pathogenicity | None known | |
| MIGS-4 | Geographic location | River Rhine, near Wageningen, The Netherlands | [ |
| MIGS-5 | Sample collection time | 1992 |
a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [19].
Figure 1Phylogenetic tree highlighting the position of relative to phylogenetically closely related organisms. 16S rRNA sequences were retrieved from Genbank (NCBI), and accession numbers are given in parentheses. Strains from which a full genome sequence are available are indicated with an asterisk. Phylogenetic analysis was done using the MEGA5 software package [20]. Sequences were aligned using the MUSCLE algorithm before a neighbor joining tree was constructed and validated with 1,000 bootstraps [21,22]. The reference bar indicates 2% sequence divergence.
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Closed genome |
| MIGS-28 | Libraries used | Two genomic libraries, one paired-end 454 library and one Illumina library. |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium and Illumina GAii |
| MIGS-31.2 | Fold coverage | 8.5 for 454 and 120 for Illumina |
| MIGS-30 | Assemblers | Newbler version 2.3, VELVET, version 1.0.13 and phrap, version SPS – 4.24 |
| MIGS-32 | Gene calling method | Prodigal, GenePRIMP |
| Genome Database release | December 28th, 2011 | |
| Genbank ID | PRJNA66209 | |
| Genbank Date of Release | ||
| GOLD ID | Gi05571 | |
| Project relevance | Type strain, Bioremediation, Biotechnology |
Nucleotide content and gene count levels of the genome
| Value | % of totala | |
|---|---|---|
| Genome size (bp) | 2,943,336 | 100.00 |
| DNA coding region (bp) | 2,473,591 | 84.04 |
| DNA G+C content (bp) | 1,311,589 | 44.56 |
| Total genesb | 2,908 | 100.00 |
| RNA genes | 82 | 2.82 |
| Protein-coding genes | 2,826 | 97.18 |
| Genes in paralog clusters | 1,174 | 40.37 |
| Genes assigned to COGs | 2,127 | 73.14 |
| Genes with signal peptides | 756 | 26.00 |
| Genes with transmembrane helices | 755 | 25.96 |
| Paralogous groups | 356 | 40.37 |
| Reductive dehalogenasesc | 25 | 0.86 |
a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
b Also includes 143 pseudogenes.
c Including pseudogenes
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 149 | 6.4 | Translation |
| A | -- | -- | RNA processing and modification |
| K | 176 | 7.6 | Transcription |
| L | 180 | 7.8 | Replication, recombination and repair |
| B | 1 | 0.1 | Chromatin structure and dynamics |
| D | 44 | 1.9 | Cell cycle control, cell division, chromosome partitioning |
| Y | -- | -- | Nuclear structure |
| V | 39 | 1.7 | Defense mechanisms |
| T | 166 | 7.2 | Signal transduction mechanisms |
| M | 148 | 6.4 | Cell wall/membrane biogenesis |
| N | 71 | 3.1 | Cell motility |
| Z | -- | -- | Cytoskeleton |
| W | -- | -- | Extracellular structures |
| U | 73 | 3.1 | Intracellular trafficking and secretion |
| O | 84 | 3.6 | Posttranslational modification, protein turnover, chaperones |
| C | 169 | 7.3 | Energy production and conversion |
| G | 64 | 2.8 | Carbohydrate transport and metabolism |
| E | 166 | 7.2 | Amino acid transport and metabolism |
| F | 59 | 2.5 | Nucleotide transport and metabolism |
| H | 118 | 5.1 | Coenzyme transport and metabolism |
| I | 40 | 1.7 | Lipid transport and metabolism |
| P | 105 | 4.5 | Inorganic ion transport and metabolism |
| Q | 27 | 1.2 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 241 | 10.4 | General function prediction only |
| S | 203 | 8.7 | Function unknown |
| - | 781 | 26.9a | Not in COGs |
a Percentage of the total number of protein coding genes in the annotated genome.
Reductive dehalogenase paralogs encoded in the genome of strain PER-K23
| | | |
|---|---|---|
| Dehre_0785 | No | |
| Dehre_0793 | No | |
| Dehre_0806 | No | |
| Dehre_0808 | No | N-terminally truncated |
| Dehre_0815 | No | N-terminally truncated |
| Dehre_0820 | No | |
| Dehre_0826 | No | |
| Dehre_0830 | No | |
| Dehre_0832 | (98.7; Dhb965) | |
| Dehre_0835 | (98.7; Dhb968) | |
| Dehre_0990 | (100; Dhb84) | |
| Dehre_1408 | (99.6; Dhb1133) | |
| Dehre_2012 | (96.6; Dhb1238) | C-terminally truncatedc |
| Dehre_2022 | No | |
| Dehre_2026 | No | |
| Dehre_2031 | No | |
| Dehre_2037 | No | |
| Dehre_2039 | No | |
| Dehre_2044 | No | |
| Dehre_2052 | No | |
| Dehre_2058 | No | C-terminally truncated |
| Dehre_2064 | No | |
| Dehre_2065 | No | |
| Dehre_2398 | (90.4; Dhb490) | PceA d |
| Dehre_2792 | Noe | Partial sequence |
a RdhA paralogs are listed in order of their position in the genome. Light grey indicates RdhA paralogs belonging to rdh cluster A (Dehre_785-835) and dark grey rdh cluster B (Dehre_2012-2065).
b Orthology defined as more than 90% pairwise identity at the amino acid level, as suggested in [40]. Identity percentage based on full length RDHs and locus tag of the corresponding genes in strain E1 are given in brackets [3]. Identity percentages were calculated using MatGat [41].
c For the comparison, a manually curated version of Dehre_2012 was used, i.e. the entire gene without the annotated frame-shift mutation.
d Dehre_2398 corresponds to the biochemically characterized PCE reductive dehalogenase (PceA) [42].
e The sequence is conserved between the two strains, but no gene is annotated at this position in D. sp. Strain E1.
Figure 2Circular map of the chromosome of strain PER-K23. Labeling from the outside circle towards the inside circle. Numbers outside the map indicate nucleotide positions; Circles 1 and 2: predicted coding sequences, including pseudogenes, on the forward and reverse strand, respectively (colored by COG categories); Circle 3: RNA genes (tRNAs green, rRNAs red, other RNAs black); Circle 4: Position of reductive dehalogenase genes, in red, both functional and truncated, A and B indicate two rdh clusters; Circle 5: Position of transposases including inactive derivatives, in green; Circle 6: Position of genes related to corrinoid synthesis and uptake, in blue; Circle 7: GC content (peaks out/inside the circle indicate above or below average GC content, respectively: Circle 8: GC skew, calculated as (G-C)/(G+C), purple or olive indicates values lower or higher than 1, respectively.