| Literature DB >> 27721915 |
Tiffany S Louie1, Donato Giovannelli2, Nathan Yee3, Priya Narasingarao1, Valentin Starovoytov4, Markus Göker5, Hans-Peter Klenk6, Elke Lang5, Nikos C Kyrpides7, Tanja Woyke8, Elisabetta Bini9, Max M Häggblom1.
Abstract
Sedimenticola selenatireducens strain AK4OH1T (= DSM 17993T = ATCC BAA-1233T) is a microaerophilic bacterium isolated from sediment from the Arthur Kill intertidal strait between New Jersey and Staten Island, NY. S. selenatireducens is Gram-negative and belongs to the Gammaproteobacteria. Strain AK4OH1T was the first representative of its genus to be isolated for its unique coupling of the oxidation of aromatic acids to the respiration of selenate. It is a versatile heterotroph and can use a variety of carbon compounds, but can also grow lithoautotrophically under hypoxic and anaerobic conditions. The draft genome comprises 4,588,530 bp and 4276 predicted protein-coding genes including genes for the anaerobic degradation of 4-hydroxybenzoate and benzoate. Here we report the main features of the genome of S. selenatireducens strain AK4OH1T.Entities:
Keywords: 4-hydroxybenzoate; Anaerobe; Gammaproteobacteria; Sedimenticola selenatireducens; Selenate respiration
Year: 2016 PMID: 27721915 PMCID: PMC5052931 DOI: 10.1186/s40793-016-0191-5
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phylogenetic analysis highlighting the position of Sedimenticola selenatireducens strain AK4OH1T relative to its closest neighbors based on the 16S rRNA gene. The evolutionary history was inferred by using the Maximum Likelihood method based on the Tamura-Nei model [29]. The tree with the highest log likelihood (-3985.1130) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained by applying the Neighbor-Joining method to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 15 nucleotide sequences. All positions containing gaps and missing data were eliminated. There were a total of 1276 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [30]. The strains and their corresponding GenBank accession numbers for 16S rRNA genes are listed in parentheses. The genome accession number and locus tag of strain AK4OH1T are NZ_ATZE00000000.1 and A3GODRAFT_03746. (T = type strain). Bar: 0.01 substitutions per nucleotide position. C. okenii was used as an outgroup
Fig. 2Electron micrograph of cells of S. selenatireducens strain AK4OH1T. Bar, 1 μm
Classification and general features of Sedimenticola selenatireducens strain AK4OH1T according to the MIGS recommendations [18]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain: AK4OH1T | |||
| Gram stain | negative | TAS [ | |
| Cell shape | rod (1.5 μm long, 0.5 μm wide) | TAS [ | |
| Motility | motile at some growth stages | TAS [ | |
| Sporulation | none | TAS [ | |
| Temperature range | mesophile | TAS [ | |
| Optimum temperature | 28 °C | TAS [ | |
| pH range; Optimum | 7 | TAS [ | |
| Carbon source | benzoate, 3-hydroxybenzoate, 4-hydroxybenzoate, acetate, formate, pyruvate, methyl-pyruvate, L-lactate, D- and L-malate, propionate, fumarate, succinate, methyl-succinate, bromo-succinate, p-hydroxyphenylacetic acid, cysteine | TAS [ | |
| MIGS-6 | Habitat | estuarine sediment | TAS [ |
| MIGS-6.3 | Salinity | 1.1-2.3 % NaCl (w/v) | TAS [ |
| MIGS-22 | Oxygen requirement | anaerobe-microaerophile | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | unknown | NAS |
| MIGS-4 | Geographic location | Hudson River estuary, Arthur Kill, intertidal strait NY/NJ, USA | TAS [ |
| MIGS-5 | Sample collection | 1995 | TAS [ |
| MIGS-4.1 | Latitude | 40°586′N | TAS [ |
| MIGS-4.2 | Longitude | 74°207′E | TAS [ |
| MIGS-4.3 | Depth | surface sediment | TAS [ |
| MIGS-4.4 | Altitude | sea level | TAS [ |
a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [36]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Level 2: High-Quality Draft |
| MIGS-28 | Libraries used | Illumina std PE IIOC |
| MIGS 29 | Sequencing platforms | Illumina |
| MIGS 31.2 | Fold coverage | 273× |
| MIGS 30 | Assemblers | ALLPATHS v. R37654 |
| MIGS 32 | Gene calling method | Prodigal 2.5 |
| Locus Tag | A3GO | |
| Genbank ID | ATZE00000000.1 | |
| GenBank Date of Release | 06/18/14 | |
| GOLD ID | Gp0013295 | |
| BIOPROJECT ID | PRJNA165429 | |
| MIGS 13 | Source Material Identifier | AK4OH1T |
| Project relevance | Bioremediation, environmental, biogeochemical cycling of Se, Genomic Encyclopedia of Bacteria and Archaea (GEBA) |
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 4,588,530 | 100.00 |
| DNA coding (bp) | 4,041,165 | 88.07 |
| DNA G + C (bp) | 2,597,447 | 56.61 |
| DNA scaffolds | 37 | 100.00 |
| Total genesb | 4331 | 100.00 |
| Protein coding genes | 4276 | 98.73 |
| RNA genes | 55 | 1.27 |
| Genes with function prediction | 3440 | 79.43 |
| Genes assigned to COGs | 2832 | 65.39 |
| Genes with Pfam domains | 3595 | 83.01 |
| Genes with signal peptides | 383 | 8.84 |
| Genes with transmembrane helices | 1143 | 26.39 |
| CRISPR repeats | 1 | - |
a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
b no pseudogenes found
Number of genes associated with general COG functional categories
| Code | Value | %age | Description |
|---|---|---|---|
| J | 205 | 6.48 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03 | RNA processing and modification |
| K | 180 | 5.69 | Transcription |
| L | 117 | 3.70 | Replication, recombination and repair |
| B | 2 | 0.06 | Chromatin structure and dynamics |
| D | 41 | 1.30 | Cell cycle control, Cell division, chromosome partitioning |
| V | 66 | 2.09 | Defense mechanisms |
| T | 244 | 7.71 | Signal transduction mechanisms |
| M | 160 | 5.06 | Cell wall/membrane biogenesis |
| N | 120 | 3.79 | Cell motility |
| U | 49 | 1.55 | Intracellular trafficking and secretion |
| O | 207 | 6.54 | Posttranslational modification, protein turnover, chaperones |
| C | 339 | 10.71 | Energy production and conversion |
| G | 116 | 3.67 | Carbohydrate transport and metabolism |
| E | 244 | 7.71 | Amino acid transport and metabolism |
| F | 57 | 1.80 | Nucleotide transport and metabolism |
| H | 166 | 5.24 | Coenzyme transport and metabolism |
| I | 148 | 4.68 | Lipid transport and metabolism |
| P | 187 | 5.91 | Inorganic ion transport and metabolism |
| Q | 76 | 2.40 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 211 | 6.67 | General function prediction only |
| S | 175 | 5.53 | Function unknown |
| - | 1499 | 34.61 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Fig. 3Phylogenetic analysis highlighting the relation of Sedimenticola selenatireducens strain AK4OH1T genes to known DMSO reductases by Maximum Likelihood method. The evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model [37]. The tree with the highest log likelihood (-17325.9218) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained by applying the Neighbor-Joining method to a matrix of pairwise distances estimated using a JTT model. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 13 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 724 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [30]. GenBank accession numbers are listed in parentheses. Bar = 0.5 substitutions per nucleotide position