Literature DB >> 27721915

High-quality draft genome sequence of Sedimenticola selenatireducens strain AK4OH1T, a gammaproteobacterium isolated from estuarine sediment.

Tiffany S Louie1, Donato Giovannelli2, Nathan Yee3, Priya Narasingarao1, Valentin Starovoytov4, Markus Göker5, Hans-Peter Klenk6, Elke Lang5, Nikos C Kyrpides7, Tanja Woyke8, Elisabetta Bini9, Max M Häggblom1.   

Abstract

Sedimenticola selenatireducens strain AK4OH1T (= DSM 17993T = ATCC BAA-1233T) is a microaerophilic bacterium isolated from sediment from the Arthur Kill intertidal strait between New Jersey and Staten Island, NY. S. selenatireducens is Gram-negative and belongs to the Gammaproteobacteria. Strain AK4OH1T was the first representative of its genus to be isolated for its unique coupling of the oxidation of aromatic acids to the respiration of selenate. It is a versatile heterotroph and can use a variety of carbon compounds, but can also grow lithoautotrophically under hypoxic and anaerobic conditions. The draft genome comprises 4,588,530 bp and 4276 predicted protein-coding genes including genes for the anaerobic degradation of 4-hydroxybenzoate and benzoate. Here we report the main features of the genome of S. selenatireducens strain AK4OH1T.

Entities:  

Keywords:  4-hydroxybenzoate; Anaerobe; Gammaproteobacteria; Sedimenticola selenatireducens; Selenate respiration

Year:  2016        PMID: 27721915      PMCID: PMC5052931          DOI: 10.1186/s40793-016-0191-5

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Selenium (Se) is an intriguing element in that microbes actively metabolize it through reduction, oxidation, methylation and demethylation reactions, using some of these to conserve energy. Of particular interest is the process of dissimilatory Se reduction, where the Se oxyanion, selenate [Se(VI)], is sequentially reduced to selenite [Se(IV)] and further to insoluble elemental Se(0). The ability to respire selenate/selenite is comparatively rare, nonetheless, is found in phylogenetically diverse anaerobes [1]. SeRB display a tremendous phylogenetic diversity, and yet the metabolic function seems to be conserved (or alternatively horizontally dispersed) in these unrelated groups. Furthermore, the physiologies of the known selenate-respiring bacteria appear to vary greatly. For example, they are able to couple growth to a wide range of electron acceptors such as arsenate, [2, 3] cobalt oxide (Co(III)) [4], and tellurite [5] to name a few. SeRB have been isolated from a variety of different locations. A few examples are: in California in the San Joaquin Valley [6], from estuarine sediment in NJ [7], from a glass manufacturing plant in Japan [8], and from the dead sea [9]. type strain AK4OH1T (= DSM 17993 = ATCC BA-1233) is a member of the isolated from estuarine sediment for its unique ability to couple the oxidation of aromatic acids to selenate respiration. The genus currently includes seven cultivated strains of which two species have been named and described: strain AK4OH1T, the type strain of the type species for this genus [10], strain CUZ [11], strain SIP-G1 [12], sp. strain Ke4OH1 [7], and sp. strain NSS [11]. Here we summarize the physiological features of AK4OH1T and provide a description of its genome.

Organism information

Classification and features

strain AK4OH1T was isolated from estuarine sediment in the New York-New Jersey harbor estuary (40°586′N, 74°207′E) [10]. The position of strain AK4OH1T relative to its phylogenetic neighbors is shown in Fig. 1. strain CUZ [11] is the closest relative to strain AK4OH1T with a 16S rRNA gene similarity of 100 %, yet interestingly, it has not been found to respire selenate. In addition to these two, there are five other cultivated strains of the genus : strain SIP-G1T [12], sp. strain NSS [11], and sp. strain Ke4OH1 [7]. The isolate TT-Z (accession number AM292414) [13] groups among the strains (Fig. 1) suggesting that it is part of the genus. The isolate IR (accession number AF521582) groups closely with strain AK4OH1T and strain CUZ, and its position in the phylogenetic tree suggests that it is a member of the species.
Fig. 1

Phylogenetic analysis highlighting the position of Sedimenticola selenatireducens strain AK4OH1T relative to its closest neighbors based on the 16S rRNA gene. The evolutionary history was inferred by using the Maximum Likelihood method based on the Tamura-Nei model [29]. The tree with the highest log likelihood (-3985.1130) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained by applying the Neighbor-Joining method to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 15 nucleotide sequences. All positions containing gaps and missing data were eliminated. There were a total of 1276 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [30]. The strains and their corresponding GenBank accession numbers for 16S rRNA genes are listed in parentheses. The genome accession number and locus tag of strain AK4OH1T are NZ_ATZE00000000.1 and A3GODRAFT_03746. (T = type strain). Bar: 0.01 substitutions per nucleotide position. C. okenii was used as an outgroup

Phylogenetic analysis highlighting the position of Sedimenticola selenatireducens strain AK4OH1T relative to its closest neighbors based on the 16S rRNA gene. The evolutionary history was inferred by using the Maximum Likelihood method based on the Tamura-Nei model [29]. The tree with the highest log likelihood (-3985.1130) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained by applying the Neighbor-Joining method to a matrix of pairwise distances estimated using the Maximum Composite Likelihood (MCL) approach. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 15 nucleotide sequences. All positions containing gaps and missing data were eliminated. There were a total of 1276 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [30]. The strains and their corresponding GenBank accession numbers for 16S rRNA genes are listed in parentheses. The genome accession number and locus tag of strain AK4OH1T are NZ_ATZE00000000.1 and A3GODRAFT_03746. (T = type strain). Bar: 0.01 substitutions per nucleotide position. C. okenii was used as an outgroup Cells of strain AK4OH1T are Gram-negative and rod-shaped [10] (Fig. 2 and Table 1). The strain can grow heterotrophically or lithoautotrophically under hypoxic and anaerobic conditions [12]. Motility is observed during early to mid-exponential growth on liquid MB2216 medium, but not in late exponential phase, and cell morphology varies depending on growth conditions [10, 12].
Fig. 2

Electron micrograph of cells of S. selenatireducens strain AK4OH1T. Bar, 1 μm

Table 1

Classification and general features of Sedimenticola selenatireducens strain AK4OH1T according to the MIGS recommendations [18]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [31]
Phylum Proteobacteria TAS [32]
Class Gammaproteobacteria TAS [33, 34]
Genus Sedimenticola TAS [10, 35]
Species Sedimenticola selenatireducens TAS [10, 35]
Type strain: AK4OH1T
Gram stainnegativeTAS [10]
Cell shaperod (1.5 μm long, 0.5 μm wide)TAS [10]
Motilitymotile at some growth stagesTAS [12]
SporulationnoneTAS [10]
Temperature rangemesophileTAS [10]
Optimum temperature28 °CTAS [10]
pH range; Optimum7TAS [10]
Carbon sourcebenzoate, 3-hydroxybenzoate, 4-hydroxybenzoate, acetate, formate, pyruvate, methyl-pyruvate, L-lactate, D- and L-malate, propionate, fumarate, succinate, methyl-succinate, bromo-succinate, p-hydroxyphenylacetic acid, cysteineTAS [10, 12]
MIGS-6Habitatestuarine sedimentTAS [10]
MIGS-6.3Salinity1.1-2.3 % NaCl (w/v)TAS [10]
MIGS-22Oxygen requirementanaerobe-microaerophileTAS [10, 12]
MIGS-15Biotic relationshipfree-livingTAS [10]
MIGS-14PathogenicityunknownNAS
MIGS-4Geographic locationHudson River estuary, Arthur Kill, intertidal strait NY/NJ, USATAS [10]
MIGS-5Sample collection1995TAS [10]
MIGS-4.1Latitude40°586′NTAS [10]
MIGS-4.2Longitude74°207′ETAS [10]
MIGS-4.3Depthsurface sedimentTAS [10]
MIGS-4.4Altitudesea levelTAS [10]

a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [36]

Electron micrograph of cells of S. selenatireducens strain AK4OH1T. Bar, 1 μm Classification and general features of Sedimenticola selenatireducens strain AK4OH1T according to the MIGS recommendations [18] a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [36] Strain AK4OH1T is able to utilize benzoate, 3-hydroxybenzoate, 4-hydroxybenzoate, acetate, formate, fumarate, L-lactate, D- and L-malate, pyruvate, methyl-pyruvate, propionate, succinate, methyl-succinate, bromo-succinate, p-hydroxyphenylacetic acid, α-ketoglutaric acid, arabinose, lyxose, ribose, xylose, D-galactonic acid-γ-lactone, α-hydroxy-glutaric acid-γ-lactone, L-alanine, L-glutamic acid, L-serine, tyramine, and phenylethylamine [10, 12].

Chemotaxonomic data

The predominant cellular fatty acids in strain AK4OH1T are C16:0 (61.9 %), C16:1 ω7c (14.4 %), C18:0 (8.4 %), and C18:1 ω7c (7.2 %) [10].

Genome sequencing information

Genome project history

strain AK4OH1T was selected for sequencing in 2011 based on its phylogenetic position [14, 15] and is part of the study Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes project (KMG-I) [16]. The goal of the KMG-I study was to increase the coverage of sequenced reference microbial genomes [17]. The Quality Draft (QD) assembly and annotation were made available for public access on June 18, 2014. Table 2 presents the project information and its association with MIGS version 2.0 compliance [18]. The NCBI accession number for the Bioproject is PRJNA165429. The genome accession number is ATZE00000000.1 consisting of 41 contigs (ATZE01000001-ATZE01000041) and 37 scaffolds.
Table 2

Project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityLevel 2: High-Quality Draft
MIGS-28Libraries usedIllumina std PE IIOC
MIGS 29Sequencing platformsIllumina
MIGS 31.2Fold coverage273×
MIGS 30AssemblersALLPATHS v. R37654
MIGS 32Gene calling methodProdigal 2.5
Locus TagA3GO
Genbank IDATZE00000000.1
GenBank Date of Release06/18/14
GOLD IDGp0013295
BIOPROJECT IDPRJNA165429
MIGS 13Source Material IdentifierAK4OH1T
Project relevanceBioremediation, environmental, biogeochemical cycling of Se, Genomic Encyclopedia of Bacteria and Archaea (GEBA)
Project information

Growth conditions and genomic DNA preparation

strain AK4OH1T was grown in mineral salt medium at 28 °C with 10 mM Na2SeO4 as electron acceptor and 250 μM 4-hydroxybenzoate as carbon source, as previously described [10]. Genomic DNA was isolated from 0.5 g of cell paste using JetFlex Genomic DNA Purification Kit (GENOMED) as recommended by the manufacturer.

Genome sequencing and assembly

Sequencing was achieved using an Illumina [19] platform using a std paired-end library obtaining 273× fold coverage. The sequencing was done at the DOE Joint Genome Institute. ALLPATHS assembly software [20] was used to obtain 41 final contigs. Quality check and assembly statistics were performed at JGI. The raw sequences were screened against contaminants and 0.1 % of the reads were removed.

Genome annotation

Gene calling was performed using Prodigal 2.5 [21]. The genome sequence was analyzed using the Joint Genome Institute IMG system [22]. Ribosomal RNAs were predicted based upon sequence similarity, using BLAST, against the non-redundant nucleotide database and/or using Infernal and Rfam models. tRNA genes were found using tRNAscan-SE [23]. The predicted CDS were searched using the NCBI non-redundant protein database. The major metabolic pathways and predicted protein set were searched using KEGG, SwissProt, COG, Pfam, and InterPro protein databases implemented in the IMG. Additional gene prediction analysis and manual functional annotation were performed within IMG and using Artemis software (release 13.0, Sanger Institute).

Genome properties

The high quality draft genome sequence consists of 37 scaffolds that account for a total of 4,588,530 bp with a 56.6 % G + C content. In total, 4331 genes were predicted, 4276 of which are protein-coding genes, 55 RNA genes, and no pseudogenes. The majority of the predicted genes (79 %) were assigned a predicted function. The properties and statistics of the genome are summarized in Table 3 and Table 4.
Table 3

Genome statistics

AttributeValue% of Totala
Genome size (bp)4,588,530100.00
DNA coding (bp)4,041,16588.07
DNA G + C (bp)2,597,44756.61
DNA scaffolds37100.00
Total genesb 4331100.00
Protein coding genes427698.73
RNA genes551.27
Genes with function prediction344079.43
Genes assigned to COGs283265.39
Genes with Pfam domains359583.01
Genes with signal peptides3838.84
Genes with transmembrane helices114326.39
CRISPR repeats1-

a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome

b no pseudogenes found

Table 4

Number of genes associated with general COG functional categories

CodeValue%ageDescription
J2056.48Translation, ribosomal structure and biogenesis
A10.03RNA processing and modification
K1805.69Transcription
L1173.70Replication, recombination and repair
B20.06Chromatin structure and dynamics
D411.30Cell cycle control, Cell division, chromosome partitioning
V662.09Defense mechanisms
T2447.71Signal transduction mechanisms
M1605.06Cell wall/membrane biogenesis
N1203.79Cell motility
U491.55Intracellular trafficking and secretion
O2076.54Posttranslational modification, protein turnover, chaperones
C33910.71Energy production and conversion
G1163.67Carbohydrate transport and metabolism
E2447.71Amino acid transport and metabolism
F571.80Nucleotide transport and metabolism
H1665.24Coenzyme transport and metabolism
I1484.68Lipid transport and metabolism
P1875.91Inorganic ion transport and metabolism
Q762.40Secondary metabolites biosynthesis, transport and catabolism
R2116.67General function prediction only
S1755.53Function unknown
-149934.61Not in COGs

The total is based on the total number of protein coding genes in the genome

Genome statistics a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome b no pseudogenes found Number of genes associated with general COG functional categories The total is based on the total number of protein coding genes in the genome

Insights from the genome sequence

The respiratory flexibility of anaerobic prokaryotes allowing them to employ different terminal electron acceptors for respiration enables these organisms to thrive in dynamic redox environments. Among the enzymes that catalyze oxidation-reduction reactions of metals and metalloids are those that are highly conserved and belong to the DMSO reductase family [24]. Key members of the DMSO family of reductases, which transfer electrons to a variety of substrates that act as terminal electron acceptors for energy generation, are nitrate reductases (Nar, Nap, Nas), arsenate reductase (Arr), selenate reductase (Ser), and chlorate reductase (Clr), among others. strain AK4OH1T can use nitrate, nitrite and selenate as the terminal electron acceptors for anaerobic growth, while using the electron donors acetate, lactate, pyruvate, benzoate, 3-hydroxybenzoate, and 4-hydroxybenzoate [10]. Chlorate and perchlorate can be used as electron acceptors when peptone is used as an energy source [12]. (Micro-)aerobic growth with oxygen as electron-acceptor and peptones as electron-donor is also detected [12] Within the AK4OH1T genome, there are several likely DMSO reductases. Figure 3 shows the grouping of AK4OH1T genes with closely matching, known, DMSO reductases. A3GODRAFT_03903 groups closely with the NapA, from sp. QH-2. A3GODRAFT_01428 clusters together with the NarG of K-12 MG1655. Both of these genes are organized in gene clusters similar to known nap and nar operons [25]. BLAST searches of the AK4OH1T genome using arsenate reductases showed no genes with significant similarity. This agrees with strain AK4OH1’s inability to respire arsenate [10]. A3GODRAFT_02603 and A3GODRAFT_03351 from strain AK4OH1T cluster closely with the chlorate reductase from sp. J5-51 and with the selenate reductase from . A3GODRAFT_02603, which groups closest with ClrA, resembles the gene organization of a clr operon [26]. While the only well-studied respiratory selenate reductase, serA, is from , A3GODRAFT_03351 and its neighboring genes follow the same organization as found with serABDC [27]. Gene A3GODRAFT_04296 clusters together with the perchlorate reductase from aromatica, and appears to have the same gene organization as a pcr operon [28].
Fig. 3

Phylogenetic analysis highlighting the relation of Sedimenticola selenatireducens strain AK4OH1T genes to known DMSO reductases by Maximum Likelihood method. The evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model [37]. The tree with the highest log likelihood (-17325.9218) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained by applying the Neighbor-Joining method to a matrix of pairwise distances estimated using a JTT model. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 13 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 724 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [30]. GenBank accession numbers are listed in parentheses. Bar = 0.5 substitutions per nucleotide position

Phylogenetic analysis highlighting the relation of Sedimenticola selenatireducens strain AK4OH1T genes to known DMSO reductases by Maximum Likelihood method. The evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model [37]. The tree with the highest log likelihood (-17325.9218) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained by applying the Neighbor-Joining method to a matrix of pairwise distances estimated using a JTT model. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 13 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 724 positions in the final dataset. Evolutionary analyses were conducted in MEGA6 [30]. GenBank accession numbers are listed in parentheses. Bar = 0.5 substitutions per nucleotide position

Conclusions

The complete genome of the estuarine bacterium AK4OH1T provides a stronger foundation from which to learn more about the process of dissimilatory selenate reduction. As AK4OH1T was the first organism isolated capable of coupling the respiration of selenate to the oxidation of benzoic acids, its genome also provides a starting point for learning more about this unique capability.
  34 in total

1.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

2.  The rapid generation of mutation data matrices from protein sequences.

Authors:  D T Jones; W R Taylor; J M Thornton
Journal:  Comput Appl Biosci       Date:  1992-06

3.  List of new names and new combinations previously effectively, but not validly, published.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2006-11       Impact factor: 2.747

Review 4.  Ecology and biotechnology of selenium-respiring bacteria.

Authors:  Y V Nancharaiah; P N L Lens
Journal:  Microbiol Mol Biol Rev       Date:  2015-03       Impact factor: 11.056

5.  Sedimenticola thiotaurini sp. nov., a sulfur-oxidizing bacterium isolated from salt marsh sediments, and emended descriptions of the genus Sedimenticola and Sedimenticola selenatireducens.

Authors:  Beverly E Flood; Daniel S Jones; Jake V Bailey
Journal:  Int J Syst Evol Microbiol       Date:  2015-05-05       Impact factor: 2.747

6.  Sedimenticola selenatireducens, gen. nov., sp. nov., an anaerobic selenate-respiring bacterium isolated from estuarine sediment.

Authors:  Priya Narasingarao; Max M Häggblom
Journal:  Syst Appl Microbiol       Date:  2006-01-20       Impact factor: 4.022

7.  ALLPATHS: de novo assembly of whole-genome shotgun microreads.

Authors:  Jonathan Butler; Iain MacCallum; Michael Kleber; Ilya A Shlyakhter; Matthew K Belmonte; Eric S Lander; Chad Nusbaum; David B Jaffe
Journal:  Genome Res       Date:  2008-03-13       Impact factor: 9.043

8.  Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees.

Authors:  K Tamura; M Nei
Journal:  Mol Biol Evol       Date:  1993-05       Impact factor: 16.240

9.  Enrichment and isolation of Bacillus beveridgei sp. nov., a facultative anaerobic haloalkaliphile from Mono Lake, California, that respires oxyanions of tellurium, selenium, and arsenic.

Authors:  S M Baesman; J F Stolz; T R Kulp; Ronald S Oremland
Journal:  Extremophiles       Date:  2009-06-18       Impact factor: 2.395

10.  Genomic Encyclopedia of Type Strains, Phase I: The one thousand microbial genomes (KMG-I) project.

Authors:  Nikos C Kyrpides; Tanja Woyke; Jonathan A Eisen; George Garrity; Timothy G Lilburn; Brian J Beck; William B Whitman; Phil Hugenholtz; Hans-Peter Klenk
Journal:  Stand Genomic Sci       Date:  2013-12-17
View more
  1 in total

1.  Taxonomic and functional heterogeneity of the gill microbiome in a symbiotic coastal mangrove lucinid species.

Authors:  Shen Jean Lim; Brenton G Davis; Danielle E Gill; Jillian Walton; Erika Nachman; Annette Summers Engel; Laurie C Anderson; Barbara J Campbell
Journal:  ISME J       Date:  2018-12-05       Impact factor: 10.302

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.