Literature DB >> 26512310

Partial genome sequence of the haloalkaliphilic soda lake bacterium Thioalkalivibrio thiocyanoxidans ARh 2(T).

Tom Berben1, Dimitry Y Sorokin2, Natalia Ivanova3, Amrita Pati3, Nikos Kyrpides3, Lynne A Goodwin3, Tanja Woyke3, Gerard Muyzer1.   

Abstract

Thioalkalivibrio thiocyanoxidans strain ARh 2(T) is a sulfur-oxidizing bacterium isolated from haloalkaline soda lakes. It is a motile, Gram-negative member of the Gammaproteobacteria. Remarkable properties include the ability to grow on thiocyanate as the sole energy, sulfur and nitrogen source, and the capability of growth at salinities of up to 4.3 M total Na(+). This draft genome sequence consists of 61 scaffolds comprising 2,765,337 bp, and contains 2616 protein-coding and 61 RNA-coding genes. This organism was sequenced as part of the Community Science Program of the DOE Joint Genome Institute.

Entities:  

Keywords:  Haloalkaliphilic; Soda lakes; Sulfur-oxidizing bacteria; Thiocyanate

Year:  2015        PMID: 26512310      PMCID: PMC4624188          DOI: 10.1186/s40793-015-0078-x

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Soda lakes are found in many arid zones across the world, such as the Kulunda Steppe in Russia, North-Eastern China, the Rift Valley in Africa, and in arid parts of North America, i.e. California and Nevada. The defining characteristics of these lakes are the abundance of carbonate/bicarbonate anions rather than chloride and their moderate to high salinities. This makes soda lakes a unique habitat with stable, alkaline pH values above nine and up to 11 [1]. Despite the high salinity and alkalinity, soda lakes harbor a rich microbial diversity that is responsible for highly active elemental cycles. Aside from the carbon cycle, the sulfur cycle is of great importance in these lakes [2], yet little is known about their precise biogeochemistry and dynamics [3]. A better understanding of these processes will lead to improved insights into the ecology and biogeochemical cycling in soda lakes. Additionally, sulfur-cycling extremophilic prokaryotes have important applications in bioremediation [4] and more detailed knowledge of their physiology may improve industrial waste processing. For these reasons, we have sequenced more than 70 strains belonging to the genus , a dominant cultivated group of chemolithoautotrophic haloalkaliphilic sulfur-oxidizing bacteria in soda lakes worldwide. Here we present the partial genome sequence of ARh 2T.

Organism information

Classification and features

ARh 2T forms motile vibrio-like cells of approximately 0.5–0.6 by 0.8–1.4 μm (basic properties are summarized in Table 1). The cells grown with thiocyanate as electron source have a remarkably extended periplasm (Fig. 1). It is a Gram-negative bacterium belonging to the (Fig. 2). The species description is based on four strains (ARh 2, ARh 3, ARh 4 and ARh 5) that were isolated from sediment samples of South-Western Siberian, Kenyan and Egyptian soda lakes. Strain ARh 2 is a type strain of the species. As a chemolithoautotroph, ARh 2T derives energy from the oxidation of inorganic sulfur compounds, such as sulfide, thiosulfate, thiocyanate, elemental sulfur and polysulfides. The most interesting properties are its ability to grow on thiocyanate as the sole source of energy, sulfur and nitrogen and its ability to grow in saturated soda brines brines with thiosulfate as energy source [5].
Table 1

Classification and general features of Thioalkalivibrio thiocyanoxidans ARh 2T [12]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [13]
Phylum Proteobacteria TAS [14, 15]
Class Gammaproteobacteria TAS [15, 16]
Order Chromatiales TAS [15, 17]
Family Ectothiorhodospiraceae TAS [18]
Genus Thioalkalivibrio TAS [19]
Species Thioalkalivibrio thiocyanoxidans TAS [5]
Type strain: ARh 2T (DSM 13532)
Gram stainNegativeTAS [5, 19]
Cell shapeVibriosTAS [5]
MotilityMotileTAS [5]
SporulationNon-sporulatingNAS
Temperature rangeMesophilicTAS [5]
Optimum temperature35–37 °CTAS [5]
pH range; Optimum8.5–10.5TAS [5]
Carbon sourceInorganic carbonTAS [5]
MIGS-6HabitatSoda lakesTAS [5]
MIGS-6.3Salinity0.3–4.3 M Na+ TAS [5]
MIGS-22Oxygen requirementAerobeTAS [5]
MIGS-15Biotic relationshipFree-livingNAS
MIGS-14PathogenicityNon-pathogenicNAS
MIGS-4Geographic locationKenyaTAS [5]
MIGS-5Sample collection1999TAS [5]
MIGS-4.1LatitudeNot reported
MIGS-4.2LongitudeNot reported
MIGS-4.4AltitudeNot reported

aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [20]

Fig. 1

Thin section electron microscopy photograph of cells of strain ARh 2T grown with thiocyanate in batch culture at pH 9.8 and 0.6 M total Na+. OM - outer cell membrane; CM - cytoplasmic membrane; P - periplasm; C - cytoplasm

Fig. 2

Phylogenetic tree based on 16S rRNA sequences comprising the Thioalkalivibrio type strains and several other members of the Ectothiorhodospiraceae family. Black dots mark nodes with a bootstrap value between 90 and 100 %. 16S rRNA sequences of members of the Alphaproteobacteria were used as the outgroup, but pruned from the tree. The tree was constructed using ARB [21] and bootstrap values calculated using MEGA6 [22]

Classification and general features of Thioalkalivibrio thiocyanoxidans ARh 2T [12] aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [20] Thin section electron microscopy photograph of cells of strain ARh 2T grown with thiocyanate in batch culture at pH 9.8 and 0.6 M total Na+. OM - outer cell membrane; CM - cytoplasmic membrane; P - periplasm; C - cytoplasm Phylogenetic tree based on 16S rRNA sequences comprising the Thioalkalivibrio type strains and several other members of the Ectothiorhodospiraceae family. Black dots mark nodes with a bootstrap value between 90 and 100 %. 16S rRNA sequences of members of the Alphaproteobacteria were used as the outgroup, but pruned from the tree. The tree was constructed using ARB [21] and bootstrap values calculated using MEGA6 [22]

Genome sequencing information

Genome project history

ARh 2T was sequenced as part of a project aimed at sequencing a large number of isolates. The goal of this project is to enable the study of the genomic diversity of the dominant genus of sulfur-oxidizing bacteria in soda lakes. ARh 2T was selected for its ability to grow in salt-saturated brines (4.3 M Na+) and for its ability to grow on thiocyanate as the sole energy, sulfur and nitrogen source. The permanent draft genome we present here consists of approximately 2.8 million basepairs divided over 61 scaffolds. Sequencing was performed at the Joint Genome Institute under project 1008667. The genome sequence was released in Genbank on December 25, 2014. An overview of the project is given in Table 2.
Table 2

Project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityImproved high-quality draft
MIGS-28Libraries usedIllumina standard fragment, 270 bp
MIGS 29Sequencing platformsIllumina HiSeq 2000
MIGS 31.2Fold coverage1819
MIGS 30AssemblersVelvet 1.1.04 [7], ALLPATHS R39750 [8]
MIGS 32Gene calling methodProdigal [9], GenePRIMP [10]
Locus TagG372
Genbank IDARQK00000000
GenBank Date of Release2014-12-25
GOLD IDGp0025980
BIOPROJECTPRJNA185302
IMG submission ID12214
MIGS 13Source Material IdentifierDSM 13532
Project relevanceBiotechnology
Project information

Growth conditions and genomic DNA extraction

ARh 2T (DSM 13532) was cultured in a standard buffer containing sodium carbonate and bicarbonate at pH 10. The total salt concentration was 0.6 M Na+ [6]. The energy source was thiosulfate, at a concentration of 40 mM. After harvesting, the cells were stored at −80 °C for further processing. Genomic DNA was extracted using a chloroform-phenol-isoamylalcohol mixture and precipitated with ethanol. After vacuum drying, the pellet was dissolved in water and the quantity and quality of the DNA determined using the JGI-provided Mass Standard Kit.

Genome sequencing and assembly

This strain was sequenced as part of the Community Science Program of the US Department of Energy Joint Genome Institute. The Illumina HiSeq 2000 platform was used for sequencing, with a depth of 1819X. More details regarding the library construction and sequencing are available at the JGI website. Reads were filtered using DUK and assembled using Velvet 1.1.04 [7]. Pseudoreads (1–3 Kb) were generated from the Velvet output using wgsim and reassembled using ALLPATHS-LG r42328 [8]. The final assembly consists of 61 scaffolds.

Genome annotation

Genes were predicted using Prodigal [9], followed by a round of manual curation using GenePRIMP [10] to detect pseudogenes. The resulting predicted genes were translated and annotated using the NCBI NR database in combination with the UniProt, TIGRFam, Pfam, KEGG, COG and InterPro databases and tRNAScanSE [11] for tRNA prediction. Ribosomal RNAs were detected using models built from SILVA. Further annotation was performed using the Integrated Microbial Genomes platform. All annotation data is freely available there, with IMG submission ID 12214.

Genome properties

The final draft of the genome comprises 2.8 million base pairs in 61 scaffolds, with a G + C percentage of 66.18 %. The gene calling and annotation pipeline detected 2677 genes, of which 2616 code for proteins. Basic statistics concerning the genome sequence are shown in Table 3. In total, 70 % of the genes could be assigned functional categories based on COGs (see Table 4).
Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)2,765,337100.00
DNA coding (bp)2,496,80990.29
DNA G + C (bp)1,829,98466.18
DNA scaffolds61100.00
Total genes2677100.00
Protein coding genes261697.72
RNA genes612.28
Pseudo genesNot determinedNot determined
Genes in internal clustersNot determinedNot determined
Genes with function prediction223083.30
Genes assigned to COGs188570.41
Genes with Pfam domains179978.94
Genes with signal peptides2178.11
Genes with transmembrane helices65524.47
CRISPR repeats1100.00
Table 4

Number of genes associated with the 25 general COG functional categories

CodeValue% ageDescription
J1487.09Translation, ribosomal structure and biogenesis
A10.05RNA processing and modification
K703.36Transcription
L984.70Replication, recombination and repair
B20.10Chromatin structure and dynamics
D321.53Cell cycle control, Cell division, chromosome partitioning
V291.39Defense mechanisms
T1055.03Signal transduction mechanisms
M1537.33Cell wall/membrane biogenesis
N733.50Cell motility
U723.45Intracellular trafficking and secretion
O1095.23Posttranslational modification, protein turnover, chaperones
C1487.09Energy production and conversion
G823.93Carbohydrate transport and metabolism
E1456.95Amino acid transport and metabolism
F602.88Nucleotide transport and metabolism
H1316.28Coenzyme transport and metabolism
I863.02Lipid transport and metabolism
P1055.03Inorganic ion transport and metabolism
Q371.77Secondary metabolites biosynthesis, transport and catabolism
R22810.93General function prediction only
S1959.35Function unknown
-79229.59Not in COGs

The total is based on the total number of protein coding genes in the genome

Genome statistics Number of genes associated with the 25 general COG functional categories The total is based on the total number of protein coding genes in the genome

Conclusions

Sequencing of the genome of ARh 2T is an important step towards a more comprehensive understanding of the mechanism by which this organism can adapt to extremely high salinity. In addition, it will provide important information on the role of this organism in the carbon and sulfur cycles of natural and engineered environments, in particular in the degradation of thiocyanate.
  15 in total

1.  ARB: a software environment for sequence data.

Authors:  Wolfgang Ludwig; Oliver Strunk; Ralf Westram; Lothar Richter; Harald Meier; Arno Buchner; Tina Lai; Susanne Steppi; Gangolf Jobb; Wolfram Förster; Igor Brettske; Stefan Gerber; Anton W Ginhart; Oliver Gross; Silke Grumann; Stefan Hermann; Ralf Jost; Andreas König; Thomas Liss; Ralph Lüssmann; Michael May; Björn Nonhoff; Boris Reichel; Robert Strehlow; Alexandros Stamatakis; Norbert Stuckmann; Alexander Vilbig; Michael Lenke; Thomas Ludwig; Arndt Bode; Karl-Heinz Schleifer
Journal:  Nucleic Acids Res       Date:  2004-02-25       Impact factor: 16.971

2.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

3.  High-quality draft assemblies of mammalian genomes from massively parallel sequence data.

Authors:  Sante Gnerre; Iain Maccallum; Dariusz Przybylski; Filipe J Ribeiro; Joshua N Burton; Bruce J Walker; Ted Sharpe; Giles Hall; Terrance P Shea; Sean Sykes; Aaron M Berlin; Daniel Aird; Maura Costello; Riza Daza; Louise Williams; Robert Nicol; Andreas Gnirke; Chad Nusbaum; Eric S Lander; David B Jaffe
Journal:  Proc Natl Acad Sci U S A       Date:  2010-12-27       Impact factor: 11.205

4.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

5.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

6.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors:  Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

7.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

8.  Thioalkalivibrio thiocyanoxidans sp. nov. and Thioalkalivibrio paradoxus sp. nov., novel alkaliphilic, obligately autotrophic, sulfur-oxidizing bacteria capable of growth on thiocyanate, from soda lakes.

Authors:  Dimitry Yu Sorokin; Tatyana P Tourova; Anatoly M Lysenko; Lubov L Mityushina; J Gijs Kuenen
Journal:  Int J Syst Evol Microbiol       Date:  2002-03       Impact factor: 2.747

9.  Prodigal: prokaryotic gene recognition and translation initiation site identification.

Authors:  Doug Hyatt; Gwo-Liang Chen; Philip F Locascio; Miriam L Land; Frank W Larimer; Loren J Hauser
Journal:  BMC Bioinformatics       Date:  2010-03-08       Impact factor: 3.169

10.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

View more
  2 in total

1.  Analysis of the Genes Involved in Thiocyanate Oxidation during Growth in Continuous Culture of the Haloalkaliphilic Sulfur-Oxidizing Bacterium Thioalkalivibrio thiocyanoxidans ARh 2T Using Transcriptomics.

Authors:  Tom Berben; Cherel Balkema; Dimitry Y Sorokin; Gerard Muyzer
Journal:  mSystems       Date:  2017-12-26       Impact factor: 6.496

2.  Comparative Genome Analysis of Three Thiocyanate Oxidizing Thioalkalivibrio Species Isolated from Soda Lakes.

Authors:  Tom Berben; Lex Overmars; Dimitry Y Sorokin; Gerard Muyzer
Journal:  Front Microbiol       Date:  2017-02-28       Impact factor: 5.640

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.